BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 017050
         (378 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255552315|ref|XP_002517202.1| Protein C9orf114, putative [Ricinus communis]
 gi|223543837|gb|EEF45365.1| Protein C9orf114, putative [Ricinus communis]
          Length = 369

 Score =  530 bits (1364), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 267/371 (71%), Positives = 312/371 (84%), Gaps = 6/371 (1%)

Query: 1   MGNKKKRGGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPI-- 58
           MG KKKR   E +  E  T ENHE  N++ + NGDS     KKKK+++R++   +     
Sbjct: 1   MGKKKKRAEAEAQ-TETETVENHEPVNDV-VVNGDSDRKKKKKKKEKERNERKKEENESK 58

Query: 59  EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
           E  T+S+AVPGSIIDN QSLELATRLAGQIARA TIFRIDEVVVFDN+SSS    R++  
Sbjct: 59  ETATISIAVPGSIIDNAQSLELATRLAGQIARAATIFRIDEVVVFDNESSSVKEDRTTMI 118

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
              N SDENESGAAFL+R+L+YLETPQYLRKALF   +SLRFVG+LPPLDAPHHLRKHEW
Sbjct: 119 T-GNNSDENESGAAFLIRILRYLETPQYLRKALFPRLNSLRFVGLLPPLDAPHHLRKHEW 177

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
           APFREGVTLKE APNS+GTLVDVGL+K+VV+DQV++PG+RVTV MGT+RNLD++ PRQVV
Sbjct: 178 APFREGVTLKEKAPNSIGTLVDVGLSKNVVIDQVVEPGIRVTVEMGTDRNLDSELPRQVV 237

Query: 239 PPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
             SKP+E +GMYWGY+VRYA NIS+VF +C YKGGYDHL+GTSEHG I+N+S L+LPTFR
Sbjct: 238 SLSKPREEAGMYWGYRVRYASNISTVFNDCPYKGGYDHLVGTSEHGQIINASKLSLPTFR 297

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           HLLIAFGGLAGLEESIEED+ LK KN REVF+SY NTCPHQGSRTIRTEEAIFISLQYFQ
Sbjct: 298 HLLIAFGGLAGLEESIEEDNSLKGKNVREVFNSYLNTCPHQGSRTIRTEEAIFISLQYFQ 357

Query: 358 EPISRALRRVE 368
           EPI+RAL+R++
Sbjct: 358 EPINRALQRIQ 368


>gi|224099761|ref|XP_002311607.1| predicted protein [Populus trichocarpa]
 gi|222851427|gb|EEE88974.1| predicted protein [Populus trichocarpa]
          Length = 366

 Score =  525 bits (1351), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 268/372 (72%), Positives = 303/372 (81%), Gaps = 11/372 (2%)

Query: 1   MGNKKKRGGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPI-- 58
           MG K+K+   E    +A   EN +++NEL L NGDS     KKK K + +  +D   I  
Sbjct: 1   MGKKQKKAEAE---TDAERVENDKAENELELTNGDSHKKKKKKKNKERNE--SDKEVIKA 55

Query: 59  -EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSA 117
            E+PTVSVA+ GSII+N QSLELATRLAGQIARA TIFRIDEVVVFDNKS+ +   R+  
Sbjct: 56  KEIPTVSVAISGSIINNAQSLELATRLAGQIARAATIFRIDEVVVFDNKSNYEKEDRTLT 115

Query: 118 ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
            +  N SDENESGAAF VR+L+YLETPQYLRKALF  H +LRFVGMLPPLDAPHHLRKHE
Sbjct: 116 TD--NYSDENESGAAFFVRILRYLETPQYLRKALFPKHCNLRFVGMLPPLDAPHHLRKHE 173

Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV 237
           WAPFREGVTL E  PNS  TLVDVGL+K+V ++QVL+PG+RVTVAMGTNRNLD+DSPRQV
Sbjct: 174 WAPFREGVTLNEKVPNSGETLVDVGLSKNVSINQVLEPGIRVTVAMGTNRNLDSDSPRQV 233

Query: 238 VPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
           V   KP+E +GMYWGY+VRYA NISSVFK+C YKGGYDHLIGTSEHG I+NSS+L+LP F
Sbjct: 234 VSLLKPREEAGMYWGYRVRYASNISSVFKDCPYKGGYDHLIGTSEHGLIINSSELSLPAF 293

Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           RHLLIAFGGLAGLEE IEED  LK KN REVF SY NTCPHQGSRTIRTEEAIFISLQYF
Sbjct: 294 RHLLIAFGGLAGLEECIEEDSNLKGKNVREVFDSYLNTCPHQGSRTIRTEEAIFISLQYF 353

Query: 357 QEPISRALRRVE 368
           QEPI+RAL RV+
Sbjct: 354 QEPINRALHRVQ 365


>gi|147790065|emb|CAN75987.1| hypothetical protein VITISV_012192 [Vitis vinifera]
          Length = 363

 Score =  517 bits (1331), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 262/374 (70%), Positives = 300/374 (80%), Gaps = 16/374 (4%)

Query: 1   MGNKKKR-GGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIE 59
           MG KKKR    E E       EN+ ++NE  L NGDS S   K K  + + Q  D     
Sbjct: 1   MGKKKKRRSDFEAET------ENNTAENETELINGDSRSKKKKNKTHKDKYQATD----- 49

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           +PT+++AVPGSII N QSLELATRLAGQ+ARA TIFRIDEVVVFD KS+S +   +S   
Sbjct: 50  IPTLTIAVPGSIIHNAQSLELATRLAGQVARAATIFRIDEVVVFDCKSTSGD---NSTVA 106

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
             + SDENE+GAAFL+R+L+YLETPQYLRK LF  H+SL+FVGMLPPLDAPHHLRKHEW 
Sbjct: 107 TPDASDENETGAAFLIRILRYLETPQYLRKILFPKHNSLKFVGMLPPLDAPHHLRKHEWG 166

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
           P+REGVTLKE AP+SVGTLVDVGLNK+VV+DQVL+PG+RVTVAMGTNRNLDAD   QVV 
Sbjct: 167 PYREGVTLKERAPSSVGTLVDVGLNKNVVIDQVLEPGIRVTVAMGTNRNLDADFVHQVVS 226

Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
            SKP+E  G YWGYKVRYA NISSVFK C +KGGYDHLIGTSEHG IV SS+L +P+FRH
Sbjct: 227 SSKPREEVGTYWGYKVRYASNISSVFKECPFKGGYDHLIGTSEHGLIVKSSELDIPSFRH 286

Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
           LLIAFGGLAGLEES+EED  LK KN RE+F SY NTCP+QGSRTIRTEEAI ISLQYFQE
Sbjct: 287 LLIAFGGLAGLEESVEEDHSLKGKNVREIFDSYLNTCPNQGSRTIRTEEAILISLQYFQE 346

Query: 359 PISRALRRVEYKSK 372
           PI+RAL+R EY+S+
Sbjct: 347 PINRALQRFEYQSQ 360


>gi|225432580|ref|XP_002277845.1| PREDICTED: uncharacterized protein C9orf114 isoform 1 [Vitis
           vinifera]
 gi|297737011|emb|CBI26212.3| unnamed protein product [Vitis vinifera]
          Length = 363

 Score =  515 bits (1326), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 261/374 (69%), Positives = 299/374 (79%), Gaps = 16/374 (4%)

Query: 1   MGNKKKR-GGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIE 59
           MG KKKR    E E       EN+ ++NE  L NGDS S   K K  + + Q  D     
Sbjct: 1   MGKKKKRRSDFEAET------ENNTAENETELINGDSRSKKKKNKTHKDKYQATD----- 49

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           +PT+++AVPGSII N QSLELATRLAGQIARA TIFRIDEVVVFD KS+S +    S   
Sbjct: 50  IPTLTIAVPGSIIHNAQSLELATRLAGQIARAATIFRIDEVVVFDCKSTSGD---DSTVA 106

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
             + SDENE+G AFL+R+L+YLETPQYLRK LF  H+SL+FVGMLPP+DAPHHLRKHEW 
Sbjct: 107 TPDASDENETGPAFLIRILRYLETPQYLRKTLFPKHNSLKFVGMLPPVDAPHHLRKHEWG 166

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
           P+REGVTLKE AP+SVGTLVDVGLNK+VV+DQVL+PG+RVTVAMGTNRNLDAD   QVV 
Sbjct: 167 PYREGVTLKERAPSSVGTLVDVGLNKNVVIDQVLEPGIRVTVAMGTNRNLDADFVHQVVS 226

Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
            SKP+E  G YWGYKVRYA NISSVFK C +KGGYDHLIGTSEHG IV SS+L +P+FRH
Sbjct: 227 SSKPREEVGTYWGYKVRYASNISSVFKECPFKGGYDHLIGTSEHGLIVKSSELDIPSFRH 286

Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
           LLIAFGGLAGLEES+EED+ LK KN RE+F SY NTCP+QGSRTIRTEEAI ISLQYFQE
Sbjct: 287 LLIAFGGLAGLEESVEEDNSLKGKNVREIFDSYLNTCPNQGSRTIRTEEAILISLQYFQE 346

Query: 359 PISRALRRVEYKSK 372
           PI+RAL+R EY+S+
Sbjct: 347 PINRALQRFEYRSQ 360


>gi|359477498|ref|XP_003631986.1| PREDICTED: uncharacterized protein C9orf114 isoform 2 [Vitis
           vinifera]
          Length = 359

 Score =  504 bits (1299), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 257/374 (68%), Positives = 293/374 (78%), Gaps = 20/374 (5%)

Query: 1   MGNKKKR-GGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIE 59
           MG KKKR    E E       EN+ ++NE  L NGDS S   K K  + + Q  D     
Sbjct: 1   MGKKKKRRSDFEAET------ENNTAENETELINGDSRSKKKKNKTHKDKYQATD----- 49

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           +PT+++AVPGSII N QSLELATRLAGQIARA TIFRIDEV +  N S S          
Sbjct: 50  IPTLTIAVPGSIIHNAQSLELATRLAGQIARAATIFRIDEVCIVMNSSRSVEL------- 102

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
             N SDENE+G AFL+R+L+YLETPQYLRK LF  H+SL+FVGMLPP+DAPHHLRKHEW 
Sbjct: 103 LINASDENETGPAFLIRILRYLETPQYLRKTLFPKHNSLKFVGMLPPVDAPHHLRKHEWG 162

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
           P+REGVTLKE AP+SVGTLVDVGLNK+VV+DQVL+PG+RVTVAMGTNRNLDAD   QVV 
Sbjct: 163 PYREGVTLKERAPSSVGTLVDVGLNKNVVIDQVLEPGIRVTVAMGTNRNLDADFVHQVVS 222

Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
            SKP+E  G YWGYKVRYA NISSVFK C +KGGYDHLIGTSEHG IV SS+L +P+FRH
Sbjct: 223 SSKPREEVGTYWGYKVRYASNISSVFKECPFKGGYDHLIGTSEHGLIVKSSELDIPSFRH 282

Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
           LLIAFGGLAGLEES+EED+ LK KN RE+F SY NTCP+QGSRTIRTEEAI ISLQYFQE
Sbjct: 283 LLIAFGGLAGLEESVEEDNSLKGKNVREIFDSYLNTCPNQGSRTIRTEEAILISLQYFQE 342

Query: 359 PISRALRRVEYKSK 372
           PI+RAL+R EY+S+
Sbjct: 343 PINRALQRFEYRSQ 356


>gi|449432714|ref|XP_004134144.1| PREDICTED: uncharacterized protein C9orf114 homolog [Cucumis
           sativus]
          Length = 367

 Score =  490 bits (1261), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 249/374 (66%), Positives = 292/374 (78%), Gaps = 21/374 (5%)

Query: 1   MGNKKKRGGLEPELK-EAATGENHESQNELSLANGDS--------SSCDNKKKKKRKRDQ 51
           MG KK +    PE + EA   ++ E+ +E  L NG S           +N   +  KR  
Sbjct: 1   MGKKKHK---RPEQESEALDRDDFEANHETELINGCSPEKKKKKKKKVENGSIEAEKRK- 56

Query: 52  LNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDN 111
                PI  PTVS+AV GSIIDN QSLELATRLAGQIARA TIFRI+EVVVFD+  SS  
Sbjct: 57  -----PISKPTVSIAVSGSIIDNAQSLELATRLAGQIARAATIFRINEVVVFDSGRSSTT 111

Query: 112 YSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
            S  +AAN S   DE+ESGAAFL+R+L+YLETPQYLRKALF  H++LRFVGMLPPLDAPH
Sbjct: 112 GSEVAAANNS---DEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 168

Query: 172 HLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDA 231
           HLRKHEW P+REGVTLKE AP++ GT VDVGL+K+VVVD++L+PG RVTVAMGT+RNL +
Sbjct: 169 HLRKHEWGPYREGVTLKERAPDAKGTSVDVGLSKNVVVDEILEPGTRVTVAMGTDRNLFS 228

Query: 232 DSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
           D PRQVV  SKP E G+YWGY+VRYA ++S+VFK  SY+GGYDHLIGTSEHG ++ SS+L
Sbjct: 229 DLPRQVVSSSKPVEEGLYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGMVIKSSEL 288

Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
           TLP FRHLLIAFGGLAGLEESIEED+  K KNA E+F SY NTCP QGSRTIRTEEAIFI
Sbjct: 289 TLPPFRHLLIAFGGLAGLEESIEEDNNFKSKNAHEIFSSYLNTCPLQGSRTIRTEEAIFI 348

Query: 352 SLQYFQEPISRALR 365
           SLQYFQEPI++A++
Sbjct: 349 SLQYFQEPINKAMQ 362


>gi|449520559|ref|XP_004167301.1| PREDICTED: uncharacterized protein C9orf114 homolog [Cucumis
           sativus]
          Length = 368

 Score =  489 bits (1260), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 249/375 (66%), Positives = 292/375 (77%), Gaps = 22/375 (5%)

Query: 1   MGNKKKRGGLEPELK-EAATGENHESQNELSLANGDS---------SSCDNKKKKKRKRD 50
           MG KK +    PE + EA   ++ E+ +E  L NG S            +N   +  KR 
Sbjct: 1   MGKKKHK---RPEQESEALDRDDFEANHETELINGCSPEKKKKKKKKKVENGSIEAEKRK 57

Query: 51  QLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD 110
                 PI  PTVS+AV GSIIDN QSLELATRLAGQIARA TIFRI+EVVVFD+  SS 
Sbjct: 58  ------PISKPTVSIAVSGSIIDNAQSLELATRLAGQIARAATIFRINEVVVFDSGRSST 111

Query: 111 NYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAP 170
             S  +AAN S   DE+ESGAAFL+R+L+YLETPQYLRKALF  H++LRFVGMLPPLDAP
Sbjct: 112 TGSEVAAANNS---DEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAP 168

Query: 171 HHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLD 230
           HHLRKHEW P+REGVTLKE AP++ GT VDVGL+K+VVVD++L+PG RVTVAMGT+RNL 
Sbjct: 169 HHLRKHEWGPYREGVTLKERAPDAKGTSVDVGLSKNVVVDEILEPGTRVTVAMGTDRNLF 228

Query: 231 ADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
           +D PRQVV  SKP E G+YWGY+VRYA ++S+VFK  SY+GGYDHLIGTSEHG ++ SS+
Sbjct: 229 SDLPRQVVSSSKPVEEGLYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGMVIKSSE 288

Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
           LTLP FRHLLIAFGGLAGLEESIEED+  K KNA E+F SY NTCP QGSRTIRTEEAIF
Sbjct: 289 LTLPPFRHLLIAFGGLAGLEESIEEDNNFKSKNAHEIFSSYLNTCPLQGSRTIRTEEAIF 348

Query: 351 ISLQYFQEPISRALR 365
           ISLQYFQEPI++A++
Sbjct: 349 ISLQYFQEPINKAMQ 363


>gi|42567956|ref|NP_197431.2| uncharacterized protein [Arabidopsis thaliana]
 gi|45825145|gb|AAS77480.1| At5g19300 [Arabidopsis thaliana]
 gi|110741745|dbj|BAE98818.1| hypothetical protein [Arabidopsis thaliana]
 gi|332005299|gb|AED92682.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 398

 Score =  475 bits (1222), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 230/307 (74%), Positives = 265/307 (86%), Gaps = 8/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+ GSII NTQSLELATRLAGQIARA TIFRIDE+VVFDNKSSS+     SAA  +
Sbjct: 99  TVSIAIAGSIIHNTQSLELATRLAGQIARAATIFRIDEIVVFDNKSSSEI---ESAA--T 153

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
           N SD NESGA+FLVR+L+YLETPQYLRK+LF   + LR+VGMLPPLDAPHHLRKHEW  +
Sbjct: 154 NASDSNESGASFLVRILKYLETPQYLRKSLFPKQNDLRYVGMLPPLDAPHHLRKHEWEQY 213

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGVTL E APNS GTLVDVGL+K VVVDQVL PG+RVTVAMGT+ +LD    RQ+VPPS
Sbjct: 214 REGVTLSEKAPNSEGTLVDVGLSKSVVVDQVLGPGIRVTVAMGTDHDLDL--VRQIVPPS 271

Query: 242 KPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
           KP+E +GMYWGYKVRYA  +SSVFK C ++GGYD+LIGTSEHG +++SS+L +PTFRHLL
Sbjct: 272 KPREEAGMYWGYKVRYASQLSSVFKECPFEGGYDYLIGTSEHGLVISSSELKIPTFRHLL 331

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPI 360
           IAFGGLAGLEESIE+D+  K KN R+VF+ Y NTCPHQGSRTIR EEA+FISLQYFQEPI
Sbjct: 332 IAFGGLAGLEESIEDDNQYKGKNVRDVFNVYLNTCPHQGSRTIRAEEAMFISLQYFQEPI 391

Query: 361 SRALRRV 367
           SRA+RR+
Sbjct: 392 SRAVRRL 398


>gi|297807989|ref|XP_002871878.1| hypothetical protein ARALYDRAFT_909964 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317715|gb|EFH48137.1| hypothetical protein ARALYDRAFT_909964 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 381

 Score =  472 bits (1214), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 230/307 (74%), Positives = 264/307 (85%), Gaps = 8/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+ GSII NTQSLELATRLAGQIARA TIFRIDE+VVFDNKSSS+     SAA   
Sbjct: 82  TVSIAIAGSIIHNTQSLELATRLAGQIARAATIFRIDEIVVFDNKSSSEI---ESAA--M 136

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
           N SD NESGA+FLVR+L+YLETPQYLRK+LF   + LR+VGMLPPLDAPHHLRKHEW  +
Sbjct: 137 NPSDSNESGASFLVRILKYLETPQYLRKSLFPKQNDLRYVGMLPPLDAPHHLRKHEWEQY 196

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGVTL E APNS GT+VDVGL+K VVVDQVL PGVRVTVAMGT+ +LD    RQ+VPPS
Sbjct: 197 REGVTLNEKAPNSEGTMVDVGLSKSVVVDQVLSPGVRVTVAMGTDHDLDL--VRQIVPPS 254

Query: 242 KPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
           KP+E +GMYWGYKVRYA  +SSVFK C ++GGYD+LIGTSEHG +++SS+L +PTFRHLL
Sbjct: 255 KPREEAGMYWGYKVRYASQLSSVFKECPFQGGYDYLIGTSEHGVVISSSELKIPTFRHLL 314

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPI 360
           IAFGGLAGLEESIE+D+  K KN R+VF+ Y NTCPHQGSRTIR EEA+FISLQYFQEPI
Sbjct: 315 IAFGGLAGLEESIEDDNQYKGKNVRDVFNIYLNTCPHQGSRTIRAEEAMFISLQYFQEPI 374

Query: 361 SRALRRV 367
           SRA+RR+
Sbjct: 375 SRAVRRL 381


>gi|224111238|ref|XP_002315789.1| predicted protein [Populus trichocarpa]
 gi|222864829|gb|EEF01960.1| predicted protein [Populus trichocarpa]
          Length = 338

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 230/318 (72%), Positives = 262/318 (82%), Gaps = 9/318 (2%)

Query: 54  DDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEV-VVFDNKSSSDNY 112
           D    E+PTVS+AV GSII+N QSLELATRLAGQIARA TIFRIDEV ++  N  +S + 
Sbjct: 28  DKEAKEIPTVSIAVSGSIINNAQSLELATRLAGQIARAATIFRIDEVSILIFNGFTSSSE 87

Query: 113 SRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH 172
                   +N SDENESG AF +R+L+YLETPQYLRKALF  H++LRFVGMLPPLDAPHH
Sbjct: 88  KEDPTVTTNNNSDENESGGAFFIRILRYLETPQYLRKALFPKHNNLRFVGMLPPLDAPHH 147

Query: 173 LRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
           LRKHEWAPFREGVTL E   NS  TLVDVGL+K+VVV+Q L+PG+RVTVAMGTNRNLD D
Sbjct: 148 LRKHEWAPFREGVTLNEKVSNSGETLVDVGLSKNVVVNQALEPGIRVTVAMGTNRNLDDD 207

Query: 233 SPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
           SPRQVV  SKP+ E+G+YWGY+VRYA NISSVFK+C Y+GGYD LIGTSEHG I+NSS+L
Sbjct: 208 SPRQVVSLSKPREEAGLYWGYRVRYASNISSVFKDCPYRGGYDLLIGTSEHGLIINSSEL 267

Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
           +LPTFRHLLIAFGGLAGLEESIEED  LK K+ REVF SY NTCPHQGSRTIRTE     
Sbjct: 268 SLPTFRHLLIAFGGLAGLEESIEEDSNLKGKDVREVFDSYLNTCPHQGSRTIRTE----- 322

Query: 352 SLQYFQEPISRALRRVEY 369
              YFQEPISRA++RV++
Sbjct: 323 --VYFQEPISRAMQRVQH 338


>gi|356575809|ref|XP_003556029.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           C9orf114-like [Glycine max]
          Length = 353

 Score =  450 bits (1158), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 216/293 (73%), Positives = 247/293 (84%), Gaps = 7/293 (2%)

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           +PTVS+AVP SIIDN  +LELATRLAGQIARA TIFRI+EVVVFDNKS+ DN S      
Sbjct: 43  IPTVSIAVPASIIDNVPTLELATRLAGQIARAATIFRINEVVVFDNKSNPDNDSVLDNVG 102

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
                  +ESGAAFL+R+LQYLETPQYLRKALF MH+SLRFVG+LPPLDAPHHLRKHEW 
Sbjct: 103 -------DESGAAFLMRILQYLETPQYLRKALFPMHNSLRFVGLLPPLDAPHHLRKHEWF 155

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
           P+REGVT+KE   NS  TLVDVGL K+V+VDQ+ +PG RVTVAMGT+RNLD+D PRQV+ 
Sbjct: 156 PYREGVTVKERDSNSGATLVDVGLVKNVIVDQIFEPGRRVTVAMGTDRNLDSDLPRQVIS 215

Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
            SKP+E G YWGY+VRYA NIS+VFK+C+YK GYD +IGTSEHG I+ SSDL +P+FRHL
Sbjct: 216 SSKPREEGTYWGYQVRYAHNISAVFKDCAYKRGYDFIIGTSEHGQIIKSSDLEIPSFRHL 275

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
           LIAFGGLAGLEESIEEDD LK K A++ F+ Y NTCPHQGSRTIRTEEAIFIS
Sbjct: 276 LIAFGGLAGLEESIEEDDNLKGKKAQDAFNLYLNTCPHQGSRTIRTEEAIFIS 328


>gi|363807614|ref|NP_001242411.1| uncharacterized protein LOC100786821 [Glycine max]
 gi|255639023|gb|ACU19812.1| unknown [Glycine max]
          Length = 352

 Score =  445 bits (1145), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 217/294 (73%), Positives = 248/294 (84%), Gaps = 8/294 (2%)

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           +PTVS+AVPGSIIDN  +LELATRLAGQIARA TIFRI+EVVVFDNKS+ DN    S  +
Sbjct: 43  IPTVSIAVPGSIIDNVPTLELATRLAGQIARAATIFRINEVVVFDNKSNPDN---DSVLD 99

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
            SN    +ESGA+FL+++LQYLETPQYLRKALF MH+SLRFVG+LPPLDAPHHLRKHEW 
Sbjct: 100 NSN----DESGASFLMQILQYLETPQYLRKALFPMHNSLRFVGLLPPLDAPHHLRKHEWG 155

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
           P+REGVT KE   NS  TLVDVGL K+V+VDQ+  PG RVTVAMGT+RNLD+D PRQVV 
Sbjct: 156 PYREGVTAKERHLNSGATLVDVGLVKNVIVDQIFKPGRRVTVAMGTDRNLDSDLPRQVVS 215

Query: 240 PSKP-KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
            SKP +E G YWGY+VRYA NIS+VFK+C+YK GYD +IGTSEHG I+ SSDL +P+FRH
Sbjct: 216 SSKPTREEGTYWGYQVRYAHNISAVFKDCAYKRGYDFIIGTSEHGQIIKSSDLEIPSFRH 275

Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
           LLIAFGGLAGLEESIEEDD LK K A++ F+ Y NTCPHQGSRTIRTEEAI IS
Sbjct: 276 LLIAFGGLAGLEESIEEDDNLKGKKAQDAFNLYLNTCPHQGSRTIRTEEAILIS 329


>gi|226510311|ref|NP_001149839.1| conserved protein [Zea mays]
 gi|195634967|gb|ACG36952.1| conserved protein [Zea mays]
          Length = 365

 Score =  437 bits (1123), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 215/308 (69%), Positives = 250/308 (81%), Gaps = 12/308 (3%)

Query: 61  PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
           PTVS+AV GSIIDN QSLELAT LAGQIARA T+FRIDEVVVFD+  +++N S       
Sbjct: 69  PTVSIAVAGSIIDNAQSLELATLLAGQIARAATVFRIDEVVVFDSTPAAENGSAG----- 123

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
               D  ESGA FLVR+L+YLETPQYLR+ LF MH +L+FVG+LPPLDAPHH+RKHEW+ 
Sbjct: 124 ----DGEESGARFLVRILEYLETPQYLRRRLFPMHKNLKFVGLLPPLDAPHHVRKHEWSE 179

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
           FREGVTL+ +   S GTLVDVGL+K+V+V+Q L+PG RVTVAMGTNR+L     R+VVPP
Sbjct: 180 FREGVTLESDP--SKGTLVDVGLSKNVLVEQKLEPGKRVTVAMGTNRDLTTACIRKVVPP 237

Query: 241 SKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
           S P+E  G YWGYKVRY  N+S VFKN  +K  YDH+IGTSEHG IVNSS+LTLPTFRHL
Sbjct: 238 STPREQMGSYWGYKVRYTSNLSGVFKNSPFKEEYDHIIGTSEHGQIVNSSELTLPTFRHL 297

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           LIAFGGLAGLEESIEED  LK K A +VF SYFNTCP+QGSRTIRTEEA+ ISLQYFQ+P
Sbjct: 298 LIAFGGLAGLEESIEEDTNLKGKRADDVFTSYFNTCPNQGSRTIRTEEALLISLQYFQDP 357

Query: 360 ISRALRRV 367
           I RA +++
Sbjct: 358 IRRAEQKL 365


>gi|242092392|ref|XP_002436686.1| hypothetical protein SORBIDRAFT_10g007030 [Sorghum bicolor]
 gi|241914909|gb|EER88053.1| hypothetical protein SORBIDRAFT_10g007030 [Sorghum bicolor]
          Length = 367

 Score =  429 bits (1104), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 211/308 (68%), Positives = 251/308 (81%), Gaps = 14/308 (4%)

Query: 61  PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
           PTVS+AV GSIIDN QSLELAT LAGQIARA T+FRIDEVVVFD+  +++N         
Sbjct: 73  PTVSIAVAGSIIDNAQSLELATLLAGQIARAATVFRIDEVVVFDSTPAAEN--------- 123

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
              +D+ ESGA FLVR+L+YLETPQYLR+ LF MH +L+FVG+LPPLDAPHH+RKHEW+ 
Sbjct: 124 GGAADDEESGARFLVRILEYLETPQYLRRRLFPMHKNLKFVGLLPPLDAPHHVRKHEWSE 183

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
           FREGVTL+ +   S GTLVDVGL+K+V+V+Q L+PG RVTVAMGTNR+L   + R+VVPP
Sbjct: 184 FREGVTLESDP--SKGTLVDVGLSKNVLVEQTLEPGKRVTVAMGTNRDLT--TARKVVPP 239

Query: 241 SKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
           S P+E  G YWGYKVRY  N+S V KN  +K  YDH+IGTSEHG I+NSS+LTLPTFRHL
Sbjct: 240 STPREQMGSYWGYKVRYTSNLSGVLKNSPFKEEYDHIIGTSEHGQIINSSELTLPTFRHL 299

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           LIAFGGLAGLEESIEED  LK K+A +VF SY NTCP+QGSRTIRTEEA+ ISLQYFQ+P
Sbjct: 300 LIAFGGLAGLEESIEEDTNLKGKHADDVFTSYLNTCPNQGSRTIRTEEALLISLQYFQDP 359

Query: 360 ISRALRRV 367
           I RA +++
Sbjct: 360 IRRAEQKM 367


>gi|357132223|ref|XP_003567731.1| PREDICTED: uncharacterized protein C9orf114-like [Brachypodium
           distachyon]
          Length = 362

 Score =  427 bits (1098), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 213/324 (65%), Positives = 249/324 (76%), Gaps = 13/324 (4%)

Query: 43  KKKKRKRDQLNDDAPIE--VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEV 100
           KKKKR +D        +   PTVS+AV GSIIDN QSLELAT LAGQIARA T+FRIDE+
Sbjct: 48  KKKKRNKDTEEGKKTDQNRKPTVSIAVAGSIIDNAQSLELATLLAGQIARAATVFRIDEI 107

Query: 101 VVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRF 160
           VVFD+    +N S + A        E ESGA FL+R+LQYLETPQYLR+ LF MH + +F
Sbjct: 108 VVFDSCPPVENASGAEA--------EEESGAQFLIRILQYLETPQYLRRRLFPMHKNFKF 159

Query: 161 VGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVT 220
           VG+LPPLDAPHH+R+HEW+ FREGVTL      S GTLVDVGL+K ++V+Q+L+PG RVT
Sbjct: 160 VGLLPPLDAPHHVRRHEWSEFREGVTL--GGDRSKGTLVDVGLSKDILVEQILEPGKRVT 217

Query: 221 VAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGT 279
           VAMGTNR++     R+VVPPS P  E G YWGYKVRYA N+S VF    YK GYDH+IGT
Sbjct: 218 VAMGTNRDITTACVRKVVPPSSPSDEMGSYWGYKVRYASNLSGVFNGSPYKEGYDHIIGT 277

Query: 280 SEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQG 339
           SEHG+ + SS+LTLP FRHLLIAFGGLAGLEESIEED  LK K A++VF SY NTCP+QG
Sbjct: 278 SEHGEAIGSSELTLPAFRHLLIAFGGLAGLEESIEEDINLKGKGAKDVFTSYLNTCPNQG 337

Query: 340 SRTIRTEEAIFISLQYFQEPISRA 363
           SRTIRTEEA+ ISLQYFQ+PI RA
Sbjct: 338 SRTIRTEEALLISLQYFQDPIRRA 361


>gi|115457406|ref|NP_001052303.1| Os04g0244500 [Oryza sativa Japonica Group]
 gi|113563874|dbj|BAF14217.1| Os04g0244500 [Oryza sativa Japonica Group]
 gi|116310997|emb|CAH67931.1| H0211F06-OSIGBa0153M17.3 [Oryza sativa Indica Group]
 gi|215686888|dbj|BAG89738.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 358

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 202/303 (66%), Positives = 242/303 (79%), Gaps = 14/303 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+AV GSIIDN QSLELAT LAGQIARA T+FRIDEVVVFD+ SS +N          
Sbjct: 68  TVSIAVAGSIIDNAQSLELATLLAGQIARAATVFRIDEVVVFDSNSSVEN---------- 117

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
              D+ ESGA FLVR+LQYLETPQYLR+ LF MH++L+FVG+LPPLDAPHHLRKHEW+ F
Sbjct: 118 -SGDDVESGARFLVRILQYLETPQYLRRRLFPMHNNLKFVGLLPPLDAPHHLRKHEWSEF 176

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGVTL  +   S+GT VDVGL+K+V+V+Q+L+PG RVT+AMGTNR++     R++V PS
Sbjct: 177 REGVTL--DGDRSMGTFVDVGLSKNVLVEQMLEPGKRVTIAMGTNRDITTACKRKIVSPS 234

Query: 242 KPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
            P+ E  +YWGYKVRYA N+  VF +  YK GYD++IGTSEHG I++SS+L LP+FRHLL
Sbjct: 235 SPRDEMELYWGYKVRYASNLGGVFSDSPYKEGYDYIIGTSEHGKIISSSELILPSFRHLL 294

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPI 360
           IAFGGLAGLEE IEED  LK K   +VF++Y NTCP QGSRTIRTEEA+ ISLQYFQ+PI
Sbjct: 295 IAFGGLAGLEECIEEDRNLKGKGVDDVFNTYLNTCPSQGSRTIRTEEALLISLQYFQDPI 354

Query: 361 SRA 363
            RA
Sbjct: 355 RRA 357


>gi|255648220|gb|ACU24563.1| unknown [Glycine max]
          Length = 307

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 196/273 (71%), Positives = 225/273 (82%), Gaps = 9/273 (3%)

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           +PTVS+AVP SIIDN  +LELATRLAGQIARA TIFRI+EVVVFDNKS+ DN S      
Sbjct: 43  IPTVSIAVPASIIDNVPTLELATRLAGQIARAATIFRINEVVVFDNKSNPDNDSVLDNVG 102

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
                  +ESGAAFL+R+LQYLETPQYLRKALF MH+SLRFVG+LPPLDAPHHLRKHEW 
Sbjct: 103 -------DESGAAFLMRILQYLETPQYLRKALFPMHNSLRFVGLLPPLDAPHHLRKHEWF 155

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
           P+REGVT+KE   NS  TLVDVGL K+V+VDQ+ +PG RVTVAMGT+RNLD+D PRQV+ 
Sbjct: 156 PYREGVTVKERDSNSGATLVDVGLVKNVIVDQIFEPGRRVTVAMGTDRNLDSDLPRQVIS 215

Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
            SKP+E G YWGY+VRYA NIS+VFK+C+YK GYD +IGTSEHG I+ SSDL +P+FRHL
Sbjct: 216 SSKPREEGTYWGYQVRYAHNISAVFKDCAYKRGYDFIIGTSEHGQIIKSSDLEIPSFRHL 275

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYF 332
           LIAFGGLAGLEESIEEDD LK K  R   HS +
Sbjct: 276 LIAFGGLAGLEESIEEDDNLKGKRHR--VHSIY 306


>gi|294461137|gb|ADE76133.1| unknown [Picea sitchensis]
          Length = 364

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 195/307 (63%), Positives = 233/307 (75%), Gaps = 9/307 (2%)

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN--KSSSDNYSRSSA 117
           V TVS+AV GSIIDN QS ELATRLAGQIARA TIFRIDEV+VFD+  K+++ N S   A
Sbjct: 62  VQTVSIAVAGSIIDNAQSSELATRLAGQIARAATIFRIDEVIVFDDQAKATTSNISIWDA 121

Query: 118 ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
                  ++N+SGA F+ R+L YL+ PQYLR+ L   H+ LR VG+LPPLDAPHH RKHE
Sbjct: 122 ------DEKNKSGALFMARILNYLDIPQYLRRTLVPRHNDLRHVGLLPPLDAPHHARKHE 175

Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV 237
           W P+REGVTL   +    GT VDVGL+K V + Q  +PG+RVTVAMG+ R+L+ D  RQ 
Sbjct: 176 WVPYREGVTLDRTSVAGGGTFVDVGLHKAVCIQQTFEPGLRVTVAMGSKRSLEDDCIRQA 235

Query: 238 VPPSKP-KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
           V  S+P K+ G+YWGY VRY+ NIS VFK C Y+GGYD+ IGTSEHGD ++S DL +P F
Sbjct: 236 VDRSEPNKKMGLYWGYNVRYSSNISCVFKECPYQGGYDYSIGTSEHGDKISSGDLEIPKF 295

Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           RHLLIAFGGLAGLEE+IE+D  L  KN  ++F  Y NTCP QGSRTIRTEEAIFISLQYF
Sbjct: 296 RHLLIAFGGLAGLEENIEQDGDLMGKNVTQLFSRYLNTCPKQGSRTIRTEEAIFISLQYF 355

Query: 357 QEPISRA 363
           Q+PI RA
Sbjct: 356 QDPIQRA 362


>gi|38346841|emb|CAD39923.2| OSJNBa0091C12.1 [Oryza sativa Japonica Group]
          Length = 361

 Score =  353 bits (907), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 172/266 (64%), Positives = 209/266 (78%), Gaps = 14/266 (5%)

Query: 82  TRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYL 141
           T LAGQIARA T+FRIDEVVVFD+ SS +N             D+ ESGA FLVR+LQYL
Sbjct: 67  TTLAGQIARAATVFRIDEVVVFDSNSSVEN-----------SGDDVESGARFLVRILQYL 115

Query: 142 ETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDV 201
           ETPQYLR+ LF MH++L+FVG+LPPLDAPHHLRKHEW+ FREGVTL  +   S+GT VDV
Sbjct: 116 ETPQYLRRRLFPMHNNLKFVGLLPPLDAPHHLRKHEWSEFREGVTL--DGDRSMGTFVDV 173

Query: 202 GLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNI 260
           GL+K+V+V+Q+L+PG RVT+AMGTNR++     R++V PS P+ E  +YWGYKVRYA N+
Sbjct: 174 GLSKNVLVEQMLEPGKRVTIAMGTNRDITTACKRKIVSPSSPRDEMELYWGYKVRYASNL 233

Query: 261 SSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLK 320
             VF +  YK GYD++IGTSEHG I++SS+L LP+FRHLLIAFGGLAGLEE IEED  LK
Sbjct: 234 GGVFSDSPYKEGYDYIIGTSEHGKIISSSELILPSFRHLLIAFGGLAGLEECIEEDRNLK 293

Query: 321 RKNAREVFHSYFNTCPHQGSRTIRTE 346
            K   +VF++Y NTCP QGSRTIRTE
Sbjct: 294 GKGVDDVFNTYLNTCPSQGSRTIRTE 319


>gi|302756383|ref|XP_002961615.1| hypothetical protein SELMODRAFT_76235 [Selaginella moellendorffii]
 gi|300170274|gb|EFJ36875.1| hypothetical protein SELMODRAFT_76235 [Selaginella moellendorffii]
          Length = 329

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 175/308 (56%), Positives = 221/308 (71%), Gaps = 10/308 (3%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN-KSSSDNYSRSSAANR 120
           TVS+AV GSII+N QSLELATRLAGQIARA TIFRIDE+VVFD+ +SS+ ++ R+     
Sbjct: 25  TVSLAVSGSIINNAQSLELATRLAGQIARAATIFRIDEIVVFDDGESSASSFQRTGWGRE 84

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
           +  +    SG  FL R+L+YLE PQYLR+AL  MH SLR+ G LPPLD+PHHLRK  W P
Sbjct: 85  TGEA----SGGEFLARVLRYLEVPQYLRRALVPMHKSLRYAGQLPPLDSPHHLRKQAWLP 140

Query: 181 FREGVTLKENAPNSVG--TLVDVGLNKHVVVDQVLDPGVRVTVAMG-TNRNLDADSPRQV 237
           FREGV L ++   S G     DVGL + V++ + ++ G R+TVAMG T++NL       V
Sbjct: 141 FREGVALDKSFEKSRGNGCYADVGLGQDVLIQETVNAGTRITVAMGSTDKNLK-RGKLSV 199

Query: 238 VPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
           VP S P+ES G+YWGY VR    ++  F + S++ GYD+ IGTSEHG+ V +SDL +P+F
Sbjct: 200 VPSSTPRESAGLYWGYTVRVVDRLTKAFSDSSFEEGYDYTIGTSEHGEKVLTSDLRVPSF 259

Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           RH LI FGG AGLEES+E D     ++ RE+F  Y NTCP QGSRTIRTEEAIFISLQ+ 
Sbjct: 260 RHALIVFGGPAGLEESLELDGSSAVEDVRELFTRYLNTCPLQGSRTIRTEEAIFISLQFL 319

Query: 357 QEPISRAL 364
           QEP+ RA+
Sbjct: 320 QEPLLRAV 327


>gi|302775540|ref|XP_002971187.1| hypothetical protein SELMODRAFT_95276 [Selaginella moellendorffii]
 gi|300161169|gb|EFJ27785.1| hypothetical protein SELMODRAFT_95276 [Selaginella moellendorffii]
          Length = 324

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 168/306 (54%), Positives = 211/306 (68%), Gaps = 15/306 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN-KSSSDNYSRSSAANR 120
           TVS+AV GSII+N QSLELATRLAGQIARA TIFRIDE+VVFD+ +SS+ ++ R+     
Sbjct: 31  TVSLAVSGSIINNAQSLELATRLAGQIARAATIFRIDEIVVFDDGESSASSFQRTGWGRE 90

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
           +  +    SG  FL R+L+YLE PQYLR+AL  MH SLR+ G LPPLD+PHHLRK     
Sbjct: 91  TGEA----SGGEFLARVLRYLEVPQYLRRALVPMHKSLRYAGQLPPLDSPHHLRK----- 141

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG-TNRNLDADSPRQVVP 239
             +     E      G   DVGL + V++ + +  G R+TVAMG T++NL       VVP
Sbjct: 142 --QACCHSEKVARGNGCYADVGLGQDVLIQETVKAGTRITVAMGSTDKNLK-RGKLSVVP 198

Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
            S P+ES G+YWGY VR    ++  F + S++ GYD+ IGTSEHG+ V +SDL +P+FRH
Sbjct: 199 SSTPRESAGLYWGYTVRVVDRLTKAFSDSSFEEGYDYTIGTSEHGEKVLTSDLRVPSFRH 258

Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
            LI FGG AGLEES+E D     K+ RE+F  Y NTCP QGSRTIRTEEAIFISLQ+ QE
Sbjct: 259 ALIVFGGPAGLEESLELDGSSTVKDVRELFTRYLNTCPLQGSRTIRTEEAIFISLQFLQE 318

Query: 359 PISRAL 364
           P+ RA+
Sbjct: 319 PLLRAV 324


>gi|167998478|ref|XP_001751945.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697043|gb|EDQ83380.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 296

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 159/303 (52%), Positives = 199/303 (65%), Gaps = 10/303 (3%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+AV GSI+DN QSL LA+RLAGQIARA  IFRIDE+VVFD+        + S     
Sbjct: 4   TISMAVAGSIVDNAQSLVLASRLAGQIARAAAIFRIDEIVVFDDGDEGSRVPKWSEG--- 60

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                 E+  +FL RLL+Y+ETPQYLR  L   H SL+  G LPPLD PH  RKHEW P+
Sbjct: 61  ----ATETSGSFLCRLLKYMETPQYLRLTLCPKHRSLQCAGSLPPLDVPHQARKHEWIPY 116

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGV          G  V   L + VV+ Q L PG RVTV MG +R        +VVP S
Sbjct: 117 REGVNPTRMYCIEKGISVSSILVQDVVIRQNLQPGTRVTVKMGESRE---QRILEVVPKS 173

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
           +P++ G YWGY VR+A ++S+      +  GYD++IGTSEHG  V+SS+L LP F+HLL+
Sbjct: 174 EPRDKGHYWGYAVRHASSLSTAISESPFPNGYDYVIGTSEHGVKVSSSELVLPKFKHLLV 233

Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPIS 361
            FGG+AGLEES E D  ++  + R++F+ Y N CP QGSRTIRTEEAI ISLQY Q+P+ 
Sbjct: 234 VFGGVAGLEESKELDKTIEVSDVRDLFNIYLNVCPGQGSRTIRTEEAILISLQYLQDPLM 293

Query: 362 RAL 364
           RAL
Sbjct: 294 RAL 296


>gi|260800799|ref|XP_002595284.1| hypothetical protein BRAFLDRAFT_232338 [Branchiostoma floridae]
 gi|229280529|gb|EEN51296.1| hypothetical protein BRAFLDRAFT_232338 [Branchiostoma floridae]
          Length = 327

 Score =  298 bits (762), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 153/307 (49%), Positives = 200/307 (65%), Gaps = 8/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+PGSI+DN QS EL T LAGQIARA  IF +DE+V+FD   S+   +  +     
Sbjct: 15  TVSVALPGSILDNAQSPELRTYLAGQIARACVIFNVDEIVIFDESGSTQPSTEGTFTGVG 74

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            + + N      + R+LQYLE PQYLRK  F  H  L++ G+L PLD PHH+R+HE + +
Sbjct: 75  KKGNANVQ----IARILQYLECPQYLRKVFFPKHQDLQYAGLLNPLDCPHHVREHELSEY 130

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRN-LDADSPRQVVPP 240
           REGV L        G+ V+VG+ + V +D+VL PG+RVTV +    N +      +VV P
Sbjct: 131 REGVVLDRPTKEGKGSFVNVGMRREVRIDKVLQPGLRVTVQLDEPGNTVSRVLTGKVVSP 190

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
           S P+ E+G+YWGY VR AP++ +VF    +KGGYD  IGTSE G+  ++ ++ LP+F HL
Sbjct: 191 SVPRTEAGLYWGYTVRLAPSLGAVFTESPFKGGYDLTIGTSERGE--STDEVFLPSFSHL 248

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           LI FGGL GLE S+E D+ L   +AR +FH Y NTCP QGSRTIRTEEAI ISL   +  
Sbjct: 249 LIVFGGLKGLEYSVECDEALDIADARVMFHYYLNTCPTQGSRTIRTEEAILISLSGLRPK 308

Query: 360 ISRALRR 366
           I  AL R
Sbjct: 309 IHEALIR 315


>gi|193718507|ref|XP_001944777.1| PREDICTED: uncharacterized protein C9orf114 homolog [Acyrthosiphon
           pisum]
          Length = 383

 Score =  295 bits (755), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 163/353 (46%), Positives = 218/353 (61%), Gaps = 20/353 (5%)

Query: 12  PELKEAATGENHESQNELSLANGDS------SSCDNKKKKKRKRDQLNDDAPIEVPTVSV 65
           PE+ E   G   E +N++ + + +        S D  K  +   D++      ++PTV++
Sbjct: 30  PEVIEINEGLETEHENKIKITDPEKVEVIIDDSIDTSKTTEIS-DKIKPSKVRQLPTVTI 88

Query: 66  AVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSD 125
           AVPGSI+DN QS E  T LAGQIARA  I++IDE+VVFD+     N   S+      +++
Sbjct: 89  AVPGSILDNAQSPEFRTYLAGQIARAACIYKIDEIVVFDDVGEQTNVMVSN-----KKTE 143

Query: 126 ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGV 185
                   L  +LQYLE PQYLR+ LF +H+ L++ G+L PLDAPHH R+ +  PFREGV
Sbjct: 144 GGSKTCKQLAIILQYLECPQYLRRILFPIHNFLKYTGVLNPLDAPHHFRQEDNVPFREGV 203

Query: 186 TLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ---VVPPSK 242
                     G  VDVGL KHV+VD+ L P VRVTV +  N +   DS RQ   VV PS 
Sbjct: 204 VSTLPPKKGKGCNVDVGLRKHVLVDKCLQPYVRVTVKLIPNSD---DSKRQKGIVVAPST 260

Query: 243 PK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT-LPTFRHLL 300
           PK ESG+YWGY VR A +I+ VF  C Y GGYD  IGTS+ G  ++ +D T L +F+H L
Sbjct: 261 PKNESGIYWGYTVRNAESINEVFTKCPYPGGYDLKIGTSDKGVDIDQTDHTQLSSFKHAL 320

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           I FGG+ GLE ++E D  +  +N   +F+ Y NTCP+QGSRTIRTEEA+FI+L
Sbjct: 321 ICFGGVHGLEAALEADQSIDEENPSTLFNVYLNTCPNQGSRTIRTEEAVFITL 373


>gi|239790536|dbj|BAH71823.1| ACYPI008697 [Acyrthosiphon pisum]
          Length = 375

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 163/353 (46%), Positives = 218/353 (61%), Gaps = 20/353 (5%)

Query: 12  PELKEAATGENHESQNELSLANGDS------SSCDNKKKKKRKRDQLNDDAPIEVPTVSV 65
           PE+ E   G   E +N++ + + +        S D  K  +   D++      ++PTV++
Sbjct: 22  PEVMEINEGLETEHENKIKITDPEKVKVIIDDSIDTSKTTEIS-DKVKPSKVRQLPTVTI 80

Query: 66  AVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSD 125
           AVPGSI+DN QS E  T LAGQIARA  I++IDE+VVFD+     N   S+      +++
Sbjct: 81  AVPGSILDNAQSPEFRTYLAGQIARAACIYKIDEIVVFDDVGEQTNVMISN-----KKTE 135

Query: 126 ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGV 185
                   L  +LQYLE PQYLR+ LF +H+ L++ G+L PLDAPHH R+ +  PFREGV
Sbjct: 136 GGSKTCKQLAIILQYLECPQYLRRILFPIHNFLKYTGVLNPLDAPHHFRQEDNVPFREGV 195

Query: 186 TLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ---VVPPSK 242
                     G  VDVGL KHV+VD+ L P VRVTV +  N +   DS RQ   VV PS 
Sbjct: 196 VSTLPPKKGKGCNVDVGLRKHVLVDKCLQPYVRVTVKLIPNSD---DSKRQKGIVVAPST 252

Query: 243 PK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT-LPTFRHLL 300
           PK ESG+YWGY VR A +I+ VF  C Y GGYD  IGTS+ G  ++ +D T L +F+H L
Sbjct: 253 PKNESGIYWGYTVRNAESINEVFTKCPYPGGYDLKIGTSDKGVDIDQTDHTQLSSFKHAL 312

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           I FGG+ GLE ++E D  +  +N   +F+ Y NTCP+QGSRTIRTEEA+FI+L
Sbjct: 313 ICFGGVHGLEAALEADQSIDEENPFTLFNVYLNTCPNQGSRTIRTEEAVFITL 365


>gi|384245829|gb|EIE19321.1| DUF171-domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 318

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 152/314 (48%), Positives = 193/314 (61%), Gaps = 24/314 (7%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+AV  S+IDN QS ELAT +AGQIAR   IF +DEV+V D+   +D+     +    
Sbjct: 4   TLSIAVAASVIDNVQSAELATLVAGQIARTAAIFNVDEVMVIDD---ADDICVGCSGRVG 60

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                   GAAFL R+LQ++ETPQYLRKAL  MH  LR  G LPPLDAPHH+R  EW PF
Sbjct: 61  K-------GAAFLARVLQFMETPQYLRKALCPMHRDLRLAGQLPPLDAPHHMRASEWRPF 113

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG------------TNRNL 229
           REGV ++       G+L+DVGL+K   V + L   VRVTV MG              + +
Sbjct: 114 REGVVVRSAV--GAGSLLDVGLDKVAFVPEALKQSVRVTVEMGEQPRPHRVEGGPLGKGV 171

Query: 230 DADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS 289
           D      V+P     ESG YWGY VR A  +S+VF  C Y+ GYD  +GTSE G + +++
Sbjct: 172 DTLEGSVVLPSDPRAESGTYWGYTVRIASGLSAVFSECPYQEGYDLKVGTSERGSVQSAA 231

Query: 290 DLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
           +L LP  +H LIAFGG  GLEE +++D G    +A E+F  + NTCP QGSRTIRTEEA+
Sbjct: 232 ELELPPHQHALIAFGGPQGLEECLKKDPGKAAGDAEELFDLWLNTCPGQGSRTIRTEEAL 291

Query: 350 FISLQYFQEPISRA 363
            ISL YFQ  +  A
Sbjct: 292 LISLAYFQPALQSA 305


>gi|299470201|emb|CBN79505.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 577

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 146/319 (45%), Positives = 197/319 (61%), Gaps = 24/319 (7%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS--------DNYS 113
           T+S+A+PGS++DN QS EL T L G+IARA  +F +DE+VV+D+ S S        D+ +
Sbjct: 241 TLSIAIPGSVVDNAQSRELKTYLVGEIARAAAVFEVDEIVVYDDLSVSRKGQGGVRDDSA 300

Query: 114 RSSAANRSNRS-----------DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
             +AA   NR            D+N     F+ RLLQY ETPQYLRK LF MHSSLR  G
Sbjct: 301 SVAAAGGFNRGGGSANGSGGGGDKNNPNV-FMARLLQYAETPQYLRKQLFPMHSSLRLAG 359

Query: 163 MLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVA 222
           +L PLDAPHH+R  +  PFREGV L        G+ VD+G  K   +D+ LDPGVRVTV 
Sbjct: 360 LLNPLDAPHHMRADDACPFREGVVLDRTIKKGAGSFVDIGKRKEARIDRPLDPGVRVTVK 419

Query: 223 MGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSE 281
           +        +   + VPPS P+E  G+YWGY+ R A  ++ +   C YKGGYD  +GTSE
Sbjct: 420 LDKGDPCKKNHKGRAVPPSAPREELGLYWGYQTRLASGMADLLAGCPYKGGYDLTVGTSE 479

Query: 282 HGDI-VNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNARE--VFHSYFNTCPHQ 338
            G++ V+S D  LP +RH L+ FGG+ G+E +++ D+ L+   +    +F  + NTCP Q
Sbjct: 480 RGNVTVDSPDFALPEYRHALVVFGGVQGIEATVDADESLQLPGSESSALFDMWVNTCPGQ 539

Query: 339 GSRTIRTEEAIFISLQYFQ 357
           GSRTIRTEEA+ I+L   +
Sbjct: 540 GSRTIRTEEAVLITLARLR 558


>gi|156368817|ref|XP_001627888.1| predicted protein [Nematostella vectensis]
 gi|156214850|gb|EDO35825.1| predicted protein [Nematostella vectensis]
          Length = 311

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 152/309 (49%), Positives = 197/309 (63%), Gaps = 15/309 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+PGSI+DN QS EL T LAGQIARAV +F +DEV+VF+    S + S+   +N  
Sbjct: 5   TVSLALPGSILDNAQSAELRTYLAGQIARAVVVFNVDEVIVFNESRKSLDKSKDGVSNFH 64

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
           +++D N      L R+LQYLE PQYLRKA F  H  L++ G+L PLD PHH+R  +  P+
Sbjct: 65  SKADANLQ----LARILQYLECPQYLRKAFFPRHKDLQYAGLLNPLDCPHHMRIDDDMPY 120

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM-----GTNRNLDADSPRQ 236
           REGV L      S G+LV+ G+ K V +D+ + PG+RVTV +       N+ +   +   
Sbjct: 121 REGVVLDRPVGQSGGSLVNAGMRKEVKIDKCIKPGIRVTVKLKPTSKSGNKQIFFGN--- 177

Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
           VV PS P+ ++G+YWGY VR AP+ S VF    Y  GYD +IGTSE G  V+   L L  
Sbjct: 178 VVSPSAPRVDAGLYWGYSVRLAPSFSDVFTESPYPQGYDVMIGTSERGAPVDC--LELQP 235

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
           F H+LI FGGL GLE S+E D+ L   +  E+FH Y NTCP QGSRTIRTEEAI IS+  
Sbjct: 236 FSHMLIVFGGLKGLEASLESDESLDVSDPSELFHHYVNTCPGQGSRTIRTEEAILISMAA 295

Query: 356 FQEPISRAL 364
            +  I R L
Sbjct: 296 LKPVIERTL 304


>gi|340377485|ref|XP_003387260.1| PREDICTED: uncharacterized protein C9orf114 homolog [Amphimedon
           queenslandica]
          Length = 445

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 146/293 (49%), Positives = 194/293 (66%), Gaps = 16/293 (5%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+PGSI++N QS +L T LAGQIAR   IF ++E+V+F  + +           + 
Sbjct: 157 TVSIALPGSILNNVQSFQLKTYLAGQIARIAAIFCVNEIVIFKEEGTD--------TRKG 208

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
           + +D +     F+ R+LQY+ETPQYLRKAL+ MH  L++ G+L PLDAPHH+R HE +PF
Sbjct: 209 SETDPD----VFMCRILQYMETPQYLRKALYGMHPDLKYAGVLNPLDAPHHMRAHEDSPF 264

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REG+ L      S  ++VD GLN +V V++ L+ GVRVTV M        +   ++V PS
Sbjct: 265 REGIVLSSTDDGS--SMVDCGLNNNVTVNKKLETGVRVTVKMKKRSESGNNLRGKIVAPS 322

Query: 242 KPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
            P  S G+YWGY+VRYA +   VF + SYK  YD +IGTSE GD ++S  L  P ++HLL
Sbjct: 323 IPLTSKGIYWGYQVRYARSFGEVFTSSSYKESYDVIIGTSERGDPIDSV-LPFPQYKHLL 381

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           I FGG  GLE S+E+D  L+  + R +FH Y NTC HQGSRTIRTEEAI ISL
Sbjct: 382 IVFGGQHGLEYSLEKDRTLEVDDVRSLFHFYLNTCTHQGSRTIRTEEAITISL 434


>gi|242018127|ref|XP_002429532.1| protein C9orf114, putative [Pediculus humanus corporis]
 gi|212514480|gb|EEB16794.1| protein C9orf114, putative [Pediculus humanus corporis]
          Length = 371

 Score =  280 bits (716), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 152/305 (49%), Positives = 195/305 (63%), Gaps = 12/305 (3%)

Query: 53  NDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNY 112
           +D  P E+ T+S+AVPGSI+ N QS EL   +AGQIARA  I++I+EV+VFD     D  
Sbjct: 62  SDCKPCELITMSIAVPGSILQNAQSNELRAYVAGQIARAACIYKINEVIVFD-----DAE 116

Query: 113 SRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH 172
               A+N  +    ++    F  RLLQYLE PQYLRK  F +HS L++ G++ PLDAPHH
Sbjct: 117 EEPLASNEKSLVKCSKYCLQF-GRLLQYLECPQYLRKYFFPLHSDLQYAGIMNPLDAPHH 175

Query: 173 LRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
           LR+ E   FREGVTL +   N  GT V+VGL K V+V + L PG+RVTV +   +     
Sbjct: 176 LRQDENNDFREGVTLDKPVKNDKGTFVEVGLTKQVLVYKHLQPGLRVTVKLNPPKENSKK 235

Query: 233 SPRQVVPPSKP-KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG---DIVNS 288
              ++V P+ P  +S +YWGY VR A +IS VF NC YKGGYD  IGTS+ G   D +  
Sbjct: 236 LKGEIVSPNFPLLDSNLYWGYTVRIAHSISEVFLNCPYKGGYDLSIGTSDKGKNIDEIKK 295

Query: 289 SDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
            DL    F+H LI FGGL GLE  +E+D  L  +N   +F  Y NTCP+QGSRTIRTEEA
Sbjct: 296 KDLR--NFKHGLIVFGGLQGLENIVEKDTTLNVENVSHLFDFYINTCPNQGSRTIRTEEA 353

Query: 349 IFISL 353
           +FI+L
Sbjct: 354 VFITL 358


>gi|52219102|ref|NP_001004627.1| uncharacterized protein LOC447888 [Danio rerio]
 gi|51858842|gb|AAH81398.1| Zgc:101748 [Danio rerio]
          Length = 381

 Score =  280 bits (716), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 147/302 (48%), Positives = 190/302 (62%), Gaps = 8/302 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+PGS++DN QS EL T LAGQIARA  +F +DEVV+FD        ++S      
Sbjct: 77  TVSVALPGSVMDNAQSPELRTYLAGQIARACAVFCVDEVVIFDELGED---AKSVEGEFK 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               + +S    L R+LQYLE PQYLRK  F  H  L+F G+L PLD+PHH+R  E A F
Sbjct: 134 GVGKKGQSCVQ-LARILQYLECPQYLRKFFFPKHQDLQFAGLLNPLDSPHHMRIDEEAEF 192

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
           REGV L        G+ V+ G+ K V +D+ L  G+RVTV +  ++N D    +  VV P
Sbjct: 193 REGVVLDRPVKAGKGSFVNCGMRKEVQIDKQLKTGLRVTVQLNESQNKDGRVYKGVVVAP 252

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ E+G+YWGY VR A  +S+VF  C YK GYD  IGTSE G   N+  +TLP FRH+
Sbjct: 253 HIPRTEAGLYWGYSVRLASCLSAVFSECPYKEGYDLTIGTSEKGQ--NTDHVTLPAFRHM 310

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE S++ D+ L   +   +F  Y NTCP QGSRTIRTEEA+ ISL   ++ 
Sbjct: 311 LVVFGGLQGLEASVDADENLTISDPSVLFDLYLNTCPAQGSRTIRTEEALLISLSGLRQK 370

Query: 360 IS 361
           I+
Sbjct: 371 IA 372


>gi|322784360|gb|EFZ11331.1| hypothetical protein SINV_00732 [Solenopsis invicta]
          Length = 369

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 142/300 (47%), Positives = 189/300 (63%), Gaps = 10/300 (3%)

Query: 59  EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
           +V T+S+AVPGSI+DN QS EL T LAGQIARA  I+R+DE+VVFD++        +   
Sbjct: 62  DVCTISIAVPGSILDNAQSPELRTYLAGQIARAACIYRVDEIVVFDDRGEV-----TEDE 116

Query: 119 NRSNRSD----ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
            R  R D    E   G   L R+LQYLE PQYLRK  F +H  L++ G+L PLDAPHHLR
Sbjct: 117 KRKVRKDDVLGEGRVGCLQLARILQYLECPQYLRKYFFPIHKDLQYAGLLNPLDAPHHLR 176

Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSP 234
           + + + +REG+   +      G+ V+VGL   + VD+VL PG+RVTV +   +       
Sbjct: 177 QEDESLYREGIVTNKPIKVGKGSQVNVGLLNEIHVDKVLMPGIRVTVKIPAGQPNPKKLK 236

Query: 235 RQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
             VVPP+ P+ E+G+YWGY V+   N++ V  NC YKGGYD  IGTS+ G +++  +   
Sbjct: 237 GIVVPPNVPRAETGIYWGYTVKLVKNLTEVLTNCQYKGGYDLTIGTSDKGTLIDEIEARS 296

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             +RH LI FGGL GLE +++ D  L   +   VFH Y N CP QGSRTIRTEEAI ++L
Sbjct: 297 LKYRHCLIVFGGLRGLEAAVDVDPHLNIDDPSLVFHKYVNMCPQQGSRTIRTEEAILLTL 356


>gi|307181843|gb|EFN69283.1| Uncharacterized protein C9orf114-like protein [Camponotus
           floridanus]
          Length = 368

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 138/296 (46%), Positives = 189/296 (63%), Gaps = 2/296 (0%)

Query: 59  EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
           ++ TVS+AVPGSI+DN QS EL T LAGQIARA  I+R+DE++VFD+K       R    
Sbjct: 60  DICTVSIAVPGSILDNAQSPELRTYLAGQIARAACIYRVDEIIVFDDKGDVTESERKKI- 118

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
            + +   E   G   L R+LQYLE PQYLRK  F +H  L++ G+L PLDAPHHLR+ + 
Sbjct: 119 RKDDALGEARLGCLQLARILQYLECPQYLRKYFFPIHKDLQYAGVLNPLDAPHHLRQMDE 178

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
           + +REG+   +      G+ V+VGL   V VD+VL PG+RVTV +   +         +V
Sbjct: 179 SLYREGIVTNKPIKVGKGSQVNVGLLNEVHVDKVLMPGLRVTVKIPMEQPNSKKLKGIIV 238

Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
           PP+ P+ E+G+YWGY V+   +++ V  NC YKGGYD  IGTS+ G +++  +     +R
Sbjct: 239 PPNVPRAEAGIYWGYTVKLVKHLTEVLTNCQYKGGYDLTIGTSDKGTLIDEIEAKSLEYR 298

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           H LI FGG+ GLE +++ D  L   +   +FH Y NTCP QGSRTIRTEEAI ++L
Sbjct: 299 HCLIVFGGVTGLEAAVDVDPNLDVDDPSLLFHKYVNTCPQQGSRTIRTEEAILLTL 354


>gi|157128149|ref|XP_001661329.1| hypothetical protein AaeL_AAEL011052 [Aedes aegypti]
 gi|108872678|gb|EAT36903.1| AAEL011052-PA [Aedes aegypti]
          Length = 438

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 146/315 (46%), Positives = 197/315 (62%), Gaps = 3/315 (0%)

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYS-RSSAA 118
           V T+S+AVPGSI+DN QS EL T LAGQIARA  IF++DEVVVFD+  + +  + +    
Sbjct: 69  VSTLSIAVPGSILDNAQSPELRTYLAGQIARAACIFQVDEVVVFDDCGTGNQATEKLGTV 128

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
           + S  S         L R+LQYLE PQYLRK  F +H+ L+F G+L PLD+ HHLR+   
Sbjct: 129 DTSEGSTSARRCCVQLARILQYLECPQYLRKFFFPIHNDLKFCGLLSPLDSQHHLRQASE 188

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QV 237
             FREG+   +   N   + V+VGL   V+VD  LDPGVRVTV M    +L +   R +V
Sbjct: 189 FQFREGIVTNKPTKNDKSSFVNVGLLNDVLVDTHLDPGVRVTVKMPPGVDLKSKKIRAKV 248

Query: 238 VPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
           VPP++P+ E+G+YWGY VR A ++S VF    YKGGYD  IGTS+ G  V   +     +
Sbjct: 249 VPPAQPRQETGIYWGYTVRIANSLSQVFTKSPYKGGYDMTIGTSDTGTNVQELEPQSLKY 308

Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           +H L+ FGG+ GLE +++ D+ L   +  ++F  Y NT P QGSRT+RTEEAI ISL   
Sbjct: 309 KHALVVFGGVLGLEPALDNDNKLNVDSVEQLFDQYLNTVPRQGSRTVRTEEAILISLAAL 368

Query: 357 QEPISRALRRVEYKS 371
            E +   ++  E+ S
Sbjct: 369 GEKLDPTVKPKEFTS 383


>gi|148236641|ref|NP_001084926.1| uncharacterized protein LOC431981 [Xenopus laevis]
 gi|47123001|gb|AAH70663.1| MGC82267 protein [Xenopus laevis]
          Length = 375

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 147/306 (48%), Positives = 189/306 (61%), Gaps = 7/306 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+PGSI+DN QS EL T LAGQIARA  IF IDE+V+FD        S+S   +  
Sbjct: 72  TVSVALPGSIMDNAQSPELRTYLAGQIARACAIFCIDEIVIFDETGEG---SKSVEGDFE 128

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               + ++    L R+LQYLE PQYLRK+ F  H  L+F G+L PLD+PHH+R  E + +
Sbjct: 129 GVGKKGQACVQ-LARILQYLECPQYLRKSFFPKHPDLQFAGLLNPLDSPHHVRMDEESEY 187

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGV L     +S G+ V+ G+ K V +D+ L  GVRVTV +             VV P 
Sbjct: 188 REGVVLDRPTKSSKGSFVNCGMRKEVQIDKQLQAGVRVTVHLQQENVEQRVQKGIVVSPQ 247

Query: 242 KPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
            P+ E G+YWGY+VR A  +S+VF  C +K GYD  +GTSEHG   N   ++LP FRH +
Sbjct: 248 HPRTEGGLYWGYRVRLASCLSAVFTECPFKDGYDLTMGTSEHGS--NVETVSLPDFRHAV 305

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPI 360
           I FGGL GLE S+E D  L       +F+ Y NTCP QGSRTIRTEEAI ISL   +  +
Sbjct: 306 IIFGGLKGLEASVESDQNLDIAEPSILFNHYLNTCPDQGSRTIRTEEAILISLAALRPRL 365

Query: 361 SRALRR 366
             A+++
Sbjct: 366 QAAIKK 371


>gi|156545293|ref|XP_001605386.1| PREDICTED: uncharacterized protein C9orf114 homolog [Nasonia
           vitripennis]
          Length = 404

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 146/298 (48%), Positives = 192/298 (64%), Gaps = 6/298 (2%)

Query: 59  EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
           +V TVS+AVPGSI+DN QS EL T +AGQIARA  I+++DEV+VFD+K       +    
Sbjct: 60  KVSTVSIAVPGSILDNAQSHELRTYVAGQIARAACIYKVDEVIVFDDKGEITEDEKKKL- 118

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
            R     E   G   L R+LQYLE PQYLRK  F +H  L++ G+L PLDAPHHLR+ + 
Sbjct: 119 RRDEVLGEARPGCLQLARILQYLECPQYLRKYFFPIHKDLQYAGLLNPLDAPHHLRQQDA 178

Query: 179 APFREGVTLKENAPNSVG--TLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
           + +REGV    N P  VG  +L++VGL   V VD+VL  G+RVTV +  ++         
Sbjct: 179 SLYREGVV--SNKPLKVGKGSLINVGLLTDVRVDKVLTSGLRVTVKIPESQENPKKPKGI 236

Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
           +VPPS P+ ++G+YWGY V+ A ++S +F  C Y GGYD  IGTS+ G  V+        
Sbjct: 237 IVPPSLPRADTGIYWGYSVKLASSLSDIFAKCPYPGGYDVSIGTSDKGTSVDDIAPKSLK 296

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           ++H LI FGGLAG+EE+IE D  L   +A  VF+ Y NTCP QGSRTIRTEEAI ++L
Sbjct: 297 YKHALIVFGGLAGIEEAIEADSNLDIDDASLVFNMYLNTCPQQGSRTIRTEEAILLTL 354


>gi|259155393|ref|NP_001158758.1| CI114 protein [Salmo salar]
 gi|223647122|gb|ACN10319.1| C9orf114 [Salmo salar]
 gi|223672995|gb|ACN12679.1| C9orf114 [Salmo salar]
          Length = 380

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 142/305 (46%), Positives = 195/305 (63%), Gaps = 8/305 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+PGS++DN QS EL T LAGQIARA  +F +DE++VFD +    +    +     
Sbjct: 75  TVSVALPGSVLDNAQSTELRTYLAGQIARACVVFCVDEIIVFDEQ----DEDVKTVEGEF 130

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
           N   +       L R+LQ+LE PQYLRK+ F MH  L++ G+L PLD+PHH+RK +   +
Sbjct: 131 NGVGKKGQACIQLARILQFLECPQYLRKSFFPMHKDLQYAGLLNPLDSPHHMRKDDECEY 190

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
           REGV L   +    G+LV+ G+ K V +D+ L  G+RVTV M   +N D    +  VV P
Sbjct: 191 REGVVLDRPSKPGKGSLVNCGMWKEVQIDKQLQSGLRVTVHMNKIQNKDGRLHKGVVVAP 250

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
            KP+ E G+YWGY VR A ++S+VF  C +K GYD  IGTSE G  V+++ L+   F+H+
Sbjct: 251 HKPRTEGGLYWGYSVRLASSLSNVFTECPHKEGYDLTIGTSEKGVDVDTTSLS--PFKHM 308

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE S++ D  L+  +   +F  Y NTCP+QGSRTIRTEEAI IS+   ++ 
Sbjct: 309 LVVFGGLQGLEASVDADQNLEVTDPGVLFDLYLNTCPNQGSRTIRTEEAILISMSSLRQK 368

Query: 360 ISRAL 364
           I+ A 
Sbjct: 369 ITAAF 373


>gi|348664786|gb|EGZ04626.1| hypothetical protein PHYSODRAFT_551704 [Phytophthora sojae]
          Length = 301

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 140/306 (45%), Positives = 204/306 (66%), Gaps = 12/306 (3%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           +A+PGSI+DN Q+ EL T LAGQ+ARA  IF++DE+VVFD++  +   S S+ A +    
Sbjct: 1   MAIPGSILDNAQTKELRTYLAGQVARAAVIFQVDEIVVFDDQLGN---SVSTGAPKKRAH 57

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREG 184
           D +     F+ R+LQYLETPQYLR+ALF MHS L   G+L PLD PHHLR  EW+ +REG
Sbjct: 58  DCH----VFMARILQYLETPQYLRRALFPMHSDLSCAGLLNPLDCPHHLRAQEWSVYREG 113

Query: 185 VTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG-TNRNLDADSPRQVVPPSKP 243
           V       +  G+ V VGL + VVVD+ + PG+RVTV +   +++       ++V  ++P
Sbjct: 114 VVTDRPLKDKEGSHVYVGLQREVVVDKRITPGIRVTVKIDEASKDFKKKMVGKLVSSAEP 173

Query: 244 KES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDI-VNSSDLTLPTFRHLLI 301
           +E  G+YWGY  R+AP++S+V+ +C + GGYD  +GTSE G++ V+    +LP FRH LI
Sbjct: 174 REHLGVYWGYTTRFAPSLSNVWSDCPFPGGYDLKVGTSERGNVSVDDPSFSLPKFRHALI 233

Query: 302 AFGGLAGLEESIEEDDG--LKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
            FGG+AG+EE ++ D+   +  +++  +F  + NTCP QGSRTIR+EEA+ ISL   +  
Sbjct: 234 VFGGVAGIEECVDADEKVPVSGEDSNTLFDMWVNTCPEQGSRTIRSEEALLISLAALRPH 293

Query: 360 ISRALR 365
           + R+ R
Sbjct: 294 VKRSGR 299


>gi|158285430|ref|XP_001687890.1| AGAP007570-PA [Anopheles gambiae str. PEST]
 gi|157019988|gb|EDO64539.1| AGAP007570-PA [Anopheles gambiae str. PEST]
          Length = 439

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 147/298 (49%), Positives = 190/298 (63%), Gaps = 4/298 (1%)

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           V TVS+AVPGSI+DN QS EL T LAGQIARA  IF++DEV+VFD+  SS+  +      
Sbjct: 61  VSTVSIAVPGSIMDNAQSPELRTYLAGQIARAACIFQVDEVIVFDDCGSSNQVTDKLGTI 120

Query: 120 RSNRSDENESGAAF-LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
           ++N    +       L R+LQYLE PQYLRK  F +H+ L+F G+L PLD+ HHLR+   
Sbjct: 121 QTNEGSTSARRCCIQLARILQYLECPQYLRKYFFPLHNDLKFTGLLNPLDSQHHLRQQSE 180

Query: 179 APFREG-VTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-Q 236
             FREG VT K    N  G  V+VGL   V+VD  LDP +RVTV +    +L +   R +
Sbjct: 181 FVFREGIVTTKPTKGNKPGAFVNVGLLNDVLVDTKLDPNLRVTVKLPAGADLKSKKLRGK 240

Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
           VVPPS+P+ E+G+YWGY VR A ++S VF    YKGGYD  IGTS+ G  V+       +
Sbjct: 241 VVPPSQPRQETGIYWGYTVRIANSLSQVFTKSPYKGGYDMTIGTSDRGKNVHELQPASLS 300

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           ++H LI FGG+ GLE +++ D  L      ++F  Y NT P QGSRTIRTEEAI IS+
Sbjct: 301 YQHALIVFGGVLGLEPALDNDQKLNVDTVEDLFEEYLNTVPTQGSRTIRTEEAILISM 358


>gi|62859635|ref|NP_001016729.1| uncharacterized protein LOC549483 [Xenopus (Silurana) tropicalis]
 gi|115530318|emb|CAL49394.1| novel protein [Xenopus (Silurana) tropicalis]
          Length = 318

 Score =  276 bits (707), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 148/307 (48%), Positives = 192/307 (62%), Gaps = 9/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+PGSI+DN QS EL T LAGQIARA  IF IDE+V+FD        S+S   N  
Sbjct: 15  TVSVALPGSIMDNAQSPELRTYLAGQIARACAIFCIDEIVIFDETGEG---SKSVEGNFE 71

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               + ++    L R+LQYLE PQYLRK+ F  H  L+F G+L PLD+PHH+R  E + +
Sbjct: 72  GVGKKGQA-CVQLARILQYLECPQYLRKSFFPKHPDLQFAGLLNPLDSPHHVRIDEESEY 130

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV L     +  G+ V+ G+ K V +D+ L  GVRVTV +    N++    +  VV P
Sbjct: 131 REGVVLDRPTKSGKGSFVNCGMRKEVQIDRQLQTGVRVTVKL-QQENVELKVQKGTVVSP 189

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+  SG+YWGY+VR A  +S+VF  C +K GYD  IGTSE G  V +  ++LP FRH 
Sbjct: 190 QHPRTASGLYWGYRVRLASCLSAVFTECPFKDGYDLTIGTSERGSSVET--VSLPDFRHA 247

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           +I FGGL GLE S+E D  L       +F+ Y NTCP QGSRTIRTEEAI ISL   +  
Sbjct: 248 MIVFGGLKGLEASVESDQNLDIAEPSILFNHYLNTCPDQGSRTIRTEEAILISLAALKAR 307

Query: 360 ISRALRR 366
           +  A+++
Sbjct: 308 LQAAIKK 314


>gi|427779265|gb|JAA55084.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 376

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 145/299 (48%), Positives = 192/299 (64%), Gaps = 8/299 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN-R 120
           T+S+AVPGSI+DN Q+ EL T LAGQIARA  +F +DE+VV+D+  S+   +  +A +  
Sbjct: 70  TLSIAVPGSIMDNAQNAELRTYLAGQIARAAVVFCVDEIVVYDDDGSTIRGANGTAVDGE 129

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
            +   ++  G   L R+LQYLE PQYLRK LF +H  L+F G+L PLD PHHLR  E +P
Sbjct: 130 FSGLAKSGQGVVQLARVLQYLECPQYLRKHLFPLHRDLQFAGLLNPLDCPHHLRATEESP 189

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVP 239
           +REG+  +       G+ V+ GLNK V +D+ L PGVRVTV M     +     R   V 
Sbjct: 190 YREGIVARLPVKEGSGSYVNCGLNKEVKIDRCLQPGVRVTVKM--KEGVPGKKLRGTAVS 247

Query: 240 PSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP-TFR 297
           PS P+ E+G+YWGY+VR A ++ +V   C + GGYD  +GTSE G  V+  +L LP  F 
Sbjct: 248 PSCPRTEAGLYWGYEVRVARSLGAVLAECPFPGGYDITLGTSERGTPVD--ELELPHRFE 305

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           H L+ FGGL GLE ++E D+ L   +   +F+ Y NTCP QGSRTIRTEEAI ISL  F
Sbjct: 306 HALVVFGGLKGLEAALEADEALDVDDPEYLFNHYLNTCPSQGSRTIRTEEAILISLSXF 364


>gi|427784287|gb|JAA57595.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 381

 Score =  276 bits (705), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 146/306 (47%), Positives = 193/306 (63%), Gaps = 8/306 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+AVPGSI+DN Q+ EL T LAGQIARA  +F +DE+VV+D+  S+   +  +A +  
Sbjct: 70  TLSIAVPGSIMDNAQNAELRTYLAGQIARAAVVFCVDEIVVYDDDGSTIRGANGTAVDGE 129

Query: 122 NRS-DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
                ++  G   L R+LQYLE PQYLRK LF +H  L+F G+L PLD PHHLR  E +P
Sbjct: 130 FSGLAKSGQGVVQLARVLQYLECPQYLRKHLFPLHRDLQFAGLLNPLDCPHHLRATEESP 189

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVP 239
           +REG+  +       G+ V+ GLNK V +D+ L PGVRVTV M     +     R   V 
Sbjct: 190 YREGIVARLPVKEGSGSYVNCGLNKEVKIDRCLQPGVRVTVKM--KEGVPGKKLRGTAVS 247

Query: 240 PSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP-TFR 297
           PS P+ E+G+YWGY+VR A ++ +V   C + GGYD  +GTSE G  V+  +L LP  F 
Sbjct: 248 PSCPRTEAGLYWGYEVRVARSLGAVLAECPFPGGYDITLGTSERGTPVD--ELELPHRFE 305

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           H L+ FGGL GLE ++E D+ L   +   +F+ Y NTCP QGSRTIRTEEAI ISL   +
Sbjct: 306 HALVVFGGLKGLEAALEADEALDVDDPEYLFNHYLNTCPSQGSRTIRTEEAILISLSALR 365

Query: 358 EPISRA 363
             I  A
Sbjct: 366 PKIVAA 371


>gi|410979354|ref|XP_003996050.1| PREDICTED: uncharacterized protein C9orf114 homolog [Felis catus]
          Length = 398

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 147/318 (46%), Positives = 194/318 (61%), Gaps = 8/318 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ + R +VV  
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVQLNQKQLPESKTYRGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRALRRVEYKSKPEDGK 377
           +++A  R  Y SK   GK
Sbjct: 368 LTQAGARQAYDSKAAAGK 385


>gi|328784069|ref|XP_003250387.1| PREDICTED: uncharacterized protein C9orf114 homolog [Apis
           mellifera]
          Length = 378

 Score =  275 bits (702), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 143/319 (44%), Positives = 198/319 (62%), Gaps = 9/319 (2%)

Query: 43  KKKKRKRDQLNDDAPIE-------VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIF 95
           K++ +K+ Q     PIE       + T+S+AVPGSI+DN QS EL T LAGQIARA  ++
Sbjct: 36  KEETKKKLQKEVVKPIEKHFEKKDISTISIAVPGSILDNAQSPELRTYLAGQIARAACVY 95

Query: 96  RIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMH 155
           +I+E++VFD+K       +    N      E+E G   L R+LQYLE PQYLRK  F +H
Sbjct: 96  KINEIIVFDDKGEIIESEKKKIKNDEILG-EDELGCLQLARILQYLECPQYLRKYFFPIH 154

Query: 156 SSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDP 215
             L++ G+L P+DAPHHLR+ + + +REGV   +   +  G+ V+VGL   + VD+VL  
Sbjct: 155 KDLQYAGILNPVDAPHHLRQQDISLYREGVVTNKPVKSGKGSHVNVGLLNDICVDKVLTA 214

Query: 216 GVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYD 274
           G+RVTV +   ++        +VPP  P+ ++G+YWGY VR A N++ V   C YK GYD
Sbjct: 215 GLRVTVKIPEEQSNLKKLKGFIVPPDIPRSDTGIYWGYTVRLANNLTEVLTQCPYKSGYD 274

Query: 275 HLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNT 334
             IGTS+ G  ++  +     + H LI FGGL+GLE +++ D  L   +A  VFH Y NT
Sbjct: 275 LTIGTSDKGKSIDDIESKGIEYHHALIVFGGLSGLEATVDCDPNLDVDDASLVFHQYLNT 334

Query: 335 CPHQGSRTIRTEEAIFISL 353
           CP QGSRTIRTEEAI I+L
Sbjct: 335 CPQQGSRTIRTEEAILITL 353


>gi|112419371|gb|AAI21999.1| hypothetical protein LOC549483 [Xenopus (Silurana) tropicalis]
          Length = 318

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 147/307 (47%), Positives = 191/307 (62%), Gaps = 9/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV +PGSI+DN QS EL T LAGQIARA  IF IDE+V+FD        S+S   N  
Sbjct: 15  TVSVVLPGSIMDNAQSPELRTYLAGQIARACAIFCIDEIVIFDETGEG---SKSVEGNFE 71

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               + ++    L R+LQYLE PQYLRK+ F  H  L+F G+L PLD+PHH+R  E + +
Sbjct: 72  GVGKKGQA-CVQLARILQYLECPQYLRKSFFPKHPDLQFAGLLNPLDSPHHVRIDEESEY 130

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV L     +  G+ V+ G+ K V +D+ L  GVRVTV +    N++    +  VV P
Sbjct: 131 REGVVLDRPTKSGKGSFVNCGMRKEVQIDRQLQTGVRVTVKL-QQENVELKVQKGTVVSP 189

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+  SG+YWGY+VR A  +S+VF  C +K GYD  IGTSE G  V +  ++LP FRH 
Sbjct: 190 QHPRTASGLYWGYRVRLASCLSAVFTECPFKDGYDLTIGTSERGSSVET--VSLPDFRHA 247

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           +I FGGL GLE S+E D  L       +F+ Y NTCP QGSRTIRTEEAI ISL   +  
Sbjct: 248 MIVFGGLKGLEASVESDQNLDIAEPSILFNHYLNTCPDQGSRTIRTEEAILISLAALKPR 307

Query: 360 ISRALRR 366
           +  A+++
Sbjct: 308 LQAAIKK 314


>gi|312381995|gb|EFR27594.1| hypothetical protein AND_05627 [Anopheles darlingi]
          Length = 430

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 145/307 (47%), Positives = 193/307 (62%), Gaps = 3/307 (0%)

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYS-RSSAA 118
           V T+S+AVPGSI+DN QS EL T LAGQIARA  IF+IDEV+VFD+   S+  + R S  
Sbjct: 61  VSTISLAVPGSIMDNAQSPELRTYLAGQIARAACIFQIDEVIVFDDCGPSNQVTDRLSTI 120

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
             +  S         L R+LQYLE PQYLRK  F +H+ L+F G+L PLD+ HHLR+   
Sbjct: 121 ETAEGSTSARRCCIQLARILQYLECPQYLRKYFFPIHNDLKFCGLLNPLDSQHHLRQQSE 180

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QV 237
             FREG+   +   +  G+ V+VGL   V +D  L+P VRVTV +    +L +   R +V
Sbjct: 181 FVFREGIVTSKPTKHGKGSYVNVGLLNDVRIDTQLEPNVRVTVKLPDGADLKSKKIRAKV 240

Query: 238 VPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
           VPPS+P+ E+G+YWGY VR A ++S VF    YKGGYD  IGTS+ G  V+       ++
Sbjct: 241 VPPSQPRQETGIYWGYSVRIANSLSQVFTKSPYKGGYDLTIGTSDRGTNVHEVAAKSLSY 300

Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           RH L+ FGG+ GLE +++ D  L  +   ++F  Y NT P QGSRT+RTEEAI IS+   
Sbjct: 301 RHALVVFGGVLGLEPALDNDTKLTVETVEDLFDEYLNTVPAQGSRTVRTEEAILISMAAL 360

Query: 357 QEPISRA 363
            E +S A
Sbjct: 361 GEKVSPA 367


>gi|344271297|ref|XP_003407476.1| PREDICTED: uncharacterized protein C9orf114-like [Loxodonta
           africana]
          Length = 376

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 144/304 (47%), Positives = 189/304 (62%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDPKTVEGEFMGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV L        G+ V+ G+ K V +D+ L+PG+RVTV +   + L++ + R +VV  
Sbjct: 190 REGVVLDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQKQLLESKTYRGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    +K GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFKDGYDLTIGTSERGSDVASAQ--LPSFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP+QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLEGLEAGVDADPNLEVAEPSVLFDLYVNTCPNQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           +++A
Sbjct: 368 LTQA 371


>gi|291415402|ref|XP_002723940.1| PREDICTED: CG12128-like [Oryctolagus cuniculus]
          Length = 374

 Score =  272 bits (695), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 144/304 (47%), Positives = 187/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DEVVVFD +               
Sbjct: 72  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEVVVFDEEGQDAKTVEGEFKGVG 131

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E A F
Sbjct: 132 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEEAEF 187

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  +  + R +VV  
Sbjct: 188 REGVVVDRPTRPGHGSFVNCGMKKEVKIDRNLEPGLRVTVQLSQKQLPECKTYRGKVVSA 247

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 248 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSEVASAQ--LPSFRHA 305

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE +++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 306 LVVFGGLQGLEAAVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 365

Query: 360 ISRA 363
           ++RA
Sbjct: 366 LARA 369


>gi|339242005|ref|XP_003376928.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316974332|gb|EFV57827.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 770

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 134/294 (45%), Positives = 187/294 (63%), Gaps = 7/294 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+PGSI++N QS EL T LAGQIARA +IF +DE+++FD  ++  +       N  
Sbjct: 184 TVSIAIPGSIVENAQSFELRTYLAGQIARAASIFEVDEIIIFDETATMTSKEVEMFENGE 243

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW-AP 180
            R+    +   FLVR+L+Y+E PQYLRK LF + S  ++VG+L PLD  HH R  E   P
Sbjct: 244 LRAF---NCNCFLVRILRYIECPQYLRKHLFPLCSDFKYVGLLNPLDCFHHFRMDEVEIP 300

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
           +RE V + +   +  G+  + GL K V +D+ L PG+RVTV    +         +VV P
Sbjct: 301 YREAVVVNKPVKDGRGSFCNAGLAKDVEIDRALKPGIRVTVKFENSDQKQKRLVAKVVNP 360

Query: 241 SKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
           ++P+ + G YWGY VR A ++ SVF +C Y  GYD ++GTS+ G   N  ++ LP+F+HL
Sbjct: 361 TEPRTAMGYYWGYNVRLAKSLKSVFSDCPYVEGYDLIVGTSDKG--TNVDNVQLPSFKHL 418

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           LI FGG+ G+E  IE D  L+  +   +F  Y NTCP+QGSRTIRTEEAI I+L
Sbjct: 419 LIVFGGVKGIEACIENDQHLQENDPESLFQFYLNTCPNQGSRTIRTEEAILITL 472


>gi|325182840|emb|CCA17295.1| conserved unknown protein putative [Albugo laibachii Nc14]
          Length = 314

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 140/306 (45%), Positives = 193/306 (63%), Gaps = 6/306 (1%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TV++A+PGSIIDN Q+ EL T LAGQIARA  IF++DE+V++D+       + S A+   
Sbjct: 3   TVTIALPGSIIDNAQTKELKTYLAGQIARAAVIFQVDEIVIYDDHLGQKTKNASQASETL 62

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            R  +      FL+R+LQYLETPQYLRK LF  H+ L   G+L PLD PHH+R  E + +
Sbjct: 63  QR--DAYDCHVFLMRILQYLETPQYLRKQLFPKHTDLSCAGLLNPLDTPHHVRSSESSLY 120

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGV       +  G+ V VGL K V VD+ + PG+RVTV +    N        +V PS
Sbjct: 121 REGVVSDRMLKDKEGSYVYVGLPKEVRVDKRIHPGIRVTVKLDQTTNNSKKLLGTLVSPS 180

Query: 242 KPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGD-IVNSSDLTLPTFRHL 299
           +P++ +G+YWGY  R+AP++  V++ C Y+ GYD  IGTSE G+   N    +LP F+H 
Sbjct: 181 EPRQKTGIYWGYTCRFAPSLRHVWQECPYESGYDLKIGTSERGNQSTNDPAFSLPAFKHG 240

Query: 300 LIAFGGLAGLEESIEEDD--GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           LI FGG+AG+EE +  D+   L  + + ++F  + NTCP QGSRTIRTEEA+ ISL    
Sbjct: 241 LIVFGGVAGIEECVGNDETISLSSEESNKLFDMWVNTCPSQGSRTIRTEEAVLISLSALM 300

Query: 358 EPISRA 363
             +S +
Sbjct: 301 PHVSMS 306


>gi|326930329|ref|XP_003211300.1| PREDICTED: uncharacterized protein C9orf114 homolog [Meleagris
           gallopavo]
          Length = 385

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 143/306 (46%), Positives = 187/306 (61%), Gaps = 8/306 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI++N QSLEL T LAGQIARA  IF +DE+VVFD                 
Sbjct: 75  TLSVALPGSILNNAQSLELRTYLAGQIARACAIFCVDEIVVFDESGEDAKSVEGEFEGIG 134

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            R          L R+LQYLE PQYLRK+ F  H  L+F G+L PLD+PHH+R  E + +
Sbjct: 135 KRG----KACVQLARILQYLECPQYLRKSFFPKHGDLQFAGLLNPLDSPHHMRVDEDSEY 190

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV L        G+ V+ G+ K V +D+ L PG+RVTV +   +N +    +  VV  
Sbjct: 191 REGVVLDRPTKPGRGSFVNCGMKKEVQIDKQLKPGLRVTVRLQEPQNPETKVRKGTVVSS 250

Query: 241 SKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+  SG+YWGY VR A  +S+VF  C +K GYD  IGTSE G+ V+   +TLP+FRH+
Sbjct: 251 HHPRTVSGLYWGYSVRLASCLSAVFSECPFKEGYDLSIGTSERGNSVDQ--VTLPSFRHI 308

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+  +   +F  Y NTCP QGSRTIRTEEA+ ISL   +  
Sbjct: 309 LVVFGGLQGLEAGVDADPNLEVTDPSVLFDFYLNTCPSQGSRTIRTEEALLISLSALRPH 368

Query: 360 ISRALR 365
           I  A++
Sbjct: 369 IDEAVK 374


>gi|363740343|ref|XP_003642310.1| PREDICTED: uncharacterized protein C9orf114 homolog [Gallus gallus]
 gi|363740439|ref|XP_003642329.1| PREDICTED: uncharacterized protein C9orf114 homolog [Gallus gallus]
          Length = 379

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 143/306 (46%), Positives = 188/306 (61%), Gaps = 8/306 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI++N QSLEL T LAGQIARA  IF +DE+VVFD                 
Sbjct: 75  TLSVALPGSILNNAQSLELRTYLAGQIARACAIFCVDEIVVFDEHGEDAKSVEGEFEGIG 134

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            R          L R+LQYLE PQYLRK+ F  H  L+F G+L PLD+PHH+R  E + +
Sbjct: 135 KRG----KACVQLARILQYLECPQYLRKSFFPKHGDLQFAGLLNPLDSPHHMRVDEDSEY 190

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV L   +    G+ V+ G+ K V +D+ L PG+RVTV +   +N +    +  VV  
Sbjct: 191 REGVVLDRPSKPGRGSFVNCGMKKEVQIDKQLKPGLRVTVRLQEPQNPETKVRKGTVVSS 250

Query: 241 SKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+  SG+YWGY VR A  +S+VF  C +K GYD  IGTSE G+ V+   +TLP+FRH+
Sbjct: 251 HHPRTVSGLYWGYSVRLASCLSAVFSECPFKEGYDVSIGTSERGNSVDQ--VTLPSFRHM 308

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+  +   +F  Y NTCP QGSRTIRTEEA+ ISL   +  
Sbjct: 309 LVVFGGLQGLEAGVDADPNLEVTDPSVLFDFYLNTCPSQGSRTIRTEEALLISLSALRPH 368

Query: 360 ISRALR 365
           I  A++
Sbjct: 369 IDEAVK 374


>gi|345805985|ref|XP_851147.2| PREDICTED: uncharacterized protein C9orf114 [Canis lupus
           familiaris]
          Length = 376

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 143/307 (46%), Positives = 189/307 (61%), Gaps = 8/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDVKTVEGEFRGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ + R +VV  
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVQLNQKQLPESKTYRGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVTSAQ--LPSFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPSQGSRTIRTEEAILISLATLQAG 367

Query: 360 ISRALRR 366
           +S+A  R
Sbjct: 368 LSQAGAR 374


>gi|301758806|ref|XP_002915276.1| PREDICTED: uncharacterized protein C9orf114-like [Ailuropoda
           melanoleuca]
          Length = 376

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 142/308 (46%), Positives = 190/308 (61%), Gaps = 8/308 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ + R +VV  
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVQLNQKQLPESKTYRGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVTSAQ--LPSFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPSQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRALRRV 367
           +++A  R+
Sbjct: 368 LTQAGARL 375


>gi|432884800|ref|XP_004074593.1| PREDICTED: uncharacterized protein C9orf114 homolog [Oryzias
           latipes]
          Length = 370

 Score =  270 bits (690), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 140/305 (45%), Positives = 191/305 (62%), Gaps = 8/305 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+PGS++DN QS EL T LAGQIARA  +F +DE++VFD +       +S   +  
Sbjct: 72  TVSVALPGSVLDNAQSPELRTYLAGQIARACAVFCVDEIIVFDEQGED---VKSVEGDFK 128

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               + ++    L R+LQY+E PQYLRK  F +H  L++ G+L PLD+PHH+R  E + +
Sbjct: 129 GVGKKGQA-CIQLARILQYVECPQYLRKCFFPVHKDLQYAGLLNPLDSPHHMRMDEESDY 187

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
           REGV L        G+LV+ G+ K V +D+ L  G+RVTV +   +N D+   +  VVPP
Sbjct: 188 REGVVLNRPPKQGKGSLVNCGMRKEVRIDKQLQSGLRVTVKLNKMQNQDSKLYKGVVVPP 247

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ E+ +YWGY VR A  +S+VF    +KGGYD  +GTSE G+ +  +  TL  F+HL
Sbjct: 248 HVPRTEAALYWGYTVRLASCLSAVFTEGPHKGGYDLTVGTSEKGNTLEQT--TLSPFKHL 305

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE S++ D  L   +   +F  Y NTCP QGSRTIRTEEAI IS+   Q  
Sbjct: 306 LVVFGGLHGLEASVDADQNLNVTDPSLLFDLYLNTCPGQGSRTIRTEEAILISMAALQPK 365

Query: 360 ISRAL 364
           I+ A 
Sbjct: 366 IAAAF 370


>gi|297480783|ref|XP_002691684.1| PREDICTED: uncharacterized protein C9orf114 [Bos taurus]
 gi|296482120|tpg|DAA24235.1| TPA: CG12128-like [Bos taurus]
          Length = 376

 Score =  270 bits (690), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 142/304 (46%), Positives = 188/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +  N+  ++ + R +VV  
Sbjct: 190 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQNQLPESKTYRGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP+QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPNQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           +++A
Sbjct: 368 LTQA 371


>gi|194671747|ref|XP_001789343.1| PREDICTED: uncharacterized protein C9orf114 [Bos taurus]
          Length = 397

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 143/307 (46%), Positives = 189/307 (61%), Gaps = 8/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 95  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 154

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 155 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 210

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +  N+  ++ + R +VV  
Sbjct: 211 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQNQLPESKTYRGKVVSS 270

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 271 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 328

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP+QGSRTIRTEEAI ISL   Q  
Sbjct: 329 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPNQGSRTIRTEEAILISLAALQPG 388

Query: 360 ISRALRR 366
           +++A  R
Sbjct: 389 LTQAGAR 395


>gi|405974466|gb|EKC39109.1| hypothetical protein CGI_10020686 [Crassostrea gigas]
          Length = 371

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 144/315 (45%), Positives = 203/315 (64%), Gaps = 8/315 (2%)

Query: 43  KKKKRKRDQLNDDAPIE--VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEV 100
           K++K  ++++ +++P +  + TVS+A+PGSI+DN QS EL T LAGQIARA  IF IDE+
Sbjct: 48  KRQKEGQEEITEESPKQGRLYTVSIALPGSILDNAQSPELRTYLAGQIARAAVIFNIDEI 107

Query: 101 VVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRF 160
           ++FD   S D+           +   N      + R+LQYLE PQYLRK+ F  H  L++
Sbjct: 108 IIFDEIFSKDSTIEGEFKGVGKKGQAN----VQMARILQYLECPQYLRKSFFPHHKDLQY 163

Query: 161 VGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKH-VVVDQVLDPGVRV 219
            G+L PLD+PHH+R  + +P+REGV   +   +  G+LV+ G+ K  V +D++L PG+RV
Sbjct: 164 AGILNPLDSPHHMRGEDESPYREGVVTNKPVKSGRGSLVNCGIGKQDVQIDKLLTPGIRV 223

Query: 220 TVAMGTNRNLDADSPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIG 278
           TV +    +       +VV P+ P+E  G+YWGY +R A ++ +VF  C YK GYD  IG
Sbjct: 224 TVKINQYDSSRRILKGRVVSPAAPREEGGLYWGYSIRLAKSLGAVFTECPYKEGYDLAIG 283

Query: 279 TSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQ 338
           TSE GD V+S ++    FRH LI FGG+ GLE S+E D+ L  ++   +F  Y NTCP+Q
Sbjct: 284 TSEKGDNVDSVEMDTSNFRHALIVFGGVQGLEASLESDEALDIEDPSLLFQYYLNTCPNQ 343

Query: 339 GSRTIRTEEAIFISL 353
           GSRTIRTEEAI ISL
Sbjct: 344 GSRTIRTEEAILISL 358


>gi|91091892|ref|XP_970427.1| PREDICTED: similar to CG12128 CG12128-PA [Tribolium castaneum]
 gi|270001126|gb|EEZ97573.1| hypothetical protein TcasGA2_TC011435 [Tribolium castaneum]
          Length = 360

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 141/302 (46%), Positives = 189/302 (62%), Gaps = 7/302 (2%)

Query: 53  NDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNY 112
           N  AP   PTVS+AVPGSI++  QS EL   L+GQIARA  IF++DE+VVFD+       
Sbjct: 50  NKKAP---PTVSIAVPGSILETAQSPELRAYLSGQIARAACIFQVDEIVVFDDYGDESTA 106

Query: 113 SRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH 172
            ++   +        ES    L R+LQY+E PQYLRK  F +H+ L+F G+L PL+APHH
Sbjct: 107 KKAELEDNYGLKTMRESCVQ-LGRILQYMECPQYLRKHFFPIHNHLKFCGILNPLNAPHH 165

Query: 173 LRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
           L K E   FREGV L +      G++V+VGL K V VD++L PG+R TV +   +     
Sbjct: 166 LTKDEEFEFREGVVLNKPVKPGRGSIVNVGLLKEVHVDKLLTPGIRCTVKLSAQQESSKK 225

Query: 233 SPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
               VV PS PK E+G+YWGY +R A ++S VF    YK GYD  IGTS+ G  V+  + 
Sbjct: 226 LKGTVVAPSTPKSETGVYWGYTIRLANSLSKVFSQSPYKDGYDLTIGTSDKGTSVD--EF 283

Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
             PTF+H+L+ FGG+ GLE ++E D  L   + + +F  Y NT P+QGS+TIRTEEAI +
Sbjct: 284 KCPTFKHVLVVFGGVQGLELALESDSVLNVDDPKFLFDHYLNTLPNQGSKTIRTEEAILV 343

Query: 352 SL 353
           +L
Sbjct: 344 TL 345


>gi|297685497|ref|XP_002820324.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C9orf114
           homolog [Pongo abelii]
          Length = 473

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 143/304 (47%), Positives = 191/304 (62%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +      +++   N  
Sbjct: 171 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQD---AKTVEGNSQ 227

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               + ++    L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 228 GVGKKGQA-CVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 286

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  D  + R +VV  
Sbjct: 287 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPDCKTYRGKVVSS 346

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 347 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 404

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 405 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 464

Query: 360 ISRA 363
           ++RA
Sbjct: 465 LTRA 468


>gi|225706296|gb|ACO08994.1| C9orf114 [Osmerus mordax]
          Length = 378

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 142/305 (46%), Positives = 190/305 (62%), Gaps = 8/305 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TV+VA+PGS++DN QS EL T LAGQIARA  +F IDE+VVFD         RS     +
Sbjct: 76  TVTVALPGSVMDNAQSTELRTYLAGQIARACGVFCIDEIVVFDEHGED---VRSVEGEFT 132

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               + ++    L R+LQ+LE PQYLRK+ F  H  L++ G+L PLD+PHH+R  E + +
Sbjct: 133 GVKKKGQA-CIQLARILQFLECPQYLRKSFFPKHQDLQYAGLLNPLDSPHHMRIDEESEY 191

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
           REGV L   +    G+ V+ G+ K V +D+ L PG+RVTV +  N+N ++   +  VV P
Sbjct: 192 REGVVLDRPSRPGKGSFVNCGMWKEVQIDKQLQPGLRVTVHLNKNQNNESRIYKGVVVAP 251

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ + G+YWGY VR A  +S+VF  C YK GYD  IGTSE G   N    ++  F+HL
Sbjct: 252 HVPRSQGGLYWGYSVRLASGLSTVFTECPYKEGYDLTIGTSEKG--CNIDQASIKPFKHL 309

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE S++ D  L   +   +F  Y NTCP QGSRTIRTEEAI IS+   ++ 
Sbjct: 310 LVVFGGLQGLEASVDADQNLDVTDPSLLFDLYLNTCPSQGSRTIRTEEAILISMAGLRQK 369

Query: 360 ISRAL 364
           IS A 
Sbjct: 370 ISIAF 374


>gi|431898882|gb|ELK07252.1| hypothetical protein PAL_GLEAN10012500 [Pteropus alecto]
          Length = 412

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 141/304 (46%), Positives = 189/304 (62%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +               
Sbjct: 110 TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDAKTVEGEFKGIG 169

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 170 KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 225

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ + R +VV  
Sbjct: 226 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVQLNQKQLPESKTYRGKVVSS 285

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 286 QDPRTKAGLYWGYTVRLASCLSAVFAETPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 343

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP+QGSRTIRTEEAI ISL   Q  
Sbjct: 344 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPNQGSRTIRTEEAILISLAALQPG 403

Query: 360 ISRA 363
           +++A
Sbjct: 404 LTQA 407


>gi|432095359|gb|ELK26558.1| hypothetical protein MDA_GLEAN10019279, partial [Myotis davidii]
          Length = 372

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 142/304 (46%), Positives = 187/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD     D     +     
Sbjct: 62  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDE----DGQEAKTVEGEF 117

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               +       L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 118 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 177

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ + R +VV  
Sbjct: 178 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVRLSQKQLPESKTYRGKVVSS 237

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 238 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 295

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  +  D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 296 LVVFGGLQGLEAGVNADPNLEVTEPSVLFDLYVNTCPSQGSRTIRTEEAILISLAALQPG 355

Query: 360 ISRA 363
           +++A
Sbjct: 356 LTKA 359


>gi|354505303|ref|XP_003514710.1| PREDICTED: uncharacterized protein C9orf114 homolog, partial
           [Cricetulus griseus]
          Length = 303

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 142/304 (46%), Positives = 187/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +               
Sbjct: 4   TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTKTVEGEFKGVG 63

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 64  KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 119

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+LV+ GL K V +D+ L+PG+RVTV +   +  +  + +  VV  
Sbjct: 120 REGIVVDRPTKAGHGSLVNCGLKKEVKIDKKLEPGLRVTVRLNQKQLPECKTYKGTVVSS 179

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 180 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSNVASAQ--LPSFRHA 237

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 238 LVVFGGLQGLEAGVDADPNLEVSEPSVLFDFYVNTCPSQGSRTIRTEEAILISLAALQPG 297

Query: 360 ISRA 363
           +++A
Sbjct: 298 LTQA 301


>gi|332024865|gb|EGI65053.1| Uncharacterized protein C9orf114-like protein [Acromyrmex
           echinatior]
          Length = 370

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 137/296 (46%), Positives = 185/296 (62%), Gaps = 3/296 (1%)

Query: 59  EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
           ++ T+S+AVPGSI+DN QS EL T LAGQIARA  I+++DE+VVFD+K            
Sbjct: 62  DICTISIAVPGSILDNAQSPELRTYLAGQIARAACIYKVDEIVVFDDKGEITE-DEKRKV 120

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
            + +   E   G   L R+LQYLE PQYLRK  F +H  L++ G+L PLDAPHHLR+   
Sbjct: 121 KKDDMLGEARLGCLQLARILQYLECPQYLRKYFFPIHKDLQYAGVLNPLDAPHHLRQENE 180

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
             +REG+   +      G+ V+VGL   V VD+VL  G+RVTV +  ++         +V
Sbjct: 181 FLYREGIVTNKPIKAGRGSQVNVGLLNEVHVDKVLMTGLRVTVKIPPDQPNRKKMKGIIV 240

Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
           PP+ P+ E+G+YWGY V+ A N++ V  NC YK GYD +IGTS+ G  ++        + 
Sbjct: 241 PPNMPRAETGIYWGYNVKLAKNLTEVLTNCQYKEGYDLMIGTSDKGTSIDEIKAKSLEYH 300

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           H LI FGGLAGLE +I+ D  L   ++  +F  Y NTCP QGSRTIRTEEAI ++L
Sbjct: 301 HCLIVFGGLAGLEAAIDVDPHL-NDDSSVIFDKYINTCPQQGSRTIRTEEAILLTL 355


>gi|149039118|gb|EDL93338.1| rCG45649, isoform CRA_a [Rattus norvegicus]
          Length = 377

 Score =  268 bits (686), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 142/304 (46%), Positives = 186/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +         S     
Sbjct: 75  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDTK----SVEGEF 130

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               +       L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 131 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 190

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+LV+ G+ K V +D+ LDPG+RVTV +   +  +  + +  VV  
Sbjct: 191 REGVVVDRPTKAGHGSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSS 250

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 251 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSSVASTQ--LPNFRHA 308

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+  +   +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 309 LVVFGGLQGLEAGVDADPNLEVADPSVLFDFYVNTCPSQGSRTIRTEEAILISLAALQPG 368

Query: 360 ISRA 363
           +++ 
Sbjct: 369 LTQG 372


>gi|338720320|ref|XP_001499881.3| PREDICTED: uncharacterized protein C9orf114-like [Equus caballus]
          Length = 376

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 141/304 (46%), Positives = 188/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ + R +VV  
Sbjct: 190 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQRQLPESKTYRGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP+QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPNQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           +++A
Sbjct: 368 LTQA 371


>gi|307205236|gb|EFN83616.1| Uncharacterized protein C9orf114-like protein [Harpegnathos
           saltator]
          Length = 331

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 142/308 (46%), Positives = 191/308 (62%), Gaps = 13/308 (4%)

Query: 54  DDAPIEVP---TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD 110
           +DA  E P   TVS+AVPGSI+DN QS EL T LAGQIARA  I+ +DE++VFD+K    
Sbjct: 15  EDAKYERPDICTVSIAVPGSILDNAQSPELRTYLAGQIARAACIYNVDEIIVFDDKGDV- 73

Query: 111 NYSRSSAANRSNRSD----ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP 166
               +    R  R D    E   G   L R+LQY+E PQYLRK  F +H  L++ G+L P
Sbjct: 74  ----TEDEKRKIRKDDVLGEARIGCLQLARILQYMECPQYLRKYFFPIHKDLQYAGVLNP 129

Query: 167 LDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTN 226
           L+APHHL++ + + +REG+   +      G+ V+VGL   V VD+VL PG+RVTV +  N
Sbjct: 130 LNAPHHLKQDDISLYREGIVTNKPIKVGKGSQVNVGLVNDVHVDKVLIPGLRVTVKIPPN 189

Query: 227 RNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDI 285
           +         VVPP+ P+ E+G YWGY V+ A N++ V  NC  K GYD +IGTS+ G +
Sbjct: 190 QPNSKKLNGIVVPPNVPRAETGTYWGYTVKLAKNLTEVITNCQCKDGYDLIIGTSDKGTL 249

Query: 286 VNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
           ++  +     + H LI FGGLAGLE  ++ D  L   +   +FH Y NTCP QGS+TIRT
Sbjct: 250 IDKIEAKSLEYHHCLIVFGGLAGLEAIVDVDPYLDIDDPSLMFHKYLNTCPQQGSQTIRT 309

Query: 346 EEAIFISL 353
           EEA+ I+L
Sbjct: 310 EEAVLITL 317


>gi|357611423|gb|EHJ67480.1| hypothetical protein KGM_03544 [Danaus plexippus]
          Length = 405

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 139/300 (46%), Positives = 186/300 (62%), Gaps = 2/300 (0%)

Query: 59  EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
           E+ TVS+AVPGSI++N QS EL T LAGQIARA  +F +DE++VFD+     +  +S   
Sbjct: 64  EISTVSIAVPGSILENAQSAELRTYLAGQIARAACVFCVDEIIVFDDIGDKVDTKKSKLE 123

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
           + S      +S    L R+LQYLE PQYLRK  F +H  L F G+L PLDAPHHLR    
Sbjct: 124 DNSGTKIARKSCVQ-LARILQYLECPQYLRKHFFPLHKDLEFAGLLNPLDAPHHLRMSND 182

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
             FREG+T+ +N     G+ V+VGL + V  D++L+PG+RVTV +            ++V
Sbjct: 183 FQFREGITMNKNVKPGKGSQVNVGLLQDVSTDKLLNPGIRVTVKLLPLTEGKKKLKGKIV 242

Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
             S P+ E+G+YWGY VR A N+S VF  C YK GYD  IGTS+ G  ++        + 
Sbjct: 243 SLSTPRAETGVYWGYTVRIANNLSQVFTQCPYKDGYDLTIGTSDKGTPIDDLPNKEVRYN 302

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           H LI FGGL G+E ++E D+ L+   A  +F+ Y N  P+QGSRTIRTEEAI I++   Q
Sbjct: 303 HALIVFGGLHGIEAALESDEQLQVDEASLLFNHYVNVLPNQGSRTIRTEEAILIAMSCLQ 362


>gi|351697028|gb|EHA99946.1| hypothetical protein GW7_01906 [Heterocephalus glaber]
          Length = 377

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 143/304 (47%), Positives = 185/304 (60%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +         S     
Sbjct: 75  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQEAK----SVEGEF 130

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               +       L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E   F
Sbjct: 131 RGVGKKGQACVQLARVLQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQEEECEF 190

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ + R  V  S
Sbjct: 191 REGVVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLKQQQLPESKTYRGTVVSS 250

Query: 242 K-PK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
           K P+  +G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 251 KEPRTRAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPGFRHA 308

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 309 LVVFGGLQGLEAGVDADPNLEVSEPSVLFDLYINTCPGQGSRTIRTEEAILISLAALQPS 368

Query: 360 ISRA 363
           +++A
Sbjct: 369 LTQA 372


>gi|348524370|ref|XP_003449696.1| PREDICTED: uncharacterized protein C9orf114 homolog [Oreochromis
           niloticus]
          Length = 379

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 140/302 (46%), Positives = 182/302 (60%), Gaps = 8/302 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+PGSI+DN QS EL T LAGQIARA  +F +DE+VVFD +           +   
Sbjct: 72  TVSLALPGSILDNAQSPELRTYLAGQIARACVVFSVDEIVVFDERGEDVKSVEGEFSGVG 131

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            R      G   L R+LQYLE PQYLRKA F  H  L++ G+L PLD+PHH+R  E + +
Sbjct: 132 KRG----HGCIQLARILQYLECPQYLRKAFFPKHQDLQYAGLLNPLDSPHHMRIDEESDY 187

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
           REGV +      + G+ VD G+ K + +D+ L PG+RVTV +   +N +    +  VV P
Sbjct: 188 REGVVVDRPTKQNKGSFVDCGMRKEIQIDKQLQPGLRVTVRLSKTQNPEGKVYKGVVVAP 247

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ E G YWGY VR A  +S+VF    +K GYD  IGTSE G   N+    L  F+HL
Sbjct: 248 HVPRTEGGYYWGYSVRLASCLSAVFTESPFKEGYDLTIGTSERGS--NADQALLTPFKHL 305

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE S++ D  L   +   +F  Y NTCP QGSRTIRTEEAI IS+   +  
Sbjct: 306 LVVFGGLQGLEASLDADQKLDVTDPSVLFDLYLNTCPGQGSRTIRTEEAILISMAVLRPK 365

Query: 360 IS 361
           I+
Sbjct: 366 IT 367


>gi|170042367|ref|XP_001848900.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167865860|gb|EDS29243.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 433

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 140/305 (45%), Positives = 187/305 (61%), Gaps = 3/305 (0%)

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYS-RSSAA 118
           + T+S+AVPGSI+DN QS EL   LAGQIARA  IF++DEVVVFD+  S +  + +    
Sbjct: 68  ISTLSIAVPGSILDNAQSPELRACLAGQIARAACIFQVDEVVVFDDCGSGNQVTDKVGTI 127

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
           + +  +         L R+LQYLE PQYLRK  F +H+ L+F G+L PLD+ HHLR+   
Sbjct: 128 DTAEGTTSARRCCVQLARILQYLECPQYLRKFFFPIHNDLKFCGLLAPLDSQHHLRQASE 187

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-V 237
             FREG+   +   N   + V+VGL   V+VD  LD GVRVTV M    +L +   R  V
Sbjct: 188 FEFREGIVTNKPTKNDKSSFVNVGLLNDVLVDTKLDVGVRVTVKMPAGVDLKSKKIRAIV 247

Query: 238 VPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
           VPPS+P++ +G+YWGY VR A ++S VF    YKGGYD  IGTS+ G  V   +     +
Sbjct: 248 VPPSQPRQDTGIYWGYTVRIANSLSQVFTKSPYKGGYDLTIGTSDTGTNVQELEPKSLKY 307

Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           +H L+ FGG+ GLE +++ D  L       +F  Y NT P QGSRT+RTEEAI ISL   
Sbjct: 308 KHALVVFGGVLGLEPALDNDAKLTVDKVDMLFDEYLNTVPRQGSRTVRTEEAILISLAAL 367

Query: 357 QEPIS 361
            E ++
Sbjct: 368 GEKLA 372


>gi|305682577|ref|NP_766248.3| uncharacterized protein C9orf114 homolog [Mus musculus]
 gi|109861106|sp|Q3UHX9.1|CI114_MOUSE RecName: Full=Uncharacterized protein C9orf114 homolog
 gi|74151214|dbj|BAE27727.1| unnamed protein product [Mus musculus]
 gi|148676498|gb|EDL08445.1| DNA segment, Chr 2, Wayne State University 81, expressed, isoform
           CRA_d [Mus musculus]
          Length = 385

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 143/307 (46%), Positives = 189/307 (61%), Gaps = 8/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +         S     
Sbjct: 75  TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTK----SVEGEF 130

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               +       L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 131 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 190

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+LV+ G+ K V +D+ LDPG+RVTV +   +  +  + +  VV  
Sbjct: 191 REGVVVDRPTKAGHGSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSS 250

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 251 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 308

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE +++ D  L+  +   +F  Y NTC  QGSRTIRTEEAI ISL   Q  
Sbjct: 309 LVVFGGLQGLEAAVDADPNLEVADPSVLFDFYVNTCLSQGSRTIRTEEAILISLAALQPG 368

Query: 360 ISRALRR 366
           +++   R
Sbjct: 369 LTQVGSR 375


>gi|148676495|gb|EDL08442.1| DNA segment, Chr 2, Wayne State University 81, expressed, isoform
           CRA_b [Mus musculus]
 gi|148676497|gb|EDL08444.1| DNA segment, Chr 2, Wayne State University 81, expressed, isoform
           CRA_b [Mus musculus]
          Length = 379

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 143/307 (46%), Positives = 189/307 (61%), Gaps = 8/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +         S     
Sbjct: 69  TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTK----SVEGEF 124

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               +       L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 125 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 184

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+LV+ G+ K V +D+ LDPG+RVTV +   +  +  + +  VV  
Sbjct: 185 REGVVVDRPTKAGHGSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSS 244

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 245 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 302

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE +++ D  L+  +   +F  Y NTC  QGSRTIRTEEAI ISL   Q  
Sbjct: 303 LVVFGGLQGLEAAVDADPNLEVADPSVLFDFYVNTCLSQGSRTIRTEEAILISLAALQPG 362

Query: 360 ISRALRR 366
           +++   R
Sbjct: 363 LTQVGSR 369


>gi|417410219|gb|JAA51586.1| Hypothetical protein, partial [Desmodus rotundus]
          Length = 379

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 140/304 (46%), Positives = 186/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF ++E+VVFD +               
Sbjct: 77  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVEEIVVFDEEGQDAKTVEGEFRGVG 136

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 137 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHVRQEEESEF 192

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  +  + R +VV  
Sbjct: 193 REGVVVDRPTRPGRGSFVNCGMKKEVQIDKNLEPGLRVTVRLNQEQLPECKTYRGKVVSS 252

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 253 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 310

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 311 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYLNTCPGQGSRTIRTEEAILISLAALQPS 370

Query: 360 ISRA 363
           +++A
Sbjct: 371 LTQA 374


>gi|296190958|ref|XP_002743410.1| PREDICTED: uncharacterized protein C9orf114 [Callithrix jacchus]
 gi|166183804|gb|ABY84165.1| hypothetical protein [Callithrix jacchus]
          Length = 376

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 142/304 (46%), Positives = 185/304 (60%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDAKTVEGQFIGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  +  + R +VV  
Sbjct: 190 REGVVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPECKTYRGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYINTCPGQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           + +A
Sbjct: 368 LIQA 371


>gi|148676496|gb|EDL08443.1| DNA segment, Chr 2, Wayne State University 81, expressed, isoform
           CRA_c [Mus musculus]
          Length = 323

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 143/307 (46%), Positives = 189/307 (61%), Gaps = 8/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +         S     
Sbjct: 13  TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTK----SVEGEF 68

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               +       L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 69  RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 128

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+LV+ G+ K V +D+ LDPG+RVTV +   +  +  + +  VV  
Sbjct: 129 REGVVVDRPTKAGHGSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSS 188

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 189 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 246

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE +++ D  L+  +   +F  Y NTC  QGSRTIRTEEAI ISL   Q  
Sbjct: 247 LVVFGGLQGLEAAVDADPNLEVADPSVLFDFYVNTCLSQGSRTIRTEEAILISLAALQPG 306

Query: 360 ISRALRR 366
           +++   R
Sbjct: 307 LTQVGSR 313


>gi|311246579|ref|XP_003122255.1| PREDICTED: uncharacterized protein C9orf114-like [Sus scrofa]
          Length = 377

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 141/304 (46%), Positives = 187/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 75  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 134

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 135 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEDSEF 190

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ + R +VV  
Sbjct: 191 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVWLNQKQLPESKTYRGKVVSS 250

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 251 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 308

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE +++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 309 LVVFGGLQGLEAAVDADPNLEVAEPSVLFDLYVNTCPSQGSRTIRTEEAILISLAALQPG 368

Query: 360 ISRA 363
           +++A
Sbjct: 369 LTQA 372


>gi|449477941|ref|XP_002194797.2| PREDICTED: uncharacterized protein C9orf114 homolog [Taeniopygia
           guttata]
          Length = 379

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 142/306 (46%), Positives = 187/306 (61%), Gaps = 8/306 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI++N QS  L T LAGQIARA TIF +DE+VVFD                 
Sbjct: 75  TLSVALPGSILNNAQSPALRTYLAGQIARACTIFCVDEIVVFDEHGEDVKTVEGDFEGIG 134

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRK+ F  H  L+F G+L PLD+PHH+R  E + +
Sbjct: 135 KKG----KACVQLARILQYLECPQYLRKSFFPKHEDLQFAGLLNPLDSPHHMRMDEDSEY 190

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV L   A    G+ V+ G+ K V +D+ L+PG+RVTV +   +N +A   +  VV  
Sbjct: 191 REGVVLDRPAKPGRGSFVNCGMRKEVQIDKQLNPGLRVTVRLNEPQNPEAKVRKGTVVSS 250

Query: 241 SKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+  SG+YWGY VR A  +S+VF  C +K GYD  IGTSE G  V+   +TLP+FRH 
Sbjct: 251 HHPRTVSGLYWGYSVRLASCLSAVFSECPFKEGYDLSIGTSERGSSVDQ--VTLPSFRHA 308

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+  +   +F  Y NTCP QGSRTIRTEEA+ ISL   +  
Sbjct: 309 LVVFGGLEGLEAGVDVDPNLEVTDPSVLFDFYLNTCPSQGSRTIRTEEALLISLSALRPH 368

Query: 360 ISRALR 365
           +  A++
Sbjct: 369 MEEAVK 374


>gi|332230194|ref|XP_003264272.1| PREDICTED: uncharacterized protein C9orf114 homolog [Nomascus
           leucogenys]
          Length = 376

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 141/304 (46%), Positives = 185/304 (60%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACIQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ LDPG+RVTV +   +  +  + R +VV  
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLDPGLRVTVRLNQQQLPECKTYRGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           +++A
Sbjct: 368 LTQA 371


>gi|395506105|ref|XP_003757376.1| PREDICTED: uncharacterized protein C9orf114 homolog [Sarcophilus
           harrisii]
          Length = 374

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 140/304 (46%), Positives = 188/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI++N QSLEL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 71  TLSVALPGSILNNAQSLELRTYLAGQIARACAIFCVDEIVVFDEEGEDVKSVEGEFVGIG 130

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +       +  L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E   +
Sbjct: 131 KKGQ----ASVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHVRQEEVLEY 186

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ L        G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ S + +VV  
Sbjct: 187 REGIVLHRPTRPGRGSFVNCGMKKEVQIDKNLEPGLRVTVRLDQQQIPESKSLKGRVVSS 246

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
            +P+  +G+YWGY VR A  +S+VF    +K GYD  IGTSE G+  + + + LP F H 
Sbjct: 247 QEPRTHAGLYWGYSVRLASCLSAVFAEGPFKDGYDLTIGTSERGE--DLASVQLPNFNHA 304

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           ++ FGGL GLE S+E D  L+  +   +F  Y NTCP QGSRTIRTEEAI ISL   + P
Sbjct: 305 IVVFGGLQGLEASVEADPNLEVTDPSVLFDFYLNTCPGQGSRTIRTEEAILISLATLRPP 364

Query: 360 ISRA 363
           I  A
Sbjct: 365 IIAA 368


>gi|410903718|ref|XP_003965340.1| PREDICTED: uncharacterized protein C9orf114 homolog [Takifugu
           rubripes]
          Length = 381

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 138/305 (45%), Positives = 183/305 (60%), Gaps = 8/305 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVAVPGS++DN QS EL T LAGQIARA  +F +DE++VFD     D     S     
Sbjct: 72  TVSVAVPGSVLDNAQSPELRTYLAGQIARACVVFCVDEIIVFDE----DGEELKSVEGEF 127

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               +  +    L R+LQYLE PQYLRK  F  H  L++ G+L PLD+PHH+R  + + +
Sbjct: 128 KGVGKKGNACIQLARILQYLECPQYLRKWFFPKHHDLQYAGLLNPLDSPHHMRTDDESEY 187

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-VVPP 240
           REG+ L        G+LV+ G+ K V +D+ L  G+RVTV +    + ++   +  VV P
Sbjct: 188 REGIVLDRPTKQGQGSLVNCGMRKDVRIDKQLQSGLRVTVQLSQTHSSESKHCKGVVVAP 247

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ E G+YWGY VR A  +S+VF    +K GYD  IGTSE G+  N    T+P F+HL
Sbjct: 248 HVPRTEGGLYWGYSVRLASCLSAVFTESQFKEGYDVTIGTSERGN--NLDKATVPPFKHL 305

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE S++ D  L   +   +F  Y NTCP QGSRTIRTEEAI +S+   +  
Sbjct: 306 LVVFGGLQGLEASVDADQNLDVTDPSVLFDLYLNTCPSQGSRTIRTEEAILVSMSGLRPK 365

Query: 360 ISRAL 364
           I  A 
Sbjct: 366 IEAAF 370


>gi|403298554|ref|XP_003940082.1| PREDICTED: uncharacterized protein C9orf114 homolog [Saimiri
           boliviensis boliviensis]
          Length = 376

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 140/304 (46%), Positives = 185/304 (60%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 74  TLSIALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGQFTGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHVRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  +  + R +VV  
Sbjct: 190 REGVVVDRPTRPGHGSFVNCGMKKEVKIDRNLEPGLRVTVRLNQQQLPECKTYRGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           +++A
Sbjct: 368 LTQA 371


>gi|395844433|ref|XP_003794966.1| PREDICTED: uncharacterized protein C9orf114 homolog [Otolemur
           garnettii]
          Length = 376

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 139/304 (45%), Positives = 186/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  +  + R +VV  
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQKQLPECKTYRGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   +P+FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--IPSFRHG 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           +++A
Sbjct: 368 LTQA 371


>gi|198436996|ref|XP_002131245.1| PREDICTED: similar to Uncharacterized protein C9orf114 [Ciona
           intestinalis]
          Length = 361

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 141/298 (47%), Positives = 188/298 (63%), Gaps = 14/298 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS--DNYSRSSAAN 119
           TVSVA+PGSI+DN QS EL T LAGQIARA  +F++DE++VFD  +SS  D+ ++     
Sbjct: 60  TVSVALPGSILDNAQSPELRTYLAGQIARACVVFQVDEIIVFDESASSGKDSSTQGDFTG 119

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
              + + N      L R+LQYLE PQYLRK+ F  H  L++ G+L PLD PHH+R+ + A
Sbjct: 120 VGKKGNCNIQ----LARILQYLECPQYLRKSFFPKHKDLQYAGVLNPLDCPHHMREQDNA 175

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
            FREGV +        G+ VD G  K V +D+VL P +RVTV +   +        +VVP
Sbjct: 176 EFREGVVVDRPIKPGKGSFVDCGTLKTVQIDKVLQPNIRVTVKIKEQKKKYILG--EVVP 233

Query: 240 PSKPK-ESGMYWGYKVRYAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLPTF 296
           P  P+ + G+YWGY VR A ++SS   +C Y   G YD  IGTS+ G+ V  +D  +PTF
Sbjct: 234 PHYPRTKGGLYWGYNVRLANSLSSAIVDCPYSDTGKYDITIGTSDKGEPV--ADFKMPTF 291

Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRK-NAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +HLLI FGGL GLE  +E D+ L        +F  Y NTCP+QGSRTIRTEEAI +++
Sbjct: 292 KHLLIVFGGLQGLEAGLEADEALGGDIELSLLFDKYLNTCPNQGSRTIRTEEAILVTM 349


>gi|348569715|ref|XP_003470643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           C9orf114-like [Cavia porcellus]
          Length = 377

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 142/304 (46%), Positives = 185/304 (60%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DEVVVFD +         S     
Sbjct: 75  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEVVVFDEEGQEAK----SVEGEF 130

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               +       L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 131 RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHVRQDEESEF 190

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  +  + R  VV  
Sbjct: 191 REGVVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPECKTYRGTVVSS 250

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
            +P+  +G+YWGY  R A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 251 QEPRTRAGLYWGYTXRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 308

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 309 LVVFGGLQGLEAGVDADPNLEVSEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 368

Query: 360 ISRA 363
           +++A
Sbjct: 369 LTQA 372


>gi|46255718|gb|AAH10579.1| Chromosome 9 open reading frame 114 [Homo sapiens]
 gi|55777177|gb|AAH46133.1| Chromosome 9 open reading frame 114 [Homo sapiens]
          Length = 376

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 140/304 (46%), Positives = 184/304 (60%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   ++ D  +   +VV  
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE   + D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           +++A
Sbjct: 368 LTQA 371


>gi|26337855|dbj|BAC32613.1| unnamed protein product [Mus musculus]
          Length = 345

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 142/307 (46%), Positives = 188/307 (61%), Gaps = 8/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +         S     
Sbjct: 35  TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTK----SVEGEF 90

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               +       L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 91  RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 150

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+LV+ G+ K V +D+ LDPG+RVTV +   +  +  + +  VV  
Sbjct: 151 REGVVVDRPTKAGHGSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSS 210

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++ +YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 211 QDPRTKACLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHA 268

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE +++ D  L+  +   +F  Y NTC  QGSRTIRTEEAI ISL   Q  
Sbjct: 269 LVVFGGLQGLEAAVDADPNLEVADPSVLFDFYVNTCLSQGSRTIRTEEAILISLAALQPG 328

Query: 360 ISRALRR 366
           +++   R
Sbjct: 329 LTQVGSR 335


>gi|196000250|ref|XP_002109993.1| hypothetical protein TRIADDRAFT_53475 [Trichoplax adhaerens]
 gi|190588117|gb|EDV28159.1| hypothetical protein TRIADDRAFT_53475 [Trichoplax adhaerens]
          Length = 373

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 140/296 (47%), Positives = 196/296 (66%), Gaps = 12/296 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD--NKSSSDNYSRSSAAN 119
           TVSVAVPGSI++N QS EL T LAGQIARA+ IF +DEVV+FD   K+ +++    +   
Sbjct: 69  TVSVAVPGSILNNAQSPELRTYLAGQIARALVIFEVDEVVIFDESGKTVNEDVGEFTGVT 128

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
           R +  D N+     L  +LQYLE PQYLRK  F  H  LR+ G+L PL++ HH+   E  
Sbjct: 129 RKDY-DPNK----VLANILQYLECPQYLRKFFFPKHPDLRYAGLLNPLNSTHHMLIDEDV 183

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVV 238
           P+REGVTL +      G+LV+ G+ K V +D+ L  G+RVTV + + ++ D+ + +  VV
Sbjct: 184 PYREGVTL-DRPVGKKGSLVNCGMRKEVRIDRQLQSGLRVTVKLDSEQSSDSKTYKGTVV 242

Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
               P+ ++G+YWGY+VR A ++ +    C Y+GGYD  IGTS+ GD  +  +L L +F+
Sbjct: 243 SADAPRMDAGIYWGYRVRVARSLGAAITECPYEGGYDLSIGTSDGGD--SMEELNLSSFK 300

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           H+LIAFGGL+GLE ++E D+ L+  + R +F  Y NTCP QGSR IRTEEA+ ISL
Sbjct: 301 HILIAFGGLSGLEAALESDESLRLSDIRLLFDYYLNTCPGQGSRIIRTEEAMLISL 356


>gi|119608245|gb|EAW87839.1| chromosome 9 open reading frame 114, isoform CRA_b [Homo sapiens]
          Length = 437

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 141/307 (45%), Positives = 185/307 (60%), Gaps = 8/307 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 135 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 194

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 195 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 250

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   ++ D  +   +VV  
Sbjct: 251 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 310

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 311 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 368

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE   + D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 369 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 428

Query: 360 ISRALRR 366
           +++A  R
Sbjct: 429 LTQAGAR 435


>gi|402896354|ref|XP_003911267.1| PREDICTED: uncharacterized protein C9orf114 homolog [Papio anubis]
          Length = 514

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 140/304 (46%), Positives = 186/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 212 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 271

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 272 KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 327

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  +  + R +VV  
Sbjct: 328 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPECKTYRGKVVSS 387

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 388 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAH--LPSFRHA 445

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 446 LVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 505

Query: 360 ISRA 363
           +++A
Sbjct: 506 LTQA 509


>gi|390337040|ref|XP_792707.2| PREDICTED: uncharacterized protein C9orf114-like
           [Strongylocentrotus purpuratus]
          Length = 537

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 145/312 (46%), Positives = 191/312 (61%), Gaps = 22/312 (7%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+PGSI+DN QS EL T LAGQIARA+ IF IDE+++FD          S AA R 
Sbjct: 233 TLSIALPGSILDNAQSPELRTYLAGQIARALVIFNIDEIIIFD---------ESGAAARE 283

Query: 122 NRSDENESGAA-------FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
           N +D   SG          L R+LQYLE PQYLRK+ F  H+ L++ G+L PLD PHH+R
Sbjct: 284 N-TDGVYSGIGKKGNANVQLARILQYLECPQYLRKSFFPKHNDLQYAGLLNPLDCPHHVR 342

Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD-S 233
             E   +REGV +        G+ V+VGL K V +D+ L   +RVTV M T + +    +
Sbjct: 343 ADEEIAYREGVVVDRPVAKGRGSHVNVGLYKEVEIDKKLQAQLRVTVKMNTKKGISPKHA 402

Query: 234 PRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYK-GGYDHLIGTSEHGDIVNSSDL 291
           P +VV PS P+ E+G+YWGY VR A ++ +VF  C +   GYD  IGTSE G    + D+
Sbjct: 403 PCKVVSPSAPRLEAGLYWGYSVRLASSLGAVFTGCPFSDNGYDLTIGTSERG--TKADDV 460

Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
            LP F+H LI FGG+ GLE S+E D+ L   +   +F  Y N CP QGSRTIRTEEA+ I
Sbjct: 461 DLPHFKHALIIFGGVHGLELSLETDEELLVSDPAVLFDHYINVCPDQGSRTIRTEEAVLI 520

Query: 352 SLQYFQEPISRA 363
           ++   +  I  A
Sbjct: 521 TMATLRPKILSA 532


>gi|397503802|ref|XP_003822507.1| PREDICTED: uncharacterized protein C9orf114 homolog [Pan paniscus]
          Length = 438

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 140/304 (46%), Positives = 183/304 (60%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 136 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 195

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 196 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 251

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  D  +   +VV  
Sbjct: 252 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPDCKTYHGKVVSS 311

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 312 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 369

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE   + D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 370 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 429

Query: 360 ISRA 363
           +++A
Sbjct: 430 LTQA 433


>gi|383850844|ref|XP_003700984.1| PREDICTED: uncharacterized protein C9orf114 homolog [Megachile
           rotundata]
          Length = 376

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 190/308 (61%), Gaps = 10/308 (3%)

Query: 59  EVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
           +V T+S+AVPGSI+DN QS EL T LAGQIARA  +++IDE+VVFD+K       +    
Sbjct: 57  DVSTISIAVPGSILDNAQSPELRTYLAGQIARAACVYQIDEIVVFDDKGEITEDEKKKVR 116

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
           N      E   G   L R+LQYLE PQYLRK  F +H  L++ G+L PLD PHHLR+ + 
Sbjct: 117 N-DEVLGERRIGCLQLARILQYLECPQYLRKYFFPLHKDLQYAGVLNPLDTPHHLRQSDV 175

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV- 237
           + +REGV   +      G+ V+VGL  +V VD+VL  G+RVTV +      D  +P+++ 
Sbjct: 176 SLYREGVITNKPVKGDKGSYVNVGLLNNVHVDKVLTAGLRVTVKIPP----DQPNPKKIK 231

Query: 238 ---VPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
              VPP+ P+ ESG+YWGY VR A N++       Y+ GYD  IGTS+ G  V+  ++  
Sbjct: 232 GFIVPPNIPRAESGIYWGYNVRLANNLTEALTQGPYEDGYDLTIGTSDKGKSVDDIEVKS 291

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             + H LI FGGL+GLE +++ D  L   +   +F  Y NTCP QGSRTIRTEEAI I+L
Sbjct: 292 IKYHHALIVFGGLSGLEAALDGDPSLDVDDPSLIFDKYLNTCPQQGSRTIRTEEAILITL 351

Query: 354 QYFQEPIS 361
              +  +S
Sbjct: 352 AELRTKLS 359


>gi|38679912|ref|NP_057474.2| uncharacterized protein C9orf114 [Homo sapiens]
 gi|126302532|sp|Q5T280.3|CI114_HUMAN RecName: Full=Uncharacterized protein C9orf114
          Length = 376

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 140/304 (46%), Positives = 183/304 (60%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   ++ D  +   +VV  
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE   + D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           + +A
Sbjct: 368 LIQA 371


>gi|332832960|ref|XP_520300.3| PREDICTED: uncharacterized protein C9orf114 homolog [Pan
           troglodytes]
 gi|410221478|gb|JAA07958.1| chromosome 9 open reading frame 114 [Pan troglodytes]
 gi|410221480|gb|JAA07959.1| chromosome 9 open reading frame 114 [Pan troglodytes]
 gi|410251074|gb|JAA13504.1| chromosome 9 open reading frame 114 [Pan troglodytes]
 gi|410251076|gb|JAA13505.1| chromosome 9 open reading frame 114 [Pan troglodytes]
 gi|410297534|gb|JAA27367.1| chromosome 9 open reading frame 114 [Pan troglodytes]
 gi|410297536|gb|JAA27368.1| chromosome 9 open reading frame 114 [Pan troglodytes]
 gi|410337681|gb|JAA37787.1| chromosome 9 open reading frame 114 [Pan troglodytes]
 gi|410337683|gb|JAA37788.1| chromosome 9 open reading frame 114 [Pan troglodytes]
          Length = 376

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 140/304 (46%), Positives = 183/304 (60%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  D  +   +VV  
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQLPDCKTYHGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE   + D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           +++A
Sbjct: 368 LTQA 371


>gi|224000970|ref|XP_002290157.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973579|gb|EED91909.1| predicted protein, partial [Thalassiosira pseudonana CCMP1335]
          Length = 328

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 139/323 (43%), Positives = 197/323 (60%), Gaps = 36/323 (11%)

Query: 61  PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
           PTVS+A+PGS++ N Q+ EL T+LAGQIARA  ++R+DE+VVFD+   S      S   +
Sbjct: 12  PTVSIAIPGSVVSNAQTRELQTQLAGQIARAAAVYRVDEIVVFDDGLGS------SLKTQ 65

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
                 +    AFL R+LQY E PQYLR+  F MH  L+F G+LPPLDAPHHLR+ ++A 
Sbjct: 66  PPHMQPSTDPHAFLARILQYCECPQYLRRKFFPMHPDLQFAGLLPPLDAPHHLRRGDFAA 125

Query: 181 FREGVTLKENA-------PNSVGTLVDVGL-NKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
           FREGV ++  +           G+ VD G+ N+ V +D+ + PG+R TV + T   L   
Sbjct: 126 FREGVVVERKSGDGGKGGDEDGGSFVDCGVPNRLVQIDRDITPGIRCTVKLDTKSYLTKK 185

Query: 233 SPRQ--VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDI-VNS 288
              +   V P+ P+ E GMYWGY  R A +I+ +F+ C Y+GGYD  +GTSE GDI ++ 
Sbjct: 186 GVMEGKAVSPTTPRDEEGMYWGYTTRLASSINGIFEECPYEGGYDLKVGTSERGDISIDD 245

Query: 289 SDLTLPT----------------FRHLLIAFGGLAGLEESIEEDDG--LKRKNAREVFHS 330
            +  L                  F HLLI FGG+AG+EE ++ D+   L  ++++++F  
Sbjct: 246 INFGLKKKRQYTKSKNKSGGDERFSHLLIVFGGVAGIEECVDADESMTLPGEDSKKMFDV 305

Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
           + N CP+QGSRTIRTEEA+ I+L
Sbjct: 306 WVNICPYQGSRTIRTEEAVLITL 328


>gi|47219853|emb|CAF97123.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 394

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 141/306 (46%), Positives = 181/306 (59%), Gaps = 8/306 (2%)

Query: 61  PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
           PTVSVAVPGS++DN QS EL + LAGQIARA  +F +DE+VVFD     D     S    
Sbjct: 71  PTVSVAVPGSVLDNAQSPELRSYLAGQIARACVVFCVDEIVVFDE----DGEDLKSVEGE 126

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
                +  +    L R+LQYLE PQYLRK  F  H  L++ G+L PLD+PHH+R  +   
Sbjct: 127 FKGVGKKGNACIQLARILQYLECPQYLRKWFFPKHQDLQYAGLLNPLDSPHHMRTDDECE 186

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG-TNRNLDADSPRQVVP 239
           +REGV L        G+LV+ G+ K V +D+ L  G+RVTV +G T+   +      VV 
Sbjct: 187 YREGVVLDRPTKAGQGSLVNCGMRKDVRIDKQLQAGLRVTVRLGPTHSPENKHCKGVVVA 246

Query: 240 PSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
           P  P+ E G+YWGY VR A  +S+VF    +K GYD  IGTSE G   N     +P F+H
Sbjct: 247 PHVPRTEGGLYWGYSVRLASCLSAVFTESQFKDGYDVTIGTSERG--TNLDQTAVPPFKH 304

Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
           LL+ FGGL GLE S++ D  L   +   +F  Y NTCP QGSRTIRTEEAI +S+   + 
Sbjct: 305 LLVVFGGLQGLEASVDADPNLDVTDPSVLFDLYLNTCPGQGSRTIRTEEAILVSMSGLRP 364

Query: 359 PISRAL 364
            I  A 
Sbjct: 365 KIEAAF 370


>gi|327291021|ref|XP_003230220.1| PREDICTED: uncharacterized protein C9orf114-like [Anolis
           carolinensis]
          Length = 326

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 140/306 (45%), Positives = 185/306 (60%), Gaps = 8/306 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVAVPGSI+DN QS EL T LAGQIAR+  +F +DEVVVFD +               
Sbjct: 19  TLSVAVPGSILDNAQSPELRTYLAGQIARSCAVFSVDEVVVFDERGEDAKTVEGEFGGLK 78

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            R          L R+LQYLE PQYLRK+ F  H  L+F G+L PLD+PHH+R  E + +
Sbjct: 79  ARGQ----ACVQLARILQYLECPQYLRKSFFPKHQDLQFAGLLNPLDSPHHVRIDEVSQY 134

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV L        G+ V+ G+ K V +D+ L+ G+RVTV +   +N ++ + +  VV  
Sbjct: 135 REGVVLARPVKPGRGSFVNCGMRKEVQIDKQLEAGLRVTVKLNEQQNPESKTQKGTVVSS 194

Query: 241 SKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+  SG+YWGY VR A  +S+VF    +  GYD  +GTSE G  V+ +  TLP FRH 
Sbjct: 195 HHPRTVSGLYWGYTVRLASCLSAVFAESPFTDGYDLSVGTSERGTPVDQA--TLPAFRHA 252

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           LI FGGL GLE +++ D  L+  +   +F  Y NTCP QGSRTIRTEEA+ ISL   +  
Sbjct: 253 LIVFGGLQGLEAAVDADPHLEVSDPSTLFDLYLNTCPRQGSRTIRTEEAMLISLSALRPR 312

Query: 360 ISRALR 365
           I+ A +
Sbjct: 313 IAEAAK 318


>gi|18204644|gb|AAH21273.1| Chromosome 9 open reading frame 114 [Homo sapiens]
 gi|39794258|gb|AAH63644.1| Chromosome 9 open reading frame 114 [Homo sapiens]
 gi|55249540|gb|AAH39590.1| Chromosome 9 open reading frame 114 [Homo sapiens]
 gi|312152042|gb|ADQ32533.1| chromosome 9 open reading frame 114 [synthetic construct]
          Length = 376

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 140/304 (46%), Positives = 183/304 (60%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   ++ D  +   +VV  
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE   + D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           + +A
Sbjct: 368 LIQA 371


>gi|7512808|pir||T14751 hypothetical protein DKFZp566D143.1 - human (fragment)
          Length = 389

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 141/304 (46%), Positives = 183/304 (60%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 87  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 146

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 147 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 202

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +   ++ D  +   +VV  
Sbjct: 203 REGVVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 262

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 263 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 320

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE   + D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  
Sbjct: 321 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPG 380

Query: 360 ISRA 363
           + +A
Sbjct: 381 LIQA 384


>gi|291232700|ref|XP_002736285.1| PREDICTED: CG12128-like [Saccoglossus kowalevskii]
          Length = 377

 Score =  262 bits (669), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 138/305 (45%), Positives = 184/305 (60%), Gaps = 8/305 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+PGSI+DN QS EL T LAGQIARA  IF++DE+++FD    S            
Sbjct: 77  TISIALPGSILDNAQSPELRTYLAGQIARACVIFKVDEIIIFDEAGFSSESITGVYDGVG 136

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            R + N      + R+LQYLE PQYLRK  F  HS L++ G+L PLD PHH+R  +   F
Sbjct: 137 KRGNANVQ----MARILQYLECPQYLRKHFFPKHSDLQYAGVLNPLDTPHHMRIDDECEF 192

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD-SPRQVVPP 240
           REGV L        G+ V+ GL K + +D+ L  G+RVTV +   +  + +    +VV P
Sbjct: 193 REGVVLDRPVKGGKGSFVNAGLRKEIRIDRQLRDGLRVTVRLDNKQTDEGEYYTGKVVSP 252

Query: 241 SKP-KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P  + G+YWGY VR A ++S VF   SYK GYD  IGTS+ GD ++ +  T   F H 
Sbjct: 253 RCPCTDRGLYWGYTVRLASSLSKVFTESSYKNGYDLTIGTSDSGDDIDHA--TFSKFNHA 310

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE S++ D+ LK     ++F  Y NTCP QGSRTIRTEEAI I++   +  
Sbjct: 311 LVVFGGLKGLEYSLDCDEKLKIHEVNDLFDMYINTCPKQGSRTIRTEEAILITMAALRPR 370

Query: 360 ISRAL 364
           I +AL
Sbjct: 371 IIQAL 375


>gi|334311915|ref|XP_001367954.2| PREDICTED: uncharacterized protein C9orf114-like [Monodelphis
           domestica]
          Length = 422

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 139/304 (45%), Positives = 186/304 (61%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI++N QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 118 TLSVALPGSILNNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGEDVKSIEGEFVGIG 177

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +       +  L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E   +
Sbjct: 178 KKGQ----ASVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHVRQEEVLEY 233

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ L        G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ S + +VV  
Sbjct: 234 REGIVLDRPTRPGRGSFVNCGMKKEVQIDKNLEPGLRVTVRLDQQQIPESKSLKGRVVSS 293

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
            +P+  +G+YWGY VR A  +S+VF    YK GYD  IGTSE G+ + S  + LP F H 
Sbjct: 294 QEPRTHAGLYWGYSVRLASCLSAVFAEGPYKDGYDLTIGTSERGEGLAS--IQLPNFNHA 351

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           ++ FGGL GLE  +E D  L+  +   +F  Y NTCP QGSRTIRTEEAI ISL   +  
Sbjct: 352 IVVFGGLQGLEAGVEADPNLEVTDPSVLFDFYLNTCPGQGSRTIRTEEAILISLATLRPH 411

Query: 360 ISRA 363
           I+ A
Sbjct: 412 ITGA 415


>gi|449266744|gb|EMC77760.1| Putative protein C9orf114 like protein, partial [Columba livia]
          Length = 354

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 140/294 (47%), Positives = 180/294 (61%), Gaps = 8/294 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI++N QS EL T LAGQIARA  IF +DE+VVFD                 
Sbjct: 62  TLSVALPGSILNNAQSPELRTYLAGQIARACAIFCVDEIVVFDEHGEDVKTVEGDFEGIG 121

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            R          L R+LQYLE PQYLRK+ F  H  L+F G+L PLD+PHH+R  E + +
Sbjct: 122 RRG----KACVQLARILQYLECPQYLRKSFFPKHEDLQFAGLLNPLDSPHHMRVDEDSEY 177

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV L        G+ V+ G+ K V +D+ L+PG+RVTV +   +  +A   +  VV  
Sbjct: 178 REGVVLDRPTKPGRGSFVNCGMRKEVQIDRQLNPGLRVTVRLEEPQKPEAKVRKGTVVSS 237

Query: 241 SKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+  SG+YWGY VR A  +S+VF  C +K GYD  IGTSE G  V+ +  TLP+FRH 
Sbjct: 238 HHPRTVSGLYWGYSVRLASCLSAVFSECPFKEGYDLSIGTSERGSSVDQA--TLPSFRHA 295

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           L+ FGGL GLE  ++ D  L+  +   +F  Y NTCP QGSRTIRTEEA+ ISL
Sbjct: 296 LVVFGGLEGLEAGVDVDPNLEVTDPSVLFDFYLNTCPGQGSRTIRTEEALLISL 349


>gi|390366072|ref|XP_003730960.1| PREDICTED: uncharacterized protein C9orf114-like
           [Strongylocentrotus purpuratus]
          Length = 492

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 144/309 (46%), Positives = 189/309 (61%), Gaps = 22/309 (7%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+PGSI+DN QS EL T LAGQIARA+ IF IDE+++FD          S AA R 
Sbjct: 188 TLSIALPGSILDNAQSPELRTYLAGQIARALVIFNIDEIIIFD---------ESGAAARD 238

Query: 122 NRSDENESGAA-------FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
           N +D   SG          L R+LQYLE PQYLRK+ F  H+ L++ G+L PLD PHH+R
Sbjct: 239 N-TDGVYSGIGKKGNANVQLARILQYLECPQYLRKSFFPKHNDLQYAGLLNPLDCPHHVR 297

Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD-S 233
             E   +REGV +        G+ V+VGL K V +D+ L   +RVTV M T +      +
Sbjct: 298 ADEEIAYREGVVVDRPVAKGRGSHVNVGLYKEVEIDKKLQAQLRVTVKMNTKKGTPTKHA 357

Query: 234 PRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYK-GGYDHLIGTSEHGDIVNSSDL 291
           P +VV PS P+ E+G+YWGY VR A ++ +VF  C +   GYD  IGTSE G    + D+
Sbjct: 358 PCKVVSPSAPRLEAGLYWGYSVRLASSLGAVFTGCPFSDNGYDLTIGTSERG--TKADDV 415

Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
            LP F+H LI FGG+ GLE S+E D+ L   +   +F  Y N CP QGSRTIRTEEA+ I
Sbjct: 416 DLPHFKHALIIFGGVQGLELSLEMDEELLVSDPGVLFDHYINVCPDQGSRTIRTEEAVLI 475

Query: 352 SLQYFQEPI 360
           ++   +  I
Sbjct: 476 TMATLRPKI 484


>gi|355736575|gb|AES12043.1| hypothetical protein [Mustela putorius furo]
          Length = 356

 Score =  257 bits (656), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 134/287 (46%), Positives = 177/287 (61%), Gaps = 8/287 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +               
Sbjct: 76  TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDAKTVEGEFRGVG 135

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 136 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 191

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ + R +VV  
Sbjct: 192 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVQLNQKQLPESKTYRGKVVSS 251

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP+FRH 
Sbjct: 252 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVTSAQ--LPSFRHA 309

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
           L+ FGGL GLE  +E D  L+      +F  Y NTCP QGSRTIRTE
Sbjct: 310 LVVFGGLQGLEAGVEADPNLEVAEPSVLFDLYVNTCPSQGSRTIRTE 356


>gi|46255825|gb|AAH33677.1| Chromosome 9 open reading frame 114 [Homo sapiens]
          Length = 376

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 139/304 (45%), Positives = 182/304 (59%), Gaps = 8/304 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 134 KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 189

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   ++ D  +   +VV  
Sbjct: 190 REGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYHGKVVSS 249

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+   LP FRH 
Sbjct: 250 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPNFRHA 307

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEP 359
           L+ FGGL GLE   + D  L+      +F  Y NT P QGSRTIRTEEAI ISL   Q  
Sbjct: 308 LVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTYPGQGSRTIRTEEAILISLAALQPG 367

Query: 360 ISRA 363
           + +A
Sbjct: 368 LIQA 371


>gi|195150735|ref|XP_002016306.1| GL10562 [Drosophila persimilis]
 gi|194110153|gb|EDW32196.1| GL10562 [Drosophila persimilis]
          Length = 501

 Score =  254 bits (648), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 189/314 (60%), Gaps = 11/314 (3%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSSDNYSRS 115
           T+S+AVPGSI++N QS EL   +AGQIARA  IFR++EV+VFD+      + +  +Y + 
Sbjct: 74  TLSIAVPGSILENAQSAELRAYVAGQIARAACIFRVNEVIVFDDVGVATARETKRSYEQD 133

Query: 116 SAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRK 175
           +  N +       S +  L R+LQYLE PQYLRK  F +H  L++ G+L PLD PHHLR+
Sbjct: 134 AEGNATGTV---RSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQ 190

Query: 176 HEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR 235
                +REGV   + A +   +  +VGL   V+VD+ ++PGVRVTV M    +       
Sbjct: 191 QSKFRYREGVISDKKAKDG-QSYANVGLLNDVLVDKAIEPGVRVTVKMDPASDTSRKQRG 249

Query: 236 QVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP 294
            +V P +P+ E+G+YWGY+VR A ++S +F    Y+  YD  +GTS+ G  V+       
Sbjct: 250 TLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYESSYDVTLGTSDRGTNVHEVPNRSI 309

Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
            ++HLLI FGGL GLEE++  DD L   +   +F  Y N  P QGSRTIRTEEA+ I+L 
Sbjct: 310 QYKHLLIVFGGLQGLEEALANDDKLTVDDPEMLFDHYVNVLPRQGSRTIRTEEALLIALA 369

Query: 355 YFQEPISRALRRVE 368
             QE +   L  VE
Sbjct: 370 SLQEKLQPELADVE 383


>gi|195120365|ref|XP_002004699.1| GI20065 [Drosophila mojavensis]
 gi|193909767|gb|EDW08634.1| GI20065 [Drosophila mojavensis]
          Length = 491

 Score =  253 bits (647), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 134/310 (43%), Positives = 188/310 (60%), Gaps = 12/310 (3%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSSDNYSRS 115
           TVS+AVPGSI++N QS EL   +AGQIARA  IFR++EV+VFD+      + +   Y + 
Sbjct: 69  TVSIAVPGSILENAQSAELRAYVAGQIARAACIFRVNEVIVFDDVGVTTARETKRIYEQD 128

Query: 116 SAANRSNRSDEN----ESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
            A N  N + +      S +  L R+LQYLE PQYLRK  F +H  L++ G+L PLD PH
Sbjct: 129 DADNNGNGAGKATSTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPH 188

Query: 172 HLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDA 231
           HLR+     +REGV   + A     +  +VGL   V++D+ L+PGVRVTV M  + +   
Sbjct: 189 HLRQQSKFRYREGVVCDKQAKEG-HSYANVGLLNDVLIDKALEPGVRVTVKMEPSSDGSR 247

Query: 232 DSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
                +V P +P+ E+G+YWGY+VR A ++S +F    Y GGYD ++GTS+ G  V +  
Sbjct: 248 KQRGTLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYDGGYDVMVGTSDRGTSVQNVP 307

Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
                + H LI FGGL GLEE++  D+ L   + + +F  Y N  P QGSRTIRTEEA+ 
Sbjct: 308 NQSIGYNHALIVFGGLQGLEEALANDEKLTVDDPQLLFDHYINVLPKQGSRTIRTEEALL 367

Query: 351 ISLQYFQEPI 360
           I++   QE +
Sbjct: 368 IAMAALQEKL 377


>gi|125808292|ref|XP_001360698.1| GA11422 [Drosophila pseudoobscura pseudoobscura]
 gi|54635870|gb|EAL25273.1| GA11422 [Drosophila pseudoobscura pseudoobscura]
          Length = 501

 Score =  253 bits (647), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 190/314 (60%), Gaps = 11/314 (3%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSSDNYSRS 115
           T+S+AVPGSI++N QS EL   +AGQIARA  IFR++EV+VFD+      + +  +Y + 
Sbjct: 74  TLSIAVPGSILENAQSAELRAYVAGQIARAACIFRVNEVIVFDDVGVATARETKRSYEQD 133

Query: 116 SAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRK 175
           +  N +       S +  L R+LQYLE PQYLRK  F +H+ L++ G+L PLD PHHLR+
Sbjct: 134 AEGNATGTV---RSSSLQLARILQYLECPQYLRKYFFPLHNDLKYSGLLNPLDTPHHLRQ 190

Query: 176 HEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR 235
                +REGV   + A +   +  +VGL   V+VD+ ++PGVRVTV M    +       
Sbjct: 191 QSKFRYREGVISDKKAKDG-QSYANVGLLNDVLVDKAIEPGVRVTVKMDPASDTSRKQRG 249

Query: 236 QVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP 294
            +V P +P+ E+G+YWGY+VR A ++S +F    Y+  YD  +GTS+ G  V+       
Sbjct: 250 TLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYESSYDVTLGTSDRGTNVHEVPNRSI 309

Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
            ++HLLI FGGL GLEE++  DD L   +   +F  Y N  P QGSRTIRTEEA+ I+L 
Sbjct: 310 QYKHLLIVFGGLQGLEEALANDDKLTVDDPEMLFDHYVNVLPRQGSRTIRTEEALLIALA 369

Query: 355 YFQEPISRALRRVE 368
             QE +   L  VE
Sbjct: 370 SLQEKLQPELADVE 383


>gi|149418281|ref|XP_001505833.1| PREDICTED: uncharacterized protein C9orf114-like [Ornithorhynchus
           anatinus]
          Length = 405

 Score =  250 bits (638), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 133/287 (46%), Positives = 182/287 (63%), Gaps = 8/287 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI++N QS EL T LAGQIARA  IF +DE+VVFD +         S   + 
Sbjct: 82  TLSVALPGSILNNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGEDSK----SVEGKF 137

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
           +   +       L R+LQYLE PQYLRK+ F  H  L+F G+L PLD PHH+R+ E + +
Sbjct: 138 DGIGKKGRATVQLARILQYLECPQYLRKSFFPKHQDLQFAGLLNPLDGPHHVRQDEDSEY 197

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV L      + G+ V+ G+ K V +D+ L+PG+RVTV +   +  ++ + + +VV  
Sbjct: 198 REGVVLARPFKPARGSFVNCGMRKEVQIDKQLEPGLRVTVRLDRQQIPESKNHKGRVVSS 257

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
            +P+ E+G+YWGY VR A  +S+VF    +K GYD  IGTSE G  ++S+   LP FRH 
Sbjct: 258 QEPRTEAGLYWGYSVRLASCLSAVFAEVPFKDGYDLTIGTSERGGDLDSA--RLPDFRHA 315

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
           LI FGGL GLE +++ D  L+      +F  Y NTCP+QGSRTIRTE
Sbjct: 316 LIVFGGLQGLEAAVDADPNLEVVEPSVLFDLYLNTCPNQGSRTIRTE 362


>gi|307111555|gb|EFN59789.1| hypothetical protein CHLNCDRAFT_133484 [Chlorella variabilis]
          Length = 319

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 137/312 (43%), Positives = 178/312 (57%), Gaps = 45/312 (14%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+AV  S I+N Q+LELAT LAGQIARA  IF +DEVVV D+  +      SSAA   
Sbjct: 38  TVSIAVAASCIENAQNLELATLLAGQIARAAAIFNVDEVVVLDDSPAKQPGHVSSAA--- 94

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                     A   R+LQ++ETPQYL+KAL  MH  L++ G+LPPLDAPHHLR  EW P+
Sbjct: 95  ----------ALFARVLQFMETPQYLKKALIPMHPDLKYAGVLPPLDAPHHLRSTEWGPY 144

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQ----VLDPGVRVTVAMGTNRNLDADSPRQV 237
           REGV     +    G+ +DVGL+K   + Q        GVR+T++MG          +QV
Sbjct: 145 REGVV--RRSAKGEGSFIDVGLDKDAHIPQASLMAARQGVRLTLSMGEQPLPATVQGQQV 202

Query: 238 V-----PPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
           +      P+ P+E  G+YWGY  R AP++ ++ + C + GGY   +GTSE G+   +  L
Sbjct: 203 LRGQLALPTDPRERGGLYWGYITRIAPSLQAMLEQCPFYGGYSLTVGTSERGERTPACQL 262

Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
            L  FRHLL+ FGG  GLE                    Y NTC  QGSRTIR+EEAI I
Sbjct: 263 DLGAFRHLLVVFGGPQGLE--------------------YLNTCFDQGSRTIRSEEAILI 302

Query: 352 SLQYFQEPISRA 363
           SL + Q  ++ A
Sbjct: 303 SLAFLQPAVTAA 314


>gi|21428350|gb|AAM49835.1| GM02932p [Drosophila melanogaster]
          Length = 485

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 136/311 (43%), Positives = 186/311 (59%), Gaps = 5/311 (1%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK--SSSDNYSRSSAAN 119
           T+S+AVPGSI++N QS EL   +AGQIARA  IFR++EV+VFD+   +++    RS  A+
Sbjct: 73  TLSIAVPGSILENAQSNELRAYVAGQIARAACIFRVNEVIVFDDVGIATARETKRSYEAD 132

Query: 120 RSNRSDEN-ESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
               S     S +  L R+LQYLE PQYLRK  F +H  L++ G+L PLD PHHLR+   
Sbjct: 133 GEGSSTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSK 192

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
             +REGV   + A     +  +VGL   V+VD+ ++PGVRVTV M             +V
Sbjct: 193 FRYREGVICDKKAKEG-HSYANVGLLNDVLVDKAIEPGVRVTVKMEPQSESCRKQRGTLV 251

Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
            P +P+ E+G+YWGY+VR A ++S +F    Y  GYD  +GTS+ G  V+        F 
Sbjct: 252 SPDEPRRETGVYWGYQVRIAHSMSEIFTKSPYATGYDVTVGTSDRGTNVHEVPNRSYNFN 311

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           H+LI FGGL GLE ++  D+ L   +   +F  Y N  P QGSRTIRTEEA+ I+L  FQ
Sbjct: 312 HMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAAFQ 371

Query: 358 EPISRALRRVE 368
           E +   +  VE
Sbjct: 372 EKLQPQVADVE 382


>gi|290491270|gb|ADD31614.1| GM22547p [Drosophila melanogaster]
          Length = 503

 Score =  248 bits (633), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 136/311 (43%), Positives = 186/311 (59%), Gaps = 5/311 (1%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK--SSSDNYSRSSAAN 119
           T+S+AVPGSI++N QS EL   +AGQIARA  IFR++EV+VFD+   +++    RS  A+
Sbjct: 91  TLSIAVPGSILENAQSNELRAYVAGQIARAACIFRVNEVIVFDDVGIATARETKRSYEAD 150

Query: 120 RSNRSDEN-ESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
               S     S +  L R+LQYLE PQYLRK  F +H  L++ G+L PLD PHHLR+   
Sbjct: 151 GEGSSTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSK 210

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
             +REGV   + A     +  +VGL   V+VD+ ++PGVRVTV M             +V
Sbjct: 211 FRYREGVICDKKAKEG-HSYANVGLLNDVLVDKAIEPGVRVTVKMEPQSESCRKQRGTLV 269

Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
            P +P+ E+G+YWGY+VR A ++S +F    Y  GYD  +GTS+ G  V+        F 
Sbjct: 270 SPDEPRRETGVYWGYQVRIAHSMSEIFTKSPYATGYDVTVGTSDRGTNVHEVPNRSYNFN 329

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           H+LI FGGL GLE ++  D+ L   +   +F  Y N  P QGSRTIRTEEA+ I+L  FQ
Sbjct: 330 HMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAAFQ 389

Query: 358 EPISRALRRVE 368
           E +   +  VE
Sbjct: 390 EKLQPQVADVE 400


>gi|219122472|ref|XP_002181568.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406844|gb|EEC46782.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 330

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 130/319 (40%), Positives = 194/319 (60%), Gaps = 21/319 (6%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+PGS++ N Q+ EL T++ GQ+ARA TI+ +DE++VFD+K + +         RS
Sbjct: 14  TVSIAIPGSVVSNCQTRELRTQMVGQLARAATIYHVDEIIVFDDKLAKEMKPDRGYYQRS 73

Query: 122 N----RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
           N    RSD +E    F+ R+ QY E PQYLR+  F MH  L++ G+L P+DAPHH+R ++
Sbjct: 74  NHHGGRSDPHE----FMARVFQYCECPQYLRRDFFPMHGDLQYAGLLAPMDAPHHVRVND 129

Query: 178 WAPFREGVTLKENAPNSVGTLVDVGL-NKHVVVDQVLDPGVRVTVAMGTNRNLDADSP-- 234
            A FREG+ L++ +  +  +LV  G+  + V +D  L PG+R TV +    + +      
Sbjct: 130 RARFREGIVLEKTSSTNGNSLVHCGIRGRPVEIDVKLTPGIRCTVQLDPKASYETGGKPN 189

Query: 235 -----RQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDI-VN 287
                 +VV PS P++  G YWGY  R A +I +VF  C + G YD  +GTSE G   ++
Sbjct: 190 SIIRGGKVVSPSAPRKFDGTYWGYTTRLASSIKAVFDECPF-GVYDLKVGTSERGSTSLD 248

Query: 288 SSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR--KNAREVFHSYFNTCPHQGSRTIRT 345
                LP+++H LI FGG+AG+EE ++ D+ L      +R++F  + N CP QGSRTIRT
Sbjct: 249 DGKFRLPSYQHALIVFGGVAGIEECVDADESLSLPGSQSRKLFDLWVNICPFQGSRTIRT 308

Query: 346 EEAIFISLQYFQEPISRAL 364
           EEA+ I+L      +S A+
Sbjct: 309 EEAVLIALAKLSPLLSTAV 327


>gi|195379965|ref|XP_002048741.1| GJ21159 [Drosophila virilis]
 gi|194143538|gb|EDW59934.1| GJ21159 [Drosophila virilis]
          Length = 483

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 137/307 (44%), Positives = 182/307 (59%), Gaps = 13/307 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+AVPGSI++N QS EL   +AGQIARA  IFR++EV+VFD+       + +    RS
Sbjct: 71  TVSIAVPGSILENAQSAELRAYVAGQIARAACIFRVNEVIVFDDV----GVATARETKRS 126

Query: 122 NRSDENESGAAF-------LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
              D   +G A        L R+LQYLE PQYLRK  F +H  L++ G+L PLD PHHLR
Sbjct: 127 YEQDAEGNGTATVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLR 186

Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSP 234
           +     +REGV   + A +   +  +VGL   V++D+ LDPGVRVTV M           
Sbjct: 187 QQSKFQYREGVVCDKQAKDG-HSYANVGLLNDVLIDKALDPGVRVTVKMDPPNETSRKQR 245

Query: 235 RQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
             +V P +P+ E+G+YWGY+VR A ++S +F    Y GGYD  IGTS+ G  V+      
Sbjct: 246 GTLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYAGGYDVTIGTSDRGASVHDVPSKS 305

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             + H+LI FGGL GLEE++  D+ L       +F  Y N  P QGSRTIRTEEA+ I+L
Sbjct: 306 IEYNHILIVFGGLQGLEEALANDEKLTVDEPELLFDHYVNVLPKQGSRTIRTEEALLIAL 365

Query: 354 QYFQEPI 360
              QE +
Sbjct: 366 AALQEKL 372


>gi|195028012|ref|XP_001986876.1| GH21612 [Drosophila grimshawi]
 gi|193902876|gb|EDW01743.1| GH21612 [Drosophila grimshawi]
          Length = 490

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 136/305 (44%), Positives = 183/305 (60%), Gaps = 9/305 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+AVPGSI++N QS EL   +AGQIARA  IFR+ EV+VFD+   +   +R +  +  
Sbjct: 69  TVSIAVPGSILENAQSAELRAYVAGQIARAACIFRVHEVIVFDDVGVAT--ARETKRSYE 126

Query: 122 NRSDENESGAAF-----LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
             +D   +G        L R+LQYLE PQYLRK  F +H  L++ G+L PLD PHHLR+ 
Sbjct: 127 QDADGGTTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQ 186

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
               +REGV   + A N   +  +VGL   V+VD+ L+PGVRVTV M    +        
Sbjct: 187 SKFRYREGVVCDKQAKNG-HSYANVGLLNDVLVDKALEPGVRVTVKMEPTSDASRKQRGT 245

Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
           +V P +P+ E+G+YWGY+VR A ++S +F    YKGGYD  +GTS+ G  V+        
Sbjct: 246 LVSPEEPRRETGVYWGYQVRIAHSLSEIFTRSPYKGGYDVTVGTSDRGTSVHDVPSQSIQ 305

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
           + H+LI FGGL GLEE++  D  L       +F  Y N  P QGSRTIRTEEA+ I+L  
Sbjct: 306 YNHMLIVFGGLQGLEEALANDSKLTVDEPELLFDHYVNVLPKQGSRTIRTEEALLIALAA 365

Query: 356 FQEPI 360
            QE +
Sbjct: 366 LQEKL 370


>gi|453080110|gb|EMF08162.1| DUF171-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 347

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 138/314 (43%), Positives = 186/314 (59%), Gaps = 24/314 (7%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+PGSII N  + +  T LAGQIARA  +F +DEVVVFD+  +            +
Sbjct: 37  TLSLALPGSIIANALTHDQKTSLAGQIARACAVFCVDEVVVFDDGQAETRAPEH--GGYT 94

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +D N     FL  +L YLETP  LRKALF MH  LR  G LP LD PHHLR  EW  +
Sbjct: 95  AFADPN----FFLFHVLTYLETPPNLRKALFPMHPDLRTAGALPSLDMPHHLRAEEWCEY 150

Query: 182 REGVTL-----KENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG-TNRNLDADSPR 235
           REG+T      K   P    T VD GL++ V +   +DP  RVTV +G T+ +    S  
Sbjct: 151 REGITTHSSSGKGKMPQ---TFVDCGLSQQVAIPGAIDPNTRVTVKLGHTSSDGYTISGL 207

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDL 291
            V P +  +E+G YWGY VR A ++S++F  C + GGYD  +GTSE G     I++ S  
Sbjct: 208 PVSPETPREEAGYYWGYSVRQASSLSAIFTECPFDGGYDVSVGTSERGKPLKSIIDQSSP 267

Query: 292 TL--PTFRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYFNTCPHQGSRTIRTE 346
           T   PT++HL++ FGG+AGLE +++ D  L+      A EVF S+ N  P+QGSRTIRTE
Sbjct: 268 TYVEPTWKHLIVVFGGVAGLEAALKADSELQSAGVSQAEEVFDSWVNLVPNQGSRTIRTE 327

Query: 347 EAIFISLQYFQEPI 360
           EA+++ L   +E +
Sbjct: 328 EAVWVGLSGLREAV 341


>gi|167517265|ref|XP_001742973.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778072|gb|EDQ91687.1| predicted protein [Monosiga brevicollis MX1]
          Length = 289

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 126/291 (43%), Positives = 175/291 (60%), Gaps = 9/291 (3%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+PGS++ N QS EL   L GQIARA  IFR DE+V+F +K +    +R +    S
Sbjct: 1   TVSLAIPGSLVANAQSAELRAYLVGQIARAAVIFRADEIVIFKDKCTQPGLARGTTQGHS 60

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               E E    F+ R+L+YL+ PQYLRK LF +H  L+F G+LPPLDAPHH R  + + F
Sbjct: 61  KTMKEPE---LFMARVLEYLDCPQYLRKHLFPIHRDLKFAGLLPPLDAPHHTRSSDISVF 117

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGT----NRNLDADSPRQV 237
           REGVTL+E      GTLV+VGL K  ++ +VL P +RVT+ +       + L      + 
Sbjct: 118 REGVTLEERPRKGTGTLVEVGLRKAAIISKVLPPKMRVTIKLNEAQLHKQKLKTLYRGEA 177

Query: 238 VPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL-TLPT 295
           V P++P+++  +YWGY+VR A +I  VFK   Y GGYD  IG S+ G  +N   L     
Sbjct: 178 VAPTEPRQAMDLYWGYQVRLAEDIEQVFKQSPYPGGYDLTIGISDKGSPLNKLHLHKHAA 237

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
           + HLL+ FGG   LE  +  D  L+ ++  E+F +Y N     GSR++RTE
Sbjct: 238 YNHLLVVFGGTVSLEAIVAADSELEARDPTELFDAYLNPLSGHGSRSLRTE 288


>gi|194754479|ref|XP_001959522.1| GF12917 [Drosophila ananassae]
 gi|190620820|gb|EDV36344.1| GF12917 [Drosophila ananassae]
          Length = 485

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 135/315 (42%), Positives = 186/315 (59%), Gaps = 13/315 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+AVPGSI++N QS EL   +AGQIARA  IFR++EV+VFD+       + +    RS
Sbjct: 71  TLSIAVPGSILENAQSAELRAYVAGQIARAACIFRVNEVIVFDDV----GIATARETKRS 126

Query: 122 NRSDENESGAAF-------LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
              DEN  G          L R+LQYLE PQYLRK  F +H  L++ G+L PLD PHHLR
Sbjct: 127 YEQDENGGGTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLR 186

Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSP 234
           +     +REGV   + A     +  +VGL   V+VD+ ++PGVRVTV M           
Sbjct: 187 QQSKFRYREGVICDKKAKEG-HSYANVGLLNDVLVDKAVEPGVRVTVKMDPPSETTRKQR 245

Query: 235 RQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
             +V P +P+ E+G+YWGY+VR A ++S +F    Y+ GYD  +GTS+ G  V+      
Sbjct: 246 GSLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYETGYDVTLGTSDRGTNVHEVPSRS 305

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            +++H+LI FGGL GLE ++  D+ L   +   +F  Y N  P QGSRTIRTEEA+ I+L
Sbjct: 306 YSYKHMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIAL 365

Query: 354 QYFQEPISRALRRVE 368
              QE +   +  VE
Sbjct: 366 AALQEKLQPEVADVE 380


>gi|452987275|gb|EME87031.1| hypothetical protein MYCFIDRAFT_187007 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 327

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 178/308 (57%), Gaps = 30/308 (9%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS------DNYSRS 115
           T+S+A+PGSII N  + +  T LAGQIARA  +F IDE+VVFD+  +       D Y+  
Sbjct: 21  TLSIALPGSIIANAVTPDQKTSLAGQIARACAVFCIDEIVVFDDGQAETRPPERDGYTAF 80

Query: 116 SAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRK 175
           +  N             FL  +L YLETP  LRKALF MH  LR  G LP LD PHHLR 
Sbjct: 81  ADPNF------------FLYHVLTYLETPPNLRKALFPMHPDLRTAGALPSLDMPHHLRS 128

Query: 176 HEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR 235
            EW  FREGV    +   S GTLVD GL + V +   LD   RVTV + ++   D     
Sbjct: 129 DEWCEFREGVA--TSPARSGGTLVDCGLPQKVSIPVELDSNTRVTVQLESDVAQDGLFMG 186

Query: 236 QVVPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV------NS 288
             V P  P+ES G YWGY VR A ++ SVF  CS+ GGYD  IGTSE G  +      +S
Sbjct: 187 TAVSPEMPRESAGYYWGYNVRQAASLGSVFTECSFDGGYDVSIGTSERGKPLSAIIQNDS 246

Query: 289 SDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYFNTCPHQGSRTIRT 345
            D   PT+ HLL+ FGG+AGLE +++ D  L+    + A ++F  + N  P QGSRTIRT
Sbjct: 247 PDHVEPTWNHLLVVFGGVAGLEAALKADKELQAAGVQKAADLFDCWVNLVPGQGSRTIRT 306

Query: 346 EEAIFISL 353
           EEA+++ L
Sbjct: 307 EEAVWVGL 314


>gi|406864343|gb|EKD17388.1| deoxyribose-phosphate aldolase 2 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 387

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 142/317 (44%), Positives = 195/317 (61%), Gaps = 27/317 (8%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD-----NKSSSDNYSRS- 115
           TVSVA+PGSII N Q+ +  T LAG IARA+ +F IDE+ +F      NKS+  +++R+ 
Sbjct: 60  TVSVALPGSIIANAQTHDQRTSLAGHIARALAVFCIDEIAIFSDGHGLNKSNKRHHNRTH 119

Query: 116 SAANRSNRSDENESGAA----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
           S A++++ + +  +G +    FLV LL YLETP +LRK LF +H +LR  G L  LD PH
Sbjct: 120 SFAHQADPAQDEYTGTSDPDHFLVHLLSYLETPPHLRKHLFPLHGNLRTAGTLSSLDLPH 179

Query: 172 HLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTV-----AMGTN 226
           HLR  EW P+REGVTL     +  GT V+ GL   V V + +    RVT+     A   N
Sbjct: 180 HLRSDEWCPYREGVTLP--GADEHGTYVEAGLRIPVTVKEQIPEKTRVTLKFSEGAEAAN 237

Query: 227 RNLDADS-PRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGD 284
           ++L+A +   + V P+ P+ E+G YWG+ VR A ++S VF  C Y+GGYD  IG SE G 
Sbjct: 238 KDLNAPTIDAEPVNPNDPREEAGYYWGFNVRKAGSLSDVFTECPYEGGYDVTIGLSERGI 297

Query: 285 IV-----NSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYFNTCP 336
            V      S    L  F+HLLI  GG+AGLE +++ D  L++     A +VF  + N CP
Sbjct: 298 DVEKLYSGSEGQKLGKFKHLLIVLGGVAGLEVAVKNDKELQKLGVMKAEDVFDRWVNVCP 357

Query: 337 HQGSRTIRTEEAIFISL 353
           +QGSRTIRTEEA++I L
Sbjct: 358 NQGSRTIRTEEAVWIGL 374


>gi|195582066|ref|XP_002080849.1| GD10707 [Drosophila simulans]
 gi|194192858|gb|EDX06434.1| GD10707 [Drosophila simulans]
          Length = 482

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 141/345 (40%), Positives = 199/345 (57%), Gaps = 11/345 (3%)

Query: 28  ELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQ 87
           EL  A    S  +    + + +D+LN        T+S+AVPGSI++N QS EL   +AGQ
Sbjct: 42  ELQQAKEAESQAEKAANEVQAKDKLNPS------TLSIAVPGSILENAQSNELRAYVAGQ 95

Query: 88  IARAVTIFRIDEVVVFDNK--SSSDNYSRSSAANRSNRSDEN-ESGAAFLVRLLQYLETP 144
           IARA  IFR++EV+VFD+   +++    RS  A+    S     S +  L R+LQYLE P
Sbjct: 96  IARAACIFRVNEVIVFDDVGIATARETKRSYEADAEGSSTGTVRSSSLQLARILQYLECP 155

Query: 145 QYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLN 204
           QYLRK  F +H  L++ G+L PLD PHHLR+     +REGV   + A     +  +VGL 
Sbjct: 156 QYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSKFRYREGVICDKKAKEG-HSYANVGLL 214

Query: 205 KHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSV 263
             V+VD+ ++PGVRVTV M             +V P +P+ E+G+YWGY+VR A ++S +
Sbjct: 215 NDVLVDKAIEPGVRVTVKMEPQSESCRKQRGTLVSPDEPRRETGVYWGYQVRIAHSMSEI 274

Query: 264 FKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKN 323
           F    Y  GYD  +GTS+ G  V+       +F+H+LI FGGL GLE ++  D+ L   +
Sbjct: 275 FTKSPYANGYDVTVGTSDRGTNVHEVPNRSYSFKHMLIVFGGLQGLESALANDEKLTVDD 334

Query: 324 AREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRRVE 368
              +F  Y N  P QGSRTIRT+EA+ I+L   QE +   +  VE
Sbjct: 335 PELLFDHYLNVLPRQGSRTIRTDEALLIALAALQEKLQPQVADVE 379


>gi|195475216|ref|XP_002089880.1| GE19326 [Drosophila yakuba]
 gi|194175981|gb|EDW89592.1| GE19326 [Drosophila yakuba]
          Length = 482

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 142/349 (40%), Positives = 197/349 (56%), Gaps = 19/349 (5%)

Query: 28  ELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQ 87
           EL LA    S  +    + + ++Q N        T+S+AVPGSI++N QS EL   +AGQ
Sbjct: 42  ELQLAKEAESQAEQVANEVQAKEQANPS------TLSIAVPGSILENAQSNELRAYVAGQ 95

Query: 88  IARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAF-------LVRLLQY 140
           IARA  IFR++EV+VFD+       + +    RS  +D   SG          L R+LQY
Sbjct: 96  IARAACIFRVNEVIVFDDV----GIATARETKRSYEADAEGSGTGTVRSSSLQLARILQY 151

Query: 141 LETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVD 200
           LE PQYLRK  F +H  L++ G+L PLD PHHLR+     +REGV   + A     +  +
Sbjct: 152 LECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSKFRYREGVICDKKAKEG-HSYAN 210

Query: 201 VGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPN 259
           VGL   V+VD+ ++PGVRVTV M             +V P +P+ E+G+YWGY+VR A +
Sbjct: 211 VGLLNDVLVDKAIEPGVRVTVKMEHQSENCRKQRGTLVSPDEPRRETGVYWGYQVRIAHS 270

Query: 260 ISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGL 319
           +S +F    Y  GYD  +GTS+ G  V+       +F+H+LI FGGL GLE ++  D+ L
Sbjct: 271 LSEIFTKSPYASGYDVTLGTSDRGTNVHEVPNRSYSFKHMLIVFGGLQGLESALANDEKL 330

Query: 320 KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRRVE 368
              +   +F  Y N  P QGSRTIRTEEA+ I+L   QE +   +  VE
Sbjct: 331 TVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAALQEKLQPQVADVE 379


>gi|194858153|ref|XP_001969113.1| GG24128 [Drosophila erecta]
 gi|190660980|gb|EDV58172.1| GG24128 [Drosophila erecta]
          Length = 482

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 135/315 (42%), Positives = 185/315 (58%), Gaps = 13/315 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+AVPGSI++N QS EL   +AGQIARA  IFR++EV+VFD+       + +    RS
Sbjct: 70  TLSIAVPGSILENAQSNELRAYVAGQIARAACIFRVNEVIVFDDV----GIATARETKRS 125

Query: 122 NRSDENESGAAF-------LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
             +D   SG          L R+LQYLE PQYLRK  F +H  L++ G+L PLD PHHLR
Sbjct: 126 YEADAEGSGTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLR 185

Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSP 234
           +     +REGV   + A     +  +VGL   V+VD+ ++PGVRVTV M           
Sbjct: 186 QQSKFRYREGVICDKKAKEG-HSYANVGLLNDVLVDKAIEPGVRVTVKMEPQSENCRKQR 244

Query: 235 RQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
             +V P +P+ E+G+YWGY+VR A ++S +F    Y  GYD  +GTS+ G  V+      
Sbjct: 245 GTLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYASGYDVTLGTSDRGTNVHEVPNRS 304

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            +F+H+LI FGGL GLE ++  D+ L   +   +F  Y N  P QGSRTIRTEEA+ I+L
Sbjct: 305 YSFKHMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIAL 364

Query: 354 QYFQEPISRALRRVE 368
              QE +   +  VE
Sbjct: 365 AALQEKLQPQVAEVE 379


>gi|195430968|ref|XP_002063520.1| GK21364 [Drosophila willistoni]
 gi|194159605|gb|EDW74506.1| GK21364 [Drosophila willistoni]
          Length = 516

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 134/310 (43%), Positives = 183/310 (59%), Gaps = 11/310 (3%)

Query: 56  APIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSS 109
           A +   TVS+AVPGSI++N QS EL   +AGQIARA  IFR++EV+VFD+      + + 
Sbjct: 68  ATVNPSTVSIAVPGSILENAQSNELRAYVAGQIARAACIFRVNEVIVFDDVGVATARETK 127

Query: 110 DNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDA 169
            +Y   +  N S       S +  L R+LQYLE PQYLRK  F +H  L++ G+L PLD 
Sbjct: 128 KSYEEDAEGNASGTV---RSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDT 184

Query: 170 PHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNL 229
           PHHLR+     +REGV   + A     +  +VGL   V+VD+ ++PGVRVTV M      
Sbjct: 185 PHHLRQQSKFRYREGVICDKKAKEG-QSYANVGLLNDVLVDKAIEPGVRVTVKMDQPNEA 243

Query: 230 DADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS 288
                  +V P +P+ E+G+YWGY+VR A ++S +F    Y  GYD  +GTS+ G  V+ 
Sbjct: 244 QRKQRGTLVSPDEPRRETGVYWGYQVRIAHSLSEIFTKSPYDSGYDVTLGTSDRGKNVHE 303

Query: 289 SDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
                 ++ HLLI FGGL GLE ++  D+ L   +   +F  Y N  P QGSRTIRTEEA
Sbjct: 304 VPAKSFSYNHLLIVFGGLQGLESALSNDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEA 363

Query: 349 IFISLQYFQE 358
           + I+L   QE
Sbjct: 364 LLIALAALQE 373


>gi|154323528|ref|XP_001561078.1| hypothetical protein BC1G_00163 [Botryotinia fuckeliana B05.10]
          Length = 385

 Score =  238 bits (607), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 136/316 (43%), Positives = 182/316 (57%), Gaps = 26/316 (8%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+PGSII N QS +L T +AGQIARA+ +F +DEV+VFD+ ++           +S
Sbjct: 61  TVSVALPGSIIANAQSHDLKTSMAGQIARALAVFCVDEVIVFDDGNARPAKKPKHPPTQS 120

Query: 122 NRSD-----ENESGAA-----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
           + S      E+E  A      FL  LL +LETP  LRK LF +H +LR  G LP +D PH
Sbjct: 121 HNSQQKPPLEDEYTAKSDPDHFLTHLLSFLETPPNLRKHLFPIHENLRTAGALPSIDLPH 180

Query: 172 HLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDA 231
           HLR  EW  +REG TL     +  GT V+ GL   V V + +    RVT+         A
Sbjct: 181 HLRADEWCMYREGTTLP--GADEHGTFVEAGLRIPVTVVEQIPANNRVTLKFSVEGEKAA 238

Query: 232 DS------PRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG- 283
                   P + V P +P+ E G YWG+ VR A  +S VF  C Y GGYD  I TSE G 
Sbjct: 239 KDKSCEIIPAEAVSPEEPREEGGYYWGWNVRRAGCLSDVFTECGYDGGYDMTIATSERGV 298

Query: 284 ---DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKN---AREVFHSYFNTCPH 337
              D+ +  +  +  F+HLL+ FGG+AGLE +++ D+ L++     A++VF  + N CP 
Sbjct: 299 DVQDLYDDEEQKVGKFKHLLVVFGGVAGLEVAVKNDEELQKLGVVEAKDVFDRWINVCPG 358

Query: 338 QGSRTIRTEEAIFISL 353
           QGSRTIRTEEA++I L
Sbjct: 359 QGSRTIRTEEAVWIGL 374


>gi|328863664|gb|EGG12763.1| hypothetical protein MELLADRAFT_32413 [Melampsora larici-populina
           98AG31]
          Length = 316

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 136/308 (44%), Positives = 181/308 (58%), Gaps = 22/308 (7%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD------NYSRS 115
           T+S+A+P SI++N Q+ EL T L GQIARA  IF +DE+V+F+    +D      NY R 
Sbjct: 5   TISIALPSSIVNNAQTWELKTALVGQIARACAIFSVDEIVIFEETLPADSAIAPSNYVRG 64

Query: 116 SA--ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL 173
                +   +S+E    + F+ R+L+YLE PQYLRK+LF +H  LR  G+LPPLD PHH 
Sbjct: 65  KYRDVDEGQQSEEPFQPSQFMARILEYLECPQYLRKSLFPLHPDLRLAGLLPPLDLPHHF 124

Query: 174 RKHEWAPFREGVTL-KENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
           R+    P+REG  L +E A N    L     NK     Q   P V V V +     +  D
Sbjct: 125 RRDHQTPWREGCVLPREKADN---YLAGNKPNKRKKDGQ--SPTVWVDVGLAEPVEVKLD 179

Query: 233 SPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSY--KGGYDHLIGTSEHGDIVNSS 289
            P ++V P +P E SG YWGY VR A +IS VF    Y   GGYD  IGTSE G  +  +
Sbjct: 180 -PARIVSPRQPTETSGTYWGYSVRLATSISKVFTESPYLKTGGYDLTIGTSERGQNITEA 238

Query: 290 DLTLPTFRHLLIAFGGLAGLEESIEEDD----GLKRKNAREVFHSYFNTCPHQGSRTIRT 345
              + +F+HLL+AFGGL+GLE  I  D+     L  ++A  +F  + NT PHQGSRT+RT
Sbjct: 239 LQDMGSFKHLLLAFGGLSGLELCIASDETLTPNLSAEDASLLFDHWVNTLPHQGSRTVRT 298

Query: 346 EEAIFISL 353
           EEA+ +SL
Sbjct: 299 EEALLVSL 306


>gi|398388217|ref|XP_003847570.1| hypothetical protein MYCGRDRAFT_15728, partial [Zymoseptoria
           tritici IPO323]
 gi|339467443|gb|EGP82546.1| hypothetical protein MYCGRDRAFT_15728 [Zymoseptoria tritici IPO323]
          Length = 313

 Score =  237 bits (605), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 136/307 (44%), Positives = 180/307 (58%), Gaps = 28/307 (9%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+PGSII N  + +  T LAGQIARA  +F +DE+VVF++  +       S    +
Sbjct: 16  TLSIALPGSIIANAVTPDQKTSLAGQIARACAVFCVDEIVVFNDGQAETRAPEPS--GYT 73

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +D N     FL  +L YLETP  LRK+LF MH  LR  G LP LD PHHLR  EW  F
Sbjct: 74  AFTDPN----FFLYHVLTYLETPPNLRKSLFPMHPDLRTAGALPSLDMPHHLRSDEWCEF 129

Query: 182 REGVTLKENAPNSVG-TLVDVGLNKHVVVDQVLDPGVRVTV----AMGTNRNLDADSPRQ 236
           REG+T     P   G TLVD GL + V +   +DP  RVTV    ++ T   +  D+   
Sbjct: 130 REGIT---TGPGKRGTTLVDCGLGRKVSIPVEIDPSTRVTVQLPQSLPTGDTISGDA--- 183

Query: 237 VVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--- 292
            V P  P+E +G YWGY+VR AP++ S+F  C Y GGYD  IGTSE G  ++S   T   
Sbjct: 184 -VAPDTPREQAGYYWGYQVRQAPSLGSIFTECPYDGGYDISIGTSERGTSLDSLTDTSSP 242

Query: 293 ---LPTFRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYFNTCPHQGSRTIRTE 346
               PT+ HL++ FGG+AGLE +++ D  L+      A +VF  + N    QGSRTIRTE
Sbjct: 243 SHVAPTWNHLVVVFGGVAGLESALKADAELQAAGVTQAADVFDCWINLVSGQGSRTIRTE 302

Query: 347 EAIFISL 353
           EA+++ L
Sbjct: 303 EAVWVGL 309


>gi|426226131|ref|XP_004023663.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C9orf114
           homolog, partial [Ovis aries]
          Length = 350

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 135/306 (44%), Positives = 176/306 (57%), Gaps = 17/306 (5%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 53  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFRGVG 112

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 113 KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 168

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPP 240
           REGV +        G+ V+ G+ K V +D+ L+PG+RVTV +  N+  ++ + R +VV  
Sbjct: 169 REGVVVDRPTRPGQGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQNQLPESKTYRGKVVSS 228

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+    P FRH 
Sbjct: 229 QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--FPNFRHA 286

Query: 300 LIAFGGLAGLE--ESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           L+ FGGL GLE       D G     A           P Q  RTIRTEEAI ISL   Q
Sbjct: 287 LVVFGGLQGLEAVAGAXGDPGTLSPWAA-------TPSPTQAGRTIRTEEAILISLAALQ 339

Query: 358 EPISRA 363
             +++A
Sbjct: 340 PGLTQA 345


>gi|20129839|ref|NP_610543.1| CG12128, isoform A [Drosophila melanogaster]
 gi|7303814|gb|AAF58861.1| CG12128, isoform A [Drosophila melanogaster]
          Length = 485

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 135/311 (43%), Positives = 185/311 (59%), Gaps = 5/311 (1%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK--SSSDNYSRSSAAN 119
           T+S+AVPGSI++N QS EL   +AGQIARA  IFR++EV+VFD+   +++    RS  A+
Sbjct: 73  TLSIAVPGSILENAQSNELRAYVAGQIARAACIFRVNEVIVFDDVGIATARETKRSYEAD 132

Query: 120 RSNRSDEN-ESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
               S     S +  L R+LQYLE PQYLRK  F +H  L++ G+L PLD PHHLR+   
Sbjct: 133 GEGSSTGTVRSSSLQLARILQYLECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSK 192

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
             +REGV   + A     +  +VGL   V+VD+ ++PGVRVTV M             +V
Sbjct: 193 FRYREGVICDKKAKEG-HSYANVGLLNDVLVDKAIEPGVRVTVKMEPQSESCRKQRGTLV 251

Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
            P +P+ E+G+YWGY+VR A ++S +F    Y  GYD  +GTS+ G  V+        F 
Sbjct: 252 SPDEPRRETGVYWGYQVRIAHSMSEIFTKSPYATGYDVTVGTSDRGTNVHEVPNRSYNFN 311

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           H+LI FGGL GLE ++  D+ L   +   +F  Y N  P QGSRTIRTEEA+ I+L   Q
Sbjct: 312 HMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAALQ 371

Query: 358 EPISRALRRVE 368
           E +   +  VE
Sbjct: 372 EKLQPQVADVE 382


>gi|71021553|ref|XP_761007.1| hypothetical protein UM04860.1 [Ustilago maydis 521]
 gi|46100927|gb|EAK86160.1| hypothetical protein UM04860.1 [Ustilago maydis 521]
          Length = 491

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 151/364 (41%), Positives = 203/364 (55%), Gaps = 83/364 (22%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD-----NKSSSDNYSRSS 116
           T+SVA+PGSI+ N QS EL +RLA  IAR   IF +DE+VVFD      ++ +D Y    
Sbjct: 123 TISVAIPGSIVLNAQSPELQSRLAAHIARTCAIFNVDEIVVFDEGQVRTEAGADLYQHKH 182

Query: 117 AANRSNR--------------SDENES------------GA------AFLVRLLQYLETP 144
             N+ NR              +DE E             GA       FL R+LQYLETP
Sbjct: 183 --NQQNRGTKRRFNQPQGNPNADEQEESTEQRDARAEAEGARGFDPHTFLARVLQYLETP 240

Query: 145 QYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTL------KENAPNSVGTL 198
           QYLRKALF MH  LR  G++PPLD PHHLR  + + +REGVT+      +    NSV   
Sbjct: 241 QYLRKALFPMHRDLRLAGLMPPLDCPHHLRFEDQSEYREGVTVDPPHWARRGGQNSV--Y 298

Query: 199 VDVGLNKHVVVDQV-----LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGY 252
           V+VGL + V+  ++     ++ G RVTV+M     +DA    ++V P +P E  G YWGY
Sbjct: 299 VNVGL-RDVIEAKLPEGARVEAGTRVTVSM----PMDAYGAGEIVSPREPVEQLGWYWGY 353

Query: 253 KVRYAPNISSVFKNCSYK-GGYDHLIGTSEHGDIVNSSDLTLPT---------------- 295
            VR A ++S+V   C +  GGYD +IGTSE G  V+ +DLTL T                
Sbjct: 354 SVRLANSLSAVLTGCPFAGGGYDLVIGTSERG--VSLTDLTLATSNNTPVALASGETVQP 411

Query: 296 ----FRHLLIAFGGLAGLEESIEEDDGLK--RKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
               F H L+ FGGL+GLE ++E+D  +K  R  A+++F ++ N   +QGSRT+RTEEAI
Sbjct: 412 MSEQFNHALLVFGGLSGLEVAVEQDAAIKLDRHTAKDLFDAWINVVENQGSRTVRTEEAI 471

Query: 350 FISL 353
            I+L
Sbjct: 472 MIAL 475


>gi|443900037|dbj|GAC77364.1| uncharacterized conserved protein [Pseudozyma antarctica T-34]
          Length = 462

 Score =  234 bits (597), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 147/352 (41%), Positives = 190/352 (53%), Gaps = 68/352 (19%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSSDNYSRS 115
           TVSVA+PGSI+ N QS EL +RLA  IARA  IF +DE+VVFD          + +  R 
Sbjct: 105 TVSVAIPGSIVVNAQSPELQSRLAAHIARACAIFNVDEIVVFDEGDVRSIDPDAHSQQRP 164

Query: 116 SAANRSNRSDENESGAA---------------------FLVRLLQYLETPQYLRKALFSM 154
            A     R  E +S AA                     FL R+LQYLETPQYLRKALF M
Sbjct: 165 PARGSKRRWSEPQSAAANEGEGEEQQEESDNKGFDPQTFLARVLQYLETPQYLRKALFPM 224

Query: 155 HSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSV-----GTLVDVGLNKHVV- 208
           H  LR  G++PPLD PHHLR  +   +REGVT+  + P+       G  V+VGL   +  
Sbjct: 225 HRDLRLAGLMPPLDCPHHLRFEDECEYREGVTV--DPPHWARGRGQGVYVNVGLRDPIQA 282

Query: 209 -----VDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISS 262
                V   ++ G RVT+ M       A    ++V P +P E  G YWGY VR AP++S+
Sbjct: 283 TLPGGVPARVETGTRVTIKM----PYSAHEAGEIVSPREPVEKLGWYWGYTVRLAPSLSA 338

Query: 263 VFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF-------------------RHLLIAF 303
           V  +C +   YD +IGTSE G  V+ +DL L T                    +H LI F
Sbjct: 339 VLTDCPFGSEYDLVIGTSERG--VSLTDLALATSQATPVATPEGQTVQPLAQPQHALIVF 396

Query: 304 GGLAGLEESIEEDDG--LKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           GGL+GLE ++E D G  L R NA E+F ++ N   +QGSRT+R+EEAI I+L
Sbjct: 397 GGLSGLELAVEADAGIQLDRNNAAELFDAWVNVVDNQGSRTVRSEEAIMIAL 448


>gi|322696041|gb|EFY87839.1| deoxyribose-phosphate aldolase [Metarhizium acridum CQMa 102]
          Length = 340

 Score =  234 bits (597), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 125/298 (41%), Positives = 179/298 (60%), Gaps = 13/298 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+P SI+ N  + +    + G+IARA+ +F +DEVVVFD+  +S    ++  A  +
Sbjct: 40  TVSVAIPTSILTNLATADQRMSIPGRIARALAVFAVDEVVVFDDSPASSRPRQTDTAGYT 99

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +D     A FL  +L +LE P ++RKALF +H +LR   +LP LD PHH    EW P+
Sbjct: 100 GDTDP----AHFLTHILSFLEAPPFMRKALFPLHPNLRLTALLPSLDMPHHPNPKEWIPY 155

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGVT+        GTLV+VGL++ V +++ + P  R+T+    ++   +  P+ V P +
Sbjct: 156 REGVTVPGKTSTGSGTLVEVGLDEPVEIEEDIPPKTRLTLLFPDDQ---SQYPQCVDPQA 212

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP-TFRHLL 300
              E G YWGY VR  P++SSVF    Y+ GYD  IGTSE G  ++ +  + P  F+HLL
Sbjct: 213 PRTEGGYYWGYTVRKCPSLSSVFTESPYEDGYDFSIGTSERGTPISKAFSSKPLNFKHLL 272

Query: 301 IAFGGLAGLEESIEEDDGLKRKN-----AREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           I FGG  GLE +   D+ L + +      RE+F  + N  P QGSRTIRTEEA+FI+L
Sbjct: 273 IVFGGPRGLEFASMNDEELVKMDIQGGRTRELFDQWVNVLPGQGSRTIRTEEALFIAL 330


>gi|221484612|gb|EEE22906.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 833

 Score =  234 bits (596), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 139/347 (40%), Positives = 189/347 (54%), Gaps = 60/347 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+P SIIDN Q+ EL   L GQIAR +T+F +DE++V+++ ++        A +R 
Sbjct: 484 TLSIALPASIIDNAQTAELRAALVGQIARTLTVFGVDEIIVYEDVAA--------AISRG 535

Query: 122 NRSDENESGA-AFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
              DE  S A  F VR L+YLETPQ+LRK+LF +H  LRF G+  PLDAPHHLR++EW P
Sbjct: 536 ANKDEGHSPALEFFVRNLRYLETPQFLRKSLFPIHPDLRFAGLQNPLDAPHHLRRNEWLP 595

Query: 181 FREGVTL-----KENAP--------------------------NSVGTLVDVGLNKHVVV 209
           +REGV +     +E  P                          N  G  VD GL  HV V
Sbjct: 596 YREGVVIASSKTREETPQGKSESRAPAKGEEVKLTAAEKKRLKNKKGAWVDCGLPAHVWV 655

Query: 210 -DQVLDPGVRVTVAMGTN-RNL--------DADSP----RQVVPPSKPKESGMYWGYKVR 255
            D  L  G+RVT+ +  + R L        DA+ P      V P   P ++G+YWGY+VR
Sbjct: 656 PDTRLQDGMRVTIRLDPSVRQLQRQPRERGDAEPPLMKGAAVSPDEPPVKAGLYWGYRVR 715

Query: 256 YAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLP-TFRHLLIAFGGLAGLEES 312
            A +   VF NC +   G YD  +GTSE G  V   D   P  ++H+L+ FGGL GLE  
Sbjct: 716 IAQHFQDVFSNCPFSSDGRYDLTVGTSERGTCVG-RDFAFPMNYKHMLLVFGGLQGLEAV 774

Query: 313 I--EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           +   + +    ++   +F  Y NTCP Q SRTIR EEA+ I+L   +
Sbjct: 775 LLDRQSNCAPCRDPSTLFDMYLNTCPFQRSRTIRAEEAVPITLALLR 821


>gi|237839871|ref|XP_002369233.1| hypothetical protein TGME49_085950 [Toxoplasma gondii ME49]
 gi|211966897|gb|EEB02093.1| hypothetical protein TGME49_085950 [Toxoplasma gondii ME49]
          Length = 833

 Score =  233 bits (595), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 139/347 (40%), Positives = 189/347 (54%), Gaps = 60/347 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+P SIIDN Q+ EL   L GQIAR +T+F +DE++V+++ ++        A +R 
Sbjct: 484 TLSIALPASIIDNAQTAELRAALVGQIARTLTVFGVDEIIVYEDVAA--------AISRG 535

Query: 122 NRSDENESGA-AFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
              DE  S A  F VR L+YLETPQ+LRK+LF +H  LRF G+  PLDAPHHLR++EW P
Sbjct: 536 ANKDEGHSPALEFFVRNLRYLETPQFLRKSLFPIHPDLRFAGLQNPLDAPHHLRRNEWLP 595

Query: 181 FREGVTL-----KENAP--------------------------NSVGTLVDVGLNKHVVV 209
           +REGV +     +E  P                          N  G  VD GL  HV V
Sbjct: 596 YREGVVIASSKTREETPQGKSESRAPAKGEEVKLTAAEKKRLKNKKGAWVDCGLPAHVWV 655

Query: 210 -DQVLDPGVRVTVAMGTN-RNL--------DADSP----RQVVPPSKPKESGMYWGYKVR 255
            D  L  G+RVT+ +  + R L        DA+ P      V P   P ++G+YWGY+VR
Sbjct: 656 PDTRLQDGMRVTIRLDPSVRQLQRQPRERGDAEPPLMKGAAVSPDEPPVKAGLYWGYRVR 715

Query: 256 YAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLP-TFRHLLIAFGGLAGLEES 312
            A +   VF NC +   G YD  +GTSE G  V   D   P  ++H+L+ FGGL GLE  
Sbjct: 716 IAQHFQDVFSNCPFSSDGRYDLTVGTSERGTCVG-RDFAFPMNYKHMLLVFGGLQGLEAV 774

Query: 313 I--EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           +   + +    ++   +F  Y NTCP Q SRTIR EEA+ I+L   +
Sbjct: 775 LLDRQSNCAPCRDPSTLFDMYLNTCPFQRSRTIRAEEAVPITLALLR 821


>gi|221504806|gb|EEE30471.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 833

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 139/347 (40%), Positives = 189/347 (54%), Gaps = 60/347 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+P SIIDN Q+ EL   L GQIAR +T+F +DE++V+++ ++        A +R 
Sbjct: 484 TLSIALPASIIDNAQTAELRAALVGQIARTLTVFGVDEIIVYEDVAA--------AISRG 535

Query: 122 NRSDENESGA-AFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
              DE  S A  F VR L+YLETPQ+LRK+LF +H  LRF G+  PLDAPHHLR++EW P
Sbjct: 536 ANKDEGHSPALEFFVRNLRYLETPQFLRKSLFPIHPDLRFAGLQNPLDAPHHLRRNEWLP 595

Query: 181 FREGVTL-----KENAP--------------------------NSVGTLVDVGLNKHVVV 209
           +REGV +     +E  P                          N  G  VD GL  HV V
Sbjct: 596 YREGVVIASSKTREETPQGKSESRAPAKGEEVKLTAAEKKRLKNKKGAWVDCGLPAHVWV 655

Query: 210 -DQVLDPGVRVTVAMGTN-RNL--------DADSP----RQVVPPSKPKESGMYWGYKVR 255
            D  L  G+RVT+ +  + R L        DA+ P      V P   P ++G+YWGY+VR
Sbjct: 656 PDTRLQDGMRVTIRLDPSVRQLQRQPRERGDAEPPLMKGAAVSPDEPPVKAGLYWGYRVR 715

Query: 256 YAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLPT-FRHLLIAFGGLAGLEES 312
            A +   VF NC +   G YD  +GTSE G  V   D   P  ++H+L+ FGGL GLE  
Sbjct: 716 IAQHFQDVFSNCPFSSDGRYDLTVGTSERGTCVG-RDFAFPVNYKHMLLVFGGLQGLEAV 774

Query: 313 I--EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           +   + +    ++   +F  Y NTCP Q SRTIR EEA+ I+L   +
Sbjct: 775 LLDRQSNCAPCRDPSTLFDMYLNTCPFQRSRTIRAEEAVPITLALLR 821


>gi|378731584|gb|EHY58043.1| hypothetical protein HMPREF1120_06061 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 373

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 136/309 (44%), Positives = 183/309 (59%), Gaps = 27/309 (8%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSII N +S E  T LAGQIARA+ +F +DE+V+FD++S      R + A   
Sbjct: 54  TLSVALPGSIIANAKSHEQKTYLAGQIARALAVFCVDEIVIFDDESPDVQRRRPAIAEHD 113

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +  +     FL  LL YLETP +LRK LF MH +LR  G LP LD PHHLR +EW  +
Sbjct: 114 YTAFSHPD--HFLAHLLSYLETPPHLRKTLFPMHPNLRTAGTLPSLDMPHHLRANEWCEY 171

Query: 182 REGVTLKENAP---NSVGTLVDVGLNK-HVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV 237
           REGV+L   +     S  TLVDVGL++   + D  +  G RVTV +    +  A +    
Sbjct: 172 REGVSLGPTSSPRGTSTATLVDVGLSEPREIPDSDIPRGTRVTVELEEPPSKLATA---- 227

Query: 238 VPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----------DIV 286
           V P  P+E+ G YWGY VR   ++S+VF  C + GGYD  IGTSE G          +  
Sbjct: 228 VSPDTPREAKGYYWGYAVRQCSSLSNVFTECPFDGGYDISIGTSERGVPLSEVFARPEFA 287

Query: 287 NSSDLTLPTFRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSR 341
           ++    LP F+H+LI FGG+AGLE + + D      G+  KN +++F  + N  P QGSR
Sbjct: 288 DADSKPLP-FQHVLIVFGGVAGLETAAKNDPQLQEMGITPKNVKDLFDYWVNVLPGQGSR 346

Query: 342 TIRTEEAIF 350
           TIRTEEA++
Sbjct: 347 TIRTEEAVW 355


>gi|313212170|emb|CBY17791.1| unnamed protein product [Oikopleura dioica]
          Length = 458

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 124/301 (41%), Positives = 181/301 (60%), Gaps = 17/301 (5%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+PGSI+DN QS EL T LAGQIARA  +F +DE+V+FD    +   +    +   
Sbjct: 158 TLSIALPGSILDNAQSFELRTYLAGQIARACVVFNVDEIVIFDESGEAAKDTTGEFSGVK 217

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            + + N++    L R+LQYLE PQYLRK LF +H  L+  G+L PLD PHHLR +E++ +
Sbjct: 218 KKGNPNQT----LGRILQYLECPQYLRKDLFPVHKDLKNAGLLNPLDTPHHLRANEFSKY 273

Query: 182 REGVTLKENAPN----SVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD--SPR 235
           REG+   ++ P+      G+ ++ G+   V V++      RVTV +   +  D    + +
Sbjct: 274 REGIV--QDRPSGKKKGKGSWINCGIEHDVRVEKEAPAFSRVTVKLDDEQREDGRFLTGK 331

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
            V      +ESG YWGY VR A ++  VF +  ++G YD +IGTSE G   N   +    
Sbjct: 332 LVTAAQVREESGTYWGYNVRLADSLGQVFMHSPWEGNYDTIIGTSERG--TNIKKMPPQK 389

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
            +H +I FGG+ GLE S E+D  LK    ++  ++F  + NTCP+QGSRTIRTEEAI ++
Sbjct: 390 AKHYMIVFGGVKGLEHSFEQDPELKENGIESVTDLFDMWLNTCPNQGSRTIRTEEAILVT 449

Query: 353 L 353
           L
Sbjct: 450 L 450


>gi|346465421|gb|AEO32555.1| hypothetical protein [Amblyomma maculatum]
          Length = 284

 Score =  231 bits (590), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 120/271 (44%), Positives = 163/271 (60%), Gaps = 4/271 (1%)

Query: 95  FRIDEVVVFDNKSSSDNYSRSSAANRS-NRSDENESGAAFLVRLLQYLETPQYLRKALFS 153
           F +DE+VV+D+  S+    + +A +       E+  G   L R+LQYLE PQYLRK LF 
Sbjct: 6   FVVDEIVVYDDDGSTVRGKKGAAVDGEFGGLGESGQGVVQLARVLQYLECPQYLRKHLFP 65

Query: 154 MHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVL 213
           +H  L+F G+L PLD PHHLR  E +P+REG+  +       G+ V+ GL+K V +D+ L
Sbjct: 66  LHRDLQFAGLLNPLDCPHHLRATEESPYREGIVARLPIKEGSGSYVNCGLSKEVKIDRCL 125

Query: 214 DPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGY 273
            PGVRVTV +             V P     E+G+YWGY+VR A ++ +V   C + GGY
Sbjct: 126 QPGVRVTVKLKEGTPGXXLRGTAVSPSCPRTEAGLYWGYEVRVAGSLGAVLAECPFPGGY 185

Query: 274 DHLIGTSEHGDIVNSSDLTLP-TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYF 332
           D  +GTSE G  V+  +L LP  F+H L+ FGGL GLE ++E D+ L   +   +F  Y 
Sbjct: 186 DITLGTSERGTPVD--ELELPHCFQHALVVFGGLKGLEAALEADEALDVDDPAYLFSHYL 243

Query: 333 NTCPHQGSRTIRTEEAIFISLQYFQEPISRA 363
           NTCP+QGSRTIRTEEAI ISL   +  I+ A
Sbjct: 244 NTCPNQGSRTIRTEEAILISLSTLRPKIAVA 274


>gi|388857119|emb|CCF49334.1| uncharacterized protein [Ustilago hordei]
          Length = 484

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 145/362 (40%), Positives = 193/362 (53%), Gaps = 76/362 (20%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN----KSSSDNYS-RSS 116
           T+SVA+PGSI+ N QS EL +RLA  IARA  IF +DE+VVF+       +SD Y  R +
Sbjct: 114 TISVAIPGSIVLNAQSPELQSRLAAHIARACAIFNVDEIVVFNEGELRSETSDPYQHRHN 173

Query: 117 AANRSNRSDENESGAA----------------------------FLVRLLQYLETPQYLR 148
             +R  +   NE   A                            FL R+LQYLETPQYLR
Sbjct: 174 QQHRGTKRRYNEPQNAKTQQGEGEVQEGGEGEEEGKGGGFEPHTFLARVLQYLETPQYLR 233

Query: 149 KALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTL--------KENAPNSVGTLVD 200
           KALF MH  LR  G++PPLD PHHLR  + + +REGVT+        +          V+
Sbjct: 234 KALFPMHRDLRLAGLMPPLDCPHHLRFEDESEYREGVTVDTPHWARNRGAGGGGNSVYVN 293

Query: 201 VGLNKHV----VVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVR 255
           +GL   +         L+ GVRVTV M  N       P  +V P +P E  G YWGY VR
Sbjct: 294 IGLRDPIEASLPTTAKLESGVRVTVRMPIN----TYEPGSIVSPREPVERLGWYWGYSVR 349

Query: 256 YAPNISSVFKNCSY--KGGYDHLIGTSEHGDIVNSSDLTLPT------------------ 295
            A ++SSV  +C +    GYD +IGTSE G  V+ +DL + T                  
Sbjct: 350 LASSLSSVLTSCPFSSSSGYDLVIGTSERG--VSLADLAIATSAQTPVALPSGETIQPLP 407

Query: 296 --FRHLLIAFGGLAGLEESIEEDDGLK--RKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
             F H L+ FGGL+GLE ++E+D+G+K  R  A+++F ++ N   +QGSRT+RTEEAI I
Sbjct: 408 SSFNHALVVFGGLSGLEVAVEQDEGIKLDRHTAKDLFDAWVNVVENQGSRTVRTEEAIMI 467

Query: 352 SL 353
           +L
Sbjct: 468 AL 469


>gi|322703851|gb|EFY95453.1| deoxyribose-phosphate aldolase 2 [Metarhizium anisopliae ARSEF 23]
          Length = 339

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 123/298 (41%), Positives = 178/298 (59%), Gaps = 13/298 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+P SI+ N  + +    + G+IARA+ +F +DEVVVFD+  +S    ++  A  +
Sbjct: 39  TVSVAIPTSILTNLATADQRMSIPGRIARALAVFAVDEVVVFDDSPASSRPRQTDTAGYT 98

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +D     A FL  +L +LE P ++RKALF +H +LR   +LP LD PHH    +W  +
Sbjct: 99  GDTDP----AHFLTHILSFLEAPPFMRKALFPLHPNLRLTALLPSLDMPHHPNPKDWISY 154

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGVT+        GTLV+VGL++ V +++ + P  R+T+    ++   +  P+ V P +
Sbjct: 155 REGVTIPGKTSTGSGTLVEVGLDEPVEIEEDIPPKTRLTLLFPDDQ---SQYPQCVDPQA 211

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT-FRHLL 300
              E G YWGY VR  P++SSVF    Y+ GYD  IGTSE G  ++ +  + P  F+HLL
Sbjct: 212 PRTEGGYYWGYTVRKCPSLSSVFTESPYEDGYDFSIGTSERGTPISKAFSSKPLKFKHLL 271

Query: 301 IAFGGLAGLEESIEEDDGLKRKN-----AREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           I FGG  GLE +   D+ L + +      RE+F  + N  P QGSRTIRTEEA+FI+L
Sbjct: 272 IVFGGPRGLEFASMNDEELVKMDIQGGRTRELFDQWVNVLPGQGSRTIRTEEALFIAL 329


>gi|397575650|gb|EJK49808.1| hypothetical protein THAOC_31275, partial [Thalassiosira oceanica]
          Length = 532

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 139/366 (37%), Positives = 197/366 (53%), Gaps = 80/366 (21%)

Query: 61  PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS-----DNYSRS 115
           PTVS+A+PGS++ N Q+ EL T+LAGQIARA  +FR+DEVVV+D+   S      NY R 
Sbjct: 148 PTVSIAIPGSVVSNAQTRELQTQLAGQIARAAAVFRVDEVVVYDDGLGSTLKTMSNYRRG 207

Query: 116 SAANR-------------SNRSDENESGA---------------AFLVRLLQYLETPQYL 147
           +   R             + R ++   G                 FL R+LQY E     
Sbjct: 208 NQRRRDGEGGDDEGKNTIAKRKEDPRQGVDTEKPAHLQASTDPHTFLARILQYCE----- 262

Query: 148 RKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGL-NKH 206
                 +H  L+F G+LPPLDAPHHLR+ + A +REG+ +     N   +LVD G+ N+ 
Sbjct: 263 ------LHPDLQFCGLLPPLDAPHHLRRGDVATYREGIVVDNEDANDSTSLVDCGVPNRL 316

Query: 207 VVVDQVLDPGVRVTVAM-------GTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAP 258
           V +D+ + PGVR TV +       G   ++      +VV P+ P+ E G+YWGY  R A 
Sbjct: 317 VKIDRKVPPGVRCTVRLEPKAYETGKKGHMKG----EVVSPTCPRDEEGVYWGYTTRMAS 372

Query: 259 NISSVFKNCSYKGGYDHLIGTSEHGDI-VNSSDLTL--------------PT-----FRH 298
            I ++F  C Y G YD  +GTSE GD+ ++     L              P+     F H
Sbjct: 373 TIDAIFAECPY-GSYDMKVGTSERGDVSIDDPKFCLRKRRENDGKPARRPPSAESEGFDH 431

Query: 299 LLIAFGGLAGLEESIE--EDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           LLI FGG+AG+EES++  E   L  +++R++F  + N CP+QGSRTIR+EEA+FI+L   
Sbjct: 432 LLIVFGGVAGIEESVDAHESMHLSGRDSRKLFDVWVNVCPYQGSRTIRSEEAVFITLARL 491

Query: 357 QEPISR 362
              I+R
Sbjct: 492 SPYIAR 497


>gi|156057771|ref|XP_001594809.1| hypothetical protein SS1G_04617 [Sclerotinia sclerotiorum 1980]
 gi|154702402|gb|EDO02141.1| hypothetical protein SS1G_04617 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 383

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 134/321 (41%), Positives = 177/321 (55%), Gaps = 31/321 (9%)

Query: 62  TVSVAVPGSIIDN-----TQSLELATRLAGQIARAVTIFRIDEVVVFDN------KSSSD 110
           TVSVA+PGSII        QS +L T +AG IARA+ +F +DEV+VFD+      K    
Sbjct: 54  TVSVALPGSIISKYVASLAQSHDLKTSMAGHIARALAVFCVDEVIVFDDGNARSPKKPRQ 113

Query: 111 NYSRSSAANRSNRSDENESGAA----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP 166
           +  +  A      S++  +G      FL  LL YLETP  LRK LF MH +LR  G LP 
Sbjct: 114 SPCQPPAIQPKAPSEDKYTGTLDPDHFLTHLLSYLETPPNLRKYLFPMHENLRTAGTLPS 173

Query: 167 LDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM--- 223
           +D PHHLR  EW  +REG TL     +  GT V+ GL   V V + +    RVT+     
Sbjct: 174 IDLPHHLRADEWCMYREGTTLP--GADEHGTFVEAGLRIPVTVPEQIPENNRVTLKFDLE 231

Query: 224 ----GTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGT 279
                 +++ +      V P    +E G YWGY VR A  +S VF  C Y GGYD  IGT
Sbjct: 232 AEKAAKDKSYEVIKAEAVRPEEPREEGGYYWGYNVRRAGCLSDVFTECGYDGGYDMTIGT 291

Query: 280 SEHG----DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR---KNAREVFHSYF 332
           SE G    D+ +  +  +  F+HLL+ FGG+AGLE +++ D+ L +     A++VF  + 
Sbjct: 292 SERGVDVQDLYDDKEQKVGKFKHLLVVFGGVAGLEVAVKNDEELHKLGVTEAKDVFDRWI 351

Query: 333 NTCPHQGSRTIRTEEAIFISL 353
           N CP QGSRTIRTEEA +I L
Sbjct: 352 NVCPGQGSRTIRTEEAAWIGL 372


>gi|320169134|gb|EFW46033.1| hypothetical protein CAOG_04001 [Capsaspora owczarzaki ATCC 30864]
          Length = 543

 Score =  228 bits (580), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 188/348 (54%), Gaps = 48/348 (13%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+PGS++   QS EL T +AGQIAR++++F +DE+V++++ S     S S      
Sbjct: 182 TVSVAIPGSVLHLAQSAELKTYMAGQIARSLSLFNVDEIVIYNDGSGGKQASSSIVTTEG 241

Query: 122 NRSDENESGAA----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
                 +   +    FL R+LQYLETPQYLR+ LF MH  L+F G+L P+++ HH ++ E
Sbjct: 242 EFEGAGKQATSNPDVFLARVLQYLETPQYLRRQLFPMHRDLKFAGLLAPIESAHHPKQDE 301

Query: 178 WAPFREGVTLK---------------------------ENAPNSVGTLVDVGLNKHVVVD 210
           +  FREG+                               +AP    + VDVGL     ++
Sbjct: 302 FVEFREGLVQSRSKGQRGNAHSHGPASAWDDQEGGEQHHSAPLPTSSFVDVGLRTEAKIN 361

Query: 211 QVLDPGVRVTVAMGT----NRNLDADSPRQ-------VVPPSKPKE-SGMYWGYKVRYAP 258
             L PG RVTV        N++     P         VV PS P+E  G+YWGY VR  P
Sbjct: 362 VALQPGTRVTVQFPQWSVDNKDGHFSQPSHHRYPSGNVVTPSLPRELRGLYWGYSVRLVP 421

Query: 259 NISSVFKNCSY--KGGYDHLIGTSEHGDIVNSSDLT-LPTFRHLLIAFGGLAGLEESIEE 315
            +S    +C +  +G YD LIG  + GD V+  D++ LP F+H LI FGG++GL  +IE 
Sbjct: 422 TLSRALVDCPFNERGRYDLLIGAGDEGDSVD--DMSELPEFKHALIVFGGVSGLPAAIEA 479

Query: 316 DDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRA 363
           D+ ++  N R VF  + +TCP +G R++R +E++ ISL   +  ++ A
Sbjct: 480 DESMEAHNPRAVFSKFLSTCPSRGCRSVRPDESLLISLAALRPHLAHA 527


>gi|343426289|emb|CBQ69820.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 455

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 146/349 (41%), Positives = 190/349 (54%), Gaps = 67/349 (19%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN----KSSSDNYSRSSA 117
           T+SVA+PGSI+ N QS EL +RLA  IARA  IF +DE+VVFD         D Y     
Sbjct: 103 TLSVAIPGSIVLNAQSPELQSRLAAHIARACAIFNVDEIVVFDEGEVRTGEGDLYQHRH- 161

Query: 118 ANRSNRSDENESGA--------------------AFLVRLLQYLETPQYLRKALFSMHSS 157
            N+ NR  +    A                     +L R+LQYLETPQYLRKALF MH  
Sbjct: 162 -NQQNRGTKRRFLADGEQEQQQEGEGEGKGFDPHTYLARVLQYLETPQYLRKALFPMHRD 220

Query: 158 LRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSV-----GTLVDVGLNKHVVVD-- 210
           LR  G++PPLD PHHLR  + + +REGVT+  +AP+          V+VGL   +     
Sbjct: 221 LRLAGLMPPLDCPHHLRFEDHSAYREGVTV--DAPHWATRARDSVYVNVGLRDPIEASLP 278

Query: 211 --QVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISSVFKNC 267
               +D G RVTV M     L++    QVV P +P E  G YWGY VR AP++SSV    
Sbjct: 279 AGAHVDAGTRVTVKM----PLNSYDRGQVVSPREPVEHRGWYWGYSVRLAPSLSSVLTTN 334

Query: 268 SYKG-GYDHLIGTSEHGDIVNSSDLTL--------------------PTFRHLLIAFGGL 306
            +    YD +IGTSE G  V  +DL L                    P F H L+ FGGL
Sbjct: 335 PFASEPYDLVIGTSERG--VPLTDLALATASATPVALASGETVQPLQPRFAHALLVFGGL 392

Query: 307 AGLEESIEEDDGLK--RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +GLE ++E+D+G++  R  A+E+F ++ N   +QGSRT+RTEEAI I+L
Sbjct: 393 SGLEVAVEQDEGIRLDRHTAKELFDAWVNVVENQGSRTVRTEEAIMIAL 441


>gi|225555276|gb|EEH03568.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 380

 Score =  224 bits (571), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 137/338 (40%), Positives = 186/338 (55%), Gaps = 54/338 (15%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+PGSII N QS +  T LAG IARA+ +F +DEVV+FD+ +      R    N +
Sbjct: 41  TLSIALPGSIIANAQSHDQKTFLAGSIARALAVFCVDEVVIFDDDA------RHIHLNGN 94

Query: 122 NRSDENESGAA-----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
           +  +E+E  A      FL  +L YLETP YLRK LF MH +LR  G LP LD PHH+R +
Sbjct: 95  HNINEDEYTAYSDPSHFLAHVLSYLETPPYLRKYLFPMHKNLRTAGTLPSLDMPHHIRAN 154

Query: 177 EWAPFREGVTLK--------------------------------ENAPNSVGTLVDVGLN 204
           EW  +REGVTL                                 +++P S  T V+ GL 
Sbjct: 155 EWCEYREGVTLSAAEEQGEVGMAGEHDNGRHGKKKGKRNKDNTGKDSPTSY-TSVNTGLP 213

Query: 205 KHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISS 262
           + V V  +  P   RVTV  G+  ++D     +VV P  P+ E+G YWGY VR   ++S+
Sbjct: 214 EKVRVPYIPTPENTRVTVKFGS--SIDPSDGAEVVSPFAPREETGYYWGYSVRRCASLSA 271

Query: 263 VFKNCSYKGGYDHLIGTSEHG-DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR 321
           VF  C + GGYD   GTSE G  +  +    +P F HL++ FGG+AGLE ++  D  L+ 
Sbjct: 272 VFTECPFDGGYDLSFGTSERGCPLSEAVAGEVPKFEHLIVVFGGVAGLEAAVTADKDLRD 331

Query: 322 K-----NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
           K     +A  VF  + N  P QGSRTIRTEEA+++ L 
Sbjct: 332 KGLGPSDAERVFDYWVNVLPGQGSRTIRTEEAVWLGLM 369


>gi|209879415|ref|XP_002141148.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209556754|gb|EEA06799.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 397

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 135/330 (40%), Positives = 183/330 (55%), Gaps = 43/330 (13%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS--DNYSRSSAAN 119
           TVSVA+P SII N QSLEL   L GQIAR + IF IDE+VV+++K  +  DN  R S   
Sbjct: 55  TVSVALPSSIISNAQSLELKAYLVGQIARILAIFGIDEIVVYEDKYKNIVDNSERQSDYI 114

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
             N S   E    F V+ L+YLETPQYLRK+LF   + L+F G+  P+DAPHH+R  EW 
Sbjct: 115 GFNCSRSME----FFVKNLRYLETPQYLRKSLFKFEADLKFAGLQNPIDAPHHMRISEWL 170

Query: 180 PFREGVT---------LKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM------- 223
           P+R+GV          L ++     G+ V+ GL     +D+ L    R+T+ M       
Sbjct: 171 PYRQGVVVSKPRLFKGLIKSEHKKSGSWVNCGLPVEAWIDESLPDNTRITIKMIDNSENL 230

Query: 224 -----------GTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKG 271
                      G N +       +VV    P+ + G+YWGYKVR A ++ SV  N  YK 
Sbjct: 231 HKKLCDEFSQKGKNEDSKTYFIGKVVSDETPRTKKGIYWGYKVRPANSLKSVLFNSPYKK 290

Query: 272 GYDHLIGTSEHGDIVNSSDLTL------PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAR 325
           GYD +IGTSE G  + + +  L       +F+HLLI FGGL GLE+ + +   L   + +
Sbjct: 291 GYDLVIGTSERGCCI-TDNFKLEKSPDKKSFKHLLIVFGGLGGLEDVLGDPQSLFNSDTK 349

Query: 326 --EVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             ++F  Y N CP Q SRTIRTEEA+ ++L
Sbjct: 350 PEDLFDMYVNICPEQKSRTIRTEEALGLTL 379


>gi|222628479|gb|EEE60611.1| hypothetical protein OsJ_14024 [Oryza sativa Japonica Group]
          Length = 175

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 104/163 (63%), Positives = 129/163 (79%), Gaps = 1/163 (0%)

Query: 202 GLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNI 260
           G   +V+V+Q+L+PG RVT+AMGTNR++     R++V PS P+ E  +YWGYKVRYA N+
Sbjct: 12  GKRMNVLVEQMLEPGKRVTIAMGTNRDITTACKRKIVSPSSPRDEMELYWGYKVRYASNL 71

Query: 261 SSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLK 320
             VF +  YK GYD++IGTSEHG I++SS+L LP+FRHLLIAFGGLAGLEE IEED  LK
Sbjct: 72  GGVFSDSPYKEGYDYIIGTSEHGKIISSSELILPSFRHLLIAFGGLAGLEECIEEDRNLK 131

Query: 321 RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRA 363
            K   +VF++Y NTCP QGSRTIRTEEA+ ISLQYFQ+PI RA
Sbjct: 132 GKGVDDVFNTYLNTCPSQGSRTIRTEEALLISLQYFQDPIRRA 174


>gi|85074739|ref|XP_965737.1| hypothetical protein NCU00597 [Neurospora crassa OR74A]
 gi|28927550|gb|EAA36501.1| conserved hypothetical protein [Neurospora crassa OR74A]
 gi|38567310|emb|CAE76598.1| conserved hypothetical protein [Neurospora crassa]
 gi|336465334|gb|EGO53574.1| hypothetical protein NEUTE1DRAFT_133922 [Neurospora tetrasperma
           FGSC 2508]
          Length = 349

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 126/299 (42%), Positives = 173/299 (57%), Gaps = 14/299 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVAVP S+I +  + E  T  AG+IARA  +F +DE+V++D+   S   SR ++ +  
Sbjct: 48  TISVAVPTSVITDCVTREQRTTWAGRIARACAVFSVDEIVIYDD---SPVESRPASTDHE 104

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             + + +  A FL  LL YLETP ++RK LF MH +LR  G+LP LD PHH  K EW P+
Sbjct: 105 GYTGDVDP-AHFLEHLLNYLETPPFMRKLLFPMHPNLRSQGLLPNLDMPHHPYKGEWMPY 163

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REG+TL+       GT+VD+GL   V +D+ L P  RVT+ M      +  SP  V P  
Sbjct: 164 REGLTLEAQPKGGKGTVVDIGLEDTVTIDEELPPKTRVTLKMPVG---EYGSPEPVHPAV 220

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS--DLTLPTFRHL 299
              E G +WGY VR A  +S VF    Y+ GYD  IGTSE G  ++ +  +     F HL
Sbjct: 221 PRTEGGYFWGYTVRKAATLSDVFTQSPYEEGYDLSIGTSERGQSLHRAFPEHKEANFTHL 280

Query: 300 LIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           L+AFGG  GLE +   D      G+     +E+F  + N  P+QGSR IRT+EA+ I+L
Sbjct: 281 LVAFGGPRGLEYAAMNDPELSKIGIHGPRTKELFEHWVNVLPNQGSRGIRTDEAMLIAL 339


>gi|336275401|ref|XP_003352453.1| hypothetical protein SMAC_01287 [Sordaria macrospora k-hell]
 gi|380094341|emb|CCC07720.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 349

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 125/299 (41%), Positives = 174/299 (58%), Gaps = 14/299 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVAVP S+I +  + E  T  AG+IARA  +F +DE+V++D+   S   SR ++ +  
Sbjct: 48  TISVAVPTSVITDCVTREQRTTWAGRIARACAVFSVDEIVIYDD---SPVESRPASTDHE 104

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             + + +  A FL  LL YLETP ++RK LF MH +LR  G+LP LD PHH  K EW P+
Sbjct: 105 GYTGDVDP-AHFLEHLLNYLETPPFMRKLLFPMHPNLRSQGLLPNLDMPHHPYKGEWMPY 163

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REG+TL+       GT+VD+GL + V +D+ L P  RVT+ M      +  SP  + P  
Sbjct: 164 REGLTLEAQPKGGKGTVVDIGLEETVTIDEELPPKTRVTLKMPVG---EYGSPEPIHPAV 220

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS--DLTLPTFRHL 299
              E G +WGY VR A  +S VF    Y+ GYD  IGTSE G  ++ +  +     F HL
Sbjct: 221 PRTEGGYFWGYTVRKAATLSDVFTQSPYEEGYDLSIGTSERGQSLHRAFPEHKEANFTHL 280

Query: 300 LIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           L+AFGG  GLE +   D      G+     +E+F  + N  P+QGSR IRT+EA+ I+L
Sbjct: 281 LVAFGGPRGLEYAAMNDPELSKIGIHGPRTKELFEHWVNVLPNQGSRGIRTDEAMLIAL 339


>gi|350295631|gb|EGZ76608.1| DUF171-domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 349

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 126/299 (42%), Positives = 173/299 (57%), Gaps = 14/299 (4%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVAVP S+I +  + E  T  AG+IARA  +F +DE+V++D+   S   SR ++ +  
Sbjct: 48  TISVAVPTSVITDCVTREQRTTWAGRIARACAVFSVDEIVIYDD---SPVESRPASTDHE 104

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             + + +  A FL  LL YLETP ++RK LF MH +LR  G+LP LD PHH  K EW P+
Sbjct: 105 GYTGDVDP-AHFLEHLLNYLETPPFMRKLLFPMHPNLRSQGLLPNLDMPHHPYKGEWMPY 163

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REG+TL+       GT+VD+GL   V +D+ L P  RVT+ M      +  SP  V P  
Sbjct: 164 REGLTLEAQPKGGKGTVVDIGLEDTVTIDEELPPKTRVTLKMPVG---EYGSPEPVHPAV 220

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS--DLTLPTFRHL 299
              E G +WGY VR A  +S VF    Y+ GYD  IGTSE G  ++ +  +     F HL
Sbjct: 221 PRTEGGYFWGYTVRKAATLSDVFTQSPYEEGYDLSIGTSERGQSLHRAFPEHKEANFTHL 280

Query: 300 LIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           L+AFGG  GLE +   D      G+     +E+F  + N  P+QGSR IRT+EA+ I+L
Sbjct: 281 LVAFGGPRGLEYAAMNDPELSKIGIHGPRTKELFEHWVNVLPNQGSRGIRTDEAMLIAL 339


>gi|358057838|dbj|GAA96340.1| hypothetical protein E5Q_03006 [Mixia osmundae IAM 14324]
          Length = 526

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 138/312 (44%), Positives = 188/312 (60%), Gaps = 28/312 (8%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKS---SSDN---YSRS 115
           T+S+A+PGSI+ N Q+ EL T LAGQIAR+  IF +DE++VF   +   SS+N     R 
Sbjct: 213 TLSIALPGSILLNAQTPELRTILAGQIARSCAIFNVDEIIVFHEPTFSPSSENGHEQKRM 272

Query: 116 SAANRSNRSDENESGAA---FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH 172
               R    +E++S A     LV++L YLETPQYLRKAL  MHS+LR  G+LPPLD PHH
Sbjct: 273 RGKYRQTAEEESQSTAQADEMLVKILTYLETPQYLRKALIPMHSALRQAGILPPLDMPHH 332

Query: 173 LRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRN 228
           LR+ + +PFRE V L  +  +     +D+GL   V        + + G RVT A  TN  
Sbjct: 333 LRRDDPSPFREAVVLSHDD-DGRSVQIDIGLRDPVRARTKSSALPERGTRVT-AHYTNAE 390

Query: 229 LDADSPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGG---YDHLIGTSEHGD 284
                   +V P  P+E  G+ WGY+VR+   IS +F + ++      YD +IGTSE G 
Sbjct: 391 WS------LVSPLLPRELLGISWGYRVRHVSQISRIFLDPAHGQPNEEYDLVIGTSERGQ 444

Query: 285 IVNSS-DLTLPTFRHLLIAFGGLAGLEESI--EEDDGLKRKNAREVFHSYFNTCPHQGSR 341
            + S+ D   P  +H+LI  GGLAGLE +I  +ED GL  + A ++F  + N CPHQGSR
Sbjct: 445 PLQSALDSFPPHPKHVLIVLGGLAGLELTIGADEDTGLTAETALDLFDHWINLCPHQGSR 504

Query: 342 TIRTEEAIFISL 353
           TIR+EEA+ I+L
Sbjct: 505 TIRSEEALPIAL 516


>gi|326426771|gb|EGD72341.1| hypothetical protein PTSG_00362 [Salpingoeca sp. ATCC 50818]
          Length = 484

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 128/322 (39%), Positives = 184/322 (57%), Gaps = 18/322 (5%)

Query: 36  SSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIF 95
           SS+  +++   +K     D  P    TVS+A+PGS+I   QS E+ + LAGQIARA  +F
Sbjct: 150 SSTARDERLPSKKAKVKQDHGPGRHYTVSIAIPGSLIAGVQSQEVRSHLAGQIARAAVLF 209

Query: 96  RIDEVVVFDNKSSSDNYSRSSAA----NRSNRSDENESGAAFLVRLLQYLETPQYLRKAL 151
           ++DE+VV+   + S + + +       N            AFL R+L+YL+ P +LRK  
Sbjct: 210 QVDEIVVYRESTISLDNTVTEGIPVLHNEHGTGKMRGDPCAFLARVLEYLDCPPFLRKHF 269

Query: 152 FSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ 211
           F +H  L+F  +LPPL+APHH      +PFREGVTLK+      G+ VD+GL +   +D+
Sbjct: 270 FGIHRDLKFASLLPPLEAPHHTVADAPSPFREGVTLKKYVKPGKGSFVDIGLTRPAKIDR 329

Query: 212 VLDPGVRVTVAMGTNR--NLDADSPR-QVVPPSKPKES-GMYWGYKVRYAPNISSVFKNC 267
            L PGVRVTVA+   +       + R + V PS P+E  GMYWGY VR A + + V   C
Sbjct: 330 HLAPGVRVTVALDVTQYHKKQVHTFRGKAVSPSAPREQKGMYWGYSVRVAESAADVLSEC 389

Query: 268 SYKGGYDHLIG-TSEHGDIVNSSDLTLP-TFRHLLIAFGGLAGLEESIEEDDGLKRKNAR 325
            +K GYD  +G T++ G    +  +T+P +F+H L+ FGG  GL+E +E      R+  +
Sbjct: 390 PWKTGYDLCMGITTKRGKAPEA--VTMPKSFKHALVIFGGAEGLDEFVE-----SRQAQK 442

Query: 326 EVFHSYFNTCPHQGSRTIRTEE 347
            V H   NTC H G+R IRTEE
Sbjct: 443 RVLHR-INTCKHAGTRNIRTEE 463


>gi|261193745|ref|XP_002623278.1| hypothetical protein BDBG_06717 [Ajellomyces dermatitidis SLH14081]
 gi|239588883|gb|EEQ71526.1| hypothetical protein BDBG_06717 [Ajellomyces dermatitidis SLH14081]
 gi|239613796|gb|EEQ90783.1| hypothetical protein BDCG_05903 [Ajellomyces dermatitidis ER-3]
          Length = 380

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 136/337 (40%), Positives = 186/337 (55%), Gaps = 54/337 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN---KSSSDNYSRSSAA 118
           T+S+A+PGSII N QS +  T LAG IARA+ +F +DEVV+FD+   ++ S  Y  ++  
Sbjct: 41  TLSIALPGSIIANAQSHDQKTFLAGSIARALAVFCVDEVVIFDDDARQTRSKPYHNANED 100

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
             +  SD +     FL  +L YLETP YLRK LF MH +LR  G LP LD PHH+R +EW
Sbjct: 101 EYTAYSDPSH----FLAHVLSYLETPPYLRKYLFPMHRNLRTAGTLPSLDMPHHIRANEW 156

Query: 179 APFREGVTL---KENA----------------------------PNSVGTLVDVGLNKHV 207
             +REGVT+   +E A                             ++  TLV+ GL +  
Sbjct: 157 CEYREGVTVSASEEQADARTAREYGNTNHERKKGKKNKEKKGQGSSTSYTLVNTGLPEKS 216

Query: 208 VVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISSVFK 265
            V  +  P   RVTV  G ++  D     +VV P  P+E  G YWGY VR   ++S+VF 
Sbjct: 217 RVPHIPAPENTRVTVKFGASK--DPSEVAEVVSPVTPREEMGYYWGYSVRRCSSLSTVFT 274

Query: 266 NCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEED----- 316
            C ++GGYD   GTSE G    D++      +P   HL+I FGG+AGLE +++ D     
Sbjct: 275 ECPFEGGYDLSFGTSERGAPLSDVIAGH---VPKSEHLIIVFGGVAGLEAAVKADKELQE 331

Query: 317 DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            GLK   A ++F  + N  P QGSRTIRTEEA+++ L
Sbjct: 332 KGLKPSEAEKMFDYWINVVPGQGSRTIRTEEAVWLGL 368


>gi|240275221|gb|EER38736.1| hypothetical protein HCDG_07605 [Ajellomyces capsulatus H143]
 gi|325094582|gb|EGC47892.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 380

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 136/338 (40%), Positives = 187/338 (55%), Gaps = 54/338 (15%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+PGSII N QS +  + LAG IARA+ +F +DEVV+FD+ +      R    N +
Sbjct: 41  TLSIALPGSIIANAQSHDQKSFLAGSIARALAVFCVDEVVIFDDDA------RHIHLNGN 94

Query: 122 NRSDENESGAA-----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
           +  +E+E  A      FL  +L YLETP YLRK LF MH +LR  G LP LD PHH+R +
Sbjct: 95  HNINEDEYTAYSDPSHFLAHVLSYLETPPYLRKYLFPMHKNLRTAGTLPSLDMPHHIRAN 154

Query: 177 EWAPFREGVTLK--------------------------------ENAPNSVGTLVDVGLN 204
           EW  +REGVTL                                 +++P S  TLV+ GL 
Sbjct: 155 EWCEYREGVTLSAAEEQGEVGMVGEHDNGRHGKKKGKRNKDNMGKDSPTSY-TLVNTGLP 213

Query: 205 KHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISS 262
           + V V  +  P   RVTV  G+  ++D     +VV P  P+ E+G YWGY VR   ++S+
Sbjct: 214 EKVRVPYIPTPENTRVTVKFGS--SIDPSDGAEVVSPFAPREETGYYWGYSVRRCASLSA 271

Query: 263 VFKNCSYKGGYDHLIGTSEHGDIVNSSDL-TLPTFRHLLIAFGGLAGLEESIEEDDGLKR 321
           VF  C + GGYD   GTSE G  ++ +    +  F HL++ FGG+AGLE ++  D  L+ 
Sbjct: 272 VFTECPFDGGYDLSFGTSERGCPLSEAIAGEVLKFEHLIVVFGGVAGLEAAVTADKNLRD 331

Query: 322 K-----NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
           K     +A  VF  + N  P QGSRTIRTEEA+++ L 
Sbjct: 332 KGLGASDAERVFDYWVNVLPGQGSRTIRTEEAVWLGLM 369


>gi|401404268|ref|XP_003881686.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325116099|emb|CBZ51653.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 821

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 132/347 (38%), Positives = 185/347 (53%), Gaps = 59/347 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+P SIIDN Q+ EL   L GQIAR +T+F +DE++V+++ ++        A +R 
Sbjct: 471 TLSVALPASIIDNAQTAELRAALVGQIARTLTVFGVDEIIVYEDVAA--------AISRG 522

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
           N  D +     F VR L+YLETPQ+LRK+LF +HS LRF G+  PLDAPHHLR++EW P+
Sbjct: 523 NAEDGHSRALEFFVRNLRYLETPQFLRKSLFPIHSDLRFAGLQNPLDAPHHLRRNEWLPY 582

Query: 182 REGVTL---------------------------------KENAPNSVGTLVDVGLNKHVV 208
           REGV +                                 K+   N  G  V+ GL   V 
Sbjct: 583 REGVVVASSKTRQERQHAKATECDRAKGPGDDVKLTAAEKKRLKNKQGAWVECGLPALVW 642

Query: 209 V-DQVLDPGVRVTVAMGTN-RNL--------DADSPRQ----VVPPSKPKESGMYWGYKV 254
           + +  L+ G+RVTV +  + R L        DA+ P      V P   P ++G+YWGY+V
Sbjct: 643 IPNSRLEDGMRVTVRLDASVRQLQRQPQERGDAEPPLMRGVAVSPDEPPVKAGLYWGYRV 702

Query: 255 RYAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEES 312
           R A +   VF +C +   G YD  +GTSE G  V         +RH+L+ FGGL GLE  
Sbjct: 703 RIAQHFQDVFSSCPFSADGRYDLTVGTSERGACVGRGFEFPRNYRHMLLVFGGLQGLEAV 762

Query: 313 I--EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           +   + +    ++   +F  Y NTC  Q SRTIR EEA+ I+L   +
Sbjct: 763 LLDRQSNCAPCRDPSTLFDMYLNTCAFQRSRTIRAEEAVPITLALLR 809


>gi|449302535|gb|EMC98544.1| hypothetical protein BAUCODRAFT_23290 [Baudoinia compniacensis UAMH
           10762]
          Length = 382

 Score =  221 bits (563), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 133/302 (44%), Positives = 168/302 (55%), Gaps = 46/302 (15%)

Query: 79  ELATRLAGQIARAVTIFRIDEVVVFDNKS------SSDNYSRSSAANRSNRSDENESGAA 132
           EL T LAGQIARA  +F +DEVVVFD+ +        D Y+          +D N     
Sbjct: 86  ELKTVLAGQIARACAVFCVDEVVVFDDGAVKARPPDEDGYT--------GYTDPNN---- 133

Query: 133 FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLK--EN 190
           FLV++L YLETP +LR+ L   H+  +  G+ P LD PHHLR  EW  FREGV L   EN
Sbjct: 134 FLVQMLTYLETPPHLRRTLIPKHADFQCAGVAPSLDMPHHLRSDEWCIFREGVALHPGEN 193

Query: 191 APNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR---------QVVPPS 241
              + GTLVD GL + V V   L+   RVTV +        D P+         + V P 
Sbjct: 194 RDGNPGTLVDCGLQQKVFVGVPLEERSRVTVKL-------PDQPQAGRQDMLHGEAVSPD 246

Query: 242 KPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV------NSSDLTLP 294
            P+ E+G YWGY VR A ++ +VF  C Y GGYD  IGTSE G  V      +S     P
Sbjct: 247 TPREEAGYYWGYSVRQASSLGAVFTECQYDGGYDISIGTSERGMSVQSVLSADSVSCVQP 306

Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKN---AREVFHSYFNTCPHQGSRTIRTEEAIFI 351
           T++HLLI FGG+AGLE ++  D  L+      A+EVF  + N  P QGSRTIRTEEA +I
Sbjct: 307 TWQHLLIVFGGVAGLEAALSADAELQAAGVSEAKEVFDRWLNVLPQQGSRTIRTEEATWI 366

Query: 352 SL 353
           +L
Sbjct: 367 AL 368


>gi|327350021|gb|EGE78878.1| deoxyribose-phosphate aldolase [Ajellomyces dermatitidis ATCC
           18188]
          Length = 380

 Score =  221 bits (563), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 135/337 (40%), Positives = 186/337 (55%), Gaps = 54/337 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN---KSSSDNYSRSSAA 118
           T+S+A+PGSII N QS +  T LAG IARA+ +F +DEVV+FD+   ++ S  Y  ++  
Sbjct: 41  TLSIALPGSIIANAQSHDQKTFLAGSIARALAVFCVDEVVIFDDDARQTRSKPYHNANED 100

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
             +  SD +     FL  +L YLETP YLRK LF MH +LR  G LP LD PHH+R +EW
Sbjct: 101 EYTAYSDPSH----FLAHVLSYLETPPYLRKYLFPMHRNLRTAGTLPSLDMPHHIRANEW 156

Query: 179 APFREGVTL---KENA----------------------------PNSVGTLVDVGLNKHV 207
             +REGVT+   +E A                             ++  TLV+ GL +  
Sbjct: 157 CEYREGVTVSASEEQADARTAREYGNTNHERKKGKKNKEKKGQGSSTSYTLVNTGLPEKS 216

Query: 208 VVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISSVFK 265
            V  +  P   RVTV  G ++  D     +VV P  P+E  G YWGY VR   ++S+VF 
Sbjct: 217 RVPHIPAPENTRVTVKFGASK--DPSEVAEVVSPVTPREEMGYYWGYSVRRCSSLSTVFT 274

Query: 266 NCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEED----- 316
            C ++GGYD   GTSE G    D++      +P   HL++ FGG+AGLE +++ D     
Sbjct: 275 ECPFEGGYDLSFGTSERGAPLSDVIAGH---VPKSEHLILVFGGVAGLEAAVKADKELQE 331

Query: 317 DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            GLK   A ++F  + N  P QGSRTIRTEEA+++ L
Sbjct: 332 KGLKPSEAEKMFDYWINMVPGQGSRTIRTEEAVWLGL 368


>gi|226294556|gb|EEH49976.1| deoxyribose-phosphate aldolase [Paracoccidioides brasiliensis Pb18]
          Length = 380

 Score =  220 bits (561), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 137/335 (40%), Positives = 182/335 (54%), Gaps = 50/335 (14%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSII N QS +  T LAG IARA+ +F +DEVV+F++    D   R S A+  
Sbjct: 41  TLSVALPGSIIANAQSHDQKTFLAGSIARALAVFCVDEVVIFED----DVRQRQSNAHYD 96

Query: 122 NRSDENES---GAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
              DE  +    + FL  +L YLETP YLRK LF MH +LR  G LP LD PHH+R +E 
Sbjct: 97  LSKDEYTAYSDPSHFLAHVLSYLETPPYLRKFLFPMHKNLRTAGTLPSLDMPHHIRANES 156

Query: 179 APFREGVTLKENAPNSVG-------------------------------TLVDVGLNKHV 207
             +REGVT+  N  +  G                               TLVD GL   V
Sbjct: 157 CEYREGVTVPANEEHGFGQSNAENHHNQHKRKKGKKNMEEGVQDTVMSYTLVDTGLPNKV 216

Query: 208 VVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNISSVFK 265
            +  +  P   RVTV + ++   D     +VV PS P+E  G YWGY VR   ++S VF 
Sbjct: 217 CIPHIPVPENTRVTVKLCSSE--DPSQGAEVVSPSTPREEMGYYWGYSVRCCDSLSDVFT 274

Query: 266 NCSYKGGYDHLIGTSEHGDIVNSSDLT--LPTFRHLLIAFGGLAGLEESIEEDD-----G 318
            C + GGYD   GTSE G  + S  +T  +P F HL++ FGG+AGLE +++ D      G
Sbjct: 275 ECPFDGGYDLSFGTSERGSPL-SQVITEGVPNFEHLIVVFGGVAGLEVAVKADKELLARG 333

Query: 319 LKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           L+ +    +F  + N  P QGSRTIRTEEA+++ L
Sbjct: 334 LRPETVGNMFDYWVNVLPGQGSRTIRTEEAVWLGL 368


>gi|115402537|ref|XP_001217345.1| deoxyribose-phosphate aldolase 2 [Aspergillus terreus NIH2624]
 gi|114189191|gb|EAU30891.1| deoxyribose-phosphate aldolase 2 [Aspergillus terreus NIH2624]
          Length = 661

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 145/377 (38%), Positives = 194/377 (51%), Gaps = 72/377 (19%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSS-SDNYSR------ 114
           T+SVA+PGSI+ N  S+E  T LAG IARA+ +F +DEVV+FD++   S NY        
Sbjct: 285 TLSVAIPGSIVANAHSIEQKTFLAGIIARALAVFCVDEVVIFDDEEHRSRNYRHYDDGYD 344

Query: 115 SSAANRSNRSDENESG-------AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
           S  A  S+  + +  G       + FL  +L YLETP YLRK LF MH +LR  G+LP L
Sbjct: 345 SPIAKTSDELNASTKGYTAYSDPSHFLAHVLSYLETPPYLRKHLFPMHPNLRGAGLLPSL 404

Query: 168 DAPHHLRKHEWAPFREGVTLKEN--------------APN-------------------- 193
           D PHHLR HEW  +REG+ L                 +PN                    
Sbjct: 405 DMPHHLRAHEWCDYREGIVLSSTPRDGRGRSGSQMSLSPNRHSYRNLSPSSNSSQASTNP 464

Query: 194 SVGTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWG 251
           +  T+VD GL++ VV+  +  P   RVTV    + +   D   + V PS P+ E+G YWG
Sbjct: 465 TTATVVDTGLSEKVVLPDIQLPEHARVTVRFSHHGH---DHYAEPVHPSAPRAEAGYYWG 521

Query: 252 YKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS-----------SDLTLPTFRHLL 300
           Y VR   ++SSVF  C + GGYD   GTSE G  V                  P ++HLL
Sbjct: 522 YYVRRCRSLSSVFTECPFDGGYDLSFGTSERGAPVGEVLEDEAPYHRPPHTHRPDYKHLL 581

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAR-----EVFHSYFNTCPHQGSRTIRTEEAIFI---S 352
           + FGG+AG+E +I  D  L+  + R     ++F  + N  P QGSRTIRTEEA+++   S
Sbjct: 582 VVFGGVAGIEAAIRNDPQLRDMDIRPTDAGKLFDYWINLLPGQGSRTIRTEEAVWLGLTS 641

Query: 353 LQYFQEPISRALRRVEY 369
           L+   E   RA R   Y
Sbjct: 642 LRGLVEGTHRAKRSYRY 658


>gi|258573617|ref|XP_002540990.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237901256|gb|EEP75657.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 369

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 131/334 (39%), Positives = 182/334 (54%), Gaps = 48/334 (14%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+PGSII N +S +  T LAG IARA+ +F +DE+V+FD++  +  +        +
Sbjct: 40  TLSIALPGSIIANAKSHDQKTYLAGYIARALAVFCVDEIVIFDDEPRAQGH-------EN 92

Query: 122 NRSDENESGA-----AFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
            +  ENE  A      FL  +L YLETP +LR+ LF MH +LR  G LP LD PHHLR +
Sbjct: 93  KQPFENEYTAFSDPSHFLAHVLSYLETPPHLRRDLFPMHPNLRTAGTLPSLDMPHHLRAN 152

Query: 177 EWAPFREGVTLK----------------------ENAPNSVGTLVDVGLNKHVVVDQV-L 213
           EW  +REGVT+                       EN   ++ TL + G +  V +  + +
Sbjct: 153 EWCEYREGVTIPGENYGSEESTSNKKKNKRSNKGENGSETL-TLANAGFSHPVRISNLSI 211

Query: 214 DPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGG 272
            P  R+T+   T    D   P   V PS P+ E+G YWGY +R   ++SSV   C + GG
Sbjct: 212 PPHTRLTLKFKTENPSDGADP---VAPSAPREEAGYYWGYSIRRCSSLSSVLTECPFDGG 268

Query: 273 YDHLIGTSEHGD---IVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRK-----NA 324
           YD   GTSE G     V      +P +RHLL+ FGG+AGLE +++ D  L  K     N 
Sbjct: 269 YDLSFGTSERGTPLATVLQCPEEIPQYRHLLLVFGGVAGLETAVKADKELADKGIGPGNV 328

Query: 325 REVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
            E+F  + N  P QGSRTIRTEEA+++ L   +E
Sbjct: 329 GELFDYWVNVLPGQGSRTIRTEEAVWLGLMGLRE 362


>gi|134079630|emb|CAK40846.1| unnamed protein product [Aspergillus niger]
          Length = 843

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 142/367 (38%), Positives = 190/367 (51%), Gaps = 76/367 (20%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK---SSSDNYSR---- 114
           T+SVA+PGSIIDN  S+E  T LAG IARA+ +F +DEVVVFD+    +  D Y      
Sbjct: 463 TLSVAIPGSIIDNAHSVEQKTFLAGVIARALAVFCVDEVVVFDDNEYGAHCDYYDHRVSY 522

Query: 115 -SSAANRSNRSD----------ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
            S  A   ++ D           N   + FL  +L YLETP YLRK LF +H +LR  G+
Sbjct: 523 DSPIAKTRDQVDGDTSSPKAYTANSDPSHFLTHVLSYLETPPYLRKHLFRIHPNLRTAGL 582

Query: 164 LPPLDAPHHLRKHEWAPFREGVTLKENA-------------------------PNSVGTL 198
           LP LD PHHLR +EW  FREG+ +  +A                          N   T+
Sbjct: 583 LPSLDMPHHLRANEWCDFREGIVVPSSARKSNPASKITQMSKDHPFHRRNSSTNNGPATI 642

Query: 199 VDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSP-RQVVPPSKPK-ESGMYWGYKVR 255
           VD GL++ +V+  +  P   RVTV     R L  + P  + V PS P+ E+G YWGY VR
Sbjct: 643 VDTGLSQKIVLPNIHLPEHARVTV-----RFLQHEFPYAEPVHPSTPRAEAGYYWGYYVR 697

Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS--------------------SDLTLPT 295
              ++SSVF  C Y GGYD   GTSE G  V++                     D   P 
Sbjct: 698 RCRSLSSVFTECPYDGGYDLSFGTSERGVPVSAVLEEKPQQAHDRYKQGHSFCRDQISPD 757

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKR-----KNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
           F+H+L+ FGG+AG+E ++  D  L+       +A ++F  + N  P QGSRTIRTEEA++
Sbjct: 758 FQHILVVFGGVAGIEAAVRNDPQLRDMAIRPSDAGKLFDYWVNFLPGQGSRTIRTEEAVW 817

Query: 351 ISLQYFQ 357
           + L   Q
Sbjct: 818 MGLTSLQ 824


>gi|317033146|ref|XP_001394931.2| deoxyribose-phosphate aldolase [Aspergillus niger CBS 513.88]
          Length = 456

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 142/367 (38%), Positives = 190/367 (51%), Gaps = 76/367 (20%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK---SSSDNYSR---- 114
           T+SVA+PGSIIDN  S+E  T LAG IARA+ +F +DEVVVFD+    +  D Y      
Sbjct: 76  TLSVAIPGSIIDNAHSVEQKTFLAGVIARALAVFCVDEVVVFDDNEYGAHCDYYDHRVSY 135

Query: 115 -SSAANRSNRSD----------ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
            S  A   ++ D           N   + FL  +L YLETP YLRK LF +H +LR  G+
Sbjct: 136 DSPIAKTRDQVDGDTSSPKAYTANSDPSHFLTHVLSYLETPPYLRKHLFRIHPNLRTAGL 195

Query: 164 LPPLDAPHHLRKHEWAPFREGVTLKENA-------------------------PNSVGTL 198
           LP LD PHHLR +EW  FREG+ +  +A                          N   T+
Sbjct: 196 LPSLDMPHHLRANEWCDFREGIVVPSSARKSNPASKITQMSKDHPFHRRNSSTNNGPATI 255

Query: 199 VDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPR-QVVPPSKPK-ESGMYWGYKVR 255
           VD GL++ +V+  +  P   RVTV     R L  + P  + V PS P+ E+G YWGY VR
Sbjct: 256 VDTGLSQKIVLPNIHLPEHARVTV-----RFLQHEFPYAEPVHPSTPRAEAGYYWGYYVR 310

Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS--------------------SDLTLPT 295
              ++SSVF  C Y GGYD   GTSE G  V++                     D   P 
Sbjct: 311 RCRSLSSVFTECPYDGGYDLSFGTSERGVPVSAVLEEKPQQAHDRYKQGHSFCRDQISPD 370

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKR-----KNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
           F+H+L+ FGG+AG+E ++  D  L+       +A ++F  + N  P QGSRTIRTEEA++
Sbjct: 371 FQHILVVFGGVAGIEAAVRNDPQLRDMAIRPSDAGKLFDYWVNFLPGQGSRTIRTEEAVW 430

Query: 351 ISLQYFQ 357
           + L   Q
Sbjct: 431 MGLTSLQ 437


>gi|380494662|emb|CCF32981.1| hypothetical protein CH063_05257 [Colletotrichum higginsianum]
          Length = 335

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 127/304 (41%), Positives = 180/304 (59%), Gaps = 27/304 (8%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+P SI+ +  + E  T +AG+IARA+ +F +DE+V+FD     D+   S   N  
Sbjct: 37  TVSVAIPSSILTDAITAERRTAVAGRIARALAVFSVDEIVLFD-----DSPVESRPKNVD 91

Query: 122 NRSDENE-SGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
            ++   +  G  FL  LL YLE P ++RK LF +H++LR    LP L+ PHH    +W P
Sbjct: 92  TKAYTGDVDGCHFLEHLLGYLEVPPFMRKMLFPLHANLRLANQLPALEMPHHPNPADWLP 151

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
           +REGV  K   P   GT++DVGL  HV+++  + P  RVT++   + N D     +   P
Sbjct: 152 YREGVASK-GVPQ--GTMIDVGLKNHVLIEDEVPPNTRVTLSY-PDYNTDK---AEACDP 204

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT---- 295
           S P+ E G YWGY+VR A ++SSVF  C Y+GGYD  IGTSE G  ++ +    P+    
Sbjct: 205 SAPREEGGYYWGYQVRRANSLSSVFTECKYEGGYDVSIGTSERGRSLSDA---FPSAAPL 261

Query: 296 -FRHLLIAFGGLAGLEESIEEDDGLK-----RKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
            F+HLLI FGG  G+E +   D  L+        ARE+F  + +  P QGSRTIRT+EAI
Sbjct: 262 DFKHLLIVFGGPRGIEYAALNDKELEGMEMTPAKARELFDHWVDVLPGQGSRTIRTDEAI 321

Query: 350 FISL 353
           +++L
Sbjct: 322 WVAL 325


>gi|452836847|gb|EME38790.1| hypothetical protein DOTSEDRAFT_139652, partial [Dothistroma
           septosporum NZE10]
          Length = 331

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 129/305 (42%), Positives = 172/305 (56%), Gaps = 19/305 (6%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+A+PGS+I N  + +  T LAGQIARA  +F +DEVVVFD     D  +   A  + 
Sbjct: 20  TLSIALPGSVIANAVTHDQKTSLAGQIARACAVFCVDEVVVFD-----DGQAEIRAPEQG 74

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +  ++  + FL  +L YLETP  LRK+LF +H  LR  G LP LD PHHLR  EW  F
Sbjct: 75  GYTAYSDP-SFFLYHVLTYLETPPNLRKSLFPIHPDLRTAGALPSLDMPHHLRSDEWCQF 133

Query: 182 REGVTLKENAPN-SVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADS--PRQVV 238
           REG+T+  +      GTLVD GL   V +   ++P  RVTV +  N  +        + V
Sbjct: 134 REGITIGPSRNGREGGTLVDCGLATKVYIPDEIEPQTRVTVQLPDNGLIHHAEIIIGEAV 193

Query: 239 PPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV------NSSDL 291
            P  P+E  G YWGY VR A ++ SVF  C + GGYD  IGTSE G  +       S+  
Sbjct: 194 SPDTPREDGGYYWGYAVRQASSLGSVFTECPFDGGYDVSIGTSERGRPLRSITEQGSATY 253

Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKN---AREVFHSYFNTCPHQGSRTIRTEEA 348
             PT+ H+++  GG++GLE  +  D  L+      A +VF  + N    QGSRTIRTEEA
Sbjct: 254 VEPTWNHMIMVLGGVSGLEAGLRADKELQAAGVSEAAQVFDFWINLVLGQGSRTIRTEEA 313

Query: 349 IFISL 353
           I+I L
Sbjct: 314 IWIGL 318


>gi|403355602|gb|EJY77383.1| hypothetical protein OXYTRI_00986 [Oxytricha trifallax]
          Length = 395

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 120/299 (40%), Positives = 170/299 (56%), Gaps = 16/299 (5%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+ VP SIIDN QSLEL T L GQIA+A   F ++E++V+ N              ++
Sbjct: 92  TISIVVPSSIIDNAQSLELKTYLVGQIAKACGHFGVNEIIVYSNDRQQ--------HMKN 143

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
              D   +   F V+ L+Y+ETPQYLRKALF  HS+LR+ G+  PL+ PHHL+  EW  +
Sbjct: 144 LVGDTRTTTTEFFVKNLEYIETPQYLRKALFPRHSALRYTGLTNPLEGPHHLKITEWCAY 203

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ--VVP 239
           REGV +     +  G+ V++GL K   VD  L+ G RVTV +   +N D        VV 
Sbjct: 204 REGVVINRPTAHGKGSWVNIGLPKDCQVDIQLEEGTRVTVKLN-EKNFDNTRYYSGVVVS 262

Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSY-KGGYDHLIGTSEHGDIVNSSDL-TLPTF 296
            ++P E  G++WGY VR A     +F  C + + GYD  +G SE G  +   D      F
Sbjct: 263 SNEPTEKLGLFWGYTVRVANKFEDIFDECPFEETGYDLKLGVSEQGQEIEHVDFRQHQNF 322

Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKN--AREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +H LI FGG+ G+E  IE+D+  K      ++ F S  N  P +G+RTIRTEE+I +++
Sbjct: 323 KHALIFFGGMEGIEGIIEQDERTKMSTEAIKKKFDSLINPVPERGTRTIRTEESILVTM 381


>gi|393908513|gb|EJD75089.1| deoxyribose-phosphate aldolase [Loa loa]
          Length = 365

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 128/318 (40%), Positives = 176/318 (55%), Gaps = 23/318 (7%)

Query: 50  DQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKS-- 107
           D +N+        +S+A+PGSI++N QS EL T LAGQIAR+  IF +DE++V+D  +  
Sbjct: 46  DNMNETVNGRFLMLSIALPGSILNNAQSPELRTYLAGQIARSAAIFCVDEIIVYDETARM 105

Query: 108 ---SSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGML 164
                D+Y              N      L R+L+++E PQYLRK LF +  +LR+ G+L
Sbjct: 106 KPQQRDDYCSGKWCPDLPVHSGNVECNFHLARILEFMECPQYLRKTLFPLQKTLRYAGIL 165

Query: 165 PPLDAPHHLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ--VLDPGVRVTV 221
            PLD PHHLR  + + P+REGV L +      G + ++GL K V +++  VL  G RVTV
Sbjct: 166 NPLDCPHHLRASDLSVPYREGVVLDKPVKAGRGPICNIGLYKEVQINEEVVLQAGTRVTV 225

Query: 222 AMGTNRNLDADSPRQ------VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH 275
            +     LD  S R+      V      +E+G+YWGYKVR A ++          G YD 
Sbjct: 226 KV-----LDIYSERKRYHGCLVNSQQIKRETGLYWGYKVRIALSLHDALN----AGEYDV 276

Query: 276 LIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTC 335
           +IGTSE G  V+  ++ L     +LI FGGL GLE +IE D  +      E+F  Y N  
Sbjct: 277 IIGTSERGKPVSKFEMPLCERNRILIVFGGLEGLEAAIEADKNISCSTPEELFEHYLNVV 336

Query: 336 PHQGSRTIRTEEAIFISL 353
           P QGSR IRTEEAI I+L
Sbjct: 337 PGQGSRIIRTEEAIPITL 354


>gi|444721244|gb|ELW61988.1| Kynurenine--oxoglutarate transaminase 1 [Tupaia chinensis]
          Length = 982

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 113/235 (48%), Positives = 150/235 (63%), Gaps = 4/235 (1%)

Query: 134 LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPN 193
           L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + FREGV +      
Sbjct: 748 LARVLQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEFREGVVVDRPTRP 807

Query: 194 SVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPPSKPK-ESGMYWG 251
             G+ V+ G+ K V +D+ L+PG+RVTV +   +  +  + R +VV    P+ ++G+YWG
Sbjct: 808 GHGSFVNCGMKKEVKIDKNLEPGLRVTVQLSQKQLPECKTYRGKVVSSQDPRTKAGLYWG 867

Query: 252 YKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEE 311
           Y VR A  +S+VF    ++ GYD  IGTSEHG  V S+   LP+FRH L+ FGGL GLE 
Sbjct: 868 YTVRLASCLSAVFAEAPFEDGYDLTIGTSEHGSDVGSAQ--LPSFRHALVVFGGLQGLEA 925

Query: 312 SIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRR 366
            ++ D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  ++RA  R
Sbjct: 926 GVDADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPGLNRAGAR 980



 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 68/144 (47%), Positives = 92/144 (63%), Gaps = 4/144 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +      +++     +
Sbjct: 544 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQD---AKTVEGEFT 600

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               + ++    L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 601 GVGKKGQACVQ-LARVLQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 659

Query: 182 REGVTLKENAPNSVGTLVDVGLNK 205
           REGV +        G+ V+ G+ K
Sbjct: 660 REGVVVDRPTRPGHGSFVNCGMKK 683



 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 31/45 (68%), Positives = 37/45 (82%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK 106
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +
Sbjct: 484 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEE 528


>gi|324509698|gb|ADY44067.1| Unknown [Ascaris suum]
          Length = 388

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 126/301 (41%), Positives = 177/301 (58%), Gaps = 15/301 (4%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD------NYSRSS 116
           +S+A+PGSI++N QS EL T LAGQIARA  +F +DEVVVFD  +          +S S 
Sbjct: 82  LSIALPGSIMNNAQSPELRTYLAGQIARAAAVFCVDEVVVFDETARMKPEELKAYHSGSW 141

Query: 117 AANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
            A +  R D  E     + R+L++LE PQYLRK LF +  +L+  G+L PLD  HHLR  
Sbjct: 142 YAEKPIREDNVECNF-HMARILEFLECPQYLRKTLFPLQRALKNAGVLNPLDCMHHLRST 200

Query: 177 EWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLD-PGVRVTVAMGTNRNLDADSP 234
           + + P+REGV L +      G L +VGL K + +D  +D P         T  +LDA+  
Sbjct: 201 DLSIPYREGVVLNKPVKKGRGPLCNVGLEKELQLDSEIDLPPGTRVTVRVTTTDLDANRL 260

Query: 235 RQVVPPSKP--KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT 292
           R  +  ++    ESG+YWGY VR A ++S       Y   YD +IGTSE G+ ++  +L 
Sbjct: 261 RGSLISARRVRDESGIYWGYNVRVASSLSDAI----YGAKYDIIIGTSERGEAIDKFELP 316

Query: 293 LPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
           L     +L+ FGGL GLE ++E D+ ++  +  ++F  Y N+ P QGSRTIRTEEAI I+
Sbjct: 317 LSEGSRILLVFGGLEGLEAAVEADESIRESDPAQLFKHYLNSVPDQGSRTIRTEEAIPIT 376

Query: 353 L 353
           L
Sbjct: 377 L 377


>gi|119192166|ref|XP_001246689.1| hypothetical protein CIMG_00460 [Coccidioides immitis RS]
          Length = 370

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 130/342 (38%), Positives = 185/342 (54%), Gaps = 51/342 (14%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSII N +S +  T L G IARA+ +F +DE+V+F+++  S  +   S A   
Sbjct: 28  TLSVALPGSIIANAKSHDQKTYLVGSIARALAVFCVDEIVIFEDEPESQQFGNRSRAENE 87

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +  + S   FL  +L YLETP +LR+ LF MH +LR  G LP LD PHH+R +EW  +
Sbjct: 88  YTAFSDPS--HFLAHVLSYLETPPHLRRDLFPMHPNLRTAGTLPSLDMPHHIRANEWCEY 145

Query: 182 REGVTL------KENAP----------------------------NSVGTLVDVGLNKHV 207
           REGVT+       E+ P                            N+  TLV+ G+ + V
Sbjct: 146 REGVTIPNDNYGAEDGPSERSNKKKKSKKSKRNQQGGQEHENSSSNNTFTLVNTGIPEPV 205

Query: 208 VVDQV-LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFK 265
            +  + + P  R+T+   T    D   P   V PS P+ E+G YWGY +R   ++SSVF 
Sbjct: 206 RIPNLSILPHTRLTLKFKTQDYSDGADP---VAPSTPREEAGYYWGYSIRRCTSLSSVFT 262

Query: 266 NCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR 321
            C ++GGYD   GTSE G     I+   +  +P ++HLL+ FGG+AGLE + + D  L  
Sbjct: 263 ECPFEGGYDLSFGTSERGLPLAHILQHPE-KIPQYKHLLLVFGGVAGLETAAKADKELAA 321

Query: 322 KN-----AREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
           K        E+F  + N  P QGSRTIRTEEA+++ L   +E
Sbjct: 322 KGIGAGEVGELFDFWVNVLPGQGSRTIRTEEAVWLGLMGLRE 363


>gi|302895721|ref|XP_003046741.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256727668|gb|EEU41028.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 340

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 119/307 (38%), Positives = 168/307 (54%), Gaps = 30/307 (9%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+P S++ N  + +      G+IARA+ +F +DEVVVFD+   S    ++   + +
Sbjct: 39  TVSIAIPSSVLTNLATADQRMTQPGRIARALAVFSVDEVVVFDDSPLSTRPRQTDPNSYT 98

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +D       F   +L YLE P ++RK LF +H +LR   +LP LD PHH    EW  +
Sbjct: 99  GDTDP----CHFFTHILSYLEAPPFMRKTLFPLHPNLRLTALLPSLDMPHHPHPKEWVSY 154

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGVT+        GTLV+VGL++ V +++ + P  R+T+    +   +   P  V P +
Sbjct: 155 REGVTIAGQTSTGSGTLVEVGLDQPVEIEEEIPPKTRITLKFPDD---ETQYPEPVHPAA 211

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSE----------HGDIVNSSDL 291
              E G YWGY VR A ++S VF  C Y+ GYD  IGTSE          HG  VN    
Sbjct: 212 PRTEGGYYWGYTVRKANSLSGVFTECPYENGYDISIGTSERGVPASRAFPHGKRVN---- 267

Query: 292 TLPTFRHLLIAFGGLAGLEESIEEDDGLKRKN-----AREVFHSYFNTCPHQGSRTIRTE 346
               F HLLI FGG  GLE +   D+ L   +      +E+F  + N  P+QGSRTIRT+
Sbjct: 268 ----FNHLLIVFGGPRGLEFASMNDEELSSMDIQGSKTKELFDHWINVLPNQGSRTIRTD 323

Query: 347 EAIFISL 353
           EA+FI+L
Sbjct: 324 EAVFIAL 330


>gi|392864074|gb|EAS35126.2| hypothetical protein CIMG_00460 [Coccidioides immitis RS]
          Length = 382

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 130/342 (38%), Positives = 185/342 (54%), Gaps = 51/342 (14%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSII N +S +  T L G IARA+ +F +DE+V+F+++  S  +   S A   
Sbjct: 40  TLSVALPGSIIANAKSHDQKTYLVGSIARALAVFCVDEIVIFEDEPESQQFGNRSRAENE 99

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +  + S   FL  +L YLETP +LR+ LF MH +LR  G LP LD PHH+R +EW  +
Sbjct: 100 YTAFSDPS--HFLAHVLSYLETPPHLRRDLFPMHPNLRTAGTLPSLDMPHHIRANEWCEY 157

Query: 182 REGVTL------KENAP----------------------------NSVGTLVDVGLNKHV 207
           REGVT+       E+ P                            N+  TLV+ G+ + V
Sbjct: 158 REGVTIPNDNYGAEDGPSERSNKKKKSKKSKRNQQGGQEHENSSSNNTFTLVNTGIPEPV 217

Query: 208 VVDQV-LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFK 265
            +  + + P  R+T+   T    D   P   V PS P+ E+G YWGY +R   ++SSVF 
Sbjct: 218 RIPNLSILPHTRLTLKFKTQDYSDGADP---VAPSTPREEAGYYWGYSIRRCTSLSSVFT 274

Query: 266 NCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR 321
            C ++GGYD   GTSE G     I+   +  +P ++HLL+ FGG+AGLE + + D  L  
Sbjct: 275 ECPFEGGYDLSFGTSERGLPLAHILQHPE-KIPQYKHLLLVFGGVAGLETAAKADKELAA 333

Query: 322 KN-----AREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
           K        E+F  + N  P QGSRTIRTEEA+++ L   +E
Sbjct: 334 KGIGAGEVGELFDFWVNVLPGQGSRTIRTEEAVWLGLMGLRE 375


>gi|66363284|ref|XP_628608.1| Mth1. SpoU superfamily - SPOUT methylase [Cryptosporidium parvum
           Iowa II]
 gi|46229613|gb|EAK90431.1| Mth1. SpoU superfamily - SPOUT methylase [Cryptosporidium parvum
           Iowa II]
          Length = 402

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 131/350 (37%), Positives = 191/350 (54%), Gaps = 53/350 (15%)

Query: 53  NDDAPIEVP--------TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD 104
           NDD   ++P        T+SVA+P SI  N QS EL   L GQIAR +++F +DE+++++
Sbjct: 43  NDDILSKIPFAGSPSKATLSVAIPASIAGNAQSFELRAYLVGQIARIISVFGVDEIIIYE 102

Query: 105 NKSS--SDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
           +K    +DN +RSS   +            F V+ L+YLETPQYLRK LF   +  +F G
Sbjct: 103 DKCKDVTDNENRSS---KEWVEFACSKWMEFFVKNLKYLETPQYLRKTLFKFDNDFKFAG 159

Query: 163 MLPPLDAPHHLRKHEWAPFREGV----------TLKENAPNSVGTLVDVGLNKHVVVDQV 212
           +  P+DAPHH+R  EW P+R+GV          T+     N  G+ V+ GL     ++  
Sbjct: 160 LQNPIDAPHHMRISEWLPYRQGVIVPGPKLFKGTISPEHKNK-GSWVNCGLPVEAWIETK 218

Query: 213 LDPGVRVTVAMGT-----NRNLDADSPR-------------QVVPPSKP-KESGMYWGYK 253
           L+   R+T+ M       ++ L +D  +             ++V  S P ++ G+YWGYK
Sbjct: 219 LENNTRITIKMSKESESLHKKLCSDFNKFGRYSEIKSYFTGKLVDDSTPYRKKGIYWGYK 278

Query: 254 VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS--------DLTLPTFRHLLIAFGG 305
           VR A ++ SVF +  Y+ GYD  IGTSE G+ V+++        D    +F+H+LI FGG
Sbjct: 279 VRPANSLKSVFNDSEYEEGYDLKIGTSERGEPVDNNFKLSTKGKDSKSQSFKHILIVFGG 338

Query: 306 LAGLEESIEEDDGL--KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           L GLE+ + +      K KN   +F  Y N CP Q SRTIRTEEAI ++L
Sbjct: 339 LGGLEDVLSDPQCSLDKVKNPSSLFDMYINICPEQRSRTIRTEEAIGLTL 388


>gi|67601352|ref|XP_666391.1| DKFZp566D143.1 [Cryptosporidium hominis TU502]
 gi|54657381|gb|EAL36163.1| DKFZp566D143.1 [Cryptosporidium hominis]
          Length = 402

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 132/349 (37%), Positives = 193/349 (55%), Gaps = 51/349 (14%)

Query: 53  NDDAPIEVP--------TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD 104
           NDD   ++P        T+SVA+P SI  N QS EL   L GQIAR +++F +DE+++++
Sbjct: 43  NDDILSKIPFAGSPSKSTLSVAIPASITGNAQSFELRAYLVGQIARIISVFGVDEIIIYE 102

Query: 105 NKSS--SDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
           +K    +DN +RSS         +      F V+ L+YLETPQYLRK LF   +  +F G
Sbjct: 103 DKCKDVTDNENRSSKEWVEFACSK---WMEFFVKNLKYLETPQYLRKTLFKFDNDFKFAG 159

Query: 163 MLPPLDAPHHLRKHEWAPFREGVT------LKENA-PN--SVGTLVDVGLNKHVVVDQVL 213
           +  P+DAPHH+R  EW P+R+GV        K N  P+  + G+ V+ GL     ++  L
Sbjct: 160 LQNPIDAPHHMRISEWLPYRQGVIVPGPKFFKGNILPDHKNKGSWVNCGLPVEAWIETKL 219

Query: 214 DPGVRVTVAMGT-----NRNLDADSPR-------------QVVPPSKP-KESGMYWGYKV 254
           +   R+T+ M       ++ L +D  +             ++V  S P ++ G+YWGYKV
Sbjct: 220 ENNTRITIKMSKESESLHKKLCSDFNKFGRYSEIKSYFTGKLVDDSTPYRKKGIYWGYKV 279

Query: 255 RYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS--------DLTLPTFRHLLIAFGGL 306
           R A ++ SVF +  Y+ GYD  IGTSE G+ V+++        D    +F+H+LI FGGL
Sbjct: 280 RPANSLKSVFNDSEYEEGYDLKIGTSERGEPVDNNFKLSTKGKDSKSQSFKHILIVFGGL 339

Query: 307 AGLEESIEEDDGL--KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            GLE+ + +      K KN   +F  Y N CP Q SRTIRTEEAI ++L
Sbjct: 340 GGLEDVLSDPQCSLDKVKNPSSLFDMYINICPEQRSRTIRTEEAIGLTL 388


>gi|310797700|gb|EFQ32593.1| hypothetical protein GLRG_07607 [Glomerella graminicola M1.001]
          Length = 335

 Score =  214 bits (544), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 128/304 (42%), Positives = 180/304 (59%), Gaps = 27/304 (8%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+P SI+ +  + E  T +AG+IARA+ +F IDE+V+FD     D+   S   N  
Sbjct: 37  TVSVAIPSSILTDAITAERRTAVAGRIARALAVFSIDEIVLFD-----DSPVESRPKNVD 91

Query: 122 NRSDENE-SGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
            +S   +  G  FL  LL YLE P ++RK LF +H +LR    LP L+ PHH    +W P
Sbjct: 92  TKSYTGDIDGCHFLEHLLGYLEVPPFMRKMLFPLHPNLRLANQLPALEMPHHPNPTDWLP 151

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
           +REGV  K   P   GTL++VGL   V+++  + P  RVT++   + N D     +   P
Sbjct: 152 YREGVAGK-GTPQ--GTLIEVGLKDPVLIEDEVPPNTRVTLSY-PDYNTDK---AEACDP 204

Query: 241 SKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT---- 295
           S P+ E G YWGY+VR A ++S+VF  C Y+GGYD  IGTSE G  ++ +    P+    
Sbjct: 205 SAPREEGGYYWGYQVRRADSLSAVFTECKYEGGYDVSIGTSERGRTLSEA---FPSAAPM 261

Query: 296 -FRHLLIAFGGLAGLEESIEED---DGLKRKNA--REVFHSYFNTCPHQGSRTIRTEEAI 349
            F+HLLI FGG  G+E +   D   +G++   A  RE+F  + N  P QGSRTIRT+EAI
Sbjct: 262 DFKHLLIVFGGPRGIEFAALNDKELEGMEMTPAKTRELFDHWVNVLPGQGSRTIRTDEAI 321

Query: 350 FISL 353
           +++L
Sbjct: 322 WVAL 325


>gi|350631644|gb|EHA20015.1| hypothetical protein ASPNIDRAFT_39432 [Aspergillus niger ATCC 1015]
          Length = 867

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 138/347 (39%), Positives = 184/347 (53%), Gaps = 55/347 (15%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS---------DNY 112
           T+SVA+PGSII N  S+E  T LAG IARA+ +F +DEVVVFD+             D+Y
Sbjct: 506 TLSVAIPGSIITNAHSVEQKTFLAGVIARALAVFCVDEVVVFDDNEYGAHCDYYDHRDSY 565

Query: 113 SRSSAANRSNRSDENESGAA---------FLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
           +   A NR     +  S  A         FL  +L YLETP YLRK LF +H +LR  G+
Sbjct: 566 NSPIAKNRDQVDGDTSSPKAYTANSDPSHFLTHVLSYLETPPYLRKHLFRIHPNLRTAGL 625

Query: 164 LPPLDAPHHLRKHEWAPFREGVTL----KENAPNSVGTLV--DVGLNKHVVVDQVLDP-G 216
           LP LD PHHLR +EW  FREG+ +    ++N P S  T +  D+     +V+  +  P  
Sbjct: 626 LPSLDMPHHLRANEWCDFREGIVVSSSARKNNPASKITQMSKDILSTVKIVLPNIHLPEH 685

Query: 217 VRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDH 275
            RVTV    +    A+     V PS P+ E+G YWGY VR   + SSVF  C Y GGYD 
Sbjct: 686 ARVTVRFPQHEFPYAEP----VHPSTPRAEAGYYWGYYVRRCRSHSSVFTECPYDGGYDL 741

Query: 276 LIGTSEHGDIVNS--------------------SDLTLPTFRHLLIAFGGLAGLEESIEE 315
             GTSE G  V++                     D   P F+H+L+ FGG+AG+E ++  
Sbjct: 742 SFGTSERGVPVSAVLEEKPQQAHDRYKQGHSFCRDQISPDFQHILVVFGGVAGIEAAVRN 801

Query: 316 DDGLKR-----KNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           D  L+       +A ++F  + N  P QGSRTIRTEEA+++ L   Q
Sbjct: 802 DPQLRDMAIRPSDAGKLFDYWVNFLPGQGSRTIRTEEAVWMGLTSLQ 848


>gi|341038421|gb|EGS23413.1| hypothetical protein CTHT_0001020 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 333

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 116/302 (38%), Positives = 173/302 (57%), Gaps = 23/302 (7%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+AVP SI+ +  + E  T   G+IARA  IF IDE+VVFD+       SR  + +  
Sbjct: 35  TVSIAVPTSILTDRVTREQKTTTVGRIARAAAIFSIDEIVVFDDSPPD---SRLPSVDPD 91

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             + + + G  F+  LL YLETP ++RK LF +H +L+  G+LP +D PHH  K EW P+
Sbjct: 92  AYTGDTDPGH-FMEHLLNYLETPPFMRKVLFPLHPNLKSQGLLPNMDMPHHPHKDEWMPY 150

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REG+T++       GT VD+GL   V + + + P  RVT+ M       + +P  + P  
Sbjct: 151 REGMTIEGKG---KGTTVDIGLPTPVTISESIPPRTRVTLKMPVT---PSGTPEPIHPAQ 204

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT-----F 296
              E+G +WG+ VR AP++S++F    +  GYD  IGTSE G  +N++    P+     F
Sbjct: 205 PRTEAGYFWGFTVRRAPSLSAIFTTSPFPDGYDMSIGTSERGKPLNAA---FPSYEQADF 261

Query: 297 RHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
           +HL++ FGG  GLE +   D      G++    RE+F  + N  P QG+R I+TEEA+ +
Sbjct: 262 KHLIVVFGGPRGLEYAAMNDPELVQMGIQGGRTRELFDHWVNVLPGQGTRGIKTEEAVLV 321

Query: 352 SL 353
           +L
Sbjct: 322 AL 323


>gi|429853814|gb|ELA28862.1| deoxyribose-phosphate aldolase [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 334

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 123/298 (41%), Positives = 170/298 (57%), Gaps = 16/298 (5%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+P SII +  + E  T +AG+IARA+ +F IDEVV+FD+       +R    +R 
Sbjct: 37  TVSVAIPTSIIADAVNAERRTAVAGRIARALGVFGIDEVVIFDDSPVE---TRPKTVDRK 93

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             + + + G  FL  LL YLE P ++RK LF +H +LR  G LP LD PHH    +W P+
Sbjct: 94  AYTGDIDPGH-FLEHLLGYLEVPPFMRKMLFPLHPNLRLAGQLPSLDMPHHPNPADWMPY 152

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGV  K+  P   GT +D+GL   V+++  + P  RVTV      + D +      P +
Sbjct: 153 REGVA-KKGTPK--GTELDLGLKTPVIIEDEVPPNTRVTVQFS---DYDTEKAEACDPAA 206

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT-FRHLL 300
             +E G YWGY VR A ++S+++  C Y+ GYD  IGTSE G  V  S  + P  F+HLL
Sbjct: 207 PREEGGYYWGYSVRRAASLSALWTECKYEDGYDISIGTSERGRSVGQSFPSTPLDFKHLL 266

Query: 301 IAFGGLAGLEESIEEDDGLKRKNA-----REVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           I FGG  G+E +   D  L   +      +E+F  + N  P Q SR IRT+EA+FI L
Sbjct: 267 IVFGGPRGIEYAAVNDKELDMVDVSGGKTKELFDHWINVLPGQNSRQIRTDEALFIGL 324


>gi|303313045|ref|XP_003066534.1| hypothetical protein CPC735_057590 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240106196|gb|EER24389.1| hypothetical protein CPC735_057590 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 370

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 128/341 (37%), Positives = 182/341 (53%), Gaps = 49/341 (14%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSII N +S +  T L G IARA+ +F +DE+++F+++  S      S A   
Sbjct: 28  TLSVALPGSIIANAKSHDQKTYLVGSIARALAVFCVDEIIIFEDEPESQQSGHRSRAENE 87

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +  + S   FL  +L YLETP +LR+ LF MH +LR  G LP LD PHH+R +EW  +
Sbjct: 88  YTAFSDPS--HFLAHVLSYLETPPHLRRDLFPMHPNLRTAGTLPSLDMPHHIRANEWCEY 145

Query: 182 REGVTL------KENAP----------------------------NSVGTLVDVGLNKHV 207
           REGVT+       E+ P                            N+  TLV+ G+ + V
Sbjct: 146 REGVTIPSDNYGAEDGPSERSNKKKKSKKSKRNQQGGEEHENCSSNNTFTLVNTGIPEPV 205

Query: 208 VVDQV-LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFK 265
            +  + + P  R+T+   T    D   P   V PS P+ E+G YWGY +R   ++SSVF 
Sbjct: 206 RIPNLSIQPHTRLTLKFKTQDYSDGADP---VAPSTPREEAGYYWGYSIRRCTSLSSVFT 262

Query: 266 NCSYKGGYDHLIGTSEHG---DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRK 322
            C ++GGYD   GTSE G     V      +P ++HLL+ FGG+AGLE + + D  L  K
Sbjct: 263 ECPFEGGYDLSFGTSERGLPLAHVLQHPEQIPQYKHLLLVFGGVAGLETAAKADKELAAK 322

Query: 323 N-----AREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
                   E+F  + +  P QGSRTIRTEEA+++ L   +E
Sbjct: 323 GIGAGEVGELFDFWVDVLPGQGSRTIRTEEAVWLGLMGLRE 363


>gi|320036589|gb|EFW18528.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
          Length = 382

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 128/341 (37%), Positives = 182/341 (53%), Gaps = 49/341 (14%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSII N +S +  T L G IARA+ +F +DE+++F+++  S      S A   
Sbjct: 40  TLSVALPGSIIANAKSHDQKTYLVGSIARALAVFCVDEIIIFEDEPESQQSGHRSRAENE 99

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +  + S   FL  +L YLETP +LR+ LF MH +LR  G LP LD PHH+R +EW  +
Sbjct: 100 YTAFSDPS--HFLAHVLSYLETPPHLRRDLFPMHPNLRTAGTLPSLDMPHHIRANEWCEY 157

Query: 182 REGVTL------KENAP----------------------------NSVGTLVDVGLNKHV 207
           REGVT+       E+ P                            N+  TLV+ G+ + V
Sbjct: 158 REGVTIPSDNYGAEDGPSERSNKKKKSKKSKRNQQGGEEHENCSSNNTFTLVNTGIPEPV 217

Query: 208 VVDQV-LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFK 265
            +  + + P  R+T+   T    D   P   V PS P+ E+G YWGY +R   ++SSVF 
Sbjct: 218 RIPNLSIQPHTRLTLKFKTQDYSDGADP---VAPSTPREEAGYYWGYSIRRCTSLSSVFT 274

Query: 266 NCSYKGGYDHLIGTSEHG---DIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRK 322
            C ++GGYD   GTSE G     V      +P ++HLL+ FGG+AGLE + + D  L  K
Sbjct: 275 ECPFEGGYDLSFGTSERGLPLAHVLQHPEKIPQYKHLLLVFGGVAGLETAAKADKELAAK 334

Query: 323 N-----AREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
                   E+F  + +  P QGSRTIRTEEA+++ L   +E
Sbjct: 335 GIGAGEVGELFDFWVDVLPGQGSRTIRTEEAVWLGLMGLRE 375


>gi|358381224|gb|EHK18900.1| hypothetical protein TRIVIDRAFT_43974 [Trichoderma virens Gv29-8]
          Length = 345

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 134/322 (41%), Positives = 181/322 (56%), Gaps = 35/322 (10%)

Query: 62  TVSVAVPGSIIDNTQSLELAT---RLA--GQIARAVTIFRIDEVVVFDNKSSSDNYSRSS 116
           TVSVA P S I    +  LAT   R+A   +IARA+ IF IDEV+VFD+   S   SR  
Sbjct: 39  TVSVAFPSSTIATYNAPSLATAEQRIAVPSRIARALAIFSIDEVIVFDD---SPVKSRPH 95

Query: 117 AANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
             ++S  + + +    FL  +L YLE P ++RKALF +H +LR    LP LD PHH    
Sbjct: 96  NTDKSAYTGDTDP-CHFLTHVLSYLEAPPFMRKALFPLHPNLRLTSSLPGLDTPHHPHIK 154

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
           E   +REGVT+     +S GTLVDVGL+K V + + + P  RVT+ +      D   P +
Sbjct: 155 ESMIYREGVTIAGMTKSSDGTLVDVGLDKPVEIKEDIPPKTRVTLKISE----DESEPAE 210

Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP- 294
            V P  P+ E+G YWGY VR A ++S+VF    Y+GGYD  IGTSE G  V       P 
Sbjct: 211 CVHPHAPRTEAGFYWGYTVRQANSLSAVFTESPYEGGYDVSIGTSERGSPVGKE---FPY 267

Query: 295 ----TFRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRT 345
                F H+LI FGG  G+E +   D+     G++    RE+F  + +  P+QGSRTIRT
Sbjct: 268 GKRVQFNHVLIVFGGPRGIEYAAMNDEELGGMGIQGGKTRELFDHWVDVMPNQGSRTIRT 327

Query: 346 EEAIFISLQYFQEPISRALRRV 367
           EEA++I+L         AL+R+
Sbjct: 328 EEAMYIALT--------ALKRI 341


>gi|195332993|ref|XP_002033176.1| GM21175 [Drosophila sechellia]
 gi|194125146|gb|EDW47189.1| GM21175 [Drosophila sechellia]
          Length = 401

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 120/288 (41%), Positives = 168/288 (58%), Gaps = 5/288 (1%)

Query: 85  AGQIARAVTIFRIDEVVVFDNK--SSSDNYSRSSAANRSNRSDEN-ESGAAFLVRLLQYL 141
            GQIARA  IFR+++V+VFD+   +++    RS  A+    S     S +  L R+LQYL
Sbjct: 12  TGQIARAACIFRVNDVIVFDDVGIATARETKRSYEADAEGSSTGTVRSSSLQLARILQYL 71

Query: 142 ETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDV 201
           E PQYLRK  F +H  L++ G+L PLD PHHLR+     +REGV   + A     +  +V
Sbjct: 72  ECPQYLRKYFFPLHKDLKYSGLLNPLDTPHHLRQQSKFRYREGVICDKKAKEG-HSYANV 130

Query: 202 GLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNI 260
           GL   V+VD+ ++PGVRVTV M             +V P +P+ E+G+YWGY+VR A ++
Sbjct: 131 GLLNDVLVDKAIEPGVRVTVKMEPQSESCRKQRGTLVSPDEPRRETGVYWGYQVRIAHSM 190

Query: 261 SSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLK 320
           S +F    Y  GYD  +GTS+ G  V+       +F+H+LI FGGL GLE ++  D+ L 
Sbjct: 191 SEIFTKSPYANGYDVTVGTSDRGTNVHEVPNRSYSFKHMLIVFGGLQGLESALANDEKLT 250

Query: 321 RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRRVE 368
             +   +F  Y N  P QGSRTIRTEEA+ I+L   QE +   +  VE
Sbjct: 251 VDDPELLFDHYLNVLPRQGSRTIRTEEALLIALAALQEKLQPQVADVE 298


>gi|340515208|gb|EGR45464.1| predicted protein [Trichoderma reesei QM6a]
          Length = 346

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 129/324 (39%), Positives = 176/324 (54%), Gaps = 37/324 (11%)

Query: 62  TVSVAVPGSIIDNTQS--LELAT---RLA--GQIARAVTIFRIDEVVVFDNKSSSDNYSR 114
           TVSVA P S I  +Q     LAT   R+A   +IARA+ IF +DEV+V+D+        +
Sbjct: 38  TVSVAFPSSTIATSQGQKYSLATAEQRIAVPSRIARALAIFSVDEVIVYDDSPVKTRPHK 97

Query: 115 SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR 174
           +     +  +D       FL  +L YLE P ++RKALF +H +LR    LP LD PHH  
Sbjct: 98  TDKTAYTGDTDP----CHFLTHVLSYLEAPPFMRKALFPLHPNLRMTSSLPGLDTPHHPH 153

Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSP 234
             EW  +REGVT+     +  GTLVDVGL K V + + + P  RVT+ +          P
Sbjct: 154 IKEWMQYREGVTIAGMTRSGEGTLVDVGLEKPVEIKEDIPPKTRVTLKVAEEEG----EP 209

Query: 235 RQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
            + V P+ P+ E+G+YWGY VR A ++S+VF    Y+ GYD  IGTSE G  V+      
Sbjct: 210 AECVHPAAPRTEAGIYWGYTVRQANSLSAVFTESPYENGYDLSIGTSERGSPVSKE---F 266

Query: 294 PT-----FRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTI 343
           P      F H LI FGG  G+E +   D+     G++    RE+F  + +  P+QGSRTI
Sbjct: 267 PASKRVQFNHALIVFGGPRGIEYAAMNDEELGGMGIQGGKTRELFDHWVDVMPNQGSRTI 326

Query: 344 RTEEAIFISLQYFQEPISRALRRV 367
           RTEEA+FI L         AL+R+
Sbjct: 327 RTEEAVFIGLM--------ALKRI 342


>gi|116192931|ref|XP_001222278.1| hypothetical protein CHGG_06183 [Chaetomium globosum CBS 148.51]
 gi|88182096|gb|EAQ89564.1| hypothetical protein CHGG_06183 [Chaetomium globosum CBS 148.51]
          Length = 348

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 126/349 (36%), Positives = 188/349 (53%), Gaps = 33/349 (9%)

Query: 23  HESQNELSLA--------NGDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDN 74
           +E Q++L L         +G + +  N K   R        AP E  TVS+AVP S+I +
Sbjct: 5   YEHQSKLMLGQRRRLDTEDGGAGNGSNFKAYPR----FTPTAPREW-TVSIAVPTSLITD 59

Query: 75  TQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFL 134
             + E  T   G++ARA+ IF +DEVV++D+  +    + +     +   D     A F+
Sbjct: 60  CVTREQRTTNVGRVARALAIFSVDEVVIYDDSPTDKRITNADPDAYTGDVDP----AHFM 115

Query: 135 VRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNS 194
             +L YLETP ++RK LF +H +LR  G+LP LD PHH  K EW P+REG+TL+  A   
Sbjct: 116 EHILTYLETPPFMRKVLFPLHPNLRSQGLLPSLDMPHHPHKDEWLPYREGMTLELPAKGG 175

Query: 195 VGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKV 254
            GT+VD+G+   V + + + P  RVT+ M  +    + +P  V P +   E G +WG+ V
Sbjct: 176 KGTVVDIGMPNTVTITESIPPKTRVTLKMPDD---PSGTPEPVHPAAPRTEDGYFWGFSV 232

Query: 255 RYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP-----TFRHLLIAFGGLAGL 309
           R A ++S+V     Y  GYD  IGTSE G  ++S+    P      F HLL+ FGG  GL
Sbjct: 233 RKASSLSNVLTESPYADGYDMSIGTSERGMPLSSA---FPGHEQANFNHLLVIFGGPRGL 289

Query: 310 EESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           E +   D      G+     +E+F  + N  P+QG+R IRT+EA+ I++
Sbjct: 290 EYAAMNDPQLGEMGISGARTKELFDHWVNVLPNQGTRGIRTDEALLIAM 338


>gi|389646437|ref|XP_003720850.1| deoxyribose-phosphate aldolase 2 [Magnaporthe oryzae 70-15]
 gi|86196595|gb|EAQ71233.1| hypothetical protein MGCH7_ch7g640 [Magnaporthe oryzae 70-15]
 gi|351638242|gb|EHA46107.1| deoxyribose-phosphate aldolase 2 [Magnaporthe oryzae 70-15]
 gi|440474736|gb|ELQ43461.1| deoxyribose-phosphate aldolase 2 [Magnaporthe oryzae Y34]
 gi|440484507|gb|ELQ64569.1| deoxyribose-phosphate aldolase 2 [Magnaporthe oryzae P131]
          Length = 347

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 128/360 (35%), Positives = 187/360 (51%), Gaps = 26/360 (7%)

Query: 21  ENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLEL 80
           E H S+       GD +   +     R   +     P +  TVSVA+P SII +    E 
Sbjct: 5   EQHPSKKRKLGRYGDGADLSDTITTFRPSARFAPTVPRDW-TVSVAIPSSIITDCARDEQ 63

Query: 81  ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQY 140
                 +IARA+ +F IDEVVVFD+   S + SR  A +    + + +  A ++  LL Y
Sbjct: 64  RINAPSRIARALAVFSIDEVVVFDD---SPHDSRPPAVDTDAYTGDVDP-AHYIWHLLNY 119

Query: 141 LETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVD 200
           +E P ++RK LF +H +LR  G+ P LD PHH  + EW P+ E +TL        GTLVD
Sbjct: 120 MECPPFMRKTLFPLHPNLRLQGLFPGLDMPHHPHRDEWLPYMEAMTLPGKPEGGKGTLVD 179

Query: 201 VGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPN 259
           +G+ K + + + + P  R+T+        +A++    V P+ P+ E G YWG+ VR   +
Sbjct: 180 IGMRKPITIAESIPPKTRLTLRFPGESYDNAEA----VDPAAPRTEGGYYWGFGVRRCAS 235

Query: 260 ISSVFKNCSYKGGYDHLIGTSEHGDIVNSS---DLTLPTFRHLLIAFGGLAGLEESIEED 316
           +S V     ++GGYD  IGTSE GD+  ++       P F+HLLI  GG  GLE +   D
Sbjct: 236 LSEVLTESPFEGGYDLSIGTSERGDVSANAFPHGKPPPQFKHLLIVLGGPRGLEFAAMND 295

Query: 317 D-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRRVEYKS 371
                 G+     REVF  + N  P QGSR I+TEE++FI+L          LRR+ Y++
Sbjct: 296 PQLSEIGIGTGRVREVFDHWINVLPDQGSRNIKTEESLFIALT--------DLRRLWYRT 347


>gi|346977837|gb|EGY21289.1| deoxyribose-phosphate aldolase [Verticillium dahliae VdLs.17]
          Length = 355

 Score =  207 bits (527), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 128/326 (39%), Positives = 181/326 (55%), Gaps = 46/326 (14%)

Query: 76  QSLELATRLAGQIARAVTIFRIDEVVVFDN-----------------KSSSDNYSRSSAA 118
           ++L+      G+IARA+ IF +DEVVV+++                 K   DN S     
Sbjct: 38  KTLDQRMTAPGRIARALAIFNVDEVVVYNDLPPLTNEGYQEQPLPHKKDRRDNRSHGGHN 97

Query: 119 NRSNRSDENESGAA-------FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
           +RS+  +++ES          FL  +L YLE P ++RKALF +H +LR  G+LP LD PH
Sbjct: 98  SRSSHRNDSESSITADVDPCHFLTHVLSYLEAPPFMRKALFPIHPNLRLAGLLPGLDMPH 157

Query: 172 HLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLD 230
           H    + A P+REGVTL+  APN+ GTLVD+G    V +   + P  RVTV +    N  
Sbjct: 158 HPNPIDIALPYREGVTLEAMAPNAKGTLVDIGNRAPVHLKDEVPPKTRVTVHLPEGDN-- 215

Query: 231 ADSPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS 289
              P + V P+ P+E  G YWGY VR A ++S+VF+   ++GGYD  IGTSE GD VN +
Sbjct: 216 --GPAEAVHPAAPREEGGYYWGYTVRRAASLSAVFEESPHEGGYDLSIGTSERGDSVNRA 273

Query: 290 --DL-TLPTFRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSR 341
             DL     F+HLLI FGG  G+E +   D      G+  +  R++F  + N  P QGSR
Sbjct: 274 FPDLKKTAKFKHLLIVFGGPKGIEHAAYNDSQLYEMGIGGQGTRQLFDHWVNVLPGQGSR 333

Query: 342 TIRTEEAIFISLQYFQEPISRALRRV 367
           T+RT+EA+++++          LRRV
Sbjct: 334 TMRTDEAVYVAMA--------GLRRV 351


>gi|225685225|gb|EEH23509.1| deoxyribose-phosphate aldolase [Paracoccidioides brasiliensis Pb03]
          Length = 684

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 136/360 (37%), Positives = 183/360 (50%), Gaps = 75/360 (20%)

Query: 62  TVSVAVPGSII-------------------------DNTQSLELATRLAGQIARAVTIFR 96
           T+SVA+PGSII                         ++ QS +  T LAG IARA+ +F 
Sbjct: 320 TLSVALPGSIIAKCALNAGCPSKRIKKRNLPDTSSRNSAQSHDQKTFLAGSIARALAVFC 379

Query: 97  IDEVVVFDNKSSSDNYSRSSAANRSNRSDENES---GAAFLVRLLQYLETPQYLRKALFS 153
           +DEVV+F++    D   R S A+     DE  +    + FL  +L YLETP YLRK LF 
Sbjct: 380 VDEVVIFED----DVRQRQSNAHYDLSKDEYTAYSDPSHFLAHVLSYLETPPYLRKFLFP 435

Query: 154 MHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVG----------------- 196
           MH +LR  G LP LD PHH+R +E   +REGVT+  N  +  G                 
Sbjct: 436 MHKNLRTAGALPSLDMPHHIRANESCEYREGVTVPANEEHGFGQSNAENHHNQHKRKKGK 495

Query: 197 --------------TLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPS 241
                         TLVD GL   V +  +  P   RVTV + ++   D     +VV PS
Sbjct: 496 KNTEEGVQDTVMSYTLVDTGLPNKVCIPHIPVPENTRVTVKLCSSE--DPSQGAEVVSPS 553

Query: 242 KPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--LPTFRH 298
            P+E  G YWGY VR   ++S VF  C + GGYD   GTSE G  + S  +T  +P F H
Sbjct: 554 TPREEMGYYWGYSVRCCDSLSDVFTECPFDGGYDLSFGTSERGSPL-SQVITEGVPNFEH 612

Query: 299 LLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           L++ FGG+AGLE +++ D      GL+ +    +F  + N  P QGSRTIRTEEA+++ L
Sbjct: 613 LIVVFGGVAGLEVAVKADKELLARGLRPETVGNMFDYWVNVLPGQGSRTIRTEEAVWLGL 672


>gi|302420647|ref|XP_003008154.1| deoxyribose-phosphate aldolase [Verticillium albo-atrum VaMs.102]
 gi|261353805|gb|EEY16233.1| deoxyribose-phosphate aldolase [Verticillium albo-atrum VaMs.102]
          Length = 355

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 128/326 (39%), Positives = 181/326 (55%), Gaps = 46/326 (14%)

Query: 76  QSLELATRLAGQIARAVTIFRIDEVVVFDN-----------------KSSSDNYSRSSAA 118
           ++L+      G+IARA+ IF +DEVVV+++                 K   DN S     
Sbjct: 38  KTLDQRMTAPGRIARALAIFNVDEVVVYNDLPPLTNEGYQEQPLPHKKDRRDNRSHGGHN 97

Query: 119 NRSNRSDENESGAA-------FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
           +RS+  +++ES          FL  +L YLE P ++RKALF +H +LR  G+LP LD PH
Sbjct: 98  SRSSHRNDSESSITADVDPCHFLTHVLSYLEAPPFMRKALFPIHPNLRLAGLLPGLDMPH 157

Query: 172 HLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLD 230
           H    + A P+REGVTL+  APN+ GTLVD+G    V +   + P  RVTV +    N  
Sbjct: 158 HPNPIDIALPYREGVTLEAMAPNAKGTLVDIGNRAPVHLKDEVPPKTRVTVHLPEGDN-- 215

Query: 231 ADSPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS 289
              P + V P+ P+E  G YWGY VR A ++S+VF+   ++GGYD  IGTSE GD VN +
Sbjct: 216 --GPAEAVHPAAPREEGGYYWGYTVRRAASLSAVFEESPHEGGYDLSIGTSERGDSVNRA 273

Query: 290 --DL-TLPTFRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSR 341
             DL     F+HLLI FGG  G+E +   D      G+  +  R++F  + N  P QGSR
Sbjct: 274 FPDLKKTAKFKHLLIVFGGPKGIEHAAYNDAQLYEMGIGGQGTRQLFDHWVNVLPGQGSR 333

Query: 342 TIRTEEAIFISLQYFQEPISRALRRV 367
           T+RT+EA+++++          LRRV
Sbjct: 334 TMRTDEAVYVAMA--------GLRRV 351


>gi|367043134|ref|XP_003651947.1| hypothetical protein THITE_2112761 [Thielavia terrestris NRRL 8126]
 gi|346999209|gb|AEO65611.1| hypothetical protein THITE_2112761 [Thielavia terrestris NRRL 8126]
          Length = 320

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 119/302 (39%), Positives = 170/302 (56%), Gaps = 20/302 (6%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVAVP SII +  + E  T   G++ARA+ IF +DEVVVFD+       S +     +
Sbjct: 19  TVSVAVPTSIITDCVTREQRTTNVGRVARALAIFSVDEVVVFDDSPVDQRISNADPDAYT 78

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
              D     A F+  LL YLETP ++RK LF +H +LR  G+LP LD PHH  K EW P+
Sbjct: 79  GDVDP----AHFMEHLLTYLETPPFMRKVLFPLHPNLRSQGLLPSLDMPHHPHKDEWLPY 134

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REG+TL+       GT VD+G+   V + + + P  RVT+ M  ++     +P  V P +
Sbjct: 135 REGMTLETPPKGGKGTEVDIGMPSTVRIAESIPPRTRVTLKMPDDQ---YGTPEPVHPAA 191

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT-----F 296
              E+G +WG+ VR A ++S+V     ++ GYD  IGTSE G  ++ +    P+     F
Sbjct: 192 PRTEAGYFWGFSVRRASSLSNVLTESPFEDGYDMSIGTSERGVPLSKA---FPSYEPANF 248

Query: 297 RHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
            HLLI FGG  GLE +   D      G+     +E+F  + N  P+QG+R I+T+EA+ I
Sbjct: 249 NHLLIVFGGPRGLEFAAMNDPQLGEMGITGPRTKELFDHWVNVLPNQGTRGIKTDEALLI 308

Query: 352 SL 353
           +L
Sbjct: 309 AL 310


>gi|391340918|ref|XP_003744780.1| PREDICTED: uncharacterized protein C9orf114-like [Metaseiulus
           occidentalis]
          Length = 313

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 122/301 (40%), Positives = 177/301 (58%), Gaps = 30/301 (9%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+AV  S+++N +      R+  QIARA  IFR++EV+V D+   +  +  S  +N  
Sbjct: 34  TISIAVCASMLENLKGEGSKHRVIAQIARAAAIFRVNEVIVVDDAYKNPGFDSSKDSN-- 91

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                          +L+YL+ PQYLRK+LF M   L+ +G+ PPLDA HH+R  +  P+
Sbjct: 92  -----------LFQLILEYLDCPQYLRKSLFPMCLELKHIGVAPPLDASHHVRMEDEVPY 140

Query: 182 REGVTLKENAPNSVGTL-VDVGLNKHVVVDQ--VLDPGVRVTVAMGTNRNLDADSPRQVV 238
           REG+  +     S+G++ +DVGL++ + ++    +  G+RVTV M     +  D+PR   
Sbjct: 141 REGIVTR----VSLGSVWLDVGLHEDIKIESRAEVKQGLRVTVNMTEKSIVGPDTPR--- 193

Query: 239 PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
                K+ G+YWGY+ R A +IS V   C ++ GYD  IGTSE G+ +   +  L  FRH
Sbjct: 194 -----KKDGLYWGYRTRVAKSISEVIALCPFEKGYDMTIGTSERGESILELE-ALGKFRH 247

Query: 299 LLIAFGGLAGLEESIEEDDGLKR-KNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           LLI FGG  GLE ++E+D  L + K  + VF  Y NTCP QGSRTIRTEEA+ ISL   Q
Sbjct: 248 LLIVFGGPKGLEHALEQDKALTQVKEPKNVFDKYLNTCPLQGSRTIRTEEAVLISLSALQ 307

Query: 358 E 358
           +
Sbjct: 308 K 308


>gi|403177364|ref|XP_003888780.1| hypothetical protein PGTG_22495 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375172839|gb|EHS64743.1| hypothetical protein PGTG_22495 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 441

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 144/359 (40%), Positives = 192/359 (53%), Gaps = 64/359 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK-----------SSSD 110
           TVS+A+PGSI++N Q+ EL T L GQIARA  IF ++E+++FD               S 
Sbjct: 98  TVSIALPGSIVNNAQTWELKTALVGQIARACAIFSVNEIIIFDESVGEADPSAKPHQPSS 157

Query: 111 NYSRSSAANRSNRSDENESG--------AAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
            Y R  A  R+   D   +G        + F+ R+L+YLE PQYLRK+LF +H  LR  G
Sbjct: 158 TYGR--AKYRTPAEDLELAGPEDGPFQPSHFVARILEYLECPQYLRKSLFPIHPDLRLAG 215

Query: 163 MLPPLDAPHHLRKHEWAPFREGVTL-KENAPNSVG-----------------TLVDVGLN 204
           +LPPLD PHH RK    P+REG  L  +   N V                  +  DVGL 
Sbjct: 216 LLPPLDLPHHFRKDHETPWREGCILPADKVENYVAGGHYQKKKNRKGDDRKTSWADVGLA 275

Query: 205 KHVVVDQVLDPGV------RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAP 258
           + V V   LD GV      RVTV M       A   R V P    + +G+YWGY +R A 
Sbjct: 276 EPVQVQ--LDEGVSLPDWTRVTVQMPQGEGKLA---RLVSPRLPTQTTGIYWGYSIRLAS 330

Query: 259 NISSVFKNCSY--KGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEED 316
           +IS VF    Y  +GGYD  IGTSE G+ V+     +  F+H+L+  GGL+GLE +I  D
Sbjct: 331 SISKVFTETPYASEGGYDLTIGTSERGENVDDVVDGIGDFKHMLLVLGGLSGLELTIASD 390

Query: 317 DGLKRK----NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRRVEYKS 371
           D L+ +    +A  +F  + NT P QGSRT+RTEEA+ +SL         ALRR+ ++S
Sbjct: 391 DTLEPRLTAADAPLLFDHWLNTLPFQGSRTVRTEEALLVSLG--------ALRRLLFRS 441


>gi|443711334|gb|ELU05162.1| hypothetical protein CAPTEDRAFT_182660 [Capitella teleta]
          Length = 234

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 104/231 (45%), Positives = 136/231 (58%), Gaps = 3/231 (1%)

Query: 134 LVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPN 193
           + R+LQYLE PQYLRK+ F  H  LR+ G+L PLD+PHH++  E   +REGV +      
Sbjct: 1   MARILQYLECPQYLRKSFFPQHDDLRYAGVLNPLDSPHHMKADETPKYREGVIVNAQIKE 60

Query: 194 SVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGY 252
              +    G+ K+V +D+VL PG+RVT+ M             VV P  P+   G+YWGY
Sbjct: 61  GRNSYALAGILKNVEIDRVLMPGMRVTIQMDYASESKKKVKGYVVSPDTPRTRDGLYWGY 120

Query: 253 KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEES 312
           + R A ++S VF +  +  GYD  IGTSE G  V+  D  LP F HLL+ FGGL GLE +
Sbjct: 121 QTRLAASLSEVFTDSPFSSGYDLTIGTSERGSSVD--DAELPHFNHLLVVFGGLRGLESA 178

Query: 313 IEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRA 363
           +E D  L   +   +FH Y NTCP QGS TIRTEEAI IS+   +  I  A
Sbjct: 179 LEADPKLNMDDPSLLFHQYLNTCPDQGSGTIRTEEAILISMAALRTKIKAA 229


>gi|384493430|gb|EIE83921.1| hypothetical protein RO3G_08626 [Rhizopus delemar RA 99-880]
          Length = 595

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 134/340 (39%), Positives = 193/340 (56%), Gaps = 37/340 (10%)

Query: 24  ESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVP---TVSVAVPGSIIDNTQSLEL 80
           ES+   +  N + S  D  K++  K+  +      E P   TV++A+P S+I N  +LEL
Sbjct: 3   ESKKRRAFENKELSGPDYGKERVVKKQDVKKPLVTEKPRKHTVTIAIPSSVISNAPTLEL 62

Query: 81  ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQY 140
            T LAGQIARA+ +F +DEVV++++K            + +++ + N     FL RLLQY
Sbjct: 63  KTILAGQIARALVLFCVDEVVIYEDK----------VPHPTSKVNPN----LFLARLLQY 108

Query: 141 LETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVD 200
           +ETP YLRKAL  + S L+F G+LP LD PHH  + +   +REGVTL  N  +   TLVD
Sbjct: 109 METPPYLRKALVPISSDLKFAGLLPSLDVPHHPARDDMTLYREGVTL--NKADDQSTLVD 166

Query: 201 VGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-------QVVPPSKPKE-SGMYWGY 252
           VGL +   +D+ + P VRVTV + +     AD+ +       +VV P  P+E SG+YWGY
Sbjct: 167 VGLFRRARIDRPVQPNVRVTVEL-SQVVAAADTKKGQKPIQAKVVSPKAPREKSGLYWGY 225

Query: 253 KVRYAPNISSVFKNCSYKGGYDHLIGTSE-HGDIVNSSDL--TLPTFRHLLIAFGGL-AG 308
            +R A + S V  +  Y+  YD L+G S+  GD V    +   +  F H+LIAFG    G
Sbjct: 226 SIRLASSFSKVMTDSPYE--YDFLVGVSDCKGDNVYQPSIKDKVKPFEHILIAFGAPNGG 283

Query: 309 LEESIEEDDGLK--RKNAREVFHSYFNTCPHQGSRTIRTE 346
           LEE+IE D+ LK   +NA E+F  +       G+R+IR E
Sbjct: 284 LEEAIEADEDLKVGEENANELFDLFIQPL-TSGTRSIRLE 322


>gi|367021134|ref|XP_003659852.1| hypothetical protein MYCTH_2115401 [Myceliophthora thermophila ATCC
           42464]
 gi|347007119|gb|AEO54607.1| hypothetical protein MYCTH_2115401 [Myceliophthora thermophila ATCC
           42464]
          Length = 340

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 115/306 (37%), Positives = 173/306 (56%), Gaps = 28/306 (9%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+AVP S+I +  + E  T   G++ARA+ IF +DEVVV+D+       + +     +
Sbjct: 39  TVSIAVPTSLITDCVTREQRTTNVGRVARALAIFSVDEVVVYDDSPMDKRITNADPDAYT 98

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
              D     A F+  +L YLETP ++RK LF +H +LR  G+LP LD PHH  K EW P+
Sbjct: 99  GDVDP----AHFMDHILNYLETPPFMRKVLFPLHPNLRSQGLLPSLDMPHHPHKDEWLPY 154

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR---QVV 238
           REG+TL+       GT+VD+G+   V + + + P  R+T+ M        D PR   + V
Sbjct: 155 REGMTLETPPKGGKGTVVDIGMPNTVTIAESIPPKTRLTLKM-------PDDPRGTPEPV 207

Query: 239 PPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT-- 295
            P+ P+ E G +WG+ VR A ++S+V     Y+ GYD  IGTSE G  ++ +    P+  
Sbjct: 208 HPTAPRTEGGYFWGFSVRRASSLSNVLTESPYEDGYDLSIGTSERGVPLSKA---FPSYE 264

Query: 296 ---FRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEE 347
              F+HL++ FGG  GLE +   D      G+     +E+F  + N  P+QG+R IRT+E
Sbjct: 265 QADFKHLIVIFGGPRGLEYAAMNDPQLSEMGISGSRTKELFDHWVNVLPNQGTRGIRTDE 324

Query: 348 AIFISL 353
           A+ I++
Sbjct: 325 ALLIAM 330


>gi|295663196|ref|XP_002792151.1| deoxyribose-phosphate aldolase [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226279326|gb|EEH34892.1| deoxyribose-phosphate aldolase [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 399

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 134/354 (37%), Positives = 183/354 (51%), Gaps = 69/354 (19%)

Query: 62  TVSVAVPGSII-------------------DNTQSLELATRLAGQIARAVTIFRIDEVVV 102
           T+SVA+PGSII                   ++ QS +  T LAG IARA+ +F +DEVV+
Sbjct: 41  TLSVALPGSIIAKYALKEISKRNLPDTNSLNSAQSHDQKTFLAGSIARALAVFCVDEVVI 100

Query: 103 FDNKSSSDNYSRSSAANRSNRSDENES---GAAFLVRLLQYLETPQYLRKALFSMHSSLR 159
           F+++       R S A+     DE  +    + FL  +L YLETP +LRK LF MH +LR
Sbjct: 101 FEDEVRQ----RQSNAHCDLSKDEYTAYSNPSHFLAHVLSYLETPPHLRKFLFPMHKNLR 156

Query: 160 FVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVG----------------------- 196
             G LP LD PHH+R +E   +REGVT   N  +  G                       
Sbjct: 157 TAGTLPSLDMPHHIRANELCEYREGVTFPANEEHGFGQSNAENHNNQHKRKKGKKNMEEG 216

Query: 197 --------TLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKES- 246
                   TLVD GL   V +  +  P   RVTV + +++  D     +VV PS P+E  
Sbjct: 217 AQDTVMSYTLVDTGLPNKVRIPHIPVPENTRVTVKLCSSK--DPSQVAEVVSPSAPREEM 274

Query: 247 GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--LPTFRHLLIAFG 304
           G YWGY VR   ++S VF  C + GGYD   GTSE G  + S  +T  +P F HL++ FG
Sbjct: 275 GYYWGYSVRCCDSLSDVFTECPFDGGYDLSFGTSERGSPL-SQVITEGVPNFEHLIVVFG 333

Query: 305 GLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           G+AGLE +++ D      GL+ +    +F  + N  P QGSRTIRTEEA+++ L
Sbjct: 334 GVAGLEVAVKADKELLARGLRPETVGNMFDYWVNVLPGQGSRTIRTEEAVWLGL 387


>gi|358396408|gb|EHK45789.1| hypothetical protein TRIATDRAFT_152499 [Trichoderma atroviride IMI
           206040]
          Length = 344

 Score =  204 bits (518), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 120/311 (38%), Positives = 172/311 (55%), Gaps = 26/311 (8%)

Query: 62  TVSVAVPGSIIDNTQSL----ELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSA 117
           TVSVA P S I    SL    E    +  ++ARA+ IF +DEV+V+D+  +S    R+  
Sbjct: 39  TVSVAFPSSTIATYASLLPTAEQRMAVPSRVARALAIFSVDEVIVYDDSPASSRPHRTDT 98

Query: 118 ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
           +  +  +D       FL  +L Y+ETP ++RKALF +H +LR    LP LD  HH    E
Sbjct: 99  SVYTGDTDP----VHFLTHVLSYMETPPFMRKALFPLHPNLRLTSSLPGLDTAHHPHPKE 154

Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV 237
             P+REGVT+     +  GTLVD+G  K V + + + P  RVT+ +      D   P + 
Sbjct: 155 SMPYREGVTIVGMTKSGNGTLVDIGQEKPVEIKEDIPPKTRVTLKLSE----DESEPCEC 210

Query: 238 VPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT- 295
           V P  P+ E+G YWGY VR A ++S+VF    Y+ GYD  IGTSE G   + +    P+ 
Sbjct: 211 VHPHAPRTEAGFYWGYTVRKAASLSAVFTESPYENGYDLSIGTSERG---SPASKEFPSS 267

Query: 296 ----FRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
               F H+LI FGG  G+E +   D+     G++    RE+F  + +  P+QGSRTIRTE
Sbjct: 268 KRVQFNHVLIVFGGPRGIEYAAMNDEELGGMGIQGGKTRELFDHWVDVLPNQGSRTIRTE 327

Query: 347 EAIFISLQYFQ 357
           EA++I+L   +
Sbjct: 328 EAMYIALTVLK 338


>gi|169594860|ref|XP_001790854.1| hypothetical protein SNOG_00159 [Phaeosphaeria nodorum SN15]
 gi|111070534|gb|EAT91654.1| hypothetical protein SNOG_00159 [Phaeosphaeria nodorum SN15]
          Length = 443

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 144/412 (34%), Positives = 205/412 (49%), Gaps = 73/412 (17%)

Query: 9   GLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKK---RKRDQLNDDAPIEV----- 60
           G+  + ++A+T   H++QN      G+SSS   KK++    ++ ++++   P  V     
Sbjct: 25  GVGSKKRKASTVVIHQNQN------GESSSSPMKKRRANEDKEEEEIDTSRPTAVFTPKG 78

Query: 61  ---PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSS------SDN 111
               TVS+A+PGS ++N +  +  T   G+IARA  +F +DE+VVFD+         +  
Sbjct: 79  ARDWTVSIALPGSWLNNAKKPDHKTAQVGRIARAAAVFCVDEIVVFDDDPVNVPSVVAPK 138

Query: 112 YSRSSAANRSNRS------DENESGA---AFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
           Y R      S +       +E+E       FL  +L Y E P +LR  LF  H +L   G
Sbjct: 139 YMRKKGKQLSKQEVMDSILEEDEPWQNPDQFLYHVLSYAECPPHLRMKLFPRHKNLEQAG 198

Query: 163 MLPPLDAPHHLRKHEWAPFREGVTL---KENAPNSVG---------TLVDVGLNKHVVV- 209
           MLP +D PHHLR HEW  +REG+ +      AP   G           V+VGL   + V 
Sbjct: 199 MLPSMDMPHHLRSHEWCQYREGMVIGPAPAPAPTQKGKKSQDTLDYAYVEVGLPYPIRVP 258

Query: 210 ---DQVLDPGVRVTVAMGTNRNL---------DADSPRQVVPP-SKPK-ESGMYWGYKVR 255
                 ++ G+R TV       L         D DS   +    S P+ E G YWGY VR
Sbjct: 259 IPAGSPVEEGMRTTVRFSNPDPLASWPNMSKSDCDSIDAIACANSLPREEGGYYWGYTVR 318

Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHG---------DIVNSSDLTLPTFRHLLIAFGGL 306
            AP++SSVF  C Y  GYD+ IGTSE G         +  +S +  L  F+HLL+ FGG+
Sbjct: 319 RAPSLSSVFSECEYPSGYDYTIGTSERGIPIHSILPDNASSSHNKVLEPFKHLLLVFGGV 378

Query: 307 AGLEESIEED-----DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           AG+E ++  D      GL +KNA  +F  + N  P QGSRTIRTEEA+   L
Sbjct: 379 AGIEPAVANDPELVAKGLGKKNADTLFDKWVNLVPGQGSRTIRTEEAVEFGL 430


>gi|402075402|gb|EJT70873.1| deoxyribose-phosphate aldolase 2 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 485

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 122/344 (35%), Positives = 184/344 (53%), Gaps = 25/344 (7%)

Query: 34  GDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVT 93
           GD +    + K  +   +     P +  TVS+AVP SII      E       +IARA+ 
Sbjct: 18  GDGADTSEEIKSFKPSTRFTPTVPRDW-TVSIAVPSSIITEASRDEQRIGAPSRIARALA 76

Query: 94  IFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFS 153
           +F +DE+VVFD+  +    SR  A +    + + +    ++  LL YLE P ++RK LF 
Sbjct: 77  VFGMDEIVVFDDSPTD---SRPPAVDTDAYTGDVDP-CHYIWHLLTYLECPPFMRKTLFP 132

Query: 154 MHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVL 213
           +H +L+  G+LP LD P H  + EW P+ E +TL        GTLVD+G+ K + + + +
Sbjct: 133 LHPNLKLQGLLPNLDMPSHPHRDEWLPYMEAMTLPGKPDGGNGTLVDIGMRKPITISESI 192

Query: 214 DPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGG 272
            P  R+T+    N++ +     Q V P+ P+ ++G  WG+ VR   ++S V   C Y+GG
Sbjct: 193 PPKTRLTLHF-PNQSYEG---AQAVDPTAPRTDAGYAWGFNVRRCASLSDVLTECPYEGG 248

Query: 273 YDHLIGTSEHGDIVNSSDL-----TLPTFRHLLIAFGGLAGLEESIEEDD-----GLKRK 322
           YD  IGTSE G+  NS+ +       P F+HLLI FGG  GLE +   D      G+   
Sbjct: 249 YDLSIGTSERGE--NSASVFPYSSAPPDFKHLLIVFGGPRGLEYAAMNDPQLSEIGIGAG 306

Query: 323 NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ---EPISRA 363
             REVF  + N  P QGSR I+T+EA++I+L   +   +P+ RA
Sbjct: 307 RVREVFDHWINVLPDQGSRAIKTDEAVYIALTESRRRDQPLGRA 350


>gi|408392531|gb|EKJ71885.1| hypothetical protein FPSE_07986 [Fusarium pseudograminearum CS3096]
          Length = 342

 Score =  201 bits (512), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 116/301 (38%), Positives = 164/301 (54%), Gaps = 18/301 (5%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+P SI+ N  + +       +IARA+ +F +DEVVVFD+   S    ++   + +
Sbjct: 41  TLSVAIPSSILTNLATADQRMTQPARIARALAVFSVDEVVVFDDSHVSTRPRQTDPNSYT 100

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +D       FL  +L +LE P ++RK LF +H +LR   MLP LD PHH    ++  +
Sbjct: 101 GDTDP----CHFLAHILSFLEAPPFMRKTLFPLHPNLRLTAMLPSLDMPHHPNPRDYLSY 156

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
            EGVT+        GTLV+ GL   V ++  + P  R+T+    + N     P  V P +
Sbjct: 157 AEGVTVTGATSTGSGTLVECGLENPVEIEADIPPKTRITLKFPEDEN---QYPEPVHPAA 213

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT----FR 297
              E G YWGY VR A ++S VF    Y+ GYD  IGTSE G  V +S    PT    F 
Sbjct: 214 PRTEGGYYWGYTVRKANSLSKVFTESPYENGYDVSIGTSERG--VPASKAFPPTKKVNFS 271

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKN-----AREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
           HLLI FGG  G+E +   D+ L   +      +E+F  + N  P+QGSRTIRT+EA+FI+
Sbjct: 272 HLLIVFGGPRGIEFAAMNDEELSSMDIQGSKTKELFDHWINVLPNQGSRTIRTDEAVFIA 331

Query: 353 L 353
           L
Sbjct: 332 L 332


>gi|342883458|gb|EGU83952.1| hypothetical protein FOXB_05535 [Fusarium oxysporum Fo5176]
          Length = 340

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 115/301 (38%), Positives = 163/301 (54%), Gaps = 18/301 (5%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVAVP SI+ +  S +       +IARA+  F +DEVVVFD+   S     +   + +
Sbjct: 39  TISVAVPSSILSDLASADQRMNQPARIARALAAFSVDEVVVFDDSPLSARPRHTDTNSYT 98

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             +D       FL  +L YLE P ++RKALF +H +LR   MLP LD PHH    +W  +
Sbjct: 99  GDTDP----CHFLTHILTYLEAPPFMRKALFPLHPNLRLTAMLPTLDMPHHPNPKDWISY 154

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REG+ +        G+LV+VGL + V V+  + P  R+T+    +   +   P  V P +
Sbjct: 155 REGIVIPGETSTGSGSLVEVGLGEPVEVEAKIPPKTRITLKFSED---ETQYPEPVHPAA 211

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT----FR 297
              E G YWGY VR A ++S+VF    Y+ GYD  IGTSE G  + +S +  P+    F 
Sbjct: 212 PRTEGGYYWGYTVRRASSLSNVFTESPYENGYDVSIGTSERG--IPASRVFPPSKRVNFN 269

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKN-----AREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
           HL+I FGG  GLE +   D  L   +      +E+F  + N  P QG R IRT+EA+FI+
Sbjct: 270 HLIIVFGGPRGLEFAAMNDQELSSMDIQGPRTKELFDHWVNILPGQGCRNIRTDEAVFIA 329

Query: 353 L 353
           L
Sbjct: 330 L 330


>gi|331246527|ref|XP_003335896.1| replication factor C subunit 1 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1470

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 137/338 (40%), Positives = 181/338 (53%), Gaps = 56/338 (16%)

Query: 62   TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNK-----------SSSD 110
            TVS+A+PGSI++N Q+ EL T L GQIARA  IF ++E+++FD               S 
Sbjct: 908  TVSIALPGSIVNNAQTWELKTALVGQIARACAIFSVNEIIIFDESVGEADPSAKPHQPSS 967

Query: 111  NYSRSSAANRSNRSDENESG--------AAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
             Y R  A  R+   D   +G        + F+ R+L+YLE PQYLRK+LF +H  LR  G
Sbjct: 968  TYGR--AKYRTPAEDLELAGPEDGPFQPSHFVARILEYLECPQYLRKSLFPIHPDLRLAG 1025

Query: 163  MLPPLDAPHHLRKHEWAPFREGVTL-KENAPNSVG-----------------TLVDVGLN 204
            +LPPLD PHH RK    P+REG  L  +   N V                  +  DVGL 
Sbjct: 1026 LLPPLDLPHHFRKDHETPWREGCILPADKVENYVAGGHYQKKKNRKGDDRKTSWADVGLA 1085

Query: 205  KHVVVDQVLDPGV------RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAP 258
            + V V   LD GV      RVTV M       A   R V P    + +G+YWGY +R A 
Sbjct: 1086 EPVQVQ--LDEGVSLPDWTRVTVQMPQGEGKLA---RLVSPRLPTQTTGIYWGYSIRLAS 1140

Query: 259  NISSVFKNCSY--KGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEED 316
            +IS VF    Y  +GGYD  IGTSE G+ V+     +  F+H+L+  GGL+GLE +I  D
Sbjct: 1141 SISKVFTETPYASEGGYDLTIGTSERGENVDDVVDGIGDFKHMLLVLGGLSGLELTIASD 1200

Query: 317  DGLKRK----NAREVFHSYFNTCPHQGSRTIRTEEAIF 350
            D L+ +    +A  +F  + NT P QGSRT+RTEEA+ 
Sbjct: 1201 DTLEPRLTAADAPLLFDHWLNTLPFQGSRTVRTEEALL 1238


>gi|46135861|ref|XP_389622.1| hypothetical protein FG09446.1 [Gibberella zeae PH-1]
          Length = 344

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 117/303 (38%), Positives = 164/303 (54%), Gaps = 20/303 (6%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLA--GQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           T+SVA+P SI+ N        R+    +IARA+ +F +DEVVVFD+   S    ++   +
Sbjct: 41  TLSVAIPSSILTNPSLATADQRMTQPARIARALAVFSVDEVVVFDDSPVSTRPRQTDPNS 100

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
            +  +D       FL  +L +LE P ++RK LF +H +LR   MLP LD PHH    ++ 
Sbjct: 101 YTGDTDP----CHFLAHILSFLEAPPFMRKTLFPLHPNLRLTAMLPSLDMPHHPNPRDYL 156

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
            + EGVT+        GTLV+ GL   V ++  + P  R+T+    + N     P  V P
Sbjct: 157 SYAEGVTVAGATSTGSGTLVECGLENPVEIEADIPPKTRITLKFPEDEN---QYPEPVHP 213

Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT---- 295
            +   E G YWGY VR A ++S VF    Y+ GYD  IGTSE G  V +S    PT    
Sbjct: 214 AAPRTEGGYYWGYTVRKANSLSKVFTESPYENGYDVSIGTSERG--VPASKAFPPTKKVN 271

Query: 296 FRHLLIAFGGLAGLEESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
           F HLLI FGG  G+E +   D+ L     +    +E+F  + N  P+QGSRTIRT+EA+F
Sbjct: 272 FNHLLIVFGGPRGIEFAAMNDEELSSMEIQGSKTKELFDHWINVLPNQGSRTIRTDEAVF 331

Query: 351 ISL 353
           I+L
Sbjct: 332 IAL 334


>gi|170589912|ref|XP_001899717.1| Uncharacterized ACR, COG2106 family protein [Brugia malayi]
 gi|158592843|gb|EDP31439.1| Uncharacterized ACR, COG2106 family protein [Brugia malayi]
          Length = 391

 Score =  197 bits (501), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 118/316 (37%), Positives = 166/316 (52%), Gaps = 18/316 (5%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+A+PGSI++N QS EL T LAGQIAR+  +F IDE+VV+D  +   +  R    N   
Sbjct: 80  LSIALPGSILNNAQSPELRTYLAGQIARSAAVFCIDEIVVYDETARMKSQQREDYCNGQW 139

Query: 123 RSD-----ENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
             D      N      L ++L++++ PQYLRK LF +  +L++ G+L PLD PHHLR  +
Sbjct: 140 YPDLPVHSGNVECNFHLAKILEFMDCPQYLRKTLFPLQKTLKYAGILNPLDCPHHLRASD 199

Query: 178 WA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
            + P+REGV L +      G + +VGL K    D+ +     V      N  L     R+
Sbjct: 200 LSVPYREGVVLDKPIKAGRGPICNVGLYKEPRKDEAISEFCPVFGLGSFNVPLKTAGIRK 259

Query: 237 --------VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS 288
                   V      +E+G+YWGYKVR A ++            YD +IGTSE G  V+ 
Sbjct: 260 RKRYHGCLVNSKQIKQEAGLYWGYKVRLALSLYDALN----AEEYDVIIGTSERGKPVSK 315

Query: 289 SDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
            +++      +LI FGGL GLE ++E D  +      E+F  Y N  P QGSR IRTEEA
Sbjct: 316 FEMSFSGKNRILIVFGGLEGLETAVEADKNINCSTPEELFEHYLNVVPGQGSRIIRTEEA 375

Query: 349 IFISLQYFQEPISRAL 364
           I I+L   +  I   L
Sbjct: 376 IPITLATLRPMICTDL 391


>gi|407918289|gb|EKG11560.1| hypothetical protein MPH_11053 [Macrophomina phaseolina MS6]
          Length = 436

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 143/388 (36%), Positives = 197/388 (50%), Gaps = 99/388 (25%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD-------NKSSSDN--- 111
           T+S+A+PGSII N QS +L T +AG IARA  +F +DE+V+FD       NK   D+   
Sbjct: 49  TLSIALPGSIIANAQSHDLKTAMAGAIARAAAVFTVDEIVIFDDGQSQVLNKGRRDSANG 108

Query: 112 --YSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDA 169
             Y  S  A+ +  SD +     FL  LL YLETP  LRKALF +H +LR  G LP LD 
Sbjct: 109 HMYGVSDYASYTGYSDPDY----FLYHLLSYLETPPNLRKALFPLHPNLRTAGTLPSLDM 164

Query: 170 PHHLRKHEWAPFREGVTL----------------------------------KEN----- 190
           PHHLR  EW  +REGV L                                  K+N     
Sbjct: 165 PHHLRADEWCQYREGVVLGASVYDDTSSVTGSYAGSYTGSPPAPNKPSSKKHKKNRKAYA 224

Query: 191 APNSV------GTLVDVGLNK--HVVVDQVLDPGVRVTVAM------GTNRNLDADSPRQ 236
           AP+++       TLVD GL    H+ +   + P  RVT+        G   NL+A    Q
Sbjct: 225 APDTMPGAGAPPTLVDAGLPSPIHLDLPSPIPPHTRVTLKFACAEDPGEYDNLEA----Q 280

Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDL 291
            + P+ P+ E+G YWGY+ R A ++S+VF  C + GGYD  +GTSE G     I+ S  +
Sbjct: 281 AIDPALPREEAGYYWGYQTRQASSLSAVFTECPWDGGYDVSVGTSERGVPLSQILGSGGI 340

Query: 292 ---------------TLPT-FRHLLIAFGGLAGLEESIEED-----DGLKRKNAREVFHS 330
                          +LP  F+HL++ FGG++GLE +   D      G+  KN  ++F +
Sbjct: 341 PKEDAGGGGVEGGGSSLPAGFKHLVLVFGGVSGLEVAAMADKELVAKGVTGKNVGDIFDA 400

Query: 331 YFNTCPHQGSRTIRTEEAIFISLQYFQE 358
           + N  P+QGSRTIR EEA+++ L  F +
Sbjct: 401 WVNLVPNQGSRTIRCEEAVWLGLMGFSD 428


>gi|391863037|gb|EIT72351.1| hypothetical protein Ao3042_01340 [Aspergillus oryzae 3.042]
          Length = 700

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 142/374 (37%), Positives = 188/374 (50%), Gaps = 67/374 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVF--------DNKSSSDNYS 113
           T++VA+PGSI+ N  S+E  T LAG IARA+ +F +DEVVVF        D      NY 
Sbjct: 327 TLTVAIPGSIVANAHSVEQKTLLAGIIARALAVFCVDEVVVFDDDENSPRDTYHGQGNYH 386

Query: 114 RSSAANRSNRSDENESGAA----------FLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
                  S   + N+S A           FL  +L YLETP YLRK LF MH +LR  G+
Sbjct: 387 EFPIDKTSGALNGNDSSAKRYTAYSDPSNFLAHILSYLETPPYLRKHLFPMHPNLRGAGL 446

Query: 164 LPPLDAPHHLRKHEWAPFREGVTL---------------------KENAPNS----VGTL 198
           LP LD PHHLR +EW  +REG+ +                     + ++P+S      T+
Sbjct: 447 LPSLDMPHHLRANEWCEYREGIVVSSSDGGGQRRHSTQMSNYHNNRRHSPSSPTNFSATV 506

Query: 199 VDVGLNKHVVVDQVLDPG-VRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRY 256
           +D GL K VV+  +  P   RVTV      +   +   Q V PS P+ E+G YWGY VR 
Sbjct: 507 IDTGLPKKVVLPDIQLPEYARVTVRF---PDYGREHYAQPVHPSTPRSEAGYYWGYYVRR 563

Query: 257 APNISSVFKNCSYKGGYDHLIGTSEHGDIVNS-------------SDLTLPTFRHLLIAF 303
             ++SSVF  C + GGYD   GTSE G  V S                  P ++HLLI F
Sbjct: 564 CRSLSSVFTECPFDGGYDLSFGTSERGAPVYSVLEEDRQEHDNYDRRKIPPDYKHLLIVF 623

Query: 304 GGLAGLEESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
           GG+AG+E ++  D  L     +   A ++F  + N  P QGSRTIRTEEA+++ L     
Sbjct: 624 GGVAGIEAAVHNDPQLCDMDIRATEAGKLFDYWVNLLPGQGSRTIRTEEAVWLGLTSLWG 683

Query: 359 PISRALR-RVEYKS 371
            +    R R  YKS
Sbjct: 684 LVEGTHRPRPSYKS 697


>gi|358369151|dbj|GAA85766.1| deoxyribose-phosphate aldolase 2 [Aspergillus kawachii IFO 4308]
          Length = 430

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 133/389 (34%), Positives = 183/389 (47%), Gaps = 78/389 (20%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIF-----------------RIDEVVVFD 104
           T+SVA+PGSII N  S+E  T LAG IARA+ +F                    E    D
Sbjct: 46  TLSVAIPGSIIANAHSVEQKTLLAGVIARALAVFCVDEVVVFDDDENSAYCDFHEHYYHD 105

Query: 105 NKSSSDNYSR-SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
           N  S+   ++  S A+       N   + FL  +L YLETP YLRK LF +H +LR  G+
Sbjct: 106 NSPSAKTRNQMDSGASSPKAYTANSDPSHFLTHILSYLETPPYLRKHLFPIHPNLRTAGL 165

Query: 164 LPPLDAPHHLRKHEWAPFREGVTLKE-----------------------------NAPNS 194
           LP LD PHHLR +EW  FREG+ +                               +  N 
Sbjct: 166 LPSLDMPHHLRANEWCDFREGIVVSSPGRYSARNRHSTSKITQRSNDHQFHHHSSSTDNG 225

Query: 195 VGTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGY 252
             T+VD GL++ +V+  +  P   RVTV    + +  A+     V  S P+ E+G YWGY
Sbjct: 226 PATIVDTGLSQKIVLPNIHLPEHARVTVRFPQHESPYAEP----VHSSTPRVEAGYYWGY 281

Query: 253 KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS--------------------SDLT 292
            VR   ++SSVF  C Y GGYD   GTSE G  +++                     +  
Sbjct: 282 YVRRCRSLSSVFTECPYDGGYDLSFGTSERGMSISTVLEEKPQQAHDRYEQDHPPDREQH 341

Query: 293 LPTFRHLLIAFGGLAGLEESIEEDDGLKR-----KNAREVFHSYFNTCPHQGSRTIRTEE 347
           LP F+H+L+ FGG+AG+E ++  D  L+       +A ++F  + N  P QGSRTIRTEE
Sbjct: 342 LPDFQHILVVFGGIAGIEAAVRNDPQLRDMAIRPSDADKLFDYWVNFLPGQGSRTIRTEE 401

Query: 348 AIFISLQYFQEPISRALRRVEYKSKPEDG 376
           A+++ L           R       P DG
Sbjct: 402 AVWMGLTSLHGLFDGTHRLKRAYRYPHDG 430


>gi|114052745|ref|NP_001040575.1| uncharacterized protein LOC499770 [Rattus norvegicus]
 gi|68534765|gb|AAH98818.1| Similar to LOC495800 protein [Rattus norvegicus]
          Length = 242

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 102/224 (45%), Positives = 139/224 (62%), Gaps = 4/224 (1%)

Query: 145 QYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLN 204
           +YLRKA F  H  L+F G+L PLD+PHH+R+ E + FREGV +        G+LV+ G+ 
Sbjct: 19  RYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEFREGVVVDRPTKAGHGSLVNCGMK 78

Query: 205 KHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPPSKPK-ESGMYWGYKVRYAPNISS 262
           K V +D+ LDPG+RVTV +   +  +  + +  VV    P+ ++G+YWGY VR A  +S+
Sbjct: 79  KEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSSQDPRTKAGLYWGYTVRLASCLSA 138

Query: 263 VFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRK 322
           VF    ++ GYD  IGTSE G  V S+   LP FRH L+ FGGL GLE  ++ D  L+  
Sbjct: 139 VFAEAPFQDGYDLTIGTSERGSSVASTQ--LPNFRHALVVFGGLQGLEAGVDADPNLEVA 196

Query: 323 NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRR 366
           +   +F  Y NTCP QGSRTIRTEEAI ISL   Q  +++   R
Sbjct: 197 DPSVLFDFYVNTCPSQGSRTIRTEEAILISLAALQPGLTQGGAR 240


>gi|68067088|ref|XP_675514.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56494743|emb|CAH99398.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 392

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 123/367 (33%), Positives = 193/367 (52%), Gaps = 58/367 (15%)

Query: 40  DNKKKKKRKRDQLNDDAPIEVPT--------VSVAVPGSIIDNTQSLELATRLAGQIARA 91
           +N+KK K+  +++ +  P  +P         VSVA+P +II+N   + + + ++  +AR 
Sbjct: 31  ENEKKIKKINEEIKEFIPKRIPNISRDLSVDVSVAIPATIINNKNDV-IKSYISSYLARI 89

Query: 92  VTIFRIDEVVVFDN------------------KSSSDNYSRSSA-ANRSNRSDENESGAA 132
            TIF I ++ ++D+                  K++ D+ S SS   N +N+  E      
Sbjct: 90  FTIFSISKIYIYDDGFELNRNERNEIKNSISQKTNKDHPSCSSTEKNENNKKYEYSYLCK 149

Query: 133 FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAP 192
           +L   LQYLETPQYLRK +F + + L+  G++ P+DAPHHLR  EW PFREGV +K+N+ 
Sbjct: 150 YLHFNLQYLETPQYLRKHIFPITNFLKHSGLMSPVDAPHHLRSDEWLPFREGVVIKKNSN 209

Query: 193 NSVGTLVDVGLNKHVVVDQV--LDPGVRVTVAMGTNRNLDADSPR-------QVVPPSKP 243
           N    +VDVGL  +V+V+ +  ++ G RVT+   ++   +  +         +V+ PS P
Sbjct: 210 N---IIVDVGLFSNVLVENIYNINVGTRVTILFNSDGFNNFKNKNTKNLFIGKVIHPSMP 266

Query: 244 KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAF 303
           K   +YWGY ++   NI+ VF         D +IGTSE GD +      +   + +LI F
Sbjct: 267 KLYNIYWGYTIQILKNITDVF-----DIQVDCIIGTSERGDPIQDLKTQIKNAKSILIVF 321

Query: 304 GGLAGLEESIEEDDGLKRKNAR-------------EVFHSYFNTCPHQGSRTIRTEEAIF 350
           G   G+E+   ++  +K+                 + F  + NTCP Q SRTIRTEEAI 
Sbjct: 322 GNRDGVEDLFIKEREIKKNKMYTGKKRIKVLNKILKKFDYFINTCPKQTSRTIRTEEAIS 381

Query: 351 ISLQYFQ 357
           I+L  FQ
Sbjct: 382 ITLSLFQ 388


>gi|83773413|dbj|BAE63540.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 719

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 142/374 (37%), Positives = 187/374 (50%), Gaps = 67/374 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVF--------DNKSSSDNYS 113
           T++VA+PGSI+ N  S+E  T LAG IARA+ +F +DEVVVF        D      NY 
Sbjct: 346 TLTVAIPGSIVANAHSVEQKTLLAGIIARALAVFCVDEVVVFDDDENSPRDTYHGQGNYH 405

Query: 114 RSSAANRSNRSDENESGAA----------FLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
                  S   + N+S A           FL  +L YLETP YLRK LF MH +LR  G+
Sbjct: 406 EFPIDKTSGALNGNDSSAKRYTAYSDPSNFLAHILSYLETPPYLRKHLFPMHPNLRGAGL 465

Query: 164 LPPLDAPHHLRKHEWAPFREGVTL---------------------KENAPNS----VGTL 198
           LP LD PHHLR +EW  +REG+ +                     + ++P+S      T+
Sbjct: 466 LPSLDMPHHLRANEWCEYREGIVVSSSDGGGQRRHSTQMSNYHNNRRHSPSSPTNFSATV 525

Query: 199 VDVGLNKHVVVDQVLDPG-VRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRY 256
           +D GL K VV+  +  P   RVTV      +   +   Q V PS P+ E+G YWGY VR 
Sbjct: 526 IDTGLPKKVVLPDIQLPEYARVTVRF---PDYGREHYAQPVHPSTPRSEAGYYWGYYVRR 582

Query: 257 APNISSVFKNCSYKGGYDHLIGTSEHGDIVNS-------------SDLTLPTFRHLLIAF 303
             ++SSVF  C + GGYD   GTSE G  V S                  P ++HLLI F
Sbjct: 583 CRSLSSVFTECPFDGGYDLSFGTSERGAPVYSVLEEDRQEHDNYDRRKIPPDYKHLLIVF 642

Query: 304 GGLAGLEESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
           GG+AG+E  +  D  L     +   A ++F  + N  P QGSRTIRTEEA+++ L     
Sbjct: 643 GGVAGIEAVVHNDPQLCDMDIRATEAGKLFDYWVNLLPGQGSRTIRTEEAVWLGLTSLWG 702

Query: 359 PISRALR-RVEYKS 371
            +    R R  YKS
Sbjct: 703 LVEGTHRPRPSYKS 716


>gi|317151462|ref|XP_001824673.2| deoxyribose-phosphate aldolase [Aspergillus oryzae RIB40]
          Length = 419

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 142/374 (37%), Positives = 187/374 (50%), Gaps = 67/374 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVF--------DNKSSSDNYS 113
           T++VA+PGSI+ N  S+E  T LAG IARA+ +F +DEVVVF        D      NY 
Sbjct: 46  TLTVAIPGSIVANAHSVEQKTLLAGIIARALAVFCVDEVVVFDDDENSPRDTYHGQGNYH 105

Query: 114 RSSAANRSNRSDENESGAA----------FLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
                  S   + N+S A           FL  +L YLETP YLRK LF MH +LR  G+
Sbjct: 106 EFPIDKTSGALNGNDSSAKRYTAYSDPSNFLAHILSYLETPPYLRKHLFPMHPNLRGAGL 165

Query: 164 LPPLDAPHHLRKHEWAPFREGVTL---------------------KENAPNS----VGTL 198
           LP LD PHHLR +EW  +REG+ +                     + ++P+S      T+
Sbjct: 166 LPSLDMPHHLRANEWCEYREGIVVSSSDGGGQRRHSTQMSNYHNNRRHSPSSPTNFSATV 225

Query: 199 VDVGLNKHVVVDQVLDPG-VRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRY 256
           +D GL K VV+  +  P   RVTV      +   +   Q V PS P+ E+G YWGY VR 
Sbjct: 226 IDTGLPKKVVLPDIQLPEYARVTVRF---PDYGREHYAQPVHPSTPRSEAGYYWGYYVRR 282

Query: 257 APNISSVFKNCSYKGGYDHLIGTSEHGDIVNS-------------SDLTLPTFRHLLIAF 303
             ++SSVF  C + GGYD   GTSE G  V S                  P ++HLLI F
Sbjct: 283 CRSLSSVFTECPFDGGYDLSFGTSERGAPVYSVLEEDRQEHDNYDRRKIPPDYKHLLIVF 342

Query: 304 GGLAGLEESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQE 358
           GG+AG+E  +  D  L     +   A ++F  + N  P QGSRTIRTEEA+++ L     
Sbjct: 343 GGVAGIEAVVHNDPQLCDMDIRATEAGKLFDYWVNLLPGQGSRTIRTEEAVWLGLTSLWG 402

Query: 359 PISRALR-RVEYKS 371
            +    R R  YKS
Sbjct: 403 LVEGTHRPRPSYKS 416


>gi|159482322|ref|XP_001699220.1| hypothetical protein CHLREDRAFT_152464 [Chlamydomonas reinhardtii]
 gi|158273067|gb|EDO98860.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 396

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 123/344 (35%), Positives = 175/344 (50%), Gaps = 51/344 (14%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SV +PG+++D   S+E A   AGQI RA   F++DEVVV+D+  ++  +      + +
Sbjct: 53  TLSVGLPGTLLDAPSSVEFAVATAGQIHRAAANFQVDEVVVYDDSLAATTFQPGQPLSAA 112

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALF-----SMHSSLRFVGMLPPLDAPHHLRKH 176
           + +     G A L R+LQYL+TP +LR  ++       H  LR    LP L  PHH R  
Sbjct: 113 SVT----PGTALLARILQYLDTPPHLRPNMYPEEVLQQHPELRLAASLPVLSPPHHARPS 168

Query: 177 E-WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMG----TNRNLDA 231
             W P+REGV LK  A    G+ VDVGL++ V ++  L  G RVTV +G      R   A
Sbjct: 169 AVWMPYREGVVLKAEA--GTGSYVDVGLDRMVYLEHSLQQGARVTVHLGDMEPVTRFFAA 226

Query: 232 DSPRQ----VVPPSKPKES-GMYWGYKVRYAPNISSVFKNC--SYKGGYDHLIGTSEHGD 284
            S       VVPPS+P+E  G+YWGY VR A  +  VFK+   S  G YD  IG+S  G 
Sbjct: 227 YSETMIVGTVVPPSEPRERLGLYWGYTVRIALGLQRVFKDAAVSKTGSYDLTIGSSPAGA 286

Query: 285 IVNSSDLTLPTFRHLLIAFG---------------------GLAG-------LEESIEED 316
             +   L +P F+H L+ FG                      + G       L++ +   
Sbjct: 287 DTDPGSLVMPRFKHALMVFGGGSPVAAAAAVAEGGAKAKPKAVPGQAVPQQDLDQMVNRV 346

Query: 317 DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPI 360
              + K  ++VF+ Y NT PH G+  IRTE+AI ++L Y   P+
Sbjct: 347 PDWEHKTPQDVFNLYLNTTPHLGTLRIRTEDAIPVALSYLIMPL 390


>gi|70950204|ref|XP_744446.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56524401|emb|CAH78880.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 357

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 121/340 (35%), Positives = 185/340 (54%), Gaps = 50/340 (14%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN----------------- 105
           VSVA+P +II+N   + + + L+  +AR  TIF I  + ++D+                 
Sbjct: 27  VSVAIPATIINNKNDV-IKSYLSSYLARIFTIFSISTIYIYDDGFELNRNERNENRNNIP 85

Query: 106 -KSSSDNYSRSSA-ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGM 163
            K++++  + SS   N +N+  E      +L   LQYLETPQYLRK +F + + L+  G+
Sbjct: 86  PKTNNERPTISSTEKNENNKKYEYSYLCKYLHFNLQYLETPQYLRKHIFPITNFLKHSGL 145

Query: 164 LPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQV--LDPGVRVTV 221
           + P+DAPHHLR  EW PFREGV +K+N  N    +VDVGL  +V+V+ +  ++ G RVT+
Sbjct: 146 MSPVDAPHHLRSDEWLPFREGVVIKKNLNN---IIVDVGLFSNVLVENINNINVGTRVTI 202

Query: 222 AM---GTNRNLDADSPR----QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYD 274
                G NR  + ++      +++ PS PK   +YWGY ++   N++ VF         D
Sbjct: 203 LFDSEGFNRFKNKNTKNLFIGKIIHPSMPKLYNIYWGYNIQILKNLTDVF-----DVKVD 257

Query: 275 HLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLK-------RKNAREV 327
            +IGTSE GD +      + + R +LI FG   G+E+    +  +K       +K A+ +
Sbjct: 258 CIIGTSERGDPIQDLKTQIKSARSILIVFGNRDGVEDLFIREREIKKNKTYTGKKRAKVL 317

Query: 328 ------FHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPIS 361
                 F  + NTCP+Q SRTIRTEEAI I+L  FQ  +S
Sbjct: 318 NKILKKFDYFINTCPNQTSRTIRTEEAISITLSLFQSILS 357


>gi|396469502|ref|XP_003838421.1| hypothetical protein LEMA_P113570.1 [Leptosphaeria maculans JN3]
 gi|312214989|emb|CBX94942.1| hypothetical protein LEMA_P113570.1 [Leptosphaeria maculans JN3]
          Length = 436

 Score =  191 bits (484), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 131/370 (35%), Positives = 177/370 (47%), Gaps = 79/370 (21%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN-------KSSSDNYSR 114
           TVS+A+PGS   N +  +  T   G+IARA  +F +DE+VV+D+       K  S+ Y+R
Sbjct: 54  TVSIALPGSWTLNAKKPDHKTMQVGRIARAAAVFSVDEIVVYDDDPCNIDPKVVSEKYTR 113

Query: 115 SSAANRSNR-----------SDENE---SGAAFLVRLLQYLETPQYLRK-------ALFS 153
           +       +           ++E+E   +   FL  LL + E P +LR        +LF 
Sbjct: 114 AKGGRGKQQKKSKQEILDSIAEEDELWQNPDQFLYHLLAFAECPPHLRYDRDQPGLSLFK 173

Query: 154 MHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTL-KENAPNSVGT--------------- 197
            H +L++ G LP +D PHHLR HEW  +REGV +    AP    T               
Sbjct: 174 EHQNLKWTGCLPSMDMPHHLRSHEWCQYREGVVVGPAPAPTPSSTAKSSKKAKMAGDGGL 233

Query: 198 ---LVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR-----------QVVP 239
               V  GL   V V    +  ++ G+R TV   TN       P               P
Sbjct: 234 EYAYVKCGLPYPVRVPIPPESPIEEGMRTTVRF-TNATEPPSWPHLSPEACTNLSATACP 292

Query: 240 PSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLP 294
            S P+E SG YWGY VR AP++S+VF  C Y  GYD+ IGTSE G     I+       P
Sbjct: 293 SSTPREESGYYWGYTVRRAPSLSAVFSECEYPAGYDYTIGTSERGVPVHSILPGGAKPCP 352

Query: 295 T------FRHLLIAFGGLAGLEESIEED-----DGLKRKNAREVFHSYFNTCPHQGSRTI 343
           T      F HLLI FGG+AG+E ++  D      GL +  A E+F ++ N  P QGSRTI
Sbjct: 353 TTRAPASFNHLLIVFGGVAGIEPAVANDPELVAKGLGKATAHELFDAWVNLVPGQGSRTI 412

Query: 344 RTEEAIFISL 353
           RTEEA+   L
Sbjct: 413 RTEEAVEFGL 422


>gi|449019335|dbj|BAM82737.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 380

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 144/383 (37%), Positives = 188/383 (49%), Gaps = 99/383 (25%)

Query: 36  SSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIF 95
           +S+C   ++K R              T++VA+PGS++ N Q+  L T LAGQIARA+ IF
Sbjct: 8   ASNCTKSEQKGRSW------------TLAVALPGSVLANAQTPALQTYLAGQIARALAIF 55

Query: 96  RIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAF---LVRLLQYLETPQYLRKALF 152
            +DEV++F  +       R++ A     +  + S  AF   L RLLQY E PQYLR+ALF
Sbjct: 56  EVDEVIIFAERV------RATIAG----TVSSPSSTAFDTHLARLLQYAECPQYLRQALF 105

Query: 153 SMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGT-----LVDVGLNKHV 207
                L++VG+L PLD PHH R+HE  P+REGV +   AP++  T      V VG +  V
Sbjct: 106 PHTPELQYVGLLNPLDTPHHPRRHEGTPYREGVVVP--APDTGATSDRSRYVHVGWDAPV 163

Query: 208 VV--DQVLDPGVRVTVAM-------------GTNRNLDADS------------PRQV-VP 239
            V  D  L  G RVTV               G+NR    DS             R V V 
Sbjct: 164 AVDTDMPLPVGQRVTVRFEKLERSGHYVAEQGSNRKRKRDSHPSLHMQGCWTWTRAVMVA 223

Query: 240 PSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV----------NS 288
             +P+E  G+YWGYKVR A N+           G D  IGTSE G  +           +
Sbjct: 224 RDEPRERLGLYWGYKVRLAANLVQALST----SGADVKIGTSERGTDLFATLWGSSAERA 279

Query: 289 SDLT----------------LPT--FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHS 330
           +D T                +P   FR LL+AFGG+ GLE S    D  K      +F +
Sbjct: 280 TDSTAAETEIDGASARRASFIPPGGFRKLLVAFGGIHGLEASYGSPDVAK------LFDA 333

Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
           Y N CPHQGSRTIRTEEA+ + L
Sbjct: 334 YVNACPHQGSRTIRTEEALLVVL 356


>gi|320589534|gb|EFX01995.1| deoxyribose-phosphate aldolase [Grosmannia clavigera kw1407]
          Length = 341

 Score =  188 bits (477), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 108/302 (35%), Positives = 168/302 (55%), Gaps = 18/302 (5%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+P SI+ +  + E       ++ARA+  F IDE+V++D+   S   SR  A +  
Sbjct: 38  TVSVAIPTSILSDCTTSEQRINAPSRVARALAAFSIDELVIYDD---SPMESRPKAVDHD 94

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
             + + +  + +L  LL Y+E P ++RKALF +H +LR  G+ P L+ PHH  + E  P+
Sbjct: 95  GYTGDTDP-SHYLFHLLSYIEAPPFMRKALFPLHPNLRLQGLFPNLEMPHHPHRDEHLPY 153

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
            EG+T+         TLV++GL K V + + +    RVT+        D ++    V P+
Sbjct: 154 MEGLTVAGKPDAGHKTLVELGLRKPVSIKEEIPVKTRVTLRFPRESFEDGEA----VDPA 209

Query: 242 KPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTF 296
            P+ E+G YWGY++R   ++S+VF   +Y+ GYD  IGTSE G     +   +      F
Sbjct: 210 APRTEAGYYWGYQLRQCSSLSAVFTESAYENGYDVSIGTSERGLPTSQVFPRATTGTLDF 269

Query: 297 RHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
           +HL++  GG  GLE +   D      G+ + N RE F  + N  P+QG+R IRT+EA+FI
Sbjct: 270 QHLIVVLGGPRGLEYAAMNDPDLAGVGITKANVRETFDHWVNVLPNQGTRNIRTDEAVFI 329

Query: 352 SL 353
           +L
Sbjct: 330 AL 331


>gi|452821207|gb|EME28240.1| hypothetical protein Gasu_42420 [Galdieria sulphuraria]
          Length = 269

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 102/267 (38%), Positives = 153/267 (57%), Gaps = 26/267 (9%)

Query: 44  KKKRKRDQ------LNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRI 97
           KKKR + Q      ++    + + T+SVA+P SI++N Q  +L T + GQ+AR +TIF++
Sbjct: 6   KKKRAKVQPKLTKGIHLGPQLTLGTLSVAIPCSILENIQKAQLQTYVIGQLARTLTIFKV 65

Query: 98  DEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
           DE++++               ++    +E     AFL  LL+YLETPQYLR+ LF M  +
Sbjct: 66  DEIILY---------------SQDRNKEEAAPSIAFLKLLLEYLETPQYLRRHLFPMSEN 110

Query: 158 LRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVV--DQVLDP 215
           L++ G+L P+DAPHHLRK EW P+REGV L+E        LVDVGL++   V     +  
Sbjct: 111 LKYAGLLNPIDAPHHLRKDEWLPWREGVILEEKQGGVY--LVDVGLDRPATVKAQHNVKK 168

Query: 216 GVRVTVAMGTNRNLDADSPRQVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYD 274
           G R+TV+   ++         +   + P+E  G+YWGY+VR A  +   ++ C Y+GGYD
Sbjct: 169 GERITVSFDQSKETSDPLMGSLCFKATPREKGGLYWGYQVRIATALCKTWEECPYEGGYD 228

Query: 275 HLIGTSEHGDIVNSSDLTLPTFRHLLI 301
            +IGTSE G  ++     L   RHLL+
Sbjct: 229 KVIGTSERGKSIDDPSFRLTPCRHLLL 255


>gi|399216712|emb|CCF73399.1| unnamed protein product [Babesia microti strain RI]
          Length = 367

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 125/338 (36%), Positives = 171/338 (50%), Gaps = 61/338 (18%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+P SI+ N Q+ EL   + G IAR   IF I E+V+F++  S          N+ 
Sbjct: 38  TVSIAIPISILANCQTEELRAYVVGNIARTCAIFGIAEIVLFNDFESLSENELKRYKNKP 97

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFS-------MHSSLRFVGMLPPLDAPHHLR 174
           +   +  S   +   +L+YLETPQYLR+++F         +  L+F G+L PLDAPHH+R
Sbjct: 98  HM--QPHSLMNYFTLILEYLETPQYLRRSIFDYEKGKLVCYLDLKFAGLLNPLDAPHHMR 155

Query: 175 KHEWAPFR-------EGVTLKENAPNSVGTLVDVGLNKHVVVD------------QVLDP 215
            +EW PFR       +G+ L   +    G  VD GL   V+VD            QV++ 
Sbjct: 156 ANEWLPFRYINHIYRQGILLPGYSME--GGSVDCGLPVPVLVDNKKVNIENVKGGQVIED 213

Query: 216 G-----VRVTVAMGTN--------RNLDADSPRQV------VPPSKPKE-SGMYWGYKVR 255
                  +VTV +           +  D     +V      VP S P    G+YWGY VR
Sbjct: 214 HKSNRYQQVTVQLDIKSLSLCAKLKYTDGQCQEEVQFSGKIVPNSLPTTLMGLYWGYTVR 273

Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEE 315
                  +F + +Y+  YD +IGTSEHG+++ S    LP F HLLI FGGL GLE  I++
Sbjct: 274 QCQTFEEIFTSVNYE--YDLVIGTSEHGEVI-SGQFKLPKFEHLLIVFGGLKGLEVVIDK 330

Query: 316 DDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            +        + F  Y N C HQ SRTIRTEEAI I L
Sbjct: 331 PE--------KKFKYYINICEHQHSRTIRTEEAIMICL 360


>gi|429328463|gb|AFZ80223.1| hypothetical protein BEWA_030760 [Babesia equi]
          Length = 388

 Score =  184 bits (468), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 125/374 (33%), Positives = 184/374 (49%), Gaps = 80/374 (21%)

Query: 34  GDSSSCDNKKKKKRKRD------QLNDDAPIEVP--------TVSVAVPGSIIDNTQSLE 79
           G+ +  + K+ K +K D      QL  + P ++P        T+SVA+P S++ N QS E
Sbjct: 32  GEVAKPEGKQAKHKKEDDGLSSIQLFKEIPQKIPFMGRPREHTISVALPASLVQNVQSEE 91

Query: 80  LATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQ 139
           L   + G IAR +TI+ I+EVV++++                   +E           L+
Sbjct: 92  LRAYVIGNIARTLTIYGINEVVLYND-----------------MGNEGTKWQDHFALNLR 134

Query: 140 YLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTL---KENAPNSVG 196
           YLETPQYLR+ L+ M+ +L++ G+  PLDAPHHLR +EW P+REGV     K        
Sbjct: 135 YLETPQYLRRYLYPMNYALKYAGLQNPLDAPHHLRSNEWLPYREGVIKLIPKSQCGKHNK 194

Query: 197 TLVDVGLNKHVV------------VDQVLDPG-----VRVTV-----------------A 222
              + GL ++V             VD V D G      RVT+                  
Sbjct: 195 FFAECGLFRNVEIKNISALSSIYGVDHVTDDGDEEIYQRVTIRLDGSSLTQCKQYWKNNK 254

Query: 223 MGTNRNLDADSPRQVVPPSKP-KESGMYWGYKVRYAPNISSVFKNCSYK--GGYDHLIGT 279
           +G+  + +      +V P +P + +G+YWGY VR A +   V   C +   G YD+ IGT
Sbjct: 255 LGSTDSGNYSLSGYIVSPEEPLQRAGLYWGYTVREADSFEQVLVECPFNDSGEYDYKIGT 314

Query: 280 SEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQG 339
           SE G++    +  LP +++LL+ FG + GLE  +E        N  + F  Y+N C HQ 
Sbjct: 315 SERGELF-GPNTKLPKYKNLLVVFGPVNGLEHIME--------NPSDKFDKYYNFCLHQK 365

Query: 340 SRTIRTEEAIFISL 353
           SRTIRTEEA+ ISL
Sbjct: 366 SRTIRTEEALSISL 379


>gi|389585659|dbj|GAB68389.1| hypothetical protein PCYB_132630 [Plasmodium cynomolgi strain B]
          Length = 475

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 122/360 (33%), Positives = 178/360 (49%), Gaps = 74/360 (20%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD------------ 110
           V VA+P SII+N   + + + L   +AR  T+F + +V ++D++ +++            
Sbjct: 121 VCVAIPSSIINNRSDV-IKSYLTSYLARIFTVFSVSKVYIYDDQMANERCEQQLNSGNST 179

Query: 111 --------------NY----------SRSSAANRSNRSDENESG-------AAFLVRLLQ 139
                         NY           +  +++ S++  E E           +L   LQ
Sbjct: 180 HGTAHPHKTHEGDQNYHIDRVGSKHQGKRPSSHSSSQEGEGEKSTPQYSYLCQYLHYNLQ 239

Query: 140 YLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLV 199
           YLETPQYLRK LF +   L+  G++ P+DAPHHLR  EW PFREGV +K+    S G +V
Sbjct: 240 YLETPQYLRKHLFPITHFLKHSGIMNPVDAPHHLRSDEWLPFREGVVIKK---KSNGVIV 296

Query: 200 DVGLNKHVVVDQV--LDPGVRVTVAMGTN-------RNLDADSPRQVVPPSKPKESGMYW 250
           DVGL  H  ++ V  +D G RVTV    N       RN +A    +++ PS PK   +YW
Sbjct: 297 DVGLFAHAHIENVTHVDIGTRVTVLFHPNSFLLFQRRNGNALFSGKLINPSTPKLYNLYW 356

Query: 251 GYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLE 310
           GY V     +S VF       G D+++GTSE G  +  +   +     +LI FG   GLE
Sbjct: 357 GYSVELLKRLSDVF-----DLGVDYIVGTSERGHCMGDAADAIRGVSSILIVFGNKQGLE 411

Query: 311 ESIEEDDGLKRKNAREV-------------FHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
           + + ++   ++K +  V             FH + NTCPHQ SRTIRTEEAI I+L  F 
Sbjct: 412 DLLIKEREERKKKSYAVEKRSKVLNKMLKKFHLFVNTCPHQTSRTIRTEEAIAITLSLFH 471


>gi|341900584|gb|EGT56519.1| hypothetical protein CAEBREN_07270 [Caenorhabditis brenneri]
          Length = 378

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 110/325 (33%), Positives = 175/325 (53%), Gaps = 20/325 (6%)

Query: 45  KKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD 104
           KK K ++++   P    T+S+AVPG  ++N QS EL T +AGQIARA T++R+DE++++D
Sbjct: 53  KKAKIEKVDHTPPF---TISIAVPGQFLNNAQSSELRTYMAGQIARAATLYRVDEIIIYD 109

Query: 105 NKSSSDNYSRSSAANRSNR-----SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLR 159
                 N + ++  N + +     ++ N  G  +L ++L+YLE PQYLRK LF +   L+
Sbjct: 110 ESCRMTNEAVNAYYNGTWQGTLLPAETNYEGCFYLAKILEYLECPQYLRKDLFPIQKPLK 169

Query: 160 FVGMLPPLDAPHHLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ---VLDP 215
           + G+L PLDA HHL+  E    FREGV LK+ +    G + ++GL+K   +D     L P
Sbjct: 170 YAGLLNPLDAQHHLKSDEKTFRFREGVVLKKRSKEGRGPICNIGLDKEFEIDSDAVSLPP 229

Query: 216 GVRVTVAMGTNRNLDADSPRQVVP--PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGY 273
             RVTV +  N N      R  +    +  +ESG YWGY VR    +  V +       +
Sbjct: 230 YTRVTVEI-KNLNEQCKLYRGSITSGAAVTRESGQYWGYSVRLMTGLQKVLEG----KKF 284

Query: 274 DHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFN 333
           D + G S  G + +  D+ +     +L+ FGG+AG++ ++E ++  + + A + F     
Sbjct: 285 DVVAGVSPRGKLASQMDVCILNKPKILLVFGGVAGVDAAVESEELAEWRRAEDAFDVLIR 344

Query: 334 TCPH-QGSRTIRTEEAIFISLQYFQ 357
           T     GSR+ R EE +   L   Q
Sbjct: 345 TTSLPNGSRSERVEENVLSVLAQVQ 369


>gi|449669826|ref|XP_002157979.2| PREDICTED: uncharacterized protein LOC100206864 [Hydra
           magnipapillata]
          Length = 767

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 97/218 (44%), Positives = 131/218 (60%), Gaps = 12/218 (5%)

Query: 85  AGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETP 144
           A  IARA  IF +DE+++F N+S S   +       +  SD N      L R+LQYLE P
Sbjct: 557 AVLIARAAVIFNVDEIIIF-NESLSKKTNEGLFKGSTKSSDPN----ILLARILQYLECP 611

Query: 145 QYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLN 204
           QYLRK  F +H  L++ G+L PLD PHH+R  +   +REGVTL        G+ V+VG+ 
Sbjct: 612 QYLRKEFFPVHQDLKYAGILNPLDCPHHMRADDIVAYREGVTLNRPVKKGEGSFVNVGIM 671

Query: 205 KHVVVDQVLDPGVRVTVAMGTNRNLDADSPR----QVVPPSKPK-ESGMYWGYKVRYAPN 259
           K V +D+ L P +RVTV +        +  R    +VV PS P+ E+G+YWGY VR A N
Sbjct: 672 KDVKIDKHLKPNIRVTVKLDPYEQETIEKKRYLTGRVVTPSTPREEAGIYWGYSVRLAEN 731

Query: 260 ISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
            SSVF NC YKGGYD  IGTSE+G  +NS  + +P+++
Sbjct: 732 FSSVFTNCPYKGGYDLTIGTSENGGDINS--IEIPSYK 767


>gi|156086160|ref|XP_001610489.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797742|gb|EDO06921.1| conserved hypothetical protein [Babesia bovis]
          Length = 373

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 144/407 (35%), Positives = 195/407 (47%), Gaps = 104/407 (25%)

Query: 3   NKKKRGGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIE--- 59
            KKKR     +L++ A  E  + Q E +  N    S D +    R   ++    P +   
Sbjct: 12  TKKKRKDAR-KLRKEAPQELRKKQKESTSKNTIPKSTDGEMSMVRLLREIPQRIPFQGKP 70

Query: 60  -VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA 118
              T SVA+P SI+ N Q+ EL   + G IAR++TI+ I EVV++++             
Sbjct: 71  SKKTFSVALPVSILQNIQTDELRAYIIGNIARSLTIYGIHEVVIYND------------- 117

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
                                 LETPQYLRK +F +   LR  G+  PLDAPHHLR +EW
Sbjct: 118 ----------------------LETPQYLRKYMFPIVPHLRNTGLQNPLDAPHHLRSNEW 155

Query: 179 APFREGVTLKE--NAPNSVGTL---VDVGLNKHVVVD--QVLDP--GV------------ 217
            P+REGV   E  +A  S+G L   VD GL   + VD  + L    GV            
Sbjct: 156 LPYREGVVKLEEKSASLSMGNLDMMVDCGLFGRIKVDNPEALSEIYGVERFCLDNEDSVY 215

Query: 218 -RVTVAMGTNRNLDADSPRQ------------------------VVPPSKPKE-SGMYWG 251
            RVTV       LDA+S R+                        +VPP +P+   G+YWG
Sbjct: 216 QRVTVM------LDANSLRECKRRWKEQLNGPSYSDSCTMLSGKLVPPDEPQSLVGLYWG 269

Query: 252 YKVRYAPNISSVFKNCSYK--GGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGL 309
           + VR A +   VF +C +   G YD+ IGTSE GDI    D+++P F+H+L+  G + GL
Sbjct: 270 FVVREAESYEGVFSDCPFSTDGKYDYTIGTSERGDIY-PKDVSVPHFQHMLMVLGPIKGL 328

Query: 310 EESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
            ESI+E       N RE   +Y N CP+Q SRTIRTEEA+ I L  F
Sbjct: 329 -ESIKE-------NPREDLDAYVNFCPNQRSRTIRTEEALTICLSQF 367


>gi|221060084|ref|XP_002260687.1| hypothetical protein, conserved [Plasmodium knowlesi strain H]
 gi|193810761|emb|CAQ42659.1| hypothetical protein, conserved [Plasmodium knowlesi strain H]
          Length = 460

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 136/425 (32%), Positives = 200/425 (47%), Gaps = 95/425 (22%)

Query: 13  ELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAP--IE--VPTVS---- 64
           E+K AA  E  +   +    +G       KK  ++ +  LN+D    IE  +P VS    
Sbjct: 47  EIKAAALREGDDPSTQAHQTDG------TKKSPEKAKAVLNEDIKKCIEKRIPNVSRNLQ 100

Query: 65  ----VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSR------ 114
               VA+P SII+N   + + + L   +AR  T+F + +V ++D++  +D + R      
Sbjct: 101 VDLYVAIPSSIINNRSDV-IKSYLTSYLARIFTVFSVSKVYIYDDQMMNDRFERREQQLN 159

Query: 115 ------------------------------------SSAANRSNRSDENESGAAFLVRLL 138
                                               S ++++    D++    ++L + L
Sbjct: 160 SGNSTHGTSHRQTTHEGDKSYHNGRVGSMHLGKRPSSHSSSQEGEGDKSNPEYSYLCQYL 219

Query: 139 ----QYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNS 194
               QYLETPQYLRK LF +   L+  G++ P+DAPHHLR  EW PFREGV +K+    S
Sbjct: 220 HYNLQYLETPQYLRKHLFPITHFLKHSGIMNPVDAPHHLRSDEWLPFREGVVVKK---KS 276

Query: 195 VGTLVDVGLNKHVVVDQV--LDPGVRVTVAMGTN-------RNLDADSPRQVVPPSKPKE 245
            G +VDVGL     ++ V  +D G RVTV    N       RN +     +++ PS PK 
Sbjct: 277 NGVIVDVGLFAPAHIENVNHVDIGTRVTVLFHPNSFVLFQRRNANVLFSGKLINPSTPKL 336

Query: 246 SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGG 305
             +YWGY V     +S VF         D+++GTSE G  +  +   +   R +LI FG 
Sbjct: 337 YNLYWGYSVELLKRLSDVF-----DIDVDYIVGTSERGHSMGDATDAIRGVRSILIVFGN 391

Query: 306 LAGLEESI--EEDDGLKRKNARE-----------VFHSYFNTCPHQGSRTIRTEEAIFIS 352
             GLE+ +  E ++  K+  A E            FH + NTCPHQ SRTIRTEEAI I+
Sbjct: 392 KQGLEDLLIKEREERKKKSYAMEKRNKVLSKMLKKFHLFVNTCPHQTSRTIRTEEAIAIT 451

Query: 353 LQYFQ 357
           L  F 
Sbjct: 452 LSLFH 456


>gi|451851982|gb|EMD65277.1| hypothetical protein COCSADRAFT_35351 [Cochliobolus sativus ND90Pr]
          Length = 443

 Score =  181 bits (459), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 128/360 (35%), Positives = 179/360 (49%), Gaps = 69/360 (19%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYS-------- 113
           TV++A+PGS   N +  +  T   G+IARA  I+ +DE+VV+D+  S+ +++        
Sbjct: 73  TVTIALPGSWTLNAKKPDHKTIQVGRIARAAAIYCVDEIVVYDDDPSNIDHNVVEQKYIR 132

Query: 114 --RSSAANRSNRSDENESGA---AFLVRLLQYLETPQYLRK-------ALFSMHSSLRFV 161
             R+      + S+++E+      FL  LL + E P +LR        ++F  H +L++V
Sbjct: 133 KGRTKQQVLDSISEQDEAWQNPDQFLYHLLAFAECPPHLRYDREDPRLSIFKEHQNLKWV 192

Query: 162 GMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGT-----------LVDVGLNKHVVV- 209
           G LP +D PHHLR HEW  +REGV L  +AP   G+            V  GL   + V 
Sbjct: 193 GNLPSMDMPHHLRSHEWCQYREGVFLG-SAPPPGGSKSKKVAREDYAYVKCGLPYPIRVP 251

Query: 210 ---DQVLDPGVRVTVAMGTNR------NL---DADS-PRQVVPPSKPKE-SGMYWGYKVR 255
              +  ++ G+R TV            NL   D DS        S P+E  G YWGY VR
Sbjct: 252 IPKEAPVEQGMRTTVRFANQEAPRGWPNLSQQDCDSLEATACAASLPREEGGYYWGYNVR 311

Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS----------SDLTLPT-------FRH 298
            A ++ +VF  C Y GGYD  IGTSE G  V S           D++  T       F H
Sbjct: 312 RAASLGAVFSECEYPGGYDMSIGTSERGQSVYSFLSNTGGVSPGDMSSSTVSGAPESFNH 371

Query: 299 LLIAFGGLAGLEESIEED-----DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           LLI FGG AG+E ++  D      GL +  A ++F  + N  P QGSRTIRTEEA+ I L
Sbjct: 372 LLIVFGGQAGIEPAVASDPELTSKGLDKSTAHQLFDVWVNLVPRQGSRTIRTEEAVIIGL 431


>gi|451997668|gb|EMD90133.1| hypothetical protein COCHEDRAFT_1178547 [Cochliobolus
           heterostrophus C5]
          Length = 443

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 132/419 (31%), Positives = 193/419 (46%), Gaps = 84/419 (20%)

Query: 15  KEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVPT------------ 62
           + A + E    +   +L + DS     KK+K   ++Q  +D     PT            
Sbjct: 17  RNAKSNEKENGKKRKTLGDNDSPM---KKRKGDGQEQSVEDIDTSRPTAVFNPTKGRNWT 73

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS-------DNYSRS 115
           V++A+PGS   N +  +  T   G+IARA  I+ +DE+VV+D+  S+         Y R 
Sbjct: 74  VTIALPGSWTLNAKKPDHKTIQVGRIARAAAIYCVDEIVVYDDDPSNIDSKVIDQRYIRK 133

Query: 116 SAANR------SNRSDENESGAAFLVRLLQYLETPQYLRK-------ALFSMHSSLRFVG 162
               +      S + +  ++   FL  LL + E P +LR        ++F  H +L++VG
Sbjct: 134 GRTKQQVLDSISEQDEAWQNPDQFLYHLLAFAECPPHLRYDREDPRLSIFKEHQNLKWVG 193

Query: 163 MLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGT-----------LVDVGLNKHVVV-- 209
            LP +D PHHLR HEW  +REGV +  +AP   GT            V  GL   + V  
Sbjct: 194 NLPSMDMPHHLRSHEWCQYREGVFVG-SAPAPGGTKSKKVAQEDYAYVKCGLPYPIRVPI 252

Query: 210 --DQVLDPGVRVTVAMG-----------TNRNLDADSPRQVVPPSKPKESGMYWGYKVRY 256
             +  ++ G+R T+              + ++ D             +E G YWGY VR 
Sbjct: 253 PKEAPVEQGMRTTIRFANYEAPKGWPNLSQQDCDGLEATACAASLPREEGGYYWGYNVRR 312

Query: 257 APNISSVFKNCSYKGGYDHLIGTSEHGDIV-------------NSSDLTLP----TFRHL 299
           A ++ +VF  C Y GGYD  IGTSE G  V             N S  T+     +F HL
Sbjct: 313 AASLGAVFSECEYPGGYDMSIGTSERGQSVYSFLSNTGGVSPGNMSGSTVSGAPESFNHL 372

Query: 300 LIAFGGLAGLEESIEED-----DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           LI FGG AG+E ++  D      GL +  A ++F ++ N  P QGSRTIRTEEA+ I L
Sbjct: 373 LIVFGGQAGIEPAVASDPELTSKGLDKSTAHQLFDAWVNLVPRQGSRTIRTEEAVIIGL 431


>gi|82594085|ref|XP_725278.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23480219|gb|EAA16843.1| Uncharacterized ACR, COG2106, putative [Plasmodium yoelii yoelii]
          Length = 502

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 114/345 (33%), Positives = 177/345 (51%), Gaps = 59/345 (17%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN----KSSSDNYSRSSAA 118
           VSVA+P +II+N   + + + L+  +AR  TIF I ++ ++D+      +  N  +++ +
Sbjct: 163 VSVAIPATIINNKNDV-IKSYLSSYLARIFTIFSISKIYIYDDGFEINCNERNEIKNTIS 221

Query: 119 NRSNRSD------------------------ENESGAAFLVRLLQYLETPQYLRKALFSM 154
            ++N+                          E      +L   LQYLETPQYLRK LF +
Sbjct: 222 QKTNKEHFSCSSRENNEKNENNKNNKNNKKFEYSYLCKYLHFNLQYLETPQYLRKHLFPI 281

Query: 155 HSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQV-- 212
            + L+  G++ P+DAPHHLR  EW PFREGV +K+N+ NS+  +VDVGL  +V+V+ +  
Sbjct: 282 TNFLKHSGIMSPVDAPHHLRSDEWLPFREGVVIKKNS-NSI--IVDVGLFSNVLVENIYN 338

Query: 213 LDPGVRVTVAMGTNRNLDADSPR-------QVVPPSKPKESGMYWGYKVRYAPNISSVFK 265
           ++ G RVT+   +    +  +         +V+ PS PK   +YWGY +    N++ VF 
Sbjct: 339 INIGTRVTILFDSYAFNNFKNKNTKNLFIGKVIHPSMPKLYNIYWGYTIHILKNLTDVF- 397

Query: 266 NCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAR 325
                   D +IGTSE GD +      + + + +LI FG   G+E+    +  +K+    
Sbjct: 398 ----DIQVDCIIGTSERGDPIQDLKTQIKSAKSILIVFGNRDGVEDLFIREREMKKNKMY 453

Query: 326 -------------EVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQ 357
                        + F  + NTCP Q SRTIRTEEAI I+L  FQ
Sbjct: 454 TGKKRIKVLNKILKKFDYFINTCPKQTSRTIRTEEAISITLSLFQ 498


>gi|268576392|ref|XP_002643176.1| Hypothetical protein CBG24142 [Caenorhabditis briggsae]
          Length = 433

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 104/308 (33%), Positives = 165/308 (53%), Gaps = 17/308 (5%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+AVPG  ++N QS EL T +AGQIARA T++R+DE++++D      N S ++  N +
Sbjct: 67  TISIAVPGQFLNNAQSSELRTYMAGQIARAATLYRVDEIIIYDESCRMTNESVNAYYNGN 126

Query: 122 NR-----SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
                  ++ N  G  +L ++L+YLE PQYLR+ LF +   L++ G+L PLDA HHL+  
Sbjct: 127 WEGSLLPAESNYEGCFYLAKILEYLECPQYLRRDLFPIQKPLKYAGLLNPLDAQHHLKSD 186

Query: 177 EWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ---VLDPGVRVTVAMGTNRNLDAD 232
           E    FREGV LK+ +    G + ++GL+K   +D     L P  RVTV +  N N    
Sbjct: 187 EKTFKFREGVVLKKRSKEGRGPICNIGLDKEFEIDTDTITLPPYTRVTVEI-KNLNEQCK 245

Query: 233 SPRQVVPPSK--PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
             R  +       +E+G YWGY VR    +  V +       +D + G S  G + +  D
Sbjct: 246 LYRGSITSGTRVTRETGQYWGYSVRLMTGLQKVLEG----KKFDIVAGVSPRGKLASQVD 301

Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPH-QGSRTIRTEEAI 349
           + +     +L+ FGG++G++ ++E ++  + + A + F     T     GSR+ R EE +
Sbjct: 302 VCILNKPKILLVFGGVSGVDAAVESEELAEWRRAEDAFDVLIRTTSLPNGSRSERVEENV 361

Query: 350 FISLQYFQ 357
              L   Q
Sbjct: 362 LSVLAQMQ 369


>gi|346323916|gb|EGX93514.1| deoxyribose-phosphate aldolase 2 [Cordyceps militaris CM01]
          Length = 402

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 113/335 (33%), Positives = 171/335 (51%), Gaps = 45/335 (13%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAA-NR 120
           T+SVAVPGSI+ N  + E       ++AR++ IF +DEV+V+D+ +++   SRSS    R
Sbjct: 60  TISVAVPGSILHNLPTAEQRLATVARLARSLAIFCVDEVIVYDDDAAAVPSSRSSTTITR 119

Query: 121 SNRSDENESG----AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR-K 175
            + S +  +G      F+ ++L +LE P ++R+ LF +H++LR   +LP +D PHH R K
Sbjct: 120 LDGSPDAYTGDSDPCHFVAQVLSFLECPPFMRRRLFPLHANLRHTALLPAVDMPHHPRPK 179

Query: 176 HEWAPFREGVTLKENAPNSVGT----LVDVG-LNKHVVVDQVLDPGVRVTVAMGTNRNLD 230
             W P+ EGVT+   A    G     +VD G     V V   + P  RVT+ +GT+    
Sbjct: 180 AAWLPYVEGVTVPPAAEEEGGGGSGSVVDTGPCGGRVCVPDDIPPHTRVTLHLGTHAA-- 237

Query: 231 ADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---- 286
           A  P    P +   E G YWGY VR   ++S VF  C +  GYD  +GTSE G  +    
Sbjct: 238 AQQPVCADPHAPRAEGGYYWGYAVRRCASLSRVFTECPHAAGYDLSVGTSERGTPLAQVM 297

Query: 287 -----------------------NSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGL---- 319
                                  + +    P+F+HL++ FGG  G+E + + D  L    
Sbjct: 298 AEHQQKRAQQKRRGYGHHQADSDDGAAAATPSFQHLVVVFGGPRGIELAADHDPALAAMH 357

Query: 320 -KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            +    RE+F  + +  P QGSR I T+EA+ I+L
Sbjct: 358 IRGPRTRELFDHWVDVLPGQGSRAIATDEAVPIAL 392


>gi|417401122|gb|JAA47457.1| Hypothetical protein [Desmodus rotundus]
          Length = 448

 Score =  177 bits (448), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 128/205 (62%), Gaps = 4/205 (1%)

Query: 161 VGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVT 220
            G+L PLD+PHH+R+ E + FREGV +        G+ V+ G+ K V +D+ L+PG+RVT
Sbjct: 241 TGLLNPLDSPHHVRQEEESEFREGVVVDRPTRPGRGSFVNCGMKKEVQIDKNLEPGLRVT 300

Query: 221 VAMGTNRNLDADSPR-QVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIG 278
           V +   +  +  + R +VV    P+ ++G+YWGY VR A  +S+VF    ++ GYD  IG
Sbjct: 301 VRLNQEQLPECKTYRGKVVSSQDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIG 360

Query: 279 TSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQ 338
           TSE G  V S+ L  P+FRH L+ FGGL GLE  ++ D  L+      +F  Y NTCP Q
Sbjct: 361 TSERGSDVASAQL--PSFRHALVVFGGLQGLEAGVDADPNLEVAEPSVLFDLYLNTCPGQ 418

Query: 339 GSRTIRTEEAIFISLQYFQEPISRA 363
           GSRTIRTEEAI ISL   Q  +++A
Sbjct: 419 GSRTIRTEEAILISLAALQPSLTQA 443



 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 59/101 (58%), Gaps = 4/101 (3%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF ++E+VVFD +               
Sbjct: 74  TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVEEIVVFDEEGQDAKTVEGEFRGVG 133

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVG 162
            +          L R+LQYLE PQYLRKA F  H  L+F G
Sbjct: 134 KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAG 170


>gi|218194448|gb|EEC76875.1| hypothetical protein OsI_15076 [Oryza sativa Indica Group]
 gi|218194449|gb|EEC76876.1| hypothetical protein OsI_15077 [Oryza sativa Indica Group]
 gi|222628481|gb|EEE60613.1| hypothetical protein OsJ_14027 [Oryza sativa Japonica Group]
          Length = 212

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 86/122 (70%), Positives = 98/122 (80%), Gaps = 11/122 (9%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+AV GSIIDN QSLELAT LAGQIARA T+FRIDEVVVFD+ SS +N          
Sbjct: 68  TVSIAVAGSIIDNAQSLELATLLAGQIARAATVFRIDEVVVFDSNSSVEN---------- 117

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
              D+ ESGA FLVR+LQYLETPQYLR+ LF MH++L+FVG+LPPLDAPHHLRKHEW+ F
Sbjct: 118 -SGDDVESGARFLVRILQYLETPQYLRRRLFPMHNNLKFVGLLPPLDAPHHLRKHEWSEF 176

Query: 182 RE 183
           RE
Sbjct: 177 RE 178


>gi|308463009|ref|XP_003093783.1| hypothetical protein CRE_24777 [Caenorhabditis remanei]
 gi|308249389|gb|EFO93341.1| hypothetical protein CRE_24777 [Caenorhabditis remanei]
          Length = 378

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 106/325 (32%), Positives = 172/325 (52%), Gaps = 20/325 (6%)

Query: 45  KKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFD 104
           KK K ++++   P    T+S+AVPG  ++N QS EL T +AGQIARA T++R+DE++++D
Sbjct: 53  KKAKIEKVDHTPPF---TISIAVPGQFLNNAQSSELRTYMAGQIARAATLYRVDEIIIYD 109

Query: 105 NKSSSDNYSRSSAANRSNR-----SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLR 159
                 + + ++  N + +     ++ N  G  +L ++L+YLE PQYLRK LF +   L+
Sbjct: 110 ESCRMTDEAVNAYYNGTWQGNLLPAENNYEGCFYLAKILEYLECPQYLRKDLFPIQKPLK 169

Query: 160 FVGMLPPLDAPHHLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ---VLDP 215
           + G+L PLDA HHL+  E    FREGV LK+ +    G +  +GL+K   +D     L P
Sbjct: 170 YAGLLNPLDAQHHLKYDEKTFRFREGVVLKKRSKEGRGPICSIGLDKEFEIDSDAVSLPP 229

Query: 216 GVRVTVAMGTNRNLDADSPRQVVP--PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGY 273
             RVTV +  N        R  +    +  +E+G YWGY VR    +  V +       +
Sbjct: 230 YTRVTVEI-KNLTEQCKLYRGSITNGAAVTRETGQYWGYSVRLLTGLQKVLEG----KKF 284

Query: 274 DHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFN 333
           D + G S  G +    ++ +     +L+ FGG+AG++ ++E ++  + + A + F     
Sbjct: 285 DMVAGVSPRGKLATEMNVCILNKPKILLVFGGVAGVDAAVESEELAEWRRAEDAFDMLIR 344

Query: 334 TCPH-QGSRTIRTEEAIFISLQYFQ 357
           T     GSR+ R EE +   L   Q
Sbjct: 345 TTSLPNGSRSERVEENVLSVLAQVQ 369


>gi|124809062|ref|XP_001348481.1| conserved protein, unknown function [Plasmodium falciparum 3D7]
 gi|23497376|gb|AAN36920.1| conserved protein, unknown function [Plasmodium falciparum 3D7]
          Length = 444

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 120/361 (33%), Positives = 181/361 (50%), Gaps = 80/361 (22%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDN----------------- 105
           V VA+P +II+N   + + + L   +AR  TIF I ++ ++D+                 
Sbjct: 86  VCVAIPSTIINNKSDV-IKSYLTSFLARIFTIFSISKIYIYDDYLEQEHLINKQKNYGDY 144

Query: 106 -------------------------KSSSDNYSRSSAANRSNRSDENES-GAAFLVRL-- 137
                                    K S+DN   +     +N  ++ ES   ++L +   
Sbjct: 145 QNEKYKKDIKNNSNNVNKNMHMNLKKDSNDNDHNTFDNIVNNEKNKEESPKYSYLCKYMH 204

Query: 138 --LQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSV 195
             LQYLETPQYLRK LF + + L+  G++ P+DAPHHLR  EW PFREGV L++   +S 
Sbjct: 205 YNLQYLETPQYLRKHLFPITNFLKNSGIMNPVDAPHHLRSDEWLPFREGVVLQK---SSN 261

Query: 196 GTLVDVGLNKHVVVDQV--LDPGVRVTVAMGTN-------RNLDADSPRQVVPPSKPKES 246
           G +VDVGL  +V+++ V  ++ G RVTV   +N       +N +     +++ PS PK  
Sbjct: 262 GIIVDVGLFSNVLIENVNYINIGTRVTVLFQSNSYELFQKKNPNVLFTGKLIHPSVPKLY 321

Query: 247 GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGL 306
            +YWGY +    N+S +F     K   D+LIGTSE G  ++ S ++    + +LI FG  
Sbjct: 322 NLYWGYSIEILKNLSDIF-----KLQVDYLIGTSERGKFIDDSMVSFRDAKSVLIVFGNK 376

Query: 307 AGLEESI--------------EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
            GLE+ +              E+   +  K  ++ F  + NTCP Q SRTIRTEEAI I+
Sbjct: 377 DGLEDLLIKEKEKKKNKVFSPEKKIKVLNKMLKK-FDLFINTCPCQTSRTIRTEEAITIT 435

Query: 353 L 353
           L
Sbjct: 436 L 436


>gi|17551846|ref|NP_498602.1| Protein B0361.6 [Caenorhabditis elegans]
 gi|1730629|sp|Q10950.1|YMP6_CAEEL RecName: Full=Uncharacterized protein B0361.6
 gi|373253800|emb|CCD61821.1| Protein B0361.6 [Caenorhabditis elegans]
          Length = 378

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 103/308 (33%), Positives = 166/308 (53%), Gaps = 17/308 (5%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+AVPG  ++N QS EL T +AGQIARA T++R+DE++++D      + + ++  N +
Sbjct: 67  TISIAVPGQFLNNAQSAELRTYMAGQIARAATLYRVDEIIIYDESCRMTDEAVNAYYNGT 126

Query: 122 NR-----SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH 176
            +     ++ N  G  +L ++L+YLE PQYLRK LF +   L+  G+L PLDA HHL+  
Sbjct: 127 WQGNLIPAETNYEGCFYLAKILEYLECPQYLRKDLFPIQKPLKNAGLLNPLDAQHHLKYD 186

Query: 177 EWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ---VLDPGVRVTVAMGTNRNLDAD 232
           E    FREGV LK+ + +  G + ++GL K   +D     L P  RVTV +  N      
Sbjct: 187 EKTLRFREGVVLKKRSKDGRGPICNIGLEKEFEIDSDAVQLPPYTRVTVEI-KNLTEQCK 245

Query: 233 SPRQVVP--PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
             R  +    +  +E+G+YWGY VR    +  V +       +D + G S  G + +  D
Sbjct: 246 LYRGSITSGATVTRETGLYWGYSVRLMTGLQKVLQ----AKKFDIVAGVSPRGKLASQMD 301

Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCP-HQGSRTIRTEEAI 349
           + +     +L+ FGG+AG++ ++E ++  + + A + F     T     GSR+ R EE +
Sbjct: 302 VCILNKPKILLVFGGVAGVDAAVESEELAEWRRAEDAFDVLIRTTSLSNGSRSERVEENV 361

Query: 350 FISLQYFQ 357
              L   Q
Sbjct: 362 LSVLAQVQ 369


>gi|156101914|ref|XP_001616650.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805524|gb|EDL46923.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 484

 Score =  174 bits (441), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 123/373 (32%), Positives = 173/373 (46%), Gaps = 87/373 (23%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD------------ 110
           V VA+P SII+N   + + + L   +AR  T+F +  V ++D++  S+            
Sbjct: 117 VCVAIPSSIINNRSDV-IKSYLTSYLARIFTVFAVARVYIYDDQMGSERDGRHERRDSHE 175

Query: 111 ---------NYSRSSAANRS-----------------------NRSDENESG-------- 130
                    N S S+A  +                        ++S  +E G        
Sbjct: 176 QREQQLKRGNSSHSTALPQKTHEREKTPHSDRDGSNHQRKRPISQSSSHEGGEGENPPPE 235

Query: 131 ----AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVT 186
                 +L   LQYLETPQYLRK LF +   L+  G++ P+DAPHHLR  EW PFREGV 
Sbjct: 236 YSYLCQYLHYNLQYLETPQYLRKHLFPITHFLKHSGIMNPVDAPHHLRSDEWLPFREGVV 295

Query: 187 LKENAPNSVGTLVDVGLNKHVVVDQV--LDPGVRVTVAMGTN-------RNLDADSPRQV 237
           +K+    S G +VDVGL  H  ++ V  +D G RVTV    N       RN D     ++
Sbjct: 296 IKK---KSNGVIVDVGLFAHAHIENVHHVDIGTRVTVLFHPNSFLLFQRRNADVLFSGKL 352

Query: 238 VPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR 297
           + P  PK   +YWGY V     +S VF         D+++GTSE G  +      +    
Sbjct: 353 INPGTPKLYNLYWGYSVELLKRLSDVF-----DLDVDYIVGTSERGRFMGEVADAVRGVS 407

Query: 298 HLLIAFGGLAGLEESIEEDDGLKRKNAREV-------------FHSYFNTCPHQGSRTIR 344
            +LI FG   GLE+ + ++   ++K +  V             FH + NTCPHQ SRTIR
Sbjct: 408 SILIVFGNKQGLEDLLIKEREEQKKKSYAVEKRSKVLSKMLKKFHLFVNTCPHQTSRTIR 467

Query: 345 TEEAIFISLQYFQ 357
           TEEAI I+L  F 
Sbjct: 468 TEEAIAITLSLFH 480


>gi|125547448|gb|EAY93270.1| hypothetical protein OsI_15078 [Oryza sativa Indica Group]
          Length = 204

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 84/150 (56%), Positives = 108/150 (72%), Gaps = 8/150 (5%)

Query: 205 KHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISSV 263
           ++V+V+Q+L+PG RVT+AMGTNR++     R++V PS P+ E  +YWGYKVRYA N+  V
Sbjct: 13  RNVLVEQMLEPGKRVTIAMGTNRDITTACKRKIVSPSSPRDEMELYWGYKVRYASNLGGV 72

Query: 264 FKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI-------AFGGLAGLEESIEED 316
           F +  YK GYD++IGTSEHG I++SS+L LP+FR             GGLAGLEE IEED
Sbjct: 73  FSDSPYKEGYDYIIGTSEHGKIISSSELILPSFRVAAFGALAPFNCIGGLAGLEECIEED 132

Query: 317 DGLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
             LK K   +VF++Y NTCP QGSRTIRTE
Sbjct: 133 RNLKGKGVDDVFNTYLNTCPSQGSRTIRTE 162


>gi|189202400|ref|XP_001937536.1| deoxyribose-phosphate aldolase 2 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187984635|gb|EDU50123.1| deoxyribose-phosphate aldolase 2 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 440

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 133/406 (32%), Positives = 182/406 (44%), Gaps = 88/406 (21%)

Query: 33  NGDSSSCDNKKKKKRKRDQLNDDAPIEVPT------------VSVAVPGSIIDNTQSLEL 80
           N   +S   KK+K   ++Q  +D     PT            V++A+PGS   N +  + 
Sbjct: 25  NSHDNSSPTKKRKSDNQEQSMEDVDTSKPTAIFNPTKGRDWTVTIALPGSWTLNAKKPDH 84

Query: 81  ATRLAGQIARAVTIFRIDEVVVFDN------------KSSSDNYSRSSAANRSNRSDEN- 127
            T   G+IARA  I+ +DE+VVFD+            K      S+    +     DE  
Sbjct: 85  KTIQVGRIARAAAIYCVDEIVVFDDDPVNIDPRVIDPKYIRKGRSKQQVLDSILEQDEAW 144

Query: 128 ESGAAFLVRLLQYLETPQYLRK-------ALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
           ++   FL  LL + E P +LR        ++F  H +L++VG LP +D PHHLR HEW  
Sbjct: 145 QNPDQFLYHLLSFAECPPHLRYDREDHRLSIFKEHQNLKWVGNLPSMDMPHHLRSHEWCQ 204

Query: 181 FREGVTLKENAP--------------NSVG-TLVDVGLNKHVVV----DQVLDPGVRVTV 221
           +REGV +    P              +  G   V  GL   V V    D  ++ G+R TV
Sbjct: 205 YREGVFVGPAPPLPSTPKSKKSKKSKDEQGYAYVKCGLPYPVRVPTPKDVPIEEGMRTTV 264

Query: 222 AMGT--------------NRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNC 267
                               NLDA +    +P    +E G YWGY VR A  +S V+  C
Sbjct: 265 RFANIDPPNNWPHLSQVDCDNLDATACAASLPR---EEGGYYWGYTVRRAACLSDVYSEC 321

Query: 268 SYKGGYDHLIGTSEHGDIV--------NSSDLTLP--------TFRHLLIAFGGLAGLEE 311
            Y GGYD  IGTSE G  V         S  +T          +F+HLLI FGG AG+E 
Sbjct: 322 EYPGGYDFSIGTSERGQSVYSFISGVGPSQSVTWTGDVAKAPESFKHLLIVFGGQAGIEP 381

Query: 312 SIEED----DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           ++  D      L +  A  +F ++ N  P QGSRTIRTEEA+ I L
Sbjct: 382 AVANDPVLGKELNKSTANSIFDAWINLVPAQGSRTIRTEEAVTIGL 427


>gi|330922064|ref|XP_003299680.1| hypothetical protein PTT_10728 [Pyrenophora teres f. teres 0-1]
 gi|311326537|gb|EFQ92220.1| hypothetical protein PTT_10728 [Pyrenophora teres f. teres 0-1]
          Length = 437

 Score =  173 bits (439), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 126/365 (34%), Positives = 168/365 (46%), Gaps = 76/365 (20%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS-DNYSRSSAANR 120
           TV++A+PGS   N +  +  T   G+IAR   I+ +DE+VVFD+   + D+        R
Sbjct: 63  TVTIALPGSWTLNAKKPDHKTIQVGRIARTAAIYCVDEIVVFDDDPVNIDSRVIDPKYIR 122

Query: 121 SNRSDEN------------ESGAAFLVRLLQYLETPQYLRK-------ALFSMHSSLRFV 161
             RS +             ++   FL  LL + E P +LR        ++F  H +L++V
Sbjct: 123 KGRSKQQVLDSILEQDEAWQNPDQFLYHLLSFAECPPHLRYDREDPRLSIFKEHQNLKWV 182

Query: 162 GMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVG---------------TLVDVGLNKH 206
           G LP +D PHHLR HEW  +REGV +    P S                   V  GL   
Sbjct: 183 GNLPSMDMPHHLRSHEWCQYREGVFVGPAPPLSSTPKSKKSKKPTDEQEYAYVKCGLPYP 242

Query: 207 VVV----DQVLDPGVRVTVAMGT--------------NRNLDADSPRQVVPPSKPKESGM 248
           V V    D  ++ G+R TV                    NLDA +    +P    +E G 
Sbjct: 243 VRVPTPKDVPIEEGMRTTVRFANVNPPRNWPHLSQVDCDNLDATACAASLPR---EEGGY 299

Query: 249 YWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS-------SDLTLPT------ 295
           YWGY VR A  +S V+  C Y GGYD  IGTSE G  V S       S    PT      
Sbjct: 300 YWGYTVRRAACLSDVYSECEYPGGYDFAIGTSERGQSVYSFISGVGPSQSFTPTGGITKA 359

Query: 296 ---FRHLLIAFGGLAGLEESIEED----DGLKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
              F+HLLI FGG AG+E ++  D      L +  A  +F ++ N  P QGSRTIRTEEA
Sbjct: 360 PESFKHLLIVFGGQAGIEPAVANDPVLGKELNKSTANSIFDAWVNLVPAQGSRTIRTEEA 419

Query: 349 IFISL 353
           + I L
Sbjct: 420 VTIGL 424


>gi|71032507|ref|XP_765895.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68352852|gb|EAN33612.1| hypothetical protein, conserved [Theileria parva]
          Length = 396

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 132/392 (33%), Positives = 196/392 (50%), Gaps = 91/392 (23%)

Query: 23  HESQNELSLANGD-SSSCDNKKKKKRKRDQ------------LNDDAPIEVP-------- 61
           H+S   + L+N +  +  DNK  K+ KR +            L  + P  +P        
Sbjct: 26  HKSNIHVELSNKNLDNVTDNKVLKENKRPKKDTNKYNLNYVRLFKEIPQRIPFLGKPRDY 85

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVF---DNKSSSDNYSRSSAA 118
           T+SVA+P SI++N QS +L   + G IAR +TI+ ++EVV++   D+KSS  N+    + 
Sbjct: 86  TISVALPVSIMENIQSDDLRAYVIGNIARTLTIYGVNEVVLYNDLDDKSS--NWMEYFSL 143

Query: 119 NRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
           N                  L+YLETPQYLRK LF M ++L+F G+  P++APHHLR  EW
Sbjct: 144 N------------------LRYLETPQYLRKFLFPMDNALKFAGLQNPINAPHHLRSTEW 185

Query: 179 APFREGVT--LKENAPNSVGTLVDVGLNKHV----------------VVDQVLDPGVRVT 220
            P+REGV   +++++ N+     D G+   V                + D+  D   RVT
Sbjct: 186 LPYREGVVGLIRKDSTNTT-LYADCGVFSSVKIHNKEDLKEFYGIEYMTDEDYDVYQRVT 244

Query: 221 VAMG------TNRNL-DADSPRQ---------VVPPSKP-KESGMYWGYKVRYAPNISSV 263
           V +        ++N  +  S  Q         +V P +P K +G+YWGY VR    +   
Sbjct: 245 VRLDDESIDKCHKNWKEGKSLSQSETGSLSAYLVHPEEPLKVAGLYWGYIVRECNTVVES 304

Query: 264 FKNCSY--KGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKR 321
            K C +     YD  +GTSE G++  S +  LP F+++LI FG + GLE  ++E +    
Sbjct: 305 LKGCPFNENARYDLKVGTSERGELY-SLNTKLPKFKNMLIYFGPVLGLEHVMQEPE---- 359

Query: 322 KNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
               + F  YFN C  Q SRTIRTEEA+ I L
Sbjct: 360 ----KKFDKYFNFCNQQKSRTIRTEEALLIVL 387


>gi|301113670|ref|XP_002998605.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262111906|gb|EEY69958.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 293

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 87/182 (47%), Positives = 123/182 (67%), Gaps = 9/182 (4%)

Query: 44  KKKRKRDQLNDD--APIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVV 101
           +KK+++DQL+++  A     T+S+A+PGSI+DN Q+ EL T LAGQ+ARA  IF++DE+V
Sbjct: 96  EKKQQQDQLDNERAAKGRAYTLSMAIPGSILDNAQTKELKTYLAGQVARAAVIFQVDEIV 155

Query: 102 VFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFV 161
           VFD++   +N S  +   R++          F+ R+LQYLETPQYLR+ALF MH+ L   
Sbjct: 156 VFDDQLG-NNVSTGAPKKRTH------DCHVFMARILQYLETPQYLRRALFPMHTDLSCA 208

Query: 162 GMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTV 221
           G+L PLD PHHLR  EW+ +REGV       +  G+ V VGL + VVVD+ + PG+RVTV
Sbjct: 209 GLLNPLDCPHHLRAQEWSVYREGVVTDRPLKDKEGSHVYVGLQREVVVDKRIQPGIRVTV 268

Query: 222 AM 223
            +
Sbjct: 269 KI 270


>gi|171684083|ref|XP_001906983.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942002|emb|CAP67654.1| unnamed protein product [Podospora anserina S mat+]
          Length = 368

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 117/317 (36%), Positives = 160/317 (50%), Gaps = 64/317 (20%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+S+AVP SII +  + E  T  AG IARA+ IF +DEVV+FD+ S  +   R+   +  
Sbjct: 101 TISIAVPTSIITSCVTREQRTTAAGSIARALAIFSVDEVVIFDD-SPIEQRPRNFDPDAY 159

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               E    A FL  LL YLETP ++RK LF +H +L+  G+L  LD PHH  K EW P+
Sbjct: 160 TGDIEP---AHFLEHLLNYLETPPFMRKVLFPIHPNLKSQGLLHGLDMPHHPHKDEWLPY 216

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REG+TL+  AP   G                                     P  V P  
Sbjct: 217 REGLTLE--APPRSG------------------------------------KPEPVNPAE 238

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLP------T 295
              E G +WGY VR A ++S VF + +Y+ GYD  IGTSE G  ++ +    P      T
Sbjct: 239 PRTEGGYFWGYSVRKAKSLSDVFTSSAYEDGYDLSIGTSERGVPLSKA---FPNHNQTAT 295

Query: 296 FRHLLIAFGGLAGLEESIEEDD-----GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
           F H+LI FGG  GLE +   D      G++    +E+F  + N  P+QG+R IRT+E++ 
Sbjct: 296 FNHMLIVFGGPRGLEFAAMNDPDLGQMGIQGARTKELFDHWVNVLPNQGTRGIRTDESLL 355

Query: 351 ISLQYFQEPISRALRRV 367
           I+L         ALRR+
Sbjct: 356 IALT--------ALRRL 364


>gi|312087671|ref|XP_003145563.1| hypothetical protein LOAG_09988 [Loa loa]
          Length = 282

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 98/258 (37%), Positives = 141/258 (54%), Gaps = 23/258 (8%)

Query: 50  DQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKS-- 107
           D +N+        +S+A+PGSI++N QS EL T LAGQIAR+  IF +DE++V+D  +  
Sbjct: 30  DNMNETVNGRFLMLSIALPGSILNNAQSPELRTYLAGQIARSAAIFCVDEIIVYDETARM 89

Query: 108 ---SSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGML 164
                D+Y              N      L R+L+++E PQYLRK LF +  +LR+ G+L
Sbjct: 90  KPQQRDDYCSGKWCPDLPVHSGNVECNFHLARILEFMECPQYLRKTLFPLQKTLRYAGIL 149

Query: 165 PPLDAPHHLRKHEWA-PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ--VLDPGVRVTV 221
            PLD PHHLR  + + P+REGV L +      G + ++GL K V +++  VL  G RVTV
Sbjct: 150 NPLDCPHHLRASDLSVPYREGVVLDKPVKAGRGPICNIGLYKEVQINEEVVLQAGTRVTV 209

Query: 222 AMGTNRNLDADSPRQ------VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH 275
            +     LD  S R+      V      +E+G+YWGYKVR A ++          G YD 
Sbjct: 210 KV-----LDIYSERKRYHGCLVNSQQIKRETGLYWGYKVRIALSLHDALN----AGEYDV 260

Query: 276 LIGTSEHGDIVNSSDLTL 293
           +IGTSE G  V+  ++ L
Sbjct: 261 IIGTSERGKPVSKFEMPL 278


>gi|344255280|gb|EGW11384.1| Uncharacterized protein C9orf114-like [Cricetulus griseus]
          Length = 193

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 84/193 (43%), Positives = 119/193 (61%), Gaps = 4/193 (2%)

Query: 173 LRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD 232
           +R+ E + FREG+ +        G+LV+ GL K V +D+ L+PG+RVTV +   +  +  
Sbjct: 1   MRQDEESEFREGIVVDRPTKAGHGSLVNCGLKKEVKIDKKLEPGLRVTVRLNQKQLPECK 60

Query: 233 SPRQVVPPSK-PK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
           + +  V  S+ P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+ 
Sbjct: 61  TYKGTVVSSQDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSNVASAQ 120

Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
           L  P+FRH L+ FGGL GLE  ++ D  L+      +F  Y NTCP QGSRTIRTEEAI 
Sbjct: 121 L--PSFRHALVVFGGLQGLEAGVDADPNLEVSEPSVLFDFYVNTCPSQGSRTIRTEEAIL 178

Query: 351 ISLQYFQEPISRA 363
           ISL   Q  +++A
Sbjct: 179 ISLAALQPGLTQA 191


>gi|440632788|gb|ELR02707.1| hypothetical protein GMDG_05656 [Geomyces destructans 20631-21]
          Length = 345

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 120/340 (35%), Positives = 172/340 (50%), Gaps = 40/340 (11%)

Query: 41  NKKKKKRKRD-QLNDDAPIEVP------TVSVAVPGSIIDNTQSLELATRLAGQIARAVT 93
           N KK+ R  D + + DA I  P      T S+A+PG+IID   SLE  + +A QIARA++
Sbjct: 7   NNKKRNRNGDVKGSQDAAIYGPGKGRSWTTSLAIPGNIIDMNMSLESRSIIASQIARAIS 66

Query: 94  IFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALF- 152
            FRIDEV++    S+S +   S+AA+  +        A FL+R+L YLETP Y+R  LF 
Sbjct: 67  TFRIDEVIIL---STSTSPLDSTAASTDS--------APFLLRVLSYLETPPYIRDRLFQ 115

Query: 153 ----SMHSSLRFVGMLPPLDA--PHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKH 206
               ++H  L  +  L       PHH    E   +REGVTLK     +   LV+ GL   
Sbjct: 116 PGKSAIHPDLAHISTLSAASTALPHHASFAETL-YREGVTLKAQEEGTYHRLVNCGLPNG 174

Query: 207 VVVDQVLDP---GVRVTVAMG--TNRNLDADSPRQVVPPSKPKES-GMYWGYKVRYAPNI 260
           ++   + DP     RVTV +   T        P +VV PS+P+E  G Y GY+ R +P++
Sbjct: 175 ILA-SIPDPIPKSTRVTVKLSSQTPSGDRTSIPAEVVAPSEPREEIGQYLGYETRTSPSL 233

Query: 261 SSVFKNCSYKGGYDHLIGTSEH-GDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEED--- 316
            S F++C Y GGYD  I  S+  GD   +       F+H+L+AFGG    EE  + D   
Sbjct: 234 VSFFEDCPYDGGYDISILVSDSDGDDAGNISAANSKFQHILLAFGGAENFEELTKNDPKL 293

Query: 317 DGLKRKNAREVFHSYFNTC---PHQGSRTIRTEEAIFISL 353
             L   N  ++F    N     P+    ++R EE + + L
Sbjct: 294 GSLDLDNFADLFDHTVNVAKGLPYVNPGSLRVEELVLVGL 333


>gi|349806181|gb|AEQ18563.1| hypothetical protein [Hymenochirus curtipes]
          Length = 166

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 86/205 (41%), Positives = 108/205 (52%), Gaps = 39/205 (19%)

Query: 148 RKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHV 207
           R++ F  H  L+F G L PLD+PHH+R  E + +REGV L        G+ V+ G+ K V
Sbjct: 1   RRSFFPKHPDLQFAGALNPLDSPHHVRIDEESEYREGVVLDRPTKPGRGSFVNCGMRKEV 60

Query: 208 VVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNC 267
            +D+                                      WGYKVR A  +S+VF  C
Sbjct: 61  QIDK-------------------------------------QWGYKVRLAACLSAVFTEC 83

Query: 268 SYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREV 327
            YK GYD  IGTSE G  V +  ++LP FRH +I FGGL GLE S+E D  L       +
Sbjct: 84  PYKDGYDLSIGTSERGSSVET--VSLPKFRHAVIVFGGLKGLEYSVEGDQTLDISEPSII 141

Query: 328 FHSYFNTCPHQGSRTIRTEEAIFIS 352
           F+ Y NTCP+QGSRTIRTEEAI IS
Sbjct: 142 FNHYLNTCPNQGSRTIRTEEAILIS 166


>gi|149039119|gb|EDL93339.1| rCG45649, isoform CRA_b [Rattus norvegicus]
          Length = 233

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 78/173 (45%), Positives = 108/173 (62%), Gaps = 4/173 (2%)

Query: 196 GTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK-PK-ESGMYWGYK 253
           G+LV+ G+ K V +D+ LDPG+RVTV +   +  +  + +  V  S+ P+ ++G+YWGY 
Sbjct: 61  GSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSSQDPRTKAGLYWGYT 120

Query: 254 VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESI 313
           VR A  +S+VF    ++ GYD  IGTSE G  V S+ L  P FRH L+ FGGL GLE  +
Sbjct: 121 VRLASCLSAVFAEAPFQDGYDLTIGTSERGSSVASTQL--PNFRHALVVFGGLQGLEAGV 178

Query: 314 EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRR 366
           + D  L+  +   +F  Y NTCP QGSRTIRTEEAI ISL   Q  +++   R
Sbjct: 179 DADPNLEVADPSVLFDFYVNTCPSQGSRTIRTEEAILISLAALQPGLTQGGAR 231



 Score = 37.7 bits (86), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 18/25 (72%), Positives = 21/25 (84%)

Query: 62 TVSVAVPGSIIDNTQSLELATRLAG 86
          T+SVA+PGSI+DN QS EL T LAG
Sbjct: 35 TLSVALPGSILDNAQSPELRTYLAG 59


>gi|118396877|ref|XP_001030775.1| hypothetical protein TTHERM_01016150 [Tetrahymena thermophila]
 gi|89285089|gb|EAR83112.1| hypothetical protein TTHERM_01016150 [Tetrahymena thermophila
           SB210]
          Length = 1437

 Score =  148 bits (374), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 96/289 (33%), Positives = 151/289 (52%), Gaps = 30/289 (10%)

Query: 51  QLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD 110
           QLN+   I++  +++ +P SII   QS EL +    ++AR   IF +DE+++  + S   
Sbjct: 303 QLNE--KIKLKHLAICIPDSIISIPQSSELRSYFISELARTAAIFCVDEIIILKDNS--- 357

Query: 111 NYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAP 170
            Y   S         +N     ++VR LQYLETPQYLRK LF +H  L+  G++ P++  
Sbjct: 358 -YIAKS---------QNFQQGPYIVRNLQYLETPQYLRKQLFPIHPDLKNAGLMNPIECH 407

Query: 171 HHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLD 230
           HHL+  ++ P+REGV +      + G+ V++GL K   +   L    RVT+ +     LD
Sbjct: 408 HHLKSDQYCPYREGVVMNRPTKENEGSWVEIGLKKQAKIPYKLVTNTRVTLRINE-PTLD 466

Query: 231 ADSP---RQVVPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG-DI 285
            +       VV   +PKE  G+YWGY+VR A  +S VFK      G D+L    +   + 
Sbjct: 467 IEQKYYSADVVSQKEPKEKMGLYWGYQVRLADTLSQVFK------GEDNLRDHQQFSEES 520

Query: 286 VNSSDLTLPTFRHLLIAFGGLAGLEESIEEDD--GLKRKNAREVFHSYF 332
           ++ ++L L   +++LI F G   LE  I+ D+   L  K+A   F + F
Sbjct: 521 ISETELDLED-KNVLIVFSGYNNLESIIDSDEYTKLSGKDALNKFDANF 568


>gi|148676494|gb|EDL08441.1| DNA segment, Chr 2, Wayne State University 81, expressed, isoform
           CRA_a [Mus musculus]
          Length = 241

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 77/173 (44%), Positives = 109/173 (63%), Gaps = 4/173 (2%)

Query: 196 GTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK-PK-ESGMYWGYK 253
           G+LV+ G+ K V +D+ LDPG+RVTV +   +  +  + +  V  S+ P+ ++G+YWGY 
Sbjct: 61  GSLVNCGMKKEVKIDKKLDPGLRVTVRLNQQQLPECKTYKGTVVSSQDPRTKAGLYWGYT 120

Query: 254 VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESI 313
           VR A  +S+VF    ++ GYD  IGTSE G  V S+ L  P+FRH L+ FGGL GLE ++
Sbjct: 121 VRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQL--PSFRHALVVFGGLQGLEAAV 178

Query: 314 EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRR 366
           + D  L+  +   +F  Y NTC  QGSRTIRTEEAI ISL   Q  +++   R
Sbjct: 179 DADPNLEVADPSVLFDFYVNTCLSQGSRTIRTEEAILISLAALQPGLTQVGSR 231


>gi|145532104|ref|XP_001451813.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419479|emb|CAK84416.1| unnamed protein product [Paramecium tetraurelia]
          Length = 321

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 106/312 (33%), Positives = 157/312 (50%), Gaps = 35/312 (11%)

Query: 58  IEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSA 117
           +E   V++A+P SII N QS EL +    Q+AR   IF++DEV++  +          S 
Sbjct: 33  VEQRHVTIAIPDSIISNAQSSELRSYFMSQLARMFAIFQVDEVIILRD---------YSY 83

Query: 118 ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
             +S + D     A+++VR LQYLETPQYLRK LF +HS L+ VG++ P+++ HHL   +
Sbjct: 84  IPKSKQFD----VASYVVRNLQYLETPQYLRKYLFPIHSDLKNVGLMNPIESKHHLLTEQ 139

Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGT-NRNLDADSPRQ 236
             PFREGV + E       + V++GL + V+++  L PG RVT+ +   N      +P  
Sbjct: 140 VCPFREGVVV-ERPSKGDTSWVEIGLKRQVLINYPLQPGTRVTLRLDDPNSTQLTGTP-- 196

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGT-----SEHGDIVNSS 289
            V     KE G YWGY V     I S F     +  +D   LI T     ++     N S
Sbjct: 197 -VSSQDAKEEGYYWGYTV----TIESKFHRLLERDDFDMKILIETEKSDEAQQTTTFNPS 251

Query: 290 DLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVF----HSYFNTCPHQGSRTIRT 345
            L+      LL+ F GL  + +  E D+  K  N  +++    H Y       G + +R 
Sbjct: 252 QLSQQP-LKLLLLFSGLQKITDFTENDEKSKLSN-EDIYGKFDHVYRYGNNDFGVKQLRL 309

Query: 346 EEAIFISLQYFQ 357
           EE +F+ LQ  Q
Sbjct: 310 EEQMFMFLQSIQ 321


>gi|325958530|ref|YP_004289996.1| hypothetical protein Metbo_0773 [Methanobacterium sp. AL-21]
 gi|325329962|gb|ADZ09024.1| protein of unknown function DUF171 [Methanobacterium sp. AL-21]
          Length = 274

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 107/307 (34%), Positives = 150/307 (48%), Gaps = 58/307 (18%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+ VP SI+  T+ L + T   G I R+  IFR D +VV++     DN           
Sbjct: 6   LSIFVPASILSETKDLRIKTYKIGLIGRSAAIFRADRIVVYN-----DN----------- 49

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWAP 180
            SDE E    F+  +L Y+ TPQYLRK +F +   LR VG+LPPL  PHH     HE   
Sbjct: 50  -SDEKE--VKFISDVLTYMNTPQYLRKKVFPITRELRNVGILPPLRTPHHPTGELHE-GD 105

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVV------VDQVLDPGVRVTVAMGTNRNLDADSP 234
           +R+G+T+K       GT+VD+G +K  +      V++VL   V   + +G    LD D P
Sbjct: 106 YRQGLTIKRTKK---GTVVDIGADKFALCKEKLSVNKVLSFKV---IKLGKEIILDPDVP 159

Query: 235 RQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSD 290
                         YWGYKV  A N          K G D +IGTS +      +++   
Sbjct: 160 ------------DFYWGYKV-LATNKDLYNSILMMKPGPDLVIGTSRYAPSITSVLDEVK 206

Query: 291 LTLPTFRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
             L   +H  I FGG  +GLEE I +      +N +E+     NT P QG++T+RTEEA+
Sbjct: 207 TKLKGSKHTAILFGGPYSGLEELISD------QNEKEIIDLEVNTVPEQGTKTVRTEEAV 260

Query: 350 FISLQYF 356
             +L  F
Sbjct: 261 LATLSVF 267


>gi|145473603|ref|XP_001462465.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124430305|emb|CAK95092.1| unnamed protein product [Paramecium tetraurelia]
          Length = 334

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 104/313 (33%), Positives = 158/313 (50%), Gaps = 37/313 (11%)

Query: 58  IEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSA 117
           +E   V++A+P SII N QS EL +    Q+AR   IF++DEV++  +          S 
Sbjct: 46  VEQRHVTIAIPDSIISNAQSSELRSYFMSQLARMFAIFQVDEVIILRD---------YSY 96

Query: 118 ANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE 177
             +S + D     A+++VR LQYLETPQYLRK LF +HS L+ VG++ P+++ HHL   +
Sbjct: 97  IPKSKQFD----VASYVVRNLQYLETPQYLRKYLFPIHSDLKNVGLMNPIESKHHLLTEQ 152

Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGT-NRNLDADSPRQ 236
             PFREGV + E       + V++GL + V+++  L PG RVT+ +   N      +P  
Sbjct: 153 VCPFREGVVV-ERPSKGDTSWVEIGLKRQVLINYPLQPGTRVTLKLDDPNSTHLTGTP-- 209

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYD-HLIGTSEHGD------IVNSS 289
            V     K+ G YWGY V     I S F     +  +D  ++  +E  D        N S
Sbjct: 210 -VSSQDAKKEGYYWGYTV----TIESKFHRLLERDDFDMKILVETEKSDEAQQTTTFNPS 264

Query: 290 DLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTC-----PHQGSRTIR 344
            L+      LL+ F GL  + +  E D+  K K ++E  +  F+          G + +R
Sbjct: 265 QLSQQP-LKLLLLFSGLQKITDFTENDE--KSKLSKEDIYGKFDHVYRYGNSEYGVKQLR 321

Query: 345 TEEAIFISLQYFQ 357
            EE +F+ LQ  Q
Sbjct: 322 LEEQMFMFLQSIQ 334


>gi|400594356|gb|EJP62211.1| deoxyribose-phosphate aldolase [Beauveria bassiana ARSEF 2860]
          Length = 456

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 93/263 (35%), Positives = 136/263 (51%), Gaps = 41/263 (15%)

Query: 62  TVSVAVPGSI------------------IDNTQSLELAT---RLA--GQIARAVTIFRID 98
           T+S+AVPGSI                  I NT    L T   RL    ++AR+++IF +D
Sbjct: 58  TISIAVPGSILSKYVPHHTAPPFSVFKSITNTHICSLPTAEQRLVTVARLARSLSIFCVD 117

Query: 99  EVVVFDNKSSS--DNYSRSSAANRSNRSDENESGA---AFLVRLLQYLETPQYLRKALFS 153
           EV+V+D+ + S  +  +R   A+ S+ +      A    F+ ++L +LE P ++R+ LF 
Sbjct: 118 EVIVYDDDAPSARNTTTRFDMADGSSNTGIYTGDADPCHFVAQVLSFLECPPFMRRRLFP 177

Query: 154 MHSSLRFVGMLPPLDAPHHLR-KHEWAPFREGVTLKENAPN---------SVGTLVDVGL 203
           +H +LR   +LP LD PHH R K  W P+ EGVT+     +         S G+LVD G+
Sbjct: 178 LHPNLRHTALLPALDMPHHPRAKAPWLPYAEGVTVTAAGEDADTLLLPDGSGGSLVDTGI 237

Query: 204 NKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR--QVVPPSKPK-ESGMYWGYKVRYAPNI 260
              V V  V+ P  RVT+      +    SP     V P +P+ + G YWGY VR + ++
Sbjct: 238 GGLVRVADVIPPNTRVTLRFDGPNDAQQQSPPPPMCVHPDEPRTKGGYYWGYAVRRSASL 297

Query: 261 SSVFKNCSYKGGYDHLIGTSEHG 283
           S VF  C + GGYD  +GTSE G
Sbjct: 298 SGVFTECPHAGGYDLSVGTSERG 320



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 38/66 (57%), Gaps = 5/66 (7%)

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
           P+F+HLL+ FGG  G+E +   D  L     +    RE+F  + N  P QGSR I T+EA
Sbjct: 382 PSFQHLLVVFGGPRGIELAAAHDPHLAAMQIRGAKTRELFDHWVNVLPGQGSRDIATDEA 441

Query: 349 IFISLQ 354
           + I+L 
Sbjct: 442 VPIALM 447


>gi|51593245|gb|AAH78441.1| D2Wsu81e protein [Mus musculus]
          Length = 179

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 71/144 (49%), Positives = 89/144 (61%), Gaps = 4/144 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA TIF +DE+VVFD +         S     
Sbjct: 35  TLSVALPGSILDNAQSPELRTYLAGQIARACTIFCVDEIVVFDEEGQDTK----SVEGEF 90

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               +       L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 91  RGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEF 150

Query: 182 REGVTLKENAPNSVGTLVDVGLNK 205
           REGV +        G+LV+ G+ K
Sbjct: 151 REGVVVDRPTKAGHGSLVNCGMKK 174


>gi|426363303|ref|XP_004048782.1| PREDICTED: uncharacterized protein C9orf114 homolog [Gorilla
           gorilla gorilla]
          Length = 242

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 77/173 (44%), Positives = 102/173 (58%), Gaps = 4/173 (2%)

Query: 196 GTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR-QVVPPSKPK-ESGMYWGYK 253
           G L   G  + V +D+ L+PG+RVTV +   +  D  +   +VV    P+ ++G+YWGY 
Sbjct: 70  GELGSWGGPQEVKIDKNLEPGLRVTVRLNQQQLPDCKTFHGKVVSSQDPRTKAGLYWGYT 129

Query: 254 VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESI 313
           VR A  +S+VF    ++ GYD  IGTSE G  V S+ L  P FRH L+ FGGL GLE   
Sbjct: 130 VRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASAQL--PNFRHALVVFGGLQGLEAGA 187

Query: 314 EEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRALRR 366
           + D  L+      +F  Y NTCP QGSRTIRTEEAI ISL   Q  +++A  R
Sbjct: 188 DADPNLEVAEPSVLFDLYVNTCPGQGSRTIRTEEAILISLAALQPGLTQAGAR 240


>gi|47202262|emb|CAF88010.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 196

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 68/145 (46%), Positives = 87/145 (60%), Gaps = 4/145 (2%)

Query: 61  PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
           PTVSVAVPGS++DN QS EL + LAGQIARA  +F +DE+VVFD     D     S    
Sbjct: 55  PTVSVAVPGSVLDNAQSPELRSYLAGQIARACVVFCVDEIVVFDE----DGEDLKSVEGE 110

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
                +  +    L R+LQYLE PQYLRK  F  H  L++ G+L PLD+PHH+R  +   
Sbjct: 111 FKGVGKKGNACIQLARILQYLECPQYLRKWFFPKHQDLQYAGLLNPLDSPHHMRTDDECE 170

Query: 181 FREGVTLKENAPNSVGTLVDVGLNK 205
           +REGV L        G+LV+ G+ K
Sbjct: 171 YREGVVLDRPTKAGQGSLVNCGMRK 195


>gi|6841442|gb|AAF29074.1|AF161459_1 HSPC109 [Homo sapiens]
          Length = 388

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 94/252 (37%), Positives = 131/252 (51%), Gaps = 21/252 (8%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+         T L+ +        R+ E V       S    R +   R 
Sbjct: 74  TLSVALPGSIL---------TMLSRRSFAPTWPVRLPEPVPSSVWMRSWCLMRRARMPRL 124

Query: 122 NR--SDENESGAAFLVRLL----QYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRK 175
            R  S E E GA   VR         E PQYLRKA F  H  L+F G+L PLD+PHH+R+
Sbjct: 125 WRGNSQELEEGAG--VRTAGPDPAVPECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQ 182

Query: 176 HEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR 235
            E + FREG+ +        G+ V+ G+ K V +D+ L+PG+RVTV +   ++ D  +  
Sbjct: 183 DEESEFREGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQQHPDCKTYH 242

Query: 236 -QVVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
            +VV    P+ ++G+YWGY VR A  +S+VF    ++ GYD  IGTSE G  V S+  + 
Sbjct: 243 GKVVSSQDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGSDVASA--SF 300

Query: 294 PTFRHLLIAFGG 305
           PT   LL   G 
Sbjct: 301 PTSGMLLWCSGA 312


>gi|297302932|ref|XP_001119071.2| PREDICTED: uncharacterized protein C9orf114-like, partial [Macaca
           mulatta]
          Length = 147

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 67/144 (46%), Positives = 86/144 (59%), Gaps = 4/144 (2%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 4   TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 63

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
            +          L R+LQYLE PQYLRKA F  H  L+F G+L PLD+PHH+R+ E + F
Sbjct: 64  KKG----QACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNPLDSPHHMRQDEESEF 119

Query: 182 REGVTLKENAPNSVGTLVDVGLNK 205
           REG+ +        G+ V+ G+ K
Sbjct: 120 REGIVVDRPTRPGHGSFVNCGMKK 143


>gi|402581918|gb|EJW75865.1| D2Wsu81e protein [Wuchereria bancrofti]
          Length = 247

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 94/266 (35%), Positives = 129/266 (48%), Gaps = 45/266 (16%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+A+PGSI++N QS EL T LAGQIAR+  +F +DE+VV+D  +   +  R    N   
Sbjct: 2   LSIALPGSILNNAQSSELRTYLAGQIARSAAVFCVDEIVVYDETARMKSQQREDYCNGQW 61

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA-PF 181
             D                            +HS      +L PLD PHHLR  + + P+
Sbjct: 62  YPD--------------------------LPVHSG-NVECILNPLDCPHHLRASDLSVPY 94

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVV--DQVLDPGVRVTVAMGTNRNLDADSPRQ--- 236
           REGV L +      G + +VGL K V +  D  L  G RVTV +     LD  S R+   
Sbjct: 95  REGVVLNKPIKAGRGPICNVGLYKEVQINEDVTLKTGTRVTVKI-----LDIYSERKRYH 149

Query: 237 ---VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
              V      +E+G+YWGYKVR A ++            YD +IGTSE G  V+  ++  
Sbjct: 150 GCLVNSKQIKQEAGLYWGYKVRLALSLYDALN----AEEYDVIIGTSERGKPVSKFEMPF 205

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGL 319
                +LI FGGL GLE +IE D  +
Sbjct: 206 SGKNRILIVFGGLEGLETAIEADKNI 231


>gi|148642823|ref|YP_001273336.1| hypothetical protein Msm_0763 [Methanobrevibacter smithii ATCC
           35061]
 gi|148551840|gb|ABQ86968.1| conserved hypothetical protein Msm_0763 [Methanobrevibacter smithii
           ATCC 35061]
          Length = 275

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/303 (32%), Positives = 156/303 (51%), Gaps = 47/303 (15%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV +P S +  ++ L++ T   G + RA+ +F+ID VV+++     D+Y ++       
Sbjct: 6   LSVFIPNSFLSESKDLKVRTFKVGILGRALAVFQIDNVVIYN-----DDYIKN------- 53

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWAP 180
             +  E    F+  +L Y+ TPQYLRK+ F +   L+ VG+LPPL APHH    + +   
Sbjct: 54  --ENGEEDGEFIAEVLNYMNTPQYLRKSAFPIRPELKHVGILPPLRAPHHPVNSQPDVGD 111

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV-VP 239
           +R+G T+K N     GT VD+G++K     +      ++TV    N  +   + ++V V 
Sbjct: 112 YRQGFTVKRNKK---GTFVDIGMDKLAFCKE------QLTVKKIFNFKITKIAKKEVIVT 162

Query: 240 PSKPKESGMYWGYKVRYAPNISS--VFKNCSYKGGYDHLIGTSEHGDIVNSS----DLTL 293
           P KP +  +YWGY V     ISS    KN       D ++ T+ +GD ++S      L +
Sbjct: 163 PDKPDD--IYWGYNV-----ISSNKSLKNSLKLIKPDFIVETTRYGDYIDSIFDELKLKV 215

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             F+++ I FGG      SI ED  +   N     H   NT P+QG+ T+RTEEA+  +L
Sbjct: 216 DEFKNIAILFGGPYS---SIPED--VSSGNWE---HIKLNTVPNQGTETVRTEEAVVATL 267

Query: 354 QYF 356
             F
Sbjct: 268 SLF 270


>gi|288560107|ref|YP_003423593.1| hypothetical protein mru_0850 [Methanobrevibacter ruminantium M1]
 gi|288542817|gb|ADC46701.1| hypothetical protein mru_0850 [Methanobrevibacter ruminantium M1]
          Length = 271

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 93/300 (31%), Positives = 150/300 (50%), Gaps = 43/300 (14%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           VSV +P S +  T+ L+L T   G I RA+ +F +DEVV++ + S  D          S 
Sbjct: 6   VSVFIPNSFLAETKDLKLKTSKVGLIGRALAVFEVDEVVIYKDLSIPD----------SE 55

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
           ++D+ +    F+  +L Y++TPQYLRK    + + LR VG+LPPL  PHH   K E   +
Sbjct: 56  QTDDGD----FIAEVLNYMDTPQYLRKKAIPIKAELRHVGILPPLRVPHHPTGKPELGDY 111

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           R+G T+K N     GT VD+G++K     +      ++TV    +  +   +   +V P 
Sbjct: 112 RQGFTVKRNKK---GTFVDIGMDKLAFCKE------QLTVNKIFSFKITKFAKEVIVTPD 162

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT----FR 297
           +P +  +YWG+K     + +   KN       D ++ T+ + + ++S    L T      
Sbjct: 163 EPDD--IYWGFKTL---STNKSLKNSLKLVNPDLVVETTRYAETIDSIFNELKTKVESSN 217

Query: 298 HLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           H+ I FGG  + + E++E       K          NT P+QG+ T+RTEEA+  +L  F
Sbjct: 218 HIAIVFGGPYSSISENVESSKWETVK---------INTIPNQGTETVRTEEAVISTLAIF 268


>gi|222445054|ref|ZP_03607569.1| hypothetical protein METSMIALI_00671 [Methanobrevibacter smithii
           DSM 2375]
 gi|222434619|gb|EEE41784.1| hypothetical protein METSMIALI_00671 [Methanobrevibacter smithii
           DSM 2375]
          Length = 275

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 96/303 (31%), Positives = 154/303 (50%), Gaps = 47/303 (15%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV +P S +  ++ L++ T   G + RA+ +F+ID VV++++                 
Sbjct: 6   LSVFIPNSFLSESKDLKVRTFKVGILGRALAVFQIDNVVIYNDDHI-------------- 51

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWAP 180
           +++  E    F+  +L Y+ TPQYLRK+ F +   L+ VG+LPPL APHH    + +   
Sbjct: 52  KNENGEEDGEFIAEVLNYMNTPQYLRKSAFPIRPELKHVGILPPLRAPHHPVNSQPDVGD 111

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV-VP 239
           +R+G T+K N     GT VD+G++K     +      ++TV    N  +   + ++V V 
Sbjct: 112 YRQGFTVKRNKK---GTFVDIGMDKLAFCKE------QLTVKKIFNFKITKIAKKEVIVT 162

Query: 240 PSKPKESGMYWGYKVRYAPNISS--VFKNCSYKGGYDHLIGTSEHGDIVNSS----DLTL 293
           P KP +  +YWGY V     ISS    KN       D ++ T+ +GD ++S      L +
Sbjct: 163 PDKPDD--IYWGYNV-----ISSNKSLKNSLKLIKPDFVVETTRYGDYIDSIFDELKLKV 215

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             F+++ I FGG      SI ED  +   N     H   NT P+QG+ T+RTEEA+  +L
Sbjct: 216 DEFKNIAILFGGPYS---SIPED--VSSGNWE---HIKLNTVPNQGTETVRTEEAVVATL 267

Query: 354 QYF 356
             F
Sbjct: 268 SLF 270


>gi|261350377|ref|ZP_05975794.1| conserved hypothetical protein [Methanobrevibacter smithii DSM
           2374]
 gi|288861160|gb|EFC93458.1| conserved hypothetical protein [Methanobrevibacter smithii DSM
           2374]
          Length = 275

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 96/303 (31%), Positives = 154/303 (50%), Gaps = 47/303 (15%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV +P S +  ++ L++ T   G + RA+ +F+ID VV++++                 
Sbjct: 6   LSVFIPNSFLSESKDLKVRTFKVGILGRALAVFQIDNVVIYNDDHI-------------- 51

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWAP 180
           +++  E    F+  +L Y+ TPQYLRK+ F +   L+ VG+LPPL APHH    + +   
Sbjct: 52  KNENGEEDGEFIAEVLNYMNTPQYLRKSAFPIRPELKHVGILPPLRAPHHPVNSQPDVGD 111

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV-VP 239
           +R+G T+K N     GT VD+G++K     +      ++TV    N  +   + ++V V 
Sbjct: 112 YRQGFTVKRNKK---GTFVDIGMDKLAFCKE------QLTVKKIFNFKITKIAKKEVIVT 162

Query: 240 PSKPKESGMYWGYKVRYAPNISS--VFKNCSYKGGYDHLIGTSEHGDIVNSS----DLTL 293
           P KP +  +YWGY V     ISS    KN       D ++ T+ +GD ++S      L +
Sbjct: 163 PDKPDD--IYWGYNV-----ISSNKSLKNSLKLIKPDFVVETTRYGDYIDSIFDELKLKV 215

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             F+++ I FGG      SI ED  +   N     H   NT P+QG+ T+RTEEA+  +L
Sbjct: 216 DEFKNIAILFGGPYS---SILED--VSSGNWE---HIKLNTVPNQGTETVRTEEAVVATL 267

Query: 354 QYF 356
             F
Sbjct: 268 SLF 270


>gi|408381923|ref|ZP_11179470.1| hypothetical protein A994_05701 [Methanobacterium formicicum DSM
           3637]
 gi|407815371|gb|EKF85956.1| hypothetical protein A994_05701 [Methanobacterium formicicum DSM
           3637]
          Length = 273

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 90/305 (29%), Positives = 150/305 (49%), Gaps = 53/305 (17%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+ +P S +  T+ L+L T   G I R+  +FR+ ++VV+ +   S             
Sbjct: 6   ISIFIPSSFLRETKDLKLKTYKVGLIGRSAALFRVTKIVVYSDTEDS------------- 52

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
              E++ G  F+  +L Y+ TPQYLRK +F +   LR VG+LPPL  PHH   +     F
Sbjct: 53  ---EDQKGVKFISDILTYMNTPQYLRKKVFPITRELRNVGILPPLRTPHHPTGELHQGDF 109

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           R+G+TLK       GT+VD+G ++  +  +      +++V    +  +D      ++ P 
Sbjct: 110 RQGLTLKRTKK---GTIVDIGADRDALCKE------KLSVNRVLSFRVDKLGKEIIITPD 160

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCS-----YKGGYDHLIGTSEHGD----IVNSSDLT 292
           +P+   +YWGY      ++ S +K+        K   D ++GTS + +    ++N     
Sbjct: 161 EPQ---VYWGY------DVLSTYKSLDDSIDMLKPRPDLVVGTSRYAEPITSVLNEVREG 211

Query: 293 LPTFRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
           L   +H  I FGG  +GL E +         N  +V     NT P+QG++T+RTEEA+  
Sbjct: 212 LRGSKHAAILFGGPYSGLHELM--------GNPGDVLDLEVNTIPNQGTKTVRTEEAVLA 263

Query: 352 SLQYF 356
           +L  F
Sbjct: 264 TLSVF 268


>gi|340506128|gb|EGR32344.1| hypothetical protein IMG5_087030 [Ichthyophthirius multifiliis]
          Length = 491

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 80/247 (32%), Positives = 130/247 (52%), Gaps = 34/247 (13%)

Query: 37  SSCDNKKKKKRKRDQLNDDAP-----------IEVPTVSVAVPGSIIDNTQSLELATRLA 85
           +   NKKK ++  ++     P           +++  +S+ +P SII   QS EL +   
Sbjct: 87  TEIQNKKKDQKNSNEKKQQIPQIINFSQNPTKVKLNQISICIPDSIISIPQSSELRSYFI 146

Query: 86  GQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQ 145
            ++AR   IF +DE+++       DN  +S + N        E G  ++ R LQYLETPQ
Sbjct: 147 SELARTAAIFCVDEIIIL-----KDNSYKSKSTNF-------EQGP-YIARNLQYLETPQ 193

Query: 146 YLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVG---TLVDVG 202
           YLRK LF +H  L+  G++ P++  HHL++ ++  +REGV +  + P   G   + V++G
Sbjct: 194 YLRKQLFPIHPDLKNAGLMNPIECHHHLKQDQYCKYREGVVV--SRPIKGGEDSSWVEIG 251

Query: 203 LNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR---QVVPPSKPKES-GMYWGYKVRYAP 258
           L K   V+  L    R+T+ +  +  LD  +     +VV   +PKE  G++WGY VR A 
Sbjct: 252 LKKQAKVNYRLAANTRITLKLEDD-TLDIQNKYYNAEVVSQKEPKEKMGLFWGYSVRLAD 310

Query: 259 NISSVFK 265
           N++ VFK
Sbjct: 311 NLTQVFK 317


>gi|410722183|ref|ZP_11361492.1| hypothetical protein B655_1961 [Methanobacterium sp. Maddingley
           MBC34]
 gi|410597419|gb|EKQ52038.1| hypothetical protein B655_1961 [Methanobacterium sp. Maddingley
           MBC34]
          Length = 273

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 88/305 (28%), Positives = 148/305 (48%), Gaps = 53/305 (17%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+ +P S +  T+ L+L T   G I R+  +FR  ++V++ +                 
Sbjct: 6   ISIFIPSSFLRETKDLKLKTYKVGLIGRSAALFRATKIVIYSD----------------T 49

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
              E+  G  F+  +L Y+ TPQYLRK +F +   LR VG+LPPL  PHH   +     F
Sbjct: 50  EDPEDRKGVKFISDILTYMNTPQYLRKKVFPITRELRNVGILPPLRTPHHPTGELHQGDF 109

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           R+G+TLK       GT+VD+G ++  +  +      +++V    +  +D      ++ P 
Sbjct: 110 RQGLTLKRTKK---GTIVDIGADRDALCKE------KLSVNRVLSFRVDKLGKEIIITPD 160

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCS-----YKGGYDHLIGTSEHGD----IVNSSDLT 292
           +P+   +YWGY      ++ S +K+        K   D ++GTS + +    ++N     
Sbjct: 161 EPQ---VYWGY------DVLSTYKSLDDSIDLLKPQPDLVVGTSRYAEPITSVLNEVKEG 211

Query: 293 LPTFRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFI 351
           L   +H+ I FGG  +GL E +         N  +V     NT P+QG++T+RTEEA+  
Sbjct: 212 LRGSKHVAILFGGPYSGLHELM--------GNPGDVLDLEVNTIPNQGTKTVRTEEAVLA 263

Query: 352 SLQYF 356
           +L  F
Sbjct: 264 TLSVF 268


>gi|154286758|ref|XP_001544174.1| predicted protein [Ajellomyces capsulatus NAm1]
 gi|150407815|gb|EDN03356.1| predicted protein [Ajellomyces capsulatus NAm1]
          Length = 206

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 76/192 (39%), Positives = 109/192 (56%), Gaps = 13/192 (6%)

Query: 182 REGVTLKENAPN---SVGTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQV 237
           ++G T K+N  N   +  TLV+ GL + V V  +  P   RVTV  G++  +D     +V
Sbjct: 13  KKGKTNKDNMGNDSPTSYTLVNTGLPEKVRVPYIPTPENTRVTVKFGSS--IDPSDGAEV 70

Query: 238 VPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT-LPT 295
           V P  P+E +G YWGY VR    +S+VF  C + GGYD   GTSE G  ++ +    +P 
Sbjct: 71  VSPFAPREETGYYWGYSVRRCATLSAVFTECPFDGGYDLSFGTSERGCPLSEAIAGEVPK 130

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRK-----NAREVFHSYFNTCPHQGSRTIRTEEAIF 350
           F HL++ FGG+AGLE ++  D  L+ K     +A  VF  + N  P QGSRTIRTEEA++
Sbjct: 131 FEHLIVVFGGVAGLEAAVTADKDLRDKGLGPSDAGRVFDYWVNVLPGQGSRTIRTEEAVW 190

Query: 351 ISLQYFQEPISR 362
           + L   +  + R
Sbjct: 191 LGLMGLRGVVER 202


>gi|383319752|ref|YP_005380593.1| hypothetical protein Mtc_1321 [Methanocella conradii HZ254]
 gi|379321122|gb|AFD00075.1| hypothetical protein Mtc_1321 [Methanocella conradii HZ254]
          Length = 271

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 92/300 (30%), Positives = 146/300 (48%), Gaps = 55/300 (18%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+ +P S+ + T    + T   GQ+ARA +IFR+DE+VV+      D+           
Sbjct: 10  LSLLIPASLTEETADPRIKTYKVGQVARAASIFRVDEIVVYKTPKRDDS----------- 58

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
                     F+  +L+Y ETPQYLRK +F M  +LR +G++PPL  P H  + E   +R
Sbjct: 59  ---------RFISTVLRYAETPQYLRKEIFPMQGALRHIGVIPPLRIPSHTSEEE---YR 106

Query: 183 EGVTLKENAPNSVGT----LVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDAD-SPRQ 236
           EG+  K      VGT     VDVG +   ++ D  ++ G RVTV + + R L  +   R 
Sbjct: 107 EGIVTK------VGTDGNAWVDVGSDSPAMLPDAKVEKGQRVTVRIYSRRPLTVELVKRS 160

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS---SDLTL 293
            VP        +YWGY+VR A  +    +    +      I TS  G  +     S++  
Sbjct: 161 DVP--------LYWGYEVRIADTLHDALETDGLR------IATSRLGHPLACEMLSEIKS 206

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
                + +AFG  +   E +  D+G K ++  +      N+ P+QG+ T+R EEAI+++L
Sbjct: 207 KIRDKVSVAFGSPSKGLEQLLHDEGHKLEDHSDCV---VNSIPNQGTATVRAEEAIYVTL 263


>gi|305664237|ref|YP_003860525.1| hypothetical protein [Ignisphaera aggregans DSM 17230]
 gi|304378806|gb|ADM28645.1| Protein of unknown function DUF171 [Ignisphaera aggregans DSM
           17230]
          Length = 297

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 91/300 (30%), Positives = 144/300 (48%), Gaps = 41/300 (13%)

Query: 61  PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
           P +S+A+P SI+  +       R  G I RA  IFR++E+ ++                 
Sbjct: 15  PYISIAIPSSIVSESPDPREKVRKIGYIGRASAIFRVEEIFIY----------------- 57

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW-A 179
               D++E    ++  LL YLE P YLR+ L     SLR+VG+LPPL  PHH+R+  +  
Sbjct: 58  ---MDDSEENLNYIYELLSYLEIPPYLRRKLVPYKPSLRYVGLLPPLKTPHHVRREIFNT 114

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKH-VVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
             REGV +     N + +++D+GL+K  V   + +  G RVTV +      D      +V
Sbjct: 115 NLREGVVIDS---NDIRSIIDIGLDKKGVAYGRYIPRGSRVTVKI-VREQRDV----YIV 166

Query: 239 PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF-- 296
                KE  +YWGYKV    ++  + +  S K  Y  +IG S+ G  +   +  +     
Sbjct: 167 DIVDEKEIDIYWGYKVYRFKSMKEMLE-YSIKNSY-IVIGASKKGQPLYCLESEIKDILS 224

Query: 297 --RHLLIAFGGLAGLEESIEEDDGLK-RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             + ++I FGG     + I  ++G+   + +R +     N  P QG  ++RTEEAIF  L
Sbjct: 225 GDKRIVIVFGGPRLDIDEIAVNEGIDIYRYSRYI----INFIPRQGVESVRTEEAIFAVL 280


>gi|238505443|ref|XP_002383948.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|220690062|gb|EED46412.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 249

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 87/248 (35%), Positives = 121/248 (48%), Gaps = 49/248 (19%)

Query: 170 PHHLRKHEWAPFREGVTL---------------------KENAPNS----VGTLVDVGLN 204
           PHHLR +EW  +REG+ +                     + ++P+S      T++D GL 
Sbjct: 2   PHHLRANEWCEYREGIVVSSSDGGGQRRHSTQMSNYHNNRRHSPSSPTNFSATVIDTGLP 61

Query: 205 KHVVVDQVLDPG-VRVTVAMGTNRNLDADSPRQVVPPSKPK-ESGMYWGYKVRYAPNISS 262
           K VV+  +  P   RVTV      +   +   Q V PS P+ E+G YWGY VR   ++SS
Sbjct: 62  KKVVLPDIQLPEYARVTVRFP---DYGREHYAQPVHPSTPRSEAGYYWGYYVRRCRSLSS 118

Query: 263 VFKNCSYKGGYDHLIGTSEHGDIVNS-------------SDLTLPTFRHLLIAFGGLAGL 309
           VF  C + GGYD   GTSE G  V S                  P ++HLLI FGG+AG+
Sbjct: 119 VFTECPFDGGYDLSFGTSERGAPVYSVLEEDRQEHDNYDRRKIPPDYKHLLIVFGGVAGI 178

Query: 310 EESIEEDDGL-----KRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRAL 364
           E ++  D  L     +   A ++F  + N  P QGSRTIRTEEA+++ L   +  +    
Sbjct: 179 EAAVHNDPQLCDMDIRATEAGKLFDYWVNLLPGQGSRTIRTEEAVWLGLTSLRGLVEGTH 238

Query: 365 R-RVEYKS 371
           R R  YKS
Sbjct: 239 RPRPSYKS 246


>gi|312137074|ref|YP_004004411.1| hypothetical protein Mfer_0851 [Methanothermus fervidus DSM 2088]
 gi|311224793|gb|ADP77649.1| Protein of unknown function DUF171 [Methanothermus fervidus DSM
           2088]
          Length = 268

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 87/307 (28%), Positives = 145/307 (47%), Gaps = 57/307 (18%)

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           V  +S+ +P S+   ++  ++ T   G I R   IFR++E++++ ++             
Sbjct: 2   VSILSIFIPDSLTIESKDKKIKTYKVGIIGRTAAIFRVNEIIIYHDEFG----------- 50

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEW 178
                  N     F+  +L Y++TPQYLRK +F + ++LR+VG+LPPL  PHH   + + 
Sbjct: 51  -------NSKEGKFIKDILTYMDTPQYLRKDIFPLTNNLRYVGVLPPLRTPHHPTGQPKV 103

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTV----AMGTNRNLDADSP 234
              R+G+TLK       GT+V++G  +  +  + L P  +V       +G    ++ D P
Sbjct: 104 GELRQGLTLKSTKK---GTIVNIGAKRLALCKEKL-PIYKVYTFRIKKLGKRILVERDIP 159

Query: 235 RQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGD----IVNSSD 290
             V           YWGYKV+Y   I+   +       YD +I TS  G     ++   +
Sbjct: 160 ENV-----------YWGYKVKY---INKPIEKAIKMKKYDSIIATSRKGVPITFVLEEIE 205

Query: 291 LTLPTFRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
             +    HL I FGG   GL  +++ +  +K            NT P+QG+ T+RTEEA+
Sbjct: 206 NIIHNTHHLAILFGGPYKGLPTTLDREIDMK-----------INTIPNQGTETVRTEEAV 254

Query: 350 FISLQYF 356
             +L  F
Sbjct: 255 LATLSIF 261


>gi|281206162|gb|EFA80351.1| DUF171 family protein [Polysphondylium pallidum PN500]
          Length = 531

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 82/249 (32%), Positives = 133/249 (53%), Gaps = 37/249 (14%)

Query: 40  DNKKKKKRKRDQLNDDAP---IEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFR 96
           D K++KK  +  L   AP    +  + S+A+P +I+D+  S E+AT L   I+R V +F 
Sbjct: 153 DIKEQKKEVQSILESSAPKIFAKHASKSIAIPANILDDITSEEMATYLVEMISRMVVLFA 212

Query: 97  IDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHS 156
           +DE++V+          +S+ +N         S +  L++LL+Y+ETP+ +R  LF +  
Sbjct: 213 VDEIIVY----------QSALSN---------STSDKLMKLLEYIETPRNIRYHLFDLDD 253

Query: 157 -SLRFVGMLPPLDAPHHLRKHEWAPFREG-VTLKENAPNSVGTLVDVGL--NKHVVVDQV 212
              +FV  L  ++A HH   + W  +REG VT K  A  S   L+DVGL   K  + D+ 
Sbjct: 254 PDYKFVDKLKKMEAAHHNTTNRWTRYREGVVTDKVEAGTS---LIDVGLGHGKEALADKK 310

Query: 213 LDPGVRVTVAM---GTNRNLDADSPR----QVVPPSKPKESGMYWGYKVRYAPNISSVFK 265
           L PG+RVT+ M     +R  +  + +    ++V P + KE+G YWGY +R+  ++ S+  
Sbjct: 311 LQPGIRVTLEMEEDSKSREQNNSNSKYKKGKLVSPRQVKEAGHYWGYTIRHVKSLDSLDA 370

Query: 266 NCSYKGGYD 274
              Y G YD
Sbjct: 371 ESPY-GSYD 378


>gi|66807919|ref|XP_637682.1| DUF171 family protein [Dictyostelium discoideum AX4]
 gi|60466114|gb|EAL64179.1| DUF171 family protein [Dictyostelium discoideum AX4]
          Length = 467

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 158/317 (49%), Gaps = 44/317 (13%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+P + +   QS E+ T +   I+R VT+ +IDE++VF     +D+Y+   +    
Sbjct: 161 TVSIAIPSNALSGIQSEEMKTYIIEMISRQVTMNKIDEIIVF----KTDDYNEEDS---- 212

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRF-VGMLPPLDAPHHLRKHEWAP 180
               + +S    L+++L+Y+ETP YLR  LF+      F V  L  +++    + ++   
Sbjct: 213 ----QQQSDLNLLLKVLEYIETPSYLRDELFNTLEKDYFNVDKLNSINSCQQQQPNDEQL 268

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNL----------- 229
           +REGV   ++  N   ++V+VGL K V++ + L+PG R+T+ +  + N            
Sbjct: 269 YREGVVTNQHWKNQ--SIVNVGLEKLVLIPKRLNPGTRLTIDIKNSSNGDDDGTGGGGDG 326

Query: 230 -----DADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKG--GYDHLIGTSEH 282
                D     +++ PS  K+ G YWG+ +R   +++ V   C Y+    YD++I  S++
Sbjct: 327 NRETNDKYINGKIISPSDIKKEGYYWGHHIRLVDSLAKVASTCQYEDTQNYDYIIMHSQY 386

Query: 283 GDIVNS-SDLTL---PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVF--HSYFNTCP 336
           G+   S +D+ +     + HLL+ F   A    ++  +  L      EV    ++ NT  
Sbjct: 387 GEQFPSYTDVVINNAKKYNHLLVIFAD-ANASSTVNPEFKL----GSEVLVPDTHINTLV 441

Query: 337 HQGSRTIRTEEAIFISL 353
               + +R EE + I+L
Sbjct: 442 DFNIKKLRFEETLSITL 458


>gi|11499510|ref|NP_070751.1| hypothetical protein AF1926 [Archaeoglobus fulgidus DSM 4304]
 gi|2648620|gb|AAB89330.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
          Length = 256

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/293 (31%), Positives = 150/293 (51%), Gaps = 53/293 (18%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+A+P S + N +  ++ T   G IARA  +FR+DE++++ +                 
Sbjct: 2   LSIAIPSSALINERDEKIKTYKVGLIARAAAVFRVDEIIIYYDPIL-------------- 47

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH-EWAPF 181
             DE++    F+  +L+YLETPQYLRK +F +  SL++ G+LPPL  P H  KH +    
Sbjct: 48  --DESD----FIAEILEYLETPQYLRKHIFPLKKSLKYAGLLPPLAIPSHRSKHLKVGEL 101

Query: 182 REGVTLKENAPNSVGT-LVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
           REGV ++  AP+  GT  VDVG++    +    + G RVTV + +   L  +        
Sbjct: 102 REGV-VRHVAPD--GTRWVDVGIDALAPMKSDAEKGARVTVRICSKTPLRVEE------- 151

Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
           ++P+E   YWGY+V+    +  + K         +L+ TS   ++V    L     R + 
Sbjct: 152 AEPQE---YWGYRVKKL-RLEELVKR-------KNLVVTSRKCEVVKPPALR----RDVT 196

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           + FG     EE + E   + ++   E+    +N  P+QG++T+R EEAIF +L
Sbjct: 197 LVFG---NPEEGVFE---IAKRLGVEMEAECWNVVPNQGTKTVRLEEAIFATL 243


>gi|333987680|ref|YP_004520287.1| hypothetical protein MSWAN_1472 [Methanobacterium sp. SWAN-1]
 gi|333825824|gb|AEG18486.1| protein of unknown function DUF171 [Methanobacterium sp. SWAN-1]
          Length = 275

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 144/301 (47%), Gaps = 45/301 (14%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+ +P SII  T+ L + T   G I R+  I++ D +VV+ + S+ +            
Sbjct: 6   LSIFIPASIIAETKDLRIRTYKVGLIGRSAAIYKADRIVVYSDDSTKEE----------- 54

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
                     F+  +L Y+ TPQYLRK +F +   L+ VG+LPPL  PHH   +     +
Sbjct: 55  --------VKFISDVLTYMNTPQYLRKKVFPLTRELKNVGILPPLRTPHHPTGELSLGDY 106

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           R+G+T+K       GT+VD+G  +  +  + L     ++  +     +  D+   ++ P 
Sbjct: 107 RQGLTIKRTKK---GTMVDIGAERLALCKEKLSVNKILSFKV---EKIAKDN--ILIVPD 158

Query: 242 KPKESGMYWGYK-VRYAPNISSVFKNCSYKGGYDHLIGTSEHGD----IVNSSDLTLPTF 296
            P+    YWGYK +    NI    +    K   D +IGTS + +    +++     L   
Sbjct: 159 IPE---FYWGYKTISTYKNIYQSIRMLDPKP--DLVIGTSRYAEPITSVLDEVKERLKGS 213

Query: 297 RHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
           +H  I FGG  +GL + I +      +  REV     NT P QG++T+RTEEA+  +L  
Sbjct: 214 KHTAILFGGPYSGLHDLISD------QKEREVIDLEVNTVPSQGTKTVRTEEAVISTLSV 267

Query: 356 F 356
           F
Sbjct: 268 F 268


>gi|14591536|ref|NP_143618.1| hypothetical protein PH1779 [Pyrococcus horikoshii OT3]
 gi|3258214|dbj|BAA30897.1| 258aa long hypothetical protein [Pyrococcus horikoshii OT3]
          Length = 258

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 93/298 (31%), Positives = 139/298 (46%), Gaps = 56/298 (18%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           V +P S+++ T  L+L     GQIARA  IF ++ + ++             A  R  R 
Sbjct: 5   VFIPDSLLEETSDLKLKAYKVGQIARACAIFGVEHIWIY------------KAGGRDGR- 51

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP---- 180
                    +  LL+Y+ETPQYLRK +F +   LR+ G++PPL  P H  K + +P    
Sbjct: 52  --------LIKTLLEYVETPQYLRKRIFPIMPELRYAGVMPPLQIPSH--KEKTSPRIGE 101

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
            REG   +           D+GL++  +V  + + G       GT + +    P +VVP 
Sbjct: 102 IREGFAFRRGK----KIFADIGLDRPALVKGIAEEG------RGTFKIISV-KPLRVVPA 150

Query: 241 SKPKESGMYWGYKVRYA-PNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P     YWGYKV ++  +++   KN       D +I TS  G  V+  D  +P    +
Sbjct: 151 EPPG----YWGYKVHWSKKSLAKTLKNAD----LDVVIATSRRG--VDVRDAEVPLEGEV 200

Query: 300 LIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
            I FG    G+ E +EE    K K     F    NT P+Q +RT+RTEEA+  +L  F
Sbjct: 201 GIVFGSPRKGVFEILEE---FKEKYE---FDLVLNTIPNQKTRTVRTEEAVLATLAIF 252


>gi|328871124|gb|EGG19495.1| DUF171 family protein [Dictyostelium fasciculatum]
          Length = 509

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 70/224 (31%), Positives = 117/224 (52%), Gaps = 29/224 (12%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T S+A+P  I+D+  S E+ T L   IAR + +F IDEV+V+                RS
Sbjct: 182 TRSIAIPNDILDDFASEEMKTYLVEMIARTLVLFSIDEVIVY----------------RS 225

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHS-SLRFVGMLPPLDAPHHLRKHEWAP 180
            +S    S    L +LL++++ P+ +R  L+ ++    +FV  L  +++  H   + W  
Sbjct: 226 TKS--TNSTTDKLCKLLEFIDLPKNIRHHLYDLNDIDYQFVDKLKRVESGQHATSNRWTK 283

Query: 181 FREGVTLKENAPNSVGTLVDVGL--NKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ-- 236
           +REG+ L  +  +   +LVDVGL   K + +D+ L  G+RVT+ M        ++  Q  
Sbjct: 284 YREGIVL--DKLSGKFSLVDVGLGDGKEIKIDKELQAGIRVTLEMEEEHGKQINNANQHR 341

Query: 237 ---VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
              VV P++ +  G YWGY+VR   +I +VF++ S+   YD+ I
Sbjct: 342 KGKVVSPAEVRHDGHYWGYQVRSVDSIDNVFQDSSF-NAYDYKI 384


>gi|413952712|gb|AFW85361.1| hypothetical protein ZEAMMB73_404324 [Zea mays]
          Length = 73

 Score =  114 bits (285), Expect = 7e-23,   Method: Composition-based stats.
 Identities = 55/71 (77%), Positives = 62/71 (87%)

Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           RHLLIAFGGLAGLEESIEED  LK K A +VF SYFNTCP+QGSRTIRTEEA+ ISLQYF
Sbjct: 3   RHLLIAFGGLAGLEESIEEDTNLKGKRADDVFTSYFNTCPNQGSRTIRTEEALLISLQYF 62

Query: 357 QEPISRALRRV 367
           Q+PI RA +++
Sbjct: 63  QDPIRRAEQKL 73


>gi|119719149|ref|YP_919644.1| hypothetical protein Tpen_0231 [Thermofilum pendens Hrk 5]
 gi|119524269|gb|ABL77641.1| Protein of unknown function DUF171 [Thermofilum pendens Hrk 5]
          Length = 273

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 97/300 (32%), Positives = 141/300 (47%), Gaps = 56/300 (18%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           +VAVP S   N         +A  +ARAVTIFR++EV ++                    
Sbjct: 12  AVAVPSSTFSNLSEPARTFHVA-LLARAVTIFRVEEVAIY-------------------- 50

Query: 124 SDENESG--AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
               ESG   A L R+L  LE PQYLRK L       R++G++PPL +P HL + E + +
Sbjct: 51  ---QESGHPCAELKRILDALEVPQYLRKYLVPRSREYRYLGLVPPLRSPSHLLRGEESEY 107

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REG  +      +   LVDVGL   V  +   D G RVT+        +    R V    
Sbjct: 108 REGYVVSRRGTKA---LVDVGLGNPVEAEVPTDAGRRVTL------RREGAGWRYV---- 154

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
            P+E  +YWGY+ +   ++    ++   +G    L+GTS     V+   L L   +    
Sbjct: 155 SPEEVKVYWGYRTKCYSSLREALEHYRSRGFL--LVGTSRKAAPVS---LVLEEIKREAT 209

Query: 302 AFGGLA--------GLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           A  G+A        GL+E I E++G + ++  +      NT P QG +TIRTEEA++ISL
Sbjct: 210 AKNGVAVFFGTWNKGLQE-ISEEEGFELESFLDFI---VNTAPIQGVKTIRTEEAVYISL 265


>gi|14520559|ref|NP_126034.1| hypothetical protein PAB2119 [Pyrococcus abyssi GE5]
 gi|5457775|emb|CAB49265.1| Hypothetical protein PAB2119 [Pyrococcus abyssi GE5]
 gi|380741086|tpe|CCE69720.1| TPA: hypothetical protein PAB2119 [Pyrococcus abyssi GE5]
          Length = 258

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 93/298 (31%), Positives = 134/298 (44%), Gaps = 56/298 (18%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           V +P S+++ T   +L     GQIARA  IF ++ + ++             A  R  R 
Sbjct: 5   VFIPDSLLEETNDPKLKAYKVGQIARACAIFGVEHIWIY------------KAGGRDGR- 51

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP---- 180
                    +  LL+Y ETPQYLRK LF +   LR+ G++PPL  P H  K + +P    
Sbjct: 52  --------LIKTLLEYAETPQYLRKRLFPIMPELRYAGVMPPLQIPSH--KEKGSPRIGE 101

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
            REG   +           D+GL++  +V  + D G       GT + +    P +VVP 
Sbjct: 102 IREGFAFRRGK----KLFADIGLDRPALVKGIADEG------RGTFKIISV-KPLRVVPA 150

Query: 241 SKPKESGMYWGYKVRYA-PNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P     YWGYKV ++  ++    KN       D +I TS  G  V+  D  +P    +
Sbjct: 151 EPPG----YWGYKVHWSRKSLVKTLKNAD----LDVVIATSRRG--VDVRDAEVPLEGEV 200

Query: 300 LIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
            I FG    G+ E       LK  N R  F    NT P Q ++T+RTEEA+  +L  F
Sbjct: 201 GIVFGSPRKGVFEI------LKEFNERYEFDLVLNTIPGQKTKTVRTEEAVLATLAIF 252


>gi|282164889|ref|YP_003357274.1| hypothetical protein MCP_2219 [Methanocella paludicola SANAE]
 gi|282157203|dbj|BAI62291.1| conserved hypothetical protein [Methanocella paludicola SANAE]
          Length = 276

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 84/281 (29%), Positives = 133/281 (47%), Gaps = 44/281 (15%)

Query: 78  LELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRL 137
           L + T   GQ+ARA +IFR+DE+V++      D                    + F+  +
Sbjct: 28  LRIKTYKVGQVARAASIFRVDEIVIYKTPIRDD--------------------SKFISTV 67

Query: 138 LQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVGT 197
           L+Y ETPQYLRK +F M   LR +G++PPL  P H    E   +REG+  K  +  +   
Sbjct: 68  LRYAETPQYLRKEIFPMQDDLRHIGVIPPLRTPAHTVT-ENEEYREGIVTKVGSDGN--A 124

Query: 198 LVDVGLNKHVVVDQV--LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVR 255
            VDVG +   ++     +  G RVTV + + R L        V   K  +  +YWGY VR
Sbjct: 125 WVDVGSDSPAMLRDAKNVRKGQRVTVRIYSRRPL-------TVHLVKRSDVPLYWGYDVR 177

Query: 256 YAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS---SDLTLPTFRHLLIAFGGLAGLEES 312
               +    +    +      I TS  G+ +     S++   T   + +AFG  +   E 
Sbjct: 178 IVNTLHDALETEGLR------IATSRLGEPLACEKLSEIKSNTRDKVCVAFGSPSMGLEQ 231

Query: 313 IEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +  D+G K ++  +      N+ PHQG+ T+R+EEA++I+L
Sbjct: 232 LLHDEGHKLEDHSDCV---VNSIPHQGTATVRSEEAVYITL 269


>gi|330796272|ref|XP_003286192.1| hypothetical protein DICPUDRAFT_150135 [Dictyostelium purpureum]
 gi|325083862|gb|EGC37304.1| hypothetical protein DICPUDRAFT_150135 [Dictyostelium purpureum]
          Length = 471

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 83/312 (26%), Positives = 150/312 (48%), Gaps = 40/312 (12%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+P   + + +S E+ T +   I+R VT+ +IDE++V+               +  
Sbjct: 171 TVSIAIPSDCLKDIESEEMKTYVIEMISRQVTMNKIDEIIVY------------KTDDYD 218

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALF-SMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
           +  +  +S    L+++L+Y+ETP YLR+ LF ++     +V  L  L++       +   
Sbjct: 219 DTDNTQQSNLNLLLKILEYIETPSYLREELFNTLDKDFFYVDKLNRLNSTQEEVNADDKL 278

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPR----- 235
           +REGV   +   N   +++ VGL K V++ + L+PG RVT+ M      + +  +     
Sbjct: 279 YREGVVTDQQFRNQ--SIISVGLEKQVLISKRLNPGTRVTIDMKQENFTEEELSKIINTT 336

Query: 236 -------QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKG--GYDHLIGTSEHGDIV 286
                  +V+ P   K+ G YWG+ VR   ++  +   C YK    YD+ I  S+ G+  
Sbjct: 337 DEQYTLGRVISPQNIKKEGYYWGHHVRLVNSMDEIASTCEYKDTKTYDYSILLSDFGEQF 396

Query: 287 NS-SDLTLPT---FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVF-HSYFNTCPHQGSR 341
            S +D  L     + HLLI F        + ++ +   R N+  +   ++ NT  +  + 
Sbjct: 397 PSPTDPMLNNAKRYNHLLIIFAN------NHKKFNPADRLNSEVLIPDTHINTLFNFNNV 450

Query: 342 TIRTEEAIFISL 353
            IR EE +F++L
Sbjct: 451 PIRFEENLFVTL 462


>gi|84489709|ref|YP_447941.1| hypothetical protein Msp_0910 [Methanosphaera stadtmanae DSM 3091]
 gi|84373028|gb|ABC57298.1| conserved hypothetical protein [Methanosphaera stadtmanae DSM 3091]
          Length = 279

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 87/295 (29%), Positives = 148/295 (50%), Gaps = 47/295 (15%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+ VP S +  ++  ++ T   G I R   +FR + +V++++  +SD  SR  A     
Sbjct: 12  LSIFVPNSFLAESKDSKIRTYKVGLIGRYAALFRANNIVIYND--NSDGGSRDDAL---- 65

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAP 180
                     ++  +L+Y++TPQYLRK +F +   L+ VG+LPPL  PHH    E     
Sbjct: 66  ----------YMKTILEYMDTPQYLRKQVFPITPELKNVGILPPLRTPHHPATDEANVGD 115

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
           FR+G+T K       GT+VD+G+ +  +  +      +++V    +  ++      ++ P
Sbjct: 116 FRKGLTTKRVRK---GTMVDIGVGRLALCKE------KLSVNKVLSFRIEKLGKEILIEP 166

Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCS-YKGGYDHLIGTSEHGDIVNS----SDLTLPT 295
            +P    +YWGY V  +   S+++ + +  K   D +IGTS++   +NS       ++  
Sbjct: 167 DEP--DSVYWGYDVITSD--SNLYDSITMMKNKPDLVIGTSKYAPNINSILDEVQTSIQQ 222

Query: 296 FRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
             H+ I FGG  +G+   I E     RK   EV     NT P QG+ T+RTEEA+
Sbjct: 223 ANHVAILFGGPYSGINSLINE-----RKLDYEV-----NTVPKQGTETVRTEEAV 267


>gi|213407844|ref|XP_002174693.1| DUF171 family protein [Schizosaccharomyces japonicus yFS275]
 gi|212002740|gb|EEB08400.1| DUF171 family protein [Schizosaccharomyces japonicus yFS275]
          Length = 326

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 78/228 (34%), Positives = 111/228 (48%), Gaps = 23/228 (10%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+A+    + +  +L L T L  +IA  +  F + EV++ + K  +            
Sbjct: 39  TVSLAIDTGCVLHETNLLLRTALVWRIAGLLLTFNVTEVILVEPKKEAAG---------- 88

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH---LRKHEW 178
             + E ++   F+  LL YLE P YLRKALF ++ +LRF    P L  PHH       + 
Sbjct: 89  --ALEVKNCTEFVETLLSYLEAPPYLRKALFPLNPALRFTSSFPLLAFPHHSTLTTSAQS 146

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM------GTNRNLDAD 232
             +REGV           TLV  GL+K V V      G RVTV++          +L++D
Sbjct: 147 VRYREGVVEGSEPYVRNKTLVYAGLDKPVRVPGKYPRGTRVTVSLKQVPKKTAKSSLNSD 206

Query: 233 SPR-QVVPPSKPKE-SGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIG 278
                VV  S P+E  G+YWGY +R A +      +CSYKGGY H+IG
Sbjct: 207 CFEGDVVSSSAPREKGGLYWGYTIRRAQSWKDAVSSCSYKGGYSHVIG 254


>gi|20093577|ref|NP_613424.1| hypothetical protein MK0137 [Methanopyrus kandleri AV19]
 gi|19886433|gb|AAM01354.1| Uncharacterized conserved protein [Methanopyrus kandleri AV19]
          Length = 255

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 85/292 (29%), Positives = 140/292 (47%), Gaps = 48/292 (16%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           V+VA P S+       ++     G +ARA+ I+R++EV ++ +              R N
Sbjct: 2   VAVAFPWSLFSEETDPKIYAYRVGTLARALAIYRVEEVYLYGD----------GVGTRRN 51

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
                   A  L +LL+Y E PQYLRK +F +   LR+ G++PPL APHH +   +    
Sbjct: 52  --------AERLRKLLEYQECPQYLRKRVFRLDRDLRYAGVMPPLRAPHHKVHSPKEGEV 103

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REG  ++ +     G LVDVG ++         P  RVTV + +   L+ +       P+
Sbjct: 104 REGYVVRRSRN---GALVDVGADRLARTRWRFKPHERVTVRIVSEDPLEVE-------PA 153

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
           +P+E   YWGY+VR    ++ V +   +K G   +I TS +G+ V   +   P      +
Sbjct: 154 EPEE---YWGYRVRIVNELNEVLRE--FKEG---IIVTSRYGEDVREVEFKSPV---KCL 202

Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            FG  +     ++ D G++ +      +   N  P+QG + +RTEEA+  +L
Sbjct: 203 VFG--SSEVSVLDVDPGVRDE------YPVINFVPNQGVQVVRTEEAVHTTL 246


>gi|19112370|ref|NP_595578.1| DUF171 family protein [Schizosaccharomyces pombe 972h-]
 gi|74626596|sp|O13641.1|YGWG_SCHPO RecName: Full=Uncharacterized protein C8D2.16c
 gi|2257535|dbj|BAA21429.1| pi049 [Schizosaccharomyces pombe]
 gi|2950503|emb|CAA17831.1| DUF171 family protein [Schizosaccharomyces pombe]
          Length = 315

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 80/271 (29%), Positives = 135/271 (49%), Gaps = 26/271 (9%)

Query: 48  KRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKS 107
           K  ++    P    T+S+A+P S ++   +L+L T    +I+R V+++ IDE+++ ++  
Sbjct: 3   KESEIQSLPPTRRYTISLALPISSLNVAYNLQLKTSFVWKISRIVSLYGIDEIILLEDPE 62

Query: 108 SSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLP-- 165
              N    + ++ +   D  +    FL  LL Y ETP ++RK LF ++  L++    P  
Sbjct: 63  YVQNTQVHTLSSDAYLKDPTK----FLTDLLCYFETPFFMRKELFPLNPHLKYTSCFPLL 118

Query: 166 PLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVD--QVLDPGVRVTVAM 223
           PL        +   P+REG+    +       +++ GL+ +V+V    VL P  RVTV  
Sbjct: 119 PLRNDKASTVNIEFPYREGIVTHPSPQAKNKYIINAGLSHNVIVSSPSVLAPRTRVTV-- 176

Query: 224 GTNRNLDADSPRQ-------VVPPSKPKES-GMYWGYKVRYAPNISSVFKNCSYKGGYDH 275
                L A SP +       +V  S P+E  G YWGYKVR   + S + K+  YKGGYD 
Sbjct: 177 ----RLKAQSPNEEGQLQGDIVSFSAPREKGGHYWGYKVRSCLS-SQLCKSSPYKGGYDF 231

Query: 276 LIGTSEHGDIVNSSDL--TLP-TFRHLLIAF 303
           ++  +     + S +L  +LP +FRH ++  
Sbjct: 232 VVQINSQTSAITSKELEASLPSSFRHAVLVL 262


>gi|302841751|ref|XP_002952420.1| hypothetical protein VOLCADRAFT_93008 [Volvox carteri f.
           nagariensis]
 gi|300262356|gb|EFJ46563.1| hypothetical protein VOLCADRAFT_93008 [Volvox carteri f.
           nagariensis]
          Length = 224

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 76/224 (33%), Positives = 112/224 (50%), Gaps = 46/224 (20%)

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADS----PR 235
           P+REGV LK       G+ VDVGL++ V ++  L    R+TV +G   NL+ ++    P 
Sbjct: 2   PYREGVVLKSEP--GAGSYVDVGLDRMVWIEHELPQATRLTVHLG---NLEPETRFMTPY 56

Query: 236 -------QVVPPSKPKES-GMYWGYKVRYAPNISSVFKNCSY---KGGYDHLIGTSEHGD 284
                  +VVPP++P+E  G+YWGY VR A  +  VFK+ +     G YD  IG+S  G 
Sbjct: 57  SETMIVGKVVPPARPREQLGLYWGYSVRLALGLQRVFKDAAVTRGSGKYDLTIGSSPLGA 116

Query: 285 IVNSSDLTLPTFRHLLIAFGGLAG--------------------------LEESIEEDDG 318
             +   L LPTF+H L+ FGG AG                          L++ +     
Sbjct: 117 HTDPLSLVLPTFKHALLVFGGGAGGASGGAGGPGSTSSSSSSTRAVPEQDLDQLVPRVPD 176

Query: 319 LKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISR 362
              K  ++VF+ Y NT PH G+  +RTE+A+ I+L Y   P+ +
Sbjct: 177 WGHKQPQDVFNLYLNTAPHLGTLRLRTEDAVPIALSYLITPLLK 220


>gi|57641478|ref|YP_183956.1| hypothetical protein TK1543 [Thermococcus kodakarensis KOD1]
 gi|57159802|dbj|BAD85732.1| hypothetical protein, conserved, DUF171 family [Thermococcus
           kodakarensis KOD1]
          Length = 256

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 89/295 (30%), Positives = 140/295 (47%), Gaps = 52/295 (17%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           V +P S+++ T   ++ T   GQ+ARA +IF ++ + ++             A  R  + 
Sbjct: 5   VFIPDSLLEETDDPKIRTYKVGQVARACSIFGVEHIWIY------------RAGGRDGK- 51

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH--EWAPFR 182
                   F+  +L+Y ETPQYLRK LF +   L++VG++PPL  PHH  K        R
Sbjct: 52  --------FIKTVLEYAETPQYLRKRLFPLMPELKYVGVIPPLRTPHHKLKGRPRVGEIR 103

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
           EG   +        T  D+GL++  VV+  ++   R T  + + R      P +VV P+K
Sbjct: 104 EGFAFRRGK----RTYADIGLDELAVVEGNVEG--RATFKIVSTR------PLRVV-PTK 150

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
           P++   YWGY+V       S+ K    K G D +I TS  G  +   ++  P    +   
Sbjct: 151 PED---YWGYRVHLTRK--SLAKTLK-KAGLDLVIATSRKGRDIREVEIP-PLEGEVGFV 203

Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           FG    G+ E + E+        +  FH   NT P+Q + T+RTEEA+  +L  F
Sbjct: 204 FGSPRKGVMELLGEE--------KFEFHLILNTIPNQRTETVRTEEALLATLAIF 250


>gi|389853118|ref|YP_006355352.1| hypothetical protein Py04_1708 [Pyrococcus sp. ST04]
 gi|388250424|gb|AFK23277.1| hypothetical protein Py04_1708 [Pyrococcus sp. ST04]
          Length = 258

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 85/295 (28%), Positives = 130/295 (44%), Gaps = 50/295 (16%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           V +P S+++ T   +L     GQIARA  IF ++ + ++                R+   
Sbjct: 5   VFIPDSLLEETNDPKLKAYKVGQIARACAIFGVEHIWIY----------------RAGGK 48

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH--EWAPFR 182
           D        +  LL+Y ETPQYLRK LF +   L++ G++PPL  P H  K        R
Sbjct: 49  D-----GKLIKTLLEYAETPQYLRKRLFPLMPELKYAGVMPPLQIPSHKPKTTPRIGEIR 103

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
           EG   ++          D+GL++  ++  + + G       GT + +    P +VVP   
Sbjct: 104 EGYAFRKGK----KLFADIGLDRPALIKGIAEEG------RGTFKIISV-KPLRVVPSDP 152

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
           P     YWGYKV ++            K   D +I TS  G  V+  D  +P    + I 
Sbjct: 153 PD----YWGYKVHWS---RKTLAKTLKKAELDVVIATSRRG--VDVRDAEVPLEGEVGIV 203

Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           FG    G+ E +E      + N +  F    NT P Q ++T+RTEEA+  +L  F
Sbjct: 204 FGSPRMGIMEILE------KYNEKFEFDLIVNTIPGQKTKTVRTEEAVLATLAIF 252


>gi|337283643|ref|YP_004623117.1| hypothetical protein PYCH_01360 [Pyrococcus yayanosii CH1]
 gi|334899577|gb|AEH23845.1| hypothetical protein PYCH_01360 [Pyrococcus yayanosii CH1]
          Length = 298

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 91/295 (30%), Positives = 128/295 (43%), Gaps = 50/295 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T  V +P S+++ T   +L     GQIARA  IF ++ + ++             A  R 
Sbjct: 42  TWHVFIPDSLLEETHDPKLRAYKVGQIARAAAIFGVEHIWIY------------RAGGRD 89

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH--EWA 179
            R         F+  LL+Y ETPQYLRK LF +   LR+ G++PPL  P H  K      
Sbjct: 90  GR---------FIKTLLEYAETPQYLRKRLFPLMPELRYAGVMPPLQIPSHKPKESPRIG 140

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
             REG   ++          D+GL++  +V  V   G       GT + +     R V  
Sbjct: 141 EIREGFAYRKGKK----LFADIGLDRPALVKGVAQEG------RGTFKVISVKPLRVV-- 188

Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
           P+KP+    YWGYKV               K   D +I TS  G  V  +D+TL     +
Sbjct: 189 PTKPEG---YWGYKVHLT---RMTLAKTLKKANLDVVIATSRKGVDVRRADVTLEG--DV 240

Query: 300 LIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            I FG    G+ E + E       N    F    NT P Q ++T+RTEEA+  +L
Sbjct: 241 GIVFGSPRRGVMEILREF------NEDYEFDVILNTIPGQKTKTVRTEEAVLATL 289


>gi|223477732|ref|YP_002582390.1| methylase [Thermococcus sp. AM4]
 gi|214032958|gb|EEB73786.1| methylase [Thermococcus sp. AM4]
          Length = 255

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 90/300 (30%), Positives = 135/300 (45%), Gaps = 63/300 (21%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           V +P S+++ T   ++ T   GQ+ARA  IF ++ + ++             A  R  + 
Sbjct: 5   VFIPDSLLEETDDPKIRTYKVGQVARACAIFGVEHIWIY------------RAGGRDGK- 51

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LR-KHEWAPFR 182
                   F+  +L+Y+ETPQYLRK LF +   LR+VG++PPL  PHH L+ K +    R
Sbjct: 52  --------FIKTVLEYMETPQYLRKRLFPLMPELRYVGVVPPLRTPHHKLKGKPKVGEIR 103

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
           EG   ++          D+GL++  +V+  ++   R T  + + R L        V P+K
Sbjct: 104 EGYAFRKGR----KIYADIGLDELAIVEGNVEG--RATFRIVSTRPLK-------VVPTK 150

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
           P E   YWGYKV  +            K   D  I TS  G  V   +L  P    + IA
Sbjct: 151 PVE---YWGYKVHLS---GKPLAKTLKKAHLDLAIATSRKGRDVRKVNLP-PLEGEVGIA 203

Query: 303 FGGLAGLEESIEEDDGLKRKNAREV------FHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           FG                RK   E+      F    NT P+Q ++T+RTEEA+  +L  F
Sbjct: 204 FGS--------------PRKGVMELLGGEYEFDFVLNTIPNQKTKTVRTEEALLATLAIF 249


>gi|393796609|ref|ZP_10379973.1| hypothetical protein CNitlB_09876 [Candidatus Nitrosoarchaeum
           limnia BG20]
          Length = 270

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 94/299 (31%), Positives = 140/299 (46%), Gaps = 48/299 (16%)

Query: 63  VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           +S+A+P S + + +SL+L  TR    +ARA +IF+ID + V+       NY         
Sbjct: 3   ISIAIPESALSD-ESLKLDKTRKISVLARACSIFKIDTIYVYQ----EGNY--------- 48

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
            + D N      LV +L+YLETPQ+LR+ LF   + L+F G+L PL  P H      +K 
Sbjct: 49  -KEDGN-----LLVTILKYLETPQFLRRRLFPKMNELKFAGVLLPLRIPSHATPANSKKI 102

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
                REGV +           VDVG+N+ +      D G RVTV         +  P  
Sbjct: 103 NIGDVREGVVISIKGKR----FVDVGINEAIPFFGKTDVGKRVTV------QFKSGYPDF 152

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--LP 294
            V   +  E+ +YWGY V+   N+ S+     ++G   + I TS  G       LT  L 
Sbjct: 153 SVKEIQRNETPLYWGYTVKERSNLFSLL--TEWRG---NTIITSRKGRTATKEQLTKYLS 207

Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +   +L+ FG     E+ + E  G K KN +       N  P+Q + T+R EEA+  +L
Sbjct: 208 SELPILVVFG---SPEKGVHEILGGKMKNIQNA--KTLNFFPNQATETVRLEEALLGTL 261


>gi|21064729|gb|AAM29594.1| RH40775p [Drosophila melanogaster]
          Length = 250

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 58/136 (42%), Positives = 80/136 (58%), Gaps = 1/136 (0%)

Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
           +V P +P+ E+G+YWGY+VR A ++S +F    Y  GYD  +GTS+ G  V+        
Sbjct: 15  LVSPDEPRRETGVYWGYQVRIAHSMSEIFTKSPYATGYDVTVGTSDRGTNVHEVPNRSYN 74

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
           F H+LI FGGL GLE ++  D+ L   +   +F  Y N  P QGSRTIRTEEA+ I+L  
Sbjct: 75  FNHMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAA 134

Query: 356 FQEPISRALRRVEYKS 371
            QE +   +  VE  S
Sbjct: 135 LQEKLQPQVADVEIDS 150


>gi|315229855|ref|YP_004070291.1| methylase [Thermococcus barophilus MP]
 gi|315182883|gb|ADT83068.1| methylase [Thermococcus barophilus MP]
          Length = 258

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 87/309 (28%), Positives = 134/309 (43%), Gaps = 72/309 (23%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+ + +P S+++ T   ++ T   GQIARA  IF ++ + ++             A  + 
Sbjct: 2   TLHIFIPDSLLEETADPKIRTYKVGQIARAAAIFGVEHIWIY------------KAGGKD 49

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH--EWA 179
            +         F+  +L+Y ETPQYLRK LF +   L++VG++PPL  PHH  K   +  
Sbjct: 50  GK---------FIKLILEYAETPQYLRKTLFPIRKELKYVGVIPPLRTPHHKLKGRPKLG 100

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNR---NLDADSPRQ 236
             REGV +K+          D+GL++  +V+             G  R    + + +P +
Sbjct: 101 EIREGVIIKKGK----RLYADIGLDELALVE-----------GSGEGRMTFKIVSVNPLK 145

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
           VV P+KP E   YWGY+V      +        K   D  I TS  G+ V          
Sbjct: 146 VV-PAKPAE---YWGYRVHLT---NKPLAKTLKKARLDLAIATSRKGEDVRKVK------ 192

Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSY---------FNTCPHQGSRTIRTEE 347
                    L  LE  I    G  RK   E+   +          NT P+Q ++T+RTEE
Sbjct: 193 ---------LPPLEGDIGFVFGSPRKGVMEILRDFNEDYPFDLILNTIPNQKTKTVRTEE 243

Query: 348 AIFISLQYF 356
           A+  +L  F
Sbjct: 244 AVLATLAVF 252


>gi|330508499|ref|YP_004384927.1| hypothetical protein MCON_2704 [Methanosaeta concilii GP6]
 gi|328929307|gb|AEB69109.1| conserved hypothetical protein [Methanosaeta concilii GP6]
          Length = 279

 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 80/299 (26%), Positives = 132/299 (44%), Gaps = 46/299 (15%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           ++ +P S+   ++   + T   GQIARA  +FR+  +V++ ++  +D             
Sbjct: 8   AILIPSSLTMESRDTRVNTLKVGQIARAAAVFRVSRIVIYRDQEYND------------- 54

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-----LRKHEW 178
                  + F+  +L+Y+ETPQYLRK L      LR VGMLPPL   HH         + 
Sbjct: 55  -------SRFISMILRYMETPQYLRKLLIPRKEELRHVGMLPPLRTSHHPINSKSESLKI 107

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
             FR G  + E+  +  G  V++G+N+ +     L  G +  V    N  + +  P    
Sbjct: 108 GEFRVGAVV-ESVGSDGGVWVEIGINRPI----PLRTGEKYPVGQRLNVRIFSQEPLAAE 162

Query: 239 PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF-- 296
           P  + K+   YWGY+     ++ S  ++         ++  S  G  +    L       
Sbjct: 163 PVDQ-KDIPHYWGYETEVVDSLESYLESMRDT----RIVLASRSGRAITPQALLELVQSG 217

Query: 297 --RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             R L +AFG  A   ++    + +KR       +   NT PHQG+ T+R EEA+F +L
Sbjct: 218 NKRDLAVAFGSPARGLDAFLSKESMKR-------YEMINTIPHQGTETVRVEEAVFATL 269


>gi|352682603|ref|YP_004893127.1| hypothetical protein TTX_1417 [Thermoproteus tenax Kra 1]
 gi|350275402|emb|CCC82049.1| conserved hypothetical protein [Thermoproteus tenax Kra 1]
          Length = 270

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 87/293 (29%), Positives = 130/293 (44%), Gaps = 44/293 (15%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           + VA+P  ++      E   R  G IARA  IFR+++++++                   
Sbjct: 1   MDVAIPHDVLSEAPDEESKVRKLGYIARAAAIFRVEKIIIY------------------T 42

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWAP 180
             D +  G   +  LL+Y  TP +LR+ +F +   LR  G+LPPL  P HL  R      
Sbjct: 43  YGDVDHEGVEEMRLLLEYAATPPHLRRKVFRLDKRLRLAGLLPPLKIPSHLAPRSPAVGD 102

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
             EG+  + +   S   +V +G  K   V +    G R+ V +   R  D D  R VV  
Sbjct: 103 VVEGIVERWDGYYS---MVYIGAGKWAKVPKPYPVGSRLIVKLEAQR--DKDVFRGVVLK 157

Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
           + P     YWGYKV        V        GYD++I T + G  V    L  P  + L+
Sbjct: 158 NPPD----YWGYKV-------EVKGLRELADGYDYVILTGKEGRSV-CEILPRPEGKTLV 205

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +  G  AG++E       + ++   E+   + N  P QG+ TIRTEEA+FI L
Sbjct: 206 VFGGPRAGVDE-------IMKREGVELRGLFLNFAPKQGTETIRTEEAMFIVL 251


>gi|332158025|ref|YP_004423304.1| hypothetical protein PNA2_0383 [Pyrococcus sp. NA2]
 gi|331033488|gb|AEC51300.1| hypothetical protein PNA2_0383 [Pyrococcus sp. NA2]
          Length = 258

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 89/298 (29%), Positives = 132/298 (44%), Gaps = 56/298 (18%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           V +P S+++ T   +L     GQIARA  IF ++ + ++             A  +  R 
Sbjct: 5   VFIPDSLLEETNDPKLKAYKVGQIARACAIFGVEHIWIY------------KAGGKDGR- 51

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP---- 180
                   F+  LL+Y ETPQYLRK LF +   LR+ G++PPL  P H  K + +P    
Sbjct: 52  --------FIKTLLEYAETPQYLRKRLFPIMPELRYAGVMPPLQIPSH--KEKTSPRIGE 101

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
            REG   +           D+GL +  +V  + + G       GT + +    P +VVP 
Sbjct: 102 IREGFAFRRGK----KLFADIGLERPALVKGIAEEG------RGTFKIISV-KPLRVVPA 150

Query: 241 SKPKESGMYWGYKVRYA-PNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             P     YWGYKV  +   ++   KN       D +I TS  G  V+  +  +P    +
Sbjct: 151 EPP----TYWGYKVHLSRKTLAKTLKNAD----LDVVIATSRLG--VDVREAEVPLEGEV 200

Query: 300 LIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
            I FG    G+ E       L+  N    F    NT P Q ++T+RTEEA+  +L  F
Sbjct: 201 GIVFGSPKKGVFEI------LRSFNEEFEFDLVLNTIPGQKTKTVRTEEAVLATLAIF 252


>gi|240104061|ref|YP_002960370.1| hypothetical protein TGAM_2004 [Thermococcus gammatolerans EJ3]
 gi|239911615|gb|ACS34506.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3]
          Length = 255

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 89/300 (29%), Positives = 135/300 (45%), Gaps = 63/300 (21%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           V +P S+++ T   ++ T   GQ+ARA  IF ++ + ++             A  R  + 
Sbjct: 5   VFIPDSLLEETDDPKIRTYKVGQVARACAIFGVEHIWIY------------RAGGRDGK- 51

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LR-KHEWAPFR 182
                   F+  +L+Y+ETPQYLRK LF +   LR+VG++PPL  PHH L+ K +    R
Sbjct: 52  --------FIKTVLEYMETPQYLRKRLFPLMPELRYVGVVPPLRTPHHKLKGKPKVGEIR 103

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
           EG   ++          D+GL++  +V+  ++   R T  + + R L        V P+K
Sbjct: 104 EGYAFRKGR----KIYADIGLDELAIVEGNIEG--RATFRVVSTRPLK-------VVPAK 150

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
           P E   YWGY+V  +            K   D  I TS  G  V   +L  P    + IA
Sbjct: 151 PVE---YWGYRVHLS---GKPLAKTLKKAHLDLAIATSRKGRDVRKVNLP-PLEGEVGIA 203

Query: 303 FGGLAGLEESIEEDDGLKRKNAREV------FHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           FG                RK   E+      F    NT P+Q ++T+RTEEA+  +L  F
Sbjct: 204 FGS--------------PRKGVMELLGGEYEFDFVLNTIPNQKTKTVRTEEALLATLAIF 249


>gi|386876304|ref|ZP_10118426.1| hypothetical protein BD31_I1465 [Candidatus Nitrosopumilus salaria
           BD31]
 gi|386805880|gb|EIJ65377.1| hypothetical protein BD31_I1465 [Candidatus Nitrosopumilus salaria
           BD31]
          Length = 270

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 89/299 (29%), Positives = 138/299 (46%), Gaps = 48/299 (16%)

Query: 63  VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           +SVA+P S + + +SL++  TR    +ARA  IF+ID + V+   ++S++          
Sbjct: 3   LSVAIPDSALSD-ESLKIDKTRKISVLARACAIFKIDTIYVYQEGNNSND---------- 51

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
                     + +V +L+YLETPQ+LR+ LF   + L+F G+L PL  P H+     +K 
Sbjct: 52  ---------GSLMVMILKYLETPQFLRRRLFPKMNDLKFAGVLQPLKIPSHITPTNSKKI 102

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
                REG+ +           VDVG+N+ +        G RVT+            P+ 
Sbjct: 103 NAGDVREGIVVSVKGKR----FVDVGINQLIPFFGDTSIGKRVTL------RFKEGYPKL 152

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
            V      ES  YWGY V+   N+ S+     +KGG   +I TS  G       +   T 
Sbjct: 153 SVKEIDRSESPDYWGYTVKERSNLYSIL--ADWKGG---IIITSRKGKTATKEQIAKYTK 207

Query: 297 --RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             + +LI FG     E+ + E  G K KN +       N  P+Q + T+R EEA+  +L
Sbjct: 208 SDQPVLIVFG---SPEKGVHEIIGGKMKNVQNA--KALNFFPNQATETVRLEEALLGTL 261


>gi|45552535|ref|NP_995790.1| CG12128, isoform B [Drosophila melanogaster]
 gi|45445612|gb|AAS64882.1| CG12128, isoform B [Drosophila melanogaster]
          Length = 250

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 57/133 (42%), Positives = 79/133 (59%), Gaps = 1/133 (0%)

Query: 237 VVPPSKPK-ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
           +V P +P+ E+G+YWGY+VR A ++S +F    Y  GYD  +GTS+ G  V+        
Sbjct: 15  LVSPDEPRRETGVYWGYQVRIAHSMSEIFTKSPYATGYDVTVGTSDRGTNVHEVPNRSYN 74

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
           F H+LI FGGL GLE ++  D+ L   +   +F  Y N  P QGSRTIRTEEA+ I+L  
Sbjct: 75  FNHMLIVFGGLQGLESALANDEKLTVDDPELLFDHYVNVLPRQGSRTIRTEEALLIALAA 134

Query: 356 FQEPISRALRRVE 368
            QE +   +  VE
Sbjct: 135 LQEKLQPQVADVE 147


>gi|212223210|ref|YP_002306446.1| hypothetical protein TON_0065 [Thermococcus onnurineus NA1]
 gi|212008167|gb|ACJ15549.1| hypothetical protein, conserved, DUF171 family [Thermococcus
           onnurineus NA1]
          Length = 305

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 91/295 (30%), Positives = 134/295 (45%), Gaps = 52/295 (17%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           + +P S+++ T   ++ T   GQIARA  IF I+ + ++             A  +  R 
Sbjct: 54  IFIPDSLLEETDDPKIRTYKVGQIARAAAIFGIEHIWIY------------RAGGKDGR- 100

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LR-KHEWAPFR 182
                   F+  +L+Y ETPQYLRK LF +   L++VG++PPL  PHH L+ K +    R
Sbjct: 101 --------FIKLILEYAETPQYLRKRLFPLMPELKYVGVIPPLRTPHHKLKGKPKVGEIR 152

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
           EG   +           D+GL++  +V    D   R T  + + R L        V P+K
Sbjct: 153 EGFAFRRGR----RVYADIGLDELALVGG--DVEGRATFRIVSTRPLR-------VIPAK 199

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
           P+E   YWGY+V       S+ K    K G D  I TS  G  +    L  P    +   
Sbjct: 200 PEE---YWGYRVHLTKK--SLAKTLK-KAGLDLAIATSRKGQDIREVKLP-PLEGEVGFI 252

Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           FG    G+ E + E+D          F    NT P+Q + T+RTEEA+  +L  F
Sbjct: 253 FGSPRKGVMELLGEED--------YSFDLILNTIPNQRTATVRTEEAVLATLAVF 299


>gi|386001186|ref|YP_005919485.1| hypothetical protein Mhar_0477 [Methanosaeta harundinacea 6Ac]
 gi|357209242|gb|AET63862.1| hypothetical protein Mhar_0477 [Methanosaeta harundinacea 6Ac]
          Length = 279

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 84/300 (28%), Positives = 136/300 (45%), Gaps = 49/300 (16%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P S    ++   + T   G IARA  +FR+D +V++                    
Sbjct: 8   SILIPSSATMESRDERVRTLKVGMIARAAAVFRVDRIVIY-------------------- 47

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP--- 180
            DE    + F+ R+LQY ETPQYLRK LF    +LR+VG+LPPL  PHH +  + +    
Sbjct: 48  RDEEFDDSRFISRVLQYAETPQYLRKRLFPRERALRYVGILPPLRTPHHPKSSKVSTLKV 107

Query: 181 --FREGVTLKENAPNSVGTLVDVGLNKHVVVDQV--LDPGVRVTVAMGTNRNLDADSPRQ 236
             +R G+ + E   +  G  V++G+ + + +        G R+ V + + + L A+    
Sbjct: 108 GEYRVGLVVDEVGSDD-GAWVEIGVERPLPLRTAKRFSAGQRLNVRVFSLKPLAAE---- 162

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
              P    E   YWGY+    P++    ++          +GT    D++ +  +     
Sbjct: 163 ---PVDRSEIPHYWGYETEVIPDLDEHVRSSGRLVVATSRLGTPATPDLL-ARVVRQGQG 218

Query: 297 RHLLIAFGGLA-GLEE--SIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           R L + +G  A G++   S E  +G              NT PHQG+ T+R EEAIF +L
Sbjct: 219 RGLELVYGSPARGVDAFLSSETMEGC----------CVINTIPHQGTETVRLEEAIFATL 268


>gi|327400873|ref|YP_004341712.1| hypothetical protein Arcve_0986 [Archaeoglobus veneficus SNP6]
 gi|327316381|gb|AEA46997.1| protein of unknown function DUF171 [Archaeoglobus veneficus SNP6]
          Length = 261

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 90/299 (30%), Positives = 144/299 (48%), Gaps = 57/299 (19%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+A+P S + N +  ++     GQIARA  IFR+DE++++                R  +
Sbjct: 3   SIAIPSSALINERDEKIKAYKVGQIARAAAIFRVDEIIIY----------------RDPK 46

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH-EWAPFR 182
            DE++    F+  +LQYLETPQYLRK L  +  SL++ G+LPPL  P H  K  +    R
Sbjct: 47  LDESD----FIADVLQYLETPQYLRKYLIPIKPSLKYAGVLPPLRIPSHRPKDLKIGEVR 102

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQ-----VLDPGVRVTVAMGTNRNLDADSPRQV 237
           +GV ++   P+     VDVG+    ++ +      L  G RVTV + + + L  +     
Sbjct: 103 DGV-VRRVGPDGTA-WVDVGVKALALLKREKSRYKLKSGARVTVRVCSTKPLVVEE---- 156

Query: 238 VPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG---DIVNSSDLTLP 294
               +P+E   YWGYKVR A  +  V +        + ++ TS+      I   + L  P
Sbjct: 157 ---VEPREVEEYWGYKVRKA-ELDEVLRR-------EEVVLTSKRCHKPGIGEIASLKSP 205

Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           T     + FG     EE + E   +  +   ++    +N  P+QG+ T+R EEA+  +L
Sbjct: 206 T-----LVFGSP---EEGVHE---IAARLGVKLPSRCWNIVPNQGTETVRLEEAVIAAL 253


>gi|329765070|ref|ZP_08256654.1| hypothetical protein Nlim_0412 [Candidatus Nitrosoarchaeum limnia
           SFB1]
 gi|329138447|gb|EGG42699.1| hypothetical protein Nlim_0412 [Candidatus Nitrosoarchaeum limnia
           SFB1]
          Length = 270

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 94/299 (31%), Positives = 139/299 (46%), Gaps = 48/299 (16%)

Query: 63  VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           +S+A+P S + + +SL+L  TR    +ARA +IF+ID + V+       NY         
Sbjct: 3   ISIAIPESALSD-ESLKLDKTRKISVLARACSIFKIDTIYVYQ----EGNY--------- 48

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
            + D N      LV +L+YLETPQ+LR+ LF   + L+F G+L PL  P H      +K 
Sbjct: 49  -KEDGN-----LLVTILKYLETPQFLRRRLFPKMNELKFAGVLLPLRIPSHATPANSKKI 102

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
                REGV +           VDVG+N+ +      D G RVTV         +  P  
Sbjct: 103 NIGDVREGVVISIKGKR----FVDVGINEVIPFFGKTDVGKRVTV------QFKSGYPDF 152

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--LP 294
            V   +  E+ +YWGY V+   N+ S+     ++G   + I TS  G       LT  L 
Sbjct: 153 SVKEIQRNETPLYWGYTVKERSNLFSLL--TEWRG---NTIITSRKGRTATKEQLTKYLR 207

Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +   +L+ FG     E+ + E  G K KN +       N  P Q + T+R EEA+  +L
Sbjct: 208 SDLPILVVFG---SPEKGVHEILGGKMKNIQNA--KTLNFFPSQATETVRLEEALLGTL 261


>gi|304314252|ref|YP_003849399.1| RNA-binding protein [Methanothermobacter marburgensis str. Marburg]
 gi|302587711|gb|ADL58086.1| predicted RNA-binding protein [Methanothermobacter marburgensis
           str. Marburg]
          Length = 266

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 138/301 (45%), Gaps = 56/301 (18%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+ +P S+   T  L++ T   G I RA  IF ++ +V++                   
Sbjct: 6   LSIFIPDSLTAETGDLKIKTYKVGLIGRAAAIFGVNRIVIY------------------- 46

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
             D+ +  A F+  +L Y++TPQYLR+ +F +   L+ VG+LPPL  PHH   K     +
Sbjct: 47  -HDDADGEAGFIRDILNYMDTPQYLRRKVFPIMKELKHVGILPPLRTPHHPTGKPVAGEY 105

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           R+G+T+K       GTLVD+G +K  +  +      ++TV    +  +       ++ P 
Sbjct: 106 RQGLTVKRVKK---GTLVDIGADKLALCRE------KLTVNRVMSFRVIRLGKEILIEPD 156

Query: 242 KPKESGMYWGYKVRYAPN-ISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTF 296
           +P++   YWGY+V      +S   K        D ++ TS +      I++     +   
Sbjct: 157 EPEDR--YWGYEVLSTEQGLSKSLKTVD----ADVVVATSRYASPITSILDEVKSKVEGA 210

Query: 297 RHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
           R + I FGG   GL E I+ D              + NT P Q + T+RTEEA+  +L  
Sbjct: 211 RRVAILFGGPYKGLPE-IDAD-------------VWVNTIPGQRTETVRTEEAVLATLSV 256

Query: 356 F 356
           F
Sbjct: 257 F 257


>gi|325969276|ref|YP_004245468.1| hypothetical protein VMUT_1765 [Vulcanisaeta moutnovskia 768-28]
 gi|323708479|gb|ADY01966.1| hypothetical protein VMUT_1765 [Vulcanisaeta moutnovskia 768-28]
          Length = 292

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 78/298 (26%), Positives = 133/298 (44%), Gaps = 38/298 (12%)

Query: 61  PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
           P + +A+P   +  +     A R  G I R   IF++  ++V+ ++ + D          
Sbjct: 14  PIIDIAIPSDFLAESPDNREAIRKIGYIGRGAAIFQVSRIIVYRHRLAGDR--------- 64

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR--KHEW 178
                     A F+V+ L+YL TP YLRK LF +   L++VG+LPPL  P+H    K E 
Sbjct: 65  --------DKAEFIVKNLRYLSTPPYLRKDLFKLDRDLKYVGLLPPLKTPNHAPEGKPER 116

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
             +REG+ +K +   S+     +G   +  V + +  G RV V +      D      VV
Sbjct: 117 GEYREGIVIKWDGYYSIAK---IGDGIYAKVPRPMPLGTRVVVQIDAPTTRDDTYRAHVV 173

Query: 239 PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR- 297
           P  +     +YWG++     ++S +F   SY      ++ T + G  + ++   +  F  
Sbjct: 174 PRDR---LSIYWGFESEVM-DVSKLFDEYSY------VVLTGKEGISIKNAIEQINDFLS 223

Query: 298 --HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
               L+ FG      + I   +G++    +   + + N  P QG  T+RTEEA+   L
Sbjct: 224 GDRFLVVFGSPYHGVDEILRAEGMENTLKK---YPFINFIPGQGVETVRTEEAVIAVL 278


>gi|375083295|ref|ZP_09730321.1| hypothetical protein OCC_12206 [Thermococcus litoralis DSM 5473]
 gi|374742026|gb|EHR78438.1| hypothetical protein OCC_12206 [Thermococcus litoralis DSM 5473]
          Length = 258

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 87/301 (28%), Positives = 136/301 (45%), Gaps = 62/301 (20%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           + +P S+++ T   ++ T   GQI RA  IF ++ V ++             A  +  + 
Sbjct: 5   IFIPDSLLEETSDPKIRTYKVGQIGRAAAIFGVEHVWIY------------KAGGKDGK- 51

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH--EWAPFR 182
                   F+  +L+Y+ETPQYLRK+L  +   LR+VG+LPPL  PHH  K   +    R
Sbjct: 52  --------FIKLILEYMETPQYLRKSLIPLTKELRYVGVLPPLRTPHHKLKGRPKLGEIR 103

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNR---NLDADSPRQVVP 239
           EG+ +++          D+GL++  +V+             G  R    + +  P +VV 
Sbjct: 104 EGIVIRKGK----RLYADIGLDELALVE-----------GSGEGRMTFKIVSLKPLKVV- 147

Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
           PSKP+E   YWGY+V       S+ K    K   +  I TS  G+ V   +L  P    +
Sbjct: 148 PSKPEE---YWGYRVHLTRK--SLAKTLK-KAKLNLAIATSRKGEDVRKVNLP-PLEGEV 200

Query: 300 LIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
              FG    G+  +     ED           F    NT P+Q ++T+RTEEA+  +L  
Sbjct: 201 GFVFGSPRKGIMEILRDFNED---------YPFDLILNTIPNQKTKTVRTEEAVLATLAI 251

Query: 356 F 356
           F
Sbjct: 252 F 252


>gi|242398278|ref|YP_002993702.1| hypothetical protein TSIB_0287 [Thermococcus sibiricus MM 739]
 gi|242264671|gb|ACS89353.1| hypothetical protein TSIB_0287 [Thermococcus sibiricus MM 739]
          Length = 261

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 86/299 (28%), Positives = 133/299 (44%), Gaps = 58/299 (19%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           + +P S+++ T   ++ T   GQI RA  IF ++ + ++             A  R  + 
Sbjct: 8   IFIPDSLLEETSDPKIKTYKVGQIGRAAAIFGVEHIWIY------------KAGGREGK- 54

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPFR 182
                   F+  +L+Y+ETPQYLRK L  +   L++ G+LPPL  PHH  K E      R
Sbjct: 55  --------FIKLVLEYMETPQYLRKTLIPLTKELKYAGILPPLRTPHHKLKREPKVGEIR 106

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP--- 239
           EGV +K+          D+GL++  +V+             G+           V+P   
Sbjct: 107 EGVIIKKGK----RLYADIGLDELALVE-------------GSGEGRMTFEIISVIPLKV 149

Query: 240 -PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
            P+KP+E   YWGY+V       S+ K    K   +  I TS  G+ V   +L  P    
Sbjct: 150 APTKPRE---YWGYRVHLTR--MSLAKTLK-KAKLNLAIATSRMGEDVRKVNLP-PLEGE 202

Query: 299 LLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           +   FG    G+ E       L+  N    F    NT P+Q ++T+RTEEA+ ++L  F
Sbjct: 203 VGFVFGSPRKGIMEI------LRDFNEDYPFDLILNTIPNQKTKTVRTEEAVLVTLAIF 255


>gi|341582064|ref|YP_004762556.1| hypothetical protein GQS_04890 [Thermococcus sp. 4557]
 gi|340809722|gb|AEK72879.1| hypothetical protein GQS_04890 [Thermococcus sp. 4557]
          Length = 256

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 92/295 (31%), Positives = 132/295 (44%), Gaps = 52/295 (17%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           + +P S+++ T   ++ T   GQIARA  IF ++ V V+             A  R  R 
Sbjct: 5   IFIPDSLLEETDDPKIRTYKVGQIARAAAIFGVEHVWVY------------RAGGRDGR- 51

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPFR 182
                   F+  +L+Y ETPQYLRK LF +   LR+VG++PPL  PHH  K +      R
Sbjct: 52  --------FIKTILEYAETPQYLRKRLFPLMPELRYVGVIPPLRTPHHKLKGKPRVGEIR 103

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
           EG   ++       T  D+GL+    V+   D   R T  + + R L        V P+K
Sbjct: 104 EGFAFRKGR----RTYADIGLDDLAAVEG--DVEGRATFRIVSVRPLR-------VIPAK 150

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
           P+E   YWGY+V       S+ K    K   D  I TS  G  +    L  P    +   
Sbjct: 151 PEE---YWGYRVHLTRK--SLAKTLK-KARLDLAIATSRKGRDIREVKLP-PLEGEVGFV 203

Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           FG    G+ E + E++          F    NT P+Q + T+RTEEA+  +L  F
Sbjct: 204 FGSPRKGVMELLGEEE--------YDFDLILNTIPNQRTATVRTEEAVLATLAVF 250


>gi|15678032|ref|NP_275146.1| hypothetical protein MTH1 [Methanothermobacter thermautotrophicus
           str. Delta H]
 gi|2621037|gb|AAB84510.1| conserved protein [Methanothermobacter thermautotrophicus str.
           Delta H]
          Length = 268

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 85/301 (28%), Positives = 139/301 (46%), Gaps = 56/301 (18%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+ +P S+   T  L++ T     IARA +IF +  +V++                   
Sbjct: 6   LSIFIPDSLTAETGDLKIKTYKVVLIARAASIFGVKRIVIY------------------- 46

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
             D+ +  A F+  +L Y++TPQYLR+ +F +   L+ VG+LPPL  PHH   K     +
Sbjct: 47  -HDDADGEARFIRDILTYMDTPQYLRRKVFPIMRELKHVGILPPLRTPHHPTGKPVTGEY 105

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           R+G+T+K       GTLVD+G +K  +  +      ++TV    +  +       ++ P 
Sbjct: 106 RQGLTVKRVKK---GTLVDIGADKLALCRE------KLTVNRIMSFRVVRLGKEILIEPD 156

Query: 242 KPKESGMYWGYKVR-YAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTF 296
           +P++   YWGY+V     N++   K      G D ++ TS +      I++     +   
Sbjct: 157 EPEDR--YWGYEVLDTRRNLAESLKTV----GADVVVATSRNASPITSILDEVKTRMRGA 210

Query: 297 RHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
           R   I FGG   GL E I+ D              + NT P Q + T+RTEEA+  +L  
Sbjct: 211 REAAILFGGPYKGLPE-IDAD-------------IWVNTLPGQCTETVRTEEAVLATLSV 256

Query: 356 F 356
           F
Sbjct: 257 F 257


>gi|288931509|ref|YP_003435569.1| hypothetical protein Ferp_1135 [Ferroglobus placidus DSM 10642]
 gi|288893757|gb|ADC65294.1| Protein of unknown function DUF171 [Ferroglobus placidus DSM 10642]
          Length = 251

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 88/297 (29%), Positives = 132/297 (44%), Gaps = 62/297 (20%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           + +A+P SI++N    ++ T   G I RA  IFR++++ ++ + S               
Sbjct: 1   MEIAIPSSILENESDDKIKTFKVGIIGRAAAIFRVEKIYIYKDPS--------------- 45

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKH-EWAPF 181
           R D N     F+  +L+Y+ETPQYLRK LF     LR+ G+L PL  P H  KH +    
Sbjct: 46  RDDSN-----FISEILRYMETPQYLRKYLFPRSEKLRYAGVLQPLQIPSHKPKHLKVGEI 100

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGV +K     +     D+G+    +       G RVTV + +   L       VV  +
Sbjct: 101 REGVIVKVADGTA---WADIGMKALALFRGKARKGARVTVRVCSTNPL-------VVEEA 150

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL-PTFRHL- 299
           KP+E   YWG+KV          + CS K          +  D V +S L   P+   + 
Sbjct: 151 KPEE---YWGFKV----------EKCSLK-------KLLKREDAVVTSKLGYHPSVDEIK 190

Query: 300 ---LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
              L+ FG  +   + I +  GL      E     +N  P QG R++R EEA+   L
Sbjct: 191 KVNLLIFGSPSKEVKEIADILGL------EFEAKMWNLIPKQGVRSVRVEEAVISCL 241


>gi|21465769|pdb|1K3R|A Chain A, Crystal Structure Of The Methyltransferase With A Knot
           From Methanobacterium Thermoautotrophicum
 gi|21465770|pdb|1K3R|B Chain B, Crystal Structure Of The Methyltransferase With A Knot
           From Methanobacterium Thermoautotrophicum
          Length = 268

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 85/301 (28%), Positives = 139/301 (46%), Gaps = 56/301 (18%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+ +P S+   T  L++ T     IARA +IF +  +V++                   
Sbjct: 6   LSLFIPDSLTAETGDLKIKTYKVVLIARAASIFGVKRIVIY------------------- 46

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-LRKHEWAPF 181
             D+ +  A F+  +L Y++TPQYLR+ +F +   L+ VG+LPPL  PHH   K     +
Sbjct: 47  -HDDADGEARFIRDILTYMDTPQYLRRKVFPIMRELKHVGILPPLRTPHHPTGKPVTGEY 105

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           R+G+T+K       GTLVD+G +K  +  +      ++TV    +  +       ++ P 
Sbjct: 106 RQGLTVKRVKK---GTLVDIGADKLALCRE------KLTVNRIMSFRVVRLGKEILIEPD 156

Query: 242 KPKESGMYWGYKVR-YAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTF 296
           +P++   YWGY+V     N++   K      G D ++ TS +      I++     +   
Sbjct: 157 EPEDR--YWGYEVLDTRRNLAESLKTV----GADVVVATSRNASPITSILDEVKTRMRGA 210

Query: 297 RHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
           R   I FGG   GL E I+ D              + NT P Q + T+RTEEA+  +L  
Sbjct: 211 REAAILFGGPYKGLPE-IDAD-------------IWVNTLPGQCTETVRTEEAVLATLSV 256

Query: 356 F 356
           F
Sbjct: 257 F 257


>gi|390960795|ref|YP_006424629.1| hypothetical protein CL1_0629 [Thermococcus sp. CL1]
 gi|390519103|gb|AFL94835.1| hypothetical protein CL1_0629 [Thermococcus sp. CL1]
          Length = 256

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 91/295 (30%), Positives = 136/295 (46%), Gaps = 52/295 (17%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           V +P S+++ T   ++ T   GQIAR+  IF ++ V ++             A  R  R 
Sbjct: 5   VFIPDSLLEETDDPKIRTYKVGQIARSCAIFGVEHVWIY------------RAGGRDGR- 51

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPFR 182
                   F+  +L+Y ETPQYLRK LF +   LR+VG++PPL  PHH  K +      R
Sbjct: 52  --------FIKTILEYAETPQYLRKRLFPLMPELRYVGVIPPLRTPHHKLKGKPRVGEIR 103

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
           EG   ++          D+GL++  +V+   D   R T  + + R      P +VV P+K
Sbjct: 104 EGFAFRKGR----RVYADIGLDELALVEG--DVEGRTTFRIVSVR------PLRVV-PAK 150

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
           P++   YWGY+V       S+ K    K   D  I TS  G  V    L  P    +   
Sbjct: 151 PEK---YWGYRVHLTRK--SLAKTLK-KARLDLTIATSRRGRDVREVKLP-PLEGEVGFV 203

Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           FG    G+ E + E++          F    NT P+Q ++T+RTEEA+  +L  F
Sbjct: 204 FGSPRKGVMELLGEEE--------YDFDLILNTVPNQRTKTVRTEEALLATLAVF 250


>gi|294659916|ref|XP_462356.2| DEHA2G18788p [Debaryomyces hansenii CBS767]
 gi|199434333|emb|CAG90863.2| DEHA2G18788p [Debaryomyces hansenii CBS767]
          Length = 382

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 100/386 (25%), Positives = 159/386 (41%), Gaps = 90/386 (23%)

Query: 47  RKRDQLNDDAPIEVPT-------VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRI 97
           +++ +  ++ P++ P         S+ VP +II   N ++LE  T +A QIA+A TI+ +
Sbjct: 3   KRKTEAKEEKPVKKPVKAIPETQYSICVPSTIISAANAKNLEQITSIAYQIAKAATIYNV 62

Query: 98  DEVVVFDNKSSSD--------------------------NYS-----RSSAANRSNR--- 123
            EVVV D  +SS                           N+S     + +A  ++     
Sbjct: 63  AEVVVLDVPTSSKKQDLAEKEAAKVVELGSNKGGKKIKFNFSDEDILKPTAVEKTKEPET 122

Query: 124 -----SDENESGAAFLVRLLQYLETPQYLRKALFS---MHSSLRFVGMLPPLDA-PHHLR 174
                SD NE+       LLQY  TP YL K++F+    H+  ++   LP +   P    
Sbjct: 123 PVDLSSDLNENNTLLFATLLQYFMTPPYLVKSVFANNQFHTKFKYAQKLPKISTLPFMSN 182

Query: 175 KHEWAPFREGVTLKENAPN--------------SVGTLVDVGLNKHVVVDQVLDPGVRVT 220
            H    F+EG+T+ +  P               SV   V++G  +  V+D  +   VRVT
Sbjct: 183 NHVSKDFKEGLTIPKATPKVTKKNKKVSALKKLSVTKYVNIGDAEPFVLDHEVPVNVRVT 242

Query: 221 VAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI--- 277
           V +   +N    SP Q       K S   +GY VRY  N SS+F  CS+  GY   I   
Sbjct: 243 VDL---KNKKIVSPLQAYGLIGSKSS---FGYHVRYCANFSSIFTQCSFPEGYTSTIYVN 296

Query: 278 --------GTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDG--LKRKNAREV 327
                    T E  + +N +    P    +L+ FG    ++ S + D       +NA ++
Sbjct: 297 CDDYFNSNDTIEARNSINKNPDLSPNDGKILLLFGNYNDIDYSFQNDKANLPGVENASQM 356

Query: 328 FHSYFNTCPHQGSRTIRTEEAIFISL 353
           F            + ++ E+A  ISL
Sbjct: 357 FDGELPI-----PKGVKIEDATLISL 377


>gi|307595040|ref|YP_003901357.1| hypothetical protein Vdis_0916 [Vulcanisaeta distributa DSM 14429]
 gi|307550241|gb|ADN50306.1| Protein of unknown function DUF171 [Vulcanisaeta distributa DSM
           14429]
          Length = 286

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 80/303 (26%), Positives = 139/303 (45%), Gaps = 48/303 (15%)

Query: 61  PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
           PT+ +A+P   +  +     A R  G + R   IF++ +++V+ +K + +        +R
Sbjct: 8   PTIDIAIPSDFLAESPDDREAIRKIGYLGRGAAIFQVSKIIVYRHKLAGER-------DR 60

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAP 180
           +N          F+++ LQYL TP YLRK LF +   L++ G+LPPL  P+H    E  P
Sbjct: 61  TN----------FIIKNLQYLVTPPYLRKDLFKLDRDLKYAGLLPPLKTPNH--APEGMP 108

Query: 181 ----FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
               +REG+ +K +   S+   + +G   +  V + +  G RV V +             
Sbjct: 109 QRGEYREGIVVKWDGYFSI---IKIGEGVYAKVPKPMPLGTRVIVQIDAQTTRGDTYRAH 165

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
           VVP  +     +YWG++      +S +F +      YD++I T + G  +N  D  +   
Sbjct: 166 VVPRDR---LSVYWGFEAEVV-ELSRLFSD------YDYVILTGKEG--MNIKD-AMEQL 212

Query: 297 R------HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
           R       +L+ FG      + I   +G++    +   + + N  P QG  T+RTEEA+ 
Sbjct: 213 RSALSRDRVLVVFGSPYHGVDEILRAEGMEEALRK---YPFINFIPGQGVETVRTEEAVI 269

Query: 351 ISL 353
             L
Sbjct: 270 AVL 272


>gi|340344792|ref|ZP_08667924.1| hypothetical protein COG2106 [Candidatus Nitrosoarchaeum koreensis
           MY1]
 gi|339519933|gb|EGP93656.1| hypothetical protein COG2106 [Candidatus Nitrosoarchaeum koreensis
           MY1]
          Length = 270

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/299 (30%), Positives = 139/299 (46%), Gaps = 48/299 (16%)

Query: 63  VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           +SVA+P S + + +SL+L  TR    +ARA  IF+ID + V+       NY         
Sbjct: 3   ISVAIPESALSD-ESLKLDKTRKISVLARACAIFKIDTIYVYQ----EGNY--------- 48

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-----LRKH 176
            R D N      LV +L+YLETPQ+LR+ LF   + L+F G+L PL  P H      +K 
Sbjct: 49  -REDGN-----LLVTILRYLETPQFLRRRLFPKMNELKFAGVLYPLKIPSHSTPANSKKI 102

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
                REGV +           +DVG+N+ +      + G R TV   T        P  
Sbjct: 103 NVGDVREGVIVSLKGKK----FIDVGINELIPFFGKENVGKRATVQFKTG------YPDF 152

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT--LP 294
            V   +  E+ +YWGY V+   ++ S+     ++G   ++I TS  G       LT  L 
Sbjct: 153 SVKEIQRNETSLYWGYAVKERASLFSLL--TEWQG---NIIITSRKGRTATKEQLTKYLS 207

Query: 295 TFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +   +L+ FG     E+ + E  G K KN +       N  P+Q + T+R +EA+  +L
Sbjct: 208 SDLPILVVFG---SPEKGVHEILGGKMKNVQNA--KTLNFFPNQATETVRLDEALLGTL 261


>gi|119608244|gb|EAW87838.1| chromosome 9 open reading frame 114, isoform CRA_a [Homo sapiens]
          Length = 236

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 51/105 (48%), Positives = 60/105 (57%), Gaps = 4/105 (3%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SVA+PGSI+DN QS EL T LAGQIARA  IF +DE+VVFD +               
Sbjct: 135 TLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDEIVVFDEEGQDAKTVEGEFTGVG 194

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP 166
            +          L R+LQYLE PQYLRKA F  H  L+F   L P
Sbjct: 195 KKGQ----ACVQLARILQYLECPQYLRKAFFPKHQDLQFAACLAP 235


>gi|18978198|ref|NP_579555.1| hypothetical protein PF1826 [Pyrococcus furiosus DSM 3638]
 gi|397652526|ref|YP_006493107.1| hypothetical protein PFC_09460 [Pyrococcus furiosus COM1]
 gi|18894007|gb|AAL81950.1| hypothetical protein PF1826 [Pyrococcus furiosus DSM 3638]
 gi|393190117|gb|AFN04815.1| hypothetical protein PFC_09460 [Pyrococcus furiosus COM1]
          Length = 260

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 85/295 (28%), Positives = 132/295 (44%), Gaps = 50/295 (16%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           V +P S+++     ++     GQ+ARA  IF ++ + ++             A  +  + 
Sbjct: 5   VFIPDSLLEEANDPKIKAYKVGQVARACAIFGVEHIWIY------------KAGGKDGK- 51

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREG 184
                   F+  LL+Y ETPQYLRK +F +   L++ G++PPL  P H  K   AP R G
Sbjct: 52  --------FIKLLLEYAETPQYLRKRIFPITPELKYAGVIPPLQIPSHKPK---APPRIG 100

Query: 185 VTLKENAPNSVG--TLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
             ++E      G     D+GL++  ++  V   G       GT + +  +  R +  P++
Sbjct: 101 -EIREGYAYRKGKKLFADIGLDRPALIKGVAREG------RGTFKIISVNPLRVI--PAE 151

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
           P E   YWGYKV  +            K   D +I TS  G  V S++  +P    + I 
Sbjct: 152 PSE---YWGYKVHLS---RKTLAKTLKKANLDVVIATSRKGVDVRSAE--VPLEGEVGIV 203

Query: 303 FGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
           FG    G+ E       LK  N    F    NT P Q ++T+RTEEA+  +L  F
Sbjct: 204 FGSPKKGVIEI------LKEYNEDYEFDLILNTIPEQKTKTVRTEEALLATLAIF 252


>gi|407464764|ref|YP_006775646.1| hypothetical protein NSED_04490 [Candidatus Nitrosopumilus sp. AR2]
 gi|407047952|gb|AFS82704.1| hypothetical protein NSED_04490 [Candidatus Nitrosopumilus sp. AR2]
          Length = 268

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 90/306 (29%), Positives = 139/306 (45%), Gaps = 62/306 (20%)

Query: 63  VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           +SVA+P S + + +SL++  TR    +ARA  IF++D + ++   S              
Sbjct: 3   LSVAIPQSALSD-ESLKIDKTRKISVLARACAIFKVDTIYIYQEGS-------------- 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
                N+S    +  +L+YLETPQ+LR+ LF   + L+F G+L PL  P H+     +K 
Sbjct: 48  -----NKSDGGLMAMILKYLETPQFLRRRLFPKMNDLKFAGVLHPLKIPSHITPVNSKKI 102

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
           +    REG+ +           VDVG+N+ V        G RVTV            P+ 
Sbjct: 103 KTGDIREGIVVSIKGKK----FVDVGINQLVQYYGSTSTGKRVTV------RFKEGYPKL 152

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL--- 293
            +      E   YWGY V+   N+ SV     +KG   ++I TS  G  +++  L     
Sbjct: 153 SIKDIDKSEVPTYWGYSVKERANLFSVL--SEWKG---NIILTSRKGKPISTEQLIKYAK 207

Query: 294 ---PTFRHLLIAFGGLAGLEESIEEDDGL---KRKNAREVFHSYFNTCPHQGSRTIRTEE 347
              PT    L+ FG     E+ I E  G    K +NA+ +     N  P+Q + T+R EE
Sbjct: 208 SDEPT----LVVFG---SPEKGIHEIIGGRMNKVQNAKSL-----NFFPNQATETVRLEE 255

Query: 348 AIFISL 353
           A+  +L
Sbjct: 256 ALLGTL 261


>gi|167043663|gb|ABZ08356.1| putative uncharacterized ACR, COG2106 [uncultured marine
           crenarchaeote HF4000_APKG2O16]
          Length = 248

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 77/267 (28%), Positives = 122/267 (45%), Gaps = 39/267 (14%)

Query: 88  IARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYL 147
           IARA  IF+I+++ ++ +                    +N++  A L   L+YLETPQY 
Sbjct: 2   IARACAIFKINQIFIYQD------------------GKQNKNDLALLSTSLKYLETPQYF 43

Query: 148 RKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKENAPNSVG-TLVDVGLNKH 206
           RK +F     L++ G L PL+   HL   +    + G T      N  G   +D+G+NK 
Sbjct: 44  RKDIFPKTQLLKYAGALQPLNISSHLTTSDQKMIKIGDTRDALIINYKGKKFLDIGINKL 103

Query: 207 VVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKN 266
           +     +  G R+ + + T +      P   V     K+   YWGY V+   N+ S+   
Sbjct: 104 IQYFGKMKSGTRIAIQIKTTQ------PELTVKEISRKDIKDYWGYSVKERANLLSIL-- 155

Query: 267 CSYKGGYDHLIGTSEHGD---IVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKN 323
            ++KG    +I TS+ G    I+++  +T P    +LI FG        I   D  K +N
Sbjct: 156 STWKG---KIILTSKKGKNFTILDAKKMTEPD-EPILIVFGTTNKGIYDILGTDIKKIQN 211

Query: 324 AREVFHSYFNTCPHQGSRTIRTEEAIF 350
           A+      FN  P+Q ++T+R EEAI 
Sbjct: 212 AK-----IFNFFPNQATQTVRLEEAIL 233


>gi|312079981|ref|XP_003142406.1| hypothetical protein LOAG_06822 [Loa loa]
          Length = 117

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 51/110 (46%), Positives = 65/110 (59%), Gaps = 4/110 (3%)

Query: 244 KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAF 303
           +E+G+YWGYKVR A ++          G YD +IGTSE G  V+  ++ L     +LI F
Sbjct: 1   RETGLYWGYKVRIALSLHDALN----AGEYDVIIGTSERGKPVSKFEMPLCERNRILIVF 56

Query: 304 GGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           GGL GLE +IE D  +      E+F  Y N  P QGSR IRTEEAI I+L
Sbjct: 57  GGLEGLEAAIEADKNISCSTPEELFEHYLNVVPGQGSRIIRTEEAIPITL 106


>gi|327311990|ref|YP_004338887.1| hypothetical protein TUZN_2118 [Thermoproteus uzoniensis 768-20]
 gi|326948469|gb|AEA13575.1| hypothetical protein TUZN_2118 [Thermoproteus uzoniensis 768-20]
          Length = 272

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 90/293 (30%), Positives = 127/293 (43%), Gaps = 42/293 (14%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           + +AVP  ++      E   R  G IAR   +FR + ++++                  N
Sbjct: 1   MDIAVPHDVLSEAPDEESKVRKLGYIARGAAVFRAENLIIY---------------TYGN 45

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAP 180
             D  E     L  LL+Y  TP +LRK LF +   LR  G+LPPL  P H    E     
Sbjct: 46  DVDWEEVERMRL--LLEYASTPPHLRKKLFKLDRRLRLAGLLPPLKIPSHTPPKEPSVGD 103

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
             EGV  + +   S   LV +G  ++  + +    G R+ V +    +   D  R VV  
Sbjct: 104 VIEGVVERWDGYYS---LVYIGGRRYAKIPKPYPIGSRLLVKIEAETD-RPDIYRAVVVK 159

Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
             P     YWGYKV   P     FK  +   GYD +I T   G  V    L  P  R L+
Sbjct: 160 KPPD----YWGYKVEVKP-----FKQLA--DGYDTVIYTGREGRSV-CEGLPRPVGRTLV 207

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +  G  AG++E I + +G++ K        + N  P QG+ T+RTEEAIF  L
Sbjct: 208 VFGGPRAGVDE-IAKIEGVELKGL------FLNFIPRQGTETVRTEEAIFAVL 253


>gi|284162461|ref|YP_003401084.1| hypothetical protein Arcpr_1362 [Archaeoglobus profundus DSM 5631]
 gi|284012458|gb|ADB58411.1| Protein of unknown function DUF171 [Archaeoglobus profundus DSM
           5631]
          Length = 254

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 85/292 (29%), Positives = 133/292 (45%), Gaps = 50/292 (17%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SVA+P S + N    ++ T   G IARA  IFR+ E++++          R    N S 
Sbjct: 2   ISVAIPSSALINENDPKIKTFKVGLIARACAIFRVGEIIIY----------RDPKLNESQ 51

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
                     F+  +L+Y ETPQYLRK +  +  +L++ G+LPPL  P H  K       
Sbjct: 52  ----------FIKDVLEYAETPQYLRKYI-PIKETLKYAGVLPPLKIPSHKPKRLKVGEV 100

Query: 183 EGVTLKENAPNSVGT-LVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
               +    P+  GT  VD+G+     +      G RVTV + +   L       VV  +
Sbjct: 101 REGVVVRVGPD--GTRWVDIGVKALAPLKSKAKRGARVTVRVCSTNPL-------VVEEA 151

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
           KP+E   YWGYKVR A  +  V          ++++ TS    +    ++   + + + +
Sbjct: 152 KPEE---YWGYKVRIA-ELKDVLSK-------ENVVLTSRKCKVPKIEEIK--SLKDVTL 198

Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            FG     EE + E   + R+   +     +NT P QG  T+R EEAI+ +L
Sbjct: 199 VFG---SPEEGVFE---IMRRLGIDTDARCWNTIPMQGVETVRLEEAIYATL 244


>gi|407462404|ref|YP_006773721.1| hypothetical protein NKOR_04455 [Candidatus Nitrosopumilus
           koreensis AR1]
 gi|407046026|gb|AFS80779.1| hypothetical protein NKOR_04455 [Candidatus Nitrosopumilus
           koreensis AR1]
          Length = 270

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 86/299 (28%), Positives = 137/299 (45%), Gaps = 48/299 (16%)

Query: 63  VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           +SVA+P S + + +SL++  TR    +ARA  IF+I+ + V+   +              
Sbjct: 3   LSVAIPESSLSD-ESLKIDKTRKISVLARACAIFKIETIYVYQEGN-------------- 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
           N+ D N      +V +L+YLETPQ+LR+ LF   + L+F G+L PL  P H+     +K 
Sbjct: 48  NKQDGN-----LMVMILKYLETPQFLRRRLFPKMNDLKFAGVLQPLRIPSHVTPANPKKI 102

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
                REG+ +           VDVG+N+ +        G RVT+            P+ 
Sbjct: 103 TRGDVREGIVVSVKGKR----FVDVGINQLIPFFGKTPIGKRVTI------QFKEGYPKF 152

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
            +      E+  YWGY V+   N+ S+     +KG   ++I TS  G       +   T 
Sbjct: 153 SIKEITRNEAPEYWGYAVKERANLFSLL--SEWKG---NIIITSRKGKTATKEQIAKYTK 207

Query: 297 --RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             +  L+ FG     E+ + E  G K KN +       N  P+Q ++T+R EEA+  +L
Sbjct: 208 SDQPTLVVFG---SPEKGVHEILGGKMKNVQNA--KSLNFFPNQATQTVRLEEALLGTL 261


>gi|409096185|ref|ZP_11216209.1| hypothetical protein TzilA_05992 [Thermococcus zilligii AN1]
          Length = 256

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 132/301 (43%), Gaps = 64/301 (21%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           V +P S+++     ++ T   GQIARA  IF ++ + ++             A  +  + 
Sbjct: 5   VFIPDSLLEERDDPKIRTYKVGQIARACAIFGVEHIWIY------------GAGGKDGK- 51

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLR--KHEWAPFR 182
                   F+  +L+Y ETPQYLRK LF +   L++ G++PPL  PHH    K E    R
Sbjct: 52  --------FIKTILEYAETPQYLRKKLFPLMPELKYAGVIPPLRTPHHKLEGKPEVGEIR 103

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
           EG   ++          D+GL++  +V+   D   R T  + + R      P +VV P+K
Sbjct: 104 EGFAFRKGR----RVYADIGLDELAMVEG--DIEGRATFRIVSLR------PLRVV-PAK 150

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIA 302
           P E   YWGY+V       S+ K    K   D +I TS  G   +  ++ LP        
Sbjct: 151 PAE---YWGYRVHLTEE--SLAKTLK-KARLDLVIATSRRGR--DIREVKLPP------- 195

Query: 303 FGGLAGLEESIEEDDGLKRKNAREV-------FHSYFNTCPHQGSRTIRTEEAIFISLQY 355
                 LE  +    G  RK   E+       F    NT P Q + T+RTEEA+  +L  
Sbjct: 196 ------LEGEVGFVFGSPRKGVMELLGGEEFEFDLILNTIPDQRTETVRTEEALLATLAI 249

Query: 356 F 356
           F
Sbjct: 250 F 250


>gi|161528318|ref|YP_001582144.1| hypothetical protein Nmar_0810 [Nitrosopumilus maritimus SCM1]
 gi|160339619|gb|ABX12706.1| Protein of unknown function DUF171 [Nitrosopumilus maritimus SCM1]
          Length = 270

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 89/299 (29%), Positives = 136/299 (45%), Gaps = 48/299 (16%)

Query: 63  VSVAVPGSIIDNTQSLEL-ATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           +SVA+P S + + +SL++  TR    +ARA  IF+I+ + V+   +              
Sbjct: 3   LSVAIPESSLSD-ESLKIDKTRKISVLARACAIFKIETIYVYQEGN-------------- 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKH 176
           N+ D N      +V +L+YLETPQ+LR+ LFS  + L+F G+L PL  P H      +K 
Sbjct: 48  NKQDGN-----LMVMILKYLETPQFLRRRLFSKVNDLKFAGVLQPLRIPSHATPANPKKI 102

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQ 236
                REG+ +           VDVG+N+ +        G RVTV            P  
Sbjct: 103 NKGDVREGIVVSVKGKR----FVDVGINQLIPFFGKTLMGKRVTV------QFKEGHPNF 152

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
            +      E+  YWGY V+   N+ S+     +KG   ++I TS  G       +   T 
Sbjct: 153 SIKEINRNEAPDYWGYTVKERSNLFSLL--SEWKG---NIILTSRKGKTATKEQIAKYTK 207

Query: 297 --RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             +  L+ FG     E+ I E  G K KN +       N  P+Q ++T+R EEA+  +L
Sbjct: 208 SDQPTLVVFG---SPEKGIHEILGGKMKNVQNA--KSLNFFPNQATQTVRLEEALLGTL 261


>gi|347830143|emb|CCD45840.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 193

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 53/114 (46%), Positives = 69/114 (60%), Gaps = 10/114 (8%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSVA+PGSII N QS +L T +AGQIARA+ +F +DEV+VFD+ ++           +S
Sbjct: 61  TVSVALPGSIIANAQSHDLKTSMAGQIARALAVFCVDEVIVFDDGNARPAKKPKHPPTQS 120

Query: 122 NRSD-----ENESGAA-----FLVRLLQYLETPQYLRKALFSMHSSLRFVGMLP 165
           + S      E+E  A      FL  LL +LETP  LRK LF +H +LR  G LP
Sbjct: 121 HNSQQKPPLEDEYTAKSDPDHFLTHLLSFLETPPNLRKHLFPIHENLRTAGALP 174


>gi|116755017|ref|YP_844135.1| hypothetical protein Mthe_1729 [Methanosaeta thermophila PT]
 gi|116666468|gb|ABK15495.1| Protein of unknown function DUF171 [Methanosaeta thermophila PT]
          Length = 277

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 86/304 (28%), Positives = 131/304 (43%), Gaps = 52/304 (17%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P S    T  L L T   G IARA  +FRID +VV+                   R
Sbjct: 8   SILIPSSYTMETADLRLRTAKVGLIARAAAVFRIDRIVVY-------------------R 48

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFRE 183
            DE +  + F+  +L+Y ETPQYLRK LF     LR  G+LPPL   HH      +  + 
Sbjct: 49  DDEFDD-SRFISTVLRYAETPQYLRKLLFPRMRELRHAGVLPPLRTAHHPVGSRSSTLKV 107

Query: 184 GVT----LKENAPNSVGTLVDVGLNKHVVV--DQVLDPGVRVTVAMGTNRNLDADSPRQV 237
           G      + E+  +  G  V+VGL++ V +   +    G R+ V + +   L A+     
Sbjct: 108 GEIRVGMVVESVGSDGGAWVEVGLDRPVPLKSKRRFQKGQRLNVRIFSLSPLAAE----- 162

Query: 238 VPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG-----DIVNSSDLT 292
             P    E   YWGY+     +     ++       + ++ TS  G     D++N  + +
Sbjct: 163 --PVDRSEIPGYWGYETIVVDSAEEYLRSRD-----EFVVATSRKGTPVSCDLLNHIERS 215

Query: 293 LPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFIS 352
               R L + FG  A   ++    + L+R           NT PHQG+ T+R EEA+  +
Sbjct: 216 --GSRGLAVVFGSPARGVDAFLSREMLER-------CCVINTIPHQGTETVRVEEAVIST 266

Query: 353 LQYF 356
           L   
Sbjct: 267 LALL 270


>gi|345005656|ref|YP_004808509.1| hypothetical protein [halophilic archaeon DL31]
 gi|344321282|gb|AEN06136.1| protein of unknown function DUF171 [halophilic archaeon DL31]
          Length = 277

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 93/309 (30%), Positives = 132/309 (42%), Gaps = 62/309 (20%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           ++ VP S++   +    ATR  G +ARA  I+R++ +VVF                  +R
Sbjct: 6   TLVVPSSLVREAEDKREATRKIGYVARAAVIYRVERIVVF-----------------PDR 48

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLD-APHHLRKHEWAPFR 182
             E   G  F+  +L Y  TP YLRK ++     LR+ G+LPPL  +P      E    R
Sbjct: 49  EGERRWGGEFVRTVLAYAATPPYLRKEVWDQRDELRYAGVLPPLRVSPRTGSASERPESR 108

Query: 183 EGVTLKENAPNSVGTLVDVGLNK----HVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV 238
           EG+  +    + V   V+ GL      HV     +  G RVTV + +   + A   R V 
Sbjct: 109 EGIVTEVGPDDRV--RVNCGLQHPISLHVPSSMTVVEGERVTVRVSSREPVRA---RLVD 163

Query: 239 PPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
            P+         GY V  A +I      +N   +      IGTS HGD     DLT+   
Sbjct: 164 EPTA--------GYVVEAA-DIQETLARQNAGVR------IGTSRHGD-----DLTVSRL 203

Query: 297 RHL---------LIAFGGLA-GLEE--SIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIR 344
             L          I FG    G+ E   +  DD L  +     F  + NT P+QGS  +R
Sbjct: 204 AELGSQCRDDGATIVFGSPGRGIPEILGVSPDD-LPVEPDSPGFDLWLNTVPNQGSEVVR 262

Query: 345 TEEAIFISL 353
           TEEA+F +L
Sbjct: 263 TEEAVFATL 271


>gi|444318731|ref|XP_004180023.1| hypothetical protein TBLA_0C07130 [Tetrapisispora blattae CBS 6284]
 gi|387513064|emb|CCH60504.1| hypothetical protein TBLA_0C07130 [Tetrapisispora blattae CBS 6284]
          Length = 384

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 78/302 (25%), Positives = 133/302 (44%), Gaps = 35/302 (11%)

Query: 1   MGNKKKRGGLEPELKEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEV 60
           M +K  R G +P   ++ TG+   +Q +       ++  + K+++K    +       + 
Sbjct: 1   MASKNTRKGEKPS--DSKTGKTETTQKKKQAKT--NAKINKKQREKPISKKKTLKVKSKT 56

Query: 61  PTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANR 120
              S+ +P +IID   +LE  T +  QIA++ T+F + E+V+ +  S +DN       N+
Sbjct: 57  LNYSLCIPNNIIDQCTNLEQITHIMYQIAKSATMFNVGEIVILETSSKTDN-------NK 109

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFV--GMLPPLDAPHHLR--KH 176
           S + D   S +  +  LLQY  TP YL K++F       F     LP L     +R  K 
Sbjct: 110 SKKKDGKLSNSMLMASLLQYFVTPPYLLKSVFKKEYWKYFTVASKLPRLSVLPFMRYLKE 169

Query: 177 EWAPFREGVTLKENAPNSVGT-------LVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
           +   +REG+ ++   PN+           +++G +  + +   L P  VRVTV     + 
Sbjct: 170 DEGRYREGLAIRMEKPNAKSNKEFKQTKYINIGKSDALELKSQLVPINVRVTVDTVERKV 229

Query: 229 LDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS 288
           +  D        +K       +GY VR A   + VF  C++ GGY   +        VNS
Sbjct: 230 VSPDEAYGDFVGAKAS-----YGYHVRVAKQFADVFTECAFPGGYSQAVW-------VNS 277

Query: 289 SD 290
            D
Sbjct: 278 GD 279


>gi|145592077|ref|YP_001154079.1| hypothetical protein Pars_1876 [Pyrobaculum arsenaticum DSM 13514]
 gi|145283845|gb|ABP51427.1| Protein of unknown function DUF171 [Pyrobaculum arsenaticum DSM
           13514]
          Length = 265

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 83/292 (28%), Positives = 129/292 (44%), Gaps = 42/292 (14%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+A+P   +  T       R  G +ARA ++F++  V+++   +               R
Sbjct: 3   SIAIPHDFLSETPDEPSKIRKLGYLARAASVFKVSTVIIYYYGAPL-------------R 49

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPF 181
            D + +       +L+YL TP YLRK ++ +   LR  G+LPPL  P H+   E      
Sbjct: 50  EDIDLAKT-----VLEYLVTPPYLRKRVYKIDKRLRLAGLLPPLKIPSHVVPKEPRIGEV 104

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGV  + +   S+   V +G  K+  + +    G R+ V +    +   D+ R  +   
Sbjct: 105 REGVVERWDGYYSI---VYIGGGKYAKIPKPYPVGARLLVRIEAPTS-RPDTYRAAIYKG 160

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
            P     YWGYKV   P I S+        G+D +I T   G  +      L   ++ L+
Sbjct: 161 PPPA---YWGYKVEVRP-IQSL------TDGFDAVILTGREGKPICEVSPKL--GKNTLV 208

Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            FGG     + I ++ GL      E+     N  P QG+ TIRTEEA+FI L
Sbjct: 209 VFGGPRRGVDEIFKESGL------ELPGDLINFAPGQGTETIRTEEAVFIVL 254


>gi|379003216|ref|YP_005258888.1| hypothetical protein Pogu_0254 [Pyrobaculum oguniense TE7]
 gi|375158669|gb|AFA38281.1| hypothetical protein Pogu_0254 [Pyrobaculum oguniense TE7]
          Length = 265

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 82/292 (28%), Positives = 129/292 (44%), Gaps = 42/292 (14%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+A+P   +  T       R  G +ARA ++F++  V+++   +               R
Sbjct: 3   SIAIPHDFLSETPDEPSKIRKLGYLARAASVFKVSTVIIYYYGAPL-------------R 49

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPF 181
            D + +       +L+YL TP YLRK ++ +   LR  G+LPPL  P H+   E      
Sbjct: 50  EDIDLAKT-----VLEYLVTPPYLRKKVYKIDKRLRLAGLLPPLKIPSHVVPKEPRIGEV 104

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           REGV  + +   S+   V +G  K+  + +    G R+ V +    +   D+ R  +   
Sbjct: 105 REGVVERWDGYYSI---VYIGGGKYAKIPKPYPVGARLLVRIEAPTS-RPDTYRAAIYKG 160

Query: 242 KPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLI 301
            P     YWGYKV   P I S+        G+D +I T + G  +      L   ++ L+
Sbjct: 161 PPPA---YWGYKVEVRP-IQSL------TDGFDAVILTGKEGKPICEVSPKL--GKNTLV 208

Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            FGG     + I ++ GL      E+     N  P QG+ TIRTEEA+ I L
Sbjct: 209 VFGGPRRGVDEIFKESGL------ELPGDLINFAPGQGTETIRTEEAVIIVL 254


>gi|448704115|ref|ZP_21700596.1| hypothetical protein C446_00624 [Halobiforma nitratireducens JCM
           10879]
 gi|445796504|gb|EMA47008.1| hypothetical protein C446_00624 [Halobiforma nitratireducens JCM
           10879]
          Length = 294

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 86/323 (26%), Positives = 130/323 (40%), Gaps = 68/323 (21%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TIFR+D ++V+ +                
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATIFRVDRLIVYPDSD-------------- 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
              D    G  F+  +L+Y  TP YLR  ++ M   L +VG+LPPL A        ++  
Sbjct: 48  --GDTGRFGDGFVSTVLRYAATPPYLRNEVWGMRDELEYVGVLPPLRAASRTGSESNDSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ G+   + +    D  +  G RVTV + + R + A    
Sbjct: 106 SLRQGIVTEVGPEGRV--RVNCGMQHPISLNTPPDMEVSEGERVTVRISSRRPVRAK--- 160

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH---LIGTSEHGDIVNSSDLT 292
                              R  P +S    + S   G +     I  S +G+ +    L 
Sbjct: 161 ----------------LVDRPLPGLSVERTDLSAALGREDAGVCIAASRYGETLTVGRLE 204

Query: 293 LPTFR----HLLIAFGG-------LAGLEESI-----------EEDDGLKRKNAREVFHS 330
               R     + +AFG        + G+E S              DDG++   A   F  
Sbjct: 205 TLAGRVEADGMTVAFGAPERGLPDILGIEASDVASAHAGDGPESADDGVEPTPADPGFDL 264

Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
           + NT P QGS  +RTEEA+F +L
Sbjct: 265 WLNTVPDQGSDVVRTEEAVFATL 287


>gi|448737861|ref|ZP_21719894.1| hypothetical protein C451_10022 [Halococcus thailandensis JCM
           13552]
 gi|445802823|gb|EMA53124.1| hypothetical protein C451_10022 [Halococcus thailandensis JCM
           13552]
          Length = 276

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 84/311 (27%), Positives = 133/311 (42%), Gaps = 50/311 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S+         AT   G +ARA T+FR+D V+VF +      +         
Sbjct: 2   TRSVLVPSSLCREAADKREATHKVGLVARAATVFRVDRVIVFPDPDGDRRW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
                   G  F+  +L Y  T  YLRK  +     L + G+LPPL A        +   
Sbjct: 53  --------GGGFVSTVLAYAATAPYLRKEAWGTRDELEYAGVLPPLRAAARTGSESNGSG 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+ + E  P+     V+ GL   + +    +  +D G RVTV + +   + A    
Sbjct: 105 SLRQGI-VTEVGPDGR-VRVNCGLQHPISLVAPPEMAVDEGERVTVRISSREPVRA---- 158

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
           ++   S P       G+ V  A ++S+    ++  ++      + TS HG+++ +  L  
Sbjct: 159 RITDESPP-------GFAVERA-DLSAALGREDAGFR------VATSRHGEVLTTERLGT 204

Query: 294 PTFRH---LLIAFGGLA-GLEESIEEDDG-LKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
              R    L +AFG    GL   ++ D   +   +  E F  + NT P QGS  +RTEEA
Sbjct: 205 LAGRADDGLTVAFGAPERGLPAMLDIDAASVASDSPHEGFDCWLNTVPKQGSEVVRTEEA 264

Query: 349 IFISLQYFQEP 359
           +F +L     P
Sbjct: 265 VFATLAPLTLP 275


>gi|84999294|ref|XP_954368.1| hypothetical protein [Theileria annulata]
 gi|65305366|emb|CAI73691.1| hypothetical protein TA21040 [Theileria annulata]
          Length = 232

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 51/120 (42%), Positives = 69/120 (57%), Gaps = 12/120 (10%)

Query: 237 VVPPSKP-KESGMYWGYKVRYAPNISSVFKNCSY--KGGYDHLIGTSEHGDIVNSSDLTL 293
           +V P +P K +G+YWGY VR   +++   K C +   G YD  IGT E G++  S +  L
Sbjct: 113 LVHPEEPLKVAGLYWGYVVRECKSVAESLKGCPFNESGNYDLKIGTCERGELY-SLNTKL 171

Query: 294 PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           P F+++LI FG + GLE  +EE +          F  YFN C  Q SRTIRTEEA+ I L
Sbjct: 172 PKFKNMLIHFGPVLGLEHVMEEPEN--------KFDKYFNFCNQQKSRTIRTEEALLIVL 223


>gi|383625046|ref|ZP_09949452.1| hypothetical protein HlacAJ_17003 [Halobiforma lacisalsi AJ5]
 gi|448697502|ref|ZP_21698542.1| hypothetical protein C445_11277 [Halobiforma lacisalsi AJ5]
 gi|445781455|gb|EMA32311.1| hypothetical protein C445_11277 [Halobiforma lacisalsi AJ5]
          Length = 296

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 87/325 (26%), Positives = 133/325 (40%), Gaps = 70/325 (21%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TI+R+D ++V+ ++               
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATIYRVDRLIVYPDRDG------------- 48

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
              +    G  F+  +L+Y  TP YLR   + M   L +VG+LPPL A        ++  
Sbjct: 49  ---ETGRFGDGFVSTVLRYAATPPYLRNEAWGMRDELEYVGVLPPLRAVSRTGSESNDSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ G+   + +    D  ++ G RVTV + + R        
Sbjct: 106 SLRQGIVTEVGPEGRV--RVNCGMQHPISLNTPPDMEVEEGERVTVRISSRR-------- 155

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHL---IGTSEHGDIVNSSDLT 292
               P + K          R  P +S    + S   G +     I  S HG+ +    L 
Sbjct: 156 ----PVRAK-------LVDRPLPGLSVERTDLSAALGREDAGVRIAASRHGEELTVGRLE 204

Query: 293 LPTFR----HLLIAFGG-------LAGLEESI-------------EEDDGLKRKNAREVF 328
               R     L +AFG        + G+E S              + DDG++   A   F
Sbjct: 205 TLAGRIEDDGLTVAFGAPERGLPDILGIEASAVAVDPDPSDRGDSDADDGVEPTPADPGF 264

Query: 329 HSYFNTCPHQGSRTIRTEEAIFISL 353
             + NT P QGS  +RTEEA+F +L
Sbjct: 265 DLWLNTVPDQGSDVVRTEEAVFATL 289


>gi|374326111|ref|YP_005084311.1| hypothetical protein P186_0607 [Pyrobaculum sp. 1860]
 gi|356641380|gb|AET32059.1| hypothetical protein P186_0607 [Pyrobaculum sp. 1860]
          Length = 264

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 88/295 (29%), Positives = 125/295 (42%), Gaps = 48/295 (16%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+A+P   +          R  G +ARA  IFR++ VV++       +Y R  A +    
Sbjct: 3   SIAIPHDFLSEAPDEASKVRKLGYLARAAAIFRVEYVVIY-------HYGRPLAED---- 51

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPF 181
            D  ++       +L+YL TP YLRK +F +   LR  G+LPPL  P H    E      
Sbjct: 52  IDLAKT-------VLEYLVTPPYLRKKVFKIDRRLRLAGLLPPLKIPSHRAPSEPRVGEV 104

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTV---AMGTNRNLDADSPRQVV 238
           REG+  + +   S   LV +G  K+  V +    G R+ V   A  T R+    S  +  
Sbjct: 105 REGIVERWDGYYS---LVYIGAEKYAKVPKPYPIGTRLYVKIEAPTTRRDTYRASVYRGP 161

Query: 239 PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH 298
           PP  P       GY+V   P         S   G+D LI T   G   +  ++     + 
Sbjct: 162 PPYYP-------GYRVEVRP-------VQSLAEGFDTLILTGREGK--SICEVKSKIGKK 205

Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            L  FG      + I E+ G+K          + N  P QG  TIRTEEA+FI L
Sbjct: 206 TLAVFGSPRKGVDEIMEETGVKLAG------EFINFAPGQGVETIRTEEAVFIVL 254


>gi|18314060|ref|NP_560727.1| hypothetical protein PAE3431 [Pyrobaculum aerophilum str. IM2]
 gi|18161641|gb|AAL64909.1| conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
          Length = 270

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 82/293 (27%), Positives = 127/293 (43%), Gaps = 44/293 (15%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+A+P   +          R  G +ARA  IF+ + ++++   +               R
Sbjct: 3   SIAIPHDFLSEIPDEAGKVRKLGYLARAAAIFKAEYIIIYHYGTP-------------RR 49

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPF 181
            D +     F   +L+YL TP YLRK ++ + + L+  G+LPPL  P H    E      
Sbjct: 50  EDID-----FAKTVLEYLVTPPYLRKKVYKIDNRLKLAGLLPPLKIPSHTVPVEPRIGEI 104

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM-GTNRNLDADSPRQVVPP 240
           REG+  + +   S   L+ +G  K+  V +    G R+ V + GT      D+ R  V  
Sbjct: 105 REGIVERWDGYYS---LIYIGGGKYAKVPKPYPIGTRLLVRIEGTTSR--PDTYRAAVYR 159

Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLL 300
             P +   YWGYK+   P      +N S   G+D +I T + G  +  +   +   +  L
Sbjct: 160 GAPPD---YWGYKIDVRP-----LQNLS--EGFDTVILTGKEGKSICEAKPKIG--KKTL 207

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           + FGG     + I  + GL+            N  P QG  TIRTEEA+FI L
Sbjct: 208 VVFGGPRKGVDEIFREAGLEPPG------ELINFAPGQGVETIRTEEAVFIVL 254


>gi|118576185|ref|YP_875928.1| hypothetical protein CENSYa_0992 [Cenarchaeum symbiosum A]
 gi|118194706|gb|ABK77624.1| conserved hypothetical protein [Cenarchaeum symbiosum A]
          Length = 265

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 81/295 (27%), Positives = 125/295 (42%), Gaps = 49/295 (16%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SVA+P S + +  +    +R A  IARA ++F +  V V+ +                 
Sbjct: 3   LSVAIPDSCLADESTQMDKSRKASVIARACSVFGVSAVYVYKDGGGDGR----------- 51

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL-----RKHE 177
                      L  +L+YLETP +LRK LF   + L++ G+L PL  P H      +   
Sbjct: 52  ----------LLTTILRYLETPPFLRKRLFPRINELKYAGVLSPLRIPSHTASSDPKSVS 101

Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQV 237
               REGV L +         VD+GL + V       P  RVTV        +   P  V
Sbjct: 102 RGDVREGVVLSKGGRK----FVDIGLGRPVPYSGQTIPPKRVTV------RFERAPPDPV 151

Query: 238 VPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL-TLPTF 296
           +     +E+G YWGY+V+    +  +     +KG    +I TS  G  ++ S + +    
Sbjct: 152 IREIPREEAGAYWGYRVKERGRLGELLSG--WKG---RIILTSRKGSAISQSGIRSYAGT 206

Query: 297 RHLLIAFGGL-AGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
             +L+ FG    G+   +    G + +NAR       N  P Q S T+R EEA+ 
Sbjct: 207 GEVLVVFGSTDKGIHGMLGGGTG-RVQNAR-----VLNFFPGQSSETVRLEEAML 255


>gi|322372169|ref|ZP_08046710.1| hypothetical protein ZOD2009_21752 [Haladaptatus paucihalophilus
           DX253]
 gi|320548178|gb|EFW89851.1| hypothetical protein ZOD2009_21752 [Haladaptatus paucihalophilus
           DX253]
          Length = 280

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 89/312 (28%), Positives = 127/312 (40%), Gaps = 61/312 (19%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+ VP S++   +    ATR  G +ARA T+FR D +VVF +                
Sbjct: 2   TVSILVPSSLVREAEDKREATRKIGYVARAATVFRADRLVVFPDGEG------------- 48

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
               E + GA F+  +L+Y  TP YLRK +F     L + G+LPPL A            
Sbjct: 49  ----ERKWGAGFVETILKYAATPPYLRKEVFGRRDELEYAGILPPLRATSQTGSESEGSG 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLN---KHVVVDQV-LDPGVRVTVAMGTNRNLDA---D 232
             R+G+  +      V   V+ GL      VV  Q+ +  G RVTV + + R + A   D
Sbjct: 105 SLRQGIVTEVGPEGRV--RVNCGLQHPISLVVPSQMTVGEGERVTVRISSRRPVRAKLVD 162

Query: 233 SPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHL---IGTSEHGDIVNSS 289
           +P                       P ++    + S     D     I TS +G+ ++  
Sbjct: 163 AP----------------------VPGLTVTRMDLSAALDRDDAGVRIATSRYGEPLSVG 200

Query: 290 DLTLPTFRH----LLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSR 341
            L     R     + +AFG    GL  +     E            F  + NT P+QGS 
Sbjct: 201 RLEEVVTRTNRDGMTVAFGSPGRGLPEIFGVPPEGISAVESGVPARFDLWLNTVPNQGSE 260

Query: 342 TIRTEEAIFISL 353
            IRTEEA+F SL
Sbjct: 261 VIRTEEAMFASL 272


>gi|171185760|ref|YP_001794679.1| hypothetical protein Tneu_1306 [Pyrobaculum neutrophilum V24Sta]
 gi|170934972|gb|ACB40233.1| Protein of unknown function DUF171 [Pyrobaculum neutrophilum
           V24Sta]
          Length = 263

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 81/294 (27%), Positives = 125/294 (42%), Gaps = 42/294 (14%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           + S+AVP   +          R  G +ARA  IFR++EVV++                  
Sbjct: 2   SFSIAVPHDFLLEAPDEAGKVRKLGYLARAAAIFRVEEVVIY------------------ 43

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA-- 179
           +  +       F   +L+Y+ TP YLR+ ++ +   LR  G+LPPL  P H    E A  
Sbjct: 44  HYGEPLREEIDFAKTILEYMVTPPYLRRRVYKLDRRLRLAGLLPPLKIPSHTVPREPAVG 103

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVP 239
             REGV  + +   S   LV +G  ++  + +    G R+ V +        D+ R  V 
Sbjct: 104 EIREGVVERWDGYYS---LVYIGGGRYAKIPRPYPIGTRLLVKIEAPTG-RPDTYRASVY 159

Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
              P     YWG++V     + S+ +      G+D ++ T   G  +      L   R  
Sbjct: 160 RGAPPN---YWGFRV----EVRSLRELAE---GFDAVVLTGREGRSICEVKPRLG--RRT 207

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           L+ FGG     + I  ++G+      E+     N  P QG  TIRTEEA+FI L
Sbjct: 208 LVVFGGPRKGVDEIYREEGV------EMPGELINFVPGQGVETIRTEEAVFIVL 255


>gi|448399309|ref|ZP_21570611.1| hypothetical protein C476_07478 [Haloterrigena limicola JCM 13563]
 gi|445669216|gb|ELZ21829.1| hypothetical protein C476_07478 [Haloterrigena limicola JCM 13563]
          Length = 291

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 82/303 (27%), Positives = 133/303 (43%), Gaps = 31/303 (10%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SV VP S+    +    ATR  G +ARA TIFR D ++V+               +R 
Sbjct: 2   TISVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLIVY--------------PDRD 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
             + + + G  F+  +L+Y  TP YLR  ++ M   L +VG+LPPL A        +   
Sbjct: 48  GETGQFDGG--FVATVLRYAATPPYLRNEVWGMRDELEYVGVLPPLRAVSQTGSESNGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ GL   + +    D  +D G RVTV + + R + A    
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNTPPDMEVDEGERVTVRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---NSSD 290
           + +P    +++ +    G +       +S F      G  + L G  E   +     + +
Sbjct: 164 EPLPGLSVEQTDLSAALGREDAGVCIAASRFGEELTVGRLETLAGRVEDDGMTVAFGAPE 223

Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
             LP    L I    +  ++E+  + D      A   F  + NT P+QGS  +RTEEA+F
Sbjct: 224 RGLPDI--LGIEASAVGAVQETATDGDDGVEPTADPGFDLWLNTVPNQGSEVVRTEEALF 281

Query: 351 ISL 353
            +L
Sbjct: 282 ATL 284


>gi|119872478|ref|YP_930485.1| hypothetical protein Pisl_0966 [Pyrobaculum islandicum DSM 4184]
 gi|119673886|gb|ABL88142.1| Protein of unknown function DUF171 [Pyrobaculum islandicum DSM
           4184]
          Length = 263

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 124/298 (41%), Gaps = 50/298 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           + S+AVP   +          R  G +ARA  IF ++ ++++        Y  S      
Sbjct: 2   SFSIAVPHDFLSEAPDEASKVRKLGYLARAAAIFNVETIIIY--------YYGSPLREEI 53

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WA 179
           +          F   +L+Y+ TP YLRK LF +   L+  G+LPPL  P H    E    
Sbjct: 54  D----------FAKTILEYVVTPPYLRKRLFKLDQRLKLAGLLPPLKIPSHTVSLEPRVG 103

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVV- 238
             REGV  + +   S   LV +G  K+  + +    G R+ V +        D+ R  V 
Sbjct: 104 EIREGVVERWDGYFS---LVYIGGGKYAKIPKPYPVGSRLLVKIEAPTE-RPDTYRASVY 159

Query: 239 ---PPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
              PP+       YWGYKV   P         S   G+D +I T + G  V  +   +  
Sbjct: 160 RGTPPA-------YWGYKVEVRP-------LQSLTEGFDAIILTGKEGRSVCEAKPKV-- 203

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            +  L+ FGG     + I ++ G++            N  P QG  +IRTEEA+FI L
Sbjct: 204 GKKTLVVFGGPRKGVDEIYKEAGIEPPG------ELINFVPRQGVESIRTEEAVFIVL 255


>gi|385805568|ref|YP_005841966.1| hypothetical protein FFONT_0524 [Fervidicoccus fontis Kam940]
 gi|383795431|gb|AFH42514.1| hypothetical protein FFONT_0524 [Fervidicoccus fontis Kam940]
          Length = 278

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 83/308 (26%), Positives = 138/308 (44%), Gaps = 52/308 (16%)

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           + ++ + +P   ++N       T  A  IARA +IF+++ + ++ ++S           +
Sbjct: 1   MKSIEILLPYCSLENEPDQIRITLKANIIARAASIFKVERITLYRSRS-----------D 49

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHE 177
           + +  D+ E     L  LL+Y  TP YL+K +F     L++VGM  PL  P H   +K +
Sbjct: 50  KKSCKDQKE----ILGLLLRYFVTPPYLKKKIFGKRKELKYVGMAYPLQIPTHSLSKKAK 105

Query: 178 WAPFREGVT--LKENAPNSVGTLVDVGLNKHVVV---DQVLDPG--VRVTVAMGTNRNLD 230
               R G+   +KE+A        D+G+ K V +   +  L  G  + V V  G    L+
Sbjct: 106 EGDIRVGLVERVKESA-----IFADIGVGKQVKISNSNYELKRGDLIFVKVKKGDLSELE 160

Query: 231 ADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---- 286
                        K   +Y GY VR   N     K     G    +IGTS+ GD +    
Sbjct: 161 I-----------IKNPDIYVGYSVRECENPLDYLKEKKKDG---LVIGTSKFGDPIWENL 206

Query: 287 NSSDLTLPTFRHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
              +  L +   +++ FG    G+ E  ++  G    N  ++FH  +N    QG++TIR 
Sbjct: 207 KEVEEKLKSSDKVVVVFGEPYRGIFEIFKQLGG----NTDDLFHGIYNLVRDQGTKTIRM 262

Query: 346 EEAIFISL 353
           EEA+FI+L
Sbjct: 263 EEALFIAL 270


>gi|448415420|ref|ZP_21578220.1| hypothetical protein C474_05545 [Halosarcina pallida JCM 14848]
 gi|445681078|gb|ELZ33519.1| hypothetical protein C474_05545 [Halosarcina pallida JCM 14848]
          Length = 278

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 130/317 (41%), Gaps = 72/317 (22%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SV VP S++   +    ATR  G +ARA T+FR + +VVF                  
Sbjct: 2   TLSVLVPSSVVREAEDKREATRKLGYVARAATVFRANRLVVF-----------------P 44

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL--DAPHHLRKHEWA 179
           +R  EN  G  F+  +L+Y  TP YLRK ++     L++VG+LPPL   +      ++  
Sbjct: 45  DREGENRWGGEFVETVLRYAVTPPYLRKEVWGHRDELQYVGVLPPLLVSSTTGSESNDSG 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
             ++G+  +      V   V+ GL   V +       L+ G RV + + +   + A    
Sbjct: 105 SLQQGIVTEVGPEGRV--RVNCGLQHPVSLYAPSGTELEEGERVAIRISSREPVRARIVD 162

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
           + VP           G+ +       +  +    +      I TS HG+      LT+P 
Sbjct: 163 EPVP-----------GFSI-----ARTDLEEALGRSDAGVTIATSRHGE-----SLTVPK 201

Query: 296 FRHL---------LIAFG----------GLAGLEESIEEDDGLKRKNAREVFHSYFNTCP 336
              L          + FG          G+   E ++E  DG         F  + NT P
Sbjct: 202 LAELSPRIDREGATVVFGSPGRGLPDILGINAEEVTVEPSDGPG-------FDLWLNTIP 254

Query: 337 HQGSRTIRTEEAIFISL 353
            QGS  +RTEEA+F SL
Sbjct: 255 RQGSEVVRTEEAMFASL 271


>gi|257387875|ref|YP_003177648.1| hypothetical protein Hmuk_1827 [Halomicrobium mukohataei DSM 12286]
 gi|257170182|gb|ACV47941.1| Protein of unknown function DUF171 [Halomicrobium mukohataei DSM
           12286]
          Length = 282

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 86/316 (27%), Positives = 134/316 (42%), Gaps = 66/316 (20%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S++   +    ATR  G +ARA  +FR+D + +F + + +  +         
Sbjct: 2   TTSVLVPSSLVREAEDRREATRKLGYVARAAVVFRVDRLTIFPDPAGAGKW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                 E G  F+  +L+Y  TP YLRK ++     L +VG+LPPL     +R    +  
Sbjct: 53  ------EDG--FVETVLRYAATPPYLRKEVWGKRDELEYVGVLPPL----RVRSQTGSGS 100

Query: 182 REGVTLKENAPNSVG----TLVDVGLNKHVVVD-----QVLDPGVRVTVAMGTNRNLDA- 231
               +L++     VG      V+ GL   + +      +V   G RVTV + + R + A 
Sbjct: 101 EGSGSLRQGIVTEVGADGRVRVNCGLQHPISLPVPSAMEVPGEGERVTVRVSSRRPVRAK 160

Query: 232 --DSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS 289
             D P                GY V  A ++ +  +      G+   I  S HG+ +   
Sbjct: 161 LVDEPLT--------------GYVVDTA-DVDTALERSD--AGF--AIAASRHGEELGVD 201

Query: 290 DLTLPTFR-----HLLIAFG-------GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPH 337
            L     R     H+ +AFG        + GL+      DG    +A   F  + NT P+
Sbjct: 202 RLGQLVERRDDAGHMTVAFGAPERGLPAIFGLDPDDAVPDGTGDDDAG--FDLWLNTVPN 259

Query: 338 QGSRTIRTEEAIFISL 353
           QGS  +RTEEA+F +L
Sbjct: 260 QGSEVVRTEEAMFATL 275


>gi|336252441|ref|YP_004595548.1| hypothetical protein Halxa_1033 [Halopiger xanaduensis SH-6]
 gi|335336430|gb|AEH35669.1| protein of unknown function DUF171 [Halopiger xanaduensis SH-6]
          Length = 292

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 89/320 (27%), Positives = 133/320 (41%), Gaps = 64/320 (20%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TIFR D +VV+ ++              +
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATIFRADRLVVYPDRD-----------GET 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
            R D       F+  +L+Y  TP YLR   + M   L + G+LPPL A        ++  
Sbjct: 51  GRFD-----GGFVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAVSQTGSESNDSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ GL   + +    D  +D G RVTV + + R + A    
Sbjct: 106 SLRQGIVTEVGPEGRV--RVNCGLQHPISLNTPPDMEVDEGERVTVRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
           + +P           G  V    ++S+    ++   +      I  S HG+ +    L  
Sbjct: 164 EPLP-----------GLSVEQT-DLSAALSREDAGVR------IAASRHGERLTVGRLET 205

Query: 294 PTFR----HLLIAFG-------GLAGLEESI-------EEDDGLKRKNAR--EVFHSYFN 333
              R     L +AFG        + G+E S        + DDG     A     F  + N
Sbjct: 206 LAGRIDRDGLTVAFGAPERGLPAILGIEASDVAAAYGGDADDGNVEPTADPGPGFDLWLN 265

Query: 334 TCPHQGSRTIRTEEAIFISL 353
           T P QGS  +RTEEA+F +L
Sbjct: 266 TVPDQGSEVVRTEEALFATL 285


>gi|156844294|ref|XP_001645210.1| hypothetical protein Kpol_1060p5 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156115869|gb|EDO17352.1| hypothetical protein Kpol_1060p5 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 377

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 67/234 (28%), Positives = 107/234 (45%), Gaps = 32/234 (13%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P SI+DN  +LE  T    QIA+  TI+ + E+V+      SDN +  S  +++ +
Sbjct: 51  SICIPTSILDNCHNLEQITSTLYQIAKTATIYNVGEIVILQISKKSDNNNNDSGTSKNQK 110

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
                S +  +  LLQY  TP YL   +F    H   ++   LP L A   +R    +  
Sbjct: 111 K---LSDSVLIASLLQYFVTPPYLVNTVFKKDYHKYFKYASKLPRLSALPFMRYLDEDNG 167

Query: 180 PFREGVTLKENAPNSVGT-----------LVDVGLNKHVVVDQVLDP-GVRVTVAMGTNR 227
            +REG+T++ + P+S               V+VG  + + +   L P  VRVTV      
Sbjct: 168 RYREGLTIRMSKPSSNNNSNKGKEFKQTKYVNVGKPEALELKTQLVPVNVRVTVDT---- 223

Query: 228 NLDADSPRQVVPPSKP----KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
                  ++VV P +       +   +GY VR A +  +VF  CS+K GY   +
Sbjct: 224 -----IEKKVVSPQEAYGDFYGAQFSYGYHVRIAQSFGNVFTECSFKNGYSQAV 272


>gi|126459649|ref|YP_001055927.1| hypothetical protein Pcal_1036 [Pyrobaculum calidifontis JCM 11548]
 gi|126249370|gb|ABO08461.1| Protein of unknown function DUF171 [Pyrobaculum calidifontis JCM
           11548]
          Length = 273

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 81/294 (27%), Positives = 127/294 (43%), Gaps = 46/294 (15%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+A+P   +  +       R  G +ARA  IF+   ++++                +  R
Sbjct: 3   SIAIPHDFLSESPDEASKVRKIGFLARAAAIFKAPLIIIYHY-------------GKPLR 49

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WAPF 181
            + +      +  LL+YL TP YLRK ++ +   L+  G+LPPL  P H+   E      
Sbjct: 50  EEID-----LMKTLLEYLATPPYLRKKVYKLDKRLKLAGLLPPLKIPSHVAPEEPRVGEV 104

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAM--GTNRNLDADSPRQVVP 239
           REGV  + +   S   LV +G  ++  V +    G R+ V +   T R+   D+ R  V 
Sbjct: 105 REGVVERWDGYYS---LVYIGGGRYAKVPKPYPIGTRLLVKIEAPTQRS---DTFRASVF 158

Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
              P     YWGY+V   P + ++ +      G+D ++ T + G  +  +++ L      
Sbjct: 159 KGPPP---AYWGYRVEVRP-LETLTE------GFDTVVLTGKEGKSICETNIRL--GEKT 206

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           LI FG      + I  + GLK            N  P QG  TIRTEEA FI L
Sbjct: 207 LIVFGSPRKGVDEIYREAGLKPPG------DLVNFVPGQGVETIRTEEAAFIVL 254


>gi|303283876|ref|XP_003061229.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457580|gb|EEH54879.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 313

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 64/205 (31%), Positives = 102/205 (49%), Gaps = 21/205 (10%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVF----DNKSSSDNYSRSSAANR 120
           +AV  ++ ++  SL   T LA + A A    + +EVVV+    ++ + +D +     ++ 
Sbjct: 3   LAVDAAVANSATSLAEVTHLAWRAACAAIKVKANEVVVYGGVKESVAKNDGHYSGPPSHA 62

Query: 121 SNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP----LDAPHHLRKH 176
           +NR       A  + R+L+Y++TP YLR AL + H  L  VG LP       A HHLR H
Sbjct: 63  ANRP------AVMIARVLEYVDTPPYLRNALLARHPDLALVGSLPRGGVLKGARHHLRIH 116

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDAD---- 232
           + + +REG+ +      S  T  +VGL   + +D+ +  GVRVTV +  NR   +D    
Sbjct: 117 DVSEYREGIVVAAANGASKSTRANVGLLSDLELDKHVKVGVRVTVRL-DNRCRASDGSTT 175

Query: 233 --SPRQVVPPSKPKESGMYWGYKVR 255
             + R V P      +G  W + VR
Sbjct: 176 PLTGRVVSPEEAALSNGGSWAFNVR 200


>gi|448726795|ref|ZP_21709187.1| hypothetical protein C448_09064 [Halococcus morrhuae DSM 1307]
 gi|445793841|gb|EMA44412.1| hypothetical protein C448_09064 [Halococcus morrhuae DSM 1307]
          Length = 276

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 83/311 (26%), Positives = 130/311 (41%), Gaps = 50/311 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S+         AT   G +ARA T+FR+D V VF +      +         
Sbjct: 2   TRSVLVPSSLCREAADKREATHKVGLVARAATVFRVDRVTVFPDLDGDRRW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
                   G  F+  +L Y  T  YLRK  +     L + G+LPPL A        +   
Sbjct: 53  --------GGGFVSTVLAYAATAPYLRKEAWGTRDELEYAGVLPPLRAAARTGSESNGSG 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+ + E  P+     V+ GL   + +    +  +D G RVTV + +   + A    
Sbjct: 105 SSRQGI-VTEVGPDG-RVRVNCGLQHPISLVAPPEMAVDEGERVTVRISSREPVRA---- 158

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
           ++   S P       G+ V  A ++S+    ++  ++      + TS HG+++ +  L  
Sbjct: 159 RITDESPP-------GFAVERA-DLSAALGREDAGFR------VATSRHGEVLTTERLGT 204

Query: 294 PTFRH---LLIAFGGLA-GLEESIEEDDG-LKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
              R    L +AFG    GL   ++ D   +        F  + NT P QGS  +RTEEA
Sbjct: 205 LAGRADDGLTVAFGAPERGLPAMLDIDAASVASDTPHAGFDCWLNTVPKQGSEVVRTEEA 264

Query: 349 IFISLQYFQEP 359
           +F +L     P
Sbjct: 265 VFATLAPLTLP 275


>gi|347524197|ref|YP_004781767.1| hypothetical protein Pyrfu_1659 [Pyrolobus fumarii 1A]
 gi|343461079|gb|AEM39515.1| protein of unknown function DUF171 [Pyrolobus fumarii 1A]
          Length = 286

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 85/301 (28%), Positives = 136/301 (45%), Gaps = 40/301 (13%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           + + VP S++    SL+L T  AG I RA  +FR   +V++ +++ +    R     R  
Sbjct: 14  LEIVVPASVLSVEPSLQLKTVKAGVIGRAAAVFRASRIVLYVDRAEA---WRDLETFR-- 68

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMH-SSLRFVGMLPPLDAPHH-LRKHEWAP 180
                        +LL+YL TP YLRK ++      LR+ G+LPPL  P H +   +   
Sbjct: 69  -------------KLLEYLATPPYLRKRVYPPGVPELRYAGLLPPLQIPTHGVGGPKEGE 115

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
            RE   L++   + V   VD GL + V VD  L  GV+V         + + +P  +   
Sbjct: 116 IREAYVLRKRGRSVV---VDAGLEEEVEVD--LPRGVQVNRGERILVKIVSLTPPILEYV 170

Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG----DIVNSSDLTLPTF 296
             P    +Y GY+V    ++    ++         LI TS  G    ++V+     L   
Sbjct: 171 ENPP---VYTGYRVATFDSLGDYLRSARKD---TLLIATSRRGRRIDEVVDELRKKLGEK 224

Query: 297 RHLLIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQY 355
           + + + FG    GL E I E +G+K     E   +  N  P+QG+ T+RTEEA++ +L  
Sbjct: 225 KRVALLFGSPREGLYE-IAEREGIKLD---ESVDAVVNVVPNQGTLTVRTEEAVWAALSL 280

Query: 356 F 356
            
Sbjct: 281 L 281


>gi|408404335|ref|YP_006862318.1| hypothetical protein Ngar_c17280 [Candidatus Nitrososphaera
           gargensis Ga9.2]
 gi|408364931|gb|AFU58661.1| protein of unknown function DUF171 [Candidatus Nitrososphaera
           gargensis Ga9.2]
          Length = 280

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 131/315 (41%), Gaps = 48/315 (15%)

Query: 60  VPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           +P + VA+P S + + Q+    +    Q ARA  IF++  + ++ +  S           
Sbjct: 1   MPNLWVAIPDSSLSDEQTRRDKSIKIAQFARACAIFQVKRIYIYHDSLS----------- 49

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH-----LR 174
           +  R D N      L  +L+YL+TPQYLRKAL+     L + G+L P+ APHH     ++
Sbjct: 50  QFERDDPN-----LLKTILRYLDTPQYLRKALYPRMHQLEYAGILHPIKAPHHKPPEDIK 104

Query: 175 KHEWAPFREGVTLKENAPNSVGTLVDVGLNKHV-VVDQVLDPGVRVTVAMGTNRNLDADS 233
           +      R GV  K          V+ GL   V  V Q  +         G   N+   S
Sbjct: 105 RVRAGDVRTGVITKVKG----QLFVEAGLGSLVPFVGQGFE---------GKKVNVKFIS 151

Query: 234 PRQVVPPSKPKESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
           P   +   +  E  +  YWGY+V+  P++  +    +       ++ TS  G    + + 
Sbjct: 152 PYPNLKAIEASEEDIFEYWGYEVKEVPSLGKLLLASAEN---TEVVITSRKGSYFKNVEA 208

Query: 292 TL----PTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEE 347
            L       +++L+ FG        I   +G   K     +    N  P+QG+ T+R EE
Sbjct: 209 KLIERAKNAQNILVVFGAPKHGVYDILAKEGASIKP----YEFVVNMFPNQGTETVRLEE 264

Query: 348 AIFISLQYFQEPISR 362
           A+  +L      I +
Sbjct: 265 AVLGTLAILNHSIGK 279


>gi|448311981|ref|ZP_21501734.1| hypothetical protein C493_08771 [Natronolimnobius innermongolicus
           JCM 12255]
 gi|445603602|gb|ELY57564.1| hypothetical protein C493_08771 [Natronolimnobius innermongolicus
           JCM 12255]
          Length = 294

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 86/326 (26%), Positives = 133/326 (40%), Gaps = 74/326 (22%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TI+R+D ++V+ +++              
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATIYRVDRLIVYPDRA-------------- 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
              +    G  F+  +L+Y  TP YLR   + M   L +VG+LPPL A        +   
Sbjct: 48  --GETGRFGDGFVSTVLRYAATPPYLRNEAWGMRDELEYVGVLPPLRAVSQTGSESNGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDA---D 232
             R+G+  +      V   V+ GL   + +    D  +  G RVTV + + R + A   D
Sbjct: 106 SLRQGIVTEVGPEGRV--RVNCGLQHPISLNTPPDMEVSEGERVTVRISSRRPVRAKLVD 163

Query: 233 SPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH---LIGTSEHGDIVNSS 289
            P                       P ++    + S   G +     I +S +G  +   
Sbjct: 164 EP----------------------LPGLAVERTDLSTALGREDAGVCIASSRYGIELTVG 201

Query: 290 DLTLPTFR---HLLIAFGG-------LAGLEES-----------IEEDDGLK-RKNAREV 327
            L     R    L +AFG        + G+E S              DDG++  ++A   
Sbjct: 202 RLETLAGRVGDGLTVAFGAPERGLPDILGIEASAIDADPSGQLDTAADDGVEPTQSADPG 261

Query: 328 FHSYFNTCPHQGSRTIRTEEAIFISL 353
           F  + NT P QGS  +RTEEA+F +L
Sbjct: 262 FDLWLNTVPDQGSEVVRTEEALFATL 287


>gi|297527411|ref|YP_003669435.1| hypothetical protein Shell_1448 [Staphylothermus hellenicus DSM
           12710]
 gi|297256327|gb|ADI32536.1| Protein of unknown function DUF171 [Staphylothermus hellenicus DSM
           12710]
          Length = 279

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 87/295 (29%), Positives = 134/295 (45%), Gaps = 38/295 (12%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           + VA+P SI+    SL L T    Q+ R  +IF + E+V F      D ++  S  N+  
Sbjct: 7   ILVALPTSILSTESSLLLKTMKIYQVIRYSSIFGVLEIVFF-----RDPFTDFSQHNKY- 60

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWAP 180
                   +A + ++ +YL TP YLR+ L      L+FVG+LPP  L+     R      
Sbjct: 61  --------SALIEKIWRYLLTPPYLRRKLIPKDPDLKFVGLLPPLRLNIFDVSRNGYVGE 112

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQV-LDPGVRVTVAMGTNRNLDADSPRQVVP 239
            R G   KE        L D+GL K   ++    +PG      +G    +D ++ R +  
Sbjct: 113 KRLGFIYKEKN----KLLADIGLLKPYRIEAGNCEPG-----DIGYVEIVDMNTRRAICL 163

Query: 240 PSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHL 299
             +P     Y G  + +A ++  V +   Y    D +I TS +G I N  +L     +  
Sbjct: 164 DEEP-----YRGPVLAFADSLREVLE--EYGRIVDLIIATSRYGRIPNYKELAGVKGKTT 216

Query: 300 LIAFGG-LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +I FGG   GL E I E +G   +N  +     +NT P Q  +TIR+EEA+  +L
Sbjct: 217 IILFGGPHRGLYE-IAEKEGFALENKVD---KVWNTIPEQMVKTIRSEEALISTL 267


>gi|448630851|ref|ZP_21673306.1| nucleic acid methylase [Haloarcula vallismortis ATCC 29715]
 gi|445755225|gb|EMA06615.1| nucleic acid methylase [Haloarcula vallismortis ATCC 29715]
          Length = 281

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 81/310 (26%), Positives = 137/310 (44%), Gaps = 55/310 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S+    +    ATR  G +ARA  ++R+D ++V+ +   +  +         
Sbjct: 2   TTSVLVPSSLTREAEDRREATRKLGYVARAAAVYRVDRLIVYPDPDGAGKW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                 E G  F+  +L+Y  TP +LRK ++     L +VG+LPPL     +R    +  
Sbjct: 53  ------EDG--FVETVLRYAATPPHLRKEMWGKRDELEYVGVLPPL----RVRSQTGSGS 100

Query: 182 REGVTLKENAPNSVG----TLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADS 233
               +L++     VG      V+ G+   + +    D  ++ G RVTV + + R + A  
Sbjct: 101 EGSGSLRQGIVTEVGADGRVRVNCGMQHPISLPVPADMDVEQGERVTVRVSSRRPVRA-- 158

Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
            + V  P+         G+ V     +++       +      I +S +G+ V S+ L  
Sbjct: 159 -KLVDAPTT--------GFDV-----VAADLDAALSRDDAGLTIASSRYGEPVTSTRLGQ 204

Query: 294 PTFRH-----LLIAFG----GLAG-LEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTI 343
              R      + +AFG    GL   L+ + +   G +  +  E F  + NT P+QGS  +
Sbjct: 205 LAERRDAEGGMTVAFGAPERGLPSILDVAPDAVGGDQTSDEPEGFDLWLNTVPNQGSEVV 264

Query: 344 RTEEAIFISL 353
           RTEEA+F SL
Sbjct: 265 RTEEALFASL 274


>gi|448298464|ref|ZP_21488493.1| hypothetical protein C496_02927 [Natronorubrum tibetense GA33]
 gi|445591660|gb|ELY45861.1| hypothetical protein C496_02927 [Natronorubrum tibetense GA33]
          Length = 290

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 84/306 (27%), Positives = 130/306 (42%), Gaps = 38/306 (12%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+ VP S+    +    ATR  G +ARA TIFR D ++V+ ++              +
Sbjct: 2   TVSILVPSSLTREAEDKREATRKLGYVARAATIFRADRLIVYPDRD-----------GET 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
            R D       F+  +L+Y  TP YLR   + M   L + G+LPPL A            
Sbjct: 51  GRFD-----GGFVSTVLRYAATPPYLRNEAWGMRDELEYAGILPPLRAMSQTGSESDGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ GL   + ++      ++ G RVTV + + R + A    
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMEVEEGERVTVRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
             +P    + + +          VR A   +S F      G  + L G  E   +     
Sbjct: 164 DPLPGLSVERTDLSAALGREDAGVRIA---ASRFGENLTVGRLETLAGRVERDGMTVAFG 220

Query: 288 SSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEE 347
           + +  LP    L I    +A  + + + DDG++   A   F  + NT P QGS  +RTEE
Sbjct: 221 APERGLPDI--LDIEASAVAPADGTGDADDGVE-PTADPGFDLWLNTVPDQGSEVVRTEE 277

Query: 348 AIFISL 353
           A+F SL
Sbjct: 278 ALFASL 283


>gi|15790631|ref|NP_280455.1| hypothetical protein VNG1688C [Halobacterium sp. NRC-1]
 gi|169236370|ref|YP_001689570.1| hypothetical protein OE3386F [Halobacterium salinarum R1]
 gi|68068034|sp|Q06847.2|Y1688_HALSA RecName: Full=Uncharacterized protein VNG_1688C
 gi|10581158|gb|AAG19935.1| conserved hypothetical protein [Halobacterium sp. NRC-1]
 gi|167727436|emb|CAP14224.1| conserved hypothetical protein [Halobacterium salinarum R1]
          Length = 283

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 78/312 (25%), Positives = 129/312 (41%), Gaps = 57/312 (18%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S++   +    ATR  G +ARA  +FRID VVVF ++               
Sbjct: 2   TRSVLVPSSLVREAEDKREATRKLGYVARAAAVFRIDRVVVFPDEDG------------- 48

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWA 179
               E + G  F+  +L+Y  TP YLRK  F     L + G+LPP  L +          
Sbjct: 49  ----ERQWGGGFVETVLRYAATPPYLRKEAFDTRDELAYAGVLPPLRLSSWTGSDSSGSG 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ----VLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +  +   V   V+ G+   + + +     +  G RVT+ + + R + A    
Sbjct: 105 SLRQGIVTQVGSEGRV--RVNCGMQHPISLHEPPGMAVSEGERVTIRVSSRRPVRAKLVD 162

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
             +P    + +G+                 +   +      I TS HG+ ++ +  +L  
Sbjct: 163 DPLPGFSVERTGL----------------GDALDRSDAGVRIATSRHGEPLSVA--SLGG 204

Query: 296 FRH------LLIAFGG--------LAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSR 341
           +R       + +AFG         L    +++ E       +A   F ++ NT P QGS 
Sbjct: 205 YRERIARDGVTVAFGAPERGLPPMLGVSADAVNESVTDSSADAPARFDAWLNTIPDQGSE 264

Query: 342 TIRTEEAIFISL 353
            +RTEEA+  +L
Sbjct: 265 VVRTEEAVLATL 276


>gi|344231046|gb|EGV62931.1| DUF171-domain-containing protein [Candida tenuis ATCC 10573]
 gi|344231047|gb|EGV62932.1| hypothetical protein CANTEDRAFT_115920 [Candida tenuis ATCC 10573]
          Length = 358

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/338 (24%), Positives = 137/338 (40%), Gaps = 59/338 (17%)

Query: 64  SVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYS-RSSAANR 120
           S+ +P S+I   N  +LE  T +A Q+A+A TI+ + E+++ D  +    Y    S AN+
Sbjct: 26  SICLPSSLISYANASNLEQITNIAYQVAKAATIYNVPEIIILDVPTIDAQYEILESKANK 85

Query: 121 SNRSD--------------------------ENESGAAFLVRLLQYLETPQYLRKALF-- 152
           + + D                          EN +       LLQ+  TP YL KA+   
Sbjct: 86  AVKIDKKIKFNEILEPPVQQQIVPNQPQNVQENHNNHELFESLLQFFITPPYLVKAMLKD 145

Query: 153 -SMHSSLRFVGMLPPLDA-PHHLRKHEWAPFREGVTLKENAPNS--------VGTLVDVG 202
                  ++   LP L   P          F+EG+++ + +P +        V   V++G
Sbjct: 146 NKYTKKFKYAQTLPKLSTLPFMGNNGTHNDFKEGLSIPKKSPGNKKQRNKLKVTKYVNIG 205

Query: 203 LNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISS 262
             +   + Q +   VRVTV +   +N    SP         K S   +GY  R A N + 
Sbjct: 206 HAQPFELTQEVPVNVRVTVDL---KNKTIVSPLHAYGVIGYKSS---FGYHTRIARNFNE 259

Query: 263 VFKNCSYKGGYDHLI-------GTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEE 315
           +F   ++  GY   I        +SE  D  N+  L       +L+ FG    LE S ++
Sbjct: 260 IFTKSAFPTGYTSSIYVNCANYFSSEPADTTNNIQLNSSENNRVLLVFGNPKDLEFSFKQ 319

Query: 316 DDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           D  L   +  ++F       P     TI+ E+A+ I+L
Sbjct: 320 DKSLSVPSFTDLFDQKLCIPP-----TIKVEDAVLIAL 352


>gi|210076192|ref|XP_504197.2| YALI0E20669p [Yarrowia lipolytica]
 gi|199426946|emb|CAG79792.2| YALI0E20669p [Yarrowia lipolytica CLIB122]
          Length = 384

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 83/353 (23%), Positives = 149/353 (42%), Gaps = 79/353 (22%)

Query: 62  TVSVAVPGSIIDNT--QSLELATRLAGQIARAVTIFRIDEVVVFD------------NKS 107
           +V++ VP + I  T  +S + A   A QIARA   + +DE+V+++             KS
Sbjct: 46  SVTLVVPSTCISETVVKSYQQAVHTAYQIARAAVAYNVDEIVIYEPSERPDIKQEKTKKS 105

Query: 108 SSDNYSRSSAANRSNRSDENE----SGAAFLVRLLQYLETPQYLRKALFSMHSS---LRF 160
            + ++ +      S+ S++++      A  L  LLQ+  TP YLRK++F+  ++     +
Sbjct: 106 ETSSFGKPKIVFNSDASNDHKPELSERAVTLASLLQFFVTPDYLRKSIFTPETTTTNFEY 165

Query: 161 VGMLPPLDAPHHLRKHEWAPFREGVTL---------------KENAPNSVGTLVDV---- 201
              LP L  P    +   A ++EG+++               K N   +    +++    
Sbjct: 166 AKKLPKLLLPFMESRD--ARYKEGMSIPMILNKNVKSKISKQKRNVKKAEAKGIELQPPT 223

Query: 202 --GLNKHVVV----------DQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMY 249
             G   ++ +           Q +  GVRVTV    N+ + A            +  G  
Sbjct: 224 VDGATPYINIGAAEPHELEYGQTVPVGVRVTVDTKENKVVSAT-----------EAYGQD 272

Query: 250 WGYKVRYAPNISSVFKNCSYKGGYDHLI----GTSEHGDIVNSSD----LTLP-TFRHLL 300
            GY VR A + S +F   S   GYD+ +    G     ++   +     L LP T +  L
Sbjct: 273 TGYSVRVATDFSHIFTMPSTPEGYDYTVWCPGGEFRQSELAPETQPKPILALPRTVKRPL 332

Query: 301 IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             FG    +E++++ D  L    A+E+F S  +       R +R E+ +F++L
Sbjct: 333 FVFGQWDHVEDAVKHDTALGVSGAKELFDSKVD-----FKRPVRVEDGVFVAL 380


>gi|55378385|ref|YP_136235.1| nucleic acid methylase [Haloarcula marismortui ATCC 43049]
 gi|344212414|ref|YP_004796734.1| nucleic acid methylase [Haloarcula hispanica ATCC 33960]
 gi|448636988|ref|ZP_21675436.1| nucleic acid methylase [Haloarcula sinaiiensis ATCC 33800]
 gi|448648022|ref|ZP_21679500.1| nucleic acid methylase [Haloarcula californiae ATCC 33799]
 gi|448671479|ref|ZP_21687418.1| nucleic acid methylase [Haloarcula amylolytica JCM 13557]
 gi|448681420|ref|ZP_21691553.1| nucleic acid methylase [Haloarcula argentinensis DSM 12282]
 gi|57015353|sp|P20571.2|Y1612_HALMA RecName: Full=Uncharacterized protein rrnAC1612
 gi|55231110|gb|AAV46529.1| nucleic acid methylase [Haloarcula marismortui ATCC 43049]
 gi|343783769|gb|AEM57746.1| nucleic acid methylase [Haloarcula hispanica ATCC 33960]
 gi|445765294|gb|EMA16433.1| nucleic acid methylase [Haloarcula sinaiiensis ATCC 33800]
 gi|445766082|gb|EMA17219.1| nucleic acid methylase [Haloarcula amylolytica JCM 13557]
 gi|445767953|gb|EMA19046.1| nucleic acid methylase [Haloarcula argentinensis DSM 12282]
 gi|445775892|gb|EMA26887.1| nucleic acid methylase [Haloarcula californiae ATCC 33799]
          Length = 281

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 81/310 (26%), Positives = 136/310 (43%), Gaps = 55/310 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S+    +    ATR  G +ARA  ++R+D + V+ +   +  +         
Sbjct: 2   TTSVLVPSSLAREAEDRREATRKLGYVARAAAVYRVDRLTVYPDPDGAGKW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                 E G  F+  +L+Y  TP +LRK ++     L +VG+LPPL     +R    +  
Sbjct: 53  ------EDG--FVETVLRYAATPPHLRKEMWGKRDELEYVGVLPPL----RVRSQTGSGS 100

Query: 182 REGVTLKENAPNSVG----TLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADS 233
               +L++     VG      V+ G+   + +    D  ++ G RVTV + + R + A  
Sbjct: 101 EGSGSLRQGIVTEVGADGRVRVNCGMQHPISLPVPADMDVEQGERVTVRVSSRRPVRA-- 158

Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
            + V  P+         G+ V     +++       +      I +S +G+ V S+ L  
Sbjct: 159 -KLVDAPTT--------GFDV-----VAADLDAALSRDDAGLTIASSRYGEPVTSTRLGQ 204

Query: 294 PTFRH-----LLIAFG----GLAG-LEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTI 343
              R      + +AFG    GL   L+ + +   G +  +  E F  + NT P+QGS  +
Sbjct: 205 LAERRDAEGGMTVAFGAPERGLPSILDVAPDAVGGDQTSDEPEGFDLWLNTVPNQGSEVV 264

Query: 344 RTEEAIFISL 353
           RTEEA+F SL
Sbjct: 265 RTEEALFASL 274


>gi|302348986|ref|YP_003816624.1| hypothetical protein ASAC_1188 [Acidilobus saccharovorans 345-15]
 gi|302329398|gb|ADL19593.1| hypothetical protein ASAC_1188 [Acidilobus saccharovorans 345-15]
          Length = 277

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 84/306 (27%), Positives = 131/306 (42%), Gaps = 65/306 (21%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           V +P S++   Q L   T  AG I+RA+ IFR+DEV +F ++ S+               
Sbjct: 17  VLLPASVLSVEQDLRDKTYKAGLISRALAIFRVDEVRIFLDEDST--------------H 62

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREG 184
           D+ E+    L  LL Y   P +L+K +  +   L++ G++PPL+ P+HL   +     EG
Sbjct: 63  DDQET----LSELLNYQVVPPHLKKRVVGLSEKLKYAGIMPPLNLPNHLPPRD---LNEG 115

Query: 185 VTLKENAPNSVGTLVDV-------GLNK--HVVVDQVLDPGVRVTVAMGTNRNLDADSPR 235
             +        G+   V       GL K  H  V+ ++    RV    G    L+  S  
Sbjct: 116 DLIDVLIITRKGSQCSVYMGEAGEGLLKPCHFDVNDIVT--ARVIRRSGRQFELEPSS-- 171

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
                       +Y GYKV  +  + +  +    +G    L+GTS++G    S+D  L  
Sbjct: 172 ---------WGNIYTGYKVLRSGKLLTELQELKEQG--VALVGTSKYG----STDYWL-- 214

Query: 296 FRHLL-----IAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
            R LL     +  GG            GL +  +R+ F    N  P QG+ T+R+EEA+ 
Sbjct: 215 LRGLLGRPIALVLGG---------PKSGLLQYTSRKAFDLLINAAPLQGTETLRSEEALL 265

Query: 351 ISLQYF 356
            SL   
Sbjct: 266 ASLTLL 271


>gi|448735136|ref|ZP_21717353.1| hypothetical protein C450_17783 [Halococcus salifodinae DSM 8989]
 gi|445798749|gb|EMA49140.1| hypothetical protein C450_17783 [Halococcus salifodinae DSM 8989]
          Length = 279

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 86/314 (27%), Positives = 130/314 (41%), Gaps = 53/314 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+ VP S+         AT   G +ARA T+FRID V VF +      +         
Sbjct: 2   TVSLLVPSSLCREAADKREATHKVGLVARAATVFRIDRVTVFPDPDGDRRW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
                   G  F+  +L+Y  T  YLRK  +     L   G+LPPL A            
Sbjct: 53  --------GGGFVSTVLRYAATAPYLRKEAWGTRDELEQAGVLPPLRAVSRTGSESQGSG 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+ + E  P+     V+ GL   + +    D  LD G RVTV + +   + A    
Sbjct: 105 SSRQGI-VTEVGPDGR-VRVNCGLQHPISLVDPTDVGLDEGERVTVRISSREPVRA---- 158

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
           ++V    P       G+ V  A ++S+    ++   +      I TS HG+ + +  L +
Sbjct: 159 RIVDEPAP-------GFAVERA-DLSAALGREDAGLR------IATSRHGEALTTDRLGV 204

Query: 294 PTFR---HLLIAFGGLA-GLEESIEEDDGLKRKNAR----EVFHSYFNTCPHQGSRTIRT 345
              R    + +AFG    GL   +  D G     +       F  + +T P+QGS  +RT
Sbjct: 205 LAGRVEDGMTVAFGAPERGLPAMLGIDTGSVASGSETDEPAGFDRWLDTVPNQGSEVVRT 264

Query: 346 EEAIFISLQYFQEP 359
           EEA+F +L     P
Sbjct: 265 EEAVFATLACLTLP 278


>gi|50295076|ref|XP_449949.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49529263|emb|CAG62929.1| unnamed protein product [Candida glabrata]
          Length = 367

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 68/231 (29%), Positives = 108/231 (46%), Gaps = 31/231 (13%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           +V +P S++DN Q+L   T +  Q+AR +T+F + EVVV D K    N       + + +
Sbjct: 43  AVCIPTSVLDNCQNLSQKTYVVYQLARTLTLFNVAEVVVLDLKQEKQN--SIIEKDDTKK 100

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRF--VGMLPPLDAPHHLRKHEWAPF 181
             +  S A  L  LLQY  TP YL K++F       F     LP L A   +R  E   +
Sbjct: 101 EKKRMSDAMLLASLLQYFVTPPYLIKSVFKKEYLPYFKEASKLPRLSALPFMRYMEENRY 160

Query: 182 REGVTLKENAPNSVG-------TLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADS 233
           REG+ ++ + P+            +++G +  + +   L P  VRVTV +     +D   
Sbjct: 161 REGLAIRMSKPDPKSKKEFKQTKYINIGESDALELKSQLVPINVRVTVDI-----ID--- 212

Query: 234 PRQVVPPSK-------PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
            ++VV P +       PK S   +GY VR A + + +F   S+  GY   +
Sbjct: 213 -KKVVAPEEAYGDYVGPKAS---YGYHVRIAKSFADLFMGSSFPSGYTQTV 259


>gi|307102254|gb|EFN50592.1| hypothetical protein CHLNCDRAFT_135058 [Chlorella variabilis]
          Length = 86

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 42/101 (41%), Positives = 53/101 (52%), Gaps = 20/101 (19%)

Query: 263 VFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRK 322
           + + C + GGY   +GTSE G+   +  L L  FRHLL+ FGG  GLE            
Sbjct: 1   MLEQCPFYGGYSLTVGTSERGERTPACQLDLGAFRHLLVVFGGPQGLE------------ 48

Query: 323 NAREVFHSYFNTCPHQGSRTIRTEEAIFISLQYFQEPISRA 363
                   Y NTC  QGSRTIR+EEAI ISL + Q  ++ A
Sbjct: 49  --------YLNTCFDQGSRTIRSEEAILISLAFLQPAVTAA 81


>gi|335437514|ref|ZP_08560290.1| hypothetical protein HLRTI_10401 [Halorhabdus tiamatea SARL4B]
 gi|334895862|gb|EGM34025.1| hypothetical protein HLRTI_10401 [Halorhabdus tiamatea SARL4B]
          Length = 301

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 85/333 (25%), Positives = 134/333 (40%), Gaps = 81/333 (24%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T S  VP S+    +    ATR  G +ARA T+FR+D ++V+ +   +  +         
Sbjct: 2   TQSALVPSSLCREAEDKREATRKIGYVARAATVFRVDRLIVYPDPDGAGRW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
                   G  F+  +L+Y  TP YLRK ++     L  VG+LPPL           +  
Sbjct: 53  --------GDGFVETVLRYAATPPYLRKEVWDRRDELECVGVLPPLRVATQTGSGSDDSG 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDA---D 232
             R+G+  +  +   V   V+ G+   + +    D+ +  G RVTV + + R + A   D
Sbjct: 105 SLRQGIVTEVGSDGRV--RVNCGMQHPISLAVPGDRTIREGERVTVRISSRRPVRAKIVD 162

Query: 233 SPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL- 291
            P    PP          GY+V  A N+ +       +      I  S HG  + +  L 
Sbjct: 163 QP----PP----------GYQVDRA-NLDAALS----RPDAGVRIAASRHGQSLTTERLG 203

Query: 292 ----TLPTFRHLLIAFGG-----------------------LAGLEESIEEDD----GLK 320
               T+ +   + +AFG                        LA   ++ E +D    G  
Sbjct: 204 RLVGTIASAGDMTVAFGAPERGLPAILGVEPGAVDPCTVDELAAAVDAAERNDETVTGSG 263

Query: 321 RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
               R  F  + NT P+QGS  +RTEEA+F +L
Sbjct: 264 SDGPR--FDLWLNTVPNQGSEVVRTEEAMFATL 294


>gi|406602263|emb|CCH46156.1| hypothetical protein BN7_5744 [Wickerhamomyces ciferrii]
          Length = 355

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 73/274 (26%), Positives = 121/274 (44%), Gaps = 51/274 (18%)

Query: 46  KRKRDQLNDDAPIEVPTV-----SVAVPGSIID--NTQSLELATRLAGQIARAVTIFRID 98
           K+KR Q +D   ++ P       S+ +P SII+  N ++L+  T +  QIA+  +I+ + 
Sbjct: 4   KQKRSQQDDLTSVKKPKTRSINYSLCIPTSIINRSNAKNLQQVTNILYQIAKVASIYNVG 63

Query: 99  EVVVFDNKSSSD-----------NYSRSSAANRSNRSDE---NE--SGAAFLVRLLQYLE 142
           E+++ D ++  D           N  +    +  N S E   NE    A  +  +LQY  
Sbjct: 64  EIIIMDIQTPPDEKVNLNDVQDPNSKKKIKFDDDNTSTEKAKNEVSEDAMTIATILQYFV 123

Query: 143 TPQYLRKALF--SMHSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTL------------K 188
           TP YL K LF     S+ ++    P L     +     + +REG+T+            K
Sbjct: 124 TPPYLTKTLFKKKFFSNFQYAKNFPKLTTLPFMNPEVSSKYREGLTVTMGKTTKSQNKSK 183

Query: 189 ENAPNSVGTLVDVGLNKHVVVD-QVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESG 247
           ++ P      V++G +K++ +  Q +   VRVTV   T         ++VV P +  E  
Sbjct: 184 KSQPLKNTKFVNIGYDKYLELQGQQVPVNVRVTVNTET---------KKVVSPIEAYEEQ 234

Query: 248 M----YWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
           +     +GY VR A   +S+F   SY  GY   I
Sbjct: 235 VGAKSSYGYHVRIAKKFTSIFTESSYPDGYSQSI 268


>gi|429190345|ref|YP_007176023.1| hypothetical protein Natgr_0316 [Natronobacterium gregoryi SP2]
 gi|448326257|ref|ZP_21515624.1| hypothetical protein C490_12702 [Natronobacterium gregoryi SP2]
 gi|429134563|gb|AFZ71574.1| hypothetical protein Natgr_0316 [Natronobacterium gregoryi SP2]
 gi|445612914|gb|ELY66631.1| hypothetical protein C490_12702 [Natronobacterium gregoryi SP2]
          Length = 288

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 132/316 (41%), Gaps = 60/316 (18%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TIF +D +VV+ ++               
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATIFGVDRLVVYPDRD-------------- 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
              +    G  F+  +L+Y  TP YLR  ++ M   L +VG+LPPL A        +   
Sbjct: 48  --GETGRFGDGFVSTVLRYAATPPYLRNEVWGMRDELEYVGVLPPLRATSQTGSESNGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDA---D 232
             R+G+  +      V   V+ GL   + +    D  +  G RVT+ + + R + A   D
Sbjct: 106 STRQGIVTEVGPEGRV--RVNCGLQHPISLNTPPDVEVAEGERVTIRISSRRPVRAKLVD 163

Query: 233 SPRQVVPPSKPKESGMYW--GYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
            P   +   +   S         VR A   +S F      G  + L G  E GD      
Sbjct: 164 CPLPGLSIERTDLSAALGREDAGVRIA---ASRFGEHLTVGRLETLAGRIE-GD------ 213

Query: 291 LTLPTFRHLLIAFGG-------LAGLEESIEE------DDGLKRKNAREVFHSYFNTCPH 337
                   + +AFG        + G+E S         DDG++   A   F  + NT P+
Sbjct: 214 -------GMTVAFGAPERGLPDILGIEASAISSSDEGGDDGVE-PTADPGFDLWLNTVPN 265

Query: 338 QGSRTIRTEEAIFISL 353
           QGS  +RTEEA+F +L
Sbjct: 266 QGSDVVRTEEALFATL 281


>gi|448329358|ref|ZP_21518658.1| hypothetical protein C489_09466 [Natrinema versiforme JCM 10478]
 gi|445614097|gb|ELY67778.1| hypothetical protein C489_09466 [Natrinema versiforme JCM 10478]
          Length = 291

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 86/319 (26%), Positives = 134/319 (42%), Gaps = 63/319 (19%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TIFR D +VV+               +R 
Sbjct: 2   TVSVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLVVY--------------PDRD 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
             + + + G  F+  +L+Y  TP YLR   + M   L + G+LPPL A            
Sbjct: 48  GETGQFDGG--FVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSESTGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ GL   + ++      ++ G RVTV + + R + A    
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
           + +P    +++ +          VR A   +S F      G  + L G ++   +     
Sbjct: 164 EPLPGLSIEQTDLRAALGREDAGVRIA---ASRFGEHLTVGRLETLAGRTKRDGMT---- 216

Query: 291 LTLPTFRHLLIAFG-------GLAGLEESIEE---------DDGLKRKNAREVFHSYFNT 334
                     +AFG        + G+EES  E         D+G++   A   F  + NT
Sbjct: 217 ----------VAFGAPERGLPAILGIEESAIEASSGQGDAADNGVE-PTADPGFDLWLNT 265

Query: 335 CPHQGSRTIRTEEAIFISL 353
            P QGS  +RTEEA+F +L
Sbjct: 266 VPDQGSEVVRTEEALFATL 284


>gi|126465912|ref|YP_001041021.1| hypothetical protein Smar_1015 [Staphylothermus marinus F1]
 gi|126014735|gb|ABN70113.1| Protein of unknown function DUF171 [Staphylothermus marinus F1]
          Length = 279

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 135/297 (45%), Gaps = 42/297 (14%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           + VA+P SI+    SL L T    Q+ R  +IF + E+V F      D ++  S      
Sbjct: 7   ILVALPTSILSTESSLLLKTIKTYQVIRYSSIFGVSEIVFF-----RDPFTDFS------ 55

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL-----DAPHHLRKHE 177
              ++   +  + ++ +YL TP YLR+ L      L+FVG+LPPL     D   + R  E
Sbjct: 56  ---QHRKYSVLIEKIWRYLLTPPYLRRKLIPKDPDLKFVGLLPPLRLSTFDVSRNGRVGE 112

Query: 178 WAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQV-LDPGVRVTVAMGTNRNLDADSPRQ 236
               R G+  +E        L D+GL +   ++     PG      +G    +D ++ R 
Sbjct: 113 K---RLGLIYREKN----KLLADIGLPRPYRIEAGNCKPG-----DIGYVEIVDVNTRRA 160

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
           +    +P     Y G  + +A ++  V +   Y+   D +I TS++G I +  +L     
Sbjct: 161 ICLDVEP-----YRGPILAFADSLQEVLEE--YRKTVDLIIATSKYGKIPSHKELAGVKG 213

Query: 297 RHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           + ++I FGG       I + +G   +N  +     +NT P Q  +TIR+EEA+  +L
Sbjct: 214 KTIIILFGGPHRGLYDIAKKEGFILENKVD---KVWNTIPEQMVKTIRSEEALISTL 267


>gi|320580950|gb|EFW95172.1| hypothetical protein HPODL_3544 [Ogataea parapolymorpha DL-1]
          Length = 310

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 71/261 (27%), Positives = 114/261 (43%), Gaps = 38/261 (14%)

Query: 62  TVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           ++++ +P S I   + ++LE AT  A QIARA   + + E+V+ D   + +       A 
Sbjct: 4   SITLCIPSSCISRRSCKNLEQATFTAYQIARAACTYNVGEIVILDVPENIE-------AV 56

Query: 120 RSNRSDENESGAA------FLVRLLQYLETPQYLRKALF--SMHSSLRFVGMLPPLDAPH 171
            SN++   E G +       L  LLQ+  TP YL K +F  S+    +    LP +    
Sbjct: 57  ESNKTKFEEKGTSASDSSRLLAALLQFFTTPPYLVKTVFQASVIEYFKIAKKLPLISTLP 116

Query: 172 HLRKHEWAPFREGVTL------KENAPNSV--------GTLVDVGLNKHVVVDQVLDPGV 217
           +++    + FREG+++      K+NA   V           V++G  K + +++ +    
Sbjct: 117 YMQNDAASCFREGISIPKKSVVKKNAEGKVLKKKKPTTTKFVNIGERKMLELEKEIPINT 176

Query: 218 RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
           RVTV   T R +  D         K       +GY VR A   SS+F    Y  GYD  I
Sbjct: 177 RVTVNTKTKRVVSPDDAYGKAGALKT------FGYHVRVATKFSSLFTEPGYTEGYDRCI 230

Query: 278 GTSEHGDIVNSSDLT-LPTFR 297
             S      N    T LP+++
Sbjct: 231 YVSSGDYFSNDQPATGLPSYQ 251


>gi|315425962|dbj|BAJ47611.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
 gi|343484754|dbj|BAJ50408.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
          Length = 278

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 79/298 (26%), Positives = 129/298 (43%), Gaps = 43/298 (14%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           +V +P S      S   AT   G + R +  FR+DE+V++ ++  +              
Sbjct: 5   AVFLPHSFTAEVPSFREATMKIGFVGRVLATFRVDELVLYPDEPGAPRLR---------- 54

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFRE 183
                  A  +  LL YL T  YLR  ++ +  SLR+ G+LPPL+ P H         +E
Sbjct: 55  ------NALLIKELLDYLVTAPYLRSRVYPLKPSLRYAGLLPPLNIPTHPEADALK--KE 106

Query: 184 GVTLKENAPNSVG--TLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPS 241
           G   ++    + G  +L++ G+ K + + + L     VT+ +   R  +      VV   
Sbjct: 107 GTHYRQALVTASGQTSLLEAGMGKPLKIPRKLPKNTIVTLRV---RVKEGRVKYSVV--- 160

Query: 242 KPKESG-MYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT----LPTF 296
             K+SG  Y G++ R   ++    K  +++      + TS  G+ +N    T    L   
Sbjct: 161 -GKKSGETYTGFRTRVVKSLEEALKPYTFR------LATSRLGEPINQVYATVAQRLKES 213

Query: 297 RHLLIAFGGL-AGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
               + FG    GL E I    GL+      +F    NT P+QG +TIRTEEA+  +L
Sbjct: 214 TQTCVVFGAADKGLHE-IARAHGLEYNR---LFTHTVNTAPNQGVKTIRTEEALAYTL 267


>gi|313125789|ref|YP_004036059.1| hypothetical protein Hbor_10220 [Halogeometricum borinquense DSM
           11551]
 gi|448285628|ref|ZP_21476869.1| hypothetical protein C499_02614 [Halogeometricum borinquense DSM
           11551]
 gi|312292154|gb|ADQ66614.1| uncharacterized conserved protein [Halogeometricum borinquense DSM
           11551]
 gi|445576264|gb|ELY30721.1| hypothetical protein C499_02614 [Halogeometricum borinquense DSM
           11551]
          Length = 278

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 128/317 (40%), Gaps = 72/317 (22%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SV VP S++   +    ATR  G +ARA T+FR D ++VF                  
Sbjct: 2   TLSVLVPSSLVREAEDKREATRKLGYVARAATVFRADRLIVF-----------------P 44

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL--DAPHHLRKHEWA 179
           +R  E   G  F+  +L+Y  TP YLRK ++     L +VG+LPPL   +       +  
Sbjct: 45  DREGERRWGRKFVETVLRYAVTPPYLRKEVWGHRDELEYVGVLPPLLVSSTTGSESDDSG 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
             ++G+ + E  P+     V+ G+   + +    D ++  G RV + + +   + A    
Sbjct: 105 SLQQGI-VTEVGPDG-RVRVNCGMQHPISLYTPSDMMVKEGERVAIRISSREPVRARIVD 162

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPT 295
           + VP           G+ V          +    +      I TS  G+      LT+P 
Sbjct: 163 EPVP-----------GFDVSRMD-----LEEALGRADAGITIATSRFGE-----SLTVPK 201

Query: 296 FRHL---------LIAFG----------GLAGLEESIEEDDGLKRKNAREVFHSYFNTCP 336
              L          + FG          G+   E ++E  DG         F  + NT P
Sbjct: 202 LADLSPRIEREGATVVFGSPGRGLPDILGIDAEEVTVEPSDGPG-------FDLWLNTIP 254

Query: 337 HQGSRTIRTEEAIFISL 353
            QGS  +RTEEA+F SL
Sbjct: 255 RQGSEVVRTEEAMFASL 271


>gi|448358803|ref|ZP_21547477.1| hypothetical protein C482_12774 [Natrialba chahannaoensis JCM
           10990]
 gi|445644483|gb|ELY97496.1| hypothetical protein C482_12774 [Natrialba chahannaoensis JCM
           10990]
          Length = 314

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 84/325 (25%), Positives = 135/325 (41%), Gaps = 52/325 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TIFR+D ++V+ +++              
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATIFRVDRLIVYPDRA-------------- 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WA 179
              D    G  F+  +L+Y  TP YLR   + M   L +VG+LPPL A            
Sbjct: 48  --GDTGRFGDEFVSTVLRYAATPPYLRNEAWGMRDELEYVGVLPPLRATSQTGSESDGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+ + E  P+     V+ GL   + ++      +  G RVT+ + + R + A    
Sbjct: 106 SLRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPKMAVSVGERVTIRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGYK-----VRYAPN-------------ISSVFKNCSYKGGYDHLI 277
           + +P    + + +          VR A +             ++   +  + + GY    
Sbjct: 164 EPLPGLTVERTDLQAALSREDAGVRIAASRFGAELTVGRLGTLAGHTERDTTRDGYTIAF 223

Query: 278 GTSEHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIE------EDDGLKRKNARE---VF 328
           G  E G +    +L   T   L      LA  E+ +E      +DD +  +        F
Sbjct: 224 GAPERG-LPAILELEDTTVESLSAWPSWLAEDEDGVEPTSAAGDDDTVAAETDAADPGGF 282

Query: 329 HSYFNTCPHQGSRTIRTEEAIFISL 353
             + NT P QGS  +RTEEA+F +L
Sbjct: 283 DLWLNTVPDQGSDVVRTEEALFATL 307


>gi|452206449|ref|YP_007486571.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
 gi|452082549|emb|CCQ35807.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
          Length = 277

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 82/308 (26%), Positives = 130/308 (42%), Gaps = 54/308 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S+    +    ATR  G +ARA T+FR+D ++V+ ++   D           
Sbjct: 2   THSVLVPSSLTREAEDKREATRKLGYVARAATVFRVDRLIVYPDQGGDDGRFGDGFVG-- 59

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                          +L+Y  TP YLRK  +     L + G+LPPL     +R    +  
Sbjct: 60  --------------TVLEYAATPPYLRKEAWGKRDELEYAGVLPPL----RVRTQTGSGS 101

Query: 182 REGVTLKENAPNSVGT----LVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADS 233
               +L++     VG+     V+ GL   + +       +  G RVT+ + +   + A  
Sbjct: 102 DGSGSLRQGIVTEVGSDGRVRVNCGLQHPISLPVPPSMAVSAGERVTIRVSSREPVRA-- 159

Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS---SD 290
            + V  PS         G+ V  A        +C  +      I  S HG+++      +
Sbjct: 160 -KLVADPSP--------GFAVERA-----TIDDCLARPDAGVRIAASRHGEVLTVDRLGE 205

Query: 291 LTLPTFRHLLIAFGGLA-GLEESIEEDDGLK----RKNAREVFHSYFNTCPHQGSRTIRT 345
           L+  T   L +AFG    GL E ++ +   +      NAR  F  + NT P+QGS  +RT
Sbjct: 206 LSGRTADGLTVAFGAPERGLPEILDCEPRAEAIGDEPNAR--FDLWLNTVPNQGSEVVRT 263

Query: 346 EEAIFISL 353
           EEA+F +L
Sbjct: 264 EEALFATL 271


>gi|448345584|ref|ZP_21534473.1| hypothetical protein C485_07297 [Natrinema altunense JCM 12890]
 gi|445633517|gb|ELY86704.1| hypothetical protein C485_07297 [Natrinema altunense JCM 12890]
          Length = 301

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 88/328 (26%), Positives = 137/328 (41%), Gaps = 71/328 (21%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TIFR D +VV+               +R 
Sbjct: 2   TVSVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLVVY--------------PDRD 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
             + + + G  F+  +L+Y  TP YLR   + M   L + G+LPPL A            
Sbjct: 48  GETGQFDGG--FVQTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSESTGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ GL   + ++      ++ G RVTV + + R + A    
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
           + +P    +++ +          VR A   +S F +    G  + L G  +         
Sbjct: 164 EPLPGLSIEQTDLQAALGREDAGVRIA---ASRFGDELTVGRLETLAGRVQRD------- 213

Query: 291 LTLPTFRHLLIAFG-------GLAGLEES-IE----EDDGLKRKNARE------------ 326
                   + +AFG        + G+EES IE    +DD ++  + R+            
Sbjct: 214 -------GMTVAFGAPERGLPAILGIEESAIEPSSGQDDAIEPSSDRDDAADNGVEPTAD 266

Query: 327 -VFHSYFNTCPHQGSRTIRTEEAIFISL 353
             F  + NT P QGS  IRTEEA+F +L
Sbjct: 267 PGFDLWLNTVPDQGSEVIRTEEALFATL 294


>gi|257053358|ref|YP_003131191.1| hypothetical protein Huta_2291 [Halorhabdus utahensis DSM 12940]
 gi|256692121|gb|ACV12458.1| Protein of unknown function DUF171 [Halorhabdus utahensis DSM
           12940]
          Length = 301

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 84/333 (25%), Positives = 133/333 (39%), Gaps = 81/333 (24%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T++  VP S+    +    ATR  G +ARA T+FR+D ++V+ +   +  +         
Sbjct: 2   TLNALVPSSLCREAEDKREATRKIGYVARAATVFRVDRLIVYPDPDGAGRW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
                   G  F+  +L+Y  TP YLRK  +     L  VG+LPPL           +  
Sbjct: 53  --------GDGFVETVLRYAATPPYLRKEAWERRDELGCVGVLPPLRVATQTGSGSDDSG 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDA---D 232
             R+G+  +  +   V   V+ G+   + +    D+ +  G RVTV + + R + A   D
Sbjct: 105 SLRQGIVTEVGSDGRV--RVNCGMQHPISLAVPGDRTIKEGERVTVRISSRRPVRAKIVD 162

Query: 233 SPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL- 291
            P    PP          GY+V  A N+ +              I  S HG  + +  L 
Sbjct: 163 QP----PP----------GYQVDRA-NLDAALSRPDAG----VRIAASRHGQSLTTERLG 203

Query: 292 ----TLPTFRHLLIAFGG-----------------------LAGLEESIEEDD----GLK 320
               T+ +   + +AFG                        LA   ++ E +D    G  
Sbjct: 204 RLAGTIASAGDMTVAFGAPERGLPAILGVEPGAVDPSTVDELAAAVDAAERNDETVTGSG 263

Query: 321 RKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
               R  F  + NT P+QGS  +RTEEA+F +L
Sbjct: 264 SDGPR--FDLWLNTVPNQGSEVVRTEEAMFATL 294


>gi|448350180|ref|ZP_21538999.1| hypothetical protein C484_11446 [Natrialba taiwanensis DSM 12281]
 gi|445637687|gb|ELY90835.1| hypothetical protein C484_11446 [Natrialba taiwanensis DSM 12281]
          Length = 299

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 82/312 (26%), Positives = 132/312 (42%), Gaps = 41/312 (13%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA T+FR D +VV+ ++              +
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLVVYPDRD-----------GET 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW--A 179
            R D       F+  +L+Y  TP YLR  ++ M + L + G+LPPL A            
Sbjct: 51  GRFD-----GGFVSTVLRYAATPPYLRNEVWGMRNELEYAGVLPPLRAMSQTGSESTGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+ + E  P+     V+ GL   + ++      LD G RVT+ + + R + A    
Sbjct: 106 SLRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPKMALDEGERVTIRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGYK-----VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
             +P    + + +          VR A   +S +      G  D L G  +   +     
Sbjct: 164 DPLPGLSVERTDLQAALSREDAGVRIA---ASRYGTELTVGRLDTLAGRVDRDGMTVAFG 220

Query: 288 SSDLTLPTFRHLLI--AFGGLAGLEESIEEDDGLKRKNAR----EVFHSYFNTCPHQGSR 341
           + +  LP         A   L+      ++ DG++  +        F  + NT P+QGS 
Sbjct: 221 APERGLPDILEFESEDAVDTLSSWPPRSDDADGVEPTSGDVDPGRGFDLWLNTVPYQGSD 280

Query: 342 TIRTEEAIFISL 353
            +RTEEA+F +L
Sbjct: 281 VVRTEEALFATL 292


>gi|433638934|ref|YP_007284694.1| hypothetical protein Halru_1963 [Halovivax ruber XH-70]
 gi|433290738|gb|AGB16561.1| hypothetical protein Halru_1963 [Halovivax ruber XH-70]
          Length = 291

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 89/319 (27%), Positives = 129/319 (40%), Gaps = 63/319 (19%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP SI    +    ATR  G +ARA TIFR D ++VF +               +
Sbjct: 2   TVSVLVPSSIAREAEDKREATRKLGYVARAATIFRADRLIVFPDGD-----------GET 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
            R D       F+  +L+Y  TP YL+   +     L +VG+LPPL A        +   
Sbjct: 51  GRFD-----GGFVETVLRYAATPPYLKTEAWGKRDELEYVGVLPPLRASSRTGSESNGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDA---D 232
             R G+  +      V   V+ GL   + ++     V+D G RVTV + + R + A   D
Sbjct: 106 SLRRGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMVVDEGERVTVRISSRRPVRAKLVD 163

Query: 233 SPRQVVPPSKPKESGMYW--GYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
            P   +   +   S         VR A   SS +      G  + L G  E         
Sbjct: 164 EPLAGLSIERTDLSAALGREDAGVRIA---SSRYGEELTTGRLETLAGRVER-------- 212

Query: 291 LTLPTFRHLLIAFGG-------LAGLEESIE---------EDDGLKRKNAREVFHSYFNT 334
                   + +AFG        + G+EES             DG++  +    F  + NT
Sbjct: 213 ------DGMTVAFGAPERGLPDILGIEESRAGASSSGDGGTADGVE-PSGDPGFDLWLNT 265

Query: 335 CPHQGSRTIRTEEAIFISL 353
            P+QGS  +RTEEA+F +L
Sbjct: 266 VPNQGSEVVRTEEAMFATL 284


>gi|448352596|ref|ZP_21541378.1| hypothetical protein C483_01224 [Natrialba hulunbeirensis JCM
           10989]
 gi|445642158|gb|ELY95228.1| hypothetical protein C483_01224 [Natrialba hulunbeirensis JCM
           10989]
          Length = 314

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 84/328 (25%), Positives = 138/328 (42%), Gaps = 58/328 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SV VP S+    +    ATR  G +ARA TIFR+D ++V+ +++              
Sbjct: 2   TISVLVPSSLTREAEDKREATRKLGYVARAATIFRVDRLIVYPDRA-------------- 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WA 179
              D    G  F+  +L+Y  TP YLR   + M   L + G+LPPL A            
Sbjct: 48  --GDTGRFGDGFVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRATSQTGSESDGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+ + E  P+     V+ GL   + ++      +  G RVT+ + + R + A    
Sbjct: 106 SLRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPKMAVSEGERVTIRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGYK-----VRYAPN-------------ISSVFKNCSYKGGYDHLI 277
           + +P    + + +          VR A +             ++   +  + + GY    
Sbjct: 164 EPLPGLTVERTDLQAALSREDAGVRIAASRFGAELTVGRLGTLAGHTERDTTRDGYTIAF 223

Query: 278 GTSEHG--DIVNSSDLTLPTFRHLLIAF-GGLAGLEESIE------EDDGLKRKNARE-- 326
           G  E G   I+   D T+ +    L A+   LA  E+ +E      +DD +  +      
Sbjct: 224 GAPERGLPAILELEDTTVES----LSAWPSWLADDEDGVEPTSAAGDDDTVAAETDAADP 279

Query: 327 -VFHSYFNTCPHQGSRTIRTEEAIFISL 353
             F  + NT P QGS  +RTEEA+F +L
Sbjct: 280 GGFDLWLNTVPDQGSGVVRTEEALFATL 307


>gi|448338135|ref|ZP_21527187.1| hypothetical protein C487_10519 [Natrinema pallidum DSM 3751]
 gi|445623310|gb|ELY76732.1| hypothetical protein C487_10519 [Natrinema pallidum DSM 3751]
          Length = 301

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 87/328 (26%), Positives = 138/328 (42%), Gaps = 71/328 (21%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TIFR D +VV+               +R 
Sbjct: 2   TVSVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLVVY--------------PDRD 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
             + + + G  F+  +L+Y  TP YLR   + M + L + G+LPPL A            
Sbjct: 48  GETGQFDGG--FVQTVLRYAATPPYLRNEAWGMRNELEYAGVLPPLRAMSQTGSESTGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ GL   + ++      ++ G RVTV + + R + A    
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
           + +P    +++ +          VR A   +S F +    G  + L G  +         
Sbjct: 164 EPLPGLSIEQTDLQAALGREDAGVRIA---ASRFGDELTVGRLETLAGRVQRD------- 213

Query: 291 LTLPTFRHLLIAFG-------GLAGLEES-IE----EDDGLKRKNARE------------ 326
                   + +AFG        + G+EES IE    +DD ++  + R+            
Sbjct: 214 -------GMTVAFGAPERGLPAILGIEESAIEASSGQDDAIEPSSNRDDAADNGVEPTAD 266

Query: 327 -VFHSYFNTCPHQGSRTIRTEEAIFISL 353
             F  + NT P QGS  +RTEEA+F +L
Sbjct: 267 PGFDLWLNTVPDQGSEVVRTEEALFATL 294


>gi|448685259|ref|ZP_21693251.1| nucleic acid methylase [Haloarcula japonica DSM 6131]
 gi|445781870|gb|EMA32721.1| nucleic acid methylase [Haloarcula japonica DSM 6131]
          Length = 281

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 82/310 (26%), Positives = 136/310 (43%), Gaps = 55/310 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S+    +    ATR  G +ARA  ++R+D + V+ +   +  +         
Sbjct: 2   TTSVLVPSSLAREAEDRREATRKLGYVARAAAVYRVDRLTVYPDPDGAGKW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                 E G  F+  +L+Y  TP +LRK ++     L +VG+LPPL     +R    +  
Sbjct: 53  ------EDG--FVETVLRYAATPPHLRKEMWGKRDELEYVGVLPPL----RVRSQTGSGS 100

Query: 182 REGVTLKENAPNSVGTLVDVGLN---KHVVVDQV-----LDPGVRVTVAMGTNRNLDADS 233
               +L++     VG    V +N   +H +   V     ++ G RVTV + + R + A  
Sbjct: 101 EGSGSLRQGIVTEVGADGRVRVNCGMQHPISLPVPAAMDVEQGERVTVRVSSRRPVRA-- 158

Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
            + V  P+         G+ V     +++       +      I +S +G+ V S+ L  
Sbjct: 159 -KLVDAPTT--------GFDV-----VAADLDAALSRDDAGLTIASSRYGEPVTSTRLGQ 204

Query: 294 PTFRH-----LLIAFG----GLAG-LEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTI 343
              R      + +AFG    GL   L+ + +   G +  +  E F  + NT P+QGS  +
Sbjct: 205 LAERRDAEGGMTVAFGAPERGLPSILDVAPDAVGGDQTSDEPEGFDLWLNTVPNQGSEVV 264

Query: 344 RTEEAIFISL 353
           RTEEA+F SL
Sbjct: 265 RTEEALFASL 274


>gi|401625618|gb|EJS43618.1| YGR283C [Saccharomyces arboricola H-6]
          Length = 342

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 109/247 (44%), Gaps = 30/247 (12%)

Query: 35  DSSSCDNKKKKKRKRDQLNDDAPIEVPT----VSVAVPGSIIDNTQSLELATRLAGQIAR 90
           D+S  + K  KK KR+ LN    I + +     S+ +P +IIDN  +LE  +  A QIA+
Sbjct: 8   DNSKSNFKVTKKIKRNILNLKKIINIKSGVINYSICIPTTIIDNCCNLEQVSFTAYQIAK 67

Query: 91  AVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKA 150
              +F I E++V D        S+     R  RS E  S    L  LLQY  TP  L   
Sbjct: 68  TAVLFNIQEIIVLD-------MSKDKKHERQPRSKETISDCLLLATLLQYFVTPPNLVDV 120

Query: 151 LFSM--HSSLRFVGMLPPLDAPHHLRKHEWAPFREGVTLKEN------APNSVGTLVDVG 202
            F     S L++    PPL     + +     + EG+++ E       +  ++ +LV +G
Sbjct: 121 TFKKKHRSYLKYAFTFPPLGQLPFMNEPADETYIEGLSIAEQTSEKDISDENLTSLVYIG 180

Query: 203 LNKHV-VVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNIS 261
            N  + +  Q +   VRVTV        D    + V P    K++ +  GY VR A ++ 
Sbjct: 181 KNNTIKLSSQNIPKTVRVTV--------DTLRKKVVSPRDAYKDTQL--GYYVRMASSLD 230

Query: 262 SVFKNCS 268
            V   C+
Sbjct: 231 EVSGGCT 237


>gi|448362305|ref|ZP_21550916.1| hypothetical protein C481_09647 [Natrialba asiatica DSM 12278]
 gi|445648826|gb|ELZ01774.1| hypothetical protein C481_09647 [Natrialba asiatica DSM 12278]
          Length = 300

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 88/327 (26%), Positives = 137/327 (41%), Gaps = 70/327 (21%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA T+FR D ++V+ ++              +
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLIVYPDRD-----------GET 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW--A 179
            R D       F+  +L+Y  TP YLR  ++ M + L + G+LPPL A            
Sbjct: 51  GRFD-----GGFVSTVLRYAATPPYLRNEVWGMRNELEYAGVLPPLRAVSQTGSESTGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+ + E  P+     V+ GL   + ++      LD G RVT+ + + R + A    
Sbjct: 106 SLRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPKMALDEGERVTIRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGYK-----VRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
           + +P    + + +          VR A   +S +      G  D L G   H   V+   
Sbjct: 164 EPLPGLSVERTDLQAALSREDAGVRIA---ASRYGTELTVGRLDTLAG---H---VDRDG 214

Query: 291 LTLPTFRHLLIAFG----GLAGLEESIEED----------------DGLKRKNAR----E 326
           +T        +AFG    GL  + E   ED                DG++  +       
Sbjct: 215 MT--------VAFGAPERGLPDILEFESEDAVDTLSSWPPRPDDDTDGVEPTSGDVDPGR 266

Query: 327 VFHSYFNTCPHQGSRTIRTEEAIFISL 353
            F  + NT P+QGS  +RTEEA+F +L
Sbjct: 267 GFDLWLNTVPYQGSDVVRTEEALFATL 293


>gi|448309693|ref|ZP_21499549.1| hypothetical protein C494_18167 [Natronorubrum bangense JCM 10635]
 gi|445589550|gb|ELY43780.1| hypothetical protein C494_18167 [Natronorubrum bangense JCM 10635]
          Length = 300

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 89/336 (26%), Positives = 130/336 (38%), Gaps = 88/336 (26%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+ VP S+    +    ATR  G +ARA TIFR D +VV+ ++               
Sbjct: 2   TVSILVPSSLTREAEDKREATRKLGYVARAATIFRADRLVVYPDR--------------- 46

Query: 122 NRSDENESG---AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKH 176
               E E+G     F+  +L+Y  TP YLR   + M   L + G+LPPL A         
Sbjct: 47  ----EGETGRFDGGFVSTVLRYAATPPYLRNEAWGMRDELEYAGILPPLRAMSQTGSEST 102

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDA- 231
                R+G+  +      V   V+ GL   + ++      +D G RVTV + + R + A 
Sbjct: 103 GSGSSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMEVDEGERVTVRISSRRPVRAK 160

Query: 232 --DSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH---LIGTSEHGDIV 286
             D P                       P +S    + S   G +     I  S  G+ +
Sbjct: 161 LEDKP----------------------LPGLSVERTDLSAALGREDAGVCIAASRFGEEL 198

Query: 287 NSSDLTLPTFR----HLLIAFGG-------LAGLEESI------------------EEDD 317
               L     R     L +AFG        + G+E S                     DD
Sbjct: 199 TVGRLGTLAGRIDRDGLTVAFGAPERGLPDILGIEASAVNASSGQGDAAESSTGDDTADD 258

Query: 318 GLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           G++   A   F  + NT P+QGS  +RTEEA+F +L
Sbjct: 259 GVE-PTADPGFDLWLNTVPNQGSEVVRTEEALFATL 293


>gi|448377805|ref|ZP_21560501.1| hypothetical protein C479_14683 [Halovivax asiaticus JCM 14624]
 gi|445655749|gb|ELZ08594.1| hypothetical protein C479_14683 [Halovivax asiaticus JCM 14624]
          Length = 291

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 89/319 (27%), Positives = 129/319 (40%), Gaps = 63/319 (19%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP SI    +    ATR  G +ARA TIFR D ++VF +               +
Sbjct: 2   TVSVLVPSSIAREAEDKREATRKLGYVARAATIFRADRLIVFPDGD-----------GET 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
            R D       F+  +L+Y  TP YL+   +     L +VG+LPPL A        +   
Sbjct: 51  GRFD-----GGFVETVLRYAATPPYLKTEAWGKRDELEYVGVLPPLRASSRTGSESNGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDA---D 232
             R G+  +      V   V+ GL   + ++     V+D G RVTV + + R + A   D
Sbjct: 106 SLRRGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMVVDEGERVTVRISSRRPVRAKLVD 163

Query: 233 SPRQVVPPSKPKESGMYW--GYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
            P   +   +   S         VR A   SS +      G  + L G  E         
Sbjct: 164 EPLAGLSIERTDLSAALGREDAGVRIA---SSRYGEELTAGRLETLAGRVER-------- 212

Query: 291 LTLPTFRHLLIAFGG-------LAGLEESIE---------EDDGLKRKNAREVFHSYFNT 334
                   + +AFG        + G+EES             DG++  +    F  + NT
Sbjct: 213 ------DGMTVAFGAPERGLPDILGIEESRAGASSSGDGGAADGVE-PSGDPGFDLWLNT 265

Query: 335 CPHQGSRTIRTEEAIFISL 353
            P+QGS  +RTEEA+F +L
Sbjct: 266 VPNQGSGVVRTEEAMFATL 284


>gi|448386088|ref|ZP_21564296.1| hypothetical protein C478_18131 [Haloterrigena thermotolerans DSM
           11522]
 gi|445655986|gb|ELZ08828.1| hypothetical protein C478_18131 [Haloterrigena thermotolerans DSM
           11522]
          Length = 291

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 82/306 (26%), Positives = 131/306 (42%), Gaps = 37/306 (12%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+ VP S+    +    ATR  G +ARA T+FR D ++V+ ++              +
Sbjct: 2   TVSILVPSSLSREAEDKREATRKLGYVARAATVFRADRLLVYPDRD-----------GET 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
            R D       F+  +L+Y  TP YLR   + M   L + G+LPPL A        +   
Sbjct: 51  GRFD-----GGFVETVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAVSQTGSESNGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ GL   + ++      ++ G RVTV + + R + A    
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLED 163

Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
             +P    + + +          VR A   +S F      G  + L G  E   +     
Sbjct: 164 VPLPGLAVERTDLSTALGREDAGVRIA---ASRFGEELTVGRLETLAGRIEGDGMTVAFG 220

Query: 288 SSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEE 347
           + +  LP    +  +  G A  + +  EDDG++   A   F  + NT P QGS  +RTEE
Sbjct: 221 APERGLPDILEIEASAVG-APKDAAAGEDDGVE-PTADPGFDLWLNTVPDQGSEVVRTEE 278

Query: 348 AIFISL 353
           A+F +L
Sbjct: 279 ALFATL 284


>gi|435848749|ref|YP_007310999.1| hypothetical protein Natoc_3480 [Natronococcus occultus SP4]
 gi|433675017|gb|AGB39209.1| hypothetical protein Natoc_3480 [Natronococcus occultus SP4]
          Length = 291

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/328 (27%), Positives = 135/328 (41%), Gaps = 81/328 (24%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S+    +    ATR  G +ARA T+FR D +VVF ++               
Sbjct: 2   TASVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLVVFPDR--------------- 46

Query: 122 NRSDENESG---AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW 178
               E E G     F+  +L+Y  TP YLR  ++ M   L + G+LPPL A         
Sbjct: 47  ----EGERGRFDGEFVDTVLRYATTPPYLRNEVWGMRDELEYAGILPPLRAVS------- 95

Query: 179 APFREGVTLKENAPNSVGTLVDVGLNKHVVVD--------------QVLDPGVRVTVAMG 224
              R G    ++  +  G + +VG ++ V V+                L+ G RVTV + 
Sbjct: 96  ---RTGSESNDSGSSRQGIVTEVGPDQRVRVNCGLQHPISLNVPPSMELEEGERVTVRIS 152

Query: 225 TNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGD 284
           + R + A    ++V  S P       G  V +    +++ +  +        I  S  G 
Sbjct: 153 SRRPVRA----KLVDDSLP-------GIDVEWTDLEAALGREDA-----GVRIAASRFGA 196

Query: 285 IVNSSDL-TLPTFRH---LLIAFG----GLAGL----EESIEE-------DDGLKRKNAR 325
            +    L TL   R    + +AFG    GL  +    EE+++        D G     + 
Sbjct: 197 ELTVGRLGTLAGRRERDGMTVAFGAPERGLPAILGMDEETVDALSRWDGVDGGGVEPTSD 256

Query: 326 EVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             F  + NT P QGS  +RTEEA+F +L
Sbjct: 257 PGFDLWLNTVPDQGSDVVRTEEALFATL 284


>gi|448369940|ref|ZP_21556393.1| hypothetical protein C480_16265 [Natrialba aegyptia DSM 13077]
 gi|445650380|gb|ELZ03304.1| hypothetical protein C480_16265 [Natrialba aegyptia DSM 13077]
          Length = 299

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 84/320 (26%), Positives = 132/320 (41%), Gaps = 57/320 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA T+FR D +VV+ ++              +
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLVVYPDRD-----------GET 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEW--A 179
            R D       F+  +L+Y  TP YLR  ++ M + L + G+LPPL A            
Sbjct: 51  GRFD-----GGFVSTVLRYAATPPYLRNEVWGMRNELEYAGVLPPLRAMSQTGSESTGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDA---D 232
             R+G+ + E  P+     V+ GL   + ++      +D G RVT+ + + R + A   D
Sbjct: 106 SLRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPKMAVDEGERVTIRISSRRPVRAKLVD 163

Query: 233 SP-------RQVVPPSKPKESGMYWGYKVRYAPNIS----SVFKNCSYKGGYDHLIGTSE 281
            P       R  +  +  +E         RY   ++            + G     G  E
Sbjct: 164 DPLPGLSVERTDLQAALSREDAGVRIAASRYGTELTVGRLGTLAGRVERDGMTVAFGAPE 223

Query: 282 HG--DIV--NSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAR----EVFHSYFN 333
            G  DI+   S D           A   L+      ++ DG++  +        F  + N
Sbjct: 224 RGLPDILEFESED-----------AVDTLSSWPPRSDDADGVEPTSGDVDPGRGFDLWLN 272

Query: 334 TCPHQGSRTIRTEEAIFISL 353
           T P+QGS  +RTEEA+F +L
Sbjct: 273 TVPYQGSDVVRTEEALFATL 292


>gi|397773844|ref|YP_006541390.1| hypothetical protein NJ7G_2073 [Natrinema sp. J7-2]
 gi|448341391|ref|ZP_21530352.1| hypothetical protein C486_06988 [Natrinema gari JCM 14663]
 gi|397682937|gb|AFO57314.1| hypothetical protein NJ7G_2073 [Natrinema sp. J7-2]
 gi|445628073|gb|ELY81384.1| hypothetical protein C486_06988 [Natrinema gari JCM 14663]
          Length = 301

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 82/315 (26%), Positives = 136/315 (43%), Gaps = 45/315 (14%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TIFR D +VV+               +R 
Sbjct: 2   TVSVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLVVY--------------PDRD 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
             + + + G  F+  +L+Y  TP YLR   + M   L + G+LPPL A            
Sbjct: 48  GETGQFDGG--FVQTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSESTGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ GL   + ++      ++ G RVTV + + R + A    
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
           + +P    +++ +          VR A   +S F +    G  + L G  +   +     
Sbjct: 164 EPLPGLSIEQTDLQAALGREDAGVRIA---ASRFGDALTVGRLETLAGRVQRDGMTVAFG 220

Query: 288 SSDLTLPTFRHL-LIAFGGLAGLEESIE--------EDDGLKRKNAREVFHSYFNTCPHQ 338
           + +  LP    +   A    +G +++++         D+G++   A   F  + NT P Q
Sbjct: 221 APERGLPAILGIEESAIDASSGRDDAVDPSSNRDDAADNGVE-PTADPGFDLWLNTVPDQ 279

Query: 339 GSRTIRTEEAIFISL 353
           GS  +RTEEA+F +L
Sbjct: 280 GSEVVRTEEALFATL 294


>gi|448411358|ref|ZP_21575823.1| hypothetical protein C475_15889 [Halosimplex carlsbadense 2-9-1]
 gi|445670546|gb|ELZ23145.1| hypothetical protein C475_15889 [Halosimplex carlsbadense 2-9-1]
          Length = 304

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 85/325 (26%), Positives = 132/325 (40%), Gaps = 62/325 (19%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S+    +    ATR  G +ARA  IFR+D + V+ +    +           
Sbjct: 2   TTSVLVPASLTREAEDKREATRKVGYVARAAAIFRVDRLTVYPDPDGVE----------- 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                 + GA F+  +L+Y  TP YLRK ++     L + G+LP       LR   W   
Sbjct: 51  ------KRGAGFVETVLRYAATPPYLRKEVWDRRGELEYAGVLP------ALRVRSWT-- 96

Query: 182 REGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTN--------------- 226
             G    E+     G + +VG ++ V V+  L   V + V  G                 
Sbjct: 97  --GSGSDESGSLRQGIVTEVGADERVRVNCGLQHPVSLPVPSGMEVGEGERVSVRVSSRR 154

Query: 227 ----RNLDADSPRQVVPPS-------KPKESGMYWGYKVRYAPNISSVFKNC---SYKGG 272
               + +D  +   VV  +       +P ++G+      RY   ++    +      +GG
Sbjct: 155 PVRAKLVDEPAAGFVVDSTDIDDALARP-DAGLTIAAS-RYGEELTVTKLDQLVERLRGG 212

Query: 273 YDHLIGTSEHG--DIVNSSDLTLPTFRHLLIAFGGLA-GLEESIEED-DGLKRKNAREVF 328
            D      E G  D  ++SD        L +AFG    GL   + ED D +  +   + F
Sbjct: 213 DDGNASGVETGGSDESDASDDEAAPLGDLTVAFGAPERGLPPILGEDPDTVAAETGDDRF 272

Query: 329 HSYFNTCPHQGSRTIRTEEAIFISL 353
             + NT P+QGS  +RTEEA+F +L
Sbjct: 273 DLWLNTVPNQGSEVVRTEEAVFATL 297


>gi|289579897|ref|YP_003478363.1| hypothetical protein Nmag_0209 [Natrialba magadii ATCC 43099]
 gi|448281137|ref|ZP_21472445.1| hypothetical protein C500_01488 [Natrialba magadii ATCC 43099]
 gi|289529450|gb|ADD03801.1| Protein of unknown function DUF171 [Natrialba magadii ATCC 43099]
 gi|445579461|gb|ELY33855.1| hypothetical protein C500_01488 [Natrialba magadii ATCC 43099]
          Length = 316

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 85/330 (25%), Positives = 139/330 (42%), Gaps = 60/330 (18%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TIFR+D ++V+ ++               
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATIFRVDRLIVYPDRD-------------- 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHE--WA 179
              D    G  F+  +L+Y  TP YLR   + M   L +VG+LPPL A            
Sbjct: 48  --GDTGRFGDEFVSTVLRYAATPPYLRNEAWGMRDELEYVGVLPPLRATSQTGSESDGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+ + E  P+     V+ GL   + ++      +  G RVT+ + + R + A    
Sbjct: 106 SLRQGI-VTEVGPDQ-RVRVNCGLQHPISLNVPPKMAVSVGERVTIRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGYK-----VRYAPN-------------ISSVFKNCSYKGGYDHLI 277
           + +P    + + +          VR A +             ++   +  + + G+    
Sbjct: 164 EPLPGLTVERTDLQAALSREDAGVRIAASRFGAELTVGRLGTLAGHTERDTTRDGFTIAF 223

Query: 278 GTSEHG--DIVNSSDLTLPTFRHLLIAF-GGLAGLEESIE------EDDGLKRKNAREV- 327
           G  E G   I+   D T+ +    L A+   LA  E+ +E      +DD +  +   +  
Sbjct: 224 GAPERGLPAILELEDTTVES----LSAWPSWLAEDEDGVEPTSAVGDDDTVAAETDADAA 279

Query: 328 ----FHSYFNTCPHQGSRTIRTEEAIFISL 353
               F  + NT P QGS  +RTEEA+F +L
Sbjct: 280 DPGGFDLWLNTVPDQGSDVVRTEEALFATL 309


>gi|448323164|ref|ZP_21512628.1| hypothetical protein C491_19399 [Natronococcus amylolyticus DSM
           10524]
 gi|445600350|gb|ELY54363.1| hypothetical protein C491_19399 [Natronococcus amylolyticus DSM
           10524]
          Length = 290

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 83/313 (26%), Positives = 130/313 (41%), Gaps = 52/313 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA T+FR D +VVF ++               
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLVVFPDRE-------------- 47

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
              +    G  F+  +L+Y  TP YLR   + M   L + G+LPPL A        +   
Sbjct: 48  --GERGRFGGEFVDTVLRYATTPPYLRNEAWGMRDELEYAGILPPLRAVSQTGSESNGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+ + E  P+     V+ GL   + ++      L+ G RVTV + + R + A    
Sbjct: 106 SSRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPGMELEEGERVTVRISSRRPVRAKLED 163

Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGD---IVN 287
           + +P    + + +          VR A   +S F      G    L G  E      +  
Sbjct: 164 EPLPGLDVERTDLQAALGREDAGVRIA---ASRFGAELTVGRLGTLAGRRERDGMTVVFG 220

Query: 288 SSDLTLPTFRHLLIAFGGLAGLEE----SIEEDDGLKRKNAREV---FHSYFNTCPHQGS 340
           + +  LP           + G++E    ++   DG+       +   F  + NT P QGS
Sbjct: 221 APERGLPD----------ILGMDEEAVDALSNWDGVDGDGVEPIDPGFDLWLNTVPDQGS 270

Query: 341 RTIRTEEAIFISL 353
             +RTEEA+F +L
Sbjct: 271 DVVRTEEALFATL 283


>gi|448302865|ref|ZP_21492815.1| hypothetical protein C495_01155 [Natronorubrum sulfidifaciens JCM
           14089]
 gi|445593872|gb|ELY48039.1| hypothetical protein C495_01155 [Natronorubrum sulfidifaciens JCM
           14089]
          Length = 300

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 132/314 (42%), Gaps = 44/314 (14%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+ VP S+    +    ATR  G +ARA TIFR D ++V+ ++               
Sbjct: 2   TVSILVPSSLTREAEDKREATRKLGYVARAATIFRADRLIVYPDR--------------- 46

Query: 122 NRSDENESG---AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKH 176
               E E+G     F+  +L+Y  TP YLR   + M   L + G+LPPL A         
Sbjct: 47  ----EGETGRFDGGFVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSEST 102

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDAD 232
                R+G+  +      V   V+ GL   + ++      +D G RVTV + + R + A 
Sbjct: 103 GSGSSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMEVDEGERVTVRISSRRPVRAK 160

Query: 233 SPRQVVPPSKPKESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
              + +P    + + +    G +       +S F      G  + L G  +   +     
Sbjct: 161 LENKPLPGLAVERTDLSAALGREDAGVCIAASRFGEELTVGRLETLAGRIDRDGLTVAFG 220

Query: 288 SSDLTLPTFRHL-LIAFGGLAGLEESIEE-------DDGLKRKNAREVFHSYFNTCPHQG 339
           + +  LP    +   A    +G  ++ E        DDG++   A   F  + NT P+QG
Sbjct: 221 APERGLPDILGIEASAVNASSGQGDAAESSTGDDTADDGVE-PTADPGFDLWLNTVPNQG 279

Query: 340 SRTIRTEEAIFISL 353
           S  +RTEEA+F +L
Sbjct: 280 SEVVRTEEALFATL 293


>gi|433590193|ref|YP_007279689.1| hypothetical protein Natpe_0869 [Natrinema pellirubrum DSM 15624]
 gi|448332401|ref|ZP_21521645.1| hypothetical protein C488_03535 [Natrinema pellirubrum DSM 15624]
 gi|433304973|gb|AGB30785.1| hypothetical protein Natpe_0869 [Natrinema pellirubrum DSM 15624]
 gi|445627505|gb|ELY80829.1| hypothetical protein C488_03535 [Natrinema pellirubrum DSM 15624]
          Length = 291

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 80/303 (26%), Positives = 130/303 (42%), Gaps = 31/303 (10%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+ VP S+    +    ATR  G +ARA T+FR D ++V+ ++              +
Sbjct: 2   TVSILVPSSLSREAEDKREATRKLGYVARAATVFRADRLLVYPDRD-----------GET 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
            R D       F+  +L+Y  TP YLR   + M   L + G+LPPL A        +   
Sbjct: 51  GRFD-----GGFVETVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAVSQTGSESNGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ GL   + ++      ++ G RVTV + + R + A    
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLED 163

Query: 236 QVVPPSKPKESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---NSSD 290
             +P    + + +    G +       +S F      G  + L G  E   +     + +
Sbjct: 164 VPLPGLAVERTDLSTALGREDAGVCIAASRFGEELTVGRLETLAGRIEGDGMTVAFGAPE 223

Query: 291 LTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
             LP    +  +  G A  + +  EDDG++   A   F  + NT P QGS  +RTEEA+F
Sbjct: 224 RGLPDILEIEASAVG-APKDAAAGEDDGVE-PTADPGFDLWLNTVPDQGSEVVRTEEALF 281

Query: 351 ISL 353
            +L
Sbjct: 282 ATL 284


>gi|448315111|ref|ZP_21504764.1| hypothetical protein C492_01898 [Natronococcus jeotgali DSM 18795]
 gi|445612370|gb|ELY66096.1| hypothetical protein C492_01898 [Natronococcus jeotgali DSM 18795]
          Length = 290

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 88/322 (27%), Positives = 131/322 (40%), Gaps = 70/322 (21%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA T+FR D +VVF ++               
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATVFRADRLVVFPDR--------------- 46

Query: 122 NRSDENESG---AAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKH 176
               E E G     F+  +L+Y  TP YLR   + M   L + G+LPPL A         
Sbjct: 47  ----EGERGRFDGEFVATVLRYATTPPYLRNEAWGMRDELEYAGILPPLRAVSQTGSESD 102

Query: 177 EWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDAD 232
           +    R+G+ + E  P+     V+ GL   + ++      L+ G RVTV + + R + A 
Sbjct: 103 DSGSSRQGI-VTEVGPDQR-VRVNCGLQHPISLNVPPGMELEEGERVTVRVSSRRPVRAK 160

Query: 233 SPRQVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
                +P    + + +          VR A   +S F      G    L G  E      
Sbjct: 161 LVDGPLPGLSIERTDLEAALGREDAGVRIA---ASRFGAELTVGRLGTLAGRRERD---- 213

Query: 288 SSDLTLPTFRHLLIAFG-------GLAGLEESIEEDDGLKRKNARE---------VFHSY 331
                      + +AFG        + G++E  E  D L R++  +          F  +
Sbjct: 214 ----------GMTVAFGAPERGLPAILGMDE--EAVDALSRRDGVDGDGVEPTDPGFDLW 261

Query: 332 FNTCPHQGSRTIRTEEAIFISL 353
            NT P QGS  +RTEEA+F +L
Sbjct: 262 LNTVPDQGSDVVRTEEALFATL 283


>gi|374109570|gb|AEY98475.1| FAGL342Cp [Ashbya gossypii FDAG1]
          Length = 424

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 78/293 (26%), Positives = 125/293 (42%), Gaps = 58/293 (19%)

Query: 41  NKKKKKRKRDQLNDDAPIEVPTVS-------VAVPGSIIDNTQSLELATRLAGQIARAVT 93
           NKK +K  +       P +V  VS       + +P SI+DN  +LE  T    QIA+A T
Sbjct: 32  NKKAQKASKPAARPKRPKKVLKVSSKTLNYTLCIPTSILDNCANLEQVTHALYQIAKAAT 91

Query: 94  IFRIDEVVVFDNKSSSDNYSRSSAANR--------------------SNRSDENESGAAF 133
           +F + E+VV D  + +D  +   AA+                     + +S    SGA  
Sbjct: 92  LFNVGEIVVLDLGTDTDAAAPHLAASELEPRKRKIKFEDVTEKTETSNKKSASRLSGAML 151

Query: 134 LVRLLQYLETPQYLRKALFSMHSSLRF--VGMLPPLDAPHHLR--KHEWAPFREGVTL-- 187
           +  LLQY  TP+YL  ++F    S  F     LP L A   +R  + +   +REG+++  
Sbjct: 152 IATLLQYFVTPRYLVSSVFKKKYSSYFNCAAKLPRLSALPFMRHLQTDQGRYREGLSVNM 211

Query: 188 ----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
                   P++    +++G +  + +  Q +   VRVTV    N+N        VV P++
Sbjct: 212 AKPAAAKKPSTRTKYINIGKDTMLELRSQQVPVNVRVTVDT-VNKN--------VVSPAE 262

Query: 243 P----KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
                  +   +GY VR A   + +F    + GGY   +        VNS+D 
Sbjct: 263 AYGDYSGANAAYGYSVRVARTFADIFLGAGFAGGYTQSLW-------VNSADF 308


>gi|238881614|gb|EEQ45252.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 371

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 87/350 (24%), Positives = 144/350 (41%), Gaps = 66/350 (18%)

Query: 62  TVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD----NKSSSDNYS-- 113
           T+S+ +P ++I   N  +L+  T +  QIA+A TI+++ E++VFD    N    D+ S  
Sbjct: 25  TISICIPSTVISSKNAYNLQQITNIIYQIAKACTIYKVAEIIVFDVPQSNNDKEDDKSTV 84

Query: 114 -------------RSSAANRSNRSDEN---ESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
                          + AN S +S+ N    S +  +  LLQ+  TP YL K +FS H +
Sbjct: 85  VVGSKVKFVEDEPEKTLANPSKKSNVNGDKVSPSLLVASLLQFFITPPYLVKTMFSSHLN 144

Query: 158 LRFVGMLPPLDAPHHLRKHEWAP----------FREGVTLKENAP--------------N 193
            +F  +LP       L K    P          F+EG+ +    P               
Sbjct: 145 SKFKNILPKFTYAFKLPKITTLPFMQNNQVYKDFKEGMIIPRETPLMKKKNKKTKSDHKI 204

Query: 194 SVGTLVDVGLNKHVVVDQVLDPGV--RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWG 251
           +V   V++G  + + +D   +  +  RVTV +   +N    SP Q       K +   +G
Sbjct: 205 TVSKYVNIGEKEALKLDIKREIPIYSRVTVDL---KNKTVVSPLQAYGVVGHKAA---FG 258

Query: 252 YKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT----LPTFR----HLLIAF 303
           Y VR A   + +F       GY   I  +      N++ ++    L TF+    ++L+  
Sbjct: 259 YHVRMASEFNKIFTQSPISDGYSSTIYVNCDDYFGNNNKISELDNLSTFKEATGNVLMVL 318

Query: 304 GGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           G    L+   + D     +N   V H + +          R E+AI ISL
Sbjct: 319 GNYKDLQAGFKADTSNVFENIDSVAHLFDSKL--NIPDGCRIEDAILISL 366


>gi|68466013|ref|XP_722781.1| hypothetical protein CaO19.12032 [Candida albicans SC5314]
 gi|68466308|ref|XP_722636.1| hypothetical protein CaO19.4563 [Candida albicans SC5314]
 gi|46444625|gb|EAL03898.1| hypothetical protein CaO19.4563 [Candida albicans SC5314]
 gi|46444780|gb|EAL04052.1| hypothetical protein CaO19.12032 [Candida albicans SC5314]
          Length = 371

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 138/344 (40%), Gaps = 54/344 (15%)

Query: 62  TVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD----NKSSSDNYS-- 113
           T+S+ +P ++I   N  +L+  T +  QIA+A TI+++ E++VFD    N    D+ S  
Sbjct: 25  TISICIPSTVISSKNAYNLQQITNIIYQIAKACTIYKVAEIIVFDVPQSNNDKEDDKSTV 84

Query: 114 -------------RSSAANRSNRSDEN---ESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
                          + AN S +S+ N    S +  +  LLQ+  TP YL K +FS H +
Sbjct: 85  VVGSKVKFVEDEPEKTLANPSKKSNVNGDKVSPSLLVASLLQFFITPPYLVKTMFSSHLN 144

Query: 158 LRFVGMLPPLDAPHHLRKHEWAP----------FREGVTLKENAPNSVGTLVDVGLNKHV 207
            +F  +LP       L K    P          F+EG+ +    P           +  +
Sbjct: 145 SKFKNILPKFTYAFKLPKITTLPFMQNNQVYKDFKEGMIIPRETPLMKKKNKKTKSDHKI 204

Query: 208 VVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMY----------WGYKVRYA 257
            V + ++ G +  + +   R +   S   V   +K   S +           +GY VR A
Sbjct: 205 TVSKYVNIGEKEALKLNIKREIPIYSRVTVDLKNKTVVSPLQAYGVVGHKAAFGYHVRMA 264

Query: 258 PNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT----LPTFR----HLLIAFGGLAGL 309
              + +F       GY   I  +      N++ ++    L TF+    ++L+  G    L
Sbjct: 265 SEFNKIFTQSPISDGYSSTIYVNCDDYFGNNNKISELDNLSTFKEATGNVLMVLGNYKDL 324

Query: 310 EESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +   + D     +N   V H + +          R E+AI ISL
Sbjct: 325 QAGFKADTSNVFENIDSVAHLFDSKL--NIPDGCRIEDAILISL 366


>gi|45200755|ref|NP_986325.1| AGL342Cp [Ashbya gossypii ATCC 10895]
 gi|44985453|gb|AAS54149.1| AGL342Cp [Ashbya gossypii ATCC 10895]
          Length = 424

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 78/293 (26%), Positives = 125/293 (42%), Gaps = 58/293 (19%)

Query: 41  NKKKKKRKRDQLNDDAPIEVPTVS-------VAVPGSIIDNTQSLELATRLAGQIARAVT 93
           NKK +K  +       P +V  VS       + +P SI+DN  +LE  T    QIA+A T
Sbjct: 32  NKKAQKASKPAARPKRPKKVLKVSSKTLNYTLCIPTSILDNCANLEQVTHALYQIAKAAT 91

Query: 94  IFRIDEVVVFDNKSSSDNYSRSSAANR--------------------SNRSDENESGAAF 133
           +F + E+VV D  + +D  +   AA+                     + +S    SGA  
Sbjct: 92  LFNVGEIVVLDLGTDTDAAAPHLAASELEPRKRKIKFEDVTEKTETSNKKSASRLSGAML 151

Query: 134 LVRLLQYLETPQYLRKALFSMHSSLRF--VGMLPPLDAPHHLR--KHEWAPFREGVTL-- 187
           +  LLQY  TP+YL  ++F    S  F     LP L A   +R  + +   +REG+++  
Sbjct: 152 IATLLQYFVTPRYLVSSVFKKKYSSYFNCAAKLPRLSALPFMRHLQTDQGRYREGLSVNM 211

Query: 188 ----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
                   P++    +++G +  + +  Q +   VRVTV    N+N        VV P++
Sbjct: 212 AKPAAAKKPSTRTKYINIGKDTMLELRSQQVPVNVRVTVDT-VNKN--------VVSPAE 262

Query: 243 P----KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL 291
                  +   +GY VR A   + +F    + GGY   +        VNS+D 
Sbjct: 263 AYGDYSGANAAYGYSVRVARTFADIFLGAGFAGGYTQSLW-------VNSADF 308


>gi|367016219|ref|XP_003682608.1| hypothetical protein TDEL_0G00300 [Torulaspora delbrueckii]
 gi|359750271|emb|CCE93397.1| hypothetical protein TDEL_0G00300 [Torulaspora delbrueckii]
          Length = 357

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 69/257 (26%), Positives = 109/257 (42%), Gaps = 38/257 (14%)

Query: 42  KKKKKRKRDQLNDDAPIEVPTV--SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDE 99
           +KKK +   +L     I   TV  S+ +P +++ N ++LE  T    QIA+  TIF + E
Sbjct: 12  QKKKVKPVAKLKKSIKILSKTVNYSLCIPTTVLANCRNLEQITFAIYQIAKTATIFNVGE 71

Query: 100 VVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLR 159
           +VV +    S+         +S + + + S A  +  LLQY  TP YL   +F    +  
Sbjct: 72  IVVLNLGDRSE---------KSKKHENSLSDAMLIASLLQYFVTPPYLVNTVFKKQYTRY 122

Query: 160 F--VGMLPPLDAPHHL--RKHEWAPFREGVTLKENAPNSVGT-----------LVDVGLN 204
           F     LP L A   +  R  +   +REG+ ++   P                 +++G  
Sbjct: 123 FQAAAKLPRLSALPFMRHRNEDKGRYREGLAIRMTKPGQASIKKKNKEFKQTKFINIGKA 182

Query: 205 KHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKE---SGMYWGYKVRYAPNI 260
           + + +   L P  VRVTV        D    R V P     +   +   +GY VR A N 
Sbjct: 183 EPLELKSQLVPVNVRVTV--------DTVEQRVVSPQEAYGDFVGANASFGYHVRVANNF 234

Query: 261 SSVFKNCSYKGGYDHLI 277
            S+F  C++  GY   I
Sbjct: 235 GSIFTECAFPNGYSQAI 251


>gi|409731025|ref|ZP_11272575.1| hypothetical protein Hham1_17426 [Halococcus hamelinensis 100A6]
 gi|448724783|ref|ZP_21707288.1| hypothetical protein C447_16559 [Halococcus hamelinensis 100A6]
 gi|445784992|gb|EMA35788.1| hypothetical protein C447_16559 [Halococcus hamelinensis 100A6]
          Length = 275

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 128/313 (40%), Gaps = 54/313 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S+         AT   G +ARA T+FR+D V VF +   +  +         
Sbjct: 2   TRSVLVPSSLCREASDKREATHKVGLVARAATVFRVDRVTVFPDDGGNRRW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                   G  F+  +L Y  T  YLRK  +     L +VG+LPPL     + +    P 
Sbjct: 53  --------GGGFVATVLAYAATAPYLRKEAWGTREELEYVGVLPPL---RPVPRTGSGPQ 101

Query: 182 REGVTLKENAPNSVG----TLVDVGLNKHVVVDQVLDPGV------RVTVAMGTNRNLDA 231
             G +L++     VG      V+ GL   V +  V+ PGV      RVTV + +   + A
Sbjct: 102 SSG-SLRQGIVTEVGPDGRVRVNCGLQHPVSL--VVPPGVEVGEGERVTVRISSREPVRA 158

Query: 232 DSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNS--- 288
               ++V    P       G+ V      +++ +  +        + TS HG+ + +   
Sbjct: 159 ----RIVDEPAP-------GFAVDRTDLPAALGREDA-----GLRVATSRHGEALTADGL 202

Query: 289 SDLTLPTFRHLLIAFGGLA-GLEESIEED-DGLKRKNAREVFHSYFNTCPHQGSRTIRTE 346
             L       L +AFG    GL   +  D + +        F  + NT P+QGS  +RTE
Sbjct: 203 GALAGGPGGDLTVAFGAPERGLPAMLGIDVESVTSAEPNGGFDRWLNTVPNQGSEVVRTE 262

Query: 347 EAIFISLQYFQEP 359
           EA+F +L     P
Sbjct: 263 EAVFATLACLTLP 275


>gi|366992812|ref|XP_003676171.1| hypothetical protein NCAS_0D02290 [Naumovozyma castellii CBS 4309]
 gi|342302037|emb|CCC69810.1| hypothetical protein NCAS_0D02290 [Naumovozyma castellii CBS 4309]
          Length = 378

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 72/269 (26%), Positives = 112/269 (41%), Gaps = 31/269 (11%)

Query: 41  NKKKKKRKRDQLNDDAPIEVPT----VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFR 96
           NKKK K           I+V +     S+ +P +I+ N  +LE  T +  QIA+A TIF 
Sbjct: 14  NKKKSKVLEPTKKSKTSIKVLSKKLNYSLCIPTTILANCTNLEQITHVVYQIAKAATIFN 73

Query: 97  IDEVVVFD-NKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMH 155
             E+V+ D         +      +  ++D   S +  +  LLQY  TP YL   +F   
Sbjct: 74  AGEIVILDMGDRKETKITEEKEEKKKKKTDARLSDSMLIASLLQYFVTPPYLVSTVFKKQ 133

Query: 156 SSLRFV--GMLPPLDAPHHLR--KHEWAPFREGVTLKENAPNSVGT---------LVDVG 202
               F     LP L A   +R    +   +REG+ ++   P S             +++G
Sbjct: 134 FMKYFTVASKLPRLSALPFMRYLNEDNGRYREGLAIRMTKPGSNSNSKKEFKQTKYINIG 193

Query: 203 LNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNIS 261
             K++ + + L P  VRVTV    NR +   SP +        ++   +GY VR A    
Sbjct: 194 KEKNLELKKQLVPVNVRVTVDTIENRVV---SPEEAYGDYVGAQAS--YGYHVRIAKTFG 248

Query: 262 SVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
            +F  C++  GY   +        VNS D
Sbjct: 249 DIFTQCAFPSGYSQTVW-------VNSGD 270


>gi|255720905|ref|XP_002545387.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240135876|gb|EER35429.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 334

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 75/307 (24%), Positives = 131/307 (42%), Gaps = 57/307 (18%)

Query: 59  EVPTVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVF-----------DN 105
           E  ++S+ +P S+I   N  +LE  T +  QIA+A TI++I E++VF           DN
Sbjct: 9   EETSISICIPSSVISSKNAYNLEQITNIIYQIAKAATIYKISEIIVFKAPKEEEEEEDDN 68

Query: 106 KSS-----------SDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSM 154
           KS             D  ++SS  N + +  E ++       LLQ+  TP YL K++FS 
Sbjct: 69  KSKEKSKKIVFEEDDDTNNKSSMNNNTQKEQEEDNSTLLAASLLQFFITPPYLVKSMFSP 128

Query: 155 HSSLRFVGMLPPLDAPHHLRKHEWAP----FREGVTLKENAP--------------NSVG 196
           H + +F  +LP       L K         ++EG+ +    P               +V 
Sbjct: 129 HLNKKFSNILPKFKYAFKLPKITTIDMDKNYKEGMIIPRETPLIKKKNKKVKADHKITVS 188

Query: 197 TLVDVGLNKHVVVDQVLDPGV--RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKV 254
             V++G ++ + +D   +  +  RVTV +   +N    S ++    S  K +   +GY V
Sbjct: 189 KYVNIGESEAIKLDIKREIPIYSRVTVDL---KNKTIVSAKKAYGNSGHKNA---FGYHV 242

Query: 255 RYAP--NISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR----HLLIAFGGLAG 308
           R     + + VF   S   GY + I  +   D   ++ + LP       ++L+  G L  
Sbjct: 243 RMIDDGDFNKVFIQSSIPDGYSNTIYVN-CNDYFGNNKIDLPEINEIKGNILVVLGNLKD 301

Query: 309 LEESIEE 315
            +  + +
Sbjct: 302 YQPDVHQ 308


>gi|284165491|ref|YP_003403770.1| hypothetical protein Htur_2217 [Haloterrigena turkmenica DSM 5511]
 gi|284015146|gb|ADB61097.1| Protein of unknown function DUF171 [Haloterrigena turkmenica DSM
           5511]
          Length = 301

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 127/314 (40%), Gaps = 43/314 (13%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TIFR D ++V+ ++              +
Sbjct: 2   TVSVLVPSSLTREAEDKREATRKLGYVARAATIFRADRLIVYPDRD-----------GET 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
            R D       F+  +L+Y  TP YLR   + M   L + G+LPPL A            
Sbjct: 51  GRFD-----GGFVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSESTGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ GL   + ++      ++ G RVTV + + R + A    
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVEEGERVTVRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIV---N 287
           + +P    + + +          VR A   SS +      G  + L G  E   +     
Sbjct: 164 EPLPGLSVERTDLQTALGREDAGVRIA---SSRYGEELTVGRLETLAGRIEGDGMTVAFG 220

Query: 288 SSDLTLPTFRHL-LIAFGGLAGLEESIEEDDGLKRKNAREV-------FHSYFNTCPHQG 339
           + +  LP    +   A    +G  ++IE            V       F  + NT P QG
Sbjct: 221 APERGLPAILGIEASAIEASSGQGDAIESSSDRDDAADDGVEPTADPGFDLWLNTVPDQG 280

Query: 340 SRTIRTEEAIFISL 353
           S  +RTEEA+F +L
Sbjct: 281 SEVVRTEEALFATL 294


>gi|398366411|ref|NP_011799.3| hypothetical protein YGR283C [Saccharomyces cerevisiae S288c]
 gi|1723785|sp|P53336.1|YG5X_YEAST RecName: Full=Uncharacterized protein YGR283C
 gi|1323517|emb|CAA97314.1| unnamed protein product [Saccharomyces cerevisiae]
 gi|285812471|tpg|DAA08371.1| TPA: hypothetical protein YGR283C [Saccharomyces cerevisiae S288c]
          Length = 341

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 59/217 (27%), Positives = 94/217 (43%), Gaps = 25/217 (11%)

Query: 55  DAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSR 114
           D    V   S+ +P ++IDN  +LE  T  A QIAR   +F + E++V D        S+
Sbjct: 32  DIESNVVKYSICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SK 84

Query: 115 SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHH 172
                + +RS E  S    L  LLQY  TP  L    F   +   L+     PPL+    
Sbjct: 85  DKKHEKKSRSKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPPLNQLPF 144

Query: 173 LRKHEWAPFREGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTN 226
           +       ++EG+++     K  + +++  LV +G N+ + + +Q +    RVTV     
Sbjct: 145 MNASAEQHYKEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----- 199

Query: 227 RNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
                D+ R+ V        G   GY VR A  ++ V
Sbjct: 200 -----DTERKEVVSPIDAYKGKPLGYHVRMASTLNEV 231


>gi|241954318|ref|XP_002419880.1| unnamed protein product [Candida dubliniensis CD36]
 gi|223643221|emb|CAX42095.1| unnamed protein product [Candida dubliniensis CD36]
          Length = 371

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 87/356 (24%), Positives = 144/356 (40%), Gaps = 78/356 (21%)

Query: 62  TVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD----NKSSSD----- 110
           TVS+ +P ++I   N  +L+  T +  QIA+A TI+++ E+VVFD    N    D     
Sbjct: 25  TVSICIPSTVISPKNAYNLQQITNIIYQIAKACTIYKVAEIVVFDVPELNNEQEDVKSTV 84

Query: 111 -------------NYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
                          S +S   +S+ + ++ S +  +  LLQ+  TP YL K +FS H +
Sbjct: 85  VVGSKVKFVEDEPEKSLASPNKKSSVNGDDVSPSLLVASLLQFFITPPYLVKTMFSSHLN 144

Query: 158 LRFVGMLPPLDAPHHLRKHEWAPF----------REGVTLKENAP--------------N 193
            +F  +LP       L K    PF          +EG+ +    P               
Sbjct: 145 SKFKNILPKFTYAFKLPKITTLPFMQNNQVYKVLKEGMIIPRETPLMRKKNKKTKSDHKI 204

Query: 194 SVGTLVDVGLNKHVVVDQVLDPGV--RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWG 251
           +V   V++G  + + ++   +  +  RVTV +   +N    SP Q       K +   +G
Sbjct: 205 TVSKYVNIGEKEALKLEIKREIPIYSRVTVDL---KNKTVVSPLQAYGVIGHKAA---FG 258

Query: 252 YKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL----TLPTFR----HLLIAF 303
           Y VR A   + +F       GY   I  +      N++ +     L TF+    ++L+  
Sbjct: 259 YHVRMASEFNKIFTQSPISDGYSSTIYVNCDDYFGNNNKIKELDNLSTFKEATGNVLMVL 318

Query: 304 GGLAGLEESIEEDDG---LKRKNAREVFHSYFNT---CPHQGSRTIRTEEAIFISL 353
           G    L+   + D        +N  ++F S  N    C        R E+AI ISL
Sbjct: 319 GNYKDLQAGFKADSSNVFQNVENVAQLFDSKLNIPDGC--------RIEDAILISL 366


>gi|151943555|gb|EDN61866.1| conserved protein [Saccharomyces cerevisiae YJM789]
 gi|349578483|dbj|GAA23649.1| K7_Ygr283cp [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 341

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 59/217 (27%), Positives = 94/217 (43%), Gaps = 25/217 (11%)

Query: 55  DAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSR 114
           D    V   S+ +P ++IDN  +LE  T  A QIAR   +F + E++V D        S+
Sbjct: 32  DIESNVVKYSICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SK 84

Query: 115 SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHH 172
                + +RS E  S    L  LLQY  TP  L    F   +   L+     PPL+    
Sbjct: 85  DKKHEKKSRSKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPPLNQLPF 144

Query: 173 LRKHEWAPFREGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTN 226
           +       ++EG+++     K  + +++  LV +G N+ + + +Q +    RVTV     
Sbjct: 145 MNASAEQHYKEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----- 199

Query: 227 RNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
                D+ R+ V        G   GY VR A  ++ V
Sbjct: 200 -----DTERKEVVSPIDAYKGKPLGYHVRMASTLNEV 231


>gi|448392362|ref|ZP_21567136.1| hypothetical protein C477_13040 [Haloterrigena salina JCM 13891]
 gi|445664452|gb|ELZ17161.1| hypothetical protein C477_13040 [Haloterrigena salina JCM 13891]
          Length = 301

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 88/328 (26%), Positives = 128/328 (39%), Gaps = 71/328 (21%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVSV VP S+    +    ATR  G +ARA TIFR D ++V+ ++              +
Sbjct: 2   TVSVLVPSSLSREAEDKREATRKLGYVARAATIFRADRLIVYPDRD-----------GET 50

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHH--LRKHEWA 179
            R D       F+  +L+Y  TP YLR   + M   L + G+LPPL A            
Sbjct: 51  GRFD-----GGFVSTVLRYAATPPYLRNEAWGMRDELEYAGVLPPLRAMSQTGSESTGSG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVD----QVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+  +      V   V+ GL   + ++      +D G RVTV + + R + A    
Sbjct: 106 SSRQGIVTEVGPEGRV--RVNCGLQHPISLNVPPKMAVDEGERVTVRISSRRPVRAKLVD 163

Query: 236 QVVPPSKPKESGMYWGY-----KVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
           + +P    + + +          VR A   +S F      G    L G  E GD      
Sbjct: 164 EPLPGLSVERTDLQTALGREDAGVRIA---ASRFGEELTVGRLGTLAGRIE-GD------ 213

Query: 291 LTLPTFRHLLIAFG-------GLAGLEES-IEEDDGLK-----------------RKNAR 325
                   + +AFG        + G+E S IE   G                      A 
Sbjct: 214 -------GMTVAFGAPERGLPAILGIEASAIEASSGQGDVIEPSSDRDDAADDGVEPTAD 266

Query: 326 EVFHSYFNTCPHQGSRTIRTEEAIFISL 353
             F  + NT P QGS  +RTEEA+F +L
Sbjct: 267 PGFDLWLNTVPDQGSEVVRTEEALFATL 294


>gi|354610538|ref|ZP_09028494.1| protein of unknown function DUF171 [Halobacterium sp. DL1]
 gi|353195358|gb|EHB60860.1| protein of unknown function DUF171 [Halobacterium sp. DL1]
          Length = 283

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 133/312 (42%), Gaps = 57/312 (18%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SV VP S++   +    ATR  G +ARA  +FR D +VVF ++               
Sbjct: 2   TLSVLVPSSLVREAEDKREATRKLGYVARAAAVFRADRLVVFPDEDG------------- 48

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWA 179
               E + G  F+  +L+Y  TP YLRK  F     L +VG+LPP  L +          
Sbjct: 49  ----ERKWGGGFVETVLRYAATPPYLRKEAFDTRDELEYVGVLPPLRLSSWTGSDSSGSG 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGV------RVTVAMGTNRNLDADS 233
             R+G+  +  +   V   V+ G+   + + Q   PG+      RVT+ + + R + A  
Sbjct: 105 SLRQGIVTQVGSEGRV--RVNCGMQHPISLHQ--PPGMEVVEGERVTIRVSSRRPVRAKL 160

Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
             + +P           G+ V       +   +   +      I TS HG  ++++ L  
Sbjct: 161 VDEPLP-----------GFSVE-----RTTIGDALDRPDAGVRIATSRHGTPLSAASLGG 204

Query: 294 PTFR----HLLIAFGG-------LAGL-EESIEEDDGLKRKNAREVFHSYFNTCPHQGSR 341
              R     L +AFG        + GL E+ + E       +A   F ++ NT P QGS 
Sbjct: 205 YVERIARDGLTVAFGAPERGLPPMLGLTEDEVSEAVADPNSDAPARFDAWLNTIPSQGSE 264

Query: 342 TIRTEEAIFISL 353
            +RTEEA+F SL
Sbjct: 265 VVRTEEALFASL 276


>gi|300710374|ref|YP_003736188.1| hypothetical protein HacjB3_05025 [Halalkalicoccus jeotgali B3]
 gi|448294698|ref|ZP_21484777.1| hypothetical protein C497_03402 [Halalkalicoccus jeotgali B3]
 gi|299124057|gb|ADJ14396.1| hypothetical protein HacjB3_05025 [Halalkalicoccus jeotgali B3]
 gi|445586375|gb|ELY40657.1| hypothetical protein C497_03402 [Halalkalicoccus jeotgali B3]
          Length = 285

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 55/108 (50%), Gaps = 17/108 (15%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+ VP S++   +    ATR  G +ARA T+FR D ++VF                  
Sbjct: 2   TVSILVPSSVVREAEDKREATRKLGYVARAATVFRADRLIVF-----------------P 44

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDA 169
           +R  E   G  F+  +L+Y  TP YLR  ++ M   L +VG+LPPL A
Sbjct: 45  DRDGERRWGGGFVEAVLRYAATPPYLRTEVWGMRDELEYVGVLPPLRA 92



 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 22/31 (70%)

Query: 323 NAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +  E F  + NT PHQGS T+RTEEA+F SL
Sbjct: 248 DGSERFDLWLNTIPHQGSETVRTEEAMFASL 278


>gi|159040854|ref|YP_001540106.1| hypothetical protein Cmaq_0268 [Caldivirga maquilingensis IC-167]
 gi|157919689|gb|ABW01116.1| Protein of unknown function DUF171 [Caldivirga maquilingensis
           IC-167]
          Length = 275

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 85/301 (28%), Positives = 135/301 (44%), Gaps = 52/301 (17%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+A+P ++ + + + E A R  G I RA  IFR+  ++++  K                
Sbjct: 3   ISIAIPYNVTEESTTEEEAVRKIGYIGRAAAIFRVRSILIYTFKG--------------- 47

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAP---HHLRKHEWA 179
             DE    A+F+ + L+YL TP YLRK LF +   LR  G+L PL  P   H        
Sbjct: 48  --DEPLRKASFIKKNLEYLVTPPYLRKDLFGIDPDLRLAGLLQPLTLPVFGHRKSNPRTG 105

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQ--VLDPGVRVTV-AMGTNRNLDADSPRQ 236
             R G+ ++     S+   V VG + +V V +   +   + V++ +M +NR         
Sbjct: 106 DVRIGLVIRWEGYYSI---VKVGEDTYVKVPKPYPIKSMLLVSIDSMISNRFYRG----H 158

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTF 296
           VV     K S +Y G    Y+ NI  +      +    +LI T + G+ V +    L   
Sbjct: 159 VV-----KSSKVYAG----YSANIIELRDLVKLR----NLILTGKEGNSVINEINKLRAL 205

Query: 297 R--HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHS--YFNTCPHQGSRTIRTEEAIFIS 352
           +   +++ FG        +  DD LK +   +V +S  + N  P+QG  TIRTEEAI   
Sbjct: 206 KDEDVVVVFG-----SPRMGVDDILKGEGLGDVLNSSLFVNFVPNQGLVTIRTEEAIIAV 260

Query: 353 L 353
           L
Sbjct: 261 L 261


>gi|323304774|gb|EGA58534.1| YGR283C-like protein [Saccharomyces cerevisiae FostersB]
          Length = 297

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 58/217 (26%), Positives = 93/217 (42%), Gaps = 25/217 (11%)

Query: 55  DAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSR 114
           D    V   S+ +P ++IDN  +LE  T  A QIAR   +F + E++V D        S+
Sbjct: 32  DIESNVVKYSICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SK 84

Query: 115 SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHH 172
                + +RS E  S    L  LLQY  TP  L    F   +   L+     PPL     
Sbjct: 85  DKKHEKKSRSKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPPLKQLPF 144

Query: 173 LRKHEWAPFREGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTN 226
           +       ++EG+++     K  + +++  LV +G N+ + + +Q +    R+TV     
Sbjct: 145 MNASAEQHYKEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARLTV----- 199

Query: 227 RNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
                D+ R+ V        G   GY VR A  ++ V
Sbjct: 200 -----DTERKEVVSPIDSYKGKPLGYHVRMASTLNEV 231


>gi|256272682|gb|EEU07659.1| YGR283C-like protein [Saccharomyces cerevisiae JAY291]
          Length = 341

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 91/208 (43%), Gaps = 25/208 (12%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P ++IDN  +LE  T  A QIAR   +F + E++V D        S+     + +R
Sbjct: 41  SICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SKDKKHEKKSR 93

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHHLRKHEWAPF 181
           S E  S    L  LLQY  TP  L    F   +   L+     PPL     +       +
Sbjct: 94  SKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPPLKQLPFMNASAEQHY 153

Query: 182 REGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPR 235
           +EG+++     K  + +++  LV +G N+ + + +Q +    RVTV          D+ R
Sbjct: 154 KEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----------DTER 203

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSV 263
           + V        G   GY VR A  ++ V
Sbjct: 204 KEVVSPIDSYKGKPLGYHVRMASTLNEV 231


>gi|366998671|ref|XP_003684072.1| hypothetical protein TPHA_0A05640 [Tetrapisispora phaffii CBS 4417]
 gi|357522367|emb|CCE61638.1| hypothetical protein TPHA_0A05640 [Tetrapisispora phaffii CBS 4417]
          Length = 397

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 67/277 (24%), Positives = 109/277 (39%), Gaps = 48/277 (17%)

Query: 38  SCDNKKKKKRKRDQLNDDAPIEVPT-----------VSVAVPGSIIDNTQSLELATRLAG 86
           S    K+KK++      +  ++ PT            S+ +P SI D+ ++LE  T +  
Sbjct: 19  SVTQTKQKKKQNHAAKKEVVLKKPTKSLKVSSKNLNYSICIPTSITDDCKNLEQTTNVIY 78

Query: 87  QIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQY 146
            IA+  TIF + EV++ D    +              + +  S    +  LLQY  TP Y
Sbjct: 79  HIAKTATIFNVGEVIILDLGKETPIEVTEDIKGTKGETKKRLSRPMLIATLLQYFVTPPY 138

Query: 147 LRKALFS--MHSSLRFVGMLPPLDAPHHLRKHE--WAPFREGVTLKENAPNSVGT----- 197
           L   +F    H   ++   LP L     +R +E     FREG+ ++ + P +  +     
Sbjct: 139 LVNTVFKKDYHKYFKYACKLPRLSVLPFMRYYEEDKGRFREGLAVRMSKPEATNSNQSNS 198

Query: 198 ----------LVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPKES 246
                      +++G ++ + +   L P  VRVTV        D    R V P    +  
Sbjct: 199 KKAKEFKQTKYINIGKSECLELKSQLVPVNVRVTV--------DTVEKRVVAP---QEAY 247

Query: 247 GMY------WGYKVRYAPNISSVFKNCSYKGGYDHLI 277
           G Y      +GY VR      SVF   ++  GY   I
Sbjct: 248 GDYVGAQSSFGYHVRVCKKFGSVFTESAFPEGYSQAI 284


>gi|190346247|gb|EDK38287.2| hypothetical protein PGUG_02385 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 355

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 140/329 (42%), Gaps = 68/329 (20%)

Query: 63  VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD---------------- 104
           V++ +P ++I   N ++L+ AT +A QIA+A T++ + E+VV                  
Sbjct: 25  VALCIPSTVISHSNARNLQQATFIAYQIAKAATLYNVSEIVVLSIPDAEAVAEISGPKVV 84

Query: 105 ------NKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALF---SMH 155
                  K   D  + S    + N  + N+  +  L  LLQY  TP YL KA+F      
Sbjct: 85  SAATGGQKIMFDEVANSPKVEQ-NHENRNDDDSLLLATLLQYFVTPPYLVKAVFKDSKFR 143

Query: 156 SSLRFVGMLPPLDA-PHHLRKHEWAPFREGVTLKENAPNSV-----GTLVDVGLNKHVVV 209
              ++   LP + + P          FREG+++ +  P  V      +   + + K+V +
Sbjct: 144 KKFKYAEKLPKIPSLPFMANNDVHKDFREGLSVAKKTPKIVKKNKKVSGKKLSVTKYVNI 203

Query: 210 --DQVLD---PGVRVTVAMGTN-RNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
             D++L+   P + V V +  + +N    SPR     S  K +   +GY VR + + +S+
Sbjct: 204 GGDELLELAGPDIPVNVRVTVDVKNKKVVSPRDAYGVSGNKST---FGYFVRVSRSFTSI 260

Query: 264 FKNCSYKGGY---------------DHLIGTSEHGDIVNSSD-LTLPTFRHL-----LIA 302
           F   ++  GY               D  I     GDI      L +  ++HL     + A
Sbjct: 261 FTESTFPSGYTKSYYISCDDFYGKPDTGIKKETFGDISEGKILLVMSNWKHLEASFSVEA 320

Query: 303 FGGLAGLEE----SIEEDDGLKRKNAREV 327
           F G+A + E     IE + G++ ++A  V
Sbjct: 321 FEGVAKVSELFDGEIEMEQGIRVEDAVMV 349


>gi|146417374|ref|XP_001484656.1| hypothetical protein PGUG_02385 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 355

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 83/332 (25%), Positives = 140/332 (42%), Gaps = 74/332 (22%)

Query: 63  VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD---------------- 104
           V++ +P ++I   N ++L+ AT +A QIA+A T++ + E+VV                  
Sbjct: 25  VALCIPSTVISHSNARNLQQATFIAYQIAKAATLYNVSEIVVLSIPDAEAVAEISGPKVV 84

Query: 105 ------NKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALF---SMH 155
                  K   D  + S    + N  + N+  +  L  LLQY  TP YL KA+F      
Sbjct: 85  SAATGGQKIMFDEVANSPKVEQ-NHENRNDDDSLLLATLLQYFVTPPYLVKAVFKDSKFR 143

Query: 156 SSLRFVGMLPPLDA-PHHLRKHEWAPFREGVTLKENAPNSV-----GTLVDVGLNKHVVV 209
              ++   LP + + P          FREG+++ +  P  V      +   + + K+V +
Sbjct: 144 KKFKYAEKLPKIPSLPFMANNDVHKDFREGLSVAKKTPKIVKKNKKVSGKKLSVTKYVNI 203

Query: 210 --DQVLDPG-------VRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNI 260
             D++L+         VRVTV +   +N    SPR     S  K +   +GY VR + + 
Sbjct: 204 GGDELLELAGPDIPVNVRVTVDV---KNKKVVSPRDAYGVSGNKST---FGYFVRVSRSF 257

Query: 261 SSVFKNCSYKGGY---------------DHLIGTSEHGDIVNSSD-LTLPTFRH-----L 299
           +S+F   ++  GY               D  I     GDI      L +  ++H     L
Sbjct: 258 TSIFTESTFPSGYTKSYYISCDDFYGKPDTGIKKETFGDISEGKILLVMSNWKHLEASFL 317

Query: 300 LIAFGGLAGLEE----SIEEDDGLKRKNAREV 327
           + AF G+A + E     IE + G++ ++A  V
Sbjct: 318 VEAFEGVAKVSELFDGEIEMEQGIRVEDAVMV 349


>gi|118430942|ref|NP_147061.2| hypothetical protein APE_0226.1 [Aeropyrum pernix K1]
 gi|116062268|dbj|BAA79138.2| conserved hypothetical protein [Aeropyrum pernix K1]
          Length = 270

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 75/295 (25%), Positives = 123/295 (41%), Gaps = 45/295 (15%)

Query: 65  VAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRS 124
           +A+P SI+    +L L T  AGQI R   I+R+DE+ VF++  SS   SR      +   
Sbjct: 1   MALPSSILRVEPTLLLKTLKAGQIGRVAAIYRVDEIAVFEDPDSSPRLSRLLQLLLN--- 57

Query: 125 DENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFREG 184
                          Y  TP +L+K +F +   L++  ++PPL  P H    E    + G
Sbjct: 58  ---------------YQATPPHLKKRVFPLRRELKYASLMPPLKIPSHTVPAE---PKVG 99

Query: 185 VTLKENAPNSVG--TLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
             L     +  G    V +G   + V+     PG  VTV +G     +    R V+   K
Sbjct: 100 AILDGYVESCSGKSCRVFLGSMGYGVLRGRARPGDVVTVRIG-----EISGGRIVL--EK 152

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTL---PTFRHL 299
                +Y+G+ V     +     +   +G    ++ T ++G     S   L   P+ + L
Sbjct: 153 ASWGELYYGFSVSRFKRLEEAVTSFRRRG--YRVVATDKNGLCPPESWKILAEIPSAKTL 210

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
           ++ FGG            G+       ++    N  P QG+ T+RTEEA+  +L+
Sbjct: 211 VV-FGG---------PYKGVLEYTNPYLYDGVVNVIPRQGTETVRTEEALLATLE 255


>gi|296242582|ref|YP_003650069.1| hypothetical protein Tagg_0844 [Thermosphaera aggregans DSM 11486]
 gi|296095166|gb|ADG91117.1| Protein of unknown function DUF171 [Thermosphaera aggregans DSM
           11486]
          Length = 272

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 78/305 (25%), Positives = 129/305 (42%), Gaps = 62/305 (20%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           + VAVP SI+    +L L T    Q+AR + IF +  +  + +  +  +  R  A     
Sbjct: 4   IEVAVPTSILRVESTLLLKTLRIHQVARILGIFGVSRIFFYRDFETDPSTHREYAR---- 59

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL-----DAPHHLR--- 174
                      + +  +Y  TP YLR+ L   +  L+ VGMLPP+     D P  L+   
Sbjct: 60  ----------LIRKQWEYFFTPPYLRRRLVPRNPLLKHVGMLPPIRLEWFDVPRRLKPGD 109

Query: 175 KHEWAPFREGVTLK-----ENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNL 229
           +     F+EG   K         N+VG   + GL     V +++DPG ++   +G +   
Sbjct: 110 ERVGYVFKEGGGFKVYVDSSRVFNAVGECRE-GLG----VIRIVDPGEKMAECVGKD--- 161

Query: 230 DADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSS 289
                              Y G + R+A +    FK    +     ++ TS +G +    
Sbjct: 162 ------------------FYKGPEPRFADS----FKQLLEESRGARIVATSRYGRVPGFE 199

Query: 290 DLT-LPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEA 348
           DL+ L +   +LI FG  +     I   +G + +   +V    +NT P Q  +T+RTEEA
Sbjct: 200 DLSALSSSNRVLILFGSPSRGLHDIAGAEGFRLEELGDV----WNTVPGQRVKTVRTEEA 255

Query: 349 IFISL 353
           + I+L
Sbjct: 256 LIITL 260


>gi|254579457|ref|XP_002495714.1| ZYRO0C01276p [Zygosaccharomyces rouxii]
 gi|238938605|emb|CAR26781.1| ZYRO0C01276p [Zygosaccharomyces rouxii]
          Length = 351

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 105/228 (46%), Gaps = 30/228 (13%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P +++ + ++LE  T    Q+A+A T+F + E+VV D  ++S              
Sbjct: 40  SLCIPTTLLSHCKNLEQVTYTVYQVAKAATMFNVGEIVVLDLGTTS-----------GTT 88

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMH--SSLRFVGMLPPLDAPHHLR--KHEWA 179
            ++  S +  +  LLQY  TP YL   +F     S  R    LP L +   +R  + +  
Sbjct: 89  KEKKLSDSMLIASLLQYFVTPPYLVNTVFKQKYLSYFREACKLPRLPSLPFMRYLQADEG 148

Query: 180 PFREGVTLKENAPNSVG---------TLVDVGLNKHVVVDQVLDPG-VRVTVAMGTNRNL 229
            +REG+ ++ + P + G           V+VG ++ + +   L P  VRVTV +   + +
Sbjct: 149 RYREGLAIRMSKPGTQGKKTKEFKQTKYVNVGKSEALELKAQLVPANVRVTVDVVERKVV 208

Query: 230 DADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
              SP++        +S   +GY+VR A N   VF   ++  GY   +
Sbjct: 209 ---SPKEAYGDFVGAQSS--YGYQVRIAKNFGDVFMESAFADGYSQAL 251


>gi|365990481|ref|XP_003672070.1| hypothetical protein NDAI_0I02590 [Naumovozyma dairenensis CBS 421]
 gi|343770844|emb|CCD26827.1| hypothetical protein NDAI_0I02590 [Naumovozyma dairenensis CBS 421]
          Length = 395

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 58/233 (24%), Positives = 101/233 (43%), Gaps = 26/233 (11%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P S+I N  +LE  T    QIA+A T F   E+++ D  S  +         + + 
Sbjct: 60  SLCIPTSVISNCTNLEQITYTIYQIAKAATFFNAGEIIILDLTSPPNTDDTQLKTKKKST 119

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRF--VGMLPPLDAPHHLRKHEWAP- 180
           +  ++S    +  LLQY  TP YL  ++F       F     LP L A   +R  +    
Sbjct: 120 TKLSDS--MLIASLLQYFVTPPYLVNSVFKKQYRQYFHEASKLPRLSALPFMRYLDHGEG 177

Query: 181 -----FREGVTLKENAPNSVGT----------LVDVGLNKHVVVDQVLDP-GVRVTVAMG 224
                +REG+ ++   P  + T           +++G + ++ +   L P  VRVTV   
Sbjct: 178 NDNHRYREGLAIRMTKPGKISTSNKKEFKQTKFINIGKDINLELKSQLVPINVRVTVDTI 237

Query: 225 TNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
             R +   SP +        ++   +GY VR A +   +F +C+++ GY   +
Sbjct: 238 EKRVV---SPEEAYGDFVGAQAS--YGYHVRIAKSFGDIFTHCAFRNGYSQSV 285


>gi|259146785|emb|CAY80042.1| EC1118_1G1_6238p [Saccharomyces cerevisiae EC1118]
          Length = 341

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 90/208 (43%), Gaps = 25/208 (12%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P ++IDN  +LE  T  A QIAR   +F + E++V D        S+     + +R
Sbjct: 41  SICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SKDKKHEKKSR 93

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHHLRKHEWAPF 181
           S E  S    L  LLQY  TP  L    F   +   L+     P L     +       +
Sbjct: 94  SKETISDCLLLATLLQYFVTPPNLLDTTFKKKTKLYLKCASTFPSLKQLPFMNASAEQHY 153

Query: 182 REGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPR 235
           +EG+++     K  + +++  LV +G N+ + + +Q +    RVTV          D+ R
Sbjct: 154 KEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----------DTER 203

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSV 263
           + V        G   GY VR A  ++ V
Sbjct: 204 KEVVSPIDSYKGKPLGYHVRMASTLNEV 231


>gi|260941594|ref|XP_002614963.1| hypothetical protein CLUG_04978 [Clavispora lusitaniae ATCC 42720]
 gi|238851386|gb|EEQ40850.1| hypothetical protein CLUG_04978 [Clavispora lusitaniae ATCC 42720]
          Length = 371

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 133/337 (39%), Gaps = 69/337 (20%)

Query: 63  VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD---------------- 104
           V++ VP S+I   N ++LE  T +A QIA+A +++ + E+VV D                
Sbjct: 25  VALCVPSSVISKSNARNLEQITNIAYQIAKAASVYNVSEIVVLDIPTADEIQKKLEAEAN 84

Query: 105 ----------NKSSSDNYSRSS-----------AANRSNRSDENESGAAFLVRLLQYLET 143
                     NK    N S              AA +    +ENE+ A  L  LLQ+  T
Sbjct: 85  KAVQLEGDKGNKKIKFNISNEEIVAEAKSEEGPAAEKLVEKEENENNALLLATLLQFFVT 144

Query: 144 PQYLRKALF---SMHSSLRFVGMLPPL-DAPHHLRKHEWAPFREGVTLKENAP------N 193
           P YL K +F     +   ++   LP L + P          F+EG+T+ ++ P       
Sbjct: 145 PPYLVKGVFRASKYNKKFKYAEKLPKLSNLPFMNNNGVIEDFKEGLTVPKHTPKIQKKNK 204

Query: 194 SVGTLVDVGLNKHVVVDQV---------LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK 244
            V  L  + + K+V V Q          +   VRVTV +   +N    SP++    +  K
Sbjct: 205 KVSALKKLKVTKYVNVGQSSLLELNGTEVPVNVRVTVDV---KNKKIVSPQEAYGVTGSK 261

Query: 245 ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR-----HL 299
            S   +GY VRYA + SS+F   S+  GY   +          + D  L T        +
Sbjct: 262 AS---FGYFVRYAKSFSSIFTESSFPSGYTESVFVQADDFFGKAKDSKLDTVDKVGKGEV 318

Query: 300 LIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCP 336
           L     L  LE S  ++     +   E+F S     P
Sbjct: 319 LFVVSNLKDLEYSFSQEKIPGVEQVTEMFDSQVKVPP 355


>gi|76803056|ref|YP_331151.1| rpl operon protein [Natronomonas pharaonis DSM 2160]
 gi|76558921|emb|CAI50517.1| conserved hypothetical protein [Natronomonas pharaonis DSM 2160]
          Length = 278

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 130/310 (41%), Gaps = 57/310 (18%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S+    +    ATR  G +ARA T+FR+D +VV+ ++   D        +  
Sbjct: 2   TRSVLVPSSLTREAEDKREATRKLGYVARAATVFRVDRLVVYPDRDGDDGRFGDGFVST- 60

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                          +L+Y  TP YLRK ++     L + G+LPPL     +R    +  
Sbjct: 61  ---------------VLEYAATPAYLRKEVWDRRGELEYAGILPPL----RVRSQTGSGS 101

Query: 182 REGVTLKENAPNSVG----TLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADS 233
               +L++     VG      V+ G+   + +       +  G RVT+ + +   + A  
Sbjct: 102 DGSGSLRQGIVTEVGPDGRVRVNCGMQHPISLPVPPSMEVGKGERVTIRVSSREPVRA-- 159

Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLT- 292
             ++V  S P       G+ V  A   +S+ +      G    I  S HG+ ++   L  
Sbjct: 160 --KLVDESPP-------GFAVDCATIEASLARP---DAGVR--IAASRHGETLSVDRLDG 205

Query: 293 ---LPTFRHLLIAFGGLA-GLEESI-----EEDDGLKRKNAREVFHSYFNTCPHQGSRTI 343
                T     +AFG    GL E +     EE  G     AR  F  + NT P+QGS  +
Sbjct: 206 IVGRTTANGFTVAFGAPERGLPEILGCEPREEAIG-DEPEAR--FDLWLNTVPNQGSEVV 262

Query: 344 RTEEAIFISL 353
           RTEEA+F +L
Sbjct: 263 RTEEALFATL 272


>gi|190406716|gb|EDV09983.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
          Length = 341

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 58/217 (26%), Positives = 92/217 (42%), Gaps = 25/217 (11%)

Query: 55  DAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSR 114
           D    V   S+ +P ++IDN  +LE  T  A QIAR   +F + E++V D        S+
Sbjct: 32  DIESNVVKYSICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SK 84

Query: 115 SSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHH 172
                + +RS E  S    L  LLQY  TP  L    F   +   L+     P L     
Sbjct: 85  DKKHEKKSRSKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPSLKQLPF 144

Query: 173 LRKHEWAPFREGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTN 226
           +       ++EG+++     K  + +++  LV +G N+ + + +Q +    RVTV     
Sbjct: 145 MNASAEQHYKEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----- 199

Query: 227 RNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
                D+ R+ V        G   GY VR A  ++ V
Sbjct: 200 -----DTERKEVVSPIDSYKGKPLGYHVRMASTLNEV 231


>gi|323354916|gb|EGA86749.1| YGR283C-like protein [Saccharomyces cerevisiae VL3]
          Length = 311

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 90/208 (43%), Gaps = 25/208 (12%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P ++IDN  +LE  T  A QIAR   +F + E++V D        S+     + +R
Sbjct: 11  SICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SKDKKHEKKSR 63

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHHLRKHEWAPF 181
           S E  S    L  LLQY  TP  L    F   +   L+     P L     +       +
Sbjct: 64  SKETISDCLLLATLLQYFVTPPNLLDTTFKKKNKLYLKCASTFPSLKQLPFMNASAEQHY 123

Query: 182 REGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPR 235
           +EG+++     K  + +++  LV +G N+ + + +Q +    RVTV          D+ R
Sbjct: 124 KEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----------DTER 173

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSV 263
           + V        G   GY VR A  ++ V
Sbjct: 174 KEVVSPIDSYKGKPLGYHVRMASTLNEV 201


>gi|344305174|gb|EGW35406.1| hypothetical protein SPAPADRAFT_58623 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 379

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 85/362 (23%), Positives = 139/362 (38%), Gaps = 82/362 (22%)

Query: 62  TVSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSS---------- 109
           ++S+ +P S+I   N  +L+  T +  QIA+A TI+ + E+VV D               
Sbjct: 25  SISICIPTSVISQKNAYNLQQITTIVYQIAKACTIYNVSEIVVLDVPEKKEEEPEPTVEV 84

Query: 110 ---------------DNYSRSSAANRSNRSDENESGAAFLV-RLLQYLETPQYLRKALF- 152
                          D+ S+   A +  + +  E   A L+  LLQ+  TP YL K +F 
Sbjct: 85  VSGDKGGKKLKFNFDDDLSKDVPAKKEIQEETTEDDKAILLASLLQFFITPPYLVKTMFS 144

Query: 153 --------SMHSSLRFVGMLPPLDA-PHHLRKHEWAPFREGVTLKENAP--------NSV 195
                   S+ +  ++   LP +   P       +  F+EG+ + +  P          V
Sbjct: 145 PALNPNFKSILAKFKYAFKLPKITTLPFMNNNQTYKDFKEGIIIPKETPKVRRKTSVKKV 204

Query: 196 GTLVDVGLNKHVVVD--QVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESG--MYWG 251
                V ++K+V +   Q L+  ++  + + +   +D  +   V P       G    +G
Sbjct: 205 KAANKVTVSKYVNIGEAQALELNIKREIPINSRVTVDLKNKTIVSPLQAYGIVGHKAAFG 264

Query: 252 YKVRYAPNISSVFKNCSYKGGY--------DHLIGTSEHGDIVNSSDLT-LPTFR----- 297
           Y VR     S +F  C    GY        D   G++E       SDLT LP        
Sbjct: 265 YHVRVVKQFSKLFTECPIPEGYSSSVYVNCDVYFGSNEK-----ISDLTNLPQLEVNSAD 319

Query: 298 --HLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT----EEAIFI 351
             ++L+  G    L+ S  EDD    +    V   +       G  TI T    E+A+ I
Sbjct: 320 KNNVLVVVGNYKYLQTSF-EDDKTNLEGVESVSQMF------DGKLTIPTGVKIEDAVLI 372

Query: 352 SL 353
           SL
Sbjct: 373 SL 374


>gi|2425174|dbj|BAA22268.1| unnamed protein product [Halobacterium salinarum]
          Length = 226

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 52/106 (49%), Gaps = 17/106 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S++   +    ATR  G +ARA  +FRID V+VF ++               
Sbjct: 15  TRSVLVPSSLVREAEDKREATRKLGYVARAAAVFRIDRVLVFPDEDG------------- 61

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
               E + G  F+  +L+Y  TP YLRK  F     L + G+LPPL
Sbjct: 62  ----ERQWGGGFVETVLRYAATPPYLRKEAFDTRDELAYAGVLPPL 103


>gi|323337420|gb|EGA78671.1| YGR283C-like protein [Saccharomyces cerevisiae Vin13]
          Length = 311

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 90/208 (43%), Gaps = 25/208 (12%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P ++IDN  +LE  T  A QIAR   +F + E++V D        S+     + +R
Sbjct: 11  SICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SKDKKHEKKSR 63

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAPHHLRKHEWAPF 181
           S E  S    L  LLQY  TP  L    F   +   L+     P L     +       +
Sbjct: 64  SKETISDCLLLATLLQYFVTPPNLLDTTFXKKNKLYLKCASTFPSLKQLPFMNASAEQHY 123

Query: 182 REGVTL-----KENAPNSVGTLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNRNLDADSPR 235
           +EG+++     K  + +++  LV +G N+ + + +Q +    RVTV          D+ R
Sbjct: 124 KEGLSIARDSSKGKSDDALTNLVYIGKNQIITLSNQNIPNTARVTV----------DTER 173

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSV 263
           + V        G   GY VR A  ++ V
Sbjct: 174 KEVVSPIDSYKGKPLGYHVRMASTLNEV 201


>gi|448099375|ref|XP_004199133.1| Piso0_002543 [Millerozyma farinosa CBS 7064]
 gi|359380555|emb|CCE82796.1| Piso0_002543 [Millerozyma farinosa CBS 7064]
          Length = 380

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 73/296 (24%), Positives = 121/296 (40%), Gaps = 72/296 (24%)

Query: 48  KRDQLNDDAPIEVPT-------VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRID 98
           K+ +   + P +VP+        ++ +P SII   N ++LE  T +A Q+A+A TI+ + 
Sbjct: 3   KKGEAKKEVPTKVPSKTSPSRRFALCIPSSIISASNARNLEQITYVAYQVAKAATIYNVV 62

Query: 99  EVVVFD------------------NKSSSDNYSRSSAAN------------RSNRSDENE 128
           EVV+ D                    SSSD   +    N            + +++D++ 
Sbjct: 63  EVVILDIPSPEEQERLAEQEATTVVSSSSDKGGKKIKFNLNDKDAVETPGEKDSQADDHT 122

Query: 129 SGA----------AFLVRLLQYLETPQYLRKALFS---MHSSLRFVGMLPPLDA-PHHLR 174
           S +               LLQ+  TP YL KA+F      S L++   LP +   P    
Sbjct: 123 SSSNQHVADSNNGLLFATLLQFFMTPPYLVKAVFGGSEFSSKLKYARKLPKISTLPFMSN 182

Query: 175 KHEWAPFREGVTLKENAPNSVGT-------------LVDVGLNKHVVVDQVLDPGVRVTV 221
            + +A F+EG+T+ + A                    V++G +  +V+   +   VRVTV
Sbjct: 183 NNVFADFKEGLTVPKKALKIKKKNKKPVSRKFNSTKYVNIGSDAPLVLSSEVPVNVRVTV 242

Query: 222 AMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
            +   +N    SP Q         +   +G+ VR+    S+VF  CS   GY   I
Sbjct: 243 DI---KNKKVVSPAQAYGVVGAHSA---FGFHVRFCKKFSAVFTECSIPAGYTQSI 292


>gi|110668742|ref|YP_658553.1| rpl operon protein [Haloquadratum walsbyi DSM 16790]
 gi|109626489|emb|CAJ52950.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790]
          Length = 303

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 17/106 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SV +P S++   +    ATR  G IAR+ T+FR+D ++VF                  
Sbjct: 2   TLSVLIPSSVVREAEDKREATRKLGYIARSATVFRVDHLIVF-----------------P 44

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
           +R  E   G  F+  +L+Y  TP +LRK ++     L +VG+LPPL
Sbjct: 45  DRGGERRLGGEFVETVLRYAATPPHLRKKIWGRRDELEYVGVLPPL 90


>gi|385804242|ref|YP_005840642.1| hypothetical protein Hqrw_3231 [Haloquadratum walsbyi C23]
 gi|339729734|emb|CCC41012.1| conserved hypothetical protein [Haloquadratum walsbyi C23]
          Length = 303

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 17/106 (16%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SV +P S++   +    ATR  G IAR+ T+FR+D ++VF                  
Sbjct: 2   TLSVLIPSSVVREAEDKREATRKLGYIARSATVFRVDHLIVF-----------------P 44

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
           +R  E   G  F+  +L+Y  TP +LRK ++     L +VG+LPPL
Sbjct: 45  DRGGERRLGGEFVETVLRYAATPPHLRKKIWGRRDELEYVGVLPPL 90


>gi|6323970|ref|NP_014041.1| hypothetical protein YMR310C [Saccharomyces cerevisiae S288c]
 gi|2497224|sp|Q04867.1|YM91_YEAST RecName: Full=Uncharacterized protein YMR310C
 gi|984683|emb|CAA90828.1| unknown [Saccharomyces cerevisiae]
 gi|285814316|tpg|DAA10211.1| TPA: hypothetical protein YMR310C [Saccharomyces cerevisiae S288c]
 gi|392297485|gb|EIW08585.1| hypothetical protein CENPK1137D_355 [Saccharomyces cerevisiae
           CEN.PK113-7D]
          Length = 317

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 73/302 (24%), Positives = 129/302 (42%), Gaps = 47/302 (15%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P +++ + ++L   T    Q+A+  ++F + EVV+ ++ S  D             
Sbjct: 21  SLCIPTTLVSDCRNLSQITHKVYQVAKFASLFNVSEVVILEDNSQVDA------------ 68

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
           + +  S A  ++ LLQY  TP YLR  +F+      L     LP L      R  K +  
Sbjct: 69  TKKKISTAKLILALLQYFVTPPYLRNTVFNEKFRPYLTAASKLPRLSTLPFTRYQKQDHG 128

Query: 180 PFREGVTLKENAPN----SVGTL------VDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
            +REG+T+K   P      +G +      +++G +K + +   L P   RVT+   T   
Sbjct: 129 RYREGLTIKMQKPTLARKKIGKVFKQTKYINIGKSKALALQSQLVPINARVTIDTIT--- 185

Query: 229 LDADSPRQVVPPSKP--KESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIG---TSE 281
                 R++V P +     +G+   +GY  R A + + +F     K GY   +    T+ 
Sbjct: 186 ------RKIVSPQEAYGDFTGLDSQYGYYTRIASSFTDLFMKGPLKEGYTQSVYVPLTTR 239

Query: 282 HGDIVNSSDL-TLPTFRHLLIAFGGLAGLEES--IEEDDGLKRKNAREVFHSYFNTCPHQ 338
              I   S L T  T  H+L+ F     L  +  +++D  +  +  +E F +    CP  
Sbjct: 240 DTSIPELSSLPTAETNPHILLVFSTWDTLARAFKLDQDQFVDCQGPQEFFDAQLP-CPVS 298

Query: 339 GS 340
            S
Sbjct: 299 NS 300


>gi|124028156|ref|YP_001013476.1| hypothetical protein Hbut_1301 [Hyperthermus butylicus DSM 5456]
 gi|123978850|gb|ABM81131.1| conserved archaeal protein [Hyperthermus butylicus DSM 5456]
          Length = 280

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 79/307 (25%), Positives = 134/307 (43%), Gaps = 52/307 (16%)

Query: 66  AVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSD 125
           AVP S       L   T   G + R   ++R+DE +V+ ++ +   Y R +   R     
Sbjct: 15  AVPASYTSTEGDLRHKTIKVGLLGRLFAVYRVDEAIVYIDRPA---YWRDAELMR----- 66

Query: 126 ENESGAAFLVRLLQYLETPQYLRKALFSMH-SSLRFVGMLPPLDAPHH-LRKHEWAPFRE 183
                     ++L+Y+ T  YLRK ++  +   LR+  +LPPL  P H +        R+
Sbjct: 67  ----------KILEYMVTAPYLRKKIYPPNVPELRYASLLPPLQLPTHGVGGPVDGEVRQ 116

Query: 184 GVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKP 243
            + L+ +   +   LV+ GL + V V      G R+ V +           R +V P + 
Sbjct: 117 ALVLRASGGRA---LVEAGLEEPVWVRGRFHRGQRILVRI-------VSVERGIVEPVE- 165

Query: 244 KESGMYWGYKVRYAPNIS---SVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRH-- 298
            +  +Y G+KV     +S   S ++N +        I TS  G +V++  L+    R   
Sbjct: 166 -DELVYNGFKVTIVEGLSRLLSKYRNYTR-------IATSRKGRLVDAELLSQIGSRSSS 217

Query: 299 ---LLIAFGGL-AGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISLQ 354
              +L+ FG    GL E + E + LK ++   +     NT P QG++T+R EEA+  +L 
Sbjct: 218 GPGVLVLFGAPDLGLYE-LAELENLKLED---LVDYVVNTIPQQGTKTVRVEEAVAATLA 273

Query: 355 YFQEPIS 361
                +S
Sbjct: 274 LLNLVLS 280


>gi|255711846|ref|XP_002552206.1| KLTH0B09636p [Lachancea thermotolerans]
 gi|238933584|emb|CAR21768.1| KLTH0B09636p [Lachancea thermotolerans CBS 6340]
          Length = 389

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 68/258 (26%), Positives = 110/258 (42%), Gaps = 44/258 (17%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P S+IDN ++LE AT +  QIA++  +F + E+VV D         R+    + +R
Sbjct: 62  SLCIPVSVIDNCKNLEQATHVVYQIAKSAVLFNVGEIVVLD-------LGRAEDVPKESR 114

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLRK-HE-WA 179
                S    +  LLQY  TP YL K++F        +    LP + A   +R  HE   
Sbjct: 115 HRFQMSPVLLVASLLQYFVTPPYLVKSVFKKEYQECFKVAAKLPRITALPFMRHLHEDNG 174

Query: 180 PFREGVTLKENAPNSVG-----------TLVDVGLNKHVVV-DQVLDPGVRVTVAMGTNR 227
            +REG+ ++   P                 V+VG +  + +  Q++   VRVTV +    
Sbjct: 175 RYREGLAVRMQRPAQGARPAKKKAYDQTKYVNVGKDSALELRGQLVPANVRVTVDL---- 230

Query: 228 NLDADSPRQVVPPSKPK----ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHG 283
            +D     +VV P++       +   +G+ VR A   + +F       GY   +      
Sbjct: 231 -IDC----KVVSPAEAYGDFVGAQASYGFHVRVAKTFADIFTQSPAPNGYTQTVW----- 280

Query: 284 DIVNSSDLTL-PTFRHLL 300
             VNS D    PT +  L
Sbjct: 281 --VNSGDFYFDPTTKKSL 296


>gi|256272178|gb|EEU07175.1| YMR310C-like protein [Saccharomyces cerevisiae JAY291]
 gi|349580602|dbj|GAA25762.1| K7_Ymr310cp [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 317

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 73/298 (24%), Positives = 127/298 (42%), Gaps = 39/298 (13%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P +++ + ++L   T    Q+A+  ++F + EVV+ ++ S  D             
Sbjct: 21  SLCIPTTLVSDCRNLSQITHKVYQVAKFASLFNVSEVVILEDNSQVDA------------ 68

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
           + +  S A  ++ LLQY  TP YLR  +F+      L     LP L      R  K +  
Sbjct: 69  TKKKISTAKLILALLQYFVTPPYLRNTVFNEKFRPYLTAASKLPRLSTLPFTRYQKQDHG 128

Query: 180 PFREGVTLKENAPN----SVGTL------VDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
            +REG+T+K   P       G +      +++G +K + +   L P   RVT+   T + 
Sbjct: 129 RYREGLTIKMQKPTLARKKTGKVFKQTKYINIGKSKALALQSQLVPINARVTIDTITRKI 188

Query: 229 LDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIG---TSEHGDI 285
           +   SP++        ES   +GY  R A + + +F     K GY   +    T+    I
Sbjct: 189 V---SPQEAYGDFTGLESQ--YGYYTRIASSFTDLFMKGPLKEGYTQSVYVPLTTRDTSI 243

Query: 286 VNSSDL-TLPTFRHLLIAFGGLAGLEES--IEEDDGLKRKNAREVFHSYFNTCPHQGS 340
              S L T  T  H+L+ F     L  +  +++D  +  +  +E F +    CP   S
Sbjct: 244 PELSSLPTAETNPHILLVFSTWDTLARAFKLDQDQFVDCQGPQEFFDAQLP-CPVSNS 300


>gi|151946019|gb|EDN64251.1| conserved protein [Saccharomyces cerevisiae YJM789]
 gi|190408536|gb|EDV11801.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
 gi|259148902|emb|CAY82147.1| EC1118_1M3_5292p [Saccharomyces cerevisiae EC1118]
 gi|323332207|gb|EGA73618.1| YMR310C-like protein [Saccharomyces cerevisiae AWRI796]
 gi|323336010|gb|EGA77286.1| YMR310C-like protein [Saccharomyces cerevisiae Vin13]
 gi|323346996|gb|EGA81273.1| YMR310C-like protein [Saccharomyces cerevisiae Lalvin QA23]
 gi|323353230|gb|EGA85530.1| YMR310C-like protein [Saccharomyces cerevisiae VL3]
 gi|365764004|gb|EHN05530.1| YMR310C-like protein [Saccharomyces cerevisiae x Saccharomyces
           kudriavzevii VIN7]
          Length = 317

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 73/302 (24%), Positives = 128/302 (42%), Gaps = 47/302 (15%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P +++ + ++L   T    Q+A+  ++F + EVV+ ++ S  D             
Sbjct: 21  SLCIPTTLVSDCRNLSQITHKVYQVAKFASLFNVSEVVILEDNSQVDA------------ 68

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
           + +  S A  ++ LLQY  TP YLR  +F+      L     LP L      R  K +  
Sbjct: 69  TKKKISTAKLILALLQYFVTPPYLRNTVFNEKFRPYLTAASKLPRLSTLPFTRYQKQDHG 128

Query: 180 PFREGVTLKENAPN----SVGTL------VDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
            +REG+T+K   P       G +      +++G +K + +   L P   RVT+   T   
Sbjct: 129 RYREGLTIKMQKPTLARKKTGKVFKQTKYINIGKSKALALQSQLVPINARVTIDTIT--- 185

Query: 229 LDADSPRQVVPPSKP--KESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLIG---TSE 281
                 R++V P +     +G+   +GY  R A + + +F     K GY   +    T+ 
Sbjct: 186 ------RKIVSPQEAYGDFTGLDSQYGYYTRIASSFTDLFMKGPLKEGYTQSVYVPLTTR 239

Query: 282 HGDIVNSSDL-TLPTFRHLLIAFGGLAGLEES--IEEDDGLKRKNAREVFHSYFNTCPHQ 338
              I   S L T  T  H+L+ F     L  +  +++D  +  +  +E F +    CP  
Sbjct: 240 DTSIPELSSLPTAETNPHILLVFSTWDTLARAFKLDQDQFVDCQGPQEFFDAQLP-CPVS 298

Query: 339 GS 340
            S
Sbjct: 299 NS 300


>gi|399577345|ref|ZP_10771098.1| hypothetical protein HSB1_31370 [Halogranum salarium B-1]
 gi|399237728|gb|EJN58659.1| hypothetical protein HSB1_31370 [Halogranum salarium B-1]
          Length = 276

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 78/304 (25%), Positives = 128/304 (42%), Gaps = 52/304 (17%)

Query: 67  VPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNRSDE 126
           +P S++   +    ATR  G +ARA T+FR D +VVF                  +R  E
Sbjct: 1   MPSSLVREAEDKREATRKLGYVARAATVFRADRLVVF-----------------PDREGE 43

Query: 127 NESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLD-APHHLRKHEWA-PFREG 184
           +  G  F+  +L+Y  TP YLR+  + +   L +VG+LPPL  +P    + + +    +G
Sbjct: 44  SRWGGEFVTTVLEYAVTPPYLRQEAWDVRDELEYVGVLPPLRVSPQTGSESDGSGSLTQG 103

Query: 185 VTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
           +  +      V   V+ G+   + +       +  G RVTV + +   +     R  +  
Sbjct: 104 IVTEVGPEGRV--RVNCGMQHPISLITPDGMSVGEGERVTVRISSREPV-----RARIVD 156

Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR--- 297
             P       GY V       ++ ++ +        I TS +G+ ++ S L+    R   
Sbjct: 157 EAPS------GYVVERMDLAEALTRDDA-----GVRIATSRYGEKLSVSRLSSLVGRLDD 205

Query: 298 HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFN----TCPHQGSRTIRTEEAI 349
            + + FG    GL  + +   ED       A     S F+    T P QGS  +RTEEA+
Sbjct: 206 GMTVVFGSPGRGLPDILDVAVEDLPTSTGGASVGPDSGFDLWLNTIPRQGSEVVRTEEAM 265

Query: 350 FISL 353
           F SL
Sbjct: 266 FASL 269


>gi|448578202|ref|ZP_21643637.1| hypothetical protein C455_11768 [Haloferax larsenii JCM 13917]
 gi|448592778|ref|ZP_21651825.1| hypothetical protein C453_14938 [Haloferax elongans ATCC BAA-1513]
 gi|445726743|gb|ELZ78359.1| hypothetical protein C455_11768 [Haloferax larsenii JCM 13917]
 gi|445730804|gb|ELZ82391.1| hypothetical protein C453_14938 [Haloferax elongans ATCC BAA-1513]
          Length = 280

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 84/311 (27%), Positives = 130/311 (41%), Gaps = 58/311 (18%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T+SV VP S++   +    ATR  G +ARA  +FR DE+V+F ++               
Sbjct: 2   TLSVLVPSSLVREAEDQREATRKLGYVARAAAVFRADELVIFPDEDG------------- 48

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWA 179
               EN+ G  F+  +L+Y  TP YLRK ++     LR+ G+LPP  + +       E  
Sbjct: 49  ----ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVSSTTADDSDELP 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGV------RVTVAMGTNRNLDADS 233
             REG+  +    + V   V+ G+   + +   + PG+      RV V + +   + A  
Sbjct: 105 ALREGIVTEVGPDDRV--RVNCGMQHPISL--FVPPGMDATEGERVAVRISSREPVRARI 160

Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDL 291
             + +P           G+ V       S        G  D    I TS +G+ ++   L
Sbjct: 161 VDEPLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPRL 202

Query: 292 TLPTFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRT 342
              T R      + + FG    GL  +     ED      +    F  + NT P QGS  
Sbjct: 203 GELTARIADADGMTVVFGSPGRGLPDMLGMTPEDVADVEPSVGPGFDLWLNTIPRQGSDV 262

Query: 343 IRTEEAIFISL 353
           +RTEEA+F SL
Sbjct: 263 VRTEEAMFASL 273


>gi|126274487|ref|XP_001387555.1| hypothetical protein PICST_5340 [Scheffersomyces stipitis CBS 6054]
 gi|126213425|gb|EAZ63532.1| hypothetical protein PICST_5340, partial [Scheffersomyces stipitis
           CBS 6054]
          Length = 368

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 72/290 (24%), Positives = 116/290 (40%), Gaps = 69/290 (23%)

Query: 63  VSVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSD---------- 110
           VSV +P ++I   N ++LE  T +  QIARA TI+ + E+++ D   S+           
Sbjct: 12  VSVCIPSTVISSKNARNLEQITSIVNQIARAATIYNVVEIIILDIPESAAEDENQKVVEL 71

Query: 111 -----------NYS--------RSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKAL 151
                      N+S        + + A  S    ++ + A  L  LLQ+  TP YL K +
Sbjct: 72  GGSKGGKKLKFNFSDEEILKDKQETVAQTSEDVQDSSANAFLLAGLLQFFVTPPYLVKTI 131

Query: 152 FS-----------MHSSLRFVGMLPPLDAPHHLRKHE-WAPFREGVTLKENAPN------ 193
           FS           M    ++   LP +     +  +E +  F+EG T+ +  P       
Sbjct: 132 FSPAINPSPLNKDMLKKFKYAYKLPKITTLPFMSNNEVFRDFKEGFTVPKETPKVVSKKD 191

Query: 194 -----------SVGTLVDVGLNK--HVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
                      SV   V++G  +   + + + +    RVTV +   +N    SP Q    
Sbjct: 192 KSKKVKAEKKISVTKYVNIGEAELLELNIKREIPAYSRVTVDL---KNKTIVSPLQAYGV 248

Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSD 290
              K S   +GY VR+    SS+F   S   GY   I   +  D  +S+D
Sbjct: 249 MGNKSS---FGYYVRFCKKFSSIFTESSAPEGYSSSICV-QSDDFYSSAD 294


>gi|389860646|ref|YP_006362886.1| hypothetical protein TCELL_0323 [Thermogladius cellulolyticus 1633]
 gi|388525550|gb|AFK50748.1| hypothetical protein TCELL_0323 [Thermogladius cellulolyticus 1633]
          Length = 274

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 74/296 (25%), Positives = 131/296 (44%), Gaps = 42/296 (14%)

Query: 63  VSVAVPGSIIDNTQSLELATRL-AGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           V V +P S++   +   L+  L A QI R  +IF +++ V F + S+S+           
Sbjct: 5   VHVVLPSSVL-TVEPPGLSRLLKAYQILRYTSIFGVEKAVFFKDFSTSER---------- 53

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDA-PHHLRKHEWAP 180
               ++ +  AF+ ++  Y +TP YLR+ L  +   LR VG+LPPL     H+ K  +  
Sbjct: 54  ----DHGTARAFIEKVWNYYKTPPYLRRKLIPIDRDLRGVGILPPLRLESFHVSKRPYTG 109

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
                 +K +A    G   D+G+ +   V+     GVR    +        D  ++VV  
Sbjct: 110 QVRPALVKRSAD---GCFADIGVGEDFAVEGECREGVRQVRVV--------DPVKKVV-- 156

Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN-SSDLTLPTFRHL 299
            +  E+ +Y G +  +  ++  V +  +   G   L+ T   G + +  +D    + + +
Sbjct: 157 -ELTETKLYTGPETAFKNSLREVLEEYA---GKSFLLATDRRGRVPSLVTDAKRMSGKKV 212

Query: 300 LIAFGGLAG--LEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           ++  G  +    E S+ E       N R      +NT P Q   T+RTEEA+ I+L
Sbjct: 213 VVLLGSPSHDLFEMSVAEG-----FNLRMFVDDVWNTIPGQKVVTVRTEEALIITL 263


>gi|448452122|ref|ZP_21593142.1| hypothetical protein C470_10435 [Halorubrum litoreum JCM 13561]
 gi|445809766|gb|EMA59804.1| hypothetical protein C470_10435 [Halorubrum litoreum JCM 13561]
          Length = 288

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 128/323 (39%), Gaps = 80/323 (24%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF                  +
Sbjct: 7   LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 49

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
           R  +   G  ++  +L Y  TP  LRK L+     LR+VG+LPPL            P+R
Sbjct: 50  REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 99

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
            G T       + G + +VG    V V+  L               + G RVT+      
Sbjct: 100 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
            + +  P +     KP+E     G++V  A ++S          G    + TS HG+ ++
Sbjct: 154 RVSSREPVRARIAGKPEE-----GFQV-VAADLSEALA------GDGVAVATSRHGESLS 201

Query: 288 SS---DLTLPTFR--HLLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
            S   DL     +     +AFG     E  + E  GL     R              F  
Sbjct: 202 VSRLGDLVSDARKAGSYTVAFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 258

Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
           + NT P Q S  +RTEEA+F +L
Sbjct: 259 WLNTIPAQASEVVRTEEALFATL 281


>gi|156937544|ref|YP_001435340.1| hypothetical protein Igni_0751 [Ignicoccus hospitalis KIN4/I]
 gi|156566528|gb|ABU81933.1| conserved hypothetical protein [Ignicoccus hospitalis KIN4/I]
          Length = 276

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 64/224 (28%), Positives = 95/224 (42%), Gaps = 21/224 (9%)

Query: 130 GAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAP---HHLRKHEWAPFREGVT 186
           G+  L+ +L+YL  P YLRK +  +   L++ G+LPPL         +  E   F +   
Sbjct: 54  GSDVLMSVLEYLSYPTYLRKLIVPLKPELKYAGVLPPLAVKALNEGFKDREEGLFFKFGL 113

Query: 187 LKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKES 246
           + E    S+ T +DVG     V   V      V V +G N   D     +V+P  +    
Sbjct: 114 IVECKRGSIAT-IDVG-EPERVTHNVKKCKKNVLVLVGFN---DKKRVVKVLPAKRGIWR 168

Query: 247 GMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFRHLLIAFGGL 306
           G Y G++V Y  NI  + +   YKG     +GTS  G         L +  ++ + FG  
Sbjct: 169 GEYLGFEVNYFDNIYELVE--FYKGAGFKALGTSRRGTWPGKLRDYLGS--NVGVLFG-- 222

Query: 307 AGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
                    D GL  K       +  N  P QG RT+R EEA++
Sbjct: 223 -------SPDKGLLDKYPDLELDALVNLFPCQGVRTVRLEEAVW 259


>gi|448426508|ref|ZP_21583357.1| hypothetical protein C473_10123 [Halorubrum terrestre JCM 10247]
 gi|448484333|ref|ZP_21606040.1| hypothetical protein C462_11693 [Halorubrum arcis JCM 13916]
 gi|445679388|gb|ELZ31855.1| hypothetical protein C473_10123 [Halorubrum terrestre JCM 10247]
 gi|445820108|gb|EMA69937.1| hypothetical protein C462_11693 [Halorubrum arcis JCM 13916]
          Length = 288

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 128/323 (39%), Gaps = 80/323 (24%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF                  +
Sbjct: 7   LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 49

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
           R  +   G  ++  +L Y  TP  LRK L+     LR+VG+LPPL            P+R
Sbjct: 50  REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 99

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
            G T       + G + +VG    V V+  L               + G RVT+      
Sbjct: 100 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
            + +  P +     KP+E     G++V  A ++S          G    + TS HG+ ++
Sbjct: 154 RVSSREPVRARIAGKPEE-----GFQV-VAADLSEALA------GDGVAVATSRHGESLS 201

Query: 288 SS---DLTLPTFRH--LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
            S   DL     +     +AFG     E  + E  GL     R              F  
Sbjct: 202 VSRLGDLVSDARKAGGYTVAFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 258

Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
           + NT P Q S  +RTEEA+F +L
Sbjct: 259 WLNTIPAQASEVVRTEEALFATL 281


>gi|448489439|ref|ZP_21607662.1| hypothetical protein C463_03472 [Halorubrum californiensis DSM
           19288]
 gi|445694811|gb|ELZ46929.1| hypothetical protein C463_03472 [Halorubrum californiensis DSM
           19288]
          Length = 288

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 128/323 (39%), Gaps = 80/323 (24%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF                  +
Sbjct: 7   LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 49

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
           R  +   G  ++  +L Y  TP  LRK L+     LR+VG+LPPL            P+R
Sbjct: 50  REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 99

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
            G T       + G + +VG    V V+  L               + G RVT+      
Sbjct: 100 TGSTPNGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
            + +  P +     KP+E     G++V  A ++S          G    + TS HG+ ++
Sbjct: 154 RVSSREPVRARIAGKPEE-----GFQV-VAADLSEALA------GDGLAVATSRHGESLS 201

Query: 288 SSDL--TLPTFRH---LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
            S L   +   R      +AFG     E  + E  GL     R              F  
Sbjct: 202 VSRLGDLVSDAREAGGYTVAFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 258

Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
           + NT P Q S  +RTEEA+F +L
Sbjct: 259 WLNTIPAQASEVVRTEEALFATL 281


>gi|448606194|ref|ZP_21658773.1| hypothetical protein C441_12325 [Haloferax sulfurifontis ATCC
           BAA-897]
 gi|445739611|gb|ELZ91118.1| hypothetical protein C441_12325 [Haloferax sulfurifontis ATCC
           BAA-897]
          Length = 280

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 83/304 (27%), Positives = 130/304 (42%), Gaps = 46/304 (15%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV VP S++   +    ATR  G +ARA  +FR +E+VVF ++                
Sbjct: 3   LSVLVPSSLVREAEDKREATRKLGYVARAAAVFRANELVVFPDEDG-------------- 48

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
              EN+ G  F+  +L+Y  TP YLRK ++     LR+ G+LPP L A       + +P 
Sbjct: 49  ---ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
            REG+  +    + V   V+ G+   + +   + PG+ +T        + +  P +    
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISL--FVPPGMDLTEGERVAVRISSREPVRARIV 161

Query: 241 SKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLPTFR- 297
            +P+      G+ V       S        G  D    I TS +G+ ++   L   T R 
Sbjct: 162 DEPRP-----GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDLTARI 209

Query: 298 ----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAI 349
                + + FG    GL  +     ED      +    F  + NT P QGS  +RTEEA+
Sbjct: 210 TDAGGMTVVFGSPGRGLPDMLGMPPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRTEEAM 269

Query: 350 FISL 353
           F SL
Sbjct: 270 FASL 273


>gi|410083202|ref|XP_003959179.1| hypothetical protein KAFR_0I02650 [Kazachstania africana CBS 2517]
 gi|372465769|emb|CCF60044.1| hypothetical protein KAFR_0I02650 [Kazachstania africana CBS 2517]
          Length = 351

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 113/257 (43%), Gaps = 43/257 (16%)

Query: 40  DNKKKKKRKRDQLNDDAPIEVPTVSV----AVPGSIIDNTQSLELATRLAGQIARAVTIF 95
           D+ K KK          PI+V + S+     +P SI+++  +L+  T    QIA+A T+F
Sbjct: 15  DDLKIKKPNLPLKKSKKPIKVLSKSINYSLCIPTSILNSCSNLDQITHTLYQIAKAATLF 74

Query: 96  RIDEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMH 155
            + E+V+ D  S +              S+   + +  +  LLQY  TP YL K++F   
Sbjct: 75  NVAEIVILDQGSRN--------------SESKITDSMLMASLLQYFVTPPYLIKSVFKKQ 120

Query: 156 --SSLRFVGMLP-PLDAPH--HLRKHEWAPFREGVTLKENAPNSVGTL-----VDVGLNK 205
                +    LP  L+ P   +L  +E   +REG++++   P++         +++G  +
Sbjct: 121 YLGYFKIASKLPRILNLPFMRYLEANE-GRYREGLSIRMTHPDAKKEFKQTKYINIGKAE 179

Query: 206 HVVVDQVLDP-GVRVTVAMGTNRNLDADSPRQVVPPSKPK----ESGMYWGYKVRYAPNI 260
            + +   L P  VRVTV +           ++VV P +       +   +GY VR A + 
Sbjct: 180 TLELKSQLVPTNVRVTVDI---------VEKKVVSPIEAYGDFVGANSSYGYHVRIAKSF 230

Query: 261 SSVFKNCSYKGGYDHLI 277
             +F   ++  GY   I
Sbjct: 231 GDIFTESAFSKGYSQAI 247


>gi|401839560|gb|EJT42732.1| YGR283C-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 341

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 62/219 (28%), Positives = 92/219 (42%), Gaps = 25/219 (11%)

Query: 53  NDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNY 112
           N D    V   S+ +P S IDN  +LE  T  A QIA+   +F I E++V D        
Sbjct: 30  NMDIESAVAQYSICIPTSAIDNCFNLEQTTFTAYQIAKTAVLFNIQEIIVLD-------L 82

Query: 113 SRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAP 170
           S+     +  RS E  S    L  LLQY  TP  L    F   +   L+     P L   
Sbjct: 83  SKDRRHGKRPRSKETISDCLLLATLLQYFVTPSNLLDITFKRKNMLYLKHAFAFPKLHQL 142

Query: 171 HHLRKHEWAPFREGVTL-KENAPNSV----GTLVDVGLNKHV-VVDQVLDPGVRVTVAMG 224
             + +     +REG+++ ++NA   +      LV +G  + + + +Q +   VRVTV + 
Sbjct: 143 AFMNESAEHIYREGLSIAQDNAEKEIERDQTNLVYIGKGEIIELSNQNIPRTVRVTVDL- 201

Query: 225 TNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
             +  +  SPR V             GY VR A  +  V
Sbjct: 202 --QRKEVVSPRDVY-------KNQTLGYHVRVASTLDEV 231


>gi|389848018|ref|YP_006350257.1| rpl operon protein [Haloferax mediterranei ATCC 33500]
 gi|448618115|ref|ZP_21666460.1| rpl operon protein [Haloferax mediterranei ATCC 33500]
 gi|388245324|gb|AFK20270.1| rpl operon protein [Haloferax mediterranei ATCC 33500]
 gi|445747670|gb|ELZ99125.1| rpl operon protein [Haloferax mediterranei ATCC 33500]
          Length = 280

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 132/309 (42%), Gaps = 56/309 (18%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV VP S++   +    ATR  G +ARA  +FR +E+VVF ++                
Sbjct: 3   LSVLVPTSLVREAEDQREATRKLGYVARAAAVFRANELVVFPDEDG-------------- 48

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWAP 180
              EN+ G  F+  +L+Y  TP YLRK ++     LR+ G+LPP  + +       E   
Sbjct: 49  ---ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVSSTTAGDSDETPA 105

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGV------RVTVAMGTNRNLDADSP 234
            REG+  +    + V   V+ G+   + +   + PG+      RV V + +   + A   
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISL--FIPPGMDIAKGERVAVRISSREPVRARIV 161

Query: 235 RQVVPPSKPKESGMYWGYKVRYAPNISSVFK-NCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
            + +P           G+ V       ++ + +   +      I TS +G+ ++   L  
Sbjct: 162 DEPLP-----------GFDVSRMDLTEALGRPDAGVR------IATSRYGESLSVPRLGD 204

Query: 294 PTFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIR 344
            T R      + +AFG    GL  +     ED      +    F  + NT P QGS  +R
Sbjct: 205 LTARITDAGGMTVAFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSEVVR 264

Query: 345 TEEAIFISL 353
           TEEA+F SL
Sbjct: 265 TEEAMFASL 273


>gi|448103215|ref|XP_004199984.1| Piso0_002543 [Millerozyma farinosa CBS 7064]
 gi|359381406|emb|CCE81865.1| Piso0_002543 [Millerozyma farinosa CBS 7064]
          Length = 380

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 70/273 (25%), Positives = 111/273 (40%), Gaps = 65/273 (23%)

Query: 64  SVAVPGSIID--NTQSLELATRLAGQIARAVTIFRIDEVVVFD----------------- 104
           ++ +P SII   N ++LE  T +A Q+A+A TI+ + EVV+ D                 
Sbjct: 26  ALCIPSSIISASNAKNLEQITYVAYQVAKAATIYNVVEVVILDIPSPEEQERLEEREATK 85

Query: 105 -NKSSSDNYSR-------------------SSAANRSNRSDE---NESGAAFLVRLLQYL 141
              SSSD   +                   S A + ++RS +   + +       LLQ+ 
Sbjct: 86  VVSSSSDKGGKKIKFNLNDKEAIEPPREKDSQADDYTSRSSQHVADPNNGLLFATLLQFF 145

Query: 142 ETPQYLRKALFS---MHSSLRFVGMLPPLDA-PHHLRKHEWAPFREGVTLKENAPNSVGT 197
            TP YL KA+F      S L++   LP +   P     + ++ F+EG+T+ + A      
Sbjct: 146 MTPPYLVKAVFGSSEFLSKLKYARKLPKISTLPFMGNNNVFSDFKEGLTVPKKALKIKKK 205

Query: 198 -------------LVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSKPK 244
                         V++G +  +V+   +   VRVTV +   +N    SP Q        
Sbjct: 206 NKKPVSRKFNSTKYVNIGSDAPLVLSSEVPVNVRVTVDI---KNKKVVSPAQAYGVVGAH 262

Query: 245 ESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
            +   +GY VR     S+VF  CS   GY   I
Sbjct: 263 SA---FGYHVRLCKKFSAVFTECSIPAGYTQSI 292


>gi|292656686|ref|YP_003536583.1| hypothetical protein HVO_2565 [Haloferax volcanii DS2]
 gi|448290690|ref|ZP_21481836.1| hypothetical protein C498_08115 [Haloferax volcanii DS2]
 gi|291372324|gb|ADE04551.1| conserved protein [Haloferax volcanii DS2]
 gi|445578061|gb|ELY32476.1| hypothetical protein C498_08115 [Haloferax volcanii DS2]
          Length = 280

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 84/308 (27%), Positives = 129/308 (41%), Gaps = 54/308 (17%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV VP S++   +    ATR  G +ARA  +FR +E+VVF ++   +N+          
Sbjct: 3   LSVLVPSSLVREAEDQREATRKLGYVARAAAVFRANELVVFPDEDGENNW---------- 52

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
                  G  F+  +L+Y  TP YLRK ++     LR+ G+LPP L A       + +P 
Sbjct: 53  -------GGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
            REG+  +    + V   V+ G+   + +       L  G RV V + +   + A    +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
            +P           G+ V       S        G  D    I TS +G+ ++   L   
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205

Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
           T R      + + FG    GL  +     ED      +    F  + NT P QGS  +RT
Sbjct: 206 TARITDAGGMTVVFGSPGRGLPDMLGMPPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265

Query: 346 EEAIFISL 353
           EEA+F SL
Sbjct: 266 EEAMFASL 273


>gi|429217689|ref|YP_007175679.1| hypothetical protein Calag_1531 [Caldisphaera lagunensis DSM 15908]
 gi|429134218|gb|AFZ71230.1| hypothetical protein Calag_1531 [Caldisphaera lagunensis DSM 15908]
          Length = 276

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 83/330 (25%), Positives = 140/330 (42%), Gaps = 72/330 (21%)

Query: 38  SCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRI 97
           SC+    KKR +  L     + +P   ++V  S+ID T         AG I+R ++I+++
Sbjct: 2   SCNEFPPKKRNKKLL-----VLLPISVLSVEDSLIDKTYK-------AGFISRILSIYKV 49

Query: 98  DEVVVFDNKSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
           DEV+++ +             + SN  DE      FL  +L Y   P +++K +      
Sbjct: 50  DEVILYRD-------------DESNSFDE-----KFLQLILNYQIIPPHIKKKIIKKSKL 91

Query: 158 LRFVGMLPPLDAPHH--LRKHEWAPFREGVTLKENAPNSVGTLVDVG---LNKHVVVDQV 212
           L++ G+LPPL   +H   +K E     +G   K +    +  L  +G   L+K  V D++
Sbjct: 92  LKYAGILPPLRLVNHDPPKKAEIGKILDGYVEKCDENYCIVYLGMIGNGYLDKKEVDDKI 151

Query: 213 LDPGVRVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNIS-SVFKNCSYKG 271
           +   ++              S +  +   K     +Y GYKV    N+    FK+  Y  
Sbjct: 152 ITVKIK--------------SIKNKIMLEKASWGNLYTGYKVSKIKNLDIKKFKDEGY-- 195

Query: 272 GYDHLIGTSEHGDIVNSS-----DLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNARE 326
               ++GTS+ G+  N       D  L   + ++  FGG              +  N +E
Sbjct: 196 ---LIVGTSKLGECFNDKIKEEIDNKLRNSKGIVFVFGGPHA-----------ELYNEKE 241

Query: 327 VFHSYFNTCPHQGSRTIRTEEAIFISLQYF 356
            F    NT  +QG+ T+RTEEA+  +L  F
Sbjct: 242 -FDYILNTVLNQGTLTVRTEEALASTLAIF 270


>gi|448569069|ref|ZP_21638481.1| hypothetical protein C456_04136 [Haloferax lucentense DSM 14919]
 gi|448600512|ref|ZP_21656008.1| hypothetical protein C452_16704 [Haloferax alexandrinus JCM 10717]
 gi|445725219|gb|ELZ76844.1| hypothetical protein C456_04136 [Haloferax lucentense DSM 14919]
 gi|445735229|gb|ELZ86781.1| hypothetical protein C452_16704 [Haloferax alexandrinus JCM 10717]
          Length = 280

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 84/308 (27%), Positives = 129/308 (41%), Gaps = 54/308 (17%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV VP S++   +    ATR  G +ARA  +FR +E+VVF ++   +N+          
Sbjct: 3   LSVLVPSSLVREAEDKREATRKLGYVARAAAVFRANELVVFPDEDGENNW---------- 52

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
                  G  F+  +L+Y  TP YLRK ++     LR+ G+LPP L A       + +P 
Sbjct: 53  -------GGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
            REG+  +    + V   V+ G+   + +       L  G RV V + +   + A    +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
            +P           G+ V       S        G  D    I TS +G+ ++   L   
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205

Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
           T R      + + FG    GL  +     ED      +    F  + NT P QGS  +RT
Sbjct: 206 TARITDAGGMTVVFGSPGRGLPDMLGMPPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265

Query: 346 EEAIFISL 353
           EEA+F SL
Sbjct: 266 EEAMFASL 273


>gi|448517688|ref|ZP_21617262.1| hypothetical protein C466_01182 [Halorubrum distributum JCM 10118]
 gi|445705903|gb|ELZ57791.1| hypothetical protein C466_01182 [Halorubrum distributum JCM 10118]
          Length = 288

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 84/323 (26%), Positives = 127/323 (39%), Gaps = 80/323 (24%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF                  +
Sbjct: 7   LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 49

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
           R  +   G  ++  +L Y  TP  LRK L+     LR+VG+LPPL            P+R
Sbjct: 50  REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 99

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
            G T       + G + +VG    V V+  L               + G RVT+      
Sbjct: 100 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
            + +  P +     KP+E     G++V  A ++S          G    + TS HG+ ++
Sbjct: 154 RVSSREPVRARIAGKPEE-----GFQV-VAADLSEALA------GDGVAVATSRHGESLS 201

Query: 288 SS---DLTLPTFRH--LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
            S   DL     +     + FG     E  + E  GL     R              F  
Sbjct: 202 VSRLGDLVSDARKAGGYTVVFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 258

Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
           + NT P Q S  +RTEEA+F +L
Sbjct: 259 WLNTIPAQASEVVRTEEALFATL 281


>gi|448508629|ref|ZP_21615580.1| hypothetical protein C465_08536 [Halorubrum distributum JCM 9100]
 gi|445697255|gb|ELZ49325.1| hypothetical protein C465_08536 [Halorubrum distributum JCM 9100]
          Length = 300

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 84/323 (26%), Positives = 127/323 (39%), Gaps = 80/323 (24%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF                  +
Sbjct: 19  LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 61

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
           R  +   G  ++  +L Y  TP  LRK L+     LR+VG+LPPL            P+R
Sbjct: 62  REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 111

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
            G T       + G + +VG    V V+  L               + G RVT+      
Sbjct: 112 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 165

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
            + +  P +     KP+E     G++V  A ++S          G    + TS HG+ ++
Sbjct: 166 RVSSREPVRARIAGKPEE-----GFQV-VAADLSEALA------GDGVAVATSRHGESLS 213

Query: 288 SS---DLTLPTFRH--LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
            S   DL     +     + FG     E  + E  GL     R              F  
Sbjct: 214 VSRLGDLVSDARKAGGYTVVFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 270

Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
           + NT P Q S  +RTEEA+F +L
Sbjct: 271 WLNTIPAQASEVVRTEEALFATL 293


>gi|448565514|ref|ZP_21636381.1| hypothetical protein C457_13294 [Haloferax prahovense DSM 18310]
 gi|445715258|gb|ELZ67014.1| hypothetical protein C457_13294 [Haloferax prahovense DSM 18310]
          Length = 280

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 129/308 (41%), Gaps = 54/308 (17%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV VP S++   +    ATR  G +ARA  +FR +E+VVF ++                
Sbjct: 3   LSVLVPSSLVREAEDQREATRKLGYVARAAAVFRANELVVFPDEDG-------------- 48

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
              EN+ G  F+  +L+Y  TP YLRK ++     LR+ G+LPP L A       + +P 
Sbjct: 49  ---ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
            REG+  +    + V   V+ G+   + +       L  G RV V + +   + A    +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
            +P           G+ V       S        G  D    I TS +G+ ++   L   
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205

Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
           T R      + + FG    GL  +     ED      +    F  + NT P QGS  +RT
Sbjct: 206 TARITDVGGMTVVFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265

Query: 346 EEAIFISL 353
           EEA+F SL
Sbjct: 266 EEAMFASL 273


>gi|448583469|ref|ZP_21646825.1| hypothetical protein C454_09616 [Haloferax gibbonsii ATCC 33959]
 gi|445729698|gb|ELZ81293.1| hypothetical protein C454_09616 [Haloferax gibbonsii ATCC 33959]
          Length = 280

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 129/308 (41%), Gaps = 54/308 (17%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV VP S++   +    ATR  G +ARA  +FR +E+VVF ++                
Sbjct: 3   LSVLVPSSLVREAEDQREATRKLGYVARAAAVFRANELVVFPDEDG-------------- 48

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
              EN+ G  F+  +L+Y  TP YLRK ++     LR+ G+LPP L A       + +P 
Sbjct: 49  ---ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
            REG+  +    + V   V+ G+   + +       L  G RV V + +   + A    +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
            +P           G+ V       S        G  D    I TS +G+ ++   L   
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205

Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
           T R      + + FG    GL  +     ED      +    F  + NT P QGS  +RT
Sbjct: 206 TARITDAGGMTVVFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265

Query: 346 EEAIFISL 353
           EEA+F SL
Sbjct: 266 EEAMFASL 273


>gi|401626168|gb|EJS44127.1| YMR310C [Saccharomyces arboricola H-6]
          Length = 319

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 71/298 (23%), Positives = 124/298 (41%), Gaps = 45/298 (15%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P +++ +  +L   T    Q+A+  ++F + E+V+ +N    D++ ++ A      
Sbjct: 21  SLCIPTTLVSDCCNLSQITHKIYQVAKFASLFNVSELVILEN----DSHVQTGA------ 70

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
           S +  S A  ++ LLQY  TP YL   +F+      L     LP L      R  K +  
Sbjct: 71  SKKKISTAKLILALLQYFVTPPYLCNTVFNEKFKPYLIAASKLPRLSTLPFTRYQKQDHG 130

Query: 180 PFREGVTLKENAPNSV----------GTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
            +REG+T+K   P                +++G  + + +   L P   RVT+   T   
Sbjct: 131 RYREGLTIKMQKPTLTRKKTGKEFKQTKYINIGKPEALALQSQLVPINARVTIDTIT--- 187

Query: 229 LDADSPRQVVPPSKP----KESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI----GTS 280
                 R++V P +       +   +GY  R A + + +F     K GY   I     T 
Sbjct: 188 ------RKIVSPQEAYGDFTGTDSLYGYYTRIASSFTDLFMKSPLKEGYTQSIYVPLATR 241

Query: 281 EHGDIVNSSDLTLPTFRHLLIAFGGLAGLEESIE--EDDGLKRKNAREVFHSYFNTCP 336
           E      SS   + T  ++L+ F     LE + E  +D  L+ +  +E F +    CP
Sbjct: 242 ETNIPALSSLPAIGTPSNILLVFSAWDTLERAFELDKDQFLECQGPQEFFDAQLP-CP 298


>gi|401838346|gb|EJT42026.1| YMR310C-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 317

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 58/233 (24%), Positives = 101/233 (43%), Gaps = 40/233 (17%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P +++ + ++L   T     +A+  ++F + E+V+ +     DN  RS+       
Sbjct: 21  SLCIPSTLVADCRNLSQITHKVYHVAKVASLFNVSELVILE-----DNPQRST------- 68

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
           S +  S A  ++ LLQY  TP YLR  +F+      L     LP L      R  K +  
Sbjct: 69  SKKKISNAKLILALLQYFVTPPYLRNTVFNEKFRPYLVAASKLPRLSTLPFTRYEKQDHG 128

Query: 180 PFREGVTLKENAPNSV----------GTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
            +REG+T+K   P S              +++G  + + +   L P   RVT+   T   
Sbjct: 129 RYREGLTIKMQKPTSARKKTGKEFKQTKYINIGKPEALALQNQLVPINARVTIDTIT--- 185

Query: 229 LDADSPRQVVPPSKP--KESGM--YWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
                 R++V P +     +G+  ++GY  R A + + +F     K GY   I
Sbjct: 186 ------RKIVSPQEAYGDFTGLDSHYGYHTRIASSFTDLFMKGPLKEGYTQSI 232


>gi|448623079|ref|ZP_21669728.1| hypothetical protein C438_11873 [Haloferax denitrificans ATCC
           35960]
 gi|445753587|gb|EMA05004.1| hypothetical protein C438_11873 [Haloferax denitrificans ATCC
           35960]
          Length = 280

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 129/308 (41%), Gaps = 54/308 (17%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV VP S++   +    ATR  G +ARA  +FR +E+VVF ++                
Sbjct: 3   LSVLVPSSLVREAEDKREATRKLGYVARAAAVFRANELVVFPDEDG-------------- 48

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
              EN+ G  F+  +L+Y  TP YLRK ++     LR+ G+LPP L A       + +P 
Sbjct: 49  ---ENKWGGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPVLVASTTADDSDESPA 105

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
            REG+  +    + V   V+ G+   + +       L  G RV V + +   + A    +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
            +P           G+ V       S        G  D    I TS +G+ ++   L   
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205

Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
           T R      + + FG    GL  +     ED      +    F  + NT P QGS  +RT
Sbjct: 206 TARITDAGGMTVVFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265

Query: 346 EEAIFISL 353
           EEA+F SL
Sbjct: 266 EEAMFASL 273


>gi|365760478|gb|EHN02195.1| YGR283C-like protein [Saccharomyces cerevisiae x Saccharomyces
           kudriavzevii VIN7]
          Length = 343

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 62/219 (28%), Positives = 91/219 (41%), Gaps = 25/219 (11%)

Query: 53  NDDAPIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNY 112
           N D    V   S+ +P S IDN  +LE  T  A QIA+   +F I E++V D        
Sbjct: 30  NMDIESAVAQYSICIPTSAIDNCFNLEQTTFTAYQIAKTAVLFNIQEIIVLD-------L 82

Query: 113 SRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSS--LRFVGMLPPLDAP 170
           S+     +  RS E  S    L  LLQY  TP  L    F   +   L+     P L   
Sbjct: 83  SKDRRHGKRPRSKETISDCLLLATLLQYFVTPSNLLDITFKRKNMLYLKHAFAFPKLHQL 142

Query: 171 HHLRKHEWAPFREGVTL-KENAPNSV----GTLVDVGLNKHV-VVDQVLDPGVRVTVAMG 224
             + +     +REG+++ ++NA   +      LV +G  + + + +Q +   VRVTV + 
Sbjct: 143 AFMNESAEHIYREGLSIAQDNAEKEIERDQTNLVYIGKGEIIELSNQNIPRTVRVTVDLQ 202

Query: 225 TNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSV 263
                +  SPR V             GY VR A  +  V
Sbjct: 203 RK---EVVSPRDVY-------KNQTLGYHVRVASTLDEV 231


>gi|448545987|ref|ZP_21626314.1| hypothetical protein C460_16402 [Haloferax sp. ATCC BAA-646]
 gi|448548061|ref|ZP_21627405.1| hypothetical protein C459_03800 [Haloferax sp. ATCC BAA-645]
 gi|448557084|ref|ZP_21632519.1| hypothetical protein C458_11715 [Haloferax sp. ATCC BAA-644]
 gi|445703333|gb|ELZ55264.1| hypothetical protein C460_16402 [Haloferax sp. ATCC BAA-646]
 gi|445714763|gb|ELZ66521.1| hypothetical protein C459_03800 [Haloferax sp. ATCC BAA-645]
 gi|445714953|gb|ELZ66710.1| hypothetical protein C458_11715 [Haloferax sp. ATCC BAA-644]
          Length = 280

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 84/308 (27%), Positives = 128/308 (41%), Gaps = 54/308 (17%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV VP S++   +    ATR  G +ARA  +FR +E+VVF ++   +N+          
Sbjct: 3   LSVLVPSSLVREAEDKREATRKLGYVARAAAVFRANELVVFPDEDGENNW---------- 52

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP-LDAPHHLRKHEWAP- 180
                  G  F+  +L+Y  TP YLRK  +     LR+ G+LPP L A       + +P 
Sbjct: 53  -------GGEFVETVLRYAATPPYLRKEAWGKRDELRYAGILPPVLVASTTADDSDESPA 105

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPRQ 236
            REG+  +    + V   V+ G+   + +       L  G RV V + +   + A    +
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPISLFVPPGMDLTEGERVAVRISSREPVRARIVDE 163

Query: 237 VVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLP 294
            +P           G+ V       S        G  D    I TS +G+ ++   L   
Sbjct: 164 PLP-----------GFDV-------SRMDLTEALGRPDAGVRIATSRYGEPLSVPTLGDL 205

Query: 295 TFR-----HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 345
           T R      + + FG    GL  +     ED      +    F  + NT P QGS  +RT
Sbjct: 206 TARITEAGGMTVVFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSEVVRT 265

Query: 346 EEAIFISL 353
           EEA+F SL
Sbjct: 266 EEAMFASL 273


>gi|448611748|ref|ZP_21662178.1| rpl operon protein [Haloferax mucosum ATCC BAA-1512]
 gi|445742509|gb|ELZ94003.1| rpl operon protein [Haloferax mucosum ATCC BAA-1512]
          Length = 280

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 80/303 (26%), Positives = 127/303 (41%), Gaps = 44/303 (14%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV VP S++   +    ATR  G +ARA  +FR DE+VVF ++                
Sbjct: 3   LSVLVPTSLVREAEDQREATRKLGYVARAAAVFRADELVVFPDEDG-------------- 48

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPP--LDAPHHLRKHEWAP 180
              EN+ G  F+  +L+Y  TP YLRK  +     LR+ G+LPP  + +       E   
Sbjct: 49  ---ENKWGGEFVETVLRYAATPPYLRKEAWGKRDELRYAGILPPVLVSSTTAGDSDETPA 105

Query: 181 FREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPP 240
            REG+  +    + V   V+ G+   V +     PG+ V+        + +  P +    
Sbjct: 106 LREGIVTEVGPDDRV--RVNCGMQHPVSL--FTPPGMNVSKGERVAVRISSREPVRARIV 161

Query: 241 SKPKESGMYWGYKVRYAPNISSVFK-NCSYKGGYDHLIGTSEHGDIVNSSDLTLPTFR-- 297
            +P       G+ V       ++ + +   +      I TS +G+ ++   L   T R  
Sbjct: 162 DEPLP-----GFDVSRMDLTEALGRPDAGVR------IATSRYGESLSVPRLGDLTARIT 210

Query: 298 ---HLLIAFG----GLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIF 350
               + + FG    GL  +     ED      +    F  + NT P QGS  +RTEEA+F
Sbjct: 211 DAGGMTVVFGSPGRGLPDMLGMSPEDVADVEPSVGPGFDLWLNTIPRQGSDVVRTEEAMF 270

Query: 351 ISL 353
            SL
Sbjct: 271 ASL 273


>gi|365758876|gb|EHN00699.1| YMR310C-like protein [Saccharomyces cerevisiae x Saccharomyces
           kudriavzevii VIN7]
          Length = 317

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 57/233 (24%), Positives = 99/233 (42%), Gaps = 40/233 (17%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P +++ + ++L   T     +A+  ++F + E+V+ +     DN  RS+       
Sbjct: 21  SLCIPSTLVADCRNLSQITHKVYHVAKVASLFNVSELVILE-----DNPPRST------- 68

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--KHEWA 179
           S +  S A  ++ LLQY  TP YLR  +F+      L     LP L      R  K +  
Sbjct: 69  SKKKISNAKLILALLQYFVTPPYLRNTVFNEKFRPYLVAASKLPRLSTLPFTRYEKQDHG 128

Query: 180 PFREGVTLKENAPNSV----------GTLVDVGLNKHVVVDQVLDP-GVRVTVAMGTNRN 228
            +REG+T+K   P S              +++G  + + +   L P   RVT+   T   
Sbjct: 129 RYREGLTIKMQKPTSARKKTGKEFKQTKYINIGKPEALALQNQLVPINARVTIDTIT--- 185

Query: 229 LDADSPRQVVPPSKPKESGM----YWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
                 R++V P +     +    ++GY  R A + + +F     K GY   I
Sbjct: 186 ------RKIVSPQEAYGDFIGLDSHYGYHTRIASSFTDLFMKGPLKEGYTQSI 232


>gi|402586177|gb|EJW80115.1| hypothetical protein WUBG_08978, partial [Wuchereria bancrofti]
          Length = 67

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 29/55 (52%), Positives = 35/55 (63%)

Query: 299 LLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
           +LI FGGL GLE +IE D  +      ++F  Y N  P QGSR IRTEEAI I+L
Sbjct: 2   ILIVFGGLEGLETAIEADKNINCLTPEKLFEHYLNIVPGQGSRIIRTEEAIPITL 56


>gi|222480858|ref|YP_002567095.1| hypothetical protein Hlac_2450 [Halorubrum lacusprofundi ATCC
           49239]
 gi|222453760|gb|ACM58025.1| Protein of unknown function DUF171 [Halorubrum lacusprofundi ATCC
           49239]
          Length = 288

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 86/320 (26%), Positives = 132/320 (41%), Gaps = 74/320 (23%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF ++                
Sbjct: 7   LTICVPSSVVREAEDAREATRKLGYVARAAAVFRADRLVVFPDREGERRRG--------- 57

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
                     ++  +L Y  TP  LRK L+     LR+VG+LPPL            P+R
Sbjct: 58  --------GEYVRTVLGYAATPPELRKDLWGERDELRYVGVLPPLR----------VPWR 99

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
            G T       + G + +VG    V V+  L               + G RVT+      
Sbjct: 100 TGSTPSGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
            + +  P +     KP+E     G++V  A ++S          G    + TS HG+ ++
Sbjct: 154 RVSSREPVRARITGKPEE-----GFQVVEA-DLSEAL------AGDGLAVATSRHGEELS 201

Query: 288 SSDL--TLPTFRH---LLIAFGGLA-GLEESIE-EDDGLKRKNA--REV-----FHSYFN 333
            S L   +   R      +AFG    GL E +    DG++   A  R V     F  + N
Sbjct: 202 VSRLGDIVSDARKAGGYTVAFGSPERGLPEMLGLSPDGIRAAVADGRSVEPDPGFDLWLN 261

Query: 334 TCPHQGSRTIRTEEAIFISL 353
           T P Q S  +RTEEA+F++L
Sbjct: 262 TIPAQASEVVRTEEALFVTL 281


>gi|448460078|ref|ZP_21596998.1| hypothetical protein C469_14736 [Halorubrum lipolyticum DSM 21995]
 gi|445807796|gb|EMA57877.1| hypothetical protein C469_14736 [Halorubrum lipolyticum DSM 21995]
          Length = 288

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 86/321 (26%), Positives = 132/321 (41%), Gaps = 76/321 (23%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF ++                
Sbjct: 7   LTICVPSSVVREAEDAREATRKLGYVARAAAVFRADRLVVFPDREGERRRG--------- 57

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
                     ++  +L Y  TP  LRK L+     LR+VG+LPPL            P+R
Sbjct: 58  --------GEYVRTVLGYAATPPELRKDLWGERDELRYVGVLPPLR----------VPWR 99

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
            G T       + G + +VG    V V+  L               + G RVT+      
Sbjct: 100 TGSTPSGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHL-IGTSEHGDIV 286
            + +  P +     KP+E     G++V        V  + S     D L + TS HG+ +
Sbjct: 154 RVSSREPVRARITGKPEE-----GFQV--------VGADLSEALAGDGLAVATSRHGEEL 200

Query: 287 NSSDL--TLPTFRH---LLIAFGGLA-GLEESIE-EDDGLKR--KNAREV-----FHSYF 332
           + S L   +   R      +AFG    GL E +    DG++    + R V     F  + 
Sbjct: 201 SVSRLGGIVSDAREAGGYTVAFGSPERGLPEMLGLSPDGIRAAVADGRSVEPDPGFDLWL 260

Query: 333 NTCPHQGSRTIRTEEAIFISL 353
           NT P Q S  +RTEEA+F++L
Sbjct: 261 NTIPAQASEVVRTEEALFVTL 281


>gi|392299539|gb|EIW10633.1| hypothetical protein CENPK1137D_3251 [Saccharomyces cerevisiae
           CEN.PK113-7D]
          Length = 129

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 7/94 (7%)

Query: 64  SVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSNR 123
           S+ +P ++IDN  +LE  T  A QIAR   +F + E++V D        S+     + +R
Sbjct: 41  SICIPTTVIDNCNNLEQVTFTAYQIARTAVLFNVQEIIVLDQ-------SKDKKHEKKSR 93

Query: 124 SDENESGAAFLVRLLQYLETPQYLRKALFSMHSS 157
           S E  S    L  LLQY  TP  L    F   +S
Sbjct: 94  SKETISDCLLLATLLQYFVTPPNLLDTTFKKKTS 127


>gi|448475311|ref|ZP_21603029.1| hypothetical protein C461_11528 [Halorubrum aidingense JCM 13560]
 gi|445816782|gb|EMA66669.1| hypothetical protein C461_11528 [Halorubrum aidingense JCM 13560]
          Length = 288

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 87/321 (27%), Positives = 132/321 (41%), Gaps = 76/321 (23%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF ++                
Sbjct: 7   LTICVPSSVVREAEDAREATRKLGYVARAAAVFRADRLVVFPDREGERRRG--------- 57

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
                     ++  +L Y  TP  LRK L+     LR+ G+LPPL            P+R
Sbjct: 58  --------GEYVRTVLGYAATPPELRKDLWGERDELRYAGVLPPLR----------VPWR 99

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
            G T       + G + +VG    V V+  L               + G RVT+      
Sbjct: 100 TGSTPSGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHL-IGTSEHGDIV 286
            + +  P +     KP+E     G++V        V  + S     D L + TS HG+ +
Sbjct: 154 RVSSREPVRARITGKPEE-----GFQV--------VGADLSEALAGDGLAVATSRHGEEL 200

Query: 287 NSS---DLTLPTFRH--LLIAFGGLA-GLEESIE-EDDGLKRKNA--REV-----FHSYF 332
           + S   DL     +    +IAFG    GL E +    DG++   A  R V     F  + 
Sbjct: 201 SVSRLGDLVSDARKAGGYVIAFGAPERGLPEMLGLSPDGIRAAVADGRPVEPDPGFDLWL 260

Query: 333 NTCPHQGSRTIRTEEAIFISL 353
           NT P Q S  +RTEEA+F++L
Sbjct: 261 NTIPAQSSEVVRTEEALFVTL 281


>gi|1181552|gb|AAA86857.1| ribosomal protein [Haloarcula marismortui]
          Length = 214

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 83/184 (45%), Gaps = 32/184 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           T SV VP S+    +    ATR  G +ARA  ++R+D + V+ +   +  +         
Sbjct: 2   TTSVLVPSSLAREAEDRREATRKLGYVARAAAVYRVDRLTVYPDPDGAGKW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPF 181
                 E G  F+  +L+Y  TP +LRK ++     L +VG+LPPL     +R    +  
Sbjct: 53  ------EDG--FVETVLRYAATPPHLRKEMWGKRDELEYVGVLPPL----RVRSQTGSGS 100

Query: 182 REGVTLKENAPNSVGTLVDVGLN---KH-----VVVDQVLDPGVRVTVAMGTNRNLDA-- 231
               +L++     VG    V +N   +H     V  D  ++ G RVTV + + R + A  
Sbjct: 101 EGSGSLRQGIVTEVGADGRVRVNCGMQHPISLPVPADMDVEQGERVTVRVSSRRPVRAKL 160

Query: 232 -DSP 234
            D+P
Sbjct: 161 VDAP 164


>gi|448436251|ref|ZP_21587052.1| hypothetical protein C472_12365 [Halorubrum tebenquichense DSM
           14210]
 gi|445682919|gb|ELZ35329.1| hypothetical protein C472_12365 [Halorubrum tebenquichense DSM
           14210]
          Length = 300

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 83/323 (25%), Positives = 126/323 (39%), Gaps = 80/323 (24%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF +                 
Sbjct: 19  LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVFPDSEG-------------- 64

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
              +   G  ++  +L Y  TP  LRK L+     LR+VG+LPPL            P+R
Sbjct: 65  ---DGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 111

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
            G T       + G + +VG    V V+  L               + G RVT+      
Sbjct: 112 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLGEHPISLLVPSGMEVEQGERVTI------ 165

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
            + +  P +     KP+      G++V  A ++S          G    + TS HG+ ++
Sbjct: 166 RVSSREPVRARITGKPEA-----GFQV-VAADLSEALA------GDGLAVATSRHGEELS 213

Query: 288 SSDL--TLPTFRH---LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
            S L   +   R      +AFG     E  + E  GL     R              F  
Sbjct: 214 VSRLGDLVSDAREAGGYTVAFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 270

Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
           + NT P Q S  +RTEEA+F +L
Sbjct: 271 WLNTIPAQASEVVRTEEALFATL 293


>gi|448503713|ref|ZP_21613342.1| hypothetical protein C464_14720 [Halorubrum coriense DSM 10284]
 gi|445691914|gb|ELZ44097.1| hypothetical protein C464_14720 [Halorubrum coriense DSM 10284]
          Length = 300

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 83/317 (26%), Positives = 125/317 (39%), Gaps = 68/317 (21%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF                  +
Sbjct: 19  LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVF-----------------PD 61

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
           R  +   G  ++  +L Y  TP  LRK L+     LR+VG+LPPL            P+R
Sbjct: 62  REGDGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPLR----------VPWR 111

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVD--------QVLDP-GVRVTVAMGTNRNLDADS 233
            G T       + G + +VG    V V+         +L P G+ V         + +  
Sbjct: 112 TGSTPSGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVVRGERVTIRVSSRE 171

Query: 234 PRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL-- 291
           P +     KP+      G++V  A ++S          G    + TS HG  ++ S L  
Sbjct: 172 PVRARITGKPEV-----GFQV-AAADLSEALA------GDGLAVATSRHGGELSVSRLGD 219

Query: 292 TLPTFRH---LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHSYFNTCP 336
            +   R      +AFG     E  + E  GL     R              F  + NT P
Sbjct: 220 LVSDAREAGGYTVAFG---APERGLPEMLGLSPDRIRSAVADGRPVEPDPGFDLWLNTIP 276

Query: 337 HQGSRTIRTEEAIFISL 353
            Q S  +RTEEA+F +L
Sbjct: 277 AQASEVVRTEEALFATL 293


>gi|448730376|ref|ZP_21712684.1| hypothetical protein C449_11338 [Halococcus saccharolyticus DSM
           5350]
 gi|445793544|gb|EMA44116.1| hypothetical protein C449_11338 [Halococcus saccharolyticus DSM
           5350]
          Length = 280

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 84/315 (26%), Positives = 130/315 (41%), Gaps = 54/315 (17%)

Query: 62  TVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRS 121
           TVS+ VP S+         AT   G +ARA T+FR+D VVVF +      +         
Sbjct: 2   TVSLLVPSSLCREAADKREATHKVGLVARAATVFRVDRVVVFPDPDGDRRW--------- 52

Query: 122 NRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHL--RKHEWA 179
                   G  F+  +++Y  T  YLRK  +     L  VG+LPPL A            
Sbjct: 53  --------GGGFVSTVMEYAATAPYLRKEAWGTRDELEQVGVLPPLRAVSRTGSESQGSG 104

Query: 180 PFREGVTLKENAPNSVGTLVDVGLNKHVVV----DQVLDPGVRVTVAMGTNRNLDADSPR 235
             R+G+ + E  P+     V+ GL   + +    D  LD G RVTV + +   + A    
Sbjct: 105 SSRQGI-VTEVGPDGR-VRVNCGLQHPISLVDPTDVGLDEGERVTVRISSREPVRARIVD 162

Query: 236 QVVPPSKPKESGMYWGYKVRYAPNISSVF--KNCSYKGGYDHLIGTSEHGDIVNSSDLTL 293
           + +P           G+ V  A ++S+    ++   +      I TS HG  + +  L +
Sbjct: 163 EPLP-----------GFVVERA-DLSAALGREDAGLR------IATSRHGQALTTERLGV 204

Query: 294 PTFR---HLLIAFGGLAGLEESIEEDDGLKRKNAREV------FHSYFNTCPHQGSRTIR 344
              R    + +AFG       ++   D     +A E       F  + +T P+QGS  +R
Sbjct: 205 LAGRVEDGMTVAFGAPERGLPAMLGIDTASVASADETGSDPGGFDRWLDTVPNQGSEVVR 264

Query: 345 TEEAIFISLQYFQEP 359
           TEEA+F +L     P
Sbjct: 265 TEEAVFATLACLTLP 279


>gi|50310397|ref|XP_455218.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49644354|emb|CAG97926.1| KLLA0F03058p [Kluyveromyces lactis]
          Length = 409

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 78/322 (24%), Positives = 129/322 (40%), Gaps = 76/322 (23%)

Query: 29  LSLANGDSSSCDNKKKKKR------KRDQL------NDDAPIEVPTV--SVAVPGSIIDN 74
           +S+      S  N KK KR      KRD L           +   TV  S+ +P +++D+
Sbjct: 1   MSVKRSKEQSKQNYKKVKRDTKVTEKRDHLAQFKRAKKVVKVSTKTVNHSICIPTTVLDS 60

Query: 75  TQSLELATRLAGQIARAVTIFRIDEVVVFD-----NKSSSD--NYSRSSA------ANRS 121
            ++LE  T    QIAR+  I+ + E+V+       NK  SD  N  +  A       N +
Sbjct: 61  CKNLEQITYTLYQIARSACIYNVAEIVILKAESETNKGESDRKNDKKQGAKIKFDDGNNT 120

Query: 122 NRS-------DENESGA------AFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPP 166
           N +        E E G         +  LLQ+  TP YL  ++F         +   LP 
Sbjct: 121 NNASTEYLEDSEKEDGKKRLSKPMLMASLLQFFITPPYLVNSVFKKDYMKYFTYAKQLPR 180

Query: 167 LDAPHHLRKHE--WAPFREGVTLKENAPNSVG---------TLVDVGLNKHVVVDQVLDP 215
           + +   +R ++     +REG+ ++ + P   G           +++G +K++ +   L P
Sbjct: 181 IPSLPFMRFYQDNNGRYREGLAIRMSKPGERGKSKKSFDQTKYINIGKDKNLELKGQLVP 240

Query: 216 -GVRVTVAMGTNRNLDADSPRQ-VVPPSKPK----ESGMYWGYKVRYAPNISSVFKNCSY 269
             VRVTV          D+  Q VV P +       +   +GY VR A + + +F    +
Sbjct: 241 INVRVTV----------DTIEQKVVSPEEAYGDFVGAKASFGYHVRVAKSFADLFTQSPF 290

Query: 270 KGGYDHLIGTSEHGDIVNSSDL 291
             GY   +        +NS D 
Sbjct: 291 PQGYTQTVW-------INSGDF 305


>gi|448535709|ref|ZP_21622229.1| hypothetical protein C467_10476 [Halorubrum hochstenium ATCC
           700873]
 gi|445703210|gb|ELZ55145.1| hypothetical protein C467_10476 [Halorubrum hochstenium ATCC
           700873]
          Length = 300

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/323 (26%), Positives = 128/323 (39%), Gaps = 80/323 (24%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF +                 
Sbjct: 19  LTICVPSSLVREAEDDREATRKLGYVARAAAVFRADRLVVFPDAEG-------------- 64

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
              +   G  ++  +L Y  TP  LRK L+     LR+VG+LPPL       +  W   R
Sbjct: 65  ---DGRRGGEYVRTVLGYAATPPELRKDLWGQRDELRYVGVLPPL-------RVLW---R 111

Query: 183 EGVTLKENAPNSVGTLVDVG----------LNKH-----VVVDQVLDPGVRVTVAMGTNR 227
            G T       + G + +VG          L +H     V  D  ++ G RVT+      
Sbjct: 112 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSDMEVEQGERVTI------ 165

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
            + +  P +     KP+      G++V  A ++S          G    + TS HG+ ++
Sbjct: 166 RVSSREPVRARIAGKPEA-----GFQV-VAADLSEALA------GDGLAVATSRHGEELS 213

Query: 288 SSDL--TLPTFRH---LLIAFGGLAGLEESIEEDDGLKRKNAREV------------FHS 330
            S L   +   R      +AFG     E  + E  GL     R              F  
Sbjct: 214 VSRLGDLVSDAREAGGYTVAFG---APERGLPEMLGLSPDRIRAAVADGRSVEPDPGFDL 270

Query: 331 YFNTCPHQGSRTIRTEEAIFISL 353
           + NT P Q S  +RTEEA+F +L
Sbjct: 271 WLNTIPAQASEVVRTEEALFATL 293


>gi|448441450|ref|ZP_21589113.1| hypothetical protein C471_06403 [Halorubrum saccharovorum DSM 1137]
 gi|445688859|gb|ELZ41106.1| hypothetical protein C471_06403 [Halorubrum saccharovorum DSM 1137]
          Length = 288

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 83/320 (25%), Positives = 130/320 (40%), Gaps = 74/320 (23%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ +P S++   +    ATR  G +ARA  +FR D +VVF +                 
Sbjct: 7   LTICIPSSVVREAEDAREATRKLGYVARAAAVFRADRLVVFPDGEGERRRG--------- 57

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
                     ++  +L Y  TP  LRK L+     LR+ G+LPPL            P+R
Sbjct: 58  --------GEYVRTVLGYAATPPGLRKDLWGERDELRYAGVLPPLR----------VPWR 99

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
            G T       + G + +VG    V V+  L               + G RVT+      
Sbjct: 100 TGSTPSGEESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
            + +  P +     KP++     G++V  A ++S          G    + TS HG+ ++
Sbjct: 154 RVSSREPVRARITGKPED-----GFQVVEA-DLSEALA------GDGLAVATSRHGEELS 201

Query: 288 SSDL--TLPTFRH---LLIAFGGLA-GLEESIE-EDDGLKRKNA--REV-----FHSYFN 333
            S L   +   R      +AFG    GL E +    DG++   A  R V     F  + N
Sbjct: 202 VSRLGTIVSDAREAGGYTVAFGSPERGLPEMLGLSPDGIRAAVADGRSVEPDPGFDLWLN 261

Query: 334 TCPHQGSRTIRTEEAIFISL 353
           T P Q S  +RTEEA+F++L
Sbjct: 262 TIPAQASEVVRTEEALFVTL 281


>gi|448463229|ref|ZP_21598007.1| hypothetical protein C468_03543 [Halorubrum kocurii JCM 14978]
 gi|445817224|gb|EMA67100.1| hypothetical protein C468_03543 [Halorubrum kocurii JCM 14978]
          Length = 288

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 83/320 (25%), Positives = 131/320 (40%), Gaps = 74/320 (23%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +++ VP S++   +    ATR  G +ARA  +FR D +VVF ++                
Sbjct: 7   LTICVPSSVVREAEDDREATRKLGYVARAAAVFRADRLVVFPDREGERRRG--------- 57

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
                     ++  +L Y  TP  LRK L+     LR+ G+LPPL            P+R
Sbjct: 58  --------GEYVRTVLGYAATPPGLRKDLWGERDELRYAGVLPPLR----------VPWR 99

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVL---------------DPGVRVTVAMGTNR 227
            G T       + G + +VG    V V+  L               + G RVT+      
Sbjct: 100 TGSTPSGGESKTQGLVTEVGPEGRVRVNSPLREHPISLLVPSGMEVEQGERVTI------ 153

Query: 228 NLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVN 287
            + +  P +     KP++     G++V  A ++S          G    + TS HG+ ++
Sbjct: 154 RVSSREPVRARITGKPED-----GFQVVEA-DLSEALA------GDGLAVATSRHGEELS 201

Query: 288 SSDL--TLPTFRH---LLIAFGGLA-GLEESIE-EDDGLKR--KNAREV-----FHSYFN 333
            S L   +   R      +AFG    GL E +    DG++    + R V     F  + N
Sbjct: 202 VSRLGTIVSDAREAGGYTVAFGSPERGLPEMLGLSPDGIRAAVADGRSVEPDPGFDLWLN 261

Query: 334 TCPHQGSRTIRTEEAIFISL 353
           T P Q S  +RTEEA+F++L
Sbjct: 262 TIPAQASEVVRTEEALFVTL 281


>gi|354547624|emb|CCE44359.1| hypothetical protein CPAR2_401610 [Candida parapsilosis]
          Length = 403

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 92/411 (22%), Positives = 155/411 (37%), Gaps = 109/411 (26%)

Query: 34  GDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIID--NTQSLELATRLAGQIARA 91
            +S++ + K  KK  R Q         P++S+ VP S+I   N  +L+  T +A QIA+A
Sbjct: 6   AESATSEAKSTKKPTRQQT-------APSISICVPSSVISSKNAYNLQQKTMIAYQIAKA 58

Query: 92  VTIFRIDEV-------------------------------VVFDNKSSSDNYSRSSAANR 120
             I+ + E+                               VVFD+   +D  S +   N 
Sbjct: 59  SLIYDVAEIIVLTESTTKQEGKGGQSEEEKKDISSSIGKKVVFDDNGENDIGSSAPVENP 118

Query: 121 SN---------RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPH 171
           ++         R ++++  A  L  LLQ+  TP YL K +FS   + +F  +LP      
Sbjct: 119 TDTELDENTDERVNDDKDDALLLASLLQFFITPPYLIKTMFSPRLNPKFGQVLPKFKYAF 178

Query: 172 HLRKHEWAP----------FREGVTLKENAPN-----------------SVGTLVDVGLN 204
            L K    P          F+EG+ +    P                  +V   V++G +
Sbjct: 179 KLPKIPSLPFMQNNNVHEHFKEGIIIPRETPKIKSKTDKQKLVPSPHKVTVSKYVNIGES 238

Query: 205 KHVVVDQVLDPGV--RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPN--I 260
           + + +D   +  V  RVT+ +   RN    S +Q    +  K S   +GY VR   +   
Sbjct: 239 EAMKLDIAREVPVYSRVTIDV---RNKTIVSAKQAYGINGHKSS---FGYHVRMVNDNQF 292

Query: 261 SSVFKNCSYKGGYDH--LIGTSEHGDIVNS-SDLTLPTFR----------HLLIAFGGLA 307
           + VF N     GY     +   ++ D  +   D ++P F           +LLI  G   
Sbjct: 293 NKVFTNSPLSDGYSQTIFVNCDDYFDKWSKLVDESVPLFEKKGDANGNNDNLLIILGNYG 352

Query: 308 GLEESIEEDDGLKRKNAREVFHSYFNT-----CPHQGSRTIRTEEAIFISL 353
            ++ + + D+         +F    N      C     +  R E+A  I+L
Sbjct: 353 TIQRNFQSDEA-----KSSIFEGLDNVSKLFDCRLDIPKGCRIEDATLIAL 398


>gi|433431321|ref|ZP_20407623.1| hypothetical protein D320_16455, partial [Haloferax sp. BAB2207]
 gi|432194053|gb|ELK50713.1| hypothetical protein D320_16455, partial [Haloferax sp. BAB2207]
          Length = 228

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 17/105 (16%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV VP S++   +    ATR  G +ARA  +FR +E+VVF ++   +N+          
Sbjct: 3   LSVLVPSSLVREAEDKREATRKLGYVARAAAVFRANELVVFPDEDGENNW---------- 52

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
                  G  F+  +L+Y  TP YLRK ++     LR+ G+LPP+
Sbjct: 53  -------GGEFVETVLRYAATPPYLRKEVWGKRDELRYAGILPPV 90


>gi|403215892|emb|CCK70390.1| hypothetical protein KNAG_0E01240 [Kazachstania naganishii CBS
           8797]
          Length = 352

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/227 (26%), Positives = 104/227 (45%), Gaps = 32/227 (14%)

Query: 61  PTVSVAV-PGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAAN 119
           P VS+ V   +++ N Q+LE  T    ++ARA +++ + EVVV      S +  R++  +
Sbjct: 25  PRVSLCVCAAAVVGNCQNLEQITHKVYELARAASMYNVREVVVL-----SLDPPRTAGTS 79

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--- 174
           +     E  S +  +  LLQY  TP YL +++F    H   +    LP L     +R   
Sbjct: 80  KG----EKLSDSMLVATLLQYFVTPPYLVRSVFKKRFHKYFQHAEKLPRLAQLPFMRQDT 135

Query: 175 -KHEWAPFREGVTLKENAPNSVG--------TLVDVGLNKHVVVDQVLDP-GVRVTVAMG 224
            +   + +REGV++    P  +           V+VG ++ + +   L P  VRVTV + 
Sbjct: 136 PREGKSRYREGVSVAMERPAKMSKQKQFQQTKYVNVGASELLELRAQLVPLNVRVTVDVV 195

Query: 225 TNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKN--CSY 269
            ++ +   SP +        E+   +GY VR A  +  +F    C+Y
Sbjct: 196 EHKIV---SPAEAYGDHVGAEAA--FGYHVRVADTLVDLFLKSPCNY 237


>gi|390938994|ref|YP_006402732.1| hypothetical protein Desfe_1289 [Desulfurococcus fermentans DSM
           16532]
 gi|390192101|gb|AFL67157.1| protein of unknown function DUF171 [Desulfurococcus fermentans DSM
           16532]
          Length = 276

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 68/292 (23%), Positives = 121/292 (41%), Gaps = 32/292 (10%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+ +P SI+    +L L +    Q+AR  +IF ++ VV +    +S             
Sbjct: 6   LSIVLPTSILSVEPTLFLKSLRIHQVARWSSIFGVNRVVFYREYETS------------- 52

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
             DE       +    +Y  TP YLR+ L   +  LR+VG LPP+         +     
Sbjct: 53  -RDEFREHREIISIHWRYFFTPPYLRRRLVPRNPLLRYVGALPPIRLSEFNVSGKPVDGE 111

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
           E +         +   +D  + K+ +V    +PG       G +R    DS R++   ++
Sbjct: 112 ERIGFITLEEGKLTAYLD-NVEKYSLV----NPG---GCKQGFSRVRVVDSRRKL---AE 160

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDIVNSSDL-TLPTFRHLLI 301
             ++  Y G  + +  ++     +   K     +I T + G+  + S L +L   + L++
Sbjct: 161 CIDTEYYIGPSLVFRDSLREALDDAEEKV---FIIATDKTGEAPSLSKLVSLKERKELML 217

Query: 302 AFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
            FG        I   +G    N  E   + +NT P Q   ++RTEEA+ I+L
Sbjct: 218 LFGSPERDLFEISRGEGF---NLLEYVDAVWNTVPGQHVVSVRTEEAVIITL 266


>gi|218884489|ref|YP_002428871.1| hypothetical protein DKAM_1178 [Desulfurococcus kamchatkensis
           1221n]
 gi|218766105|gb|ACL11504.1| DUF171 domain containing protein [Desulfurococcus kamchatkensis
           1221n]
          Length = 276

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 73/298 (24%), Positives = 125/298 (41%), Gaps = 44/298 (14%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +S+ +P SI+    +L L +    Q+AR  +IF ++ VV +    +S             
Sbjct: 6   LSIVLPTSILSVEPTLFLKSLRIHQVARWSSIFGVNRVVFYREYETS------------- 52

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWAPFR 182
             DE       +    +Y  TP YLR+ L   +  LR+VG LPP+         +     
Sbjct: 53  -RDEFREHREIISIHWRYFFTPPYLRRRLVPRNPLLRYVGALPPIRLSEFNVSGKPVDGE 111

Query: 183 EGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGTNRNLDADSPRQVVPPSK 242
           E +         +   +D  + K+ +V    +PG       G +R    DS R++   ++
Sbjct: 112 ERIGFITLEEGKLTAYLD-NVEKYSLV----NPG---DCKQGFSRVKVVDSRRKL---AE 160

Query: 243 PKESGMYWGYKVRYAPNISSVFKNCSYKGGYDH------LIGTSEHGDIVNSSDL-TLPT 295
             ++  Y G      P++  VF+N S +   D       +I T + G+  + S+L +L  
Sbjct: 161 CIDTEYYIG------PSL--VFRN-SLREALDEAGEKVFIIATDKTGEAPSLSNLASLKG 211

Query: 296 FRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRTEEAIFISL 353
              L++ FG        I   +G    N  E   + +NT P Q   ++RTEEA+ I+L
Sbjct: 212 RMELMLLFGSPERDLFEISRGEGF---NLLEYVDAVWNTVPGQHVVSVRTEEAVIITL 266


>gi|448529670|ref|XP_003869886.1| hypothetical protein CORT_0E01660 [Candida orthopsilosis Co 90-125]
 gi|380354240|emb|CCG23753.1| hypothetical protein CORT_0E01660 [Candida orthopsilosis]
          Length = 411

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 82/379 (21%), Positives = 143/379 (37%), Gaps = 107/379 (28%)

Query: 34  GDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSIID--NTQSLELATRLAGQIARA 91
            +S+S + K  KK  + Q         P+VS+ +P S+I   N  +L+  T +A QIA+A
Sbjct: 6   AESASSEPKPTKKPTKHQ-------STPSVSICIPSSVISYKNAYNLQQKTMIAYQIAKA 58

Query: 92  VTIFRIDEVVVF-------------------------------------DNKS------- 107
             I+ + E++V                                      DN++       
Sbjct: 59  SLIYDVSEIIVLKESPIKHEKEEEKEKHGNGGGKKEAMSAVGKKVVFNEDNENGASSSLQ 118

Query: 108 ---SSDNYSRSSAANRSNRSDENESGAAFLVRLL-QYLETPQYLRKALF---------SM 154
              S+D   +    + +  +D NE+  A L+  L Q+  TP YL K +F         S+
Sbjct: 119 PVPSTDTKPKEETPSEAVNNDANENDDALLLASLLQFFITPPYLIKTMFLPRLNPKFTSI 178

Query: 155 HSSLRFVGMLPPL-DAPHHLRKHEWAPFREGVTLKENAPN-----------------SVG 196
            S  ++   LP + + P     + +  F+EG+ +    P                  +V 
Sbjct: 179 LSKFKYAFKLPKIPNLPFMQNNNVYEHFKEGIIIPRETPKIKSKTDKSKLVPSPHKVTVS 238

Query: 197 TLVDVGLNKHVVVDQVLDPGV--RVTVAMGTNRNLDADSPRQVVPPSKPKESGMYWGYKV 254
             V++G  + + +D   +  V  RVT+ +   RN    S R+       K S   +GY V
Sbjct: 239 KYVNIGEPEAMKLDIAREVPVYSRVTIDV---RNKTIVSAREAYGTHGHKSS---FGYHV 292

Query: 255 RYA--PNISSVFKNCSYKGGYDH--LIGTSEHGDIVNSSDLTLPTFR-----------HL 299
           R       S VF N     GY     +   ++ D  + +   +P+F            +L
Sbjct: 293 RMVNETQFSKVFTNSPLSDGYSQTIFVNCDDYFDKWSKTVDQVPSFEEKKSRGNDGNDNL 352

Query: 300 LIAFGGLAGLEESIEEDDG 318
           LI  G    ++++ E D+ 
Sbjct: 353 LIILGNCGTIQQNFENDEA 371


>gi|70606413|ref|YP_255283.1| hypothetical protein Saci_0598 [Sulfolobus acidocaldarius DSM 639]
 gi|449066625|ref|YP_007433707.1| hypothetical protein SacN8_02905 [Sulfolobus acidocaldarius N8]
 gi|449068899|ref|YP_007435980.1| hypothetical protein SacRon12I_02895 [Sulfolobus acidocaldarius
           Ron12/I]
 gi|68567061|gb|AAY79990.1| conserved Archaeal protein [Sulfolobus acidocaldarius DSM 639]
 gi|449035133|gb|AGE70559.1| hypothetical protein SacN8_02905 [Sulfolobus acidocaldarius N8]
 gi|449037407|gb|AGE72832.1| hypothetical protein SacRon12I_02895 [Sulfolobus acidocaldarius
           Ron12/I]
          Length = 250

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 54/117 (46%), Gaps = 19/117 (16%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           +SV +  SI D  + LE      G I R + IFR+ EV+  D+  +  N +R    + SN
Sbjct: 10  LSVVLFTSIFDTEKKLEEIIYKLGFIIRTLVIFRVSEVIWLDDLKNKKNITR-IIKDVSN 68

Query: 123 RSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLPPLDAPHHLRKHEWA 179
                            Y  TP Y +K  F +  +L  VG++PP++ P H+  +++ 
Sbjct: 69  -----------------YALTPPYGKKY-FPIKRTLSKVGLIPPINIPSHVVSNDYV 107


>gi|320101369|ref|YP_004176961.1| hypothetical protein [Desulfurococcus mucosus DSM 2162]
 gi|319753721|gb|ADV65479.1| protein of unknown function DUF171 [Desulfurococcus mucosus DSM
           2162]
          Length = 274

 Score = 45.8 bits (107), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 48/106 (45%), Gaps = 16/106 (15%)

Query: 63  VSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDEVVVFDNKSSSDNYSRSSAANRSN 122
           VS+ +P SI+    +L L +    Q+AR  +IF + EVV +        Y         +
Sbjct: 3   VSIVLPSSILSVESTLLLKSLRIHQVARWTSIFGVSEVVFYRE------YGTPHGEYMGH 56

Query: 123 RSDENESGAAFLVRL-LQYLETPQYLRKALFSMHSSLRFVGMLPPL 167
           R          L+ L  +Y  TP YLR+ L      LR+VG LPP+
Sbjct: 57  RQ---------LIELHWRYFFTPPYLRRRLVPRSPLLRYVGALPPI 93


>gi|323303496|gb|EGA57290.1| YMR310C-like protein [Saccharomyces cerevisiae FostersB]
          Length = 265

 Score = 45.1 bits (105), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 62/246 (25%), Positives = 100/246 (40%), Gaps = 35/246 (14%)

Query: 120 RSNRSDENESGAAFLVRLLQYLETPQYLRKALFS--MHSSLRFVGMLPPLDAPHHLR--K 175
           RS    +  S A  ++ LLQY  TP YLR  +F+      L     LP L      R  K
Sbjct: 13  RSMPPKKKISTAKLILALLQYFVTPPYLRNTVFNEKFRPYLTAASKLPRLSTLPFTRYQK 72

Query: 176 HEWAPFREGVTLKENAPN----SVGTL------VDVGLNKHVVVDQVLDP-GVRVTVAMG 224
            +   +REG+T+K   P       G +      +++G +K + +   L P   RVT+   
Sbjct: 73  QDHGRYREGLTIKMQKPTLARKKTGKVFKQTKYINIGKSKALALQSQLVPINARVTIDTI 132

Query: 225 TNRNLDADSPRQVVPPSKP--KESGMY--WGYKVRYAPNISSVFKNCSYKGGYDHLIG-- 278
           T         R++V P +     +G+   +GY  R   + + +F     K GY   +   
Sbjct: 133 T---------RKIVSPQEAYGDFTGLXSQYGYYTRIXSSFTDLFMKGPLKEGYTQSVYVP 183

Query: 279 -TSEHGDIVNSSDL-TLPTFRHLLIAFGGLAGLEES--IEEDDGLKRKNAREVFHSYFNT 334
            T+    I   S L T  T  H+L+ F     L  +  +++D  +  +  +E F +    
Sbjct: 184 LTTRDTSIPELSSLPTAETNPHILLVFSTWDTLARAFKLDQDQFVDCQGPQEFFDAQLP- 242

Query: 335 CPHQGS 340
           CP   S
Sbjct: 243 CPVSNS 248


>gi|385301686|gb|EIF45859.1| DEHA2G18788p-like protein [Dekkera bruxellensis AWRI1499]
          Length = 258

 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 45/176 (25%), Positives = 75/176 (42%), Gaps = 35/176 (19%)

Query: 128 ESGAAFLVRLLQYLETPQYLRKALF----------SMHSSLRFVGMLPPLDAPHHLRKHE 177
           +S    L  LLQ+  TP YL +++F          +    L  + MLP + A      H 
Sbjct: 2   DSDTMLLAGLLQFFVTPPYLVESVFGNSPIDLKKFNYAKKLPTIPMLPFMQANDG---HG 58

Query: 178 WAPFREGVTL-------KENAPNSVGTL---------VDVGLNKHVVVDQVLDPGVRVTV 221
              ++EG+T+       K+ A   V  +         V+VGL K + + + +  G RVTV
Sbjct: 59  TKKYKEGMTIQNLSKSRKKTASGKVKKIKKADRMTKYVNVGLKKMLELKEAVPKGARVTV 118

Query: 222 AMGTNRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLI 277
            M  +  +   SP      S    S   +GY+VR A ++ ++     +  GYD ++
Sbjct: 119 DMKKHIVV---SPYVAYGTSSTNNS---YGYQVRVATSLEALLGQTPFPDGYDMIL 168


>gi|198423892|ref|XP_002121436.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 1860

 Score = 38.5 bits (88), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 32/58 (55%)

Query: 15  KEAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDAPIEVPTVSVAVPGSII 72
           +E     NH+S ++  L NGD +S D K   +R+R +   D   + P V +A+P S++
Sbjct: 404 EEKLVNGNHDSHDDTRLMNGDDASHDLKGIYERQRLKHEQDESAKTPIVDIAIPNSVV 461


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.132    0.379 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,016,339,570
Number of Sequences: 23463169
Number of extensions: 256169958
Number of successful extensions: 529151
Number of sequences better than 100.0: 512
Number of HSP's better than 100.0 without gapping: 445
Number of HSP's successfully gapped in prelim test: 67
Number of HSP's that attempted gapping in prelim test: 527268
Number of HSP's gapped (non-prelim): 652
length of query: 378
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 234
effective length of database: 8,980,499,031
effective search space: 2101436773254
effective search space used: 2101436773254
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 78 (34.7 bits)