BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 011682
         (479 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255549968|ref|XP_002516035.1| conserved hypothetical protein [Ricinus communis]
 gi|223544940|gb|EEF46455.1| conserved hypothetical protein [Ricinus communis]
          Length = 483

 Score =  590 bits (1520), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 330/484 (68%), Positives = 395/484 (81%), Gaps = 12/484 (2%)

Query: 1   MSRVGSE--CFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA 58
           M+ +GSE      R + A+ I  + V S+++ +++  + E +S   ++ VE   +++V  
Sbjct: 1   MAMIGSEWKSLSLRYVKASLICFIGVVSMTIVSEARQQQEGSSDRNSEDVESAFLMRVVN 60

Query: 59  ---PKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDA 115
               K + GY HVWPD+KFGW+IVVG+IIGFFG A GSVGGVGGGGIFVPML LI+GFDA
Sbjct: 61  FLWQKGQLGYTHVWPDMKFGWQIVVGSIIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDA 120

Query: 116 KSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVI 175
           KSS A+SKCM+TG A S V YNL+ RHPTLD+P+IDYDLALLFQP+LVLGISIGVAFNVI
Sbjct: 121 KSSTAISKCMITGAAASTVYYNLKLRHPTLDMPIIDYDLALLFQPVLVLGISIGVAFNVI 180

Query: 176 FADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEEC--K 233
           FADWMITVLLI+LF+  ST+AFLKGVETWKKETI K+EAA++L      NG  +EE   K
Sbjct: 181 FADWMITVLLIILFIGTSTKAFLKGVETWKKETIIKKEAARRLA----SNGGASEEVEYK 236

Query: 234 CDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL 293
             P   +N T PE + EPK+ +VSIIEN+ WKE G+L AVW IILAL+I KNYTTTCSV+
Sbjct: 237 PLPGGPTNGTGPE-SNEPKREEVSIIENVRWKELGILFAVWLIILALEITKNYTTTCSVV 295

Query: 294 YWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGG 353
           YWV NLLQIPVA GVS+Y+AV+LYKG+R+IASKG+ GT WR  QLV YCACG+ AGMVGG
Sbjct: 296 YWVCNLLQIPVALGVSSYQAVSLYKGKRRIASKGEAGTNWRVHQLVLYCACGVLAGMVGG 355

Query: 354 LLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFAL 413
           LLGLGGGFILGPLFLELG+PPQVSSATA FA+TFS+SMSV+EYYLLKRFPVPYALYF A+
Sbjct: 356 LLGLGGGFILGPLFLELGVPPQVSSATATFAMTFSASMSVIEYYLLKRFPVPYALYFVAV 415

Query: 414 SIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGF 473
           S IAAFVGQHV+++LI ILGRAS+IIF LSF IFVSA+SLGGVG+A MI++IE+ EYMGF
Sbjct: 416 STIAAFVGQHVVRRLISILGRASLIIFILSFTIFVSAISLGGVGIANMIEKIENHEYMGF 475

Query: 474 DNIC 477
           DNIC
Sbjct: 476 DNIC 479


>gi|225442593|ref|XP_002284423.1| PREDICTED: uncharacterized protein LOC100266912 [Vitis vinifera]
 gi|297743252|emb|CBI36119.3| unnamed protein product [Vitis vinifera]
          Length = 481

 Score =  587 bits (1514), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 327/480 (68%), Positives = 386/480 (80%), Gaps = 6/480 (1%)

Query: 1   MSRVGSECFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA-- 58
           M+ +GS+ +  R +A   +  L VASV VSA+  LK E++S + +  VEP ++ K+    
Sbjct: 1   MAGIGSKWWGLRRVALILVTFLGVASVLVSAERTLKQESSSHDGSREVEPGLMTKIVNFL 60

Query: 59  -PKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKS 117
              + S Y+HVWPD+KFGW+I+VGTIIGFFG A GSVGGVGGGGIFVPML L++GFDAKS
Sbjct: 61  WDPNLSAYEHVWPDMKFGWQIIVGTIIGFFGAAFGSVGGVGGGGIFVPMLTLVIGFDAKS 120

Query: 118 SAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFA 177
           S A+SKCM+ G A S V YNL+ RHPTLD+P+IDYDLALLFQP+LVLGISIGVAFNVIFA
Sbjct: 121 STAISKCMIMGAAASTVYYNLKLRHPTLDMPIIDYDLALLFQPMLVLGISIGVAFNVIFA 180

Query: 178 DWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPE 237
           DWM+TVLLIVLF+  ST+AF KGVETWKKETI K+EAAK+LE    GNG +  E K  P 
Sbjct: 181 DWMVTVLLIVLFIGTSTKAFFKGVETWKKETIIKKEAAKRLE--SNGNGSEDGEYKALPA 238

Query: 238 YLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVL 297
              ND T  +T  PK  +VSIIEN+ WKE GLL AVW IIL LQI KNYT+TCS  YW L
Sbjct: 239 G-PNDGTQRDTNAPKDKEVSIIENVCWKELGLLFAVWVIILGLQIGKNYTSTCSAGYWTL 297

Query: 298 NLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGL 357
           NLLQ+PVA GVS YEAV LYKGRR IASKG+EGT W+   L+FYC  G+ AG+VGGLLGL
Sbjct: 298 NLLQVPVAVGVSGYEAVCLYKGRRVIASKGEEGTNWKVHNLIFYCGVGVLAGIVGGLLGL 357

Query: 358 GGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIA 417
           GGGFILGPLFLELGIPPQVSSATA FA+TFS+SMSVVEYYLLKRFPVPYALYF A++ +A
Sbjct: 358 GGGFILGPLFLELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYFVAVATVA 417

Query: 418 AFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           AF+GQHV++KLI ILGRAS+IIF LSF IFVSA+SLGGVG+  MI++IE K+YMGF+N+C
Sbjct: 418 AFLGQHVVRKLINILGRASLIIFILSFTIFVSAISLGGVGIKDMIEKIEQKDYMGFENLC 477


>gi|224070901|ref|XP_002303288.1| predicted protein [Populus trichocarpa]
 gi|222840720|gb|EEE78267.1| predicted protein [Populus trichocarpa]
          Length = 426

 Score =  570 bits (1470), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 313/419 (74%), Positives = 354/419 (84%), Gaps = 5/419 (1%)

Query: 60  KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
           K   GY HVWPD+KFGW+IVVGTIIGFFG A GSVGGVGGGGIFVPML LI+GFDAKSS 
Sbjct: 10  KGLYGYTHVWPDMKFGWKIVVGTIIGFFGAALGSVGGVGGGGIFVPMLTLIIGFDAKSST 69

Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
           A+SKCM+TG A S V YNL+ RHPTLD+PVIDYDLALLFQP+LVLGISIGVAFNVIFADW
Sbjct: 70  AISKCMITGAAASTVYYNLKLRHPTLDMPVIDYDLALLFQPMLVLGISIGVAFNVIFADW 129

Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYL 239
           MITVLLI+LF+  ST+AFLKGVETWKKETI K+EAA++LE     N    EE +  P   
Sbjct: 130 MITVLLIILFIGTSTKAFLKGVETWKKETILKQEAARRLE----SNDDDNEEVEYQPLPG 185

Query: 240 SNDTTPE-ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLN 298
                 E E +EPKK  VSIIEN+YWKE GLL AVW  ILAL+I KNYTTTCS+ YW LN
Sbjct: 186 GPSGGGEAENKEPKKEAVSIIENVYWKELGLLFAVWGAILALEIGKNYTTTCSMAYWALN 245

Query: 299 LLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLG 358
           LLQIPVA GVS+YEAV+LYKG RKIASKG+ GT WRA QLV YCACG+ AG+VGGLLGLG
Sbjct: 246 LLQIPVAVGVSSYEAVSLYKGTRKIASKGETGTNWRAHQLVLYCACGVLAGIVGGLLGLG 305

Query: 359 GGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA 418
           GGFILGPLFLELGIPPQVSSATA FA+TFS+SMSVVEYYLLKRFPVPYALYF A++ ++A
Sbjct: 306 GGFILGPLFLELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYFVAVATVSA 365

Query: 419 FVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           FVGQ V++KLI +LGRAS+IIF L+F IFVSA+SLGGVG+  MI++IEH +YMGF+NIC
Sbjct: 366 FVGQFVVRKLINLLGRASLIIFILAFTIFVSAISLGGVGIVNMIEKIEHHDYMGFENIC 424


>gi|337733644|gb|AEI72271.1| hypothetical protein [Citrus trifoliata]
          Length = 353

 Score =  552 bits (1422), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 294/353 (83%), Positives = 320/353 (90%), Gaps = 3/353 (0%)

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           M+TG AV+ VVYNLRQRHPTLD+PVIDYDLALLFQP+LVLGISIGVAFNVIFADWMITVL
Sbjct: 1   MITGTAVATVVYNLRQRHPTLDMPVIDYDLALLFQPMLVLGISIGVAFNVIFADWMITVL 60

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTT 244
           LI+LF+ +S +AFLKGVETWKKETITK EAAK+LEL    NG Q EE KC+PE LSN TT
Sbjct: 61  LIILFIGMSAKAFLKGVETWKKETITKVEAAKRLELD--RNGDQAEEFKCEPENLSN-TT 117

Query: 245 PEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPV 304
           P+ET+EP KS+ SI++NIYWKE GLLVAVW I+LALQIAKNYT TCSVLYWVLN LQIPV
Sbjct: 118 PKETKEPVKSEGSILKNIYWKELGLLVAVWVIVLALQIAKNYTATCSVLYWVLNFLQIPV 177

Query: 305 AGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILG 364
           AGGVSAYEAVALYKGRR IASKGD GT WR  QLVFYCACGITAG+VGGLLGLGGGFILG
Sbjct: 178 AGGVSAYEAVALYKGRRTIASKGDVGTNWRPHQLVFYCACGITAGVVGGLLGLGGGFILG 237

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           PLFLELGIPPQVSSATA FA+TFS+SMSVVEYYLLKRFPVPYALYFFA+S IAAFVGQHV
Sbjct: 238 PLFLELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYFFAVSTIAAFVGQHV 297

Query: 425 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           ++KLI ILGRASIIIFTLS  IFVSALSL G+GLAKM+ RI+ KEYMGFD+IC
Sbjct: 298 VRKLINILGRASIIIFTLSLTIFVSALSLSGIGLAKMVPRIDRKEYMGFDSIC 350


>gi|225430021|ref|XP_002284175.1| PREDICTED: uncharacterized protein LOC100267889 [Vitis vinifera]
 gi|296081886|emb|CBI20891.3| unnamed protein product [Vitis vinifera]
          Length = 478

 Score =  549 bits (1415), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 309/461 (67%), Positives = 370/461 (80%), Gaps = 8/461 (1%)

Query: 22  LVVASVSVSAQSNLKPEATSKN--KNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGW 76
            ++AS  VS + ++K EA++ N  K      N + KV        RSGY+HVWP+++FGW
Sbjct: 17  FMLASAFVSGERSIKHEASTFNVTKEAGFNSNYLSKVVNFLWQSDRSGYQHVWPEMEFGW 76

Query: 77  RIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVY 136
           +IVVG++IGFFG A GSVGGVGGGGIFVPML+L++GFD KS+ A+SKCM+ G A S V Y
Sbjct: 77  QIVVGSVIGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDPKSATALSKCMIMGAAGSTVYY 136

Query: 137 NLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRA 196
           NL+ RHPTLD+P+IDYDLALLFQP+L++GISIGVAFNV+FADWM+TVLLIVLFL  ST+A
Sbjct: 137 NLKLRHPTLDMPIIDYDLALLFQPMLMMGISIGVAFNVLFADWMVTVLLIVLFLGTSTKA 196

Query: 197 FLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKV 256
           FLKGVETWKKETI KREAAK+L     GNG +  E K  P   SN T    T + K+ +V
Sbjct: 197 FLKGVETWKKETIMKREAAKRLG--TNGNGTEEVEYKPLPSGPSNGTQ-NATNKSKEPEV 253

Query: 257 SIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL 316
           SIIEN+YWKE GLLV VW   LALQIAKN T TCS+ YWVLN +QIPV+ GVS YEAV+L
Sbjct: 254 SIIENVYWKELGLLVFVWVAFLALQIAKNNTATCSMAYWVLNFMQIPVSVGVSLYEAVSL 313

Query: 317 YKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQV 376
           YKGRR IASKGD GT +R  QL+ YC CG+ AG+VGGLLGLGGGFILGPLFLELG+PPQV
Sbjct: 314 YKGRRIIASKGDAGTNFRVHQLILYCFCGVLAGIVGGLLGLGGGFILGPLFLELGVPPQV 373

Query: 377 SSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRAS 436
           SSATA FA+TFSSSMSVVEYYLLKRFPVPYA+Y   ++ IAAF+GQHV+++LI ILGRAS
Sbjct: 374 SSATATFAMTFSSSMSVVEYYLLKRFPVPYAVYLVLVATIAAFIGQHVVRRLISILGRAS 433

Query: 437 IIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +IIF L+F IF+SA+SLGGVGL+ MI +IE  EYMGF+N+C
Sbjct: 434 LIIFILAFTIFISAISLGGVGLSNMIGKIERHEYMGFENLC 474


>gi|224092328|ref|XP_002309560.1| predicted protein [Populus trichocarpa]
 gi|222855536|gb|EEE93083.1| predicted protein [Populus trichocarpa]
          Length = 479

 Score =  542 bits (1396), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 303/476 (63%), Positives = 369/476 (77%), Gaps = 15/476 (3%)

Query: 10  RWRLLAAAFIGLL--VVASVSVSAQSNLKPEATSKNKN---DHVEPNIVIKVS---APKS 61
           +WR L +  + LL  ++A V VSA+  LK EATS   N   D V   ++  V+    P  
Sbjct: 7   KWRGLRSVLMVLLNFLLAFVFVSAERGLKHEATSTRMNETGDSVSSYVLKAVNFLWQPDH 66

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
           + GY+HVWP++KFGW+IV+G+IIGFFG A GSVGGVGGGGIFVPML+L++GFD KS+ A+
Sbjct: 67  K-GYQHVWPEMKFGWQIVLGSIIGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDPKSATAI 125

Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
           SKCM+ G A+S V YNL+ RHPTLD+P+IDYDLALL QP+L+LGISIGVAFNV+FADWM+
Sbjct: 126 SKCMIMGAAISTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVVFADWMV 185

Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSN 241
           TVLLI+LFL  ST+AFLKGVETWKKETI KREAAK+LE     +G    E +  P     
Sbjct: 186 TVLLIILFLGTSTKAFLKGVETWKKETIMKREAAKRLE----SDGASAGEVEYKPLPGGP 241

Query: 242 DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 301
             +P+  +  K+ +VSI+EN+YWKE GLLV VW   L LQI K  T+TCS  YWVLNLLQ
Sbjct: 242 SNSPQ--KADKEEEVSILENVYWKELGLLVFVWVAFLVLQITKEGTSTCSTTYWVLNLLQ 299

Query: 302 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 361
           IPV+ GVS YEAV+LYKG R IASKG EGT +   QLV YC  GI AG+VGGLLGLGGGF
Sbjct: 300 IPVSVGVSMYEAVSLYKGHRIIASKGTEGTNFTILQLVIYCLFGILAGVVGGLLGLGGGF 359

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           I+GPLFLELGIPPQVSSATA FA+TFSSSMSVVEYYLLKRFPV YA+YF A++  AAF+G
Sbjct: 360 IMGPLFLELGIPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVSYAVYFVAVATFAAFIG 419

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           QH++++LI + GRAS+IIF L+  IF+SA+SLGGVG+A MI +I   EYMGF+N+C
Sbjct: 420 QHIVRRLIIVFGRASLIIFILASTIFISAISLGGVGVANMIGKIHRHEYMGFENLC 475


>gi|356556244|ref|XP_003546436.1| PREDICTED: uncharacterized protein LOC100816528 [Glycine max]
          Length = 476

 Score =  536 bits (1381), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 304/472 (64%), Positives = 371/472 (78%), Gaps = 12/472 (2%)

Query: 10  RWRL-LAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHV 68
           RW + L   F  LL+ ASVS     N K    +  +   +    V+      S SGY+H 
Sbjct: 9   RWDVRLVLTFSLLLISASVSALEHVNHKKIDPTTAETSFLGK--VMNFLWSSSGSGYEHT 66

Query: 69  WPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTG 128
           WP+I+FGWRI+ GTIIGF G+A G+VGGVGGGGIFV ML+LI+GFDAKS+ A+SKCM+TG
Sbjct: 67  WPEIEFGWRIITGTIIGFVGSAFGTVGGVGGGGIFVTMLSLIIGFDAKSATAISKCMITG 126

Query: 129 GAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVL 188
           GA + V YNLRQ+HPTLD+PVIDYDLALLFQP+LVLGISIGVAFNVIFADWMITVLL+++
Sbjct: 127 GAAATVFYNLRQKHPTLDMPVIDYDLALLFQPVLVLGISIGVAFNVIFADWMITVLLLII 186

Query: 189 FLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQ-TEECKCDPEYLSNDTTPEE 247
           F+ ++T+AFLKGVETWKKETI K+E A+Q +     NG + T E   +P  L        
Sbjct: 187 FVGIATKAFLKGVETWKKETIIKKETARQSQF----NGTERTAEVAYEP--LPGGPNTSN 240

Query: 248 TREPKKSKV--SIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVA 305
            +EPKKSK   SI+EN+ WK  G+L  VW +ILA +IAK++TTTCSV YWVLNLLQ+PVA
Sbjct: 241 HKEPKKSKETGSILENVRWKALGVLFTVWVLILASEIAKSHTTTCSVEYWVLNLLQVPVA 300

Query: 306 GGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 365
            G ++Y+AV LY G+R IASKGD+ T+WRA QL+ YC+CGI AG+VGGLLGLGGGFILGP
Sbjct: 301 LGATSYQAVLLYTGKRVIASKGDQRTQWRAHQLIMYCSCGICAGIVGGLLGLGGGFILGP 360

Query: 366 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 425
           LFLELGIPPQVSSATA FA+TFS+SMSVVEYYLLKRFP+PY LYF A+S  AAFVGQ ++
Sbjct: 361 LFLELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPIPYTLYFVAVSTFAAFVGQVLV 420

Query: 426 KKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +KL+ ILGRAS+IIF LS  IFVSA+SLGGVG++ MI+RI +KEYMGF+N+C
Sbjct: 421 RKLVAILGRASLIIFILSSTIFVSAISLGGVGISNMIQRIANKEYMGFENLC 472


>gi|255551046|ref|XP_002516571.1| conserved hypothetical protein [Ricinus communis]
 gi|223544391|gb|EEF45912.1| conserved hypothetical protein [Ricinus communis]
          Length = 476

 Score =  535 bits (1379), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 296/474 (62%), Positives = 371/474 (78%), Gaps = 15/474 (3%)

Query: 10  RWRLLAAAFIGLL--VVASVSVSAQSNLKPE--ATSKNKNDHVEPNIVIKVSA--PKSRS 63
           +WR L +  +  L  ++A V VSA+  ++ E   T + + + +    V  V       ++
Sbjct: 8   KWRGLRSVLMVSLNFLLAFVLVSAERGMRREFTGTPQTEGEGLSTYFVKAVDFLWQPDQT 67

Query: 64  GYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSK 123
           GY+HVWP+++FGW+IV+GTIIGFFG A GSVGGVGGGGIFVPML+LI+GFD KS+ A+SK
Sbjct: 68  GYRHVWPEMRFGWQIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISK 127

Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
           CM+ G A S V YNL+ RHPTLD+P+IDYDL LL QP+L+LGISIGVAFNVIFADWM+TV
Sbjct: 128 CMIMGAAASTVYYNLKLRHPTLDMPIIDYDLTLLIQPMLMLGISIGVAFNVIFADWMVTV 187

Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDT 243
           LLI+LF+  ST+AF KGVETWKKETI K+EAAK+LE     NG    E +  P      T
Sbjct: 188 LLIILFIGTSTKAFFKGVETWKKETIMKKEAAKRLE----ANG---AEVEYKPLPSGPST 240

Query: 244 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 303
            PE  +E K+ +V+++EN+YWKE GLLV VW   L L IAKN+   CS+ YWV+NLLQIP
Sbjct: 241 GPE--KEAKEPEVTLLENVYWKELGLLVFVWVAFLILHIAKNHLPACSIEYWVVNLLQIP 298

Query: 304 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 363
           V+ GVS YEAV+LYKGRR IASKG EGT ++  QLV YC+CG+ AG+VGGLLGLGGGFI+
Sbjct: 299 VSLGVSGYEAVSLYKGRRIIASKGKEGTNFKVHQLVLYCSCGVLAGVVGGLLGLGGGFIM 358

Query: 364 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 423
           GPLFLELGIPPQVSSATA FA+TFSSSMSVVEYYLLKRFPVPYALYF +++ IAA +GQH
Sbjct: 359 GPLFLELGIPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVSVATIAALIGQH 418

Query: 424 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +++++I + GRAS+IIF L+F IFVSA+SLGGVG++ MI +I+H EYMGF+N+C
Sbjct: 419 IVRRMIIMFGRASLIIFILAFTIFVSAISLGGVGISNMIWKIQHNEYMGFENLC 472


>gi|356547396|ref|XP_003542098.1| PREDICTED: uncharacterized protein LOC100789733 [Glycine max]
          Length = 487

 Score =  533 bits (1372), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 296/466 (63%), Positives = 368/466 (78%), Gaps = 20/466 (4%)

Query: 26  SVSVSA-----QSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVV 80
           SVSVSA       N+ P      +   +    V+      S SGY+H WPDI+FGWRI+ 
Sbjct: 24  SVSVSALQHVSHKNINPTTVETTQTSFLGK--VVNFLWSSSGSGYQHTWPDIEFGWRIIT 81

Query: 81  GTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQ 140
           GTIIGF G+A G+VGGVGGGGIFV ML+LI+GFDAKS+ A+SKCM+TGGA + V YNL+Q
Sbjct: 82  GTIIGFLGSAFGTVGGVGGGGIFVTMLSLIIGFDAKSATAISKCMITGGAAATVFYNLKQ 141

Query: 141 RHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKG 200
           +HPTLD+PVIDYDLALLFQP+LVLGISIGVAFNVIFADWMITVLL+++F+ ++T+AFLKG
Sbjct: 142 KHPTLDMPVIDYDLALLFQPVLVLGISIGVAFNVIFADWMITVLLLIIFVGIATKAFLKG 201

Query: 201 VETWKKETITKREAAKQLELIVLGNGYQ-TEECKCDP----EYLSNDTTPEETREP---- 251
           VETWKKETI K+E A+Q +     NG + +EE   +P       SN   P++++E     
Sbjct: 202 VETWKKETIIKKETARQSQF----NGTERSEEVAYEPLPGGPNTSNHNEPKKSKETTVRL 257

Query: 252 KKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAY 311
           +  K S++EN+ WK  G+L  VW +ILA +IAK++TTTCSV YW+LNLLQ+PVA G ++Y
Sbjct: 258 RHHKGSVLENVRWKALGVLFTVWVLILASEIAKSHTTTCSVEYWILNLLQVPVALGATSY 317

Query: 312 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
           +AV LY G+R IASKGD+ T+WRA QLV YC+CGI AG+VGGLLGLGGGFILGPLFLELG
Sbjct: 318 QAVLLYTGKRVIASKGDQRTQWRAHQLVLYCSCGICAGIVGGLLGLGGGFILGPLFLELG 377

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           IPPQVSSATA FA+TFS+SMSVVEYYLLKRFP+PY LYF A+S  AAFVGQ +++KL+ I
Sbjct: 378 IPPQVSSATATFAMTFSASMSVVEYYLLKRFPIPYTLYFVAVSTFAAFVGQVLVRKLVAI 437

Query: 432 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           LGRAS+IIF LS  IFVSA+SLGGVG++ MI++I +KEYMGF+N+C
Sbjct: 438 LGRASLIIFILSGTIFVSAISLGGVGISNMIQKIANKEYMGFENLC 483


>gi|224059076|ref|XP_002299704.1| predicted protein [Populus trichocarpa]
 gi|222846962|gb|EEE84509.1| predicted protein [Populus trichocarpa]
          Length = 424

 Score =  532 bits (1370), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 301/419 (71%), Positives = 356/419 (84%), Gaps = 5/419 (1%)

Query: 60  KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
           K R GY HVWP+++FGW+IV+G+IIGFFG ACGSVGGVGGGGIFVPML+LI+GFD KS+ 
Sbjct: 10  KGRVGYTHVWPEMRFGWKIVLGSIIGFFGAACGSVGGVGGGGIFVPMLSLIIGFDTKSAT 69

Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
           A+SKCM+TGGA + V YNL+ RHPTL++PVIDYDLALLFQP+LVLGISIGVA NVIFA+W
Sbjct: 70  AMSKCMITGGAAATVYYNLKLRHPTLELPVIDYDLALLFQPMLVLGISIGVALNVIFANW 129

Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYL 239
           MIT+LLIVLF+V ST+AF KGVETWKKET TK+EA + LE     N  + EE + +P  +
Sbjct: 130 MITILLIVLFIVTSTKAFFKGVETWKKETKTKKEATRSLE----SNDDRNEEVEGEPPPV 185

Query: 240 SNDTTPE-ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLN 298
                 + ET+E KK +VSII+N+YW E  LLVAVW IILALQI KNY+TTCS+ YW+LN
Sbjct: 186 GPTVGAQTETKEYKKEQVSIIKNVYWWELSLLVAVWLIILALQIGKNYSTTCSMAYWLLN 245

Query: 299 LLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLG 358
           +LQIPVA GVS+YEA  LYKGRRKIASKGD  T W+  QLV YC  G+ AG+VGG+LGLG
Sbjct: 246 ILQIPVAFGVSSYEAFNLYKGRRKIASKGDAVTNWKIRQLVLYCIIGLLAGVVGGMLGLG 305

Query: 359 GGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA 418
           GGFILGPLFLE+GIPPQVSSATA FA+ FS+SMSVVEYYLLKRFPVPYALYFFA++ +AA
Sbjct: 306 GGFILGPLFLEMGIPPQVSSATATFAMMFSASMSVVEYYLLKRFPVPYALYFFAVATVAA 365

Query: 419 FVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            VGQHV++KLI ILGRAS+IIFTL+F IFVSA+ LGGVG+A+M+KRIE KEYMGF+NIC
Sbjct: 366 LVGQHVIRKLISILGRASLIIFTLAFTIFVSAILLGGVGIARMVKRIERKEYMGFENIC 424


>gi|224143013|ref|XP_002324818.1| predicted protein [Populus trichocarpa]
 gi|222866252|gb|EEF03383.1| predicted protein [Populus trichocarpa]
          Length = 448

 Score =  526 bits (1356), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 289/452 (63%), Positives = 352/452 (77%), Gaps = 13/452 (2%)

Query: 32  QSNLKPEATSKNKN---DHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIG 85
           +  L  EATS   +   D V   ++  VS    P  R GY+HVWP++KFGW+IV+G+IIG
Sbjct: 1   ERGLTREATSARTDETGDSVSSYVLKAVSCLWQPDQR-GYQHVWPEMKFGWQIVLGSIIG 59

Query: 86  FFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL 145
           F G A GSVGGVGGGGIFVPML+LI+GFD KS+ A+SKCM+ G AVS+V YNL+ RHPT+
Sbjct: 60  FLGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSSVYYNLKLRHPTI 119

Query: 146 DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWK 205
           D+P+IDYDLALL QP+L+LGIS+GVA NVIFADWM+TVLLIVLF V ST+AF KGVETWK
Sbjct: 120 DMPIIDYDLALLIQPMLMLGISLGVALNVIFADWMVTVLLIVLFTVTSTKAFFKGVETWK 179

Query: 206 KETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWK 265
           KETI KREAAK LE     NG    E +  P       +P+  +E K+ ++++ EN+YWK
Sbjct: 180 KETIMKREAAKHLE----SNGTDAGEVEYKPLRGGPSNSPQ--KETKELEITVFENVYWK 233

Query: 266 EFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS 325
           E GLLV VW   L LQIAK  T TCS+ YWVLNLLQIPV+ GV+ YE V+LYKGRR +AS
Sbjct: 234 ELGLLVFVWVAFLVLQIAKESTYTCSIGYWVLNLLQIPVSVGVTLYEVVSLYKGRRVVAS 293

Query: 326 KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 385
           KGDEGT +R  QL+ YCA G+ AG+VGGLLGLGGGFI+GPLFLELG+PPQVSSATA FA+
Sbjct: 294 KGDEGTNFRVLQLMTYCAFGVLAGVVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAM 353

Query: 386 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 445
           TFSSSMSVVEYYLLKRFPVPYA+YF  ++  AAF GQH++++LI + GRAS+IIF L+F 
Sbjct: 354 TFSSSMSVVEYYLLKRFPVPYAVYFVVVATFAAFTGQHIVRRLIIVFGRASLIIFILAFT 413

Query: 446 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           IFVSA+ LGG G++ MI +I  +EYMGF+N C
Sbjct: 414 IFVSAILLGGFGISNMIGKIHRQEYMGFENFC 445


>gi|297821995|ref|XP_002878880.1| hypothetical protein ARALYDRAFT_481426 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324719|gb|EFH55139.1| hypothetical protein ARALYDRAFT_481426 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 479

 Score =  526 bits (1355), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 280/414 (67%), Positives = 333/414 (80%), Gaps = 7/414 (1%)

Query: 64  GYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSK 123
           GY+HVWP+ +F W+IV+GT++GFFG A GSVGGVGGGGIFVPML+L++GFD KS+ A+SK
Sbjct: 69  GYRHVWPEFEFNWQIVLGTLVGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDPKSATAISK 128

Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
           CM+ G +VS V YNLR RHPTLD+P+IDYDLALL QP+L+LGISIGVAFNVIF DW++TV
Sbjct: 129 CMIMGASVSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFPDWLVTV 188

Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDT 243
           LLI+LFL  ST+AFLKG ETW KETI K+EAAK+LE     NG    E +  P   +  T
Sbjct: 189 LLIILFLGTSTKAFLKGSETWNKETIEKKEAAKRLE----SNGVSGAEVEYVPLPAAPST 244

Query: 244 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 303
            P      KK +VSIIEN+YWKE GLLV VW + LALQI+K     CSV YWV+NLLQIP
Sbjct: 245 NPGNK---KKEEVSIIENVYWKELGLLVFVWVVFLALQISKQNLANCSVAYWVINLLQIP 301

Query: 304 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 363
           VA GVS YEAVALY+GRR IASKG   + +   QLV YC  GI AG+VGGLLGLGGGFI+
Sbjct: 302 VAVGVSGYEAVALYQGRRIIASKGQGDSNFTVGQLVMYCTFGILAGIVGGLLGLGGGFIM 361

Query: 364 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 423
           GPLFLELG+PPQVSSATA FA+TFSSSMSVVEYYLLKRFPVPYALY   ++ IAAFVGQH
Sbjct: 362 GPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLVGVATIAAFVGQH 421

Query: 424 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           V+++LI  LGRAS+IIF L+ +IF+SA+SLGGVG+  MI +I+  EYMGF+N+C
Sbjct: 422 VVRRLIAALGRASLIIFILASMIFISAISLGGVGIVNMIGKIQRHEYMGFENLC 475


>gi|30682879|ref|NP_850068.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|22531098|gb|AAM97053.1| unknown protein [Arabidopsis thaliana]
 gi|25083770|gb|AAN72117.1| unknown protein [Arabidopsis thaliana]
 gi|26450521|dbj|BAC42374.1| unknown protein [Arabidopsis thaliana]
 gi|28951043|gb|AAO63445.1| At2g25737 [Arabidopsis thaliana]
 gi|330252652|gb|AEC07746.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 476

 Score =  525 bits (1351), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 287/452 (63%), Positives = 346/452 (76%), Gaps = 10/452 (2%)

Query: 29  VSAQSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIG 85
           VSA+   K    S ++    E +  +K         + GY+HVWP+ +F W+IV+GT++G
Sbjct: 28  VSAERRGKSLRLSTDETRENESSFFLKAINFLWESDQIGYRHVWPEFEFNWQIVLGTLVG 87

Query: 86  FFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL 145
           FFG A GSVGGVGGGGIFVPML+LI+GFD KS+ A+SKCM+ G +VS V YNLR RHPTL
Sbjct: 88  FFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGASVSTVYYNLRLRHPTL 147

Query: 146 DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWK 205
           D+P+IDYDLALL QP+L+LGISIGVAFNVIF DW++TVLLIVLFL  ST+AFLKG ETW 
Sbjct: 148 DMPIIDYDLALLIQPMLMLGISIGVAFNVIFPDWLVTVLLIVLFLGTSTKAFLKGSETWN 207

Query: 206 KETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWK 265
           KETI K+EAAK+LE     NG    E +  P   +  T P      KK +VSIIEN+YWK
Sbjct: 208 KETIEKKEAAKRLE----SNGVSGTEVEYVPLPAAPSTNPGNK---KKEEVSIIENVYWK 260

Query: 266 EFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS 325
           E GLLV VW + LALQI+K     CSV YWV+NLLQIPVA GVS YEAVALY+GRR IAS
Sbjct: 261 ELGLLVFVWIVFLALQISKQNLANCSVAYWVINLLQIPVAVGVSGYEAVALYQGRRIIAS 320

Query: 326 KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 385
           KG   + +   QLV YC  GI AG+VGGLLGLGGGFI+GPLFLELG+PPQVSSATA FA+
Sbjct: 321 KGQGDSNFTVGQLVMYCTFGIIAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAM 380

Query: 386 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 445
           TFSSSMSVVEYYLLKRFPVPYALY   ++ IAA+VGQHV+++LI  +GRAS+IIF L+ +
Sbjct: 381 TFSSSMSVVEYYLLKRFPVPYALYLVGVATIAAWVGQHVVRRLIAAIGRASLIIFILASM 440

Query: 446 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           IF+SA+SLGGVG+  MI +I+  EYMGF+N+C
Sbjct: 441 IFISAISLGGVGIVNMIGKIQRHEYMGFENLC 472


>gi|255549970|ref|XP_002516036.1| conserved hypothetical protein [Ricinus communis]
 gi|223544941|gb|EEF46456.1| conserved hypothetical protein [Ricinus communis]
          Length = 466

 Score =  524 bits (1349), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 301/477 (63%), Positives = 367/477 (76%), Gaps = 15/477 (3%)

Query: 1   MSRVGSECFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPK 60
           M   G + +  RL     +GL+VVAS+ + A+S      T +    H            K
Sbjct: 2   MDWSGLKEWNLRLRKKYLVGLIVVASMVIVAESASSYNQTQQAGYLH---------HGNK 52

Query: 61  SRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAA 120
             S YKHVWP+++ GWRIVVG IIGFFG A GS GGVGGGGI+VPML LI+GFDAKSS A
Sbjct: 53  GHSDYKHVWPEMRIGWRIVVGAIIGFFGAAFGSAGGVGGGGIYVPMLTLIIGFDAKSSIA 112

Query: 121 VSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWM 180
           +SKCM+TG A S V YNL+QRHPTL++PVIDYDLALLFQP+LVLG+SIGV FNVIFADWM
Sbjct: 113 ISKCMITGAAASTVYYNLKQRHPTLEMPVIDYDLALLFQPMLVLGVSIGVTFNVIFADWM 172

Query: 181 ITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLS 240
           ITVLLI++F+ +ST+AF KG++TWKKETI K+ A      ++LG+  +       PE LS
Sbjct: 173 ITVLLIIIFIFMSTKAFSKGIQTWKKETIKKKIAP-----LLLGDNAEVVIPTPPPEILS 227

Query: 241 NDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLL 300
            D    ET+ PKK KVS+IEN+YWK  GLL+ +W +ILALQIAKNYT TC+V YW+L+  
Sbjct: 228 -DNAQIETKGPKKEKVSLIENVYWKALGLLLVIWFMILALQIAKNYTKTCTVPYWILDFA 286

Query: 301 QIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGG 360
           QIPVA  ++ Y+AV LYKGRRKIAS+G+  + WR  +L+ YC  G+ AG++GGLLGLGGG
Sbjct: 287 QIPVAASLTIYQAVRLYKGRRKIASRGEAESNWRVHKLLLYCFSGLLAGIIGGLLGLGGG 346

Query: 361 FILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 420
           FILGPLFLE+GIPPQVSSATA FA+TFS+SMSVVEYYLLKRFPVPYALY FA++ +A FV
Sbjct: 347 FILGPLFLEMGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYLFAVTTLAGFV 406

Query: 421 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           GQ+V+KK+I +LGRASIIIF LSF I VSA+SLGGVGLA MIK+IE KEYMGF++IC
Sbjct: 407 GQYVVKKVIDMLGRASIIIFILSFTILVSAISLGGVGLADMIKKIERKEYMGFEDIC 463


>gi|356518316|ref|XP_003527825.1| PREDICTED: uncharacterized protein LOC100810784 [Glycine max]
          Length = 477

 Score =  523 bits (1346), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 284/460 (61%), Positives = 352/460 (76%), Gaps = 13/460 (2%)

Query: 24  VASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA-----PKSRSGYKHVWPDIKFGWRI 78
           V ++  S +S +K E    N     EP       A         SGY+HVWPD++FGW+I
Sbjct: 21  VVAIVASERSIMKMEVARFNVTTSREPQQSFLTKALNFLWQSGESGYQHVWPDMEFGWQI 80

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           VVGT +GF G A GSVGGVGGGGIFVPML+LIVGFD KSS A+SKCM+ G AVS V YNL
Sbjct: 81  VVGTFVGFCGAAFGSVGGVGGGGIFVPMLSLIVGFDQKSSTAISKCMIMGAAVSTVYYNL 140

Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
           + RHPTL++P+IDYDLALL QP+L+LGISIGV FNV+F DW++T+LLIVLFL  ST+AF 
Sbjct: 141 KLRHPTLNMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWIVTILLIVLFLGTSTKAFF 200

Query: 199 KGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI 258
           KG+ETWKKETI K+EAAK+ E     NG    E +  P  L +       ++ K+ +V+I
Sbjct: 201 KGIETWKKETIMKKEAAKRQE----SNG-SGAEVEYKP--LPSGPNGANEKDTKEQEVTI 253

Query: 259 IENIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALY 317
           IEN+YWKEFGLLV VW   LALQIAK NYTTTCS  YWVLNLLQ+PV+ GV+AYEA AL+
Sbjct: 254 IENVYWKEFGLLVFVWVSFLALQIAKENYTTTCSTFYWVLNLLQVPVSVGVTAYEAAALF 313

Query: 318 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVS 377
            GRR IAS G++G  +   QL+ YC  G+ AG+VGG+LGLGGGF++GPLFLELG+PPQVS
Sbjct: 314 SGRRVIASTGEQGKDFTVLQLIIYCVFGVLAGVVGGMLGLGGGFVMGPLFLELGVPPQVS 373

Query: 378 SATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASI 437
           SATA FA+TFSSSMSV+EYYLLKRFP+PYALYF  ++ IAAFVGQH+++KLI + GRAS+
Sbjct: 374 SATATFAMTFSSSMSVIEYYLLKRFPIPYALYFVLVATIAAFVGQHIVRKLIILFGRASL 433

Query: 438 IIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           IIF L+  IFVSA+SLGGVG+  M+ +I++ EYMGF+++C
Sbjct: 434 IIFILASTIFVSAVSLGGVGIVNMVHKIDNHEYMGFEDLC 473


>gi|255549972|ref|XP_002516037.1| conserved hypothetical protein [Ricinus communis]
 gi|223544942|gb|EEF46457.1| conserved hypothetical protein [Ricinus communis]
          Length = 481

 Score =  523 bits (1346), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 304/473 (64%), Positives = 369/473 (78%), Gaps = 8/473 (1%)

Query: 10  RWRL--LAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSG 64
           RW L  L A     LVV S+   +++ LK   +   +   VE + V + S     K R G
Sbjct: 8   RWSLIMLGAILSSFLVVGSLLALSETRLKLLGSDYKRPKEVEFHHVTRASNFFYQKGRLG 67

Query: 65  YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
           YKHVWP+++FGW+IV+G +IG FG ACGSVGGVGGGGIFVPML LIVGFDAKSS A+SKC
Sbjct: 68  YKHVWPEMRFGWKIVLGGVIGVFGAACGSVGGVGGGGIFVPMLTLIVGFDAKSSTALSKC 127

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           M+TG A + V YNL+ RHPTL++P+IDYDLALLFQP+LVLGIS+GVA NVIFADW IT+L
Sbjct: 128 MITGAAAATVYYNLKLRHPTLELPIIDYDLALLFQPMLVLGISLGVALNVIFADWAITIL 187

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTT 244
           LI+LF+V S++AF KGVETWK+ET  K+EAA+ LE+         E+    P   S    
Sbjct: 188 LIILFIVTSSKAFFKGVETWKQETKLKKEAARHLEIDEDTTEEMEEK---SPPTGSTIVA 244

Query: 245 PEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPV 304
             +  E K+ KVSI EN+ WKE GLLVAVW+IILALQI KNY+TTCSV YWVL+ LQIPV
Sbjct: 245 QAQFSESKREKVSIGENVCWKELGLLVAVWSIILALQIGKNYSTTCSVTYWVLSTLQIPV 304

Query: 305 AGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILG 364
           A GV+ YEA++LYKG+RKIAS+G+  T W+  QLV Y A GI AG+VGG+LGLGGGFILG
Sbjct: 305 AVGVTGYEAISLYKGQRKIASRGEVSTNWKIHQLVSYSAIGILAGVVGGMLGLGGGFILG 364

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           PLFLE+GIPPQVSSATA F + FS+SMSV+EYYLLKRFPVPYALY FA++IIAAFVGQHV
Sbjct: 365 PLFLEMGIPPQVSSATATFIMAFSASMSVIEYYLLKRFPVPYALYLFAVAIIAAFVGQHV 424

Query: 425 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           ++KLI +LGRAS+IIF L+F IF SA+SLGGVG+A+ IK+IE KEYMGF++IC
Sbjct: 425 VRKLISLLGRASLIIFILAFTIFASAISLGGVGIARAIKKIERKEYMGFEDIC 477


>gi|356510003|ref|XP_003523730.1| PREDICTED: uncharacterized protein LOC100809318 [Glycine max]
          Length = 472

 Score =  518 bits (1333), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 277/417 (66%), Positives = 339/417 (81%), Gaps = 12/417 (2%)

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
            SGY+HVWPD++FGW IV+GT +GF G A GSVGGVGGGGIFVPML+LI+GFD KSS A+
Sbjct: 63  ESGYQHVWPDMEFGWEIVLGTFVGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSSTAI 122

Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
           SKCM+ G AVS V YNL+ RHPTL++P+IDYDLALL QP+L+LGISIGV FNV+F DW++
Sbjct: 123 SKCMIMGAAVSTVYYNLKLRHPTLNMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWIV 182

Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSN 241
           T+LLIVLFL  ST+AF KGVETWKKETI K+EAAK+ E     NG          EY   
Sbjct: 183 TILLIVLFLGTSTKAFFKGVETWKKETIMKKEAAKRQE----SNG-----SGAVVEYKPL 233

Query: 242 DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLNLL 300
            + PE  ++ K+ ++SIIEN+YWKEFGLLV VW   LALQIAK NYTTTCS LYWVLNLL
Sbjct: 234 PSGPE--KDTKEQEMSIIENVYWKEFGLLVFVWVSFLALQIAKENYTTTCSTLYWVLNLL 291

Query: 301 QIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGG 360
           Q+PV+ GV+AYEA AL+ GRR IAS G++G  +   QL+ YC  G+ AG+VGG+LGLGGG
Sbjct: 292 QVPVSVGVTAYEAAALFSGRRVIASTGEQGKDFTVLQLMIYCVFGVLAGVVGGMLGLGGG 351

Query: 361 FILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 420
           F++GPLFLELG+PPQVSSATA FA+TFSSSMSV+EYYLLKRFPVPYALYF  ++ IAAFV
Sbjct: 352 FVMGPLFLELGVPPQVSSATATFAMTFSSSMSVIEYYLLKRFPVPYALYFILVATIAAFV 411

Query: 421 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           GQH+++KLI + GRAS+IIF L+  IFVSA+SLGGVG+  M+ +I++ EYMGF+++C
Sbjct: 412 GQHIVRKLIILFGRASLIIFILASTIFVSAVSLGGVGIVNMVHKIQNHEYMGFEDLC 468


>gi|358344801|ref|XP_003636475.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
 gi|355502410|gb|AES83613.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
          Length = 470

 Score =  516 bits (1330), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 275/416 (66%), Positives = 334/416 (80%), Gaps = 14/416 (3%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           SGY+HVWP+++FGW+IV+G+ IGF G A GSVGGVGGGGIFVPML+LI+GFD KSS A+S
Sbjct: 64  SGYQHVWPEMEFGWQIVLGSFIGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSSTAIS 123

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G A+S V YNLR RHPTLD+P+IDYDLALL QP+L+LGISIGV FNV+F DW++T
Sbjct: 124 KCMIMGAALSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWLVT 183

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           +LLIVLFL  ST+AF KG+ETW KETI K+EAA++            +E     EY +  
Sbjct: 184 ILLIVLFLGTSTKAFFKGLETWNKETIMKKEAARR------------QESNGSGEYKALP 231

Query: 243 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLNLLQ 301
           T P    E K++KVSIIEN+YWKEFGLL  VW   LALQIAK NYTTTCS  YW+LNLLQ
Sbjct: 232 TGPNVAIE-KENKVSIIENVYWKEFGLLAFVWISFLALQIAKQNYTTTCSAAYWILNLLQ 290

Query: 302 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 361
           IP+A GV+AYEA AL+ GRR IAS GD+G  +   QLV YC  G+ AG+VGGLLGLGGGF
Sbjct: 291 IPIAVGVTAYEATALFTGRRVIASTGDQGKAFTVGQLVIYCVFGVLAGIVGGLLGLGGGF 350

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           ++GPLFLELG+PPQVSSATA FA+TFSSSMSVVEYYLLKRFPVPYALY   ++ IAA VG
Sbjct: 351 VMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLSLVATIAALVG 410

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           QH++++LI + GRAS+IIF L+  IF+SA+SLGGVG++ M+ +I + EYMGF+NIC
Sbjct: 411 QHIVRRLIILFGRASLIIFILAGTIFISAISLGGVGISNMVHKIANHEYMGFENIC 466


>gi|115478679|ref|NP_001062933.1| Os09g0344800 [Oryza sativa Japonica Group]
 gi|50252864|dbj|BAD29095.1| membrane protein-like [Oryza sativa Japonica Group]
 gi|113631166|dbj|BAF24847.1| Os09g0344800 [Oryza sativa Japonica Group]
 gi|215692830|dbj|BAG88213.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 481

 Score =  514 bits (1324), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 276/418 (66%), Positives = 331/418 (79%), Gaps = 14/418 (3%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           + Y HVWP+++ GW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 71  NSYHHVWPEMELGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 130

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNV+F DW++T
Sbjct: 131 KCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVLFPDWLVT 190

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VLLI+LFL  ST+AFLKGVETWKKETI KREAAK+LE        QT E   +PEY    
Sbjct: 191 VLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLE--------QTSE---EPEYAPLP 239

Query: 243 TTP---EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNL 299
           T P    + + P     S+++NIYWKEFGLL  VW   L LQ+ KNYT TCS  YW+LNL
Sbjct: 240 TGPGAVADAKRPSDEAASLMKNIYWKEFGLLAFVWMAFLVLQVTKNYTATCSSWYWILNL 299

Query: 300 LQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 359
           LQIPV+ GV+ YEA+ L  G+R ++SKG+E T  +  QL  YC  GITAG+VGGLLGLGG
Sbjct: 300 LQIPVSVGVTMYEALGLMSGKRVLSSKGNEQTTLKFHQLCIYCFFGITAGVVGGLLGLGG 359

Query: 360 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 419
           GFI+GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYALYF  ++ IAA 
Sbjct: 360 GFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYALYFVIVAFIAAI 419

Query: 420 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +GQHV+++LI  LGRAS+IIF L+F+IFVSA+SLGGVG++ MI RI   EYMGF+N+C
Sbjct: 420 IGQHVVRRLINWLGRASLIIFILAFMIFVSAISLGGVGISNMIHRINQHEYMGFENLC 477


>gi|222641386|gb|EEE69518.1| hypothetical protein OsJ_28979 [Oryza sativa Japonica Group]
          Length = 481

 Score =  514 bits (1324), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 276/418 (66%), Positives = 331/418 (79%), Gaps = 14/418 (3%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           + Y HVWP+++ GW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 71  NSYHHVWPEMELGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 130

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNV+F DW++T
Sbjct: 131 KCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVLFPDWLVT 190

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VLLI+LFL  ST+AFLKGVETWKKETI KREAAK+LE        QT E   +PEY    
Sbjct: 191 VLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLE--------QTSE---EPEYAPLP 239

Query: 243 TTP---EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNL 299
           T P    + + P     S+++NIYWKEFGLL  VW   L LQ+ KNYT TCS  YW+LNL
Sbjct: 240 TGPGAVADAKRPSDEAASLMKNIYWKEFGLLAFVWMAFLVLQVTKNYTATCSSWYWILNL 299

Query: 300 LQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 359
           LQIPV+ GV+ YEA+ L  G+R ++SKG+E T  +  QL  YC  GITAG+VGGLLGLGG
Sbjct: 300 LQIPVSVGVTMYEALGLMSGKRVLSSKGNEQTTLKFHQLCIYCFFGITAGVVGGLLGLGG 359

Query: 360 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 419
           GFI+GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYALYF  ++ IAA 
Sbjct: 360 GFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYALYFVIVAFIAAI 419

Query: 420 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +GQHV+++LI  LGRAS+IIF L+F+IFVSA+SLGGVG++ MI RI   EYMGF+N+C
Sbjct: 420 IGQHVVRRLINWLGRASLIIFILAFMIFVSAISLGGVGISNMIHRINQHEYMGFENLC 477


>gi|224143011|ref|XP_002324817.1| predicted protein [Populus trichocarpa]
 gi|222866251|gb|EEF03382.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  512 bits (1319), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 278/480 (57%), Positives = 357/480 (74%), Gaps = 14/480 (2%)

Query: 1   MSRVGSECFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA-- 58
           M+ +G++    R  A  F   ++ AS  +SA+ ++K   +  ++ + +EPN ++K+ +  
Sbjct: 1   MAFLGAKFLSLRPAAILFFTFML-ASAFISAERSIKHGVSRLSEAEELEPNYLVKIISFL 59

Query: 59  -PKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKS 117
                SGY+HVWP++ F W+ V+GTIIGFFG A GSVGGVGGGGIFVPML LI+GFD KS
Sbjct: 60  WQSDESGYQHVWPELSFNWQAVIGTIIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKS 119

Query: 118 SAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFA 177
           + A+SKCM+ G AVS V +NLR RHPTLD+PVIDYDL LL QP+L+LGISIGV FNVIF 
Sbjct: 120 ATAISKCMIMGAAVSTVYHNLRLRHPTLDLPVIDYDLVLLIQPMLMLGISIGVTFNVIFP 179

Query: 178 DWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPE 237
           DW+ITV LI+L +  S +AF KG++TWK ETI K  AAK+ E      G +  E K  P+
Sbjct: 180 DWVITVTLIILCIGTSVKAFFKGLDTWKIETILKEVAAKRFEST---GGSEEVEYKPLPD 236

Query: 238 YLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVL 297
              ND        PK SKV+I++NIYWKE GLLV VW   LA+QIAKNYT  CS  +WVL
Sbjct: 237 GQGND-------PPKASKVTILQNIYWKELGLLVFVWVSYLAVQIAKNYTAPCSTTFWVL 289

Query: 298 NLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGL 357
           NLLQIP++ GV  YEA+ LYKGRR+I+SKGDE   W+  +L+ +  CG+ AG+VGGLLG+
Sbjct: 290 NLLQIPISIGVFLYEAIGLYKGRRRISSKGDEIMDWQVHRLLMFSVCGVVAGIVGGLLGI 349

Query: 358 GGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIA 417
           GGGF++GPLFLE+GIPPQVSS TA F + FSSSMSVVEYYLL RFPVPYALYF A+++IA
Sbjct: 350 GGGFVMGPLFLEMGIPPQVSSGTATFGMLFSSSMSVVEYYLLDRFPVPYALYFIAVAVIA 409

Query: 418 AFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           AF+GQ+++ KLI I GRAS+IIF L+F IF+SA++LGGVG+ + I  +E  EYMGF+N+C
Sbjct: 410 AFIGQNIITKLITISGRASLIIFVLAFTIFISAIALGGVGITRTIGMVERHEYMGFENLC 469


>gi|326489334|dbj|BAK01650.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  512 bits (1319), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 275/418 (65%), Positives = 332/418 (79%), Gaps = 14/418 (3%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           + Y+HVWP ++FGW++V+G++IGFFG + GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 61  NSYQHVWPPMEFGWKMVLGSLIGFFGASFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 120

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+TG AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW++T
Sbjct: 121 KCMITGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVIFPDWLVT 180

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VLLIVLFL  ST+AFLKGVETWKKETI KREAAK+LE        QT E   +PEY    
Sbjct: 181 VLLIVLFLGTSTKAFLKGVETWKKETIIKREAAKRLE--------QTSE---EPEYAPLP 229

Query: 243 TTP---EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNL 299
           T P    + +       S+++NIYWKEFGLL  VW   LA+QI KNY  TCS  YWVLN 
Sbjct: 230 TGPGAVADVKPQSDEAPSLMKNIYWKEFGLLTFVWLAFLAIQITKNYAPTCSTWYWVLNF 289

Query: 300 LQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 359
           LQIPV+ GV+ YEA+ L  G+R ++SKGDE T  +  QL  YC  G+TAG+VGGLLGLGG
Sbjct: 290 LQIPVSVGVTMYEALGLMNGKRVLSSKGDEQTTLKFHQLCIYCFFGVTAGLVGGLLGLGG 349

Query: 360 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 419
           GFI+GPLFLELGIPPQVSSATA FA+ FS+SMSVVEYYLL RFPVPYA YF A++ +AA 
Sbjct: 350 GFIMGPLFLELGIPPQVSSATATFAMMFSASMSVVEYYLLNRFPVPYAAYFVAVAFVAAI 409

Query: 420 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +GQHV+++LI  LGRAS+IIF L+F+IFVSA+SLGGVG++ MI +IE  EYMGF+N+C
Sbjct: 410 IGQHVVRRLISWLGRASLIIFILAFMIFVSAISLGGVGISNMIHKIEQHEYMGFENLC 467


>gi|218201092|gb|EEC83519.1| hypothetical protein OsI_29108 [Oryza sativa Indica Group]
          Length = 466

 Score =  512 bits (1318), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 278/419 (66%), Positives = 328/419 (78%), Gaps = 14/419 (3%)

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
            + Y HVWP ++FGW+I++G +IGFFG A GSVGGVGGGGIFVPML LI+GFDAKSS A+
Sbjct: 55  EASYHHVWPPMEFGWKIILGILIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDAKSSTAI 114

Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
           SKCM+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW+I
Sbjct: 115 SKCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVLFNVIFPDWLI 174

Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSN 241
           TVLLI+LFL  ST+AFLKGVETWKKETI KREAAK+LE I       TEE    PEY   
Sbjct: 175 TVLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLEQI-------TEE----PEYSPL 223

Query: 242 DTTPE---ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLN 298
            T P    E++ P     S+ +N+YWKEFGLL  VW   L LQ+ KNY  TCS  YWVLN
Sbjct: 224 PTGPNATAESKAPSDEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPTCSTWYWVLN 283

Query: 299 LLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLG 358
            LQIPV+ GV+ YE + L +GRR I+SKGDE T  +  QL+ YC  G+ AG+VGGLLGLG
Sbjct: 284 FLQIPVSVGVTMYEGLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAGVVGGLLGLG 343

Query: 359 GGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA 418
           GGFI+GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYAL+F  ++  AA
Sbjct: 344 GGFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALFFTTVAFFAA 403

Query: 419 FVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            +GQH+++KLI  LGRAS+IIF LSF+IF+SA+SLGGVG++ MI +I   EYMGFDNIC
Sbjct: 404 IIGQHIVRKLINWLGRASLIIFILSFMIFISAISLGGVGISNMIGKIARHEYMGFDNIC 462


>gi|218201973|gb|EEC84400.1| hypothetical protein OsI_30977 [Oryza sativa Indica Group]
          Length = 465

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 275/418 (65%), Positives = 331/418 (79%), Gaps = 14/418 (3%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           + Y HVWP+++ GW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 55  NSYHHVWPEMELGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 114

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNV+F DW++T
Sbjct: 115 KCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVLFPDWLVT 174

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VLLI+LFL  ST+AFLKGVETWKKETI KREAAK+LE        QT E   +PEY    
Sbjct: 175 VLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLE--------QTSE---EPEYAPLP 223

Query: 243 TTP---EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNL 299
           T P    + + P     S+++NIYWKEFGLL  VW   L LQ+ KNYT TCS  YW+LNL
Sbjct: 224 TGPGAVADAKRPSDEAASLMKNIYWKEFGLLAFVWMAFLVLQVTKNYTATCSSWYWILNL 283

Query: 300 LQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 359
           LQIPV+ GV+ YEA+ L  G+R ++SKG+E T  +  QL  YC  GITAG+VGGLLGLGG
Sbjct: 284 LQIPVSVGVTLYEALGLMSGKRVLSSKGNEQTTLKFHQLCIYCFFGITAGVVGGLLGLGG 343

Query: 360 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 419
           GFI+GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYALYF  ++ IAA 
Sbjct: 344 GFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYALYFVIVAFIAAI 403

Query: 420 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +GQHV+++LI  LGRAS+IIF L+F+IFVSA+SLGGVG++ MI +I   EYMGF+N+C
Sbjct: 404 IGQHVVRRLINWLGRASLIIFILAFMIFVSAISLGGVGISNMIHKINQHEYMGFENLC 461


>gi|357158060|ref|XP_003578003.1| PREDICTED: uncharacterized protein LOC100827421 [Brachypodium
           distachyon]
          Length = 469

 Score =  510 bits (1314), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 273/418 (65%), Positives = 334/418 (79%), Gaps = 14/418 (3%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           + Y HVWP ++FGW++V+GT+IGFFG + GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 59  NSYHHVWPPMEFGWKVVLGTLIGFFGASFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 118

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+TG AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW++T
Sbjct: 119 KCMITGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVIFPDWLVT 178

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VLLI+LFL  ST+AFLKG+ETWKKETI KREAAK+LE        QT E   +PEY    
Sbjct: 179 VLLIILFLGTSTKAFLKGLETWKKETILKREAAKRLE--------QTSE---EPEYAPLP 227

Query: 243 TTP---EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNL 299
           T P    + + P     S+++NIYW+EFGLL  VW   LA+QI KNYT+TCS  YW+LN 
Sbjct: 228 TGPGAAADVKIPSDEAPSLMKNIYWREFGLLTLVWLSFLAIQITKNYTSTCSTWYWILNF 287

Query: 300 LQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 359
           LQIPV+ GV+ YEA+ L  G+R ++SKG E T  +  QL  YC  G+TAG+VGGLLGLGG
Sbjct: 288 LQIPVSVGVTLYEALGLMNGKRVLSSKGGEQTTLKFHQLCIYCFFGVTAGLVGGLLGLGG 347

Query: 360 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 419
           GFI+GPLFLELGIPPQVSSATA FA+ FS+SMSVVEYYLL RFPVPYA+YF A++ IAA 
Sbjct: 348 GFIMGPLFLELGIPPQVSSATATFAMMFSASMSVVEYYLLNRFPVPYAVYFVAVAFIAAI 407

Query: 420 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +GQH ++KLI  LGRAS+IIF L+F+IFVSA+SLGGVG++ MI++I+  EYMGF+N+C
Sbjct: 408 IGQHAVRKLISWLGRASLIIFILAFMIFVSAISLGGVGVSNMIQKIQRHEYMGFENLC 465


>gi|449437030|ref|XP_004136295.1| PREDICTED: uncharacterized protein LOC101214783 [Cucumis sativus]
          Length = 471

 Score =  510 bits (1314), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 280/475 (58%), Positives = 353/475 (74%), Gaps = 15/475 (3%)

Query: 5   GSECFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSG 64
           G  C  +R +A       VV ++ VS +  +K E +S++ N        +       ++G
Sbjct: 6   GGICAGFRSMALTMFNFAVVLAL-VSGERKMKLEGSSEDLNLF---QGALNFLWQSDKTG 61

Query: 65  YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
           Y HVWP++ FGW+IVVG++IGF G A GSVGGVGGGGI+VPML+LI+GFD KSS A+SKC
Sbjct: 62  YHHVWPEMAFGWQIVVGSVIGFMGAAFGSVGGVGGGGIYVPMLSLIIGFDPKSSTAISKC 121

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           M+ G A S V YNL+ RHPTL++P+IDYDL LL  P+L+LGIS+GV FNVIFADWM+TVL
Sbjct: 122 MIMGAAASTVYYNLKLRHPTLELPIIDYDLVLLIMPMLMLGISVGVVFNVIFADWMVTVL 181

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTT 244
           LIVLFL  ST++FLKGVETWKKETI K EAA++ E         + + + D  Y    + 
Sbjct: 182 LIVLFLATSTKSFLKGVETWKKETIMKTEAARRAE---------SNDIQADTAYAPLPSG 232

Query: 245 PEETREPKKS--KVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 302
           P    E   +  +V I+EN+YWKE GLL+ VW   LA+QI K ++  CS  YW+LN LQ+
Sbjct: 233 PSHRPETNNTDREVPILENVYWKEMGLLLLVWFAFLAIQITKKHSPNCSWEYWLLNFLQV 292

Query: 303 PVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 362
           P+A GVS YEAV LYKG+RKIAS G++ T +R  QLV YC  GI AG+VGGLLGLGGGFI
Sbjct: 293 PIAFGVSGYEAVGLYKGKRKIASLGNQKTDFRIYQLVLYCFMGIIAGVVGGLLGLGGGFI 352

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
           +GPLFLELGIPPQVSSA+A F +TFSSSMSV++YYLL RFPVPYALYF  ++ +AAFVGQ
Sbjct: 353 MGPLFLELGIPPQVSSASATFGMTFSSSMSVIQYYLLNRFPVPYALYFTIVAAVAAFVGQ 412

Query: 423 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           HV++KLI ++GRAS+IIF LSF IFVSALSLGGVG++KMI +I+  EYMGF+N+C
Sbjct: 413 HVIRKLILLIGRASLIIFILSFTIFVSALSLGGVGISKMIGQIQRHEYMGFENLC 467


>gi|115476246|ref|NP_001061719.1| Os08g0389700 [Oryza sativa Japonica Group]
 gi|40253412|dbj|BAD05341.1| membrane protein-like [Oryza sativa Japonica Group]
 gi|40253788|dbj|BAD05726.1| membrane protein-like [Oryza sativa Japonica Group]
 gi|113623688|dbj|BAF23633.1| Os08g0389700 [Oryza sativa Japonica Group]
          Length = 465

 Score =  510 bits (1314), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 278/419 (66%), Positives = 327/419 (78%), Gaps = 14/419 (3%)

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
            + Y HVWP ++FGW+IV+G +IGFFG A GSVGGVGGGGIFVPML LI+GFDAKSS A+
Sbjct: 54  EASYHHVWPPMEFGWKIVLGILIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDAKSSTAI 113

Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
           SKCM+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW+I
Sbjct: 114 SKCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVLFNVIFPDWLI 173

Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSN 241
           TVLLI+LFL  ST+AFLKGVETWKKETI KREAAK+LE I        EE    PEY   
Sbjct: 174 TVLLIILFLGTSTKAFLKGVETWKKETILKREAAKRLEQI-------AEE----PEYSPL 222

Query: 242 DTTPE---ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLN 298
            T P    E++ P     S+ +N+YWKEFGLL  VW   L LQ+ KNY  TCS  YWVLN
Sbjct: 223 PTGPNATAESKAPSDEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPTCSTWYWVLN 282

Query: 299 LLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLG 358
            LQIPV+ GV+ YE + L +GRR I+SKGDE T  +  QL+ YC  G+ AG+VGGLLGLG
Sbjct: 283 FLQIPVSVGVTMYEGLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAGVVGGLLGLG 342

Query: 359 GGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA 418
           GGFI+GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYAL+F  ++  AA
Sbjct: 343 GGFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALFFTVVAFFAA 402

Query: 419 FVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            +GQH+++KLI  LGRAS+IIF LSF+IF+SA+SLGGVG++ MI +I   EYMGFDNIC
Sbjct: 403 IIGQHIVRKLINWLGRASLIIFILSFMIFISAISLGGVGISNMIGKIARHEYMGFDNIC 461


>gi|449523247|ref|XP_004168635.1| PREDICTED: uncharacterized LOC101214783 [Cucumis sativus]
          Length = 471

 Score =  509 bits (1312), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 280/475 (58%), Positives = 353/475 (74%), Gaps = 15/475 (3%)

Query: 5   GSECFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSG 64
           G  C  +R +A       VV ++ VS +  +K E +S++ N        +       ++G
Sbjct: 6   GGICAGFRSMALTMFNFAVVLAL-VSGERKMKLEGSSEDLNLF---QGALNFLWQSDKTG 61

Query: 65  YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
           Y HVWP++ FGW+IVVG++IGF G A GSVGGVGGGGI+VPML+LI+GFD KSS A+SKC
Sbjct: 62  YHHVWPEMAFGWQIVVGSVIGFMGAAFGSVGGVGGGGIYVPMLSLIIGFDPKSSTAISKC 121

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           M+ G A S V YNL+ RHPTL++P+IDYDL LL  P+L+LGIS+GV FNVIFADWM+TVL
Sbjct: 122 MIMGAAASTVYYNLKLRHPTLELPIIDYDLVLLIMPMLMLGISVGVVFNVIFADWMVTVL 181

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTT 244
           LIVLFL  ST++FLKGVETWKKETI K EAA++ E         + + + D  Y    + 
Sbjct: 182 LIVLFLATSTKSFLKGVETWKKETIMKTEAARRAE---------SNDIQADTAYAPLPSG 232

Query: 245 PEETREPKKS--KVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 302
           P    E   +  +V I+EN+YWKE GLL+ VW   LA+QI K ++  CS  YW+LN LQ+
Sbjct: 233 PSHRPETNYTDREVPILENVYWKEMGLLLLVWFAFLAIQITKKHSPNCSWEYWLLNFLQV 292

Query: 303 PVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 362
           P+A GVS YEAV LYKG+RKIAS G++ T +R  QLV YC  GI AG+VGGLLGLGGGFI
Sbjct: 293 PIAFGVSGYEAVGLYKGKRKIASLGNQKTDFRIYQLVLYCFMGIIAGVVGGLLGLGGGFI 352

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
           +GPLFLELGIPPQVSSA+A F +TFSSSMSV++YYLL RFPVPYALYF  ++ +AAFVGQ
Sbjct: 353 MGPLFLELGIPPQVSSASATFGMTFSSSMSVIQYYLLNRFPVPYALYFTIVAAVAAFVGQ 412

Query: 423 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           HV++KLI ++GRAS+IIF LSF IFVSALSLGGVG++KMI +I+  EYMGF+N+C
Sbjct: 413 HVIRKLILLIGRASLIIFILSFTIFVSALSLGGVGISKMIGQIQRHEYMGFENLC 467


>gi|358344803|ref|XP_003636476.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
 gi|355502411|gb|AES83614.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
          Length = 480

 Score =  509 bits (1310), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 275/426 (64%), Positives = 334/426 (78%), Gaps = 24/426 (5%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           SGY+HVWP+++FGW+IV+G+ IGF G A GSVGGVGGGGIFVPML+LI+GFD KSS A+S
Sbjct: 64  SGYQHVWPEMEFGWQIVLGSFIGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSSTAIS 123

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G A+S V YNLR RHPTLD+P+IDYDLALL QP+L+LGISIGV FNV+F DW++T
Sbjct: 124 KCMIMGAALSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWLVT 183

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           +LLIVLFL  ST+AF KG+ETW KETI K+EAA++            +E     EY +  
Sbjct: 184 ILLIVLFLGTSTKAFFKGLETWNKETIMKKEAARR------------QESNGSGEYKALP 231

Query: 243 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLNLLQ 301
           T P    E K++KVSIIEN+YWKEFGLL  VW   LALQIAK NYTTTCS  YW+LNLLQ
Sbjct: 232 TGPNVAIE-KENKVSIIENVYWKEFGLLAFVWISFLALQIAKQNYTTTCSAAYWILNLLQ 290

Query: 302 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 361
           IP+A GV+AYEA AL+ GRR IAS GD+G  +   QLV YC  G+ AG+VGGLLGLGGGF
Sbjct: 291 IPIAVGVTAYEATALFTGRRVIASTGDQGKAFTVGQLVIYCVFGVLAGIVGGLLGLGGGF 350

Query: 362 ILGPLFLELGIPPQ----------VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFF 411
           ++GPLFLELG+PPQ          VSSATA FA+TFSSSMSVVEYYLLKRFPVPYALY  
Sbjct: 351 VMGPLFLELGVPPQVSFKLQYINHVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLS 410

Query: 412 ALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYM 471
            ++ IAA VGQH++++LI + GRAS+IIF L+  IF+SA+SLGGVG++ M+ +I + EYM
Sbjct: 411 LVATIAALVGQHIVRRLIILFGRASLIIFILAGTIFISAISLGGVGISNMVHKIANHEYM 470

Query: 472 GFDNIC 477
           GF+NIC
Sbjct: 471 GFENIC 476


>gi|356528375|ref|XP_003532779.1| PREDICTED: uncharacterized protein LOC100801788 [Glycine max]
          Length = 469

 Score =  508 bits (1309), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 285/477 (59%), Positives = 359/477 (75%), Gaps = 28/477 (5%)

Query: 10  RWRL---LAAAFIGLLVVASVS-VSAQSNLKPEATSKNKN----DHVEPNIVIKVSAPKS 61
           RW +   +  A I  L++ SVS  S+  N   + TS +      DH             +
Sbjct: 8   RWDVKWVVGKALIAFLLLVSVSATSSHQNTDNKTTSPSNGTIGVDH------------HA 55

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
           ++ YKH WP +KFGWRI+VG ++GF G+A G+VGGVGGGGIFVPML LIVGFD KS+ A+
Sbjct: 56  KAFYKHHWPSMKFGWRIIVGAVVGFLGSAFGTVGGVGGGGIFVPMLTLIVGFDQKSATAI 115

Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
           SKCM+TGGA + V YNLRQRHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNVIF +WM+
Sbjct: 116 SKCMITGGATATVFYNLRQRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVIFPEWML 175

Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYL-S 240
           TVLLI+ F+ +S ++F KGV+TWKKETI K+EA K   +  +G+       + D  Y+ +
Sbjct: 176 TVLLIIFFVGISVKSFFKGVDTWKKETIMKKEAKKNSRIDDIGSP------EDDAHYIQT 229

Query: 241 NDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLL 300
            D T ++T + +K KVSIIENI WKE GLL A W +ILAL+I K +TTTCS LYW+ NLL
Sbjct: 230 GDPTKDDTNQSRK-KVSIIENIRWKELGLLFAGWIMILALEIGKKHTTTCSRLYWLSNLL 288

Query: 301 QIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGG 360
           Q+P+A G+S+YEAV LYKG+R IASKGD+ T W   QLV +CACG  AGM+ GLLGLGGG
Sbjct: 289 QVPIAVGMSSYEAVRLYKGKRIIASKGDKQTHWCVLQLVLFCACGTLAGMIAGLLGLGGG 348

Query: 361 FILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 420
           FILGPLFL LGIPPQV+SAT+   + FS+SM+VVEYYLLKRFPVPYALYF A++  AA V
Sbjct: 349 FILGPLFLGLGIPPQVASATSTLVMAFSASMAVVEYYLLKRFPVPYALYFVAIATAAALV 408

Query: 421 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           GQH+++K I ILGRAS+IIF L+  + VSA+ LGGVG+A MI++IE+KEYMGF ++C
Sbjct: 409 GQHLVRKAIAILGRASVIIFILTLTLSVSAVLLGGVGIAHMIQKIENKEYMGFGDLC 465


>gi|356511003|ref|XP_003524221.1| PREDICTED: uncharacterized protein LOC100818240 [Glycine max]
          Length = 473

 Score =  507 bits (1306), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 268/440 (60%), Positives = 345/440 (78%), Gaps = 3/440 (0%)

Query: 40  TSKNKNDHVEPNIVIKV-SAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVG 98
           T   KN   E N +  V +   ++  Y+H WP +K+GW+I+VGTI+GF G+A G+VGGVG
Sbjct: 29  TVSAKNTTSESNEITGVGTGYHAKVFYQHTWPSMKYGWKIIVGTIVGFLGSAFGNVGGVG 88

Query: 99  GGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLF 158
           GGGIFVPML LI+GFDAKS+ A+SKCM+TGGA + V YNLRQRHPTLD+PVIDYDLALLF
Sbjct: 89  GGGIFVPMLTLIIGFDAKSAIAISKCMITGGATATVFYNLRQRHPTLDLPVIDYDLALLF 148

Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
           QP+L+LGISIGV+FNVIF DWM+T LLI+ F  +S ++F KGV+TWK+ET+  +EA+K  
Sbjct: 149 QPMLMLGISIGVSFNVIFPDWMLTTLLIISFTGISVKSFFKGVDTWKQETLMMKEASKNS 208

Query: 219 ELIVLGNGYQTEECKCDPEYLSN-DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAII 277
                G        + D  Y+   D   + T + +K KVS+I+N++WKE GLL AVW +I
Sbjct: 209 LTDGRGGSTYIGNPEDDAHYIKTIDPVKDNTNQSRK-KVSVIDNVHWKELGLLFAVWIMI 267

Query: 278 LALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQ 337
           LAL+I KNYTTTCS +YWV+NLLQ+P+A G+S+Y+A+ LYKG+R IASKGD+ T WR  Q
Sbjct: 268 LALEIGKNYTTTCSGVYWVINLLQVPIAVGMSSYQAMRLYKGQRIIASKGDQQTHWRVLQ 327

Query: 338 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           L+ +CACGI AG + GLLGLGGGFIL PLFL LGIPPQV+SAT++ A+ FS+S++VVEYY
Sbjct: 328 LIVFCACGILAGTIAGLLGLGGGFILAPLFLGLGIPPQVASATSILAMAFSASIAVVEYY 387

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVG 457
           LLKRFP+ YALYF A++  AA VGQH+++K+I +LGRAS+IIF L+  + VSA+ LGGVG
Sbjct: 388 LLKRFPISYALYFVAVATAAALVGQHLVRKVIAMLGRASVIIFILTLTLCVSAVLLGGVG 447

Query: 458 LAKMIKRIEHKEYMGFDNIC 477
           +A MIKRIE+KEYMGF N+C
Sbjct: 448 VANMIKRIENKEYMGFGNLC 467


>gi|356511005|ref|XP_003524222.1| PREDICTED: uncharacterized protein LOC100818778 [Glycine max]
          Length = 474

 Score =  507 bits (1305), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 279/466 (59%), Positives = 353/466 (75%), Gaps = 18/466 (3%)

Query: 17  AFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGW 76
           A I  L++ SVS ++        T+   N  +  +         +++ YKH WP +KFGW
Sbjct: 18  ALIAFLLLVSVSATSSHQNTDNKTTSPSNGTIGVDY-------HAKAFYKHHWPSMKFGW 70

Query: 77  RIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVY 136
           RI+VG I+GF G+A G+VGGVGGGGIFVPML LIVGFD KS+ A+SKCM+TGGA + V Y
Sbjct: 71  RIIVGAIVGFLGSAFGTVGGVGGGGIFVPMLTLIVGFDQKSATAISKCMITGGATATVFY 130

Query: 137 NLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRA 196
           NLRQRHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNVIF +WM+TVLLI+ F+ +S ++
Sbjct: 131 NLRQRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVIFPEWMLTVLLIIFFVGISVKS 190

Query: 197 FLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYL-SNDTTPEETREPKKSK 255
           F KGV+TWKKETI K+EA K   +  +G+       + D  Y+ + D   ++T + +K K
Sbjct: 191 FFKGVDTWKKETIMKKEAKKNSRIDDIGSP------EDDAHYIQTGDPAKDDTNQSRKKK 244

Query: 256 ----VSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAY 311
               VSIIENI WKE GLL A W +ILAL+I K +TTTCS L+W+LNLLQ+P+A G+S+Y
Sbjct: 245 TVMQVSIIENIRWKELGLLFAGWIMILALEIGKKHTTTCSRLFWLLNLLQVPIAVGMSSY 304

Query: 312 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
           EAV LYKG+R IASKGD+ T W   QLV +CACG  AGM+ GLLGLGGGFILGPLFL LG
Sbjct: 305 EAVRLYKGKRIIASKGDQQTHWCVLQLVLFCACGTLAGMIAGLLGLGGGFILGPLFLGLG 364

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           IPPQV+SAT+   + FS+SM+VVEYYLLKRFPVPYALYF A++  AA VGQH+++K I I
Sbjct: 365 IPPQVASATSTLVMAFSASMAVVEYYLLKRFPVPYALYFVAIATAAALVGQHLVRKAIAI 424

Query: 432 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           LGRAS+IIF L+  + VSA+ LGGVG+A MI++IE+KEYMGF ++C
Sbjct: 425 LGRASVIIFILTLTLSVSAVLLGGVGIAHMIQKIENKEYMGFGDLC 470


>gi|356528373|ref|XP_003532778.1| PREDICTED: uncharacterized protein LOC100801244 [Glycine max]
          Length = 466

 Score =  504 bits (1299), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 273/472 (57%), Positives = 355/472 (75%), Gaps = 24/472 (5%)

Query: 10  RWRLLAAA---FIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKV-SAPKSRSGY 65
           RW +   A   FI LL++ +VS               KN  +E N    V +   ++  Y
Sbjct: 9   RWDVKWVAGKVFIALLLLVTVSA--------------KNTTLESNETTGVGTGYYAKVFY 54

Query: 66  KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
           KH WP +++GW+I+VGT++GF G+A G+VGGVGGGGIFVPML LI+GFDAKS+ A+SKCM
Sbjct: 55  KHTWPSMEYGWKIIVGTMVGFLGSAFGNVGGVGGGGIFVPMLTLIIGFDAKSAIAISKCM 114

Query: 126 VTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLL 185
           +TGGA + V YNLRQRHPTLD+PVIDYDLALLFQP+L+LGISIGV+FNVIF DWM+T LL
Sbjct: 115 ITGGATATVFYNLRQRHPTLDLPVIDYDLALLFQPMLMLGISIGVSFNVIFPDWMLTTLL 174

Query: 186 IVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTP 245
           I+ F  +S ++F KGV+TWK+ET+  +EA K  ++  +G+    E+      Y+      
Sbjct: 175 IIFFTGISVKSFFKGVDTWKQETLIVKEARKNSQIDDIGS---PEDA---AHYIQTGDPV 228

Query: 246 EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVA 305
           ++     + KVS+IENI+W E GLL AVW +ILAL+I KNYTTTCS +YWV+NLLQ+P+A
Sbjct: 229 KDNTNQSRKKVSVIENIHWNELGLLFAVWIMILALEIGKNYTTTCSGVYWVINLLQVPIA 288

Query: 306 GGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 365
            G+S+Y+A+ LYKG+R I SKGD+ T WR  QL+ +CACGI AG + GLLGLGGGFIL P
Sbjct: 289 VGMSSYQAMRLYKGQRIIGSKGDQQTNWRVLQLILFCACGILAGTIAGLLGLGGGFILAP 348

Query: 366 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 425
           LFL +GIPPQV+SAT++ A+ FS+SM+VVEYYLLKRFP+ YALYF A++  AA VGQH++
Sbjct: 349 LFLGIGIPPQVASATSILAMAFSASMAVVEYYLLKRFPISYALYFVAVATAAALVGQHLV 408

Query: 426 KKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +K+I ILGRAS+IIF L+  + VSA+ LGGVG+A MIKRIE+KEYMGF N+C
Sbjct: 409 RKVIAILGRASVIIFILTLTLCVSAVLLGGVGVANMIKRIENKEYMGFGNLC 460


>gi|357519217|ref|XP_003629897.1| hypothetical protein MTR_8g088080 [Medicago truncatula]
 gi|355523919|gb|AET04373.1| hypothetical protein MTR_8g088080 [Medicago truncatula]
          Length = 479

 Score =  504 bits (1299), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 276/414 (66%), Positives = 341/414 (82%), Gaps = 6/414 (1%)

Query: 65  YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
           YKH WP +KFGWRI+VGTIIGF G++ G+VGGVGGGGIFVPML LIVGFDAKS+ A+SK 
Sbjct: 65  YKHTWPGMKFGWRIIVGTIIGFLGSSFGTVGGVGGGGIFVPMLTLIVGFDAKSATAISKF 124

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           M+TGGA + V+YNL++RHPTLD+PVIDYDLALLFQP+L+LGIS+GVAFN+IF DWM+T L
Sbjct: 125 MITGGAGATVLYNLKKRHPTLDLPVIDYDLALLFQPMLMLGISLGVAFNLIFPDWMLTTL 184

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEY-LSNDT 243
           LI+ F  +S  AFLKGV TWKKET+TK+EA    +L    N  +TE+   D +   S + 
Sbjct: 185 LIIFFTGISVNAFLKGVNTWKKETLTKKEAKDNSQL----NDIRTEDATLDLQIGESVNE 240

Query: 244 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 303
               T  P+K KVS+IEN+YWKE GLL +VW +ILALQI KNYTTTCS++YW+LNLLQ+P
Sbjct: 241 NQTNTNLPRK-KVSVIENVYWKELGLLFSVWIMILALQIGKNYTTTCSLIYWILNLLQVP 299

Query: 304 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 363
           +A GVS YEA+ LYKG+R IAS GD+G  W   QL+ YC+CGI AG++GGLLGLGGGFIL
Sbjct: 300 IAIGVSGYEAILLYKGKRVIASNGDQGINWSVKQLILYCSCGIIAGIIGGLLGLGGGFIL 359

Query: 364 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 423
           GPLF+ LGIPPQV+SAT+ FA+TFS+SMSVVEYYLLKRFP+PYALYF  ++ +AA VGQH
Sbjct: 360 GPLFIGLGIPPQVASATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVVVATVAALVGQH 419

Query: 424 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +++KLI +LGRASIIIF L+  +FVS +SLGG G+A +IKR+E+KEYMGF N+C
Sbjct: 420 LVRKLIALLGRASIIIFILALTVFVSGISLGGTGIANLIKRLENKEYMGFGNLC 473


>gi|217075809|gb|ACJ86264.1| unknown [Medicago truncatula]
          Length = 481

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 276/414 (66%), Positives = 341/414 (82%), Gaps = 6/414 (1%)

Query: 65  YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
           YKH WP +KFGWRI+VGTIIGF G++ G+VGGVGGGGIFVPML LIVGFDAKS+ A+SK 
Sbjct: 67  YKHTWPGMKFGWRIIVGTIIGFLGSSFGTVGGVGGGGIFVPMLTLIVGFDAKSATAISKF 126

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           M+TGGA + V+YNL++RHPTLD+PVIDYDLALLFQP+L+LGIS+GVAFN+IF DWM+T L
Sbjct: 127 MITGGAGATVLYNLKKRHPTLDLPVIDYDLALLFQPMLMLGISLGVAFNLIFPDWMLTTL 186

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEY-LSNDT 243
           LI+ F  +S  AFLKGV TWKKET+TK+EA    +L    N  +TE+   D +   S + 
Sbjct: 187 LIIFFTGISVNAFLKGVNTWKKETLTKKEAKDNSQL----NDIRTEDATLDLQIGESVNE 242

Query: 244 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 303
               T  P+K KVS+IEN+YWKE GLL +VW +ILALQI KNYTTTCS++YW+LNLLQ+P
Sbjct: 243 NQTNTNLPRK-KVSVIENVYWKELGLLFSVWIMILALQIGKNYTTTCSLIYWILNLLQVP 301

Query: 304 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 363
           +A GVS YEA+ LYKG+R IAS GD+G  W   QL+ YC+CGI AG++GGLLGLGGGFIL
Sbjct: 302 IAIGVSGYEAILLYKGKRVIASNGDQGINWSVKQLILYCSCGIIAGIIGGLLGLGGGFIL 361

Query: 364 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 423
           GPLF+ LGIPPQV+SAT+ FA+TFS+SMSVVEYYLLKRFP+PYALYF  ++ +AA VGQH
Sbjct: 362 GPLFIGLGIPPQVASATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVVVATVAALVGQH 421

Query: 424 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +++KLI +LGRASIIIF L+  +FVS +SLGG G+A +IKR+E+KEYMGF N+C
Sbjct: 422 LVRKLIALLGRASIIIFILALTVFVSGISLGGTGIANLIKRLENKEYMGFGNLC 475


>gi|125603285|gb|EAZ42610.1| hypothetical protein OsJ_27176 [Oryza sativa Japonica Group]
          Length = 442

 Score =  496 bits (1276), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 273/409 (66%), Positives = 321/409 (78%), Gaps = 14/409 (3%)

Query: 72  IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAV 131
           ++FGW+IV+G +IGFFG A GSVGGVGGGGIFVPML LI+GFDAKSS A+SKCM+ G AV
Sbjct: 41  MEFGWKIVLGKLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDAKSSTAISKCMIMGAAV 100

Query: 132 SAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLV 191
           S V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW+ITVLLI+LFL 
Sbjct: 101 STVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVLFNVIFPDWLITVLLIILFLG 160

Query: 192 LSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPE---ET 248
            ST+AFLKGVETWKKETI KREAAK+LE I        EE    PEY    T P    E+
Sbjct: 161 TSTKAFLKGVETWKKETILKREAAKRLEQI-------AEE----PEYSPLPTGPNATAES 209

Query: 249 REPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGV 308
           + P     S+ +N+YWKEFGLL  VW   L LQ+ KNY  TCS  YWVLN LQIPV+ GV
Sbjct: 210 KAPSDEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPTCSTWYWVLNFLQIPVSVGV 269

Query: 309 SAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 368
           + YE + L +GRR I+SKGDE T  +  QL+ YC  G+ AG+VGGLLGLGGGFI+GPLFL
Sbjct: 270 TMYEGLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAGVVGGLLGLGGGFIMGPLFL 329

Query: 369 ELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           ELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYAL+F  ++  AA +GQH+++KL
Sbjct: 330 ELGIPPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALFFTVVAFFAAIIGQHIVRKL 389

Query: 429 IKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           I  LGRAS+IIF LSF+IF+SA+SLGGVG++ MI +I   EYMGFDNIC
Sbjct: 390 INWLGRASLIIFILSFMIFISAISLGGVGISNMIGKIARHEYMGFDNIC 438


>gi|356518101|ref|XP_003527721.1| PREDICTED: uncharacterized protein LOC100789748 [Glycine max]
 gi|83853827|gb|ABC47859.1| membrane protein-like protein [Glycine max]
          Length = 491

 Score =  494 bits (1272), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 264/417 (63%), Positives = 327/417 (78%), Gaps = 16/417 (3%)

Query: 66  KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
             VWP++KFGWRIVVG+I+GFFG A GSVGGVGGGGIF+PML L++GFDAKSS A+SKCM
Sbjct: 71  DRVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTLVIGFDAKSSTALSKCM 130

Query: 126 VTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLL 185
           + G AVS V YNLR RHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNV+FADWM+TVLL
Sbjct: 131 IMGAAVSTVYYNLRFRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLL 190

Query: 186 IVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND--- 242
           I+LF+  ST+A  KG++TWKKETI K+EAAK L            E    P Y+S +   
Sbjct: 191 IILFIATSTKALFKGIDTWKKETIMKKEAAKML------------ESDSSPGYVSEEDYK 238

Query: 243 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 302
           + P  + +P+  +V +++NIYWKE  +LV VW   L +QI K YT TCS+LYWVLN LQ+
Sbjct: 239 SLPAGSADPRDEEVPLLKNIYWKELLVLVYVWVAFLIVQIIKTYTKTCSILYWVLNSLQV 298

Query: 303 PVAGGVSAYEAVALYKGRRKIASKGDEGTKW-RASQLVFYCACGITAGMVGGLLGLGGGF 361
           P+A  V+ YEA+ L  G R I+SKG E T W +  ++  YC+CGI AG+V GLLGLGGGF
Sbjct: 299 PIAISVTLYEAICLCNGTRVISSKGKENTDWMKLHKICLYCSCGIIAGIVSGLLGLGGGF 358

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           ILGPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YYLL+RFPVPYA YF  ++ IAA  G
Sbjct: 359 ILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYLLERFPVPYASYFILVATIAALTG 418

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICY 478
           QHV++K+I I GRASIIIF L+F IF+SA+SLGGVG+  M++++E+ EYMGF NIC+
Sbjct: 419 QHVVRKIIAIFGRASIIIFVLAFTIFLSAISLGGVGIENMVEKMENNEYMGFANICH 475


>gi|356509743|ref|XP_003523605.1| PREDICTED: uncharacterized protein LOC100816633 [Glycine max]
          Length = 474

 Score =  493 bits (1269), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 273/446 (61%), Positives = 338/446 (75%), Gaps = 11/446 (2%)

Query: 34  NLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGS 93
           + KPE+    +   V  N +I             VWP++KFGWRIVVG+I+GFFG A GS
Sbjct: 39  DQKPESFVVKERQGVM-NSIIDFFWNDGEPTNDRVWPEMKFGWRIVVGSIVGFFGAALGS 97

Query: 94  VGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYD 153
           VGGVGGGGIF+PML L++GFDAKSS A+SKCM+ G AVS V YNLR RHPTLD+PVIDYD
Sbjct: 98  VGGVGGGGIFIPMLTLVIGFDAKSSTALSKCMIMGAAVSTVYYNLRFRHPTLDLPVIDYD 157

Query: 154 LALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE 213
           LALLFQP+L+LGISIGVAFNV+FADWM+TVLLI+LF+  ST+A  KG++TWKKETI K+E
Sbjct: 158 LALLFQPMLMLGISIGVAFNVMFADWMVTVLLIILFIATSTKALFKGIDTWKKETIMKKE 217

Query: 214 AAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAV 273
           AAK LE      GY +EE     +Y S    P  + +P+  +V +++NIYWKE  +L  V
Sbjct: 218 AAKMLESDS-SPGYVSEE-----DYKS---LPAGSADPRDEEVPLLKNIYWKELLVLAYV 268

Query: 274 WAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKW 333
           W   L +QI K YT TCS+LYWVLN LQ+P+A  V+ YEA+ L  G R IASKG E T W
Sbjct: 269 WVAFLIVQIIKTYTKTCSILYWVLNSLQVPIAISVTLYEAICLCNGTRVIASKGKENTDW 328

Query: 334 -RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMS 392
            +  ++  YC+CGI AG+V GLLGLGGGFILGPLFLELGIPPQV+SAT+ FA+ FSSSMS
Sbjct: 329 MKLHKICLYCSCGIIAGIVSGLLGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMS 388

Query: 393 VVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALS 452
           VV+YYLL+RFPVPYA YF  ++ IAA  GQHV++K+I I GRASIIIF L+F IF+SA+S
Sbjct: 389 VVQYYLLERFPVPYASYFILVATIAALTGQHVVRKIIAIFGRASIIIFVLAFTIFLSAIS 448

Query: 453 LGGVGLAKMIKRIEHKEYMGFDNICY 478
           LGGVG+  M++++E+ EYMGF NIC+
Sbjct: 449 LGGVGIENMVEKMENNEYMGFANICH 474


>gi|224054356|ref|XP_002298219.1| predicted protein [Populus trichocarpa]
 gi|222845477|gb|EEE83024.1| predicted protein [Populus trichocarpa]
          Length = 421

 Score =  488 bits (1255), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 279/428 (65%), Positives = 336/428 (78%), Gaps = 19/428 (4%)

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
           +SGYKHVWPD+KFGW+ VVGTII F G ACGSVGGVGGGGIFVPML LIVGFD+KSSAA+
Sbjct: 1   QSGYKHVWPDMKFGWKTVVGTIITFLGAACGSVGGVGGGGIFVPMLTLIVGFDSKSSAAI 60

Query: 122 SKC-----------MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGV 170
           SKC           M+TG A + V YNLR RHPTL++P+IDYDLALLFQP+L+LGISIGV
Sbjct: 61  SKCNCCRLCLLLSGMITGAAATTVFYNLRLRHPTLELPIIDYDLALLFQPMLILGISIGV 120

Query: 171 AFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTE 230
             NV+FADWMI +LLI+ F+  STR+F KGVETWKKET +K+ +      +   N  +  
Sbjct: 121 TLNVLFADWMIIILLIIFFIATSTRSFFKGVETWKKETKSKQVS----HFLYPKNLIEAA 176

Query: 231 ECKCDPEYLSNDTTPEETR-EPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTT 289
           E K  P      T+ ++ + EP K KVSIIENI WK+ G+LV VW IILALQI KNYTTT
Sbjct: 177 EEKPLP---GGTTSHDQIKPEPNKEKVSIIENIKWKDLGILVIVWLIILALQIGKNYTTT 233

Query: 290 CSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAG 349
           CS +YW+LN+LQIPVA GV++YEAV+LYKGRRKI S G+    W   +L+ YC  G+ AG
Sbjct: 234 CSAIYWLLNILQIPVAAGVTSYEAVSLYKGRRKITSMGEASNNWPVHELILYCFIGVMAG 293

Query: 350 MVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALY 409
            VGG+LGLGGGFILGPLFLE+GIPPQVSSATA FA+ FS+SMSVVE+YLLKRFPVPYALY
Sbjct: 294 TVGGMLGLGGGFILGPLFLEMGIPPQVSSATATFAMMFSASMSVVEFYLLKRFPVPYALY 353

Query: 410 FFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE 469
           FF ++ IAA VGQHV++KLI ILGRAS+IIF L+  IFVSA+ +GG G+A MI++ E KE
Sbjct: 354 FFTVATIAAVVGQHVVRKLISILGRASLIIFILASTIFVSAILVGGAGMASMIEKFERKE 413

Query: 470 YMGFDNIC 477
           YMGF+++C
Sbjct: 414 YMGFESMC 421


>gi|326510405|dbj|BAJ87419.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 474

 Score =  487 bits (1253), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 262/416 (62%), Positives = 318/416 (76%), Gaps = 1/416 (0%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           + Y HVWP +K GW+IV+G+ IGFFG A GS+GGVGGGG+FVPML LI+GFD KSSAA+S
Sbjct: 55  TSYHHVWPPLKLGWKIVLGSFIGFFGAAFGSIGGVGGGGVFVPMLTLIIGFDPKSSAAMS 114

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G AVS V  N++ +HPTLD+PVIDYDLALL QP+L+LG+SIGV  NVIF DW++T
Sbjct: 115 KCMIMGAAVSTVYCNVKLKHPTLDMPVIDYDLALLIQPMLMLGVSIGVICNVIFPDWLVT 174

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VLLI+LF+V ST+AFLK VETWKKETIT   A K+ E       Y       D    S  
Sbjct: 175 VLLIILFIVTSTKAFLKAVETWKKETITIAVATKRSEETSEEQEYMQLSAGLDAASQSEI 234

Query: 243 TTPEETRE-PKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 301
            +  +  E P    VSI +N+YWKE GLL  VW   LALQ+AKNY  TCS+ YWVLNLLQ
Sbjct: 235 LSDHDQNETPPDEAVSIWKNVYWKEVGLLGFVWVAFLALQLAKNYMATCSISYWVLNLLQ 294

Query: 302 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 361
           IPV+ GV+ YEAV L +GRR I+SK +E    RA QL+ YC  G+TAG+V GLLG+GGG 
Sbjct: 295 IPVSVGVAMYEAVGLMQGRRVISSKRNEENSLRAHQLLVYCFLGVTAGVVAGLLGVGGGS 354

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           I+GPLFLELG+PPQV+SATA FA+ FS+SMSVVEYYLL RFPVPYALY   L+ +AA +G
Sbjct: 355 IMGPLFLELGVPPQVASATATFAMMFSASMSVVEYYLLNRFPVPYALYLTILAFLAAIIG 414

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           Q V++KLI +LGRASIIIF LSF+IF+SALSLGGVG++  I +I  +EYMGF+NIC
Sbjct: 415 QRVVRKLIDLLGRASIIIFILSFMIFISALSLGGVGISNTIHKIARREYMGFENIC 470


>gi|357147621|ref|XP_003574413.1| PREDICTED: uncharacterized protein LOC100846525 [Brachypodium
           distachyon]
          Length = 461

 Score =  486 bits (1251), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 274/476 (57%), Positives = 343/476 (72%), Gaps = 30/476 (6%)

Query: 10  RWRLLAAAFIGLLVVASVSVSAQSNL------KPEATSKNKNDHVEPNIVIKVSAPKSRS 63
           +W  +AA  I   VVA+V+   + +L       PE  S  +N                R+
Sbjct: 4   KWHAVAALAIAYTVVATVAADDRGHLLAGTVTAPEGRSSLRN-----------VTSLGRT 52

Query: 64  GYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSK 123
            Y HVWP +K GWR+++G+ I FFG A GS+GGVGGGGIFVPML LI+GFD KSSAA+SK
Sbjct: 53  TYHHVWPPMKVGWRVLLGSFIVFFGAAFGSIGGVGGGGIFVPMLTLIIGFDTKSSAAISK 112

Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
           CM+ G A+S V  NL+ +HP+LD+P+IDYDLALL QP+L+LG+SIGV  NVIF DW++TV
Sbjct: 113 CMIMGAALSTVYCNLKLKHPSLDMPLIDYDLALLIQPMLMLGVSIGVICNVIFPDWLVTV 172

Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDT 243
           LLI LFLV ST+AFLKGVETWKKET+TKREA KQLE        QT E     EY    +
Sbjct: 173 LLISLFLVTSTKAFLKGVETWKKETLTKREATKQLE--------QTSE---QWEYTLPPS 221

Query: 244 TPEE--TREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 301
             ++  ++ P    VSI +NIYWKEFGLL  VW   L LQI K+Y  TCS  YWVLN+LQ
Sbjct: 222 GADDAASKAPSDEAVSIWKNIYWKEFGLLAFVWIAFLVLQITKDYVATCSTWYWVLNILQ 281

Query: 302 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 361
           IPV+ GV+ Y+AV L +G+R I+SK +  T  +A QL  YC+ G+TAG +GGLLG+GGG 
Sbjct: 282 IPVSVGVTMYQAVGLAQGKRVISSKANNQTSLKAYQLFVYCSLGLTAGSMGGLLGVGGGI 341

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           I+GPLFLELG+PPQVSSAT+ FA+ FS+SMSVVEYYLLKRFPVPYA +F  ++ +AA VG
Sbjct: 342 IMGPLFLELGVPPQVSSATSTFAMMFSASMSVVEYYLLKRFPVPYAAFFTIVAFLAAIVG 401

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           Q V++K+I  LGRAS+IIF LSF+IF+SALSLGGVG + MI +I   EY+GF+NIC
Sbjct: 402 QGVVRKMISWLGRASLIIFVLSFMIFISALSLGGVGTSNMIHKITQHEYLGFENIC 457


>gi|255551044|ref|XP_002516570.1| conserved hypothetical protein [Ricinus communis]
 gi|223544390|gb|EEF45911.1| conserved hypothetical protein [Ricinus communis]
          Length = 465

 Score =  485 bits (1249), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 258/415 (62%), Positives = 317/415 (76%), Gaps = 16/415 (3%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           SGY+HVWP ++ GW++VVG+IIGFFG A GSVGGVGGGGIFVPML LI+GFD KS+ A+S
Sbjct: 63  SGYQHVWPGMELGWQVVVGSIIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSATAIS 122

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G A+S V YNL+ RHPTLD+P++DYDL LL QP ++LGIS GV  NV F DW +T
Sbjct: 123 KCMIMGAAMSTVYYNLKLRHPTLDLPIVDYDLVLLIQPAVLLGISTGVTLNVFFPDWAVT 182

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VLLI+L +  S +A+ KGV+TW KET  K EAAKQ +L+  G                ++
Sbjct: 183 VLLIILLIGTSVKAYFKGVDTWNKETTLKEEAAKQYKLLPGG---------------PSN 227

Query: 243 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 302
             P + +  ++ +V ++ENI+WKEF LLV  W   LALQIAKNYT TCS  YWVLNLLQ+
Sbjct: 228 VNPRQ-KHTREREVPVLENIHWKEFALLVFDWVAYLALQIAKNYTATCSTAYWVLNLLQV 286

Query: 303 PVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 362
           PV+ GV  Y AV LY+GRR+IASKGDEG   +A QL+ +CA G+ AG+VGGLLGLGGGFI
Sbjct: 287 PVSVGVFLYAAVGLYRGRRRIASKGDEGMNLKAYQLLAFCAYGVMAGIVGGLLGLGGGFI 346

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
           +GPLFLELGIPPQV+SATA F + FSSSMSV EYYLL RFPVPYALYF A++ IAAF GQ
Sbjct: 347 MGPLFLELGIPPQVTSATATFGMAFSSSMSVAEYYLLSRFPVPYALYFIAVATIAAFAGQ 406

Query: 423 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           H++ KLI I GRAS+IIF L+F IFVSAL+LGGVG++ +I +IE  E M F N+C
Sbjct: 407 HIITKLIAITGRASLIIFVLAFTIFVSALTLGGVGISNIIGKIEKGESMEFQNLC 461


>gi|224094753|ref|XP_002310222.1| predicted protein [Populus trichocarpa]
 gi|222853125|gb|EEE90672.1| predicted protein [Populus trichocarpa]
          Length = 454

 Score =  485 bits (1248), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 272/454 (59%), Positives = 338/454 (74%), Gaps = 18/454 (3%)

Query: 30  SAQSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIGF 86
           SA+  LK +         + P  ++K+        +S Y+ VWPD+KF WR+VVGTI+GF
Sbjct: 13  SAERLLKDKEAETLVKKEMSPGYIVKMIHFLFQGGKSSYEPVWPDMKFDWRVVVGTIVGF 72

Query: 87  FGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD 146
           FG A GSVGGVGGGGIFVPML LI+GFD KSS A+SKCM+ G A + V YNLR RHPTLD
Sbjct: 73  FGAALGSVGGVGGGGIFVPMLTLIIGFDPKSSTAISKCMIMGAAGATVYYNLRLRHPTLD 132

Query: 147 IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
           +P+IDYDLALLFQP+L+LGISIGVAFNV+FADWM+TVLLI+LF+  ST+A  KG++TWKK
Sbjct: 133 MPIIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLLIILFIGTSTKALFKGIDTWKK 192

Query: 207 ETITKREAAKQLELIVL-GNGYQTE--ECKCDPEYLSNDTTPEETREPKKSKVSIIENIY 263
           E++ K+EAAKQLE     G+G + E       P  L +D  P            + ENI 
Sbjct: 193 ESVMKKEAAKQLESESKPGDGAEEEYQPLPSGPVVLHDDEVP------------LRENIR 240

Query: 264 WKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKI 323
           WKE  LLV VW   L +QI +    TCSV+YW+LN LQ+P+A  V+ +EA+ L KG R I
Sbjct: 241 WKELALLVYVWVAFLVVQIVQTNVPTCSVMYWILNSLQVPIAASVTLFEAICLCKGTRVI 300

Query: 324 ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 383
           ASKG E T W+  Q++ YC+CGI AG+VGGLLGLGGGFILGPLFLELGIPPQV+SAT+ F
Sbjct: 301 ASKGKEITNWKLHQVLLYCSCGIIAGIVGGLLGLGGGFILGPLFLELGIPPQVASATSTF 360

Query: 384 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 443
           A+ FSSSMSVVEYYLL RFPVPYA YF  ++ IAAF GQHV++++I ILGRASIIIF L+
Sbjct: 361 AMAFSSSMSVVEYYLLNRFPVPYAAYFVLVATIAAFAGQHVIRRIIAILGRASIIIFILA 420

Query: 444 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
             IFVSA+SLGGVG+A M++++E+ EYMGF+N+C
Sbjct: 421 LTIFVSAISLGGVGIANMVEKLENDEYMGFENLC 454


>gi|217073872|gb|ACJ85296.1| unknown [Medicago truncatula]
          Length = 474

 Score =  485 bits (1248), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 250/418 (59%), Positives = 319/418 (76%), Gaps = 10/418 (2%)

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGG-GGIFVPMLNLIVGFDAKSSAA 120
            S Y+ VWP++KFGWRIVVG+I+GFFG A GSVGGVGG  GIF+PML LI+GFD KSS A
Sbjct: 59  ESSYERVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGWKGIFIPMLTLIIGFDPKSSTA 118

Query: 121 VSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWM 180
           +SKCM+TG A S V  NLR RHPTLD+P+IDYDLALLFQP+L+LGISIGVAFNV+FADWM
Sbjct: 119 LSKCMITGAAGSTVYCNLRLRHPTLDMPLIDYDLALLFQPMLMLGISIGVAFNVMFADWM 178

Query: 181 ITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLS 240
           +T+LLI+LF+  ST+A +KG++TWKKE I K+EA ++          Q  E    P+Y S
Sbjct: 179 VTILLIILFIGTSTKALVKGIDTWKKEAIMKKEAFEE--------AAQMLESGSTPDYAS 230

Query: 241 NDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLL 300
            +       + +  +V +++NI+WKE  +L+ VW   L +QI K Y+ TCS+ YW+LN L
Sbjct: 231 EEDYKSLPADLQDEEVPLLDNIHWKELSVLMYVWVAFLIVQILKTYSKTCSIEYWLLNSL 290

Query: 301 QIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGG 360
           Q+P+A  V+ +EA+ L KG R IAS+G E T W+  ++  YC CGI AGMV GLLGLGGG
Sbjct: 291 QVPIAISVTLFEAICLCKGTRVIASRGKEIT-WKFHKICLYCFCGIIAGMVSGLLGLGGG 349

Query: 361 FILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 420
           FILGPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YY L RFP+PYA Y   ++ IAA  
Sbjct: 350 FILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYHLDRFPIPYASYLVLVATIAALT 409

Query: 421 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICY 478
           GQHV++K+I I GRASII+F L+F IFVSA+SLGGVG+  M++++E+ EYMGFDN C+
Sbjct: 410 GQHVVRKIIAIFGRASIIVFILAFTIFVSAISLGGVGIGNMVEKMENAEYMGFDNFCH 467


>gi|148906357|gb|ABR16333.1| unknown [Picea sitchensis]
          Length = 505

 Score =  484 bits (1246), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 288/496 (58%), Positives = 359/496 (72%), Gaps = 29/496 (5%)

Query: 10  RWRLLAA--AFIGLLVVASVSVSAQSNLKPEATSKNKN-------DHVEPNIVIKVSAPK 60
           +W L+    A  GLL+  ++S + ++ L   + S           D VE N  +K    K
Sbjct: 7   KWLLVVVMVATFGLLLAVTLSSAERTFLDGNSASSETERSMEKIVDPVEGN-TLKYHLLK 65

Query: 61  SR--------SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVG 112
           +R        S Y+ VWP++KFGWRIV+G+I+GFFG A GSVGGVGGGGIFVPML LI+G
Sbjct: 66  ARNFLWRSNISDYQRVWPEMKFGWRIVIGSIVGFFGAAVGSVGGVGGGGIFVPMLTLIIG 125

Query: 113 FDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAF 172
           FDAKSS A+SKCM+ G A S V YNL+ +HPTLD+P+IDYDLA+LFQP+L+LGISIGVAF
Sbjct: 126 FDAKSSTAISKCMIMGAAGSTVYYNLKLKHPTLDLPIIDYDLAMLFQPMLMLGISIGVAF 185

Query: 173 NVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEEC 232
           NVIFADWM+TVLLI+LFL  ST+AFLKGVETWK+ETI K+EA +     V  +    E  
Sbjct: 186 NVIFADWMVTVLLIILFLGTSTKAFLKGVETWKQETILKKEAERVRVENVTADVEPEEHQ 245

Query: 233 KCDPEYLSNDTTPEET-----------REPKKSKVSIIENIYWKEFGLLVAVWAIILALQ 281
           + D + L ++  P E            R P     +I  N+ WKE G+LV VW I L LQ
Sbjct: 246 EVDFKPLPSEPVPNEESNGNNEGSNNERLPTDVTNTIWFNVRWKELGILVVVWIIFLVLQ 305

Query: 282 IAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFY 341
           + K  +TTCSV YWVLNLLQIPVA  VS YEA+ LYKG + +ASKG+ G  W+ SQLV Y
Sbjct: 306 VVKAKSTTCSVEYWVLNLLQIPVAMSVSLYEAIGLYKGTKAVASKGEAGINWKISQLVLY 365

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
             CGI AG+VGGLLGLGGGFILGPL LELG+PPQVSSATA F +TFSSSMSVVEYY LKR
Sbjct: 366 FFCGILAGVVGGLLGLGGGFILGPLLLELGVPPQVSSATATFVMTFSSSMSVVEYYFLKR 425

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
           FPVPYA Y F + +IAAF+GQHV+++L+ +LGRAS+IIF L+F+IF+SA++LGGVG+ KM
Sbjct: 426 FPVPYAAYLFGVCVIAAFMGQHVIRRLVILLGRASLIIFILAFVIFLSAITLGGVGIVKM 485

Query: 462 IKRIEHKEYMGFDNIC 477
           I + E+ +YMGF+N+C
Sbjct: 486 IHKFENGDYMGFENLC 501


>gi|242044426|ref|XP_002460084.1| hypothetical protein SORBIDRAFT_02g022510 [Sorghum bicolor]
 gi|241923461|gb|EER96605.1| hypothetical protein SORBIDRAFT_02g022510 [Sorghum bicolor]
          Length = 470

 Score =  484 bits (1246), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 263/415 (63%), Positives = 320/415 (77%), Gaps = 6/415 (1%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           + Y+HVWP ++ GW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 58  NSYQHVWPTMELGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 117

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G +VS V YNL+ +HPTLD+PVIDYDLA+L QP+L+LGISIGV FNVIF DW++T
Sbjct: 118 KCMIMGASVSTVYYNLKLKHPTLDMPVIDYDLAVLIQPMLMLGISIGVIFNVIFPDWLVT 177

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VLLI+LFL  ST+AFLKG+ETWKKETI +REAAK+LE I  G   +       P   +N 
Sbjct: 178 VLLIILFLGTSTKAFLKGIETWKKETIIQREAAKRLEQIA-GEEAEYAPLPTGPGAAANK 236

Query: 243 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 302
            T      P     S+I+NI+WKE GLL  VW   L LQ+ KNYT TCS  YWVLNLLQ+
Sbjct: 237 KTLSSDEAP-----SLIKNIHWKEVGLLSFVWVAFLVLQVTKNYTATCSPWYWVLNLLQV 291

Query: 303 PVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 362
           PV+ GV+ YE   L  G+R ++SKG E T  +  Q+V Y   GI AG+VGGLLGLGGGFI
Sbjct: 292 PVSVGVTLYEGFGLMSGKRVLSSKGSEQTTMKFHQVVVYGLFGIAAGLVGGLLGLGGGFI 351

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
           +GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYA+YF  ++  AA +GQ
Sbjct: 352 MGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYAVYFTIVAFAAALIGQ 411

Query: 423 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           H ++KLI  LGRAS+IIF L+F+IFVSALSLGGVG++ M+ +I   EYMGF+N+C
Sbjct: 412 HAVRKLINWLGRASLIIFILAFMIFVSALSLGGVGISNMVHKIARHEYMGFENLC 466


>gi|225430784|ref|XP_002267318.1| PREDICTED: uncharacterized protein LOC100253061 [Vitis vinifera]
          Length = 476

 Score =  483 bits (1244), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 280/450 (62%), Positives = 341/450 (75%), Gaps = 18/450 (4%)

Query: 32  QSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIGFFG 88
           +++ +P     NKN   EP  + KV        +S Y+HVWPD+KFGW++VV +IIGFFG
Sbjct: 40  KADDRPAEHLVNKNK--EPGFLSKVVHFLWQSGKSSYQHVWPDMKFGWKLVVASIIGFFG 97

Query: 89  TACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP 148
            A GSVGGVGGGGIFVPML LIVGFD KSS A+SKCM+ G A S V YN+R RHPTLD+P
Sbjct: 98  AALGSVGGVGGGGIFVPMLTLIVGFDPKSSTAISKCMIMGAAGSTVYYNMRLRHPTLDMP 157

Query: 149 VIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET 208
           +IDYDLALLFQP+L+LGISIGVAFNVIFADWM+TVLLI+LFL  ST+A  KG+ETWKKET
Sbjct: 158 IIDYDLALLFQPMLMLGISIGVAFNVIFADWMVTVLLIILFLGTSTKALFKGIETWKKET 217

Query: 209 ITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFG 268
           I K+EAA+ LE           E K   +Y    + P   R+     V +I NI WKE  
Sbjct: 218 IMKKEAARLLE----------AETKPTGDYRPLPSGPATIRD---DHVPVIYNICWKELA 264

Query: 269 LLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGD 328
           LLV VW   LA+QI K YT TCS+ YWVLN LQ+P+A  VS YEAV LYKG R IASKG 
Sbjct: 265 LLVYVWVAFLAVQIVKTYTVTCSIEYWVLNCLQVPIAASVSIYEAVCLYKGTRVIASKGK 324

Query: 329 EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFS 388
           E T W+  Q+  YC+CGI AG+VGGLLGLGGGFILGPLFLELGIPPQV+SAT+ F++ FS
Sbjct: 325 EVTNWKIHQIFLYCSCGIVAGLVGGLLGLGGGFILGPLFLELGIPPQVASATSTFSMAFS 384

Query: 389 SSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
           SSMSVV+Y+LL+RFPVPYA YF  ++ IAA VGQHV++K+IK++GRASIIIF L+  IF+
Sbjct: 385 SSMSVVQYHLLRRFPVPYASYFVLVATIAALVGQHVVRKVIKLVGRASIIIFILALTIFI 444

Query: 449 SALSLGGVGLAKMIKRIEHKEYMGFDNICY 478
           SA+SLGGVG+  M++++ + EYMGFD++CY
Sbjct: 445 SAISLGGVGIMTMVEKLANNEYMGFDDLCY 474


>gi|255561190|ref|XP_002521607.1| conserved hypothetical protein [Ricinus communis]
 gi|223539285|gb|EEF40878.1| conserved hypothetical protein [Ricinus communis]
          Length = 478

 Score =  481 bits (1237), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 264/448 (58%), Positives = 339/448 (75%), Gaps = 13/448 (2%)

Query: 35  LKPEATSKNKNDHVEPNIVIKVSA----PKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTA 90
            K +  +K   DH +    ++ +A       +S Y+ VWP+++F ++IVVGTI+GF G A
Sbjct: 40  FKEQNHTKVLVDHEKEKSFLQKAAHFLWQSGKSSYEPVWPEMEFNYKIVVGTIVGFLGAA 99

Query: 91  CGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVI 150
            GSVGGVGGGGIFVPML L++GFD KSS A+SKCM+ G A S V YN+R RHPTLD+P+I
Sbjct: 100 LGSVGGVGGGGIFVPMLTLLIGFDPKSSTAISKCMIMGAAGSTVYYNMRLRHPTLDMPLI 159

Query: 151 DYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETIT 210
           DYDLALLFQP+L+LGISIGVAFNV+FADWM+TVLLI+LF+  ST+A  KG++TWKKE++ 
Sbjct: 160 DYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLLIILFIGTSTKALFKGIDTWKKESMM 219

Query: 211 KREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLL 270
           K+EAA+QLE         +E    D E       P      +  ++S+ +NIYWKE  LL
Sbjct: 220 KKEAARQLE---------SESKPNDGEGQDYKPLPSGPVALEDEEISLFQNIYWKELALL 270

Query: 271 VAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEG 330
           V VW   LA+QI K+Y  TCSV YW+LN LQ+P+A  V+ +EA+ L KG R IAS+G E 
Sbjct: 271 VYVWVGFLAVQIVKSYVRTCSVAYWILNALQVPIAASVTLFEAICLRKGTRVIASRGKEI 330

Query: 331 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 390
           T W+  Q++ YC+CGI AGMVGGLLGLGGGFILGPLFLELGIPPQV+SAT+ FA+ FSSS
Sbjct: 331 TNWKYHQILLYCSCGIIAGMVGGLLGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSS 390

Query: 391 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSA 450
           MSVV+YYLLKRFPVPYA +F  ++ +AA VGQHV++K+I ILGRASIIIF L+  IFVSA
Sbjct: 391 MSVVQYYLLKRFPVPYATFFVLVATVAALVGQHVVRKIIAILGRASIIIFILALTIFVSA 450

Query: 451 LSLGGVGLAKMIKRIEHKEYMGFDNICY 478
           +SLGGVG+  M++++ ++EYMGF+N+CY
Sbjct: 451 ISLGGVGIVDMVEKLTNEEYMGFENLCY 478


>gi|414885094|tpg|DAA61108.1| TPA: hypothetical protein ZEAMMB73_890925 [Zea mays]
          Length = 470

 Score =  481 bits (1237), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 259/415 (62%), Positives = 321/415 (77%), Gaps = 6/415 (1%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           + Y+HVWP ++FGW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 58  NSYQHVWPPMEFGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 117

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G +VS V YNL+ +HPTLD+PVIDYDLA+L QP+L+LGISIGV FNVIF DW++T
Sbjct: 118 KCMIMGASVSTVYYNLKLKHPTLDMPVIDYDLAVLMQPMLMLGISIGVIFNVIFPDWLVT 177

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VLLI+LFL  ST+AFLKG+ETWKKET+ +REAAK LE    G   +       P+  +N 
Sbjct: 178 VLLIILFLGTSTKAFLKGIETWKKETVIQREAAKLLEQTA-GEEAEYAALPTGPDVAANK 236

Query: 243 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 302
                   P     S+I+NI+WK+ GLL  VW   L LQ+ KNYT TCS  YWVLNLLQ+
Sbjct: 237 KALTSDEAP-----SLIKNIHWKKVGLLSFVWVAFLVLQVTKNYTATCSPWYWVLNLLQV 291

Query: 303 PVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 362
           PV+ GV+ YE   L  G+R ++SKG   T  +  Q+V Y   GI AG+VGGLLGLGGGFI
Sbjct: 292 PVSVGVTLYEGFGLMSGKRVLSSKGSGQTTMKFHQVVVYGLFGIAAGLVGGLLGLGGGFI 351

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
           +GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYA+YF  ++ +AA +GQ
Sbjct: 352 MGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYAVYFIVVAFVAAIIGQ 411

Query: 423 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           HV++KLI  LGRAS+IIF L+F+IFVSA+SLGGVG++ M+ +IE  EYMGF+++C
Sbjct: 412 HVVRKLINWLGRASLIIFILAFMIFVSAISLGGVGVSNMVHKIERHEYMGFEDLC 466


>gi|224134334|ref|XP_002327812.1| predicted protein [Populus trichocarpa]
 gi|222836897|gb|EEE75290.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  480 bits (1236), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 270/454 (59%), Positives = 337/454 (74%), Gaps = 18/454 (3%)

Query: 30  SAQSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIGF 86
           SA+  LK +    +    + P  ++KV        +S Y+ VWPD+KF WRIVVGTI+GF
Sbjct: 33  SAERLLKDKEPEASAEKEMSPGYIVKVLHFLFQGGKSSYEPVWPDMKFDWRIVVGTIVGF 92

Query: 87  FGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD 146
            G A GSVGGVGGGGIFVPML LI+GFD KSS A+SKCM+ G A + V YNLR RHPTLD
Sbjct: 93  LGAALGSVGGVGGGGIFVPMLTLIIGFDPKSSTAISKCMIMGAAGATVYYNLRLRHPTLD 152

Query: 147 IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
           +P+IDYDL LLFQP+L+LGISIGV+F+V+FADWM+TVLLI+LF+  ST+A  KG++TWKK
Sbjct: 153 MPLIDYDLTLLFQPMLMLGISIGVSFSVMFADWMVTVLLIILFIGTSTKALFKGIDTWKK 212

Query: 207 ETITKREAAKQLELIVLGNGYQTEECK---CDPEYLSNDTTPEETREPKKSKVSIIENIY 263
           ET+ +++AAKQLE          E+ K     P  L +D  P            + ENIY
Sbjct: 213 ETMMRKDAAKQLESESKPEDGAEEDYKPLPSGPVVLPDDEVP------------LRENIY 260

Query: 264 WKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKI 323
           WKE  LL  VWA  L +QI + Y  TCSV YW+LN LQ+P+A  V+ +EA+ L KG R I
Sbjct: 261 WKEVALLFYVWAGFLVVQIVQTYFPTCSVTYWILNSLQVPIAASVTLFEAICLCKGTRVI 320

Query: 324 ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 383
           ASKG E T W+  Q++ YC+CGI AGMVGGLLGLGGGFILGPLFLELGIPPQV+SAT+ F
Sbjct: 321 ASKGKEITNWKLHQILLYCSCGIIAGMVGGLLGLGGGFILGPLFLELGIPPQVASATSTF 380

Query: 384 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 443
           A+ FSSSMSVVEYYLL RFPVPYA YF  ++ I+AFVGQHV++K+I  LGRAS+IIF L+
Sbjct: 381 AMAFSSSMSVVEYYLLNRFPVPYAAYFALVATISAFVGQHVVRKIIAFLGRASLIIFILA 440

Query: 444 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
             IFVSA+SLGGVG+A M++++E++EYMGF+N+C
Sbjct: 441 LTIFVSAVSLGGVGIANMVEKLENEEYMGFENLC 474


>gi|297735172|emb|CBI17534.3| unnamed protein product [Vitis vinifera]
          Length = 481

 Score =  479 bits (1234), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 278/450 (61%), Positives = 341/450 (75%), Gaps = 13/450 (2%)

Query: 32  QSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIGFFG 88
           +++ +P     NKN   EP  + KV        +S Y+HVWPD+KFGW++VV +IIGFFG
Sbjct: 40  KADDRPAEHLVNKNK--EPGFLSKVVHFLWQSGKSSYQHVWPDMKFGWKLVVASIIGFFG 97

Query: 89  TACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP 148
            A GSVGGVGGGGIFVPML LIVGFD KSS A+SKCM+ G A S V YN+R RHPTLD+P
Sbjct: 98  AALGSVGGVGGGGIFVPMLTLIVGFDPKSSTAISKCMIMGAAGSTVYYNMRLRHPTLDMP 157

Query: 149 VIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET 208
           +IDYDLALLFQP+L+LGISIGVAFNVIFADWM+TVLLI+LFL  ST+A  KG+ETWKKET
Sbjct: 158 IIDYDLALLFQPMLMLGISIGVAFNVIFADWMVTVLLIILFLGTSTKALFKGIETWKKET 217

Query: 209 ITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFG 268
           I K+EAA+ LE        +  +     +Y    + P   R+     V +I NI WKE  
Sbjct: 218 IMKKEAARLLE-----AETKPTDNTGGGDYRPLPSGPATIRD---DHVPVIYNICWKELA 269

Query: 269 LLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGD 328
           LLV VW   LA+QI K YT TCS+ YWVLN LQ+P+A  VS YEAV LYKG R IASKG 
Sbjct: 270 LLVYVWVAFLAVQIVKTYTVTCSIEYWVLNCLQVPIAASVSIYEAVCLYKGTRVIASKGK 329

Query: 329 EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFS 388
           E T W+  Q+  YC+CGI AG+VGGLLGLGGGFILGPLFLELGIPPQV+SAT+ F++ FS
Sbjct: 330 EVTNWKIHQIFLYCSCGIVAGLVGGLLGLGGGFILGPLFLELGIPPQVASATSTFSMAFS 389

Query: 389 SSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
           SSMSVV+Y+LL+RFPVPYA YF  ++ IAA VGQHV++K+IK++GRASIIIF L+  IF+
Sbjct: 390 SSMSVVQYHLLRRFPVPYASYFVLVATIAALVGQHVVRKVIKLVGRASIIIFILALTIFI 449

Query: 449 SALSLGGVGLAKMIKRIEHKEYMGFDNICY 478
           SA+SLGGVG+  M++++ + EYMGFD++CY
Sbjct: 450 SAISLGGVGIMTMVEKLANNEYMGFDDLCY 479


>gi|125561413|gb|EAZ06861.1| hypothetical protein OsI_29096 [Oryza sativa Indica Group]
          Length = 469

 Score =  473 bits (1217), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 252/416 (60%), Positives = 314/416 (75%), Gaps = 13/416 (3%)

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
            + Y HVWP ++FGW+IV+G+ +GF G A GS+GGVGGGG FVPML LI+GFDAKSS A+
Sbjct: 63  ENSYHHVWPPMEFGWQIVLGSFVGFIGAAFGSIGGVGGGGFFVPMLTLIIGFDAKSSVAI 122

Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
           SKCM+ G AVS V  NL+++HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW++
Sbjct: 123 SKCMIMGAAVSTVYCNLKRKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVIFPDWLV 182

Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSN 241
           TVLLI+LFL  ST+AFLKG+ETWKKETI KREA K+ E         +EE +  P     
Sbjct: 183 TVLLIILFLGTSTKAFLKGIETWKKETIIKREAEKRSE-------QTSEELEYRP----- 230

Query: 242 DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 301
                E++ P    VSI+ N+YWKEFGLL  VW   LALQ+ KNY  TCS  YWVLNLLQ
Sbjct: 231 -VPASESKPPSDEAVSILHNVYWKEFGLLAFVWIAFLALQVTKNYMPTCSTWYWVLNLLQ 289

Query: 302 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 361
           IPV+ GV+ YE + L +GRR I+S G+E T  +  QL+ YC  GITAG+V GLLG+GGG 
Sbjct: 290 IPVSVGVTMYEGLGLMQGRRVISSNGNEQTNLKFHQLLMYCFFGITAGIVAGLLGVGGGS 349

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           ILGP+FL+LG+PPQV+SATA F++ FS+SMS VEYY L RFPVPYALY   ++  +A VG
Sbjct: 350 ILGPMFLDLGVPPQVASATATFSMMFSASMSAVEYYFLDRFPVPYALYLTVVAFFSAIVG 409

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           Q +++K+I  LGRASIIIFTLS +IF+S + LGG+G+   I +IE  EYMGF++IC
Sbjct: 410 QRMVRKVINWLGRASIIIFTLSIMIFLSTIPLGGIGIVNWIGKIERHEYMGFEDIC 465


>gi|356511007|ref|XP_003524223.1| PREDICTED: uncharacterized protein LOC100819316 [Glycine max]
          Length = 454

 Score =  473 bits (1216), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 273/466 (58%), Positives = 343/466 (73%), Gaps = 34/466 (7%)

Query: 12  RLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPD 71
           R+L  AF  L+ V   S    +  KP ++  N    V+ N         +   YKH WP+
Sbjct: 17  RVLLVAFFLLVSVYVASSHQNTEQKPVSSPSNDTKGVDTN------GHHANGFYKHTWPE 70

Query: 72  IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAV 131
           +KFGW+I+VG+IIGF G+A G+VGGVGGGGIFVPML LI+GFDA+S+ A+SKCM+TGGA 
Sbjct: 71  MKFGWKIIVGSIIGFLGSAFGTVGGVGGGGIFVPMLTLIIGFDARSATAISKCMITGGAG 130

Query: 132 SAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLV 191
           + V YNL+QRHPTLD+PVIDYDLALLFQP+L+LGIS+GVAFNVIF DWM+T LLI++F+ 
Sbjct: 131 ATVFYNLKQRHPTLDMPVIDYDLALLFQPMLMLGISVGVAFNVIFPDWMLTALLIIVFIG 190

Query: 192 LSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREP 251
           LS  AF KGV+TWKKETI K+    Q    ++                         +  
Sbjct: 191 LSVNAFFKGVKTWKKETIIKKCFHFQHLFFLM-------------------------KSK 225

Query: 252 KKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAY 311
              +VS+IENIYWKE G+LV+VW +ILALQI KNYTT CS LYW++NLLQ+P+  G + Y
Sbjct: 226 TVLQVSVIENIYWKELGILVSVWILILALQIGKNYTTNCSALYWIMNLLQVPITVGTTFY 285

Query: 312 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
           EAV LYKG+R IASKGD+ T+WR  QL+ YC+CGI AG++GGLLGLGGGFILGPLF+ LG
Sbjct: 286 EAVLLYKGQRVIASKGDQQTRWRVQQLILYCSCGIIAGIIGGLLGLGGGFILGPLFIGLG 345

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           I PQVSSAT+ FA+TFS+SMSVVEYYLLKRFP+PYALYF A++  AA VGQH+++K+I I
Sbjct: 346 IHPQVSSATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVAVATAAALVGQHLVRKVIAI 405

Query: 432 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           LGR S+IIF L+  +FVS +SLGGVG+AK+   IE K  M F+N+C
Sbjct: 406 LGRTSLIIFILALTVFVSGISLGGVGIAKL---IEKKVQMKFENLC 448


>gi|357518359|ref|XP_003629468.1| hypothetical protein MTR_8g077830 [Medicago truncatula]
 gi|355523490|gb|AET03944.1| hypothetical protein MTR_8g077830 [Medicago truncatula]
          Length = 480

 Score =  470 bits (1210), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 253/424 (59%), Positives = 324/424 (76%), Gaps = 16/424 (3%)

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
            S Y+ VWP++KFGWRIVVG+I+GFFG A GSVGGVGGGGIF+PML LI+GFD KSS A+
Sbjct: 59  ESSYERVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTLIIGFDPKSSTAL 118

Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
           SKCM+TG A S V YNLR RHPTLD+P+IDYDLALLFQP+L+LGISIGVAFNV+FADWM+
Sbjct: 119 SKCMITGAAGSTVYYNLRLRHPTLDMPLIDYDLALLFQPMLMLGISIGVAFNVMFADWMV 178

Query: 182 TVLLIVLFL-------VLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKC 234
           T+LLI+LF+         ST+A +KG++TWKKETI K+EA ++          Q  E   
Sbjct: 179 TILLIILFIGNNHYLFSTSTKALVKGIDTWKKETIMKKEAFEE--------AAQMLESGS 230

Query: 235 DPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLY 294
            P+Y S +       + +  +V +++NI+WKE  +L+ VW   L +QI K Y+ TCS+ Y
Sbjct: 231 TPDYASEEDYKSLPADLQDEEVPLLDNIHWKELSVLMYVWVAFLIVQILKTYSKTCSIEY 290

Query: 295 WVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGL 354
           W+LN LQ+P+A  V+ +EA+ L KG R IAS+G E T W+  ++  YC CGI AGMV GL
Sbjct: 291 WLLNSLQVPIAISVTLFEAICLCKGTRVIASRGKEIT-WKFHKICLYCFCGIIAGMVSGL 349

Query: 355 LGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALS 414
           LGLGGGFILGPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YY L RFP+PYA Y   ++
Sbjct: 350 LGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYHLDRFPIPYASYLVLVA 409

Query: 415 IIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFD 474
            IAA  GQHV++K+I I GRASII+F L+F IFVSA+SLGGVG+  M++++E++EYMGFD
Sbjct: 410 TIAALTGQHVVRKIIAIFGRASIIVFILAFTIFVSAISLGGVGIGNMVEKMENEEYMGFD 469

Query: 475 NICY 478
           N+C+
Sbjct: 470 NLCH 473


>gi|338762845|gb|AEI98632.1| hypothetical protein 111018.19 [Coffea canephora]
          Length = 465

 Score =  470 bits (1209), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 258/417 (61%), Positives = 320/417 (76%), Gaps = 23/417 (5%)

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
           +S Y+HVWP+++FGW++VVGT++GF G A GSVGGVGGGGIFVPML+LI+GFD KSS A+
Sbjct: 70  KSSYQHVWPEMEFGWKLVVGTVVGFLGAALGSVGGVGGGGIFVPMLSLIIGFDPKSSTAI 129

Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
           SKCM+ G A S V YNLR RHPTLD+P+IDYDLALLFQP+L+LGISIGVAFNVIFADWM+
Sbjct: 130 SKCMIMGAAGSTVYYNLRLRHPTLDLPLIDYDLALLFQPMLMLGISIGVAFNVIFADWMV 189

Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSN 241
           TVLLI+LF+  S++AF KG+ETW KET  K           LG G + +     P   ++
Sbjct: 190 TVLLILLFIGTSSKAFFKGLETWNKETKMK-----------LGPGNEYKPLPGGPAAHAD 238

Query: 242 DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 301
           +            KV ++ NIYWKE  LL+ VW   LA+QI K YT TCS  YW LN LQ
Sbjct: 239 E------------KVPLLYNIYWKELCLLLFVWIAFLAIQIMKTYTQTCSAKYWTLNFLQ 286

Query: 302 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 361
           +P+A  VS YEA  LYKG R IASKG E T W+ + +  YC  GI AG+VGGLLGLGGGF
Sbjct: 287 VPIAASVSLYEATCLYKGTRTIASKGKEITVWKPNLIFLYCCLGIVAGVVGGLLGLGGGF 346

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           +LGPLFLELG+PPQV+SAT+ FA+TFSSSMSVV+YYLL RFPVPYA YF  ++ +AA VG
Sbjct: 347 VLGPLFLELGVPPQVASATSTFAMTFSSSMSVVQYYLLNRFPVPYATYFVLVATVAALVG 406

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICY 478
           QHV++++I ILGRAS+IIF L+  IFVSA+SLGGVG+A MI+++++ EYMGFDN+C+
Sbjct: 407 QHVVRRIIAILGRASLIIFILALTIFVSAISLGGVGIASMIEKLQNHEYMGFDNLCH 463


>gi|388493882|gb|AFK35007.1| unknown [Medicago truncatula]
          Length = 473

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 251/417 (60%), Positives = 321/417 (76%), Gaps = 9/417 (2%)

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
            S Y+ VWP++KFGWRIVVG+I+GFFG A GSVGGVGGGGIF+PML LI+GFD KSS A+
Sbjct: 59  ESSYERVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTLIIGFDPKSSTAL 118

Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
           SKCM+TG A S V  NLR RHPTLD+P+IDYDLALLFQP+L+LGISIGVAFNV+FADWM+
Sbjct: 119 SKCMITGAAGSTVYCNLRLRHPTLDMPLIDYDLALLFQPMLMLGISIGVAFNVMFADWMV 178

Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSN 241
           T+LLI+LF+  ST+A +KG++TWKKE I K+EA ++          Q  E    P+Y S 
Sbjct: 179 TILLIILFIGTSTKALVKGIDTWKKEAIMKKEAFEE--------AAQMLESGSTPDYASE 230

Query: 242 DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 301
           +       + +  +V +++NI+WKE  +L+ VW   L +QI K Y+ TCS+ YW+LN LQ
Sbjct: 231 EDYKSLPADLQDEEVPLLDNIHWKELSVLMYVWVAFLIVQILKTYSKTCSIEYWLLNSLQ 290

Query: 302 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 361
           +P+A  V+ +EA+ L KG R IAS+G E T W+  ++  YC CGI AGMV GLLGLGGGF
Sbjct: 291 VPIAISVTLFEAICLCKGTRVIASRGKEIT-WKFHKICLYCFCGIIAGMVSGLLGLGGGF 349

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           ILGPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YY L RFP+PYA Y   ++ IAA  G
Sbjct: 350 ILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYHLDRFPIPYASYLVLVATIAALTG 409

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICY 478
           QHV++K+I I GRASII+F L+F IFVSA+SLGGVG+  M++++E+ EYMGFDN+C+
Sbjct: 410 QHVVRKIIAIFGRASIIVFILAFTIFVSAISLGGVGIGNMVEKMENAEYMGFDNLCH 466


>gi|115476232|ref|NP_001061712.1| Os08g0387200 [Oryza sativa Japonica Group]
 gi|40253486|dbj|BAD05436.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623681|dbj|BAF23626.1| Os08g0387200 [Oryza sativa Japonica Group]
 gi|125603275|gb|EAZ42600.1| hypothetical protein OsJ_27163 [Oryza sativa Japonica Group]
          Length = 469

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 251/416 (60%), Positives = 314/416 (75%), Gaps = 13/416 (3%)

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
            + Y HVWP ++FGW+IV+G+ +GF G A GS+GGVGGGG F+PML LI+GFDAKSS A+
Sbjct: 63  ENSYHHVWPPMEFGWQIVLGSFVGFIGAAFGSIGGVGGGGFFMPMLTLIIGFDAKSSVAI 122

Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
           SKCM+ G AVS V  NL+++HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW++
Sbjct: 123 SKCMIMGAAVSTVYCNLKRKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVIFPDWLV 182

Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSN 241
           TVLLI+LFL  ST+AFLKG+ETWKKETI KREA K+ E         +EE +  P     
Sbjct: 183 TVLLIILFLGTSTKAFLKGIETWKKETIIKREAEKRSE-------QTSEELEYRP----- 230

Query: 242 DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 301
                E++ P    VSI+ N+YWKEFGLL  VW   LALQ+ KNY  TCS  YWVLNLLQ
Sbjct: 231 -VPASESKPPSDEAVSILHNVYWKEFGLLAFVWIAFLALQVTKNYMPTCSTWYWVLNLLQ 289

Query: 302 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 361
           IPV+ GV+ YE + L +GRR I+S G+E T  +  QL+ YC  GITAG+V GLLG+GGG 
Sbjct: 290 IPVSVGVTMYEGLGLMQGRRVISSNGNEQTNLKFHQLLMYCFFGITAGIVAGLLGVGGGS 349

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           ILGP+FL+LG+PPQV+SATA F++ FS+SMS VEYY L RFPVPYALY   ++  +A VG
Sbjct: 350 ILGPMFLDLGVPPQVASATATFSMMFSASMSAVEYYFLDRFPVPYALYLTVVAFFSAIVG 409

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           Q +++K+I  LGRASIIIFTLS +IF+S + LGG+G+   I +IE  EYMGF++IC
Sbjct: 410 QRMVRKVINWLGRASIIIFTLSIMIFLSTIPLGGIGIVNWIGKIERHEYMGFEDIC 465


>gi|356528377|ref|XP_003532780.1| PREDICTED: uncharacterized protein LOC100802321 [Glycine max]
          Length = 434

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 274/471 (58%), Positives = 338/471 (71%), Gaps = 53/471 (11%)

Query: 10  RWRLLAAA---FIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYK 66
           RW +   A    +  L+V+    S+Q N   +  S   N+       +   A  +   YK
Sbjct: 8   RWAVKWVAEKVLVAFLLVSVYVASSQQNTDQKTISSASNETKG----VDTKAHHAEGFYK 63

Query: 67  HVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMV 126
           H WP++KFGW+I+VG+IIGF G+A G+VGGVGGGGIFVPML LI+GFDA+S+ A+SKCM+
Sbjct: 64  HTWPEMKFGWKIIVGSIIGFLGSAFGTVGGVGGGGIFVPMLTLIIGFDARSATAISKCMI 123

Query: 127 TGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLI 186
           TGGA + + YNL+QRHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNVIF DWM+T LLI
Sbjct: 124 TGGAGATIFYNLKQRHPTLDMPVIDYDLALLFQPMLMLGISIGVAFNVIFPDWMLTALLI 183

Query: 187 VLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPE 246
           V+F+ LS  AF KGV+TWKKETI                                     
Sbjct: 184 VVFIGLSFNAFFKGVKTWKKETIV------------------------------------ 207

Query: 247 ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAG 306
                  +KVS+IENIYWKE G+LV+VW +ILALQI KNYTT CS LYW LNLLQ+P+  
Sbjct: 208 -------NKVSVIENIYWKELGILVSVWILILALQIGKNYTTNCSALYWALNLLQVPITV 260

Query: 307 GVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 366
           G + YEAV LYKG+RKIASKGD+ T+WR  QL+ YC CGI AG++GGLLGLGGGFILGPL
Sbjct: 261 GTTVYEAVLLYKGKRKIASKGDQQTRWRVHQLILYCTCGIIAGIIGGLLGLGGGFILGPL 320

Query: 367 FLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 426
           F+ LGI PQVSSAT+ FA+TFS+SMSVVEYYLLKRFP+PYALYF A++  AA VGQH+++
Sbjct: 321 FIGLGIHPQVSSATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVAVATAAALVGQHLVR 380

Query: 427 KLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           K+I ILGR S+IIF L+  +FVS +SLGGVG+AK+   IE K  M F+N+C
Sbjct: 381 KVIAILGRTSLIIFILALTVFVSGISLGGVGIAKL---IEKKVQMKFENLC 428


>gi|356496755|ref|XP_003517231.1| PREDICTED: uncharacterized protein LOC100801441 [Glycine max]
          Length = 470

 Score =  467 bits (1201), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 258/418 (61%), Positives = 323/418 (77%), Gaps = 13/418 (3%)

Query: 60  KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
           + +S Y+ VWP++KFGWRI+VG+IIGF G A GSVGGVGGGGIFVPML LI+GFD KSS 
Sbjct: 63  EGKSSYEPVWPNMKFGWRIIVGSIIGFLGAALGSVGGVGGGGIFVPMLALIIGFDPKSST 122

Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
           A+SKCM+ G ++S V YNLR RHPTLD+P+IDYDLAL+FQP+L+LGISIGV  NV+FADW
Sbjct: 123 AISKCMIMGASISTVYYNLRLRHPTLDMPLIDYDLALIFQPMLMLGISIGVICNVMFADW 182

Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYL 239
           M+TVLLI+LF+  ST+A  KG++TWKKETI K+EA+K LE           E K   +Y 
Sbjct: 183 MVTVLLIILFIATSTKATYKGIDTWKKETIAKKEASKLLE----------AEPKSGDDYK 232

Query: 240 SNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNL 299
           S  + P E+      +  +++NIYWKE  LL  VW     +QI K YT  CS+ +W+LN 
Sbjct: 233 SLPSGPTESL---FEEAPLLKNIYWKELSLLAYVWVAFFIVQIVKEYTKPCSIQFWLLNF 289

Query: 300 LQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 359
           LQ+PVA  V+ +EA+ LYKG R IASKG E T W+  Q+  YC+ GI AGMVGGLLGLGG
Sbjct: 290 LQVPVAVSVTLFEAIGLYKGTRVIASKGKEVTNWKIHQICLYCSTGIMAGMVGGLLGLGG 349

Query: 360 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 419
           GFILGPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YYLL RFPVPYA YF  ++ IAAF
Sbjct: 350 GFILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYLLDRFPVPYASYFALVATIAAF 409

Query: 420 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            GQHV++K+I +LGRASIIIF L+  IF+SA+SLGGVG+  +I++IE+ EYMGF+++C
Sbjct: 410 TGQHVVRKVIVVLGRASIIIFILALTIFISAISLGGVGIENIIEKIENHEYMGFEDLC 467


>gi|414885096|tpg|DAA61110.1| TPA: hypothetical protein ZEAMMB73_890925 [Zea mays]
          Length = 404

 Score =  466 bits (1198), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 254/406 (62%), Positives = 314/406 (77%), Gaps = 6/406 (1%)

Query: 72  IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAV 131
           ++FGW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+SKCM+ G +V
Sbjct: 1   MEFGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAISKCMIMGASV 60

Query: 132 SAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLV 191
           S V YNL+ +HPTLD+PVIDYDLA+L QP+L+LGISIGV FNVIF DW++TVLLI+LFL 
Sbjct: 61  STVYYNLKLKHPTLDMPVIDYDLAVLMQPMLMLGISIGVIFNVIFPDWLVTVLLIILFLG 120

Query: 192 LSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREP 251
            ST+AFLKG+ETWKKET+ +REAAK LE    G   +       P+  +N         P
Sbjct: 121 TSTKAFLKGIETWKKETVIQREAAKLLEQTA-GEEAEYAALPTGPDVAANKKALTSDEAP 179

Query: 252 KKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAY 311
                S+I+NI+WK+ GLL  VW   L LQ+ KNYT TCS  YWVLNLLQ+PV+ GV+ Y
Sbjct: 180 -----SLIKNIHWKKVGLLSFVWVAFLVLQVTKNYTATCSPWYWVLNLLQVPVSVGVTLY 234

Query: 312 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
           E   L  G+R ++SKG   T  +  Q+V Y   GI AG+VGGLLGLGGGFI+GPLFLELG
Sbjct: 235 EGFGLMSGKRVLSSKGSGQTTMKFHQVVVYGLFGIAAGLVGGLLGLGGGFIMGPLFLELG 294

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           IPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYA+YF  ++ +AA +GQHV++KLI  
Sbjct: 295 IPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYAVYFIVVAFVAAIIGQHVVRKLINW 354

Query: 432 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           LGRAS+IIF L+F+IFVSA+SLGGVG++ M+ +IE  EYMGF+++C
Sbjct: 355 LGRASLIIFILAFMIFVSAISLGGVGVSNMVHKIERHEYMGFEDLC 400


>gi|357147619|ref|XP_003574412.1| PREDICTED: uncharacterized protein LOC100846223 [Brachypodium
           distachyon]
          Length = 468

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 273/471 (57%), Positives = 344/471 (73%), Gaps = 13/471 (2%)

Query: 10  RWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYK 66
           +W  +AA  I   V+A+   + +  L     +  + + V  + + KV++    K  + Y 
Sbjct: 4   QWHAVAALCISCTVMAAAVGADRGRLFASTAAAPEPEDV--SFLRKVASYMWQKDGTTYH 61

Query: 67  HVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMV 126
           HVWP +KFGW+IV+G+  GF G A GS+GGVGGGGIFVPML LI+GFD KS+AA+SKCM+
Sbjct: 62  HVWPPMKFGWKIVLGSFFGFIGGAFGSIGGVGGGGIFVPMLTLIIGFDPKSAAAMSKCMI 121

Query: 127 TGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLI 186
            G AVS V  NL+ +HPTLD+PVIDYDL LL QP+L+LG+SIGV  NVIF +W++TVLLI
Sbjct: 122 MGAAVSTVYCNLKLKHPTLDMPVIDYDLVLLIQPMLMLGVSIGVICNVIFPNWLVTVLLI 181

Query: 187 VLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPE 246
           +LFLV ST+AFLKGVETWKKETI  +EA K+ E I       +EE +  P     D+ P 
Sbjct: 182 ILFLVTSTKAFLKGVETWKKETIIIKEAEKRSEQI-------SEETEYTPLPTGPDS-PA 233

Query: 247 ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAG 306
           ET  P    VSI +N++WKEFGLLV VW   LALQ+ KNY  TCS  YWVLNLLQIPV+ 
Sbjct: 234 ETITPSDEAVSIWKNVHWKEFGLLVFVWVAFLALQVTKNYIATCSTWYWVLNLLQIPVSV 293

Query: 307 GVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 366
           GV+ YEAV L +G+R I+SK ++ T  +  QL+ YC  G+ AG+V GLLG+GGG I+GPL
Sbjct: 294 GVAMYEAVGLLQGKRVISSKANDQTSLKVHQLLLYCFLGLMAGVVAGLLGVGGGSIMGPL 353

Query: 367 FLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 426
           FLELG+PPQVSSATA FA+ FSSSMSVVEY+LL RFPVPYAL+F  L+  AA VGQ V++
Sbjct: 354 FLELGVPPQVSSATATFAMMFSSSMSVVEYFLLNRFPVPYALFFTVLAFFAAIVGQRVVR 413

Query: 427 KLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           KLI +LGRAS+I+F LS +IFVSA SLGGVG++ MI +I   EYMGF+NIC
Sbjct: 414 KLIGLLGRASLIVFILSMMIFVSAFSLGGVGISNMIHKITRHEYMGFENIC 464


>gi|356554517|ref|XP_003545592.1| PREDICTED: uncharacterized protein LOC100781779 [Glycine max]
          Length = 462

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 245/456 (53%), Positives = 321/456 (70%), Gaps = 18/456 (3%)

Query: 27  VSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGF 86
           V +SA ++ K E +S N+      +I   +       GY+HVWP+++F WRIVVGT+IG 
Sbjct: 16  VFISAITSTKSEVSSTNEEQSFSYHIK-ALEFIWKHLGYQHVWPEMEFSWRIVVGTLIGI 74

Query: 87  FGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD 146
            G A GSVGGVGGGGIFVPML LI+GFD KS+ A+SKCMVTG A+SAV + ++QRHPTLD
Sbjct: 75  LGAAFGSVGGVGGGGIFVPMLILIIGFDPKSAVAISKCMVTGAAISAVFFCMKQRHPTLD 134

Query: 147 IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
            PVIDYDL LL QP L+LGISIGV  +VIFADWM+T+LLI+L +V S RAF  G +TWKK
Sbjct: 135 EPVIDYDLMLLIQPTLMLGISIGVILSVIFADWMVTILLIILCIVTSIRAFFMGADTWKK 194

Query: 207 ETITKREAAKQLELIVLGNGYQTEECKCDPEY-----LSNDTTPEETREPKKSKVSIIEN 261
           ET  K E+ + ++L        T  C  +  Y      S++   ++TR+P   +VSI+ N
Sbjct: 195 ETKMKEESQETIKL-----SESTATCSEEEGYKYLPGCSDEGYEKDTRKP---EVSILGN 246

Query: 262 IYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRR 321
           +YWKEF L+  VW   + LQIAKNYT +CSV YW+L L QIP+  G   Y+A ALY+GR 
Sbjct: 247 MYWKEFVLIFIVWLAFVVLQIAKNYTVSCSVTYWILILSQIPITVGFYLYQARALYQGR- 305

Query: 322 KIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 381
              + G + T W    L     C + AG+VGGLLG G GF++GPLFLE+GI PQV+SATA
Sbjct: 306 ---AAGSQHTHWPLHHLFLASICSLLAGIVGGLLGTGSGFVMGPLFLEVGIAPQVASATA 362

Query: 382 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT 441
            F + +SSS+SV++YYLL RFPVPYAL+   ++ IAAF+GQ+++ KL+ I  RAS+IIF 
Sbjct: 363 TFGMMYSSSLSVIQYYLLNRFPVPYALFLTLVAAIAAFLGQYLIDKLVNIFQRASLIIFV 422

Query: 442 LSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           L+F IFVS+++LGGVG++ MI +I+  EYMGFDN C
Sbjct: 423 LAFTIFVSSIALGGVGISNMILKIQRNEYMGFDNFC 458


>gi|83853809|gb|ABC47842.1| membrane protein-like protein [Glycine max]
          Length = 469

 Score =  460 bits (1183), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 260/416 (62%), Positives = 320/416 (76%), Gaps = 19/416 (4%)

Query: 66  KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
             VWP++KFGWRIVVG+I+GFFG A GSVGGVGGGGIF+PML L++GFDAKSS A+SKCM
Sbjct: 70  DRVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTLVIGFDAKSSTALSKCM 129

Query: 126 VTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLL 185
           + G AVS V YNLR RHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNV+FADWM+TVLL
Sbjct: 130 IMGAAVSTVYYNLRFRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLL 189

Query: 186 IVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTP 245
           I+LF+  ST+A  KG++TWKKETI K+EAAK LE      GY +EE     +Y S    P
Sbjct: 190 IILFIATSTKALFKGIDTWKKETIMKKEAAKMLESDS-SPGYVSEE-----DYKS---LP 240

Query: 246 EETREPKKSKV--SIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 303
             + +P+  +V    I+ I +  F        + LAL +   YT TCS+LYWVLN LQ+P
Sbjct: 241 AGSADPRDEEVIPMAIQGIPYSNF------IEVFLALHVV-TYTKTCSILYWVLNSLQVP 293

Query: 304 VAGGVSAYEAVALYKGRRKIASKGDEGTKW-RASQLVFYCACGITAGMVGGLLGLGGGFI 362
           +A  V+ YEA+ L  G R IASKG E T W +  ++  YC+CGI AG+V GLLGLGGGFI
Sbjct: 294 IAISVTLYEAICLCNGTRVIASKGKENTDWMKLHKICLYCSCGIIAGIVSGLLGLGGGFI 353

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
           LGPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YYLL+RFPVPYA YF  ++ IAA  GQ
Sbjct: 354 LGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYLLERFPVPYASYFILVATIAALTGQ 413

Query: 423 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICY 478
           HV++K+I I GRASIIIF L+F IF+SA+SLGGVG+  M++++E+ EYMGF NIC+
Sbjct: 414 HVVRKIIAIFGRASIIIFVLAFTIFLSAISLGGVGIENMVEKMENNEYMGFANICH 469


>gi|357519215|ref|XP_003629896.1| hypothetical protein MTR_8g088070 [Medicago truncatula]
 gi|355523918|gb|AET04372.1| hypothetical protein MTR_8g088070 [Medicago truncatula]
          Length = 448

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 257/432 (59%), Positives = 325/432 (75%), Gaps = 19/432 (4%)

Query: 23  VVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGT 82
           V +S   + Q    P  +++ K DH             ++  YKH WP +KFGWRI+VGT
Sbjct: 33  VTSSHQYTDQKTTPP--SNETKGDH-------------AKVFYKHTWPGMKFGWRIIVGT 77

Query: 83  IIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRH 142
           I+ F G++ G+VGGVGGG   VPML LI+GFDAKS+ A+SKCM+TGGA + V+YNLRQRH
Sbjct: 78  ILAFLGSSFGTVGGVGGG-GIVPMLTLIIGFDAKSATAISKCMITGGAGATVLYNLRQRH 136

Query: 143 PTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVE 202
           PTLD+PVIDYDLALLFQP+L+LGIS+GVAFNVIF DWMIT L+++ F  +S +AFLKGV 
Sbjct: 137 PTLDLPVIDYDLALLFQPMLMLGISLGVAFNVIFPDWMITSLILIFFTGISVKAFLKGVN 196

Query: 203 TWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENI 262
           TWK ET+ K+EA    +L    +  + E+ + D +   +          ++ KVS+IEN+
Sbjct: 197 TWKNETLAKKEARDNSQL---NDIRRIEDVELDLQAGESVNDNHTNTNIQRKKVSVIENV 253

Query: 263 YWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK 322
            WKE GLL  VW +ILALQI K+YT TCSV YW+LNLLQ+P+A GVS+YEAV LYKG++ 
Sbjct: 254 SWKEVGLLFFVWIMILALQIGKSYTITCSVAYWILNLLQVPIAVGVSSYEAVLLYKGKKV 313

Query: 323 IASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATAL 382
           IASKGD+G  W   QL+ YC+CGI AG++GGLLGLGGGFIL PLFL LGIPPQVSSAT+ 
Sbjct: 314 IASKGDQGINWSVKQLILYCSCGIIAGLIGGLLGLGGGFILAPLFLGLGIPPQVSSATST 373

Query: 383 FAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           FA+TFS+SMSVVEYYLLKRFP+PYALYF  ++ +AA VGQH+++KLI +LGRASIIIF L
Sbjct: 374 FAMTFSASMSVVEYYLLKRFPIPYALYFVVVATVAALVGQHLVRKLIVLLGRASIIIFIL 433

Query: 443 SFIIFVSALSLG 454
           +  +FVS +SLG
Sbjct: 434 ALTVFVSGISLG 445


>gi|414589355|tpg|DAA39926.1| TPA: hypothetical protein ZEAMMB73_327447 [Zea mays]
          Length = 497

 Score =  446 bits (1146), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 256/438 (58%), Positives = 318/438 (72%), Gaps = 26/438 (5%)

Query: 64  GYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSK 123
            Y HVWP + FGWR+VVG++IGFFG ACGSVGGVGGGGIFVPML LIVGFD KSS A+SK
Sbjct: 60  AYHHVWPPMGFGWRVVVGSLIGFFGAACGSVGGVGGGGIFVPMLALIVGFDPKSSTAISK 119

Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
           CM+ GG+VS V YNL+++HP+LD+P+IDYDLALL QP+L+LG+SIGV FNVIF DW+IT 
Sbjct: 120 CMIMGGSVSTVYYNLKRKHPSLDMPLIDYDLALLMQPMLMLGVSIGVIFNVIFPDWLITA 179

Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKRE-------AAKQLELIVL---GN-------- 225
           LLI +FLV ST+A+LKG ETWKKET+ KR+       AA+Q    ++   GN        
Sbjct: 180 LLITIFLVTSTKAYLKGFETWKKETVKKRQMLLRLHRAARQETRQIMKRYGNLACIAVRT 239

Query: 226 -GYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK 284
             Y      C   Y+  D   E          S+++NIYWKE GLL  VW   L LQ+ +
Sbjct: 240 IAYIVRIRTCVHIYI--DELMETACLALFQTSSVLKNIYWKELGLLAFVWIAFLGLQLTR 297

Query: 285 -NYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS-KG---DEGTKWRASQLV 339
            NY  +CSV YWVLN LQ+PVA GV+ YEA  L  GRR ++S KG    + +  R  QL+
Sbjct: 298 TNYAASCSVWYWVLNSLQVPVAVGVTVYEACGLRTGRRALSSNKGGSQQQQSALRIRQLL 357

Query: 340 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLL 399
            YC  G+ AG+VGGLLG+GGGFI+GPLFLELGIPPQVSSATA F + FSSSMSVVEYYLL
Sbjct: 358 VYCLFGVLAGLVGGLLGMGGGFIMGPLFLELGIPPQVSSATATFTMMFSSSMSVVEYYLL 417

Query: 400 KRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 459
            RFPVPYA YF  ++ +AA  GQH ++KLI  LGRAS+IIF L+ +IFVSAL+LGGVG++
Sbjct: 418 HRFPVPYAAYFTGVAFVAAITGQHCVRKLIAWLGRASLIIFILASMIFVSALTLGGVGIS 477

Query: 460 KMIKRIEHKEYMGFDNIC 477
            ++ R+E  +YMGF+++C
Sbjct: 478 NIVHRVERHQYMGFESLC 495


>gi|242044424|ref|XP_002460083.1| hypothetical protein SORBIDRAFT_02g022500 [Sorghum bicolor]
 gi|241923460|gb|EER96604.1| hypothetical protein SORBIDRAFT_02g022500 [Sorghum bicolor]
          Length = 455

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 250/419 (59%), Positives = 312/419 (74%), Gaps = 26/419 (6%)

Query: 64  GYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSK 123
            Y HVWP ++FGWR+VVG++IGFFG ACGSVGGVGGGGIFVPML LI+GFD KSS A+SK
Sbjct: 56  AYHHVWPPMEFGWRVVVGSLIGFFGAACGSVGGVGGGGIFVPMLALIIGFDPKSSTAISK 115

Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
                       YNL+ +HP+LD+P+IDYDLALL QP+L+LG+SIGV FNVIF +W+IT 
Sbjct: 116 ----------FYYNLKLKHPSLDMPLIDYDLALLMQPMLMLGVSIGVIFNVIFPNWLITA 165

Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEY---LS 240
           LLI +FLV ST+A+LKG ETWKKETI KRE A++ E I          C+ +PE+   + 
Sbjct: 166 LLITIFLVTSTKAYLKGFETWKKETIKKREDARRKEQI----------CQ-EPEHAIPIG 214

Query: 241 NDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLL 300
           +D   ++ + P     S+++NIYWKEFGLL  VW   L LQ+ KNY  +CSV YWVLN L
Sbjct: 215 SDAPADKAKTPTDEATSVLKNIYWKEFGLLAFVWVAFLGLQVTKNYVASCSVWYWVLNSL 274

Query: 301 QIPVAGGVSAYEAVALYKGRRKIASKGDEGT--KWRASQLVFYCACGITAGMVGGLLGLG 358
           QIPVA GV+ YEA  L  G+R ++SKG        R  QL+ YC  GI AG++GGLLG+G
Sbjct: 275 QIPVAVGVTVYEAYGLRTGKRVLSSKGSSQQQSTLRIRQLLVYCLFGILAGLIGGLLGMG 334

Query: 359 GGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA 418
           GGFI+GPLFLELGIPPQVSSATA F + FSSSMSVVEYYLL RFPVPYA YF  ++ +AA
Sbjct: 335 GGFIMGPLFLELGIPPQVSSATATFTMMFSSSMSVVEYYLLHRFPVPYAAYFTGVAFVAA 394

Query: 419 FVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
             GQH ++KLI  LGRAS+IIF L+ +IFVSAL+LGGVG++ ++ R+E  EYMGF+++C
Sbjct: 395 ITGQHFVRKLIAWLGRASLIIFILASMIFVSALTLGGVGISNIVHRMERHEYMGFESLC 453


>gi|357483379|ref|XP_003611976.1| Membrane protein-like protein [Medicago truncatula]
 gi|358344383|ref|XP_003636269.1| Membrane protein-like protein [Medicago truncatula]
 gi|355502204|gb|AES83407.1| Membrane protein-like protein [Medicago truncatula]
 gi|355513311|gb|AES94934.1| Membrane protein-like protein [Medicago truncatula]
          Length = 470

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 254/416 (61%), Positives = 320/416 (76%), Gaps = 13/416 (3%)

Query: 62  RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
           +S Y+ VWP++KF W+I+VG+IIGF G A GSVGGVGGGGIFVPML LI+GFD KSS A+
Sbjct: 66  KSSYEPVWPEMKFDWKIIVGSIIGFLGAALGSVGGVGGGGIFVPMLALIIGFDPKSSTAI 125

Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
           SKCM+ G A+S V YN+R R+PTLD+P+IDYDLALLFQP+L+LGISIGV  NV+FADWM+
Sbjct: 126 SKCMIMGAALSTVYYNMRLRNPTLDMPLIDYDLALLFQPMLMLGISIGVICNVMFADWMV 185

Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSN 241
           TVLLI+LF+  ST+A +KG+ TWKKET+ K+E AKQLE          EE K   +Y   
Sbjct: 186 TVLLIILFIGTSTKALIKGINTWKKETMLKKETAKQLE----------EEPKTGEDY--- 232

Query: 242 DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 301
              P+   E +   V +++NIYWKE  LLV VW   L +QI K YT TCS+ YW+LN LQ
Sbjct: 233 KPLPKGPGEIQDEVVPLLKNIYWKELSLLVYVWVAFLIVQIVKTYTKTCSIEYWILNFLQ 292

Query: 302 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 361
           +P+A  V+ +EAV +YKG R I SKG E    +  Q++ YC+ G+ AGMVGGLLGLGGGF
Sbjct: 293 VPIAISVTLFEAVCIYKGTRVIKSKGKEVKNMKIYQILLYCSIGVIAGMVGGLLGLGGGF 352

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           ILGPLFLE+GIPPQV+SAT+ F++ FSSSMSVV+YY L RFPVPYA YF  ++ IAAF G
Sbjct: 353 ILGPLFLEMGIPPQVASATSTFSMLFSSSMSVVQYYYLDRFPVPYASYFVLVATIAAFAG 412

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           QHV++++I ILGRASIIIF L+  IF+SA+SLGGVG+  MI ++E+ EYMGF+N+C
Sbjct: 413 QHVVRRIIAILGRASIIIFILASTIFISAISLGGVGIQNMIVKLENHEYMGFENLC 468


>gi|215694369|dbj|BAG89362.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 349

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 232/356 (65%), Positives = 273/356 (76%), Gaps = 14/356 (3%)

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           M+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW+ITVL
Sbjct: 1   MIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVLFNVIFPDWLITVL 60

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTT 244
           LI+LFL  ST+AFLKGVETWKKETI KREAAK+LE I        EE    PEY    T 
Sbjct: 61  LIILFLGTSTKAFLKGVETWKKETILKREAAKRLEQI-------AEE----PEYSPLPTG 109

Query: 245 PE---ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 301
           P    E++ P     S+ +N+YWKEFGLL  VW   L LQ+ KNY  TCS  YWVLN LQ
Sbjct: 110 PNATAESKAPSDEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPTCSTWYWVLNFLQ 169

Query: 302 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 361
           IPV+ GV+ YE + L +GRR I+SKGDE T  +  QL+ YC  G+ AG+VGGLLGLGGGF
Sbjct: 170 IPVSVGVTMYEGLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAGVVGGLLGLGGGF 229

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           I+GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYAL+F  ++  AA +G
Sbjct: 230 IMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALFFTVVAFFAAIIG 289

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           QH+++KLI  LGRAS+IIF LSF+IF+SA+SLGGVG++ MI +I   EYMGFDNIC
Sbjct: 290 QHIVRKLINWLGRASLIIFILSFMIFISAISLGGVGISNMIGKIARHEYMGFDNIC 345


>gi|67043799|gb|AAY63998.1| unknown [Brassica napus]
          Length = 348

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 230/350 (65%), Positives = 277/350 (79%), Gaps = 7/350 (2%)

Query: 128 GGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIV 187
           G +VS V YNLR RHPTLD+P+IDYDLALL QP+L+LGIS+GVAFNV+F DWM+TVLLI+
Sbjct: 2   GASVSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISVGVAFNVMFPDWMVTVLLII 61

Query: 188 LFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEE 247
           LFL  ST+AFLKG ETW KETI K EAAK+LE     +G    E +  P   + +T P  
Sbjct: 62  LFLGTSTKAFLKGRETWNKETIEKMEAAKRLE----SDGVSATEVEYMPLPAAPNTNPGN 117

Query: 248 TREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGG 307
               KK +VSIIEN+YWKE GLLV VW + LALQIAK    TCSV YWV+NLLQIPVA G
Sbjct: 118 N---KKREVSIIENVYWKELGLLVFVWIVFLALQIAKKNMPTCSVGYWVINLLQIPVAVG 174

Query: 308 VSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF 367
           VS YEAVALY+GRR IAS G  G+ +   QL+ YC+ G+ AG+VGGLLGLGGGFI+GPLF
Sbjct: 175 VSGYEAVALYQGRRIIASNGQGGSNFTIGQLILYCSFGVLAGVVGGLLGLGGGFIMGPLF 234

Query: 368 LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 427
           LELG+PPQVSSATA FA+TFSSSMSVVEYYLLKRFP+PYALY   ++ IAA VGQHV+++
Sbjct: 235 LELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPIPYALYLVGVATIAALVGQHVVRR 294

Query: 428 LIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           LI +LGRAS+IIF L+  IF+SA+SLGGVG+  M+ +++  EYMGF+N+C
Sbjct: 295 LIAVLGRASLIIFILASTIFISAISLGGVGIVNMMGKLQRHEYMGFENLC 344


>gi|3643608|gb|AAC42255.1| hypothetical protein [Arabidopsis thaliana]
 gi|20197510|gb|AAM15102.1| hypothetical protein [Arabidopsis thaliana]
          Length = 902

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 225/350 (64%), Positives = 265/350 (75%), Gaps = 24/350 (6%)

Query: 128 GGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIV 187
           G +VS V YNLR RHPTLD+P+IDYDLALL QP+L+LGISIGVAFNVIF DW++TVLLIV
Sbjct: 2   GASVSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFPDWLVTVLLIV 61

Query: 188 LFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEE 247
           LFL  ST+AFLKG ETW KETI K+EAAK+LE     NG    E +  P   +  T P  
Sbjct: 62  LFLGTSTKAFLKGSETWNKETIEKKEAAKRLE----SNGVSGTEVEYVPLPAAPSTNPGN 117

Query: 248 TREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGG 307
               KK +VSIIEN+YWKE GLLV VW + LALQI+K                 IPVA G
Sbjct: 118 K---KKEEVSIIENVYWKELGLLVFVWIVFLALQISK-----------------IPVAVG 157

Query: 308 VSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF 367
           VS YEAVALY+GRR IASKG   + +   QLV YC  GI AG+VGGLLGLGGGFI+GPLF
Sbjct: 158 VSGYEAVALYQGRRIIASKGQGDSNFTVGQLVMYCTFGIIAGIVGGLLGLGGGFIMGPLF 217

Query: 368 LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 427
           LELG+PPQVSSATA FA+TFSSSMSVVEYYLLKRFPVPYALY   ++ IAA+VGQHV+++
Sbjct: 218 LELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLVGVATIAAWVGQHVVRR 277

Query: 428 LIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           LI  +GRAS+IIF L+ +IF+SA+SLGGVG+  MI +I+  EYMGF+N+C
Sbjct: 278 LIAAIGRASLIIFILASMIFISAISLGGVGIVNMIGKIQRHEYMGFENLC 327


>gi|224068935|ref|XP_002326235.1| predicted protein [Populus trichocarpa]
 gi|222833428|gb|EEE71905.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  389 bits (1000), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 226/442 (51%), Positives = 294/442 (66%), Gaps = 15/442 (3%)

Query: 42  KNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGG 101
            + N+   PN    + A    S    VWP ++F WR V+ T+IG  G+ACG+VGGVGGGG
Sbjct: 25  SHPNEKASPNPNSDIFASPYLSTTDKVWPKLEFSWRTVLATVIGLLGSACGTVGGVGGGG 84

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           IFVPML LIVGFD KS+AA+SKCM+   + S+V YNLR  HPT ++P+IDYDLALLFQP+
Sbjct: 85  IFVPMLTLIVGFDTKSAAALSKCMIMAASASSVWYNLRVPHPTREVPIIDYDLALLFQPM 144

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
           L+LGI++GV+ +V+F  W+ITVL+I+LF+  S+R+F KG+E WK+ETI K+E   Q E I
Sbjct: 145 LLLGITLGVSLSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWKEETILKKEMVIQQETI 204

Query: 222 VLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQ 281
           V   G    E   D EY      P   RE K     +  N+ WK   +L  VW   L LQ
Sbjct: 205 VNSRG----ELLIDTEY-----EPLIPREEKSKMQILCFNLKWKRLLILFLVWTSFLLLQ 255

Query: 282 IAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGT------KWRA 335
           + KN    CS  YWVL  LQ P+A GV  YEAV LY+  +K  S G+  T      +W  
Sbjct: 256 VIKNDVAVCSTWYWVLFCLQFPIAFGVFGYEAVKLYRENKKRISTGNTETICEASIEWTP 315

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
             ++F   CGI  G VGGLLG GGGF+LGPL LE+G+ P V+SAT+ F + FSSS+SVVE
Sbjct: 316 MHILFCALCGIIGGTVGGLLGSGGGFVLGPLLLEIGVSPHVASATSTFVMMFSSSLSVVE 375

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGG 455
           +YLLKRFP+P+ALY   +S++A F GQ  ++KL+KILGRAS+I+F LS +IFVSAL++GG
Sbjct: 376 FYLLKRFPIPFALYLMGVSVLAGFWGQFFVRKLVKILGRASLIVFILSGVIFVSALTMGG 435

Query: 456 VGLAKMIKRIEHKEYMGFDNIC 477
           VG+   I  I + E+MGF   C
Sbjct: 436 VGIDTSITMIRNHEFMGFLEFC 457


>gi|297827147|ref|XP_002881456.1| hypothetical protein ARALYDRAFT_902780 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327295|gb|EFH57715.1| hypothetical protein ARALYDRAFT_902780 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 463

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 224/438 (51%), Positives = 300/438 (68%), Gaps = 15/438 (3%)

Query: 46  DHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVP 105
           D   PN    +S+    S  + +WPD+KF W++V+ T+I F G+ACG+VGGVGGGGIFVP
Sbjct: 32  DTTNPNHHHHLSSSSPLSATEKIWPDLKFSWKLVLATVIAFLGSACGTVGGVGGGGIFVP 91

Query: 106 MLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLG 165
           ML LI+GFD KS+AA+SKCM+ G + S+V YN+R RHPT ++P++DYDLALLFQP+L+LG
Sbjct: 92  MLTLILGFDTKSAAAISKCMIMGASASSVWYNVRVRHPTKEVPILDYDLALLFQPMLLLG 151

Query: 166 ISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGN 225
           I++GV+ +V+F  W+ITVL+I+LF+  S+R+F KG+E WK+ET+ K E A+Q   +V   
Sbjct: 152 ITVGVSLSVVFPYWLITVLIIILFVGTSSRSFFKGIEMWKEETLLKNEMAQQRANMVNSR 211

Query: 226 GYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKN 285
           G    E   D EY      P   RE K     I  N+ WK   +LV VW   L +QI KN
Sbjct: 212 G----ELLIDTEY-----EPLYPREEKSELEIIRSNLKWKRLLVLVTVWLAFLLIQIIKN 262

Query: 286 YTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGD-----EGT-KWRASQLV 339
               CS +YW+L +LQ PVA  V  +EA+ LY   +K  S G+     E T KW    L+
Sbjct: 263 EIKVCSTIYWILFILQFPVALAVFGFEAIKLYTENKKRLSSGNTECICEATIKWTPLSLI 322

Query: 340 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLL 399
           F   CG+  G+VGGLLG GGGF+LGPL LE+G+ PQV+SATA F + FSSS+SVVE+YLL
Sbjct: 323 FCGLCGVIGGIVGGLLGSGGGFVLGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLL 382

Query: 400 KRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 459
           KRFP+PYA+Y  ++SI+A F GQ  ++KL+ IL RASII+F LS +I  SAL++G +G+ 
Sbjct: 383 KRFPIPYAMYLISVSILAGFWGQSFIRKLVAILKRASIIVFVLSGVICASALTMGVIGIE 442

Query: 460 KMIKRIEHKEYMGFDNIC 477
           K IK I + E+MGF   C
Sbjct: 443 KSIKMIHNHEFMGFLGFC 460


>gi|30686851|ref|NP_850267.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|18700129|gb|AAL77676.1| At2g36630/F1O11.26 [Arabidopsis thaliana]
 gi|22137236|gb|AAM91463.1| At2g36630/F1O11.26 [Arabidopsis thaliana]
 gi|330254181|gb|AEC09275.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 459

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 218/423 (51%), Positives = 294/423 (69%), Gaps = 15/423 (3%)

Query: 61  SRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAA 120
           S S  + +WPD+KF W++V+ T+I F G+ACG+VGGVGGGGIFVPML LI+GFD KS+AA
Sbjct: 43  SLSATEKIWPDLKFSWKLVLATVIAFLGSACGTVGGVGGGGIFVPMLTLILGFDTKSAAA 102

Query: 121 VSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWM 180
           +SKCM+ G + S+V YN+R RHPT ++P++DYDLALLFQP+L+LGI++GV+ +V+F  W+
Sbjct: 103 ISKCMIMGASASSVWYNVRVRHPTKEVPILDYDLALLFQPMLLLGITVGVSLSVVFPYWL 162

Query: 181 ITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLS 240
           ITVL+I+LF+  S+R+F KG+E WK+ET+ K E A+Q   +V   G    E   D EY  
Sbjct: 163 ITVLIIILFVGTSSRSFFKGIEMWKEETLLKNEMAQQRANMVNSRG----ELLIDTEY-- 216

Query: 241 NDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLL 300
               P   RE K     I  N+ WK   +LV VW   L +QI KN    CS +YW+L ++
Sbjct: 217 ---EPLYPREEKSELEIIRSNLKWKGLLILVTVWLTFLLIQIVKNEIKVCSTIYWILFIV 273

Query: 301 QIPVAGGVSAYEAVALYKGRRKIASKGD-----EGT-KWRASQLVFYCACGITAGMVGGL 354
           Q PVA  V  +EA  LY   +K  + G+     E T +W    L+F   CG+  G+VGGL
Sbjct: 274 QFPVALAVFGFEASKLYTANKKRLNSGNTECICEATIEWTPLSLIFCGLCGLIGGIVGGL 333

Query: 355 LGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALS 414
           LG GGGF+LGPL LE+G+ PQV+SATA F + FSSS+SVVE+YLLKRFP+PYA+Y  ++S
Sbjct: 334 LGSGGGFVLGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLLKRFPIPYAMYLISVS 393

Query: 415 IIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFD 474
           I+A F GQ  ++KL+ IL RASII+F LS +I  SAL++G +G+ K IK I + E+MGF 
Sbjct: 394 ILAGFWGQSFIRKLVAILRRASIIVFVLSGVICASALTMGVIGIEKSIKMIHNHEFMGFL 453

Query: 475 NIC 477
             C
Sbjct: 454 GFC 456


>gi|356570810|ref|XP_003553577.1| PREDICTED: uncharacterized protein LOC100807336 [Glycine max]
          Length = 459

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 224/433 (51%), Positives = 299/433 (69%), Gaps = 23/433 (5%)

Query: 56  VSAPKSRSGY--KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGF 113
           +S P    G+    +WP++K GWR+ + T+IGF G+A G+VGGVGGGGIFVPML LIVGF
Sbjct: 36  ISPPSHDYGFGNDKLWPELKVGWRVALATVIGFLGSAFGTVGGVGGGGIFVPMLTLIVGF 95

Query: 114 DAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFN 173
           D KS+AA+SKCM+ G + ++V YNLR  HPT ++P+IDYDLALLFQP+L+LGI++GV  +
Sbjct: 96  DTKSAAALSKCMIMGASTASVWYNLRVPHPTKEVPIIDYDLALLFQPMLMLGITVGVVLS 155

Query: 174 VIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECK 233
           V+F  W+ITVL+I+LF+  S+R+F KG + W++ET+ K+E A+Q   +V   G    E  
Sbjct: 156 VVFPYWLITVLIIILFIGSSSRSFFKGTQMWREETLLKKEMARQRATLVNFRG----ELL 211

Query: 234 CDPEYLSNDTTPEETREPKKSKVSI---IENIYWKEFGLLVAVWAIILALQIAKNYTTTC 290
            D EY        E   PK+ K S+     N+ WK   +L+ VW   L LQ+ KN    C
Sbjct: 212 IDTEY--------EQLFPKEEKSSMQIFCFNLKWKRILILMFVWVSFLLLQVIKNDVKIC 263

Query: 291 SVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGT------KWRASQLVFYCAC 344
           SV YWVL  LQ P+A  V  YEAV LYKG ++  S G+  +      +W    ++F   C
Sbjct: 264 SVWYWVLFCLQFPIALLVFGYEAVKLYKGHKERVSTGNPESICEASIEWTVLHILFCALC 323

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           GI  G VGGLLG GGGFILGPL +E+G+ PQV+SATA F + FSSS+SVVE+YLLKRFP+
Sbjct: 324 GILGGTVGGLLGSGGGFILGPLLIEIGVIPQVASATATFVMMFSSSLSVVEFYLLKRFPI 383

Query: 405 PYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKR 464
           PYALY  A+S++A F GQ+ ++KL+ IL RASII+F LS +IF SAL++G VG+ K IK 
Sbjct: 384 PYALYLTAVSVLAGFWGQYFVRKLMVILKRASIIVFILSGVIFASALTMGVVGIDKSIKM 443

Query: 465 IEHKEYMGFDNIC 477
           I+H E+MGF + C
Sbjct: 444 IQHHEFMGFLDFC 456


>gi|255575461|ref|XP_002528632.1| conserved hypothetical protein [Ricinus communis]
 gi|223531921|gb|EEF33735.1| conserved hypothetical protein [Ricinus communis]
          Length = 449

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 224/421 (53%), Positives = 288/421 (68%), Gaps = 23/421 (5%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           S  + VWP+++F WRIV+ T+IGF G+ACG+VGGVGGGGIFVPML LIVGFD KS+AA+S
Sbjct: 43  STTERVWPELEFSWRIVLATVIGFLGSACGTVGGVGGGGIFVPMLTLIVGFDTKSAAAIS 102

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G + S+V YNLR  HPT ++P++DYDLALLFQP+L+LGI++GVA +V+F  W+IT
Sbjct: 103 KCMIMGASASSVWYNLRVPHPTKEVPILDYDLALLFQPMLMLGITVGVASSVVFPYWLIT 162

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VL+I+LF+  S+R+F KGVE WK+ETI K+E AKQ E +V   G    E   D EY    
Sbjct: 163 VLIIILFIGTSSRSFFKGVEMWKEETILKKELAKQQEAVVNSRG----ELLIDTEY---- 214

Query: 243 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 302
               E   PK+ K  ++E  +     L     ++ L LQ+ KN   TCS  YWVL  LQ 
Sbjct: 215 ----EPLVPKEEKSEMVEKAFCAAICL-----SVFLLLQVIKNDVATCSKWYWVLFCLQF 265

Query: 303 PVAGGVSAYEAVALYKGRRKIASKGDEGT------KWRASQLVFYCACGITAGMVGGLLG 356
           PVA  V  YEAV LYK  +K  S G+  +       W    + F   CGI  G VGGLLG
Sbjct: 266 PVALAVFGYEAVKLYKEHKKRISTGNTESICEASIAWTPMHISFCALCGILGGTVGGLLG 325

Query: 357 LGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII 416
            GGGFILGPL LE+G+ PQV+SATA F + FSSS+SVVE+YLLKRFP+PYALY   +S++
Sbjct: 326 SGGGFILGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLLKRFPMPYALYLTGVSVL 385

Query: 417 AAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNI 476
           A F GQ  ++KLI IL R S+I+F LS +IF SA+++G VG  K I+ I + E+MGF   
Sbjct: 386 AGFWGQFFVRKLITILKRGSLIVFILSGVIFASAITMGVVGTEKSIRMINNHEFMGFLGF 445

Query: 477 C 477
           C
Sbjct: 446 C 446


>gi|356533415|ref|XP_003535260.1| PREDICTED: uncharacterized protein LOC100788770 [Glycine max]
          Length = 464

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 226/455 (49%), Positives = 309/455 (67%), Gaps = 13/455 (2%)

Query: 29  VSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFG 88
           VSA S     A     + H + N  I + A +     K VWPD++  WR+V+ T+IGF G
Sbjct: 14  VSAFSCAVLSALFLTNHIHHDDNKPITLLASRVSETLK-VWPDLEPSWRLVLATVIGFLG 72

Query: 89  TACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP 148
           +ACG+VGGVGGGGIFVPMLNL++GFD KS+AA+SKCM+ G + S+V YN+R  HPT ++P
Sbjct: 73  SACGTVGGVGGGGIFVPMLNLLLGFDTKSAAALSKCMIMGASASSVWYNVRVPHPTKEVP 132

Query: 149 VIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET 208
           ++DYDLALLFQP+L+LGI++GVA +V+F  W+ITVL+I+LF+  S+R+F KG+E W++ET
Sbjct: 133 ILDYDLALLFQPMLMLGITVGVALSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWREET 192

Query: 209 ITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFG 268
           I KRE  KQ   +V  +G + +    D +Y      P   +E K +   +  N+ WK   
Sbjct: 193 IFKREKTKQRATLVDSHG-EDKTGIIDTKY-----EPLIPKEEKSTIQILCLNLRWKRIL 246

Query: 269 LLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYK-GRRKIASKG 327
           +L+ VW   L +Q+ KN    CS  YWVL  LQ P+A  V  YEAV LYK  +R+++++ 
Sbjct: 247 VLIVVWVAFLLVQVIKNDVEACSPWYWVLFGLQFPIALLVFGYEAVKLYKEHKRRMSTRN 306

Query: 328 DE-----GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATAL 382
            E       +W A  L F   CGI  G+VGGLLG GGGF+LGPL LE+G+ PQV+SATA 
Sbjct: 307 LECICEASIEWTAMNLAFCSLCGIVGGIVGGLLGSGGGFVLGPLLLEIGVIPQVASATAT 366

Query: 383 FAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           F + FSSS+SVVE+YLLKRFP+PYALY  ++S++A F GQ  +++LI  LGRASII+F L
Sbjct: 367 FVMMFSSSLSVVEFYLLKRFPIPYALYLTSVSVLAGFWGQFFVRRLIACLGRASIIVFIL 426

Query: 443 SFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           S +IF SAL++G VG+   I+ I + E+MGF   C
Sbjct: 427 SGVIFASALTMGVVGIENSIQMINNHEFMGFLGFC 461


>gi|356548305|ref|XP_003542543.1| PREDICTED: uncharacterized protein LOC100804918 [Glycine max]
          Length = 464

 Score =  369 bits (946), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 221/438 (50%), Positives = 302/438 (68%), Gaps = 15/438 (3%)

Query: 47  HVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPM 106
           H + N  + + A +     K VWPD++  WR+++ T+IGF G+ACG+VGGVGGGGIFVPM
Sbjct: 32  HDDDNKAVTLLASRVSETLK-VWPDLEPSWRLLLATVIGFLGSACGTVGGVGGGGIFVPM 90

Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGI 166
           LNLI+GFD KS+AA+SKCM+ G + S+V YN+R  HPT ++P++DYDLALLFQP+L+LGI
Sbjct: 91  LNLILGFDTKSAAALSKCMIMGASTSSVWYNVRVPHPTKEVPILDYDLALLFQPMLMLGI 150

Query: 167 SIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNG 226
           ++GVA +V+F  W+ITVL+I+LF+  S+R+F KG+E W++ETI KRE   Q   +V   G
Sbjct: 151 TVGVALSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWREETIFKREKTMQRATLVDSQG 210

Query: 227 YQTEECKCDPEYLSNDTTPEETREPKKSKVSII-ENIYWKEFGLLVAVWAIILALQIAKN 285
            + +    D +Y      P+E    KKS + I+  N+ WK   +L+ VW   L +Q+ KN
Sbjct: 211 -EDKTGIIDTKY--EPLIPKE----KKSTMEILCLNLRWKRILVLIVVWVGFLLVQVIKN 263

Query: 286 YTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDE------GTKWRASQLV 339
               CS  YWVL  LQ+P+A  V  YEAV LYK  ++  + G+         +W A  L 
Sbjct: 264 DVEACSAWYWVLFGLQLPIALLVFGYEAVKLYKEHKRRMNTGNSECICEASIEWTAINLA 323

Query: 340 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLL 399
           F   CGI  G+VGGLLG GGGF+LGPL LE+G+ PQV+SATA F + FSSS+SVVE+YLL
Sbjct: 324 FCALCGIVGGIVGGLLGSGGGFVLGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLL 383

Query: 400 KRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 459
           KRFP+PYALY  ++S++A F GQ  +++LI  LGRASII+F LS +IF SAL++G VG+ 
Sbjct: 384 KRFPIPYALYLTSVSVLAGFWGQFFVRRLITCLGRASIIVFILSGVIFASALTMGVVGIE 443

Query: 460 KMIKRIEHKEYMGFDNIC 477
             I+ I + E+MGF   C
Sbjct: 444 NSIQMINNHEFMGFLGFC 461


>gi|225441056|ref|XP_002283988.1| PREDICTED: uncharacterized protein LOC100240792 isoform 1 [Vitis
           vinifera]
 gi|297740033|emb|CBI30215.3| unnamed protein product [Vitis vinifera]
          Length = 469

 Score =  363 bits (933), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 233/452 (51%), Positives = 303/452 (67%), Gaps = 24/452 (5%)

Query: 40  TSKNKNDHVEPN-------IVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACG 92
           T  N N    P+        +++   P   S  + VWP++KF WRIV+ T+IGF G+A G
Sbjct: 25  TDPNGNHTYPPSNLSSLNTTLLQNPFPSVSSSSEKVWPELKFSWRIVLATVIGFLGSAFG 84

Query: 93  SVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDY 152
           +VGGVGGGGIFVPML LIVGFD KS+AA+SKCM+ G + S+V YNLR  HPT + P+IDY
Sbjct: 85  TVGGVGGGGIFVPMLTLIVGFDTKSAAALSKCMIMGASTSSVWYNLRVNHPTKEAPIIDY 144

Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR 212
           DLALLFQP+L+LGI++GVA +V+F  W+IT+L+I+LF+  STR+F K V+ W +ET+ K+
Sbjct: 145 DLALLFQPMLMLGITVGVALSVVFPYWLITILIIILFMGTSTRSFFKAVQMWNEETLFKK 204

Query: 213 EAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSII-ENIYWKEFGLLV 271
           E  +Q   +V  +G    E   D EY  +   P E    +KS + I+  N+ WK   +LV
Sbjct: 205 ELEEQRRTMVNSHG----ELLIDAEY--DPLIPNE----EKSWIQILCSNLRWKRILVLV 254

Query: 272 AVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKG-RRKIASKGDEG 330
            VW I L LQI KN +T CS  YWVL LLQ PVA  V  YEAV L+K  +++I S   E 
Sbjct: 255 TVWVIFLLLQIFKNDSTVCSAWYWVLFLLQFPVAVVVFGYEAVKLHKEYKKRIISGNTEA 314

Query: 331 T-----KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 385
                 +W    L F   CGI  G VGGLLG GGGFILGPL LE+G+ PQV+SATA F +
Sbjct: 315 VCEANIEWTGLNLAFCALCGILGGTVGGLLGSGGGFILGPLLLEIGVIPQVASATATFVM 374

Query: 386 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 445
            FSSS+SVVE+YLLKRFP+PYALY  ++S++A F GQ  ++KL+  L RASIIIF LS +
Sbjct: 375 MFSSSLSVVEFYLLKRFPIPYALYLISVSVVAGFWGQLFIRKLVAFLRRASIIIFILSGV 434

Query: 446 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           IF SAL++G VG  + I  I + E+MGF + C
Sbjct: 435 IFASALTMGVVGTKESITMIHNHEFMGFLSFC 466


>gi|449451245|ref|XP_004143372.1| PREDICTED: uncharacterized protein LOC101206149 [Cucumis sativus]
 gi|449523213|ref|XP_004168618.1| PREDICTED: uncharacterized protein LOC101229265 [Cucumis sativus]
          Length = 456

 Score =  359 bits (922), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 217/415 (52%), Positives = 287/415 (69%), Gaps = 15/415 (3%)

Query: 69  WPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTG 128
           WPD++  WR+V  T+IGF G+ACG+VGGVGGGGIFVPML LI+GFD KS+AA+SKCM+ G
Sbjct: 48  WPDLEPSWRLVAATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMG 107

Query: 129 GAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVL 188
            + S+V YNLR  HPT D+P+ID+DLALLFQP+L+LGI++GV+ +V+F  W+IT+L+I+L
Sbjct: 108 ASTSSVWYNLRVAHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITILIIIL 167

Query: 189 FLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEET 248
           F+  S+R+F KG+E WK+ETI K+E AK+ E +V  +G    E   D EY      P   
Sbjct: 168 FIGTSSRSFFKGIEMWKEETILKKEFAKRCETVVNSHG----ELLIDVEY-----DPLIP 218

Query: 249 REPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGV 308
           +E K     +  N+ WK   +L AVW   L LQ+ KN    CS+ YWV+  LQ P+A  V
Sbjct: 219 KEQKTELELMCFNLRWKRTSILFAVWISFLILQVVKNDVAACSIWYWVVFFLQFPIAIVV 278

Query: 309 SAYEAVALYKGRRKIASKG------DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 362
             YEA  LYK  +K    G      +    W  S L F   CGI  G VGGLLG GGGF+
Sbjct: 279 FGYEARKLYKEHKKRMEAGNLEQICEASIGWTGSHLAFCALCGIVGGTVGGLLGSGGGFV 338

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
           LGPL LE+G+ PQV+SATA F + FSSS+SVVE+YLL RFP+PYALY  ++S++A F GQ
Sbjct: 339 LGPLLLEIGVVPQVASATATFVMMFSSSLSVVEFYLLNRFPIPYALYLTSVSVLAGFWGQ 398

Query: 423 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
             ++KLI IL RAS+I+F LS +IF SA+++G VG+ K I  I++ E+MGF + C
Sbjct: 399 FFVRKLITILRRASLIVFILSGVIFASAITMGIVGVTKSITMIQNHEFMGFLDFC 453


>gi|326524532|dbj|BAK00649.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 474

 Score =  359 bits (921), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 210/426 (49%), Positives = 285/426 (66%), Gaps = 7/426 (1%)

Query: 58  APKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKS 117
           AP+  S    VWPD++F WR+VV T++GF G+A G+VGGVGGGGIFVP+LNL++GFD KS
Sbjct: 49  APRLSSRSARVWPDLEFNWRLVVATVVGFLGSAFGTVGGVGGGGIFVPLLNLVLGFDTKS 108

Query: 118 SAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFA 177
           +AA+SKCM+ G + S+V YNL+  HPT + PV+DY LALLFQP+L+LGI+IGV  +V+F 
Sbjct: 109 AAALSKCMIMGASASSVWYNLQVSHPTKEAPVLDYKLALLFQPMLMLGITIGVELSVVFP 168

Query: 178 DWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPE 237
            W+ITVL+I+LF+  S+R+F KG+  W++ET    E  ++           T +   DP 
Sbjct: 169 YWLITVLIIILFVGTSSRSFYKGILMWREETRILMETRERETQSKSSGASATNDVVLDPS 228

Query: 238 YLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVL 297
           Y +    P+     K    +++ N+ WK   +L+AVW+  L LQI KN + TCS  YWV+
Sbjct: 229 Y-AEPLLPQSKPTQKSGLETLMFNLRWKHILVLLAVWSSFLVLQILKNNSKTCSTFYWVI 287

Query: 298 NLLQIPVAGGVSAYEAVALYKGRRKIASKG------DEGTKWRASQLVFYCACGITAGMV 351
           N+ QIP A  V  ++AV L +  R     G      +   +W  +QL+F   CG+  G V
Sbjct: 288 NIFQIPAAVSVFVWKAVQLCRESRARRMNGNLECVCEANIEWSPTQLIFCAFCGLLGGTV 347

Query: 352 GGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFF 411
           GGLLG GGGFILGPL LELG  PQV+SATA F + FSSS+SVVE+Y L RFP+PYALY  
Sbjct: 348 GGLLGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLGRFPIPYALYLI 407

Query: 412 ALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYM 471
            +SI+A F GQ +++K++ +L RAS+I+F LS +IF SAL++G VG  K I  I   EYM
Sbjct: 408 FISILAGFWGQCLVRKIVLVLKRASLIVFILSSVIFASALTMGVVGTEKSISMINKHEYM 467

Query: 472 GFDNIC 477
           GF + C
Sbjct: 468 GFLSFC 473


>gi|115455071|ref|NP_001051136.1| Os03g0726500 [Oryza sativa Japonica Group]
 gi|41469272|gb|AAS07154.1| expressed protein [Oryza sativa Japonica Group]
 gi|50428734|gb|AAT77085.1| expressed protein [Oryza sativa Japonica Group]
 gi|108710852|gb|ABF98647.1| expressed protein [Oryza sativa Japonica Group]
 gi|113549607|dbj|BAF13050.1| Os03g0726500 [Oryza sativa Japonica Group]
 gi|215697621|dbj|BAG91615.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 475

 Score =  357 bits (916), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 215/430 (50%), Positives = 287/430 (66%), Gaps = 23/430 (5%)

Query: 60  KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
            SRS    VWPD+ F WR+VV T++GF G+A G+VGGVGGGGIFVPMLNL+VGFD KS+A
Sbjct: 55  SSRSA--RVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLLVGFDTKSAA 112

Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
           A+SKCM+ G + S+V YNL+  HPT + PVIDY LALLFQP+L+LGI+IGV  +VIF  W
Sbjct: 113 ALSKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYW 172

Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYL 239
           +ITVL+I+LF+  S+R+F KG+  WK ET  + E  ++ E           +  C    +
Sbjct: 173 LITVLIIILFIGTSSRSFYKGILMWKDETRIQMETREREE---------ESKSSCAARDV 223

Query: 240 SNDTTPEETR--EPKKSKVSIIE----NIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL 293
             D + EE    +P+  + S +E    N+ WK   +L+ VW+  L LQI KN + +CS  
Sbjct: 224 VIDPSCEEPLLCQPQPKEKSALETFLFNLRWKNILVLMTVWSSFLVLQIFKNNSQSCSTF 283

Query: 294 YWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG------DEGTKWRASQLVFYCACGIT 347
           YWV+N+LQ+PVA  V  +E V L +  R     G      +   +W  +QL+F   CG+ 
Sbjct: 284 YWVINILQVPVALSVFLWEGVQLCRESRARRMDGNWECVCEASIEWSPAQLIFCAFCGLL 343

Query: 348 AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYA 407
            G VGGLLG GGGFILGPL LELG  PQV+SATA F + FSSS+SVVE+Y L RFP+P+A
Sbjct: 344 GGTVGGLLGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLNRFPIPFA 403

Query: 408 LYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEH 467
           +Y   +SI+A F GQ +++KL+ +L RAS+I+F LS +IF SAL++G VG  K I  I +
Sbjct: 404 VYLICISILAGFWGQSLVRKLVHVLKRASLIVFILSSVIFASALTMGVVGTQKSISMINN 463

Query: 468 KEYMGFDNIC 477
            EYMGF + C
Sbjct: 464 HEYMGFLDFC 473


>gi|125545587|gb|EAY91726.1| hypothetical protein OsI_13369 [Oryza sativa Indica Group]
          Length = 475

 Score =  356 bits (914), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 215/430 (50%), Positives = 287/430 (66%), Gaps = 23/430 (5%)

Query: 60  KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
            SRS    VWPD+ F WR+VV T++GF G+A G+VGGVGGGGIFVPMLNL+VGFD KS+A
Sbjct: 55  SSRSA--RVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLLVGFDTKSAA 112

Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
           A+SKCM+ G + S+V YNL+  HPT + PVIDY LALLFQP+L+LGI+IGV  +VIF  W
Sbjct: 113 ALSKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYW 172

Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYL 239
           +ITVL+I+LF+  S+R+F KG+  WK ET  + E  ++ E           +  C    +
Sbjct: 173 LITVLIIILFIGTSSRSFYKGILMWKDETRIQMETREREE---------ESKSSCAARDV 223

Query: 240 SNDTTPEETR--EPKKSKVSIIE----NIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL 293
             D + EE    +P+  + S +E    N+ WK   +L+ VW+  L LQI KN + +CS  
Sbjct: 224 VIDPSCEEPLLCQPQPKEKSALETFLFNLRWKNILVLMTVWSSFLVLQIFKNNSQSCSTF 283

Query: 294 YWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG------DEGTKWRASQLVFYCACGIT 347
           YWV+N+LQ+PVA  V  +E V L +  R     G      +   +W  +QL+F   CG+ 
Sbjct: 284 YWVINILQVPVALSVFLWEGVQLCRESRARRMDGNWECVCEASIEWSPAQLIFCAFCGLL 343

Query: 348 AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYA 407
            G VGGLLG GGGFILGPL LELG  PQV+SATA F + FSSS+SVVE+Y L RFP+P+A
Sbjct: 344 GGTVGGLLGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLNRFPIPFA 403

Query: 408 LYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEH 467
           +Y   +SI+A F GQ +++KL+ +L RAS+I+F LS +IF SAL++G VG  K I  I +
Sbjct: 404 VYLICISILAGFWGQSLVRKLVHVLKRASLIVFILSSVIFASALTMGVVGTQKSISMINN 463

Query: 468 KEYMGFDNIC 477
            EYMGF + C
Sbjct: 464 HEYMGFLDFC 473


>gi|242033121|ref|XP_002463955.1| hypothetical protein SORBIDRAFT_01g009520 [Sorghum bicolor]
 gi|241917809|gb|EER90953.1| hypothetical protein SORBIDRAFT_01g009520 [Sorghum bicolor]
          Length = 482

 Score =  351 bits (900), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 219/458 (47%), Positives = 300/458 (65%), Gaps = 32/458 (6%)

Query: 33  SNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACG 92
           S   P AT+  +ND V   + +   +P+       VWPD+ F WR+VV T++GF G+A G
Sbjct: 43  SAADPPATAA-RND-VAARLRLSSRSPR-------VWPDLAFNWRVVVATVVGFLGSAFG 93

Query: 93  SVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDY 152
           +VGGVGGGGIFVPMLNL+VGFD KS+AA+SKCM+ G + S+V YNL+  HPT + PVIDY
Sbjct: 94  TVGGVGGGGIFVPMLNLVVGFDTKSAAALSKCMIMGASASSVWYNLQVSHPTKEAPVIDY 153

Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET---I 209
            LALLFQP+L+LGI+IGV  +VIF  W+ITVL+I+LF+  S+R+F KG+  W++ET   +
Sbjct: 154 KLALLFQPMLMLGITIGVELSVIFPYWLITVLIIILFVGTSSRSFYKGILMWREETRILM 213

Query: 210 TKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIE----NIYWK 265
             RE  +Q +     +     +   DP Y     T     +P+  K S +E    N+ WK
Sbjct: 214 ETREREEQSK-----STCAAGDVIIDPSYEEPLLT-----QPQPKKKSALETFLFNLRWK 263

Query: 266 EFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS 325
              +L++VW+  L LQ+ KN   +CS  YWV+N+LQ+PVA  V  +EA+ L +  R    
Sbjct: 264 NILVLMSVWSSFLVLQVLKNNAKSCSTFYWVVNILQVPVAVSVFLWEAMQLCRESRARRM 323

Query: 326 KG------DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 379
            G      +   +W  +QL+F   CG+  G VGGLLG GGGFILGPL LELG  PQV+SA
Sbjct: 324 NGNLECVCEASIEWSPAQLIFCAFCGLLGGTVGGLLGSGGGFILGPLLLELGCIPQVASA 383

Query: 380 TALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 439
           TA F + FSSS+SVVE+Y L RFP+P+A Y   +SI+A F GQ +++K++ +L RAS+I+
Sbjct: 384 TATFVMMFSSSLSVVEFYFLHRFPLPFAGYLIFISILAGFWGQCLVRKIVHVLKRASVIV 443

Query: 440 FTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           F LS +IF SAL++G VG  K I  I++ EYMGF + C
Sbjct: 444 FILSSVIFASALTMGVVGTQKSISMIKNHEYMGFLDFC 481


>gi|293331891|ref|NP_001168709.1| uncharacterized protein LOC100382501 [Zea mays]
 gi|223950337|gb|ACN29252.1| unknown [Zea mays]
 gi|238009716|gb|ACR35893.1| unknown [Zea mays]
 gi|414872565|tpg|DAA51122.1| TPA: hypothetical protein ZEAMMB73_068409 [Zea mays]
          Length = 479

 Score =  348 bits (894), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 215/426 (50%), Positives = 291/426 (68%), Gaps = 16/426 (3%)

Query: 60  KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
            SRS    VWPD+ F WR+VV T++GF G+A G+VGGVGGGGIFVPMLNL+VGFD KS+A
Sbjct: 61  SSRS--PRVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLVVGFDTKSAA 118

Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
           A+SKCM+ G + S+V YNL+  HPT + PVIDY LALLFQP+L+LGI+IGV  +VIF  W
Sbjct: 119 ALSKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYW 178

Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYL 239
           +ITVL+I+LF+  S+R+F KG+  W++E+    E   + E     +     +   DP Y 
Sbjct: 179 LITVLIIILFVGTSSRSFYKGILMWREESRILMETRGREEQS--KSTCAASDVVIDPSY- 235

Query: 240 SNDTTPEETR-EPKKSK-VSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVL 297
                P  T+ +PKKS   + + N+ WK   +L++VW++ L LQ+ KN + +CS  YWV+
Sbjct: 236 ---GEPLLTQPQPKKSAWETFMFNLRWKNILVLMSVWSLFLVLQVLKNNSKSCSSFYWVI 292

Query: 298 NLLQIPVAGGVSAYEAVALYK---GRR---KIASKGDEGTKWRASQLVFYCACGITAGMV 351
           N+LQ+PVA  V  +EA+ L +    RR    +    +   +W  +QL+F   CG+  G V
Sbjct: 293 NVLQVPVAVSVFLWEAMQLCRESHARRMNGNLECVCEASIEWSPAQLIFCAFCGLLGGTV 352

Query: 352 GGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFF 411
           GGLLG GGGFILGPL LELG  PQV+SATA F + FSSS+SVVE+Y L RFP+P+A Y  
Sbjct: 353 GGLLGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLHRFPLPFAGYLI 412

Query: 412 ALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYM 471
            +SI+A F GQ +++K++ +L RAS+I+F LS +IFVSAL++G VG  K I  I + EYM
Sbjct: 413 FISILAGFWGQCLVRKIVHVLKRASLIVFILSSVIFVSALTMGVVGTQKSISMINNHEYM 472

Query: 472 GFDNIC 477
           GF N C
Sbjct: 473 GFLNFC 478


>gi|357117223|ref|XP_003560372.1| PREDICTED: uncharacterized protein LOC100844938 [Brachypodium
           distachyon]
          Length = 479

 Score =  335 bits (860), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 214/433 (49%), Positives = 289/433 (66%), Gaps = 29/433 (6%)

Query: 60  KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
            SRS    VWPD++F WR+VV TI+GF G+A G+VGGVGGGGIFVP+LNL++GFD KS+A
Sbjct: 60  SSRSA--RVWPDLEFNWRLVVATIVGFLGSAFGTVGGVGGGGIFVPLLNLVLGFDTKSAA 117

Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
           A+SKCM+ G + S+V YNL+  HPT + PV+DY LALLFQP+L+LGI+IGV  +V+F  W
Sbjct: 118 ALSKCMIMGASASSVWYNLQVSHPTKEAPVLDYKLALLFQPMLMLGITIGVELSVVFPYW 177

Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKET-----ITKREAAKQ----LELIVLGNGYQTE 230
           +IT+L+I+LF+  S+R+F KG+  W++ET       +REA  +    +  +VLG  Y+  
Sbjct: 178 LITILIIILFVGTSSRSFYKGILMWREETRILMETQEREAQSKSSSAVSDVVLGTSYE-- 235

Query: 231 ECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTC 290
                 E L     P E    K    +++ N+ WK   +L+AVW+  L LQI KN + TC
Sbjct: 236 ------EPLLPQPKPTE----KSGLETLLFNLRWKNILVLMAVWSSFLFLQILKNNSKTC 285

Query: 291 SVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG------DEGTKWRASQLVFYCAC 344
           S  YWV+N+LQ+P A  V  ++AV L +  R     G      +   +W  +QLVF   C
Sbjct: 286 STFYWVINILQVPAAVSVFLWKAVQLCRESRARRMNGNLECVCEASIEWSPTQLVFCAFC 345

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+  G VGGLLG GGGFILGPL LELG  PQV+SATA F + FSSS+SVVE+Y L RFP+
Sbjct: 346 GLLGGTVGGLLGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLHRFPI 405

Query: 405 PYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKR 464
           PYA Y   +SI+A F GQ +++K++ +L RAS+I+F LS +IF SA+++G VG  K I  
Sbjct: 406 PYAAYLIFISILAGFWGQCLVRKIVLVLKRASLIVFILSSVIFASAVTMGVVGTEKSISM 465

Query: 465 IEHKEYMGFDNIC 477
           I + EYMGF   C
Sbjct: 466 INNHEYMGFLGFC 478


>gi|242044420|ref|XP_002460081.1| hypothetical protein SORBIDRAFT_02g022480 [Sorghum bicolor]
 gi|241923458|gb|EER96602.1| hypothetical protein SORBIDRAFT_02g022480 [Sorghum bicolor]
          Length = 383

 Score =  309 bits (792), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 202/406 (49%), Positives = 256/406 (63%), Gaps = 59/406 (14%)

Query: 72  IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAV 131
           ++FGWRIV+G++IGFFG A GSVGGVGGGG+F+PML LI+GFD KSSAA+SKCM+ G +V
Sbjct: 33  MEFGWRIVLGSLIGFFGAAFGSVGGVGGGGVFLPMLALIIGFDPKSSAAMSKCMIMGASV 92

Query: 132 SAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLV 191
           S V  NL+ +HP+LDIP+IDYDLA+L QP+L+LGISIGV FNV+F DW++T LLI+LFLV
Sbjct: 93  STVYCNLKLKHPSLDIPLIDYDLAMLIQPMLMLGISIGVIFNVVFPDWLVTALLIILFLV 152

Query: 192 LSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREP 251
            ST+AFLKGVETWKKETI KR                                       
Sbjct: 153 TSTKAFLKGVETWKKETIKKR--------------------------------------- 173

Query: 252 KKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAY 311
              K S+++N+YWKEFGLL  VW   L LQIA  Y+  CS LYW LN LQIPVA GV+ Y
Sbjct: 174 ---KSSLLKNVYWKEFGLLAFVWLAFLGLQIANKYSAYCSALYWALNSLQIPVAVGVTMY 230

Query: 312 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
           EA  L  G++ ++SKG +    ++ QL  YC  GI AG+VGGLLG+GGGFI+GPLFLELG
Sbjct: 231 EAHGLATGKKVLSSKGSQQNALKSCQLFVYCQFGIIAGLVGGLLGVGGGFIMGPLFLELG 290

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           IPPQ     AL      S+ +   ++L         L F  L     F   H+L KL +I
Sbjct: 291 IPPQ----PALCEEVDQSARACFTHHL--------HLGFHDLC-QRNFSWYHIL-KLAEI 336

Query: 432 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
                     L   I    L  GGVG++ ++ +IE  +YMGF+++C
Sbjct: 337 NQNP---YGALDSYINNHRLPAGGVGISNIVHKIECHQYMGFESLC 379


>gi|217075755|gb|ACJ86237.1| unknown [Medicago truncatula]
          Length = 238

 Score =  290 bits (741), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 158/251 (62%), Positives = 191/251 (76%), Gaps = 14/251 (5%)

Query: 72  IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAV 131
           ++FGW+IV+G+ IGF G A GSVGGVGGGGIFVPML+LI+GFD KSS A+SKCM+ G A+
Sbjct: 1   MEFGWQIVLGSFIGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSSTAISKCMIMGAAL 60

Query: 132 SAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLV 191
           S V YNLR RHPTLD+P+IDYDLALL QP+L+LGISIGV FNV+F DW++T+LLIVLFL 
Sbjct: 61  STVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWLVTILLIVLFLG 120

Query: 192 LSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREP 251
            ST+AF KG+ETW KETI K+EAA++            +E     EY +  T P    E 
Sbjct: 121 TSTKAFFKGLETWNKETIMKKEAARR------------QESNGSGEYKALPTGPNVAIE- 167

Query: 252 KKSKVSIIENIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLNLLQIPVAGGVSA 310
           K++KVSIIEN+YWKEFGLL  VW   LALQIAK NYTTTCS  YW+LNLLQIP+A GV+A
Sbjct: 168 KENKVSIIENVYWKEFGLLAFVWISFLALQIAKQNYTTTCSAAYWILNLLQIPIAVGVTA 227

Query: 311 YEAVALYKGRR 321
           YE +   +G  
Sbjct: 228 YEQLHCSQGEE 238


>gi|226503253|ref|NP_001145740.1| uncharacterized protein LOC100279247 [Zea mays]
 gi|219884241|gb|ACL52495.1| unknown [Zea mays]
 gi|414885095|tpg|DAA61109.1| TPA: hypothetical protein ZEAMMB73_890925 [Zea mays]
          Length = 291

 Score =  277 bits (709), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 146/240 (60%), Positives = 180/240 (75%), Gaps = 6/240 (2%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           + Y+HVWP ++FGW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 58  NSYQHVWPPMEFGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 117

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G +VS V YNL+ +HPTLD+PVIDYDLA+L QP+L+LGISIGV FNVIF DW++T
Sbjct: 118 KCMIMGASVSTVYYNLKLKHPTLDMPVIDYDLAVLMQPMLMLGISIGVIFNVIFPDWLVT 177

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VLLI+LFL  ST+AFLKG+ETWKKET+ +REAAK LE    G   +       P+  +N 
Sbjct: 178 VLLIILFLGTSTKAFLKGIETWKKETVIQREAAKLLEQTA-GEEAEYAALPTGPDVAANK 236

Query: 243 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 302
                   P     S+I+NI+WK+ GLL  VW   L LQ+ KNYT TCS  YWVLNLLQ+
Sbjct: 237 KALTSDEAP-----SLIKNIHWKKVGLLSFVWVAFLVLQVTKNYTATCSPWYWVLNLLQV 291


>gi|388514887|gb|AFK45505.1| unknown [Lotus japonicus]
          Length = 290

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 153/287 (53%), Positives = 203/287 (70%), Gaps = 10/287 (3%)

Query: 193 STRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPE--ETRE 250
           S +AF KGVETWKKETI K E    LE         T  C  D +Y     +P+    ++
Sbjct: 8   SIKAFFKGVETWKKETIIKEETVMLLE--------STAPCSGDEQYNLLPGSPDGGPQKK 59

Query: 251 PKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSA 310
            +K +VSI+ NIYWK+FGLL  VW   L LQIAK YT TCS   W++ LLQIPV+ GV  
Sbjct: 60  GRKHEVSILGNIYWKQFGLLSFVWLAYLLLQIAKTYTATCSTTCWMIILLQIPVSVGVFL 119

Query: 311 YEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL 370
           Y+A+ LY+G   I++KGD+  +W+   L+   AC + AG++GGLLG+GGGF++GPLFLEL
Sbjct: 120 YKAIGLYQGWSVISAKGDQAIQWKLHYLILTSACALLAGIIGGLLGIGGGFVMGPLFLEL 179

Query: 371 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 430
           GI PQV+SATA F +TFS+S+SVV+YYLL RFPVPYALY   ++ IAA+ GQ ++ KL+ 
Sbjct: 180 GIAPQVASATATFGMTFSASISVVQYYLLNRFPVPYALYLTLVAAIAAYRGQFIIDKLVN 239

Query: 431 ILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +  RAS+IIFTL+F IFVSA+ LGG G++ MI +I+  EYMGF+++C
Sbjct: 240 MFQRASLIIFTLAFTIFVSAIVLGGEGISDMIGQIQRSEYMGFEDLC 286


>gi|302816689|ref|XP_002990023.1| hypothetical protein SELMODRAFT_184956 [Selaginella moellendorffii]
 gi|300142334|gb|EFJ09036.1| hypothetical protein SELMODRAFT_184956 [Selaginella moellendorffii]
          Length = 356

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 171/380 (45%), Positives = 238/380 (62%), Gaps = 37/380 (9%)

Query: 109 LIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISI 168
           L+ GFDAK++ A+SKCM+     S V YNL  R    +  +IDYDLALLFQP+L+LGIS+
Sbjct: 2   LVCGFDAKTAPALSKCMIFAQCASTVAYNLGLRSSQTNSSLIDYDLALLFQPMLLLGISV 61

Query: 169 GVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQ 228
           GV FNV+F +W+IT+LLIV+ L +++RAF KG+ETWKKET  KR       LI+ G+   
Sbjct: 62  GVTFNVLFPNWLITLLLIVVSLAMASRAFSKGLETWKKETNEKR-------LILEGS--- 111

Query: 229 TEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTT 288
                  P   +N TT +          S+   + WK+  LL AVW +I  LQ+ K YT 
Sbjct: 112 ---LTPGP---ANFTTLD----------SLWTTVEWKKLSLLFAVWCLITGLQVLKAYTA 155

Query: 289 TCSVLYWVLNLLQIPVAGGVSAYEAVAL-----YKGRRKIASKGDEGTKW------RASQ 337
            CS  +W+ N+LQ PV   V+  +A+ +     YK R  +  +  E +        RA  
Sbjct: 156 NCSTAFWIYNILQAPVTLAVTVTQALRMREHSSYKLRESLLDESSESSSASLKAAGRALD 215

Query: 338 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           +  Y   G+ AG + GLLG+GGG  LGP+ LE G+PPQ +SATA  A+ FSSS+SVVE+Y
Sbjct: 216 VFRYVFFGVLAGSIAGLLGVGGGATLGPIMLEFGVPPQTASATATLAMLFSSSLSVVEFY 275

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVG 457
            L R  V YALYF A+ I++AF+ Q +++KL+ +LGR S+I FTL F+I VS +SLGGVG
Sbjct: 276 FLGRIKVSYALYFGAICIVSAFLSQKLVQKLLNLLGRTSVITFTLVFVIVVSVISLGGVG 335

Query: 458 LAKMIKRIEHKEYMGFDNIC 477
           +   I+ ++  +YMGF ++C
Sbjct: 336 IVDSIQELKRGKYMGFGSLC 355


>gi|357440251|ref|XP_003590403.1| hypothetical protein MTR_1g061710 [Medicago truncatula]
 gi|355479451|gb|AES60654.1| hypothetical protein MTR_1g061710 [Medicago truncatula]
          Length = 558

 Score =  245 bits (626), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 155/318 (48%), Positives = 213/318 (66%), Gaps = 13/318 (4%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           SG   +WP+++  W++V+ T+IGFFG+ACG+VGGVGGGGIFVPMLNL++GFD KS+AA+S
Sbjct: 49  SGTDRIWPELEPSWKLVLATVIGFFGSACGTVGGVGGGGIFVPMLNLLLGFDTKSAAALS 108

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           KCM+ G + S+V YNLR  HPT  +P++DYDLALLFQP+L+LGI++GV  +V+F  W+IT
Sbjct: 109 KCMIMGASASSVWYNLRVAHPTKQVPILDYDLALLFQPMLMLGITVGVTLSVVFPFWLIT 168

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND 242
           VL+I+LF+  S R+F KG+E W+ ET+ K+E A+Q  L V  NG + +    D ++    
Sbjct: 169 VLIIILFIGTSLRSFFKGIEMWRVETLLKKETAEQAAL-VNSNG-EFKAVLVDTKF---- 222

Query: 243 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 302
             P   +E K +   +  N+ WK   +L+++W   L +QI K +   CSV YWVL  LQ 
Sbjct: 223 -EPLIPKEQKSTMQILCFNLRWKRILVLLSIWFSFLLIQIIKTHVEDCSVWYWVLFGLQF 281

Query: 303 PVAGGVSAYEAVALYKGRRKIASKG------DEGTKWRASQLVFYCACGITAGMVGGLLG 356
           P+A  V  +EAV LYK  R     G      +   +W A+ L F    GI  G+VGGLLG
Sbjct: 282 PIAFLVFGFEAVKLYKDHRMRMGTGSLEYICEASIEWSATSLAFCAMSGIVGGIVGGLLG 341

Query: 357 LGGGFILGPLFLELGIPP 374
            GGGFILGPL LE+G+ P
Sbjct: 342 SGGGFILGPLLLEIGVIP 359



 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 57/132 (43%), Positives = 77/132 (58%), Gaps = 28/132 (21%)

Query: 376 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 435
           V+SATA F + FSSS+SV E+Y+LKRFP+PYALY   +S++A F GQ  ++KL+  LGRA
Sbjct: 427 VASATATFVMMFSSSLSVFEFYILKRFPMPYALYLITVSVLAGFWGQFFVRKLVAFLGRA 486

Query: 436 SIIIFTLSFIIFVSALSLGG----------------------------VGLAKMIKRIEH 467
           S+I+F LS +IF SAL++G                             VG  K I+ I +
Sbjct: 487 SVIVFILSGVIFASALTMGNNGNLPQFADLYKMILAQISMLYIFISGIVGTEKSIEMIHN 546

Query: 468 KEYMGFDNICYT 479
            E+MGF   C T
Sbjct: 547 HEFMGFLGFCST 558


>gi|302808025|ref|XP_002985707.1| hypothetical protein SELMODRAFT_446339 [Selaginella moellendorffii]
 gi|300146616|gb|EFJ13285.1| hypothetical protein SELMODRAFT_446339 [Selaginella moellendorffii]
          Length = 392

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 185/407 (45%), Positives = 242/407 (59%), Gaps = 24/407 (5%)

Query: 78  IVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYN 137
           +V+G+++ F G    +  GVGGGG+FVP+  L++GFD K+SAA+SK M+ GGAV++ +YN
Sbjct: 1   MVLGSVVAFLGAMFANAAGVGGGGLFVPLFRLLIGFDVKTSAALSKSMIMGGAVASALYN 60

Query: 138 LRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
           L  +HP LD P+IDYDLALL QP+L+LGISIGV  NV+F DW++TVLL+ +   ++ + F
Sbjct: 61  LPSKHPVLDKPLIDYDLALLIQPMLLLGISIGVMCNVMFPDWILTVLLVAVLTGMAFKTF 120

Query: 198 LKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYL--SNDTTPEETREPKKSK 255
            KG  TW  E+  K       E   + N         +   L  S    PE         
Sbjct: 121 NKGACTWSTESEQKAHCCDTGEEGFVSNSRTEPSSSLEQGLLAKSEQCAPE--------- 171

Query: 256 VSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVA 315
             +  +I W    LL  VW + L LQ+ K+   TC  LYW+L +LQ PV+ G +A     
Sbjct: 172 -GLPSSIKWANICLLCTVWTVYLILQLLKSGAATCGRLYWILTVLQAPVSIGATAI---- 226

Query: 316 LYKGRRKIASKGDEGTKWRAS-----QLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL 370
              G  +I  +GD G +  AS     QL  Y    + AG+VGGLLG+GGG IL PLFLEL
Sbjct: 227 ---GVWRIYRRGDFGKEKPASGPTCEQLFLYPLYAVLAGVVGGLLGIGGGMILAPLFLEL 283

Query: 371 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 430
           GI PQV+SAT  F + FSSSMSVVE+YLL R PV +A YF  L  IAA VG H  K  I+
Sbjct: 284 GIIPQVTSATTTFIVVFSSSMSVVEFYLLGRLPVRFAAYFTTLCGIAALVGLHATKLFIR 343

Query: 431 ILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
             GR S IIF L+ II  SA+ LG +G    I+R    EYMGF ++C
Sbjct: 344 RYGRTSFIIFILAAIIGSSAIILGIIGGFADIRRYIAGEYMGFHSLC 390


>gi|16660290|gb|AAL27555.1|AF420410_1 hypothetical protein [Musa acuminata AAA Group]
          Length = 238

 Score =  241 bits (614), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 136/196 (69%), Positives = 172/196 (87%), Gaps = 1/196 (0%)

Query: 278 LALQIAK-NYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRAS 336
           L LQ+ K NYT+TCS+ YW+LNLLQ+PV+ GVS YEAV+LY+G+R I+SKG EGT +   
Sbjct: 2   LILQVLKQNYTSTCSLWYWILNLLQVPVSLGVSGYEAVSLYRGKRIISSKGLEGTDFTVI 61

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
           QLVFYC  G+ AG+VGGLLGLGGGFILGP+FLELG+PPQVSSATA FA+TFSSSMSVVEY
Sbjct: 62  QLVFYCLIGVLAGVVGGLLGLGGGFILGPVFLELGVPPQVSSATATFAMTFSSSMSVVEY 121

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 456
           YLLKRFP+PYALYF +++++AAFVGQH++K+LI+ILGRAS+IIF L+  IF+SA+SLGGV
Sbjct: 122 YLLKRFPIPYALYFVSVALVAAFVGQHLVKRLIEILGRASLIIFILASTIFISAISLGGV 181

Query: 457 GLAKMIKRIEHKEYMG 472
           G++ M+++I+H E  G
Sbjct: 182 GISNMVQKIQHHESWG 197


>gi|302785269|ref|XP_002974406.1| hypothetical protein SELMODRAFT_414563 [Selaginella moellendorffii]
 gi|300158004|gb|EFJ24628.1| hypothetical protein SELMODRAFT_414563 [Selaginella moellendorffii]
          Length = 380

 Score =  241 bits (614), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 177/384 (46%), Positives = 230/384 (59%), Gaps = 26/384 (6%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           +FVP+  L++GFD K+SAA+SK M+ GGAV++ +YNL  +HP LD P+IDYDLALL QP+
Sbjct: 13  LFVPLFRLLIGFDVKTSAALSKSMIMGGAVASALYNLPSKHPVLDKPLIDYDLALLIQPM 72

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
           L+LGISIGV  NV+F DW++TVLL+ +   ++ + F KG  TW  E+  K       E  
Sbjct: 73  LLLGISIGVMCNVMFPDWILTVLLVAVLTAMAFKTFNKGACTWSTESEQKAHCCDTGEE- 131

Query: 222 VLGNGYQTEECKCDPEYL---SNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIIL 278
             G+  +TE      + L   S    PE           +  +I W    LL  VW + L
Sbjct: 132 DFGSNSRTEPSSSLEQGLLAKSEQCAPE----------GLSSSIKWANICLLCTVWTVYL 181

Query: 279 ALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRAS-- 336
            LQ+ K+   TC  LYW+LN+LQ PV+ G +A        G  +I  +GD G +  A+  
Sbjct: 182 ILQLLKSGAATCGRLYWILNVLQAPVSIGATAI-------GVWRIYRRGDFGKEKPAAGP 234

Query: 337 ---QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 393
              QL  Y    + AG+VGGLLG+GGG IL PLFLELGI PQV+SAT  F + FSSSMSV
Sbjct: 235 TCEQLFLYPLYAVLAGVVGGLLGIGGGMILAPLFLELGIIPQVTSATTTFIVVFSSSMSV 294

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
           VE+YLL R PV +A YF  L  IAA VG H  K  I+  GR S IIF L+ II  SA+ L
Sbjct: 295 VEFYLLGRLPVRFAAYFTTLCGIAALVGLHATKLFIRRYGRTSFIIFILAAIIGSSAIIL 354

Query: 454 GGVGLAKMIKRIEHKEYMGFDNIC 477
           G +G    I+R    EYMGF ++C
Sbjct: 355 GIIGGFADIRRYIAGEYMGFHSLC 378


>gi|359482297|ref|XP_003632751.1| PREDICTED: uncharacterized protein LOC100240792 isoform 2 [Vitis
           vinifera]
          Length = 369

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 143/271 (52%), Positives = 186/271 (68%), Gaps = 11/271 (4%)

Query: 59  PKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSS 118
           P   S  + VWP++KF WRIV+ T+IGF G+A G+VGGVGGGGIFVPML LIVGFD KS+
Sbjct: 51  PSVSSSSEKVWPELKFSWRIVLATVIGFLGSAFGTVGGVGGGGIFVPMLTLIVGFDTKSA 110

Query: 119 AAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFAD 178
           AA+SKCM+ G + S+V YNLR  HPT + P+IDYDLALLFQP+L+LGI++GVA +V+F  
Sbjct: 111 AALSKCMIMGASTSSVWYNLRVNHPTKEAPIIDYDLALLFQPMLMLGITVGVALSVVFPY 170

Query: 179 WMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEY 238
           W+IT+L+I+LF+  STR+F K V+ W +ET+ K+E  +Q   +V  +G    E   D EY
Sbjct: 171 WLITILIIILFMGTSTRSFFKAVQMWNEETLFKKELEEQRRTMVNSHG----ELLIDAEY 226

Query: 239 LSNDTTPEETREPKKSKVSII-ENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVL 297
             +   P E    +KS + I+  N+ WK   +LV VW I L LQI KN +T CS  YWVL
Sbjct: 227 --DPLIPNE----EKSWIQILCSNLRWKRILVLVTVWVIFLLLQIFKNDSTVCSAWYWVL 280

Query: 298 NLLQIPVAGGVSAYEAVALYKGRRKIASKGD 328
            LLQ PVA  V  YEAV L+K  +K    G+
Sbjct: 281 FLLQFPVAVVVFGYEAVKLHKEYKKRIISGN 311


>gi|255645402|gb|ACU23197.1| unknown [Glycine max]
          Length = 261

 Score =  234 bits (598), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 125/194 (64%), Positives = 154/194 (79%), Gaps = 9/194 (4%)

Query: 66  KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
             VWP+++FGWRIVVG+I+GFFG A GSVGGVGGGGIF+PML L++GFDAKSS A+SKCM
Sbjct: 71  DRVWPEMEFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTLVIGFDAKSSTALSKCM 130

Query: 126 VTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLL 185
           + G AVS V YNLR RHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNV+FADWM+TVLL
Sbjct: 131 IMGAAVSTVYYNLRFRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLL 190

Query: 186 IVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTP 245
           I+LF+  ST+A  KG++TWKKETI K+EAAK LE      GY +EE     +Y S    P
Sbjct: 191 IILFIATSTKALFKGIDTWKKETIMKKEAAKMLESDS-SPGYVSEE-----DYKS---LP 241

Query: 246 EETREPKKSKVSII 259
             + +P+  +V ++
Sbjct: 242 AGSADPRDEEVPLL 255


>gi|356538176|ref|XP_003537580.1| PREDICTED: uncharacterized protein LOC100817312 [Glycine max]
          Length = 198

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 125/194 (64%), Positives = 158/194 (81%)

Query: 284 KNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCA 343
           + Y+  CS+ +W+LN LQ+P+A  V+ +EA+ LYKG R IASKG E T W+  Q+  YC+
Sbjct: 2   QEYSKPCSIHFWLLNFLQVPIAVSVTLFEAIGLYKGTRVIASKGKEVTNWKIHQICLYCS 61

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 403
            GI AGMVGGLLGLGGGFILGPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YYLL RFP
Sbjct: 62  TGIMAGMVGGLLGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYLLDRFP 121

Query: 404 VPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK 463
           VPYA YF  ++ IAAF GQHV++K+I +LGRASIIIF L+  IF+SA+SLGGVG+  +I+
Sbjct: 122 VPYASYFALVATIAAFTGQHVVRKVIVVLGRASIIIFILALTIFISAISLGGVGIENIIE 181

Query: 464 RIEHKEYMGFDNIC 477
           +IE  EYMGF+++C
Sbjct: 182 KIESHEYMGFEDLC 195


>gi|147812176|emb|CAN61520.1| hypothetical protein VITISV_033969 [Vitis vinifera]
          Length = 382

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 139/306 (45%), Positives = 176/306 (57%), Gaps = 69/306 (22%)

Query: 22  LVVASVSVSAQSNLKPEATSKN--KNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGW 76
            ++AS  VS + ++K EA++ N  K      N + KV        RSGY+HVWP+++FGW
Sbjct: 80  FMLASAFVSGERSIKHEASTFNVTKEAGFNSNYLSKVVNFLWQSDRSGYQHVWPEMEFGW 139

Query: 77  RIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVY 136
           +IVVG+IIGFFG A GSVGGVGGGGIFVPML+L++GFD KS+ A+SKCM+ G A S V Y
Sbjct: 140 QIVVGSIIGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDPKSATALSKCMIMGAAGSTVYY 199

Query: 137 NLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRA 196
           NL+ RHPTLD+P+IDYDLALLFQP+L++GISIGVAFNV+FADWM                
Sbjct: 200 NLKLRHPTLDMPIIDYDLALLFQPMLMMGISIGVAFNVLFADWM---------------- 243

Query: 197 FLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKV 256
                           EAAK+L     GNG +  E K  P   SN T     +  +    
Sbjct: 244 ----------------EAAKRLG--TNGNGTEEVEYKPLPSGPSNGTQXATNKSKE---- 281

Query: 257 SIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL 316
                                      +N T TCS+ YWVLN +QIPV+ GVS YEAV+L
Sbjct: 282 --------------------------XENNTATCSMAYWVLNFMQIPVSVGVSLYEAVSL 315

Query: 317 YKGRRK 322
            +G   
Sbjct: 316 VQGTEN 321


>gi|302771113|ref|XP_002968975.1| hypothetical protein SELMODRAFT_170159 [Selaginella moellendorffii]
 gi|300163480|gb|EFJ30091.1| hypothetical protein SELMODRAFT_170159 [Selaginella moellendorffii]
          Length = 309

 Score =  214 bits (545), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 145/331 (43%), Positives = 205/331 (61%), Gaps = 37/331 (11%)

Query: 158 FQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQ 217
            QP+L+LGIS+GV FNV+F +W+IT+LLIV+ L +++RAF KG+ETWKKET  KR     
Sbjct: 4   LQPMLLLGISVGVTFNVLFPNWLITLLLIVVSLAMASRAFSKGLETWKKETNEKR----- 58

Query: 218 LELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAII 277
             LI+ G+          P   +N TT +          S+   + WK+  LL AVW +I
Sbjct: 59  --LILEGS------LTPGP---ANFTTLD----------SLWTTVEWKKLSLLFAVWCLI 97

Query: 278 LALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL-----YKGRRKIASKGDEGTK 332
             LQ+ K YT  CS  +W+ N+LQ PV   V+  +A+ +     YK R  +  +  E + 
Sbjct: 98  TGLQVLKAYTANCSTAFWIYNILQAPVTLAVTVTQALRMREHSSYKLRESLLDESSESSS 157

Query: 333 W------RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAIT 386
                  RA  +  Y   G+ AG + GLLG+GGG  LGP+ LE G+PPQ +SATA  A+ 
Sbjct: 158 ASLKAAGRALDVFRYVFFGVLAGSIAGLLGVGGGATLGPIMLEFGVPPQTASATATLAML 217

Query: 387 FSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFII 446
           FSSS+SVVE+Y L R  V YALYF A+ I++AF+ Q +++KL+ +LGR S+I FTL F+I
Sbjct: 218 FSSSLSVVEFYFLGRIKVSYALYFGAICIVSAFLSQKLVQKLLNLLGRTSVITFTLVFVI 277

Query: 447 FVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            VS +SLGGVG+   I+ ++  +YMGF ++C
Sbjct: 278 VVSVISLGGVGIVDSIQELKRGKYMGFGSLC 308


>gi|222625716|gb|EEE59848.1| hypothetical protein OsJ_12424 [Oryza sativa Japonica Group]
          Length = 465

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 138/344 (40%), Positives = 193/344 (56%), Gaps = 31/344 (9%)

Query: 146 DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWK 205
             PV   + A+   P L     +G + N    +   T          S+R+F KG+  WK
Sbjct: 139 HCPVFCTNSAIASAPPLKFSTKVGSSRNACVTNSTGT----------SSRSFYKGILMWK 188

Query: 206 KETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETR--EPKKSKVSIIE--- 260
            ET  + E  ++ E           +  C    +  D + EE    +P+  + S +E   
Sbjct: 189 DETRIQMETREREE---------ESKSSCAARDVVIDPSCEEPLLCQPQPKEKSALETFL 239

Query: 261 -NIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKG 319
            N+ WK   +L+ VW+  L LQI KN + +CS  YWV+N+LQ+PVA  V  +E V L + 
Sbjct: 240 FNLRWKNILVLMTVWSSFLVLQIFKNNSQSCSTFYWVINILQVPVALSVFLWEGVQLCRE 299

Query: 320 RRKIASKG------DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIP 373
            R     G      +   +W  +QL+F   CG+  G VGGLLG GGGFILGPL LELG  
Sbjct: 300 SRARRMDGNWECVCEASIEWSPAQLIFCAFCGLLGGTVGGLLGSGGGFILGPLLLELGCI 359

Query: 374 PQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 433
           PQV+SATA F + FSSS+SVVE+Y L RFP+P+A+Y   +SI+A F GQ +++KL+ +L 
Sbjct: 360 PQVASATATFVMMFSSSLSVVEFYFLNRFPIPFAVYLICISILAGFWGQSLVRKLVHVLK 419

Query: 434 RASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           RAS+I+F LS +IF SAL++G VG  K I  I + EYMGF + C
Sbjct: 420 RASLIVFILSSVIFASALTMGVVGTQKSISMINNHEYMGFLDFC 463


>gi|168019909|ref|XP_001762486.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686219|gb|EDQ72609.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 390

 Score =  201 bits (512), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 140/410 (34%), Positives = 232/410 (56%), Gaps = 36/410 (8%)

Query: 83  IIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRH 142
           ++G  G +  S GGVGGGG+F+P+ NL++GFDAKSSAA+S  M+ GG+++ V +N+++ H
Sbjct: 2   VLGTLGASICSAGGVGGGGLFIPLFNLLIGFDAKSSAALSNFMIFGGSIANVWWNIQRDH 61

Query: 143 PTLD-IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGV 201
           P L   P+ID+D+ LL QP ++LGISIGV  NV F  W IT+  I+    ++ R+F  G+
Sbjct: 62  PFLPGHPLIDFDVVLLLQPNMLLGISIGVICNVAFPSWFITLEFIITLGYITARSFRSGL 121

Query: 202 ETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETRE---PKKSKVSI 258
             W+ ET  +R                      D +    +  P+E  E   P   +   
Sbjct: 122 VRWRNETPLER---------------------LDSKLSWQNLCPKEGDEAIVPLLGESKP 160

Query: 259 IENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSA 310
             N  + +  +L  VW   LA+Q+ +   ++ S+L        YW+L  +Q+P+A  ++ 
Sbjct: 161 PRNFPYVKLLMLTLVWTAFLAVQLLRGGKSSDSILTLEPCGGAYWLLTTMQVPLAFLLTG 220

Query: 311 YEAVALYKGRRKIASKGDEGTKWR---ASQLVFYCACGITAGMVGGLLGLGGGFILGPLF 367
           + A  L+    +  +   E  +W        V + +  + AG +GG+LG+GGG I+ P+ 
Sbjct: 221 WSAWHLHHALEEKNTPELELEEWDVIGPRAFVLFPSMALLAGFLGGMLGIGGGMIINPML 280

Query: 368 LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 427
           +ELG+ PQ+++AT  F + FSSS+SVV+++LL R  + +AL F A+  +++ +G HV+++
Sbjct: 281 IELGMHPQLTAATTAFMVFFSSSLSVVQFWLLGRLQLDFALLFGAICFVSSLIGLHVVQQ 340

Query: 428 LIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            I   GR SII+F++S ++ +SA+S    G  ++  +    EYMGF   C
Sbjct: 341 AIAKFGRPSIIVFSVSIVLGISAVSTTICGGFEVWDQFTGGEYMGFHYPC 390


>gi|168044376|ref|XP_001774657.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673957|gb|EDQ60472.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 380

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 134/386 (34%), Positives = 218/386 (56%), Gaps = 29/386 (7%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQP 160
           +F+P+ NL++ FD K+SAA+S  ++ GG+V+ +++NL QRHP+L     IDYD+AL+ QP
Sbjct: 14  LFIPVFNLLLLFDPKTSAALSNFVILGGSVANLIWNLPQRHPSLPHKSAIDYDVALILQP 73

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
            ++LGISIGV  NV+F  W+I V L ++   ++TR++  G++ W+ E+        QL  
Sbjct: 74  NMLLGISIGVICNVMFPGWLIIVQLALILGFITTRSWKNGIKRWRIES--------QLA- 124

Query: 221 IVLGNGYQTEECKCDPE-YLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILA 279
                  +TEE   + E  L +D        P K+ +    +    +   L  VW     
Sbjct: 125 -----ALKTEEANAESEGPLESDNLHAPLLAPVKTPLE--PSCLGSKVMCLGLVWVAFFV 177

Query: 280 LQIAKNYTTT--------CSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGT 331
           +Q+ +   TT        C V YW+L L QIP+A  V+ + A   ++  +  + K D+G 
Sbjct: 178 IQLLRGGKTTEGILPLKSCGVGYWLLTLTQIPLACFVTLWTA---FRHTQCSSDKQDQGE 234

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
             R   L  +    + AG+ GG+LG+GGG  + PL +E G+  QV++AT  F + FSSS+
Sbjct: 235 ITRHRALTVFPGMALVAGLWGGMLGIGGGMFMNPLLIEAGVHVQVTAATTAFMVFFSSSL 294

Query: 392 SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
           SVVE++LL R PV +A+ F ++  +A+ +G  V+   I   GRASII+F+++  + +SA+
Sbjct: 295 SVVEFWLLGRIPVDFAIVFSSICFVASLIGLTVVHHAISKYGRASIIVFSVAIALGISAV 354

Query: 452 SLGGVGLAKMIKRIEHKEYMGFDNIC 477
            + G G   + ++ +   YMGF   C
Sbjct: 355 MMAGFGSLNVYRQYKDGAYMGFHTPC 380


>gi|290979999|ref|XP_002672720.1| hypothetical protein NAEGRDRAFT_81115 [Naegleria gruberi]
 gi|284086299|gb|EFC39976.1| hypothetical protein NAEGRDRAFT_81115 [Naegleria gruberi]
          Length = 610

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 129/460 (28%), Positives = 222/460 (48%), Gaps = 50/460 (10%)

Query: 65  YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
           +K ++P  +F +   + T+I F G+   ++ G+GGGG+ VP++     F  + +  +SK 
Sbjct: 154 HKDLFP--QFSYLDAIATVISFLGSVLSTMAGIGGGGLIVPLMETAGQFPPQMAVGISKT 211

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           M+ G ++S  +    +RHP  D P+I+YD+ALL QP  ++G+ +GV  N +  +W+I +L
Sbjct: 212 MIFGASISNFIALSLKRHPHADRPLINYDVALLLQPTSLVGVLVGVLLNSMTPNWLIVLL 271

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITK----REAAKQLELIVLGNGYQTEECKCD----- 235
             ++  ++S   F++    W+ E+  K       +      +  NG   +    D     
Sbjct: 272 SAIILTIVSLTTFVRAGRMWRAESAAKLVGSNNGSSANYHQITDNGVNNDTIISDDEDET 331

Query: 236 ---PEYLSNDTTPE-----------------ETREPKKSKVSIIENIYWKEFG---LLVA 272
              P+  +N+   E                 E  E K+    ++ N     F    +LV 
Sbjct: 332 FDRPKNTNNENVLETSLLGEDELEQMEQENIEKEEKKQIARKVLHNEKSTPFTKLFVLVL 391

Query: 273 VWAIILALQIAKN--------YTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL---YKGRR 321
            W I+  L I +         +   CSV +WVL     P+   VS    V L   +K R 
Sbjct: 392 CWVIVFILTIVRGGHGAPSVFHIDHCSVWHWVLFAAMFPIMIAVSYMVGVYLVRKHKERS 451

Query: 322 KIASKGDE----GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVS 377
            + S G         W    ++ Y      +G++ G+LG+GGG I  P+ L LG+ P V+
Sbjct: 452 ALVSHGYRFVAGDPHWSVGTVILYPVFSALSGVIAGMLGVGGGMIKSPMLLYLGLDPLVA 511

Query: 378 SATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASI 437
           +ATA F + F+SS+S +++ +L   P  Y L++F   + A  VGQ +L   +   GR S+
Sbjct: 512 AATASFMMLFTSSISSIQFVILGTVPFDYGLWYFVFGLFAGMVGQ-ILMHFVFQKGRKSV 570

Query: 438 IIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           ++F ++FII VS L + G G+   +  +++  YMGF +IC
Sbjct: 571 LVFIVAFIIIVSTLCMTGFGIYNAVIMLQNNMYMGFHSIC 610


>gi|4415908|gb|AAD20139.1| unknown protein [Arabidopsis thaliana]
 gi|20197925|gb|AAM15316.1| unknown protein [Arabidopsis thaliana]
          Length = 288

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 92/156 (58%), Positives = 128/156 (82%)

Query: 60  KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
            S S  + +WPD+KF W++V+ T+I F G+ACG+VGGVGGGGIFVPML LI+GFD KS+A
Sbjct: 42  SSLSATEKIWPDLKFSWKLVLATVIAFLGSACGTVGGVGGGGIFVPMLTLILGFDTKSAA 101

Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
           A+SKCM+ G + S+V YN+R RHPT ++P++DYDLALLFQP+L+LGI++GV+ +V+F  W
Sbjct: 102 AISKCMIMGASASSVWYNVRVRHPTKEVPILDYDLALLFQPMLLLGITVGVSLSVVFPYW 161

Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAA 215
           +ITVL+I+LF+  S+R+F KG+E WK+ET+ K   A
Sbjct: 162 LITVLIIILFVGTSSRSFFKGIEMWKEETLLKFPVA 197


>gi|356566531|ref|XP_003551484.1| PREDICTED: uncharacterized protein LOC100809802 [Glycine max]
          Length = 254

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 98/184 (53%), Positives = 125/184 (67%), Gaps = 14/184 (7%)

Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
           C++    VS V YNL+ RHPTL++P+IDYDLALL QP+L+LGI+IGV FNV+F+ W++T+
Sbjct: 3   CLLVIYVVSTVYYNLKLRHPTLNMPIIDYDLALLIQPMLMLGITIGVVFNVVFSYWIVTI 62

Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKREA-AKQLELIVLGNG-YQTEECKCDPEYLSN 241
           LLIVLFL  ST++  KG+ETWKKETI K++   K+     + NG +     K  P+   +
Sbjct: 63  LLIVLFLGTSTKSLFKGIETWKKETIIKKDQFTKEAGKHPVSNGEFGPNHIKVLPDQFLS 122

Query: 242 DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLNLL 300
           D +           V+IIEN+YWKEFGLLV VW    ALQI K NYTTTC   YW LNLL
Sbjct: 123 DVS-----------VAIIENVYWKEFGLLVFVWVSFPALQIGKENYTTTCPTFYWALNLL 171

Query: 301 QIPV 304
           QI V
Sbjct: 172 QIMV 175


>gi|290999413|ref|XP_002682274.1| predicted protein [Naegleria gruberi]
 gi|284095901|gb|EFC49530.1| predicted protein [Naegleria gruberi]
          Length = 399

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 127/422 (30%), Positives = 203/422 (48%), Gaps = 49/422 (11%)

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           ++ TI  F G    +  G GGGG+FVP+L++   F    +  +S  M+ G  +  +    
Sbjct: 3   ILATITCFIGGVLSAASGTGGGGVFVPLLHVAGQFPPTLAIPISTLMIFGAGIINIATLS 62

Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
            +RHP  D P+IDYD+AL+ +P  +LG  IGV FN++F DW+I V +I+   + S   F 
Sbjct: 63  FKRHPHADRPLIDYDIALMMEPPTLLGTIIGVFFNMMFPDWLIVVFVILTLSITSFVMFK 122

Query: 199 KGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI 258
            G +  K+E     E  K+LE                      ++T EE      +  ++
Sbjct: 123 NGYKRLKRE----NEERKKLE----------------------ESTTEENHNDNNNPTTV 156

Query: 259 ---IENIYWKE----FG---LLVAVWAIILALQIAKN---------YTTTCSVLYWVLNL 299
              ++ +Y  E    FG   +L   W  +  L + K            T CSV YW+L  
Sbjct: 157 YDELQQMYESEKRTPFGKVIVLFICWMTVFTLSLLKGGHGAPSIIPSVTQCSVGYWILTA 216

Query: 300 LQIPVAGGVS---AYEAVALYKGRRKIASKGDEG-TKWRASQLVFYCACGITAGMVGGLL 355
           L  P+ G ++    +  +  ++ + +I  +  EG   W    +  Y    + AG++  +L
Sbjct: 217 LSFPLLGAMTLGIVFYLLKKHERKVQIGYQFVEGDVHWNKYNVTLYPIACVGAGILASML 276

Query: 356 GLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSI 415
           G+GGG +  PL L LG  P  S AT  F I F+SS+S V+Y +    PV Y L+F A  I
Sbjct: 277 GIGGGMVKSPLLLILGSDPVSSQATTSFMILFTSSISTVQYLIAGLLPVDYGLWFLACGI 336

Query: 416 IAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDN 475
           +    GQ +L   +   GR SI+IF ++ +   +   +GG G+  ++K++E   YMGF +
Sbjct: 337 LCGVFGQLILDLWLDKSGRRSIMIFIVAIVTLAATFLMGGAGIYDVVKQVEKGVYMGFRS 396

Query: 476 IC 477
            C
Sbjct: 397 PC 398


>gi|384245840|gb|EIE19332.1| hypothetical protein COCSUDRAFT_44684 [Coccomyxa subellipsoidea
           C-169]
          Length = 495

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 133/469 (28%), Positives = 210/469 (44%), Gaps = 57/469 (12%)

Query: 57  SAPKSRSGYKHVW---PDIKFGW--RIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIV 111
            A +S SG   +    P+ KF W  R +    +        +  G+GGG  +VP+LN+++
Sbjct: 35  EASRSDSGLFQILGTGPNTKFEWTPRTIFAAALACVCALLANSAGIGGGPFYVPLLNVVL 94

Query: 112 GFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT-LDIPVIDYDLALLFQPLLVLGISIGV 170
           GFD K++  +S  +V   AV++ +Y L Q HP     P++D+D+AL F P L+LG+S GV
Sbjct: 95  GFDLKAATGLSHTIVATSAVASSIYGLIQTHPNDPSRPLVDFDVALTFIPALLLGVSFGV 154

Query: 171 AFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETIT---KREAAKQLELIVLGNGY 227
             NV+  DW+ T LL VL L +  +   KG+  W++E      KR AA+Q     LG+  
Sbjct: 155 LLNVLVPDWLQTALLTVLLLFVINKTVRKGITQWRQEQKAIKQKRSAAQQ----DLGDED 210

Query: 228 QTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLL----------------- 270
             E    +  +  N   P +      S V  ++    + F  L                 
Sbjct: 211 DEEGVLHEERFERN---PSKRFSAPHSSVHQLQTTLSQIFHRLPLFKALRKVCETCVQMA 267

Query: 271 --VAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGD 328
             VA+WA+ LA Q  K     C+  Y+ +   Q+     V+A+     Y+ ++      D
Sbjct: 268 AVVALWAVFLAFQQLKARYPNCTWQYFTIFAAQVIFLLSVTAF--CIWYEAKKAAGPHAD 325

Query: 329 E--------------------GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 368
           E                    GT     +L         AG   GLLG+GG  I  P+ L
Sbjct: 326 EMDPELRTVILGEQSDSETPIGTADTYKRLAKVVGVMAFAGFTAGLLGIGGALIFNPVLL 385

Query: 369 ELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           +LG+ PQV+++T++  I F+SS   + +Y        YAL    L  +A+ +G  V+ ++
Sbjct: 386 QLGVQPQVTASTSVLMILFTSSAIALSFYFQGLLNTSYALVLAPLCFVASLIGVTVVGRI 445

Query: 429 IKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           I+  GR SII+  LS +I    +     G  + +  I     +GF   C
Sbjct: 446 IRASGRVSIIVLLLSALIIAGTVLTAFFGGIRAVNDIRDGAPIGFKPFC 494


>gi|407038050|gb|EKE38923.1| hypothetical protein ENU1_147930 [Entamoeba nuttalli P19]
          Length = 459

 Score =  174 bits (441), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 128/446 (28%), Positives = 220/446 (49%), Gaps = 48/446 (10%)

Query: 74  FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
           F W+++VG+I   F     +  G+GGG  ++ +  LI+  D   +  +SK    G A   
Sbjct: 7   FDWKLIVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDPHQAIPLSKITTFGVACGG 66

Query: 134 VVYNLRQRHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
            +    +RHP +   P+I Y  AL+ +PL + G  IGV FN+I   W+I ++L++L    
Sbjct: 67  FLILWMKRHPNVRYKPLISYPTALMVEPLTIYGTMIGVIFNIISPSWLIIIVLVLLLGFT 126

Query: 193 STRAFLKGVETWKKETITKREAAKQLELI-------------------------VLGNGY 227
           S + F K ++ WK E   KR+AAK  EL+                         ++    
Sbjct: 127 SYKTFAKAIKQWKNEN-EKRDAAKATELVETSKPDIADNDNDDMKPSENGNNAVIVDERV 185

Query: 228 QTEE--CKCDPEYLSNDTTPEETREPKKS------KVSIIENIYWKEFGLLVAVWAIILA 279
           Q EE      P+ L  D + E  +E KK       K  II+ I     G+L+ VWA++  
Sbjct: 186 QEEEEGQGSGPKLLPQDESQEAQKEAKKIEEKTLLKREIIKAIL--SVGILIVVWAVMFF 243

Query: 280 LQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGT 331
           + I K      S++        YWVL  +  P+   V+    + L+  +R    +G+   
Sbjct: 244 IVILKGGEKMDSIVGIECGTPWYWVLTAIGGPLMLAVTIVVGIFLWWRQRGEEVQGE--V 301

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
           +W     +        AG+    LG+GGG ++GP+ LE+G+ PQV++AT+ F I F++S 
Sbjct: 302 QWTVKNCLIIPIGAFFAGVSAAYLGIGGGMVIGPILLEIGVLPQVATATSAFMIMFTASS 361

Query: 392 SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
           S ++Y +  +  + Y L++FA+  I A  GQ    K+++ L R SII F L  +I +S L
Sbjct: 362 SSLQYIIDGKLDIFYGLWYFAIGFIGAAFGQFGFSKIVQKLNRQSIIGFFLGVLIVLSTL 421

Query: 452 SLGGVGLAKMIKRIEHKEYMGFDNIC 477
           ++  + + +++  +++ + +GF ++C
Sbjct: 422 AMIAITVVQLVSDVKN-DNLGFKHLC 446


>gi|290977704|ref|XP_002671577.1| predicted protein [Naegleria gruberi]
 gi|284085147|gb|EFC38833.1| predicted protein [Naegleria gruberi]
          Length = 537

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 127/431 (29%), Positives = 210/431 (48%), Gaps = 38/431 (8%)

Query: 83  IIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRH 142
           +  F G+   S GG GGGG+FVP+L+++  F+A+ +  +SK M+ G AV+ V     + H
Sbjct: 107 LTNFIGSVLSSAGGTGGGGVFVPLLHVLGRFNAQEAVPLSKVMIFGAAVTNVFTLFFRSH 166

Query: 143 PTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVE 202
           P  D P+IDYD+AL+ +P  +LG  IGV  N+I  +W+I + +I++  + +   F K  +
Sbjct: 167 PYADRPLIDYDIALMMEPATLLGTIIGVFLNIICPEWVIVLSVIIVLTITTILTFRKFFQ 226

Query: 203 TWKKE-------TITKREAA---KQLELIVLGN-------GY----QTEEC--KCDPEYL 239
             + E            E     K+ E IV  N       GY    + EE      P ++
Sbjct: 227 RARVEFAFLWKKKKKDEETEPLNKEQEPIVSENKEEEQAQGYGSVNKEEESLVNTQPIFV 286

Query: 240 SNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYT---------TTC 290
           S     ++  +P    + + +  YWK F +L+  W II  L + +              C
Sbjct: 287 SQQVAEKDFVKPSAWTI-VKKTPYWKIF-VLIVCWIIIFTLSLLRGGEGAPSVIPGLEMC 344

Query: 291 SVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG----DEGTKWRASQLVFYCACGI 346
           S +YW L  L  P+ G +    A  L    R+   +G        KW    + FY    +
Sbjct: 345 SPVYWTLVGLSFPIIGAIMIIVAAYLLIDYRRKVKRGHVFVQGDVKWNWINVTFYPGACL 404

Query: 347 TAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPY 406
            AG++  +LG+GGG I  PL L LG  P V +ATA F I F+SS+S  ++ ++ R P  Y
Sbjct: 405 IAGILAAMLGIGGGMIKSPLLLLLGSDPAVGAATAAFMIFFTSSISSAQFAIVGRIPFDY 464

Query: 407 ALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIE 466
            + +     ++ FVG   +   ++  G+ S+II  +  ++  + + +GGVG+  ++  ++
Sbjct: 465 GMLYGLTGFVSGFVGYFFVTFGVERWGKRSLIILCVGLVLLFATMLMGGVGIYDVVIDLQ 524

Query: 467 HKEYMGFDNIC 477
              YMGF + C
Sbjct: 525 QGVYMGFHDPC 535


>gi|67466247|ref|XP_649271.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56465670|gb|EAL43888.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449701601|gb|EMD42390.1| Hypothetical protein EHI5A_207490 [Entamoeba histolytica KU27]
          Length = 460

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 125/447 (27%), Positives = 217/447 (48%), Gaps = 49/447 (10%)

Query: 74  FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
           F W+++VG+I   F     +  G+GGG  ++ +  LI+  D   +  +SK    G A   
Sbjct: 7   FDWKLIVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDPHQAIPLSKITTFGVACGG 66

Query: 134 VVYNLRQRHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
            +    +RHP +   P+I Y  AL+ +PL + G  IGV FN+I   W+I ++L++L    
Sbjct: 67  FLILWMKRHPNVRYKPLISYPTALMVEPLTIYGTMIGVIFNIISPSWLIIIVLVLLLGFT 126

Query: 193 STRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCD----------------- 235
           S + F K ++ WK E   KR+AAK  EL+        +    D                 
Sbjct: 127 SYKTFAKAIKQWKNEN-EKRDAAKATELVETSKPDIADNDNDDMKPSENGSNAVIVDEKV 185

Query: 236 -----------PEYLSNDTTPEETREPKKS------KVSIIENIYWKEFGLLVAVWAIIL 278
                      P+ L  D + E  +E KK       K  II+ I     G+L+ VWA++ 
Sbjct: 186 QEEEEEGQGTGPKLLPQDESQEAQKEAKKIEEKTLLKREIIKAIL--SVGILIIVWAVMF 243

Query: 279 ALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEG 330
            + I K      S++        YWVL  +  P+   V+    + L+  +R    +G+  
Sbjct: 244 FIVILKGGEKMDSIVGIECGTPWYWVLTAIGGPLMLAVTIVVGIFLWWRQRGEEVEGE-- 301

Query: 331 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 390
            +W     +        AG+    LG+GGG ++GP+ LE+G+ PQV++AT+ F I F++S
Sbjct: 302 VQWTVKNCLIIPIGAFFAGVSAAYLGIGGGMVIGPILLEIGVLPQVATATSAFMIMFTAS 361

Query: 391 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSA 450
            S ++Y +  +  + Y +++FA+  I A  GQ    K+++ L R SII F L  +I +S 
Sbjct: 362 SSSLQYIIDGKLDIFYGIWYFAIGFIGAAFGQFGFSKIVQKLNRQSIIGFFLGVLIVLST 421

Query: 451 LSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           L++  + + +++  +++ + +GF ++C
Sbjct: 422 LAMIAITVVQLVSDVKN-DNLGFKHLC 447


>gi|168065030|ref|XP_001784459.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663984|gb|EDQ50721.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 405

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 136/415 (32%), Positives = 232/415 (55%), Gaps = 30/415 (7%)

Query: 77  RIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVY 136
           R+V   I+G  G    + GGVGGGG+F+P+ NL++ FDAK+SAA+S  M+  G+++ + +
Sbjct: 4   RVVAAVILGSLGACICAAGGVGGGGLFIPIFNLLLLFDAKTSAALSNIMILAGSIAVLAW 63

Query: 137 NLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTR 195
           N+R+ HP +   P+IDYD+ALL  P ++LGISIGV  N+ F  W++  +L V+   ++ R
Sbjct: 64  NIRRTHPLSPGKPLIDYDVALLLNPNMLLGISIGVFCNITFPGWLLISVLTVILFYMTNR 123

Query: 196 AFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSK 255
           +   G   WKKE+    +A +++ ++   + +  E  +     L   + P    +    K
Sbjct: 124 SIQNGFTRWKKESAAAAKAKEKIAIV---SAHSVETGQSKYPLLGGQSEPSLFAQCPPQK 180

Query: 256 VSIIENIYWKEFGLLVAVWAIILALQIAKNYTTT--------CSVLYWVLNLLQIPVAGG 307
             +I+         LV +W +  A+QI +    T        C + YW+L+  Q+P+A G
Sbjct: 181 --LIK---------LVMMWLLFFAVQILRGGEGTEGILKVKPCGLAYWLLSASQLPLAIG 229

Query: 308 VSAYEAVALYKGRRKIASKGDEGTK-----WRASQLVFYCACGITAGMVGGLLGLGGGFI 362
           ++A+  +AL    +  A+K  E  +       +     +    + AGM+GG+LG+GGG I
Sbjct: 230 LTAW--IALQHSSKSHAAKPSESNEEVDVMLTSRAYTVFPLMALLAGMLGGMLGIGGGMI 287

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
           + P+  E+G+ PQ ++ T+ F I F++SMSV++++LL R P+ +AL F A+ +  + VG 
Sbjct: 288 INPMLTEVGMHPQATAGTSSFMIFFATSMSVLQFWLLGRIPMDFALLFGAVCLFWSCVGI 347

Query: 423 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            +L+  I   GR S+I+F +S ++ VSAL +   G   +  +    +YMGF   C
Sbjct: 348 GLLQAAIVKHGRPSVIVFLVSSVMGVSALLMATFGGFNVWHQYRAGDYMGFHAAC 402


>gi|356549443|ref|XP_003543103.1| PREDICTED: uncharacterized protein LOC100790958 [Glycine max]
          Length = 466

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 128/399 (32%), Positives = 218/399 (54%), Gaps = 37/399 (9%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD-IPVIDYDLALLFQP 160
           +FVP+L+++ G D K+++++S  MVTGG+++ V+ N+    P      +IDYD+AL  +P
Sbjct: 82  LFVPILSIVAGLDLKTASSLSAFMVTGGSIANVMCNMCITSPKFGGKSLIDYDIALSSEP 141

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK-GVETWKKETITKREAAKQLE 219
            ++LG+S+GV  N++F +W+ITVL  + FL  ST    K G+  WK E+   R+      
Sbjct: 142 CMLLGVSLGVICNLVFPEWLITVLFAI-FLAWSTSKTCKSGLLFWKAESEVIRK------ 194

Query: 220 LIVLGNGYQTEECKCDPEYLSNDTTPE-----ETREPKKSKVSIIE-------NIYWKEF 267
                NG   EE   +   L N+T  +     E  EPK  +VS++         I W + 
Sbjct: 195 -----NGLINEEL--EKGLLENETIEQRKVYIENNEPKSIEVSLLAPQGNSKVRIPWFKL 247

Query: 268 GLLVAVWAIILALQIAKNY--------TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKG 319
            +L+ +W    ++ + +             C V YW+L+ +Q+P+A   +A+        
Sbjct: 248 AVLLLIWFSFFSVYLLRGNRYGEGIIPMEPCGVGYWILSSVQVPLAVVFTAWIVFRKESL 307

Query: 320 R-RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS 378
           R R +  K    TK R S ++ +    + AG++GG+ G+GGG ++ PL L++G+ P+V++
Sbjct: 308 RDRTLIPKVPGLTKKRPSNILVFPLMALLAGILGGVFGIGGGMLISPLLLQVGVTPEVTA 367

Query: 379 ATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASII 438
           AT  F + FS++MS ++Y LL    V  AL    +  +A+ +G  V+++ I+  GRASII
Sbjct: 368 ATCSFMVLFSATMSGLQYLLLGMEHVQAALVLAIMCFVASLLGLLVVQRAIRKYGRASII 427

Query: 439 IFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +F++S ++F+S + +   G  K+    E  EYMGF   C
Sbjct: 428 VFSVSIVMFISNVLMTSFGAIKVWTDYESGEYMGFKLPC 466


>gi|302758356|ref|XP_002962601.1| hypothetical protein SELMODRAFT_65204 [Selaginella moellendorffii]
 gi|300169462|gb|EFJ36064.1| hypothetical protein SELMODRAFT_65204 [Selaginella moellendorffii]
          Length = 385

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 126/394 (31%), Positives = 215/394 (54%), Gaps = 40/394 (10%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD-IPVIDYDLALLFQP 160
           ++VP+ NL++ F+ +++A +S  M+ GG ++ ++++  QR P  D  P+ID+D ALL QP
Sbjct: 14  LYVPLFNLLLRFNTRTAAVLSNFMIFGGMIANMMWSAFQRDPFDDERPLIDFDAALLMQP 73

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
            ++LGIS+GV  N++F  W+IT+LL +    ++ R+F  G   WK E+ +          
Sbjct: 74  NMLLGISLGVLCNLMFPGWLITLLLTITLAFVTFRSFNCGFRLWKAESGSNSSD------ 127

Query: 221 IVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILAL 280
              G G   +    +   L +   P   R P             K  GLL+ VW    A+
Sbjct: 128 ---GEGKSAKYHDAEAPLLDSAEIPHR-RFPAL-----------KLAGLLL-VWLFFFAV 171

Query: 281 QIAKNYTTT--------CSVLYWVLNLLQIPVAGGVSAY---EAVALYKGRR--KIASKG 327
           Q+ +   T+        C + YW++   Q+P+    + +   EA      +   K     
Sbjct: 172 QLLRGSKTSEGYFHLDECGLGYWLITGSQLPLTLLFTVWTIREATVSTSCQSDVKFLVSS 231

Query: 328 DEGTKWRASQ----LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 383
            +  KW +S+     +      + AG++GGLLG+GGG ++ P+ LE+G+PPQV++AT+ F
Sbjct: 232 AKSFKWNSSRSNRAHLTLPLMALLAGILGGLLGIGGGMLISPILLEMGMPPQVTAATSAF 291

Query: 384 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 443
            + FSSS+SV +Y+L+ R PV +AL+F  +  + + VG  V+++ I+  GRASII+F ++
Sbjct: 292 MVFFSSSLSVAQYWLMGRIPVEFALWFSGICFVFSLVGLLVVQRAIQRYGRASIIVFLVT 351

Query: 444 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            ++ +SAL + G G   + K+ E  +YMGF + C
Sbjct: 352 SVMGLSALLMAGFGGMDVWKQYERGDYMGFRSPC 385


>gi|302797450|ref|XP_002980486.1| hypothetical protein SELMODRAFT_55219 [Selaginella moellendorffii]
 gi|300152102|gb|EFJ18746.1| hypothetical protein SELMODRAFT_55219 [Selaginella moellendorffii]
          Length = 385

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 124/394 (31%), Positives = 212/394 (53%), Gaps = 40/394 (10%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD-IPVIDYDLALLFQP 160
           ++VP+ NL++ F+ +++A +S  M+ GG ++ ++++  QR P  D  P+ID+D ALL QP
Sbjct: 14  LYVPLFNLLLRFNTRTAAVLSNFMIFGGMIANMMWSAFQRDPFDDERPLIDFDAALLMQP 73

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
            ++LGIS+GV  N++F  W+IT+LL +    ++ R+F  G   WK E+ +          
Sbjct: 74  NMLLGISLGVLCNLMFPGWLITLLLTITLAFVTFRSFNCGFRLWKAESGSNSSD------ 127

Query: 221 IVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILAL 280
              G G   +    +   L +   P   R P             K  GLL+ VW    A+
Sbjct: 128 ---GEGKSAKYHDAEAPLLDSAEIPHR-RFPAL-----------KLAGLLL-VWLFFFAV 171

Query: 281 QIAKNYTTT--------CSVLYWVLNLLQIPVA-----GGVSAYEAVALYKGRRKIASKG 327
           Q+ +   T+        C + YW++   Q+P+        +         +   K     
Sbjct: 172 QLLRGSKTSEGYFHLDECGLGYWLITGSQLPLTLLFTVWTIRETTVSTSCQSDVKFLVSS 231

Query: 328 DEGTKWRASQ----LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 383
            +  KW +S+     +      + AG++GGLLG+GGG ++ P+ LE+G+PPQV++AT+ F
Sbjct: 232 AKSFKWNSSRSNRAHLTLPLMALLAGILGGLLGIGGGMLISPILLEMGMPPQVTAATSAF 291

Query: 384 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 443
            + FSSS+SV +Y+L+ R PV +AL+F  +  + + VG  V+++ I+  GRASII+F ++
Sbjct: 292 MVFFSSSLSVAQYWLMGRIPVEFALWFSGICFVFSLVGLLVVQRAIQRYGRASIIVFLVT 351

Query: 444 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            ++ +SAL + G G   + K+ E  +YMGF + C
Sbjct: 352 SVMGLSALLMAGFGGMDVWKQYERGDYMGFRSPC 385


>gi|255555059|ref|XP_002518567.1| conserved hypothetical protein [Ricinus communis]
 gi|223542412|gb|EEF43954.1| conserved hypothetical protein [Ricinus communis]
          Length = 463

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 140/421 (33%), Positives = 227/421 (53%), Gaps = 26/421 (6%)

Query: 70  PDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGG 129
           P +K    +V+  ++ F   +  S GG+GGGG+FVP+L ++ G D K++++ S  MVTGG
Sbjct: 56  PMLKLASPMVLSGVLCFIAASISSAGGIGGGGLFVPILTIVAGLDLKTASSFSAFMVTGG 115

Query: 130 AVSAVVYNLRQRHPTLD-IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVL 188
           +++ V+ NL    P      +IDYD+ALL +P ++LG+S+GV  N+IF +W+ITVL  VL
Sbjct: 116 SIANVLCNLFS--PKFGGKALIDYDIALLSEPCMLLGVSVGVICNLIFPEWLITVLF-VL 172

Query: 189 FLVLSTRAFLKG-VETWKKETI-TKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPE 246
           FLV ST    K  V  W  E+   KR     LE     NG   +      E +     P 
Sbjct: 173 FLVWSTFKTCKNAVAHWNLESEEVKRNGHGNLE-----NGRVKDRSSIGNEEIKIIKEPL 227

Query: 247 ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYT--------TTCSVLYWVLN 298
              E  ++++S      W++ G+LV +W     L + +             C V YWV++
Sbjct: 228 MGIE-MENRMSFT----WEKLGVLVLIWLSFSFLYLLRGNRYGEGIAPLKPCGVGYWVVS 282

Query: 299 LLQIPVAGGVSAYEAVA--LYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLG 356
            LQIP+A   +A+  +    Y+ +       D+  + RA   + +    + AG++GG+ G
Sbjct: 283 SLQIPLAIIFTAWILLKKRHYQNQTANLQDIDDSMEGRAPNKLTFPIMALLAGILGGVFG 342

Query: 357 LGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII 416
           +GGG ++ PL L +GIPP+V++AT  F + FSS+MS  +Y L        AL F ++  +
Sbjct: 343 IGGGMLISPLLLHVGIPPEVTAATCSFMVFFSSTMSAFQYLLSGMEHTDTALMFASICFV 402

Query: 417 AAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNI 476
           A+ VG  V++++I+  GRASII+F++S ++ +S + +   G   + +  E    MGF   
Sbjct: 403 ASLVGLLVVQRIIQDYGRASIIVFSVSIVMALSTVLITSFGTIDVWRNYESGTNMGFKLP 462

Query: 477 C 477
           C
Sbjct: 463 C 463


>gi|414872566|tpg|DAA51123.1| TPA: hypothetical protein ZEAMMB73_068409 [Zea mays]
          Length = 199

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 90/141 (63%), Positives = 111/141 (78%), Gaps = 5/141 (3%)

Query: 60  KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
            SRS    VWPD+ F WR+VV T++GF G+A G+VGGVGGGGIFVPMLNL+VGFD KS+A
Sbjct: 61  SSRS--PRVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLVVGFDTKSAA 118

Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
           A+SKCM+ G + S+V YNL+  HPT + PVIDY LALLFQP+L+LGI+IGV  +VIF  W
Sbjct: 119 ALSKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYW 178

Query: 180 MITVLLIVLFLVL---STRAF 197
           +ITVL+I+LF      STRAF
Sbjct: 179 LITVLIIILFAPRRDPSTRAF 199


>gi|225469193|ref|XP_002275818.1| PREDICTED: uncharacterized protein LOC100252710 [Vitis vinifera]
 gi|297741863|emb|CBI33227.3| unnamed protein product [Vitis vinifera]
          Length = 458

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 134/421 (31%), Positives = 227/421 (53%), Gaps = 33/421 (7%)

Query: 71  DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
            +KF   IV   ++ F   +  S GG+GGGG+FVP+L ++ G D K+++  S  MV GG+
Sbjct: 57  QLKFSPPIVRAAVLCFIAASISSAGGIGGGGLFVPILAIVGGLDLKTASTFSAFMVAGGS 116

Query: 131 VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
            + ++  +          VID+D+ALL +P L+LG+SIGV  N++F +W+IT+L +V   
Sbjct: 117 TANILCTMFINCIHGGKSVIDFDIALLSEPCLLLGVSIGVVCNIVFPEWLITILFVVFLS 176

Query: 191 VLSTRAFLKGVETWKKET-ITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETR 249
             +++   KGV +WK E+ + +R    +LE     NG + +E   + E + +       +
Sbjct: 177 WTTSKTCRKGVVSWKLESEVIRRNGFGELE-----NGVRRDESNGENEVIKS------LK 225

Query: 250 EPKKSKVSIIE-NIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLL 300
           EP   +V   + +I W +FG LV +W     L I +      S++        YW+L+ L
Sbjct: 226 EPLMGEVENFKISIPWTKFGALVVIWLSFFLLYILRGDRDGQSIIPMEPCGEGYWILSSL 285

Query: 301 QIPVAGGVSAYEAVALYKGRRKIASK----GDEGTKWRASQLVFYCACGITAGMVGGLLG 356
           Q P+A   +A+        RR+ +++    G  G K     L+F     + AG++GG+ G
Sbjct: 286 QFPLAITFTAW-----ILHRRETSNQQEILGQTGEK--PPNLIFPIM-ALLAGILGGVFG 337

Query: 357 LGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII 416
           +GGG ++ PL L +GIPP+V++AT    + FSS+MS  +Y L+       AL F  +   
Sbjct: 338 IGGGMLISPLLLHIGIPPEVTAATCSVMVFFSSTMSSFQYLLIGMEHKEVALIFAIICFF 397

Query: 417 AAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNI 476
           A+ +G  V+++ I+  GRAS+I+F++S ++ +S + +   G   + +     EYMGF   
Sbjct: 398 ASILGVVVVQRAIEKYGRASLIVFSVSTVMALSTVLITSFGAIDVWRDYARGEYMGFKLP 457

Query: 477 C 477
           C
Sbjct: 458 C 458


>gi|115440389|ref|NP_001044474.1| Os01g0786800 [Oryza sativa Japonica Group]
 gi|113534005|dbj|BAF06388.1| Os01g0786800, partial [Oryza sativa Japonica Group]
          Length = 434

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 125/396 (31%), Positives = 192/396 (48%), Gaps = 42/396 (10%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR-----QRHPTLDIPVIDYDLAL 156
           +F+P+LNL+ G   K + A S  MVTGGA S V+YNL      +R       VIDYD+AL
Sbjct: 61  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLLCTGCGRRAAA----VIDYDIAL 116

Query: 157 LFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAK 216
           LFQP L+LG+SIGV  NV+F +W+IT L  +     +T+    G+  W  E+      A 
Sbjct: 117 LFQPCLLLGVSIGVVCNVMFPEWLITALFALFLAFCTTKTLRAGLRIWSSES----RGAT 172

Query: 217 QLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAI 276
                   +G +      +P  L     P  T               WK+  +LV VW  
Sbjct: 173 LAVAAATAHGRE------EPLLL-----PHGTDAGNGGGARGDAGFPWKDVSVLVMVWLC 221

Query: 277 ILALQI--------AKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK--IASK 326
              L +               C V YW++ L Q+P A   +AY    +Y  R+K  + ++
Sbjct: 222 FFVLHVFIGDKHGKGMIRIKPCGVAYWLITLSQVPFAVAFTAY---IIYAKRKKQVLHNQ 278

Query: 327 GD-----EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 381
            D     E TK      + +       G + GL G+GGG +L P+ L++GIPPQ ++AT+
Sbjct: 279 EDGKANPESTKMDTLPTLLFPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATS 338

Query: 382 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT 441
            F + F +SMS+V++ LL    +  A  +  +  +A+ VG  V+++ I+  GR S+I+F 
Sbjct: 339 SFMVLFCASMSMVQFILLGMQGIGEASVYAGICFVASVVGAVVIERAIRKSGRVSLIVFL 398

Query: 442 LSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           ++ I+ VS + +   G   +  +     YMGF   C
Sbjct: 399 VTGIMAVSTVIITFFGALDVWAQYTSGAYMGFKLPC 434


>gi|53792419|dbj|BAD53257.1| membrane protein-like [Oryza sativa Japonica Group]
          Length = 461

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 125/396 (31%), Positives = 192/396 (48%), Gaps = 42/396 (10%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR-----QRHPTLDIPVIDYDLAL 156
           +F+P+LNL+ G   K + A S  MVTGGA S V+YNL      +R       VIDYD+AL
Sbjct: 88  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLLCTGCGRRAAA----VIDYDIAL 143

Query: 157 LFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAK 216
           LFQP L+LG+SIGV  NV+F +W+IT L  +     +T+    G+  W  E+      A 
Sbjct: 144 LFQPCLLLGVSIGVVCNVMFPEWLITALFALFLAFCTTKTLRAGLRIWSSES----RGAT 199

Query: 217 QLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAI 276
                   +G +      +P  L     P  T               WK+  +LV VW  
Sbjct: 200 LAVAAATAHGRE------EPLLL-----PHGTDAGNGGGARGDAGFPWKDVSVLVMVWLC 248

Query: 277 ILALQI--------AKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK--IASK 326
              L +               C V YW++ L Q+P A   +AY    +Y  R+K  + ++
Sbjct: 249 FFVLHVFIGDKHGKGMIRIKPCGVAYWLITLSQVPFAVAFTAY---IIYAKRKKQVLHNQ 305

Query: 327 GD-----EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 381
            D     E TK      + +       G + GL G+GGG +L P+ L++GIPPQ ++AT+
Sbjct: 306 EDGKANPESTKMDTLPTLLFPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATS 365

Query: 382 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT 441
            F + F +SMS+V++ LL    +  A  +  +  +A+ VG  V+++ I+  GR S+I+F 
Sbjct: 366 SFMVLFCASMSMVQFILLGMQGIGEASVYAGICFVASVVGAVVIERAIRKSGRVSLIVFL 425

Query: 442 LSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           ++ I+ VS + +   G   +  +     YMGF   C
Sbjct: 426 VTGIMAVSTVIITFFGALDVWAQYTSGAYMGFKLPC 461


>gi|125527981|gb|EAY76095.1| hypothetical protein OsI_04019 [Oryza sativa Indica Group]
          Length = 461

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 125/396 (31%), Positives = 192/396 (48%), Gaps = 42/396 (10%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR-----QRHPTLDIPVIDYDLAL 156
           +F+P+LNL+ G   K + A S  MVTGGA S V+YNL      +R       VIDYD+AL
Sbjct: 88  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLLCTGCGRRAAA----VIDYDIAL 143

Query: 157 LFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAK 216
           LFQP L+LG+SIGV  NV+F +W+IT L  +     +T+    G+  W  E+      A 
Sbjct: 144 LFQPCLLLGVSIGVVCNVMFPEWLITALFALFLAFCTTKTLRAGLRIWSSES----RGAT 199

Query: 217 QLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAI 276
                   +G +      +P  L     P  T               WK+  +LV VW  
Sbjct: 200 LAVAAATAHGRE------EPLLL-----PHGTDAGNGGGARGDAGFPWKDVSVLVMVWLC 248

Query: 277 ILALQI--------AKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK--IASK 326
              L +               C V YW++ L Q+P A   +AY    +Y  R+K  + ++
Sbjct: 249 FFVLHVFIGDKHGKGMIRIKPCGVAYWLITLSQVPFAVAFTAY---IIYAKRKKQVLHNQ 305

Query: 327 GD-----EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 381
            D     E TK      + +       G + GL G+GGG +L P+ L++GIPPQ ++AT+
Sbjct: 306 EDGKANPESTKMDTLPTLLFPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATS 365

Query: 382 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT 441
            F + F +SMS+V++ LL    +  A  +  +  +A+ VG  V+++ I+  GR S+I+F 
Sbjct: 366 SFMVLFCASMSMVQFILLGMQGIGEASVYAGICFVASVVGAVVIERAIRKSGRVSLIVFL 425

Query: 442 LSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           ++ I+ VS + +   G   +  +     YMGF   C
Sbjct: 426 VTGIMAVSTVIITFFGALDVWAQYTSGAYMGFKLPC 461


>gi|357125384|ref|XP_003564374.1| PREDICTED: uncharacterized protein LOC100827655 [Brachypodium
           distachyon]
          Length = 469

 Score =  154 bits (390), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 121/403 (30%), Positives = 197/403 (48%), Gaps = 55/403 (13%)

Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
            +F+P+LNL+ G   K +A  S  MVTGGA S V+YNL      +   +IDYD+ALLFQP
Sbjct: 96  SLFLPILNLVAGLSLKRAATYSSFMVTGGAASNVLYNLLWNRGRV---LIDYDIALLFQP 152

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETI----------T 210
            L+LG+SIGV  NV+F +W+IT L  +     + +    GV+ W+ E+            
Sbjct: 153 CLLLGVSIGVVCNVMFPEWLITALFSLFLAFCTVKTCRAGVKIWRSESCAASAAVAVAAA 212

Query: 211 KREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLL 270
           +    K+  L++L  G                      ++  ++         WK+  +L
Sbjct: 213 RHNENKEPLLVLLPAG----------------------QDGDQAAAGNGAGFPWKDVSVL 250

Query: 271 VAVWAIILALQ--IAKNY------TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK 322
           VAVW     L   I   +       T C + YW+  + Q+P +   +AY    +Y  R+K
Sbjct: 251 VAVWLCFFLLHAFIGDKHGKGMIRITPCGIAYWLFTISQVPFSVAFTAY---IIYAKRKK 307

Query: 323 IASKGDEGTKWRA--------SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPP 374
              +  E  K           S L+   A  +T G + GL G+GGG +L P+ L++GIPP
Sbjct: 308 QLLRNQEDGKANCVETKTETMSSLILPLAAFVT-GSLSGLFGIGGGLLLNPVLLQMGIPP 366

Query: 375 QVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR 434
           Q ++AT+ F + F +SMS+V++ LL    +  A  +  +  +A+  G  +++K ++  GR
Sbjct: 367 QTAAATSSFMVLFCASMSMVQFILLGMDGIGEASVYAGICFVASVAGAVLIEKAVRKSGR 426

Query: 435 ASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            S+I+F ++ I+ +S + +   G   + K+     YMGF   C
Sbjct: 427 VSMIVFLVTAIMALSTVIVTCFGALDVWKQYNGGAYMGFKLPC 469


>gi|414880188|tpg|DAA57319.1| TPA: hypothetical protein ZEAMMB73_804983 [Zea mays]
          Length = 458

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 118/392 (30%), Positives = 184/392 (46%), Gaps = 30/392 (7%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           +F+P+LNL+ G   K + A S  MVTGGA S V+YNL          +IDYD+ALLFQP 
Sbjct: 81  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLACTGGGGRGRLIDYDIALLFQPC 140

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
           L+LG+SIGV  NV+F +W+IT+L  +     + +    GV+ W+ E+             
Sbjct: 141 LLLGVSIGVVCNVMFPEWLITLLFSLFLAFCTAKTCRAGVKIWRSESS------------ 188

Query: 222 VLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIY-WKEFGLLVAVWAIILAL 280
             G G      +    +      P  T +            + W +  LLV +W    AL
Sbjct: 189 --GCGSGARSSRGGHSHSKEPLLPRGTSDGDAEGGGGNGAGFPWGDVALLVIIWLCFFAL 246

Query: 281 QIAKN--------YTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTK 332
            +               C V YW++   Q+P A   + Y   A  K R   +   ++G+K
Sbjct: 247 HVLIGDKHGKGVIRIRPCGVAYWLITFFQLPAAVAFTGYIVYAKRKKRAVHSESQEDGSK 306

Query: 333 W-------RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 385
                    A  L          G + GL G+GGG +L P+ L++GIPPQ ++AT+ F +
Sbjct: 307 ADLADAGVEALPLPTLPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMV 366

Query: 386 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 445
            F +SMS+V++ LL    V  A  +  +    +  G  V+++ +   GR S+I+F ++ I
Sbjct: 367 LFCASMSMVQFILLGVEGVGQACVYAGVCFAGSVAGVVVIERAVSKSGRVSLIVFLVTAI 426

Query: 446 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           + +SA+ +   G   +  +    EYMGF   C
Sbjct: 427 MALSAVIVTCFGALDVWMQYTTGEYMGFKLPC 458


>gi|224116342|ref|XP_002317274.1| predicted protein [Populus trichocarpa]
 gi|222860339|gb|EEE97886.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 135/466 (28%), Positives = 237/466 (50%), Gaps = 38/466 (8%)

Query: 39  ATSKNKNDHVEP---NIVIKVSAPK----SRSGYKHVWPDIKFGWRIVVGTIIGFFGTAC 91
           A S+++ +  +P   N+ I +   K    S    +     +K    +V+  ++ F   + 
Sbjct: 19  ALSQSRAEETQPLSNNLKIDLFLDKIGKWSHHQIQSQETGLKLAPSMVIAGVLCFIAASV 78

Query: 92  GSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD-IPVI 150
            S GG+GGGG+++P+L ++   D K++++ S  MVTGG+V+ V+ N+  R        ++
Sbjct: 79  SSAGGIGGGGLYIPILTIVASLDLKTASSFSAFMVTGGSVANVMCNMFTRSAKFGGQTLV 138

Query: 151 DYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWK--KET 208
           DYD+A+L +P ++LG+S+GV  N++F +W++T+L  V     + +    GV  WK   E 
Sbjct: 139 DYDIAILSEPCMLLGVSVGVICNLVFPEWLVTILFAVFLACSTFKTCQNGVFHWKLESEE 198

Query: 209 ITKREAAKQLELIVLGNG---YQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWK 265
           + + E+        L NG   Y+T   K   E +S+   P    E   S    +    W 
Sbjct: 199 VNRNESGN------LENGLVEYETST-KESEEVISSVKEPLLGVELTSS----VLRFPWM 247

Query: 266 EFGLLVAVWAI--ILALQIAKNY------TTTCSVLYWVLNLLQIPVAGGVSAY-----E 312
           + G+L  +W    IL L     Y        +C   YWV++ LQIP+A   +A+     E
Sbjct: 248 KLGILFIIWFSFSILYLLRGNRYGEGIIPMESCGFGYWVVSSLQIPLAIMFTAWILYRKE 307

Query: 313 AVALYKGRRKIASKGDEG-TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
           +       ++++ KG E  T    S  + +    + AGM+GG+ G+GGG ++ PL L +G
Sbjct: 308 SCQHQTINQQLSVKGMEDLTGGGTSNKLIFPVMALLAGMLGGVFGIGGGMLISPLLLHVG 367

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           I P++++AT  F + FSSSMS ++Y LL    V  A+    +  +A+ +G  V+++ I  
Sbjct: 368 IAPEITAATCSFMVFFSSSMSALQYLLLGMEHVDTAIILSVICFVASLLGLLVVQRAIVK 427

Query: 432 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            GRAS+I+F++S ++ +S + +   G   + +       MGF   C
Sbjct: 428 YGRASMIVFSVSTVMALSTVLMTSFGALNVWRDYNSGRNMGFKLPC 473


>gi|242058941|ref|XP_002458616.1| hypothetical protein SORBIDRAFT_03g036740 [Sorghum bicolor]
 gi|241930591|gb|EES03736.1| hypothetical protein SORBIDRAFT_03g036740 [Sorghum bicolor]
          Length = 473

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 144/483 (29%), Positives = 232/483 (48%), Gaps = 43/483 (8%)

Query: 14  LAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIK 73
           +A +F+     ++ + S+  +L  E +   +    +P+     S+P   + +  V P+I 
Sbjct: 15  IAVSFLSAAAASNTTTSSLQSLLAEVSQWRETQLGDPS-----SSPGGGAAHAGVRPNIV 69

Query: 74  FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
             W      ++ F   +  S GGVGGG +F+P+LNL+ G   K + A S  MVTGGA S 
Sbjct: 70  ASW------VLSFLAASVSSAGGVGGGSLFLPILNLVAGLGLKRATAYSSFMVTGGAASN 123

Query: 134 VVYNL---RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
           V+YNL             +IDYD+ALLFQP L+LG+SIGV  NV+F +W+IT+L  V   
Sbjct: 124 VLYNLASRSSTGTGGGGRLIDYDIALLFQPCLLLGVSIGVVCNVVFPEWLITLLFSVFLA 183

Query: 191 VLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETRE 250
             + +    GV+ W+ E+     A          +G   E         ++D   E    
Sbjct: 184 SCTAKTCRAGVKIWRSESGGAGTARGDHH-----HGIGKEPLLLRLPLGTSDGDAE---- 234

Query: 251 PKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKN--------YTTTCSVLYWVLNLLQI 302
                        W +  LLV VW    AL +               CSV YW++ L Q+
Sbjct: 235 --GGGRGNGAGFPWADVALLVMVWLCFFALHVLIGDKHGKGVIRIRPCSVAYWLITLSQL 292

Query: 303 PVAGGVSAYEAVALYKGRRKIASKG--------DEGTKWRASQLVFYCACGITAGMVGGL 354
           P A   + Y  +   + +R + S+         D G +     L    A  +T G + GL
Sbjct: 293 PAAVAFTGY-IIHSKRKKRVVPSQEDGKQEDLVDTGVETTLPSLTLPLAAFVT-GALSGL 350

Query: 355 LGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALS 414
            G+GGG +L P+ L++GIPPQ ++AT+ F + F +SMS+V++ LL    +  A  +  + 
Sbjct: 351 FGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLFCASMSMVQFILLGVEGIGQASIYAGIC 410

Query: 415 IIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFD 474
            +A+ VG  V+++ I+  GR S+I+F ++ I+ +S + +   G   +  +    EYMGF 
Sbjct: 411 FVASVVGVVVIQRAIRKSGRVSLIVFLVTAIMALSTVIVTCFGALDVWMQYTSGEYMGFK 470

Query: 475 NIC 477
            +C
Sbjct: 471 LLC 473


>gi|326504538|dbj|BAJ91101.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 465

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 118/398 (29%), Positives = 192/398 (48%), Gaps = 37/398 (9%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           +F+P+LNL+ G   K +   S  MVTGGA S V+YNL +        +IDYD+ALLFQP 
Sbjct: 83  LFLPILNLVAGLTLKHATTYSSFMVTGGAASNVLYNLWRARGRGRAALIDYDIALLFQPC 142

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET------ITKREAA 215
           L+LG+SIGV  NV+F +W+IT L  +     + +    G + W+ E+         R   
Sbjct: 143 LLLGVSIGVVCNVMFPEWLITALFSLFLAFCTAKTCRAGAKIWRCESAGAGAPAAARHGH 202

Query: 216 KQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWA 275
           K   L+ +G           P+   +D   +  R+      +      WK+  +LV VW 
Sbjct: 203 KVPLLLDVGG---------LPQPSQDDGGLQAARDGGSGGGAGFP---WKDVAVLVIVWL 250

Query: 276 IILALQ--IAKNY------TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG 327
               L   I   +         C + YW+  + Q+P A   +AY    +Y  R+K A+  
Sbjct: 251 CFFLLHVFIGDKHGKGVIRIKPCGIAYWLATVSQVPFAVAFTAY---IIYAKRKKQATHH 307

Query: 328 DEG--------TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 379
            E         TK      +         G + GL G+GGG +L P+ L++GIPPQ ++A
Sbjct: 308 HEDGKAHSSVHTKSETLPALALPLAAFVTGSLSGLFGIGGGLLLNPVLLQIGIPPQTAAA 367

Query: 380 TALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 439
           T+ F + F +SMS+V++ LL    V  A  +  +  +A+  G  +++++++  GR S+I+
Sbjct: 368 TSSFMVLFCASMSMVQFILLGMDGVGEASVYAGICFVASIAGVVLIERVVRKSGRVSMIV 427

Query: 440 FTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           F ++ I+ +S + +   G   +  +     YMGF   C
Sbjct: 428 FLVTAIMALSTVIVTCFGALDVWTQYTGGAYMGFKLPC 465


>gi|3023070|gb|AAC12639.1| hypothetical protein [Mesembryanthemum crystallinum]
          Length = 132

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 81/146 (55%), Positives = 101/146 (69%), Gaps = 14/146 (9%)

Query: 149 VIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET 208
           +IDYDLALLFQP+L+LGISIGVAFNVIFADWM+TVLLIVLF+  ST+AF++G++TWKKET
Sbjct: 1   IIDYDLALLFQPMLMLGISIGVAFNVIFADWMVTVLLIVLFVGTSTKAFMRGIDTWKKET 60

Query: 209 ITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFG 268
           + ++EAAK+ E     +G + E     PE        +E RE       I+EN+YWKE G
Sbjct: 61  LMQKEAAKRAESNG-ADGVEYEPLPAGPE--------KEDRE-----APILENVYWKEVG 106

Query: 269 LLVAVWAIILALQIAKNYTTTCSVLY 294
           LL  VW   LA +I    T TCSV Y
Sbjct: 107 LLCFVWVAFLAFEIINENTATCSVAY 132


>gi|222640475|gb|EEE68607.1| hypothetical protein OsJ_27142 [Oryza sativa Japonica Group]
          Length = 192

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 87/164 (53%), Positives = 119/164 (72%)

Query: 56  VSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDA 115
           ++A  S  G +    D+ F WR+VV T++GF G+  G+VGGVGGGG F PM+NL+VGF+ 
Sbjct: 1   MAASASPRGRRASGRDLGFKWRVVVATVVGFPGSGFGTVGGVGGGGKFGPMVNLLVGFET 60

Query: 116 KSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVI 175
           KS+A VSK M+ G + S+V YNL+  HPT + PVIDY LALLFQP+L+LGI+IGV  +VI
Sbjct: 61  KSAAGVSKGMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVI 120

Query: 176 FADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
           F  W+ITVL+I+LF+  S+R+F KG+  WK ET  + E  ++ E
Sbjct: 121 FPYWLITVLIIILFIGTSSRSFYKGILMWKDETRIQMETREREE 164


>gi|226497914|ref|NP_001144009.1| uncharacterized protein LOC100276828 precursor [Zea mays]
 gi|195635291|gb|ACG37114.1| hypothetical protein [Zea mays]
          Length = 458

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 118/392 (30%), Positives = 184/392 (46%), Gaps = 30/392 (7%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           +F+P+LNL+ G   K + A S  MVTGGA S V+YNL          +IDYD+ALLFQP 
Sbjct: 81  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLACTGGGGRGRLIDYDIALLFQPC 140

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
           L+LG+SIGV  NV+F +W+IT+L  +     + +    GV+ W+ E+             
Sbjct: 141 LLLGVSIGVVCNVMFPEWLITLLFSLFLAFCTAKTCRAGVKIWRSESS------------ 188

Query: 222 VLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIY-WKEFGLLVAVWAIILAL 280
             G G      +    +      P  T +            + W +  LLV +W    AL
Sbjct: 189 --GCGSGARSSRGGHSHSKEPLLPRGTSDGDAEGGGGNGAGFPWGDVALLVIIWLCFFAL 246

Query: 281 QIAKN--------YTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTK 332
            +               C V YW++   Q+P A   + Y   A  K R   +   ++G+K
Sbjct: 247 HVLIGDKHGKGVIRIRPCGVAYWLITFFQLPAAVAFTGYIVYAKRKKRAVHSESQEDGSK 306

Query: 333 W-------RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 385
                    A  L          G + GL G+GGG +L P+ L++GIPPQ ++AT+ F +
Sbjct: 307 ADLADAGVEALPLPTLPLAAFVTGALXGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMV 366

Query: 386 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 445
            F +SMS+V++ LL    V  A  +  +    +  G  V+++ +   GR S+I+F ++ I
Sbjct: 367 LFCASMSMVQFILLGVEGVGQACVYAGVCFAGSVAGVVVIERAVSKSGRVSLIVFLVTAI 426

Query: 446 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           + +SA+ +   G   +  +    EYMGF   C
Sbjct: 427 MALSAVIVTCFGALDVWMQYTTGEYMGFKLPC 458


>gi|328868613|gb|EGG16991.1| hypothetical protein DFA_07972 [Dictyostelium fasciculatum]
          Length = 521

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 114/400 (28%), Positives = 194/400 (48%), Gaps = 37/400 (9%)

Query: 65  YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
           +K ++ D  F +  +VG  + F G A  S GGVGGGGI++P+L L+  F  K++  +S C
Sbjct: 120 HKSLFHD-HFTFLDIVGLCLLFIGCALSSGGGVGGGGIYIPILILVSKFSPKTAIPLSNC 178

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           +V G A++  + N  +RHP  +  +IDY + LL +PL + G + GV  +     ++I +L
Sbjct: 179 LVAGCAMANFIQNFPRRHPFANKHLIDYSVVLLIEPLTLGGTTFGVYLHTFLPPFVILIL 238

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTT 244
           L+V     +   F KG+  +KKE  TK  +                        + N + 
Sbjct: 239 LVVTLTATAITTFRKGLSIYKKENETKSYSQ-----------------------IKNTSI 275

Query: 245 PEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK----NYTTT----CSVLYWV 296
             +  E ++S     ++  W +   +V V A+     + K     Y+      CS  YW 
Sbjct: 276 NSDGSETQQSNP--FKDAEWGKISAIVLVLALSTVFSVLKGGDGEYSMVGIKLCSPTYWT 333

Query: 297 LNLLQIPVAGGVSAYEAVALYKGRRKIASKG---DEGTKWRASQLVFYCACGITAGMVGG 353
           L+    PV        A+ LY+  +K   +G   +    +    ++      + AG++  
Sbjct: 334 LSFAIWPVIIVTWILTALYLYRRWKKNQLQGIKVEGDINYSPQTIILLGFLSVIAGILAS 393

Query: 354 LLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFAL 413
           LLG+GGG I GP+ L++G+ P +++AT+ F I F+S+ S ++Y LL +  + Y L ++  
Sbjct: 394 LLGIGGGMIKGPVLLQMGLSPDITAATSSFMILFTSASSAIQYVLLGKLRLDYGLVYYFT 453

Query: 414 SIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
           + IA FVG   L   +K   R S  IF +  +I +S + L
Sbjct: 454 AFIACFVGTQSLLYAVKKSNRKSYFIFLICLVIVISTILL 493


>gi|357446189|ref|XP_003593372.1| hypothetical protein MTR_2g010740 [Medicago truncatula]
 gi|355482420|gb|AES63623.1| hypothetical protein MTR_2g010740 [Medicago truncatula]
          Length = 466

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 141/463 (30%), Positives = 245/463 (52%), Gaps = 31/463 (6%)

Query: 30  SAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGT 89
           S     KP +T  N N     +++ K+S  K+ +   H    ++    +VV  I+ F  +
Sbjct: 20  SNAKQTKPFSTILNIN-----HVLDKISQLKNVTQGFHE-SQLQISVPLVVAGILCFIAS 73

Query: 90  ACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVV-YNLRQRHPTLDIP 148
           +  S GG+GGGGIF+P+L ++ G D K ++++S  MVTGG+++ V+ Y            
Sbjct: 74  SISSAGGIGGGGIFIPILTIVAGLDLKVASSISAFMVTGGSIANVICYMFTTSTKFGGKS 133

Query: 149 VIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK-GVETW--K 205
           +IDYD+AL  +P ++LG+S+GV  N++F +W+IT L+  +FL  ST    K GV  W  +
Sbjct: 134 LIDYDIALSSEPCMLLGVSVGVICNLVFPEWLIT-LMFAVFLAWSTSKTCKSGVMFWNIE 192

Query: 206 KETITKREAAKQLELIVLGNGYQTEECKCDPEYLSND--TTPEETREPKKSKVSIIENIY 263
            E I K    +++E  +L N     +         ND   T EE       + S    I 
Sbjct: 193 SEEIRKNIGVQEIEKGLLENEITMHK--------DNDGSKTVEENLVLVPQENSSKLCIP 244

Query: 264 WKEFGLLVAVWAIILALQIAKNY-------TTTCSVLYWVLNLLQIPVAGGVSAYEAV-- 314
           W + G+L+ +W    ++ + +            C V YW+++ +Q+P+A   +A+  +  
Sbjct: 245 WLKLGVLLLIWFSFFSIYLIRGNGYGQIIPMEPCGVGYWIISSVQVPLAVVFTAWMVLRK 304

Query: 315 ALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPP 374
              + +  I     +     +++LVF     + AGM+GG+ G+GGG ++ PL L++GI P
Sbjct: 305 ESIQDQTLIPQVQCQNRNCPSNKLVFPLM-ALLAGMLGGVFGIGGGMLISPLLLQVGIAP 363

Query: 375 QVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR 434
           +V++AT  F + FSS+MS ++Y LL    V  AL    +  +A+ +G  V++K+I+  GR
Sbjct: 364 EVTAATCSFMVFFSSTMSSLQYLLLGMEHVETALILAIMCFVASLLGLLVVQKVIRKYGR 423

Query: 435 ASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            SII+F++S ++ +S + +   G  K+ +  +  +YMGF   C
Sbjct: 424 PSIIVFSVSIVMSLSIVLMTSFGTLKVWEDYKSGKYMGFKLPC 466


>gi|330818743|ref|XP_003291498.1| hypothetical protein DICPUDRAFT_89425 [Dictyostelium purpureum]
 gi|325078304|gb|EGC31962.1| hypothetical protein DICPUDRAFT_89425 [Dictyostelium purpureum]
          Length = 463

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 119/406 (29%), Positives = 209/406 (51%), Gaps = 34/406 (8%)

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           ++G ++ F G A  S GGVGGGGI++P+L L+  +D KSS  +S C+V G +++  + N 
Sbjct: 77  ILGMVLLFIGCALSSGGGVGGGGIYIPILILVSKWDPKSSIPLSNCLVAGCSLANFIQNF 136

Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
            +RHP  +  +ID+ +ALL +PL + G   GV  +  F   +I +LL++     S +   
Sbjct: 137 PRRHPFSNKHLIDFSVALLIEPLTLAGTIFGVYLHTYFPPLVILLLLVITLGFTSFKTIT 196

Query: 199 KGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI 258
           KGVE ++KE   K        + +L N +             ND+    +  P     + 
Sbjct: 197 KGVEIYRKEIKAK--------VSLLNNDHHK----------INDSNGSGSSNPNGDGAN- 237

Query: 259 IENIYWKEFGLLVAVWAIILALQIAKNYT----TTCSVLYWVLNLLQIPVAGGVSAYEAV 314
             N+   ++ LL+      +     + Y+      CS LYWVL+ + +PV   +  + A 
Sbjct: 238 -SNV---KYNLLIFSTMFSIFKGGDEEYSLIGVKLCSPLYWVLSFVMVPVIIILWGFTAR 293

Query: 315 ALYKG---RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
            LY+    RR    + +   K+    ++      I AG++  LLG+GGG I GP+ L++G
Sbjct: 294 YLYREYEIRRDEGREIEGEIKYTHKNIIVLGILSIVAGILASLLGIGGGMIKGPVLLQMG 353

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           + P V++AT+ + I F+S+ S ++Y L+ +    Y + ++A+  I+ FVG   L  ++K 
Sbjct: 354 LSPDVTAATSSYMILFTSASSAIQYILVGKLRWDYGIVYYAIGFISCFVGTQTLIWIVKK 413

Query: 432 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
             R S I+F +  +I VS + L    + + I  ++++  + FD+IC
Sbjct: 414 YQRRSYIVFLIGAVISVSTILL---VVTESIDFVKYRN-LSFDSIC 455


>gi|223994523|ref|XP_002286945.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220978260|gb|EED96586.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 385

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 105/392 (26%), Positives = 183/392 (46%), Gaps = 35/392 (8%)

Query: 74  FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
           F W  +VGT++ F         GVGGGGI++P+  +++ F  K S  +S+  + G  +  
Sbjct: 3   FLWTDLVGTVVWFLTAGIAVSCGVGGGGIYMPLGMILLRFAPKQSTGLSQACIFGAGLGG 62

Query: 134 VVYNLRQRHPTLDI---------PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           ++ N R+RHP   I         P+IDYDLAL   P+ + G  +GV    +  DW+   +
Sbjct: 63  LIINSRKRHPDRHIRDTKGFYTRPIIDYDLALFQAPMELAGAVVGVTVQRLLPDWLFLSI 122

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTT 244
            +V+  +   + F K  E++KK+ + K+  A       L   +  E+   + + +    +
Sbjct: 123 AVVILGLTCFKTFQKFFESYKKDKMQKKHLA------FLAQRHLDEQ---EAQKIPGCPS 173

Query: 245 P-EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 303
           P   + E + + V +       +  +L  +W + L +    +   T  +  W LN +   
Sbjct: 174 PGYNSDESEHTTVELCAESVPDDVMILRCLWRLFLRIFRTSHQALTSMMTQWNLNFVDSS 233

Query: 304 VAGGVSAYEAV----ALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 359
           +   ++++  +     LY G             W   Q+  +      AGM+GGL+G+  
Sbjct: 234 LQRMLTSFRGLRLLHCLYFGL------------WDYKQVRDFSLVSFGAGMIGGLVGISA 281

Query: 360 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 419
           G+  GP  ++ G+ P+VS+AT    +  +SS   V + L    P  YALYFF + +  AF
Sbjct: 282 GYFTGPFMIQRGLHPRVSTATTATTMLLTSSSVAVMFVLSGLLPWEYALYFFLVCVTGAF 341

Query: 420 VGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
           VG+  +   +K  G AS++I  L+ II  S L
Sbjct: 342 VGKTRIDAYVKKTGMASVLIGALATIIGCSTL 373


>gi|125527980|gb|EAY76094.1| hypothetical protein OsI_04018 [Oryza sativa Indica Group]
          Length = 397

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 113/394 (28%), Positives = 196/394 (49%), Gaps = 43/394 (10%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL---RQRHPTLDIPVIDYDLALLF 158
           ++VP+LN++ G   K++ A S  MVTGG +S V+Y L   R        P+IDYD+A++ 
Sbjct: 29  LYVPILNIVAGLSLKTATAFSTFMVTGGTLSNVLYTLIVLRGHEKGGHQPLIDYDIAVVS 88

Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
           QP L+LG+S+GV  NV+F +W+IT L  V     + + +  G++ W+ ET   R   +  
Sbjct: 89  QPCLLLGVSVGVICNVMFPEWLITALFAVFLASATFKTYGTGMKRWRAETAAARRMLEGG 148

Query: 219 ELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIIL 278
               LG+G          E L      +  R                +  +LV +W    
Sbjct: 149 S--SLGDG--------AGEALLGQKDGDGHRRQ------------CVDLMVLVTIWLCFF 186

Query: 279 ALQI------AKNY--TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDE- 329
            + +      AK       C V YW++ + QIPVA    A+ A  +++ R+  A    E 
Sbjct: 187 VIHLFIGGEGAKGVFDIEPCGVTYWLITIAQIPVA---VAFTACIVHQKRKSHAQNSQEF 243

Query: 330 ------GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 383
                  +K  +  +  +    +  G++ GL G+GGG +L P+ L++G+PP+ +S+T +F
Sbjct: 244 DQAISVKSKLESLPVYVFPVAALLTGVMSGLFGIGGGLLLNPVLLQIGVPPKTASSTTMF 303

Query: 384 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 443
            + F +SMS+V++ +L    +  AL +     +A+ VG  V++  I+  GR S+I+F ++
Sbjct: 304 MVLFCASMSMVQFIILGVDGIVTALVYAITCFVASIVGLVVIQGAIRKSGRVSLIVFMVA 363

Query: 444 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            I+ +S + +   G  ++  +    +YMGF   C
Sbjct: 364 AILALSVVVIACSGAVRVWVQYTSGQYMGFKMPC 397


>gi|53792418|dbj|BAD53256.1| membrane protein-like [Oryza sativa Japonica Group]
 gi|53792472|dbj|BAD53380.1| membrane protein-like [Oryza sativa Japonica Group]
 gi|125572272|gb|EAZ13787.1| hypothetical protein OsJ_03712 [Oryza sativa Japonica Group]
          Length = 397

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 112/394 (28%), Positives = 196/394 (49%), Gaps = 43/394 (10%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL---RQRHPTLDIPVIDYDLALLF 158
           ++VP+LN++ G   K++ A S  MVTGG +S V+Y L   R        P+IDYD+A++ 
Sbjct: 29  LYVPILNIVAGLSLKTATAFSTFMVTGGTLSNVLYTLIVLRGHEKGGHQPLIDYDIAVVS 88

Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
           QP L+LG+S+GV  NV+F +W+IT L  V     + + +  G++ W+ ET   R   +  
Sbjct: 89  QPCLLLGVSVGVICNVMFPEWLITALFAVFLASATFKTYGTGMKRWRAETAAARRMLEGG 148

Query: 219 ELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIIL 278
               LG+G          E L      +  R                +  +LV +W    
Sbjct: 149 S--SLGDG--------AGEALLGQKDGDGHRRQ------------CVDLMVLVTIWLCFF 186

Query: 279 ALQI------AKNY--TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDE- 329
            + +      AK       C V YW++ + QIP+A    A+ A  +++ R+  A    E 
Sbjct: 187 VIHLFIGGEGAKGVFDIEPCGVTYWLITIAQIPIA---VAFTACIVHQKRKSHAQNSQEF 243

Query: 330 ------GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 383
                  +K  +  +  +    +  G++ GL G+GGG +L P+ L++G+PP+ +S+T +F
Sbjct: 244 DQAISVKSKLESLPVYVFPVAALLTGVMSGLFGIGGGLLLNPVLLQIGVPPKTASSTTMF 303

Query: 384 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 443
            + F +SMS+V++ +L    +  AL +     +A+ VG  V++  I+  GR S+I+F ++
Sbjct: 304 MVLFCASMSMVQFIILGVDGIVTALVYAITCFVASIVGLVVIQGTIRKSGRVSLIVFMVA 363

Query: 444 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            I+ +S + +   G  ++  +    +YMGF   C
Sbjct: 364 AILALSVVVIACSGAVRVWVQYTSGQYMGFKMPC 397


>gi|356524768|ref|XP_003531000.1| PREDICTED: uncharacterized protein LOC100806202 [Glycine max]
          Length = 473

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 114/392 (29%), Positives = 211/392 (53%), Gaps = 20/392 (5%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQP 160
           +F+P+L ++   D K+++++S  MVTGG+++ V+ NLR  +P L    +IDYD+ALL +P
Sbjct: 86  LFIPILTIVASLDLKTASSLSAFMVTGGSIANVMCNLRATNPKLGGKSLIDYDIALLSEP 145

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR-----EAA 215
            ++LG+S+GV  N++F +W+IT+L  V     +++    GV  WK E+  +R     E  
Sbjct: 146 CMLLGVSVGVICNLVFPEWLITMLFAVFLTWSTSKTCNSGVVFWKIESEERRKNDGFEGL 205

Query: 216 KQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWA 275
           ++  L    +  + E  + + E     +  E+   P++   +I   I W +  +L+ VW 
Sbjct: 206 EKGLLEDESSEEREEGVQVEKEKEKVKSIEEQVMVPEE---NIRVRIPWLKLVVLLLVWF 262

Query: 276 IILALQIAKNY--------TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG 327
              +L + +             C V YW+++  Q+P+A   +A+        + +   + 
Sbjct: 263 SFFSLYLLRGNKYGQSIIPMEPCGVGYWIISSAQVPLALFFTAWIVYRKESHQDQNLMQE 322

Query: 328 DE--GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 385
           D    +   +++L+F     + AG++GG+ G+GGG ++ PL L +GI P+V++AT  F +
Sbjct: 323 DSCLSSNGPSNKLIFP-MMALLAGILGGVFGIGGGMLISPLLLHVGIAPEVTAATCSFMV 381

Query: 386 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 445
            FSS+MS ++Y LL    +  AL    +  +A+ +G  V++K I+  GR S+I+F++S +
Sbjct: 382 FFSSTMSALQYLLLGMDHIETALILALICFVASLIGLLVVQKAIQSYGRPSLIVFSVSIV 441

Query: 446 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           + +S + +   G  +  K      YMGF   C
Sbjct: 442 MTLSIVLMTSFGAIRTWKDYTSGRYMGFKLPC 473


>gi|297737341|emb|CBI26542.3| unnamed protein product [Vitis vinifera]
          Length = 220

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 99/194 (51%), Positives = 120/194 (61%), Gaps = 25/194 (12%)

Query: 72  IKFGWRIVVGTIIGFFGTACGSVGG---VGGGGIFVPMLNLIVGFDAKSSAAVSK----C 124
           +KFGW+I   +IIGFFG   G  GG   V  GG FV           +SS  ++     C
Sbjct: 1   MKFGWQIDDCSIIGFFGATFGVGGGGSFVTCGGSFVTC--------GRSSEVINSICWSC 52

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           MV G   S V YNL+ RHPTLD+P+I YDLALLFQP+L++GISIGV FNV  ADWM+T+L
Sbjct: 53  MVMGAVGSTVYYNLKLRHPTLDMPIICYDLALLFQPMLMMGISIGVVFNVAVADWMVTIL 112

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTT 244
           LIVLFL   T+AF+KGVETWKKETI KR            NG +  E K  P  LSN T 
Sbjct: 113 LIVLFLGTPTKAFIKGVETWKKETIMKRNKC---------NGTKEVEYKPLPSGLSNGTQ 163

Query: 245 PEETREPKKSKVSI 258
              TR+ ++ +VSI
Sbjct: 164 -NATRKFEELEVSI 176


>gi|440293005|gb|ELP86177.1| hypothetical protein EIN_328840 [Entamoeba invadens IP1]
          Length = 492

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 114/461 (24%), Positives = 214/461 (46%), Gaps = 57/461 (12%)

Query: 71  DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
           ++ F W++ VG+I   F     +  G+GGG  ++ +  LI+  DA  +  +SK    G A
Sbjct: 2   ELTFDWKLAVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDAHEAVPLSKITTFGVA 61

Query: 131 VSAVVYNLRQRHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLF 189
               +    + HPT+   P+I Y  AL+ +PL + G  +GV  N+    W+I V+L++L 
Sbjct: 62  CGGYLVLWMKMHPTIKYKPLISYATALMVEPLTIYGTMLGVILNITSPSWLIIVVLVLLL 121

Query: 190 LVLSTRAFLKGVETWKKETITKREA--------AKQLELIVLGNGYQTEECKCDPEYLS- 240
              S + F K  + +K E    ++A         KQ +     +   TE  K   E    
Sbjct: 122 GYTSYKTFTKAWKQYKAENEKMKQAKIAIEMKEEKQPDQDTANDDMTTESLKTGAESDDK 181

Query: 241 -------------NDTTPEETREPK--------KSKVSIIENIYWKEFGLL--------- 270
                        N++TP++ +  +        + +   +E+   KE  +L         
Sbjct: 182 VQSGVIVSDKIDLNESTPQDDKNEQGTGPQLLPQDENETVEDTKKKEEKILFKREVLKSI 241

Query: 271 ------VAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAVAL 316
                 +AVWA++  + I +      SV+        YW+L  +  P+   V+      L
Sbjct: 242 LSIIILIAVWAVMFCIVILRGGGKMDSVVGVQCGTPWYWILTGVGAPLMLTVTLIVGGVL 301

Query: 317 YKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQV 376
           +   R    +G+   +W     +      + AG+    LG+GGG ++GP+ LE+G+ PQV
Sbjct: 302 WYKHRGEHIEGE--VQWSVKNCIIIPVGALFAGVSAAFLGIGGGMVIGPILLEIGVLPQV 359

Query: 377 SSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRAS 436
           ++AT+ F I F++S S ++Y +  +  +   L++F +  I A  GQ    K+++ + R S
Sbjct: 360 ATATSAFMIMFTASSSSLQYIIDGKLDLYCGLWYFGIGFIGAAFGQFGFSKIVQKMNRQS 419

Query: 437 IIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +I F L  +I +S L++  + + ++I+ +++   +GF ++C
Sbjct: 420 VIGFFLGALIVLSTLAMVAMTVIQLIQDVKNHN-LGFHHLC 459


>gi|297837261|ref|XP_002886512.1| hypothetical protein ARALYDRAFT_475155 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332353|gb|EFH62771.1| hypothetical protein ARALYDRAFT_475155 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 461

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 128/431 (29%), Positives = 221/431 (51%), Gaps = 46/431 (10%)

Query: 70  PDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGG 129
           P I+     ++  ++ F  ++  S GG+GGGG++VP++ ++ G D K++++ S  MVTGG
Sbjct: 54  PKIELTTSTIIAGLLSFLASSISSAGGIGGGGLYVPIMTIVAGLDLKTASSFSAFMVTGG 113

Query: 130 AVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVL 188
           +++ V  NL  R+P +    +ID+DLALL +P ++LG+SIGV  N++F +W+IT L  V 
Sbjct: 114 SIANVGCNLFVRNPKSGGKTLIDFDLALLLEPCMLLGVSIGVICNLVFPNWLITSLFAV- 172

Query: 189 FLVLST-RAFLKGVETWK--KETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTP 245
           FL  ST + F  G+  W+   E +  RE+ +         G   EE K +   L      
Sbjct: 173 FLAWSTLKTFGNGLYYWRLESEMVKIRESNRI--------GEDDEEDKIESLKLP---LL 221

Query: 246 EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYT--------TTCSVLYWVL 297
           E+   PK+          W + G+LV +W    A+ + +             C   YW++
Sbjct: 222 EDYERPKRFP--------WIKLGVLVIIWLSYFAVYLLRGNKYGEGIISIEPCGNAYWLI 273

Query: 298 NLLQIPV----------AGGVSAYEAVALYKGRRKIAS-KGDEGTKWRASQLVFYCACGI 346
           +  QIP+          +  V + +        + +   + ++G +   S    +    +
Sbjct: 274 SSSQIPLTLFFTLWICFSDNVQSQQPSDYNVSIKDVEDLRSNDGAR---SNKCMFPVMAL 330

Query: 347 TAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPY 406
            AG++GG+ G+GGG ++ PL L++GI P+V++AT  F + FSS+MS ++Y LL       
Sbjct: 331 LAGVLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLLLGMEHTGT 390

Query: 407 ALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIE 466
           A  F  +  +A+ VG  V++K+I   GRASII+F++  ++ +S + +   G   +     
Sbjct: 391 ASIFAVICFVASLVGLKVVQKVITEYGRASIIVFSVCIVMALSIVLMTSYGALDVWNDYV 450

Query: 467 HKEYMGFDNIC 477
              YMGF   C
Sbjct: 451 AGRYMGFKLPC 461


>gi|186512097|ref|NP_193857.3| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|332659032|gb|AEE84432.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 449

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 136/418 (32%), Positives = 220/418 (52%), Gaps = 30/418 (7%)

Query: 71  DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
           ++K    I++  ++ F      S GG+GGGG+F+P++ ++ G D K++++ S  MVTGG+
Sbjct: 51  ELKLSSAIIMAGVLCFLAALISSAGGIGGGGLFIPIMTIVAGVDLKTASSFSAFMVTGGS 110

Query: 131 VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
           ++ V+ NL           +DYDLALL +P ++LG+SIGV  N +  +W+ITVL  V   
Sbjct: 111 IANVISNLFGGKAL-----LDYDLALLLEPCMLLGVSIGVICNRVLPEWLITVLFAVFLA 165

Query: 191 VLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETRE 250
             S +    GV+ WK E+   RE+         G G   EE K       N   P    +
Sbjct: 166 WSSLKTCRSGVKFWKLESEIARESGHGRP--ERGQGQIEEETK-------NLKAPLLEAQ 216

Query: 251 PKKSKVSIIENIYWKEFGLLVAVWA---IILALQIAKN-----YTTTCSVLYWVLNLLQI 302
             K+K      I W + G+LV VWA   +I  L+  K+         C V YW+L  LQI
Sbjct: 217 ATKNK----SKIPWTKLGVLVIVWASFFVIYLLRGNKDGKGIITIKPCGVEYWILLSLQI 272

Query: 303 PVAGGVSAYEAVALYKGRRKIA---SKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 359
           P+A  +    A++  + R++ +    K  EGT+   S  + + A    AG++GG+ G+GG
Sbjct: 273 PLAL-IFTKLALSRTESRQEQSPNDQKNQEGTRLDKSTRLKFPAMSFLAGLLGGIFGIGG 331

Query: 360 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 419
           G ++ PL L+ GIPPQ+++AT  F + FS++MS V+Y LL       A  F  +  +A+ 
Sbjct: 332 GMLISPLLLQSGIPPQITAATTSFMVFFSATMSAVQYLLLGMQNTDTAYVFSFICFLASL 391

Query: 420 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +G  +++K +   GRASII+F++  ++ +S + +   G   +       + MGF   C
Sbjct: 392 LGLVLVQKAVAQFGRASIIVFSVGTVMSLSTVLMTSFGALDVWTDYVAGKDMGFKLPC 449


>gi|281207304|gb|EFA81487.1| hypothetical protein PPL_05475 [Polysphondylium pallidum PN500]
          Length = 500

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 113/410 (27%), Positives = 194/410 (47%), Gaps = 41/410 (10%)

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           +VG  + F G A  S GGVGGGGI++P+L L+  F  K++  +S C+V G + + ++ N 
Sbjct: 113 IVGMALLFLGCALSSGGGVGGGGIYIPILILVNKFSPKTAIPLSNCLVAGCSFANLIQNF 172

Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
            +RHP  +  +IDY + LL +PL + G   G+  + +   ++I +LL+V     S   F 
Sbjct: 173 PRRHPHANKHLIDYSVVLLIEPLTLGGTVFGIYLHTVLPPYVILILLVVTLTATSATTFK 232

Query: 199 KGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI 258
           KG++  KKE   K                         EYL  +   +    P+K KV+ 
Sbjct: 233 KGLDLRKKENTKK-------------------------EYLLINNNSDAYLTPEK-KVNP 266

Query: 259 IENIYWKEFGLLVAVWAIILALQIAKNYTTT--------CSVLYWVLNLLQIPVAGGVSA 310
             +  W +   ++++  +     + K   +         CS  YWVL+    P+      
Sbjct: 267 FLDADWVKIFAILSILILSTMFSVFKGGDSEVSLIGIKLCSPPYWVLSFAIWPIIIITWI 326

Query: 311 YEAVALYKGRRKIASKGD--EGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF 367
           + A  LY    +  + G   EG  ++    ++      + AG++  LLG+GGG I GP+ 
Sbjct: 327 FTARYLYGQWLRNQADGTIIEGDIRYSRKTIILLGILSVVAGILASLLGIGGGMIKGPVL 386

Query: 368 LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 427
           L +G+ P + +AT+ F I F+S+ S  +Y LL +  + Y L ++ +   A FVG   L  
Sbjct: 387 LAMGLSPDIVAATSSFMILFTSASSAFQYILLGKLRLDYGLVYYIIGFAACFVGTQTLIW 446

Query: 428 LIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           ++    + S IIF ++ II +S + L    +   +  +E  +   F +IC
Sbjct: 447 VVNKYKKRSYIIFLITAIIVISTILL----VVTEVLDLEKYKNQPFQSIC 492


>gi|6554197|gb|AAF16643.1|AC011661_21 T23J18.20 [Arabidopsis thaliana]
          Length = 491

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 128/414 (30%), Positives = 211/414 (50%), Gaps = 34/414 (8%)

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           ++  ++ FF  +  S GG+GGGG+F+ ++ +I G + K++++ S  MVTG + + V  NL
Sbjct: 97  IIAAVLSFFAASISSAGGIGGGGLFLSIMTIIAGLEMKTASSFSAFMVTGVSFANVGCNL 156

Query: 139 RQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLST-RA 196
             R+P + D  +ID+DLAL  QP L+LG+SIGV  N +F +W++ + L  +FL  ST + 
Sbjct: 157 FLRNPKSRDKTLIDFDLALTIQPCLLLGVSIGVICNRMFPNWLV-LFLFAVFLAWSTMKT 215

Query: 197 FLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKV 256
             KGV  W  E  ++R   K                  D   ++      E RE  + + 
Sbjct: 216 CKKGVSYWNLE--SERAKIKSPR-------------DVDGIEVARSPLLSEEREDVRQRG 260

Query: 257 SIIENIYWKEFGLLVAVWAIILALQIAKNYT--------TTCSVLYWVLNLLQIPVAGGV 308
            I     W + G+LV +W +  ++ + +             C  LYW L+ LQIP+    
Sbjct: 261 MI--RFPWMKLGVLVIIWLLFFSINLFRGNKYGQGIISIKPCGALYWFLSSLQIPLTIFF 318

Query: 309 S-----AYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 363
           +     +    + +       S+ + G   R ++L+      + AG++GGL G+GGG ++
Sbjct: 319 TLCIYFSDNVQSNHTSHSNQNSEQETGVGGRQNKLML-PVMALLAGVLGGLFGIGGGMLI 377

Query: 364 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 423
            PL L++GI P+V++AT  F + FSSSMS ++Y LL       A  F  +  +A+ VG  
Sbjct: 378 SPLLLQIGIAPEVTAATCSFMVLFSSSMSAIQYLLLGMEHAGTAAIFALVCFVASLVGLM 437

Query: 424 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           V+KK+I   GRASII+F +  ++ +S + +   G   +        YMGF   C
Sbjct: 438 VVKKVIAKYGRASIIVFAVGIVMALSTVLMTTHGAFNVWNDFVSGRYMGFKLPC 491


>gi|15220600|ref|NP_176367.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|13272465|gb|AAK17171.1|AF325103_1 unknown protein [Arabidopsis thaliana]
 gi|4508075|gb|AAD21419.1| Unknown protein [Arabidopsis thaliana]
 gi|111074464|gb|ABH04605.1| At1g61740 [Arabidopsis thaliana]
 gi|332195760|gb|AEE33881.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 458

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 125/431 (29%), Positives = 221/431 (51%), Gaps = 46/431 (10%)

Query: 70  PDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGG 129
           P I+     ++  ++ F  ++  S GG+GGGG++VP++ ++ G D K++++ S  MVTGG
Sbjct: 51  PRIELTTSTIIAGLLSFLASSISSAGGIGGGGLYVPIMTIVAGLDLKTASSFSAFMVTGG 110

Query: 130 AVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVL 188
           +++ V  NL  R+P +    +ID+DLALL +P ++LG+SIGV  N++F +W+IT L  V 
Sbjct: 111 SIANVGCNLFVRNPKSGGKTLIDFDLALLLEPCMLLGVSIGVICNLVFPNWLITSLFAV- 169

Query: 189 FLVLST-RAFLKGVETWK--KETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTP 245
           FL  ST + F  G+  W+   E +  RE+ +   +       + E  K            
Sbjct: 170 FLAWSTLKTFGNGLYYWRLESEMVKIRESNR---IEEDDEEDKIESLKLP--------LL 218

Query: 246 EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYT--------TTCSVLYWVL 297
           E+ + PK+          W + G+LV +W    A+ + +             C   YW++
Sbjct: 219 EDYQRPKRFP--------WIKLGVLVIIWLSYFAVYLLRGNKYGEGIISIEPCGNAYWLI 270

Query: 298 NLLQIPV----------AGGVSAYEAVALYKGRRKIAS-KGDEGTKWRASQLVFYCACGI 346
           +  QIP+          +  V + +    +   + +   + ++G +   S    +    +
Sbjct: 271 SSSQIPLTLFFTLWICFSDNVQSQQQSDYHVSVKDVEDLRSNDGAR---SNKCMFPVMAL 327

Query: 347 TAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPY 406
            AG++GG+ G+GGG ++ PL L++GI P+V++AT  F + FSS+MS ++Y LL       
Sbjct: 328 LAGVLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLLLGMEHTGT 387

Query: 407 ALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIE 466
           A  F  +  +A+ VG  V++K+I   GRASII+F++  ++ +S + +   G   +     
Sbjct: 388 ASIFAVICFVASLVGLKVVQKVITEYGRASIIVFSVGIVMALSIVLMTSYGALDVWNDYV 447

Query: 467 HKEYMGFDNIC 477
              YMGF   C
Sbjct: 448 SGRYMGFKLPC 458


>gi|4079632|emb|CAA10487.1| hypothetical protein [Arabidopsis thaliana]
          Length = 389

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 115/399 (28%), Positives = 203/399 (50%), Gaps = 46/399 (11%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQP 160
           ++VP++ ++ G D K++++ S  MVTGG+++ V  NL  R+P +    +ID+DLALL +P
Sbjct: 14  LYVPIMTIVAGLDLKTASSFSAFMVTGGSIANVGCNLFVRNPKSGGKTLIDFDLALLLEP 73

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLST-RAFLKGVETWK--KETITKREAAKQ 217
            ++LG+SIGV  N++F +W+IT L  V FL  ST + F  G+  W+   E +  RE+ + 
Sbjct: 74  CMLLGVSIGVICNLVFPNWLITSLFAV-FLAWSTLKTFGNGLYYWRLESEMVKIRESNR- 131

Query: 218 LELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAII 277
             +       + E  K            E+ + PK+          W + G+LV +W   
Sbjct: 132 --IEEDDEEDKIESLKLP--------LLEDYQRPKRFP--------WIKLGVLVIIWLSY 173

Query: 278 LALQIAKNYT--------TTCSVLYWVLNLLQIPV----------AGGVSAYEAVALYKG 319
            A+ + +             C   YW+++  QIP+          +  V + +    +  
Sbjct: 174 FAVYLLRGNKYGEGIISIEPCGNAYWLISSSQIPLTLFFTLWICFSDNVQSQQQSDYHVS 233

Query: 320 RRKIAS-KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS 378
            + +   + ++G +   S    +    + AG++GG+ G+GGG ++ PL L++GI P+V++
Sbjct: 234 VKDVEDLRSNDGAR---SNKCMFPVMALLAGVLGGVFGIGGGMLISPLLLQVGIAPEVTA 290

Query: 379 ATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASII 438
           AT  F + FSS+MS ++Y LL       A  F  +  +A+ VG  V++K+I   GRASII
Sbjct: 291 ATCSFMVLFSSTMSAIQYLLLGMEHTGTASIFAVICFVASLVGLKVVQKVITEYGRASII 350

Query: 439 IFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +F++  ++ +S + +   G   +        YMGF   C
Sbjct: 351 VFSVGIVMALSIVLMTSYGALDVWNDYVSGRYMGFKLPC 389


>gi|363807478|ref|NP_001242649.1| uncharacterized protein LOC100803518 precursor [Glycine max]
 gi|255636709|gb|ACU18690.1| unknown [Glycine max]
          Length = 473

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 122/422 (28%), Positives = 230/422 (54%), Gaps = 18/422 (4%)

Query: 71  DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
            I+    IVV  ++ F  ++  S GG+GGGG+F+P+L ++   D K+++++S  MVTGG+
Sbjct: 55  QIQISGPIVVAGVLCFIASSISSAGGIGGGGLFLPILTIVACLDLKTASSLSAFMVTGGS 114

Query: 131 VSAVVYNLRQRHPTL-DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLF 189
           ++ V+ NL    P      +IDYD+ALL +P ++LG+S+GV  N++F +W+IT+L  V  
Sbjct: 115 IANVLCNLCATSPKFGGKSLIDYDIALLSEPCMLLGVSVGVICNLVFPEWLITMLFAVFL 174

Query: 190 LVLSTRAFLKGVETWKKETITKR--EAAKQLELIVLGNG---YQTEECKCDPEYLSNDTT 244
              +++    GV  WK E+  +R  +  ++LE  +L +G    + E  + + E     + 
Sbjct: 175 TWSTSKTCNSGVLFWKIESEERRKNDGFERLEKGLLEDGSSEEREERVQVNNEKAGMKSI 234

Query: 245 PEETREPKKS-KVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTT------CSVLYWVL 297
            E+   P+++ ++ I          + ++ ++  L L     Y  +      C V YW+L
Sbjct: 235 EEQVMVPEENIRMRIPWLKLVVLLLVWLSFFS--LYLLRGNKYGQSIIPMEPCGVGYWIL 292

Query: 298 NLLQIPVAGGVSAYEAVALYKGRRKIASKGDE--GTKWRASQLVFYCACGITAGMVGGLL 355
           +  Q+P+A   +A+        + +   + D    +   +++L+F     + AG++GG+ 
Sbjct: 293 SSAQVPLALFFTAWIVYRKESHQDQNLMQEDPCLSSNGPSNKLIFP-MMALLAGILGGVF 351

Query: 356 GLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSI 415
           G+GGG ++ PL L +GI P+V++AT  F + FSS+MS ++Y LL    +  AL    +  
Sbjct: 352 GIGGGMLISPLLLHVGIAPEVTAATCSFMVFFSSTMSALQYLLLGMDHIETALILALICF 411

Query: 416 IAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDN 475
           +A+ +G  V+++ ++  GR S+I+F++S ++ +S + +   G+ +  K      YMGF  
Sbjct: 412 VASLIGLLVVQRAVQSYGRPSLIVFSVSIVMTLSIVLMTSFGVIRTWKDYTSGRYMGFKL 471

Query: 476 IC 477
            C
Sbjct: 472 PC 473


>gi|222619377|gb|EEE55509.1| hypothetical protein OsJ_03713 [Oryza sativa Japonica Group]
          Length = 351

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 116/373 (31%), Positives = 178/373 (47%), Gaps = 42/373 (11%)

Query: 125 MVTGGAVSAVVYNLR-----QRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
           MVTGGA S V+YNL      +R       VIDYD+ALLFQP L+LG+SIGV  NV+F +W
Sbjct: 1   MVTGGAASNVLYNLLCTGCGRRAAA----VIDYDIALLFQPCLLLGVSIGVVCNVMFPEW 56

Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYL 239
           +IT L  +     +T+    G+  W  E+      A         +G +      +P  L
Sbjct: 57  LITALFALFLAFCTTKTLRAGLRIWSSES----RGATLAVAAATAHGRE------EPLLL 106

Query: 240 SNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQI--------AKNYTTTCS 291
                P  T               WK+  +LV VW     L +               C 
Sbjct: 107 -----PHGTDAGNGGGARGDAGFPWKDVSVLVMVWLCFFVLHVFIGDKHGKGMIRIKPCG 161

Query: 292 VLYWVLNLLQIPVAGGVSAYEAVALYKGRRK--IASKGD-----EGTKWRASQLVFYCAC 344
           V YW++ L Q+P A    A+ A  +Y  R+K  + ++ D     E TK      + +   
Sbjct: 162 VAYWLITLSQVPFA---VAFTAYIIYAKRKKQVLHNQEDGKANPESTKMDTLPTLLFPLA 218

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
               G + GL G+GGG +L P+ L++GIPPQ ++AT+ F + F +SMS+V++ LL    +
Sbjct: 219 AFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLFCASMSMVQFILLGMQGI 278

Query: 405 PYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKR 464
             A  +  +  +A+ VG  V+++ I+  GR S+I+F ++ I+ VS + +   G   +  +
Sbjct: 279 GEASVYAGICFVASVVGAVVIERAIRKSGRVSLIVFLVTGIMAVSTVIITFFGALDVWAQ 338

Query: 465 IEHKEYMGFDNIC 477
                YMGF   C
Sbjct: 339 YTSGAYMGFKLPC 351


>gi|348688213|gb|EGZ28027.1| hypothetical protein PHYSODRAFT_469591 [Phytophthora sojae]
          Length = 545

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 137/528 (25%), Positives = 234/528 (44%), Gaps = 93/528 (17%)

Query: 39  ATSKNKND----HVEPNIVIKVSAPKSRSGYKH------VWPD---IKFGWRIVVGTIIG 85
           A ++N ND     VEP +  + +      G +H      V  D          + G+I  
Sbjct: 24  ALAQNVNDSPVLDVEPEVSCEAATDCLAQGLEHSTCRAGVCVDDALFPLSREEIAGSISA 83

Query: 86  FFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL 145
           F  +   +  G+GGGG+ VP+  + +   +  +  +SK  + GGA+++ + N+R+RHP +
Sbjct: 84  FLASVVAAGSGLGGGGLLVPLYIMTMSMSSHEAVPLSKATIFGGAIASFLLNVRKRHPLV 143

Query: 146 -DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETW 204
              P+IDY+  LL +P+ + G  IGV  N +F +W+IT+ ++ L    + R + KG   W
Sbjct: 144 RSRPLIDYETILLMEPMTLAGTIIGVNMNAVFPEWLITLCIVWLLTKTALRTYSKGKTIW 203

Query: 205 KKETIT-------------------------------------KREAAKQLELIVLGNGY 227
           K+E                                        K    ++L L +L    
Sbjct: 204 KEEADADTKIVSDIVAYWRLLPYESNFKQFRAVARAYLKWKSYKSPEKEELRLKILAGKA 263

Query: 228 QTEECKCDPEYLSNDTTPEETREPKKSK----------------VSIIE------NIYWK 265
            +EE           T  E + +  +S+                +S+ E       +   
Sbjct: 264 SSEEDHSSSNSTEASTEEEASSDENESESLMSWGLQDKKRPVKFLSVEEIAKARRTVPMA 323

Query: 266 EFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAV--- 314
           + G+L   W  ++   +AK    T SV+        YW L ++  P   GV+ Y  +   
Sbjct: 324 DMGVLFLTWVGLVLFSMAKGGHGTPSVIGLSCGSFGYWSLIVVSFPFFMGVTIYFGMKIS 383

Query: 315 ---ALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
              A+ +      +KGD    W    ++ Y A    AG+  GLLG+GGG + GPL LE+G
Sbjct: 384 RFHAMLQASDYTYAKGD--MVWTKHAVIKYPALCTAAGVAAGLLGIGGGMVKGPLLLEMG 441

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           + PQVSSAT+   I F+SS + +++ +L    V +AL+   +  IA  +GQ  +  LIK 
Sbjct: 442 LIPQVSSATSSSMILFTSSATTIQFIILGTLSVEHALWHGTVGFIAGLIGQLGMSYLIKK 501

Query: 432 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICYT 479
             +++++IF ++  I VS   +G +G+A    RI    + GF ++C +
Sbjct: 502 YRKSALVIFLIAIFIGVSGGVMGVLGVA----RISEIGFGGFRSLCLS 545


>gi|297804026|ref|XP_002869897.1| hypothetical protein ARALYDRAFT_492757 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315733|gb|EFH46156.1| hypothetical protein ARALYDRAFT_492757 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 447

 Score =  138 bits (347), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 139/454 (30%), Positives = 233/454 (51%), Gaps = 37/454 (8%)

Query: 39  ATSKNKNDHVEP----NIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSV 94
           A + N+ +  +P    N++ KV   ++ S  +    ++K    I+V  ++ F      S 
Sbjct: 16  AINANQEEENQPQSHHNLLHKVQQWRT-SLKESSSAELKLSSAIIVAGVLCFLAALISSA 74

Query: 95  GGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
           GG+GGGG+F+P++ ++ G D K++++ S  MVTGG+++ V+ NL           +DYDL
Sbjct: 75  GGIGGGGLFIPIMTIVAGLDLKTASSFSAFMVTGGSIANVISNLFGGKAL-----LDYDL 129

Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREA 214
           ALL +P ++LG+SIGV  N +  +W+IT L  V     S +    GV+ WK E+   R  
Sbjct: 130 ALLLEPCMLLGVSIGVICNRVLPEWLITALFAVFLAWSSLKTCRSGVKFWKIESEIAR-- 187

Query: 215 AKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVW 274
                    G G++  E         N   P    +  ++K      I W + G+LV VW
Sbjct: 188 ---------GKGHERPEKGQGEIEEDNLKAPLLEAQVNRNK----SKIPWTKLGVLVIVW 234

Query: 275 A---IILALQIAKN-----YTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIA-- 324
           A   +I  L+  K+         C V YW+L  LQIP+A  +    A++  + R++ +  
Sbjct: 235 ASFFVIYLLRGNKDGKGIITIKPCGVEYWILLSLQIPLAL-IFTKLALSRTESRQEQSPN 293

Query: 325 -SKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 383
             K  EGT+   S  + + A    AG++GG+ G+GGG ++ PL L+ GIPPQ+++AT  F
Sbjct: 294 NQKNQEGTRMDQSMRLKFPAMSFLAGLLGGIFGIGGGMLISPLLLQSGIPPQITAATTSF 353

Query: 384 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 443
            + FS++MS V+Y LL       A  F  +  +A+ +G  +++K +   GRASII+F++ 
Sbjct: 354 MVFFSATMSAVQYLLLGMQNTDTAYVFSFICFLASLLGLVLVQKAVAQFGRASIIVFSVG 413

Query: 444 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            ++ +S + +   G   +       + MGF   C
Sbjct: 414 TVMSLSTVLMTSFGALDVWTDYMAGKDMGFKLPC 447


>gi|301122683|ref|XP_002909068.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099830|gb|EEY57882.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 483

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 108/388 (27%), Positives = 185/388 (47%), Gaps = 44/388 (11%)

Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
            I VP + LI+GFD K +  +S   + GGAV+   +N+R+RHP ++ P+ID +LAL   P
Sbjct: 84  VIMVPAMVLIMGFDIKRATPISNVAILGGAVANAWFNMRKRHPNVNRPLIDPELALGMIP 143

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
           +++ G  +G   N +   +++++L +V+ LV  +R   KG+   KKE   +REA      
Sbjct: 144 VVIGGTVLGALINKLIPSYVLSLLFVVVLLVGGSRTMKKGIRLHKKEVAKRREAEAATSE 203

Query: 221 IVLGNGYQTEECKCDPEYLSNDTTPEETREPKK-------------------------SK 255
           +       T +    P      +TP+ T   +K                         S 
Sbjct: 204 V-------TADIPVSPGAYVQVSTPQITGNDEKRLSLSVTGGDAAPVKSAAGDHAGSDSL 256

Query: 256 VSIIE---NIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYE 312
           V I+E   +  W     ++  +  ++A  I         V YWV+ L++IP         
Sbjct: 257 VQILEKERHFAWGPHVAIMVCYLGVVAASIGDASVDCGGVAYWVILLIEIPWVAVFVVLT 316

Query: 313 AVALYKGR-RKIASK---GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 368
           +  L+K   RK A+     D   KW    +V++      AG+V G+ G+GGG I GP+ +
Sbjct: 317 SHYLHKVYLRKTAANYQYVDGDIKWTKKMVVYFPLGCAVAGIVAGMFGVGGGIITGPIMI 376

Query: 369 ELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           ELGI P+V+S+T    I +SS+ +  ++ + K     +AL+  A++ +     Q V+   
Sbjct: 377 ELGIVPEVASSTTALMILYSSAAATAKFAVFKMVAWDWALFLCAVAFVVTSASQVVILGF 436

Query: 429 IKILGRASIIIFTLSFIIFVSALSLGGV 456
           ++  GR SII+  ++     +A+ +GGV
Sbjct: 437 VRRTGRQSIIVLCIA-----TAVLIGGV 459


>gi|42561921|ref|NP_172621.2| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|332190628|gb|AEE28749.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 367

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 118/386 (30%), Positives = 192/386 (49%), Gaps = 34/386 (8%)

Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLG 165
           + +I G + K++++ S  MVTG + + V  NL  R+P + D  +ID+DLAL  QP L+LG
Sbjct: 1   MTIIAGLEMKTASSFSAFMVTGVSFANVGCNLFLRNPKSRDKTLIDFDLALTIQPCLLLG 60

Query: 166 ISIGVAFNVIFADWMITVLLIVLFLVLST-RAFLKGVETWKKETITKREAAKQLELIVLG 224
           +SIGV  N +F +W++ + L  +FL  ST +   KGV  W  E  ++R   K        
Sbjct: 61  VSIGVICNRMFPNWLV-LFLFAVFLAWSTMKTCKKGVSYWNLE--SERAKIKSPR----- 112

Query: 225 NGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK 284
                     D   ++      E RE  + +  I     W + G+LV +W +  ++ + +
Sbjct: 113 --------DVDGIEVARSPLLSEEREDVRQRGMI--RFPWMKLGVLVIIWLLFFSINLFR 162

Query: 285 NYT--------TTCSVLYWVLNLLQIPVAGGVS-----AYEAVALYKGRRKIASKGDEGT 331
                        C  LYW L+ LQIP+    +     +    + +       S+ + G 
Sbjct: 163 GNKYGQGIISIKPCGALYWFLSSLQIPLTIFFTLCIYFSDNVQSNHTSHSNQNSEQETGV 222

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
             R ++L+      + AG++GGL G+GGG ++ PL L++GI P+V++AT  F + FSSSM
Sbjct: 223 GGRQNKLMLP-VMALLAGVLGGLFGIGGGMLISPLLLQIGIAPEVTAATCSFMVLFSSSM 281

Query: 392 SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
           S ++Y LL       A  F  +  +A+ VG  V+KK+I   GRASII+F +  ++ +S +
Sbjct: 282 SAIQYLLLGMEHAGTAAIFALVCFVASLVGLMVVKKVIAKYGRASIIVFAVGIVMALSTV 341

Query: 452 SLGGVGLAKMIKRIEHKEYMGFDNIC 477
            +   G   +        YMGF   C
Sbjct: 342 LMTTHGAFNVWNDFVSGRYMGFKLPC 367


>gi|320170699|gb|EFW47598.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 642

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 123/447 (27%), Positives = 213/447 (47%), Gaps = 73/447 (16%)

Query: 86  FFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL 145
           FFG++   +GGVGGGG+FVP+L +I  F A  +  +S  M+T  A+ ++++ +R + P  
Sbjct: 199 FFGSSLAVIGGVGGGGLFVPLLMIITHFKADQAVPISSTMITAAAIMSLLFEIRAKRPN- 257

Query: 146 DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWK 205
             PVIDYD++ L QP+ + G ++GV  NV+   WMI + L+V+ +  +T+   KG+  +K
Sbjct: 258 GRPVIDYDVSFLLQPVCLAGTTLGVFLNVLLPGWMIILALLVILVYTTTKTMKKGITMYK 317

Query: 206 KETITKREAAK----------QLELIVLGNGYQTEECKCDPEYL--SNDTT--------- 244
           KE+  +R  A                 L    +      D E L  S DT+         
Sbjct: 318 KESQQRRALANGGTSANVAASAPAAAPLQKLNKRHHKHQDKESLVASTDTSAEQLSVNMD 377

Query: 245 -------------------PEETREPKKSKV----SIIENIYWKEFGLL--VAVWAIILA 279
                              P E   P K +V     + + + +    +L  +A+W I+LA
Sbjct: 378 DSDLSDSEDAGRIQLSQPVPSEAELPSKDQVLYQRQLDQELRFPTTQILGMIAMWLIVLA 437

Query: 280 LQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRR---KIASKGDEGTKWRAS 336
               K + + CS  +W++  L +P+A        V L+ GRR       K   G ++  +
Sbjct: 438 CSTIKRFVSKCSAEFWIVAFLPLPIA------ILVTLWYGRRLRDAFELKQRCGHQFEPT 491

Query: 337 QLVF-------YCACGITAGMVGGLLGLGGGFILGPLFLELGI---PPQVSSATALFAIT 386
             VF       Y       G+ GG++G+GG  ++GPL L + +    P V++A +   + 
Sbjct: 492 DFVFNRRNTIVYPLLSFGGGLAGGMVGVGGAMVIGPLLLNMAVQTPDPSVTTAISNLLVV 551

Query: 387 FSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFII 446
           F+++ +V+++ +L      YAL+  A +++A+ + + VLK      GR S ++F L    
Sbjct: 552 FTAASTVIQFVILNTLVYDYALFLSAFTLMASVLSKKVLKPWFDNKGRKSFVVFAL---- 607

Query: 447 FVSALSLGGVGLAKMIKRIEHKEYMGF 473
            V ++SL G+  A  I+ I    ++GF
Sbjct: 608 -VLSISLSGILTA--IEGILKVAHVGF 631


>gi|301117154|ref|XP_002906305.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262107654|gb|EEY65706.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 543

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 132/526 (25%), Positives = 236/526 (44%), Gaps = 93/526 (17%)

Query: 41  SKNKND----HVEPNIVIKVSAPKSRSGYKH------VWPD---IKFGWRIVVGTIIGFF 87
           ++N ND     VE ++V + +      G KH      V  D          + G+I  F 
Sbjct: 23  AQNVNDSPVLDVESDVVCEAATDCLAQGLKHSTCRAGVCVDNALFPLSHEEIAGSIGAFV 82

Query: 88  GTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-D 146
            T   +  G+GGGG+ VP+  +++   +  +  +SK  + GGA+++ + N+R+RHP +  
Sbjct: 83  ATILAAGSGLGGGGLLVPLYIMLMSMSSHEAVPLSKTTIFGGAIASFLLNVRKRHPLVRS 142

Query: 147 IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
            P+IDY+  LL +P+ + G  IGV  N +F +W+IT+ ++ L    + R + KG + WK+
Sbjct: 143 RPLIDYETMLLMEPMTLAGTIIGVNMNAVFPEWLITICIVWLLTKTALRTYSKGKKIWKE 202

Query: 207 ETIT-------------------------------------KREAAKQLELIVLGNGYQT 229
           E                                        K    ++L L +L +   +
Sbjct: 203 EVDADNKIIMDIVAYWRLLPYESNFKQFQVVARAYLKWKAYKSPEKEELRLKILADEASS 262

Query: 230 EECKCDPEYLSNDTTPEETREPKKSKVSII----------------------ENIYWKEF 267
            E +     ++  +T EET   +    +++                        +   + 
Sbjct: 263 VEERKSSSNITEASTEEETSSDENESENLMSWGLQDKRPVKFLSVEEIAKTRRTVPMADM 322

Query: 268 GLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAVALYKG 319
           G+L   W  ++   +AK    T SV+        YW+L ++  P    V+ Y  + + + 
Sbjct: 323 GVLFLTWIGLVLFSMAKGGHGTPSVIGLSCGSIGYWLLVIVSFPFFMSVTIYFGMKISRF 382

Query: 320 RRKIAS------KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIP 373
              + +      KGD    W    +V Y A    AG+  GLLG+GGG + GPL +E+G+ 
Sbjct: 383 HTMLQASDYTYAKGD--MIWTKFAVVKYPALCTAAGVAAGLLGIGGGMVKGPLLIEMGLL 440

Query: 374 PQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 433
           PQVSSAT+   I F+SS + +++ +L    V +AL+  A+  +A  +GQ  +  L K   
Sbjct: 441 PQVSSATSSSMILFTSSATTIQFIILGTLSVNHALWHGAVGFVAGLIGQLGMSYLFKKYR 500

Query: 434 RASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICYT 479
           +++++IF ++  I VS + +G +G      RI    + GF ++C +
Sbjct: 501 KSALVIFLVAVFIGVSGIVMGVLGAV----RISEIGFGGFRSLCLS 542


>gi|428174477|gb|EKX43372.1| hypothetical protein GUITHDRAFT_95350 [Guillardia theta CCMP2712]
          Length = 572

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 107/383 (27%), Positives = 198/383 (51%), Gaps = 21/383 (5%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           I VPM   I  F       +SK  + GGAV+   YN+++RHP  + P++DY+  ++ +P+
Sbjct: 180 ILVPMYLSIGKFSPHYGIPLSKATIFGGAVTNNYYNVQRRHPYANRPLVDYNTCMMLEPV 239

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
           L+LG  IGV FN +   W+IT+LL++     + R  +K +ET+ KE    +E   +    
Sbjct: 240 LLLGTIIGVFFNAVSPGWLITILLVLSLTYTTYRTSVKALETYNKEEKAVKEEETKH--- 296

Query: 222 VLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQ 281
           +LG+    E+    P ++ +   PE+ RE  +++  ++  I W    +++AV +I+   +
Sbjct: 297 LLGSKAGPEQ---HPSFMLDANIPEDLREIYEAESRVL-TISW----IIIAVCSILKGGE 348

Query: 282 IAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL---YKGRRKIASKGDEG----TKWR 334
             +      S+ YW+L    +P+  G+  Y    L   Y+ + ++  +  EG    T+  
Sbjct: 349 GGQGIVACGSLGYWLLVAAPLPMVLGLVMYCGDILVRGYEDKLRLGYEFAEGDIHWTRKN 408

Query: 335 ASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVV 394
           AS    YC   I+AG   G LG+  G ILGP+ LELG+ P V +A++ F + F++S +  
Sbjct: 409 ASVYPLYC---ISAGFAAGALGIAAGTILGPILLELGMLPLVGTASSGFMVIFTASSTTF 465

Query: 395 EYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLG 454
           ++ ++ +  + YAL+F  + ++   +G  V+   +K   +   ++  LS ++  S + +G
Sbjct: 466 QFLIMGQLQIDYALFFCGIGLLGGAIGNTVVSFFVKKYKKTWFVVAILSAVLAASTVLMG 525

Query: 455 GVGLAKMIKRIEHKEYMGFDNIC 477
             G  +     +H + MG   +C
Sbjct: 526 YAGFERAELSYDHGKNMGIRRLC 548


>gi|116783346|gb|ABK22903.1| unknown [Picea sitchensis]
          Length = 95

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 62/93 (66%), Positives = 77/93 (82%)

Query: 385 ITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSF 444
           + FSSSMSVVEYY LKRFPVPYA YFF + IIAAF GQHV++KL+ +LGRASIIIF L+F
Sbjct: 1   MLFSSSMSVVEYYFLKRFPVPYAAYFFGVCIIAAFTGQHVIRKLVLLLGRASIIIFCLAF 60

Query: 445 IIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +IF+SA  +GGVG++KM+  I+   YMGF N+C
Sbjct: 61  MIFISAWIMGGVGISKMVHEIKDGAYMGFQNLC 93


>gi|301122685|ref|XP_002909069.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099831|gb|EEY57883.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 470

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 101/375 (26%), Positives = 178/375 (47%), Gaps = 28/375 (7%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           I VP + LI+GFD K +  VS   + GGA++   +N+R+RHP +D P+ID DL+    PL
Sbjct: 75  ILVPAMVLIMGFDIKRATPVSNLAIVGGAIANAWFNIRKRHPAVDRPLIDADLSFSMIPL 134

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE-----TITKREAAK 216
           ++ G  +G     +   +++++L +V+ ++  TR   KG++ ++ E       T  E   
Sbjct: 135 VMGGAVVGTVLAKLLPSYLLSLLFVVVLVLGGTRTVSKGIKMYRAEMKSCKVQTTEEQQA 194

Query: 217 QLELIVLGNGYQTEECKCDPE-------------YLSNDTTPEETREPKKSKVSIIENIY 263
                V      TE+   D                LSN+       E     V    +  
Sbjct: 195 AAYAAVCSPSSCTEDKFADDGGDSTSHSLLKGTGSLSNECGGSTDEEVLTEIVERERHFS 254

Query: 264 WKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGR-RK 322
             + G ++  +  I+A  I     +   + YW+L ++++P   G     AV LY+   RK
Sbjct: 255 LTKHGAIMLCYMGIVAASIGGAAVSCGGITYWLLLIIELPWIAGFGVCTAVYLYRQHCRK 314

Query: 323 IA-----SKGDEGTKWRASQLV-FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQV 376
           ++     + GD    W    +V F  AC   AG++ GL G+GGG + GPL +E+GI P+V
Sbjct: 315 VSVNYEFAAGD--IHWTKKTVVRFPLACA-GAGLIAGLFGVGGGIVTGPLMIEMGIVPEV 371

Query: 377 SSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRAS 436
           +SAT    + +SS+ +  ++ +       +AL   A++ +   V Q ++   ++  GR S
Sbjct: 372 ASATTALMVLYSSAAATAKFAVFNMTAWDWALLLSAVAFVVTAVSQVIILGFVRRTGRQS 431

Query: 437 IIIFTLSFIIFVSAL 451
           +I+  +   I + A+
Sbjct: 432 VIVLCIGATICIGAV 446


>gi|224014684|ref|XP_002297004.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968384|gb|EED86732.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 522

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 117/461 (25%), Positives = 194/461 (42%), Gaps = 97/461 (21%)

Query: 80  VGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR 139
           VG I+ F      +  GVGGGGI+VP+  L++ F  KSS+ +S+  + G ++  ++ N+R
Sbjct: 11  VGIIVWFLTAGIATACGVGGGGIYVPLGILLLRFPPKSSSGLSQASIFGASLGGLIVNIR 70

Query: 140 QRHPTLDI-----------------------------------------------PVIDY 152
            RHP L +                                               PVIDY
Sbjct: 71  NRHPDLYMRDTRGKPLQEGSGKIVSYEKDKGPAAIEEDRKAYLAGGDGKQKFYTRPVIDY 130

Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE----- 207
           D+AL   P+ + G  +GV    +F DW+      V+    S + + K    +KK+     
Sbjct: 131 DMALFLAPMEMAGAVLGVIIQRLFPDWLFLSFAAVVLGFTSYKTYTKFFSAFKKDKEKRE 190

Query: 208 ---TITKREAAKQLELIVLG------------------NGYQTEECKCDPEYLSNDTTPE 246
               + + E+ + +   V+                   N Y+        +  + D   +
Sbjct: 191 KDRQLVQAESERNVSTSVVSDEDNKDEDATNGVRDANTNAYEGNAANTSTKSKNGDEEEQ 250

Query: 247 ETREPK---KSKVSIIENI--YWKE-FGLLVAVW---AIILALQIAKNYTT----TC-SV 292
           E  +PK   K ++ + ++   Y KE    L+ +W    +I  L+  K   +    TC   
Sbjct: 251 EQDDPKELEKRRLFLQDDSRQYPKEKIAFLILLWIGLTVITFLKGGKGIDSVIGITCEDP 310

Query: 293 LYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASK-------GDEGTKWRASQLVFYCACG 345
            Y VL   Q     G   + AV  +K  ++   +        +    W   +L FY    
Sbjct: 311 AYIVLVAAQFLWTFG---FAAVFGWKNTKRTQERLAVNYPFQEHDVLWNFKKLQFYSFFT 367

Query: 346 ITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVP 405
             AG+V GL+G+GGG +LGPL L +GI P VS+AT    I  +SS   V + +    P  
Sbjct: 368 FVAGIVAGLIGIGGGMVLGPLMLVMGIHPSVSTATTASMILLTSSSVAVMFVMSGLVPWQ 427

Query: 406 YALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFII 446
           YALYFF + ++ A++G+  +   ++  G  S+++ TL+ II
Sbjct: 428 YALYFFCVCLVGAYIGKSRIDSYVRKTGMTSVLVGTLATII 468


>gi|167393930|ref|XP_001733526.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165894999|gb|EDR22813.1| hypothetical protein EDI_269710 [Entamoeba dispar SAW760]
          Length = 404

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 102/367 (27%), Positives = 175/367 (47%), Gaps = 47/367 (12%)

Query: 74  FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
           F W+++VG+I   F     +  G+GGG  ++ +  LI+  D   +  +SK    G A   
Sbjct: 7   FDWKLIVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDPHQAIPLSKITTFGVACGG 66

Query: 134 VVYNLRQRHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
            +    +RHP +   P+I Y  AL+ +PL + G  IGV FN+I   W+I ++L++L    
Sbjct: 67  FLILWMKRHPIVRYKPLISYPTALMVEPLTIYGTMIGVIFNIISPSWLIIIVLVILLGFT 126

Query: 193 STRAFLKGVETWKKETITKREAAKQLELIVL------------------GNG-------- 226
           S + F K ++ WK E   KR+A K  EL+                    GN         
Sbjct: 127 SYKTFAKAIKQWKNEN-EKRDATKATELVETSKPDITDNDNDDMKPSENGNNAVIVDERI 185

Query: 227 -YQTEECKCDPEYLSNDTTPEETREPK----KS--KVSIIENIYWKEFGLLVAVWAIILA 279
             + +E    P+ L  D + E  ++ K    K+  K  II+ +     G+L+ VWA++  
Sbjct: 186 QEEDDEQGTGPKLLPQDESQEAQQQAKIVEEKTLLKREIIKAVL--SVGILIIVWAVMFF 243

Query: 280 LQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGT 331
           + I K      S++        YW+L ++  P+   V+    + L+  +R    +G+   
Sbjct: 244 IVILKGGEKMDSIVGIECGTPWYWILTVIGGPLMLTVTIIVGIFLWWRQRGEEVEGE--V 301

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
           +W     +        AG+    LG+GGG ++GP+ LE+G+ PQV++AT+ F I F++S 
Sbjct: 302 QWTVKNCLIIPIGAFFAGVSAAYLGIGGGMVIGPILLEIGVLPQVATATSAFMIMFTASS 361

Query: 392 SVVEYYL 398
           S ++Y +
Sbjct: 362 SSLQYII 368


>gi|123437805|ref|XP_001309695.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121891432|gb|EAX96765.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 448

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 116/413 (28%), Positives = 193/413 (46%), Gaps = 47/413 (11%)

Query: 73  KFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVS 132
           K+  R+++G I  F      S  G+GGGG+FVP++ L+V F    +   SK ++ GG+++
Sbjct: 75  KWDTRLIIGIICVFIAGIFVSGAGIGGGGLFVPIMMLLVNFPTSYAIPTSKAIIFGGSLA 134

Query: 133 AVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
             ++NL +RHP  + P+I+Y++A + +P+  LG  IGV FN I  +W++  +  VL    
Sbjct: 135 VTLFNLNKRHPYYERPLINYNVAAMIEPISWLGTVIGVIFNSIIPEWLLYSVQFVLLTYT 194

Query: 193 STRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPK 252
           +   F KG+   K +   K   +   EL+V G  Y        P Y              
Sbjct: 195 AWNTFKKGL---KDQRNAKLGISPNNELLVKGT-YD------GPTYSIGLLW-------- 236

Query: 253 KSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSV--------LYWVLNLLQIPV 304
                           LL+ ++ + LA+   +      S+        +YW L     P+
Sbjct: 237 ----------------LLLIIYVVFLAISFLRGGDGADSIIGIKFCSPIYWALTFGPFPI 280

Query: 305 AGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILG 364
             G++A+  V + K R  +    +E TK     +      G  AGM  G LG+GGG I G
Sbjct: 281 YLGITAW-MVHIAK-RYPVLGHKNELTK---KDIFLLMMSGFVAGMAAGFLGIGGGMIKG 335

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           P+ L L I  +  +AT+ F I  +SS + ++Y      P      F ++  ++  +G   
Sbjct: 336 PMMLALEIEAEEMAATSSFMILMTSSATSIQYIAEGLMPWLEFGVFTSMGFVSFLIGVIF 395

Query: 425 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           L+ L+K LG  SI ++ L+ II +SA+ +  VG+  +I  ++    MGF   C
Sbjct: 396 LRWLVKKLGNRSIFLYFLAAIIMISAILMSVVGIEIIILEVKEHASMGFRPFC 448


>gi|242043762|ref|XP_002459752.1| hypothetical protein SORBIDRAFT_02g009840 [Sorghum bicolor]
 gi|241923129|gb|EER96273.1| hypothetical protein SORBIDRAFT_02g009840 [Sorghum bicolor]
          Length = 432

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 118/397 (29%), Positives = 197/397 (49%), Gaps = 49/397 (12%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQR---------HPTLDIPVIDY 152
           ++VP+L+++ G   K++ A+S  MVTGG +S V+Y L  R               P+IDY
Sbjct: 64  LYVPILSIVAGLSLKTATALSTFMVTGGTLSNVLYTLFLRGGGSGSGQGQGQGQQPLIDY 123

Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLST-RAFLKGVETWKKETITK 211
           D+A++ QP L+LG+S+GV  NV+F +W+IT L   LFL  +T + +  GV  W+ E    
Sbjct: 124 DIAVVSQPCLLLGVSVGVVCNVVFPEWLITALF-SLFLAFATFKTYGAGVRRWRAE---- 178

Query: 212 REAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLV 271
                            T E    P+    +T   E     ++ VS      W +  +LV
Sbjct: 179 -----------------TAELGRIPDAAGAETAAAEEALLGRN-VSGGHRCQWVDLAVLV 220

Query: 272 AVWAIILALQI------AKNY--TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKI 323
            VW     + +      AK       C  +YW++ + Q+PVA   +     A    +RK 
Sbjct: 221 TVWLCFFVMHLFIGGEGAKGVFDIEPCGTVYWLITVAQVPVAVAFT-----ACIGQKRKS 275

Query: 324 ASKGD---EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT 380
            + G       K  A     +    +  G++ GL G+GGG +L P+ L++G+PP  +SAT
Sbjct: 276 QAHGQVISAKRKLDALPAYVFPVAALLTGVMSGLFGIGGGLLLNPVLLQIGVPPTTASAT 335

Query: 381 ALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF 440
            +F + F +SMS+V++ +L    +  A+ + A   +A+ VG   ++  ++  GRAS+I+F
Sbjct: 336 TMFMVLFCASMSMVQFIILGVDGIASAVLYAATCFVASIVGLVGIQGAVRRSGRASLIVF 395

Query: 441 TLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            ++ ++ VSAL +   G A++ +     +YMGF   C
Sbjct: 396 MVAGVLAVSALVIACSGAARVWEEYMSGQYMGFKMPC 432


>gi|357483383|ref|XP_003611978.1| Membrane protein-like protein [Medicago truncatula]
 gi|355513313|gb|AES94936.1| Membrane protein-like protein [Medicago truncatula]
          Length = 109

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 67/106 (63%), Positives = 88/106 (83%)

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           +  +V+SAT+ F++ FSSSMSVV+YY L RFPVPYA YF  ++ IAAF GQHV++K+I I
Sbjct: 1   MKEKVASATSTFSMLFSSSMSVVQYYYLDRFPVPYASYFVLVATIAAFAGQHVVRKIIAI 60

Query: 432 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           LGRASIIIF L+  IF+SA+SLGGVG+ KMI ++E+ EYMGF+N+C
Sbjct: 61  LGRASIIIFILASTIFISAISLGGVGIEKMIVKMENHEYMGFENLC 106


>gi|449519050|ref|XP_004166548.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101231924 [Cucumis sativus]
          Length = 455

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 123/415 (29%), Positives = 219/415 (52%), Gaps = 37/415 (8%)

Query: 78  IVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYN 137
           IVV  ++ F   +  S GG+GGGG+++P+L ++ G D K++++++ CMVTGG+++ V+ N
Sbjct: 63  IVVAGVLCFIAASLSSAGGIGGGGLYLPILTIVAGVDLKTASSLTACMVTGGSIANVLSN 122

Query: 138 LRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
              +  +    +I++D+ALL +P ++LG+SIGV  N+ F +W +T +L  +FL  ST   
Sbjct: 123 FFSK--SGGKCLINFDIALLTEPCMLLGVSIGVICNLSFPEW-VTTILFAIFLAWSTLKT 179

Query: 198 LK-GVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKV 256
            K G+  W++E+        +LE     +G Q E    + + +     P  T+E  +S+ 
Sbjct: 180 CKSGMVYWERESEGLMNNGCKLE-----DGLQNEN---EAKLVEEPLLP--TQENCRSRF 229

Query: 257 SIIENIYWKEFGLLVAVWAIILALQIAKNYTT--------TCSVLYWVLNLLQIPVAGGV 308
             +      + G LV VW     + + +            TC   YW+L+ +Q+PVA   
Sbjct: 230 PSM------KLGXLVLVWFCFYLIYLLRGDQDGRGLLPIETCGTGYWILSSVQVPVA--- 280

Query: 309 SAYEAVALYKGRRKIASKGDEGTKWRA------SQLVFYCACGITAGMVGGLLGLGGGFI 362
            A+    LYK +   +    +     A      S+ + +      AG++GG+ G+GGG +
Sbjct: 281 IAFTLWILYKQKSPQSEDSSQKELEEARPVVEPSKKLIFPVMAFLAGILGGMFGIGGGML 340

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
           + P  L++GI P+ ++AT  F + FS++MS  +Y LL       A+ F  +  IA+ +G 
Sbjct: 341 ISPFLLQVGIVPEKTAATCSFMVFFSATMSAGQYLLLGMEHAEIAIIFAIICFIASVLGL 400

Query: 423 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            V++K I+  GRAS+IIF++S ++ +S + +   G   + +       MGF + C
Sbjct: 401 VVVQKAIRDHGRASVIIFSVSIVMALSTVLMTTFGALNVFRDYIAGNPMGFKSPC 455


>gi|449454696|ref|XP_004145090.1| PREDICTED: uncharacterized protein LOC101208650 [Cucumis sativus]
 gi|449470720|ref|XP_004153064.1| PREDICTED: uncharacterized protein LOC101220005 [Cucumis sativus]
          Length = 455

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 123/415 (29%), Positives = 219/415 (52%), Gaps = 37/415 (8%)

Query: 78  IVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYN 137
           IVV  ++ F   +  S GG+GGGG+++P+L ++ G D K++++++ CMVTGG+++ V+ N
Sbjct: 63  IVVAGVLCFIAASLSSAGGIGGGGLYLPILTIVAGVDLKTASSLTACMVTGGSIANVLSN 122

Query: 138 LRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
              +  +    +I++D+ALL +P ++LG+SIGV  N+ F +W +T +L  +FL  ST   
Sbjct: 123 FFSK--SGGKCLINFDIALLTEPCMLLGVSIGVICNLSFPEW-VTTILFAIFLAWSTLKT 179

Query: 198 LK-GVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKV 256
            K G+  W++E+        +LE     +G Q E    + + +     P  T+E  +S+ 
Sbjct: 180 CKSGMVYWERESEGLMNNGCKLE-----DGLQNEN---EAKLVEEPLLP--TQENCRSRF 229

Query: 257 SIIENIYWKEFGLLVAVWAIILALQIAKNYTT--------TCSVLYWVLNLLQIPVAGGV 308
             +      + G LV VW     + + +            TC   YW+L+ +Q+PVA   
Sbjct: 230 PSM------KLGALVLVWFCFYLIYLLRGDQDGRGLLPIETCGTGYWILSSVQVPVA--- 280

Query: 309 SAYEAVALYKGRRKIASKGDEGTKWRA------SQLVFYCACGITAGMVGGLLGLGGGFI 362
            A+    LYK +   +    +     A      S+ + +      AG++GG+ G+GGG +
Sbjct: 281 IAFTLWILYKQKSPQSEDSSQKELEEARPVVEPSKKLIFPVMAFLAGILGGMFGIGGGML 340

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
           + P  L++GI P+ ++AT  F + FS++MS  +Y LL       A+ F  +  IA+ +G 
Sbjct: 341 ISPFLLQVGIVPEKTAATCSFMVFFSATMSAGQYLLLGMEHAEIAIIFAIICFIASVLGL 400

Query: 423 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            V++K I+  GRAS+IIF++S ++ +S + +   G   + +       MGF + C
Sbjct: 401 VVVQKAIRDHGRASVIIFSVSIVMALSTVLMTTFGALNVFRDYIAGNPMGFKSPC 455


>gi|348676056|gb|EGZ15874.1| hypothetical protein PHYSODRAFT_316057 [Phytophthora sojae]
          Length = 437

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/364 (25%), Positives = 167/364 (45%), Gaps = 55/364 (15%)

Query: 100 GGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQ 159
           G I VP + LI+GFD K +  +S   + GGA++   +N+++RHP+ D P+ID DLAL   
Sbjct: 77  GVIMVPAMVLIMGFDIKRATPISNVGILGGALANAWFNMQKRHPSADRPLIDADLALGMI 136

Query: 160 PLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
           P+L                +++++L +V+     TR  +KG++ ++ E+  K +A     
Sbjct: 137 PVL-------------LPSYIVSLLFVVVLAASGTRMMIKGIQLYRAESTKKAQAD---- 179

Query: 220 LIVLGNGYQTEECKCDPEYLSNDTTPEETREPK--------------KSKVSIIE---NI 262
                +   T +    P+  +   TP  + +                K    I+E   + 
Sbjct: 180 ----ADSKDTADAAMSPDAYAQAFTPNPSIDSDASAAKSASASAQAVKVLAEILEQERHF 235

Query: 263 YWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK 322
            W++ G ++  +  ++A  I     +   V  WV+ L +IP          VA       
Sbjct: 236 AWRKHGAILVCYLGVVATSIGDASVSCGGVADWVILLAEIP---------WVARKASVGY 286

Query: 323 IASKGDEGTKWRASQLVFY---CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 379
           +  +GD   +W    ++ +   CA G   G+V G+ G+GGG I GP+ +E+GI P+V+S+
Sbjct: 287 LYIEGD--IRWTQKAVICFPLGCALG---GIVAGMFGVGGGIITGPIRIEMGIVPEVASS 341

Query: 380 TALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 439
           T    I +SS+ +  +Y +       +A    A++       Q V+   ++  GR SI++
Sbjct: 342 TMALMILYSSAAATAKYTVFNMIAWDWAALLCAVTFAVTSAAQVVILAYVRRSGRQSIVV 401

Query: 440 FTLS 443
             +S
Sbjct: 402 LCIS 405


>gi|358344379|ref|XP_003636267.1| Membrane protein-like protein [Medicago truncatula]
 gi|355502202|gb|AES83405.1| Membrane protein-like protein [Medicago truncatula]
          Length = 110

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 67/102 (65%), Positives = 86/102 (84%)

Query: 376 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 435
           V+SAT+ F++ FSSSMSVV+YY L RFPVPYA YF  ++ IAAF GQHV++K+I ILGRA
Sbjct: 6   VASATSTFSMLFSSSMSVVQYYYLDRFPVPYASYFVLVATIAAFAGQHVVRKIIAILGRA 65

Query: 436 SIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           SIIIF L+  IF+SA+SLGGVG+ KMI ++E+ EYMGF+N+C
Sbjct: 66  SIIIFILASTIFISAISLGGVGIEKMIVKMENHEYMGFENLC 107


>gi|2911082|emb|CAA17544.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268922|emb|CAB79125.1| hypothetical protein [Arabidopsis thaliana]
          Length = 431

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 131/418 (31%), Positives = 208/418 (49%), Gaps = 48/418 (11%)

Query: 71  DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
           ++K    I++  ++ F      S GG+GGGG+F+P++ ++ G D K++++ S  MVTGG+
Sbjct: 51  ELKLSSAIIMAGVLCFLAALISSAGGIGGGGLFIPIMTIVAGVDLKTASSFSAFMVTGGS 110

Query: 131 VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
           ++ V+ NL           +DYDLALL +P ++LG+SIGV  N +  +W+ITVL  V   
Sbjct: 111 IANVISNLFGGKAL-----LDYDLALLLEPCMLLGVSIGVICNRVLPEWLITVLFAVFLA 165

Query: 191 VLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETRE 250
               +    GV+ WK E+   RE+         G G   EE K       N   P    +
Sbjct: 166 WSILKTCRSGVKFWKLESEIARESGHGRP--ERGQGQIEEETK-------NLKAPLLEAQ 216

Query: 251 PKKSKVSIIENIYWKEFGLLVAVWA---IILALQIAKN-----YTTTCSVLYWVLNLLQI 302
             K+K      I W + G+LV VWA   +I  L+  K+         C V YW+L  LQI
Sbjct: 217 ATKNK----SKIPWTKLGVLVIVWASFFVIYLLRGNKDGKGIITIKPCGVEYWILLSLQI 272

Query: 303 PVAGGVSAYEAVALYKGRRKIA---SKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 359
           P+A  +    A++  + R++ +    K  EGT+   S  + + A    AG++GG+ G+GG
Sbjct: 273 PLAL-IFTKLALSRTESRQEQSPNDQKNQEGTRLDKSTRLKFPAMSFLAGLLGGIFGIGG 331

Query: 360 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 419
           G ++ PL L+ GIPPQ+++AT  F + FS++MS V+Y LL                    
Sbjct: 332 GMLISPLLLQSGIPPQITAATTSFMVFFSATMSAVQYLLL------------------GM 373

Query: 420 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
                  K +   GRASII+F++  ++ +S + +   G   +       + MGF   C
Sbjct: 374 QNTDTAYKAVAQFGRASIIVFSVGTVMSLSTVLMTSFGALDVWTDYVAGKDMGFKLPC 431


>gi|223998204|ref|XP_002288775.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975883|gb|EED94211.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 385

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 117/398 (29%), Positives = 185/398 (46%), Gaps = 49/398 (12%)

Query: 84  IGFFGTACGSV----GGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR 139
           +GFF  A G V    GG+GGGGI VP+  LI+ F  K +  +S   V GGAV+  ++N+R
Sbjct: 3   LGFFFAALGLVLAAGGGIGGGGILVPVYILILDFLPKHAIPLSNVTVFGGAVANTIFNVR 62

Query: 140 QRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK 199
           +RHP  D P+ID+DL L+ +P  +LG  +G   N I ++ +I V+L+VL    +     K
Sbjct: 63  KRHPLADRPLIDWDLILVMEPSTLLGALVGANLNKILSETLIAVMLVVLLSFTAYGTLKK 122

Query: 200 GVETWKKETITKREAAKQLELIVLGNGYQTEECKCD--PEYLSNDTTPEETREPKK---- 253
             + + KET                   + E    D   E+L ND +  +  E +K    
Sbjct: 123 AGKMYDKET----------------EDLKNEWSYSDGLREHLVNDYSHMDDEEGRKGAND 166

Query: 254 --SKVSIIENIYWKEFGLLVA------VWAIILALQIAKN-------YTTTC-SVLYWVL 297
              + ++ E   ++EFG+  A      ++ ++LA+ I K            C S  +W+ 
Sbjct: 167 NNKEDTVTE---YEEFGMHEANSLDRLMFVVVLAINILKGGGGFASPVGIKCGSAAFWIS 223

Query: 298 NLLQIPVAGGVSAYEAVALYKGRRKIASKG----DEGTKWRASQLVFYCACGITAGMVGG 353
             L +    G+S      L K        G     E   W     + Y      AG   G
Sbjct: 224 QALLLVWIIGISLVARRQLIKDTALKMGAGFRYLKEDMIWDDKSTIIYPLFSTVAGFCAG 283

Query: 354 LLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFAL 413
           + G+GGG + GPL + +G+ P V+SAT+   I F+S  +   + +       YA+    L
Sbjct: 284 MFGIGGGIVKGPLMIMMGVHPAVASATSACMILFTSFTATTTFAVYGLLVHDYAIACVIL 343

Query: 414 SIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
             +A  VGQ +  +L+K   R S I F++ F++ +SAL
Sbjct: 344 GFVATAVGQTITTRLLKKSRRNSYIAFSIGFVVLLSAL 381


>gi|219112381|ref|XP_002177942.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410827|gb|EEC50756.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 644

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 115/430 (26%), Positives = 201/430 (46%), Gaps = 46/430 (10%)

Query: 84  IGFFGTACG----SVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR 139
           IGFF    G    + GG+GGGGI VP+  L++GF  K +  +S   V GGAV+  + N R
Sbjct: 207 IGFFLATLGLMVAAGGGIGGGGILVPVYILVMGFTPKHAIPLSNVTVLGGAVANTILNAR 266

Query: 140 QRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK 199
           +RHP  D P++D+DL L+ +PL + G  +G   N +  + ++TVLL++L  V +  +  K
Sbjct: 267 KRHPLADRPLVDWDLILVMEPLTIAGALLGAFLNKVLPELLLTVLLVLLLSVTAYTSLTK 326

Query: 200 GVETWKKET--------ITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTP------ 245
            ++ + +E+        + + +  K+ EL V+      ++     E L  +         
Sbjct: 327 ALKLYARESRAMAAAQGLVRVDGTKESELTVMARLEDQDDHDEAAEVLLENMERDDDDDE 386

Query: 246 -------EETREPKKSKVSIIENIYWKE-------FGLLVAVWAIILALQIAKNYTTTCS 291
                  +    P  S  + ++ +  +E         +LV ++ ++L + + K      S
Sbjct: 387 SSSDDDMKSVELPASSLQAELDQLLEEECTTPMANISILVTMFIVVLTINVLKGGGAFPS 446

Query: 292 VL--------YWVLNLLQIPVAG----GVSAYEAVALYKGRRKIASKGDEGT-KWRASQL 338
            L        +W+ NL+ +   G    G+ AY  V  ++ +R+++    EG  +W A   
Sbjct: 447 PLGIRCGSRAFWIANLVMLAWIGIISVGIRAY-LVRRFEQKRRLSFPYVEGDIRWDARAT 505

Query: 339 VFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYL 398
           + Y      AG   G+ G+GGG + GPL L +G+ P VSSA++   I F+S  +   + +
Sbjct: 506 IVYPVVCCMAGFFAGMFGVGGGIVKGPLMLAMGVHPAVSSASSACMILFTSFTATTSFVV 565

Query: 399 LKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 458
                  YA    A+  +A F GQ  L  L++   R S I F++  ++ +SA  +    L
Sbjct: 566 FGLLVWDYAYVCMAIGFVATFAGQVGLSYLMRRAQRNSYIAFSIGAVVLLSAFLMTIQSL 625

Query: 459 AKMIKRIEHK 468
             M    +H 
Sbjct: 626 LSMAAGEKHH 635


>gi|414880189|tpg|DAA57320.1| TPA: hypothetical protein ZEAMMB73_804983 [Zea mays]
          Length = 407

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 91/291 (31%), Positives = 132/291 (45%), Gaps = 30/291 (10%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           +F+P+LNL+ G   K + A S  MVTGGA S V+YNL          +IDYD+ALLFQP 
Sbjct: 81  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLACTGGGGRGRLIDYDIALLFQPC 140

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
           L+LG+SIGV  NV+F +W+IT+L  +     + +    GV+ W+ E+             
Sbjct: 141 LLLGVSIGVVCNVMFPEWLITLLFSLFLAFCTAKTCRAGVKIWRSESS------------ 188

Query: 222 VLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIY-WKEFGLLVAVWAIILAL 280
             G G      +    +      P  T +            + W +  LLV +W    AL
Sbjct: 189 --GCGSGARSSRGGHSHSKEPLLPRGTSDGDAEGGGGNGAGFPWGDVALLVIIWLCFFAL 246

Query: 281 Q--IAKNY------TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTK 332
              I   +         C V YW++   Q+P A   + Y   A  K R   +   ++G+K
Sbjct: 247 HVLIGDKHGKGVIRIRPCGVAYWLITFFQLPAAVAFTGYIVYAKRKKRAVHSESQEDGSK 306

Query: 333 W-------RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQV 376
                    A  L          G + GL G+GGG +L P+ L++GIPPQV
Sbjct: 307 ADLADAGVEALPLPTLPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQV 357


>gi|224014498|ref|XP_002296911.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968291|gb|EED86639.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 516

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 114/451 (25%), Positives = 189/451 (41%), Gaps = 97/451 (21%)

Query: 33  SNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFG------WRIVVGTIIGF 86
           ++ K E     K  HVE N   +   P +   Y  +W D+          R  VGT + F
Sbjct: 66  ADYKHELCYDRKLFHVEGN---EEGDPDNH--YPFLWNDVSQSSSLSVIHRNSVGTFLWF 120

Query: 87  FGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD 146
              A     GVGGGGI+VP+  L++ F  KSS+ +S+  + G A+  ++ NLR RHP   
Sbjct: 121 LTAALSITCGVGGGGIYVPVGILLLRFPPKSSSGLSQASIFGAALGGLIVNLRNRHPYTF 180

Query: 147 I-----------------------------------------------PVIDYDLALLFQ 159
           +                                               PVIDYD+AL   
Sbjct: 181 VRDTKGTPLEDHPGKIVSYEKDKGPAAIEKDREAYLAGGDGKRKFYTRPVIDYDMALFLA 240

Query: 160 PLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
           P+ + G  +GV    +F +W+      V+    S + + K   ++KK+ +  RE A +L 
Sbjct: 241 PMEMAGAVLGVIIQKLFPNWLFLSFATVILGFTSYKTYKKFFSSYKKDKLN-RETAMRLS 299

Query: 220 LI-----------VLGNGYQTEECKCDPEY--LSNDTTPEETREPKK--SKVSIIENI-- 262
           +              GN   + +   + E    ++    ++  +PK+   +   +EN   
Sbjct: 300 MAESMNVSASAADATGNEEPSNDADANAEENGTADTAIKDDLDDPKELEKRRMFLENDSR 359

Query: 263 -YWKE-FGLLVAVW---AIILALQIAKNYTT----TC-SVLYWVLNLLQIPVAGGVSAYE 312
            + KE    L+ +W   A+I  L+  K   +    TC    Y+VL   Q     G +A+ 
Sbjct: 360 QFPKEKVAYLILLWIGLAVITFLKGGKGVDSLIGITCEDSAYYVLVAAQFLWTMGFAAFF 419

Query: 313 AVALYKGRRKIASK-------GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 365
               +K  ++   +        ++   W   +L FY      AG+V GL+G+GGG +LGP
Sbjct: 420 G---WKNTKRTQERLAVNYPFQEQDVLWDLKKLQFYSFFTFVAGIVAGLIGIGGGMVLGP 476

Query: 366 LFLELGIPPQVSSA-TALFAITFSSSMSVVE 395
           L + +G+ P V++A TA   +  SSS++V+ 
Sbjct: 477 LMMVMGVHPSVATATTATMVVLTSSSVAVIR 507


>gi|348675995|gb|EGZ15813.1| hypothetical protein PHYSODRAFT_263156 [Phytophthora sojae]
          Length = 421

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 166/363 (45%), Gaps = 34/363 (9%)

Query: 104 VPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLV 163
           VP L+ +  +DA S A     M+ G AVS+          T D P+ID +LAL   P+++
Sbjct: 38  VPSLSEMDTYDALSIA----FMIIGLAVSSA-------GATADRPLIDPELALGLIPVVI 86

Query: 164 LGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVL 223
            G  +G   N +   +++++L +V+      R  L+G+  +K+E   KR  +   E    
Sbjct: 87  GGTVLGAVINKLIPSYIVSLLFVVVLTFSDARMTLRGIRLFKQEVAQKRAESSANETKAD 146

Query: 224 GNGYQTEECKCD-PEYLSNDTTPEETREP--KKSKVSIIE----------------NIYW 264
                +   K   P+  S+D   EE R    + S  S+++                +  W
Sbjct: 147 DPESPSVYIKASTPQPSSDDIMAEEHRLSISRSSLKSVLDEDEDGAIRSQILGKERHFAW 206

Query: 265 KEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYK--GRRK 322
                 +  +  ++A  I         V+YW+L L+++P       + +  L+K   R++
Sbjct: 207 GSHSATLVCFLGVVATSIGDASVDCGGVVYWILLLIEVPWVVAFVFFTSHYLHKIYLRKE 266

Query: 323 IASKG--DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT 380
             S    D   +W    +V++      AG+V G+ G+GGG I GP+ +ELGI P+V+S+T
Sbjct: 267 AVSYQYVDGDIQWTKKTVVYFPLGCAVAGIVAGMFGVGGGIITGPIMIELGIVPEVASST 326

Query: 381 ALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF 440
               I +SS+ +  ++ + K     +AL   A++ +     Q ++   ++  GR SIII 
Sbjct: 327 TALMILYSSAAATAKFAVFKMIAWDWALLLCAVAFVVTSASQVMILGFVRRTGRQSIIIL 386

Query: 441 TLS 443
            +S
Sbjct: 387 CIS 389


>gi|223992571|ref|XP_002285969.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977284|gb|EED95610.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 564

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 109/421 (25%), Positives = 175/421 (41%), Gaps = 72/421 (17%)

Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
           G+ VP+  LI+GF  K +  +S   V GGAV+  + N+R+RHP  D P++D+DL L+ +P
Sbjct: 119 GVLVPIYILIMGFSPKHAIPLSNITVFGGAVANTILNVRKRHPLADRPLVDWDLILVMEP 178

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
           L + G  IG   N +  + ++ + L+ L    S     K +  +K E  +K   A++   
Sbjct: 179 LTIAGALIGAFLNKLLPEAVLVLSLVALLSFTSYTTLKKAIRMYKAE--SKALEAQRGVR 236

Query: 221 IVLGNGYQTEE-------------------CKC----------DPEYLSNDTTPEETREP 251
            V G+G    E                   C            D +   +D   EE  +P
Sbjct: 237 KVRGDGTVESELTRLAREMEEDEEDEEEEGCTVGLLDAAITTDDQDETESDANDEENGQP 296

Query: 252 -------------KKSKVSIIENIYWKE----------------FGLLVAVWAIILALQI 282
                         +S  S   ++  KE                  +L+  + +IL + I
Sbjct: 297 TSSNTDTTNDNNDDESTFSTATDLKNKEELSKILSEERTVPKGNVQVLLVTFTVILFINI 356

Query: 283 AKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEA---VALYKGRRKIASKGDEGT 331
            K      S L        +WV N + I     VS +     V  Y+ + ++     EG 
Sbjct: 357 MKGGGAFPSPLGIRCGSPSFWVSNGIMIGWILLVSVFARSYLVRRYEIKERVGFPYVEGD 416

Query: 332 -KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 390
            +W     V Y      AG   G+ G+GGG + GPL L +G+ P+VSSA++   I F+S 
Sbjct: 417 IRWDGRATVVYPLVCTAAGFFAGMFGVGGGIVKGPLMLAMGVHPKVSSASSACMILFTSF 476

Query: 391 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSA 450
            +   + +     + YA     L  +A  VGQ  L  L++   R S I F++  I+ +SA
Sbjct: 477 TATTSFVVFGLLDMDYATVCMTLGFVATLVGQIGLFYLMEKFQRNSYIAFSIGGIVLLSA 536

Query: 451 L 451
            
Sbjct: 537 F 537


>gi|146183328|ref|XP_001025893.2| hypothetical protein TTHERM_00713420 [Tetrahymena thermophila]
 gi|146143638|gb|EAS05648.2| hypothetical protein TTHERM_00713420 [Tetrahymena thermophila
           SB210]
          Length = 505

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 95/407 (23%), Positives = 181/407 (44%), Gaps = 66/407 (16%)

Query: 68  VWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVT 127
           V  D+ FG   ++  ++        +VGG+GGG + VP+L +++ F  K +  +S  ++ 
Sbjct: 34  VHQDLTFGVVEIISYVLISIIVGLANVGGLGGGIVKVPILVILLNFSVKEATFLSYPILL 93

Query: 128 GGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIV 187
           GG +S  +  + QRHP  D P+ID+DL L+  P ++LG  +G+  NVI ++ ++T + ++
Sbjct: 94  GGVLSNAILLISQRHPRKDKPIIDFDLVLILVPTVLLGTVVGILMNVIISEIILTSVFML 153

Query: 188 LFLVLSTRAFLKGVETWKKETITKREAAKQLE---------------LIVLGNGYQTE-- 230
              ++    F+K  +  +K+   K E    ++                +V  N Y +E  
Sbjct: 154 FMCLVCVYLFMKARDIQQKQQEDKEEQDSSIQDNNEQNQNKLSKSTIQLVKQNSYLSEVD 213

Query: 231 -------ECKCDPEY-------LSNDTTPEETREP------KKSKVSIIENIYWKEF--- 267
                  +  C  E        +S D+  + + E       K+ + + IEN    EF   
Sbjct: 214 QKKIEQIDENCQKELGIQYYQVVSEDSQEQNSEEDSKGVSNKEKQKNKIENQLLAEFLEQ 273

Query: 268 ----------GLLVAVWAIILALQIAKNYT--------TTCSVLYWVLNLLQIPVAGGVS 309
                       LV ++ +   + I+K             C  LY++L  LQ+ ++  + 
Sbjct: 274 EKKMLPLDKLFYLVLIFLVFTFIGISKGGKGFQSIFGIQKCDNLYYLLTALQL-ISSIIF 332

Query: 310 AY----EAVALYKGRRKIASKGDEGTKWRASQLVFYCA-CGITAGMVGGLLGLGGGFILG 364
            +    +   L++ +  I  + D    + ++   F  +  G+ AG + G+LG+G G I+ 
Sbjct: 333 MFFIYLQQKRLHEYKISINYQFDREDFYFSNYNFFILSLSGLAAGSITGMLGMGSGLIIL 392

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFF 411
           P+ L LG   +V S+T+ F   F    S++  Y+L    + Y +  F
Sbjct: 393 PVLLSLGCHTRVCSSTSGFMYLFIGGTSII--YVLTEGILSYKMILF 437


>gi|15233581|ref|NP_193858.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|2911083|emb|CAA17545.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268923|emb|CAB79126.1| hypothetical protein [Arabidopsis thaliana]
 gi|332659033|gb|AEE84433.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 393

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 123/463 (26%), Positives = 201/463 (43%), Gaps = 106/463 (22%)

Query: 12  RLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWP- 70
           + L   FI +L V +++ + Q  +  E+T +        N++ KV   K R+  K     
Sbjct: 2   KTLFVLFILILCVFAINANQQEEINQESTQQTHQ-----NLLYKVQ--KWRTSLKDSSDA 54

Query: 71  DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
           ++K    +VV  ++ F      S  G+                D K++++ S  MVTGG+
Sbjct: 55  ELKLSPALVVAGVLCFTAALISSASGI----------------DLKAASSFSAFMVTGGS 98

Query: 131 VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
           ++    NL   H      +IDYDLALL +P ++LG+S+GV  N +F +W+IT L +V  +
Sbjct: 99  IA----NLINNHFGCK-KLIDYDLALLLEPCMLLGVSVGVICNKVFPEWLITGLFVVFLM 153

Query: 191 VLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETRE 250
             S      G  +WK   I + +                                E+ R+
Sbjct: 154 WSSMETCENGHTSWKLSLILREK--------------------------------EDMRD 181

Query: 251 PKKSKVSIIENI-YWKEFGLLVAVWAIILALQIAKNYTTT--------------CSVLYW 295
            + ++V     I ++K            L L+I K  T                CSV YW
Sbjct: 182 SRLAEVKRRRTIIFFKH-----------LYLKIKKTETKQSFLGRNLGIISIKPCSVEYW 230

Query: 296 VLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLL 355
           +L  LQIP+A     +  +AL                 R   L         AG++GG+ 
Sbjct: 231 ILLSLQIPLA---LVFTILALS----------------RTESLQEQSISNQEAGLLGGIF 271

Query: 356 GLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSI 415
           G+GGG I+ PL L  GIPPQV++AT  F + FS++MS V+Y LL       A  F  +  
Sbjct: 272 GIGGGMIISPLLLRAGIPPQVTAATTSFMVFFSATMSGVQYLLLGMQNTEAAYVFSVICF 331

Query: 416 IAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 458
            A+ +G    +K++    RASII+F +  +++++ + +   G+
Sbjct: 332 FASTLGLVFAQKVVPHFRRASIIVFLVGTMMYLTTIVMASFGI 374


>gi|303279591|ref|XP_003059088.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458924|gb|EEH56220.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 461

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 60/142 (42%), Positives = 89/142 (62%), Gaps = 2/142 (1%)

Query: 78  IVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYN 137
           I + ++  FF +   + GGVGGGG+FVP+L L+VG   K +  VS CM+  GA+ A  +N
Sbjct: 16  IAIASVACFFVSILANAGGVGGGGVFVPLLMLVVGLSGKWAIPVSNCMILAGAIPATFFN 75

Query: 138 LRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
           L +RHPT D P++D + ALL  P  + G + GV  NV+F +W+++ +LI L    ST+ F
Sbjct: 76  LMKRHPTRDRPLLDTNAALLLIPATLAGTTPGVMLNVLFPEWLVSAMLICLLTYTSTQTF 135

Query: 198 LKGVETWKKETITKREAAKQLE 219
            KG   W+KE   KR+  K++E
Sbjct: 136 QKGKREWRKEGEIKRK--KRME 155



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 63/172 (36%), Positives = 90/172 (52%), Gaps = 14/172 (8%)

Query: 288 TTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGIT 347
           T C   YW   L+  P+A G+ A     LY      A +   G  + A  L      GI 
Sbjct: 267 TPCVNEYWA--LVAAPIALGLCASYLAGLYL-HADCAERLRVGYAYDAGDLRMETTDGIA 323

Query: 348 --------AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLL 399
                   AGM  GLLG+GGG ILGPL L+LG+ P+VS+    FA+ F+SSMSV+++ LL
Sbjct: 324 KWTLWAFGAGMGSGLLGIGGGMILGPLLLDLGMSPKVSAPITHFAVLFTSSMSVIQFALL 383

Query: 400 KRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR-ASIIIFTLSFIIFVSA 450
            +    +A     L++I + +   VLK+ +   G  ASII+  L+ I+ +SA
Sbjct: 384 GQLLPAHA--GCCLTLIGSVMSDKVLKREMAKRGYGASIIVLCLAGIMSLSA 433


>gi|294944275|ref|XP_002784174.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239897208|gb|EER15970.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 393

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 92/363 (25%), Positives = 169/363 (46%), Gaps = 37/363 (10%)

Query: 93  SVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDY 152
           S  G+GGG +F P+L+L+   D+K ++A S+ ++    +++ + N   +      P+I  
Sbjct: 24  SPAGIGGGILFTPVLHLVGELDSKEASATSQALIAASQLASCIINFWTQWHEPKKPLIIL 83

Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR 212
              ++  P +V G  IGV  N+I    +I +L +++    + +  LKGV+ ++ E  TK+
Sbjct: 84  PYVIIMLPCIVAGAVIGVYLNMILPQLIILILYVLVAAFSTIKTTLKGVKQYRSENATKK 143

Query: 213 EAAKQLELIVLGNGYQTEECKCDPEYLSNDT--TPEETREPKKSKVSIIEN-----IYWK 265
            A+K+ E                P   S  T  T EE +E K     ++ +      YW 
Sbjct: 144 -ASKEHE---------------SPSSASQKTIVTLEEAKEKKVDPFLVMPSRKVLFFYWT 187

Query: 266 EFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS 325
              L   +W + L   + +  +T  S+         +P  GGV  + A         I+S
Sbjct: 188 TAFL---IWVLCLIFPLLRGSSTAKSIA-------PVPYCGGVYWFLAALEIALLLGISS 237

Query: 326 KGDEGTKWRASQLVFYCACGIT---AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATAL 382
            G    K +  +LV  C  G+     G++  ++G+GGG ++ P+ L+ G+ PQ  +AT  
Sbjct: 238 -GFIFAKRKVCRLVQCCILGVEIVFTGIISSMVGIGGGILMNPIILDFGLNPQQGTATNA 296

Query: 383 FAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTL 442
             I   S+ + + Y +   FP    ++   L  +   +G+ VLK+++   GR S+++F L
Sbjct: 297 INIFAMSTSTALSYGMSGYFPGGSDMWIVVLPFVGGIIGKLVLKQIVAKTGRMSVLVFLL 356

Query: 443 SFI 445
           + I
Sbjct: 357 AGI 359


>gi|297737351|emb|CBI26552.3| unnamed protein product [Vitis vinifera]
          Length = 100

 Score =  103 bits (256), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 49/76 (64%), Positives = 62/76 (81%)

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           M  G A S +  NL+ +HPTLD+P+I+YDLALLFQP+L++GISI VAFNV+F D M+T+L
Sbjct: 24  MNMGAASSTIYCNLKLKHPTLDMPIIEYDLALLFQPMLMMGISIEVAFNVVFVDSMVTIL 83

Query: 185 LIVLFLVLSTRAFLKG 200
           LIVLFL  ST+ FLKG
Sbjct: 84  LIVLFLGTSTKTFLKG 99


>gi|268083363|gb|ACY95275.1| unknown [Zea mays]
          Length = 94

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 50/93 (53%), Positives = 69/93 (74%)

Query: 385 ITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSF 444
           + FSSS+SVVE+Y L RFP+P+A Y   +SI+A F GQ +++K++ +L RAS+I+F LS 
Sbjct: 1   MMFSSSLSVVEFYFLHRFPLPFAGYLIFISILAGFWGQCLVRKIVHVLKRASLIVFILSS 60

Query: 445 IIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           +IFVSAL++G VG  K I  I + EYMGF N C
Sbjct: 61  VIFVSALTMGVVGTQKSISMINNHEYMGFLNFC 93


>gi|358344827|ref|XP_003636488.1| hypothetical protein MTR_042s0030 [Medicago truncatula]
 gi|355502423|gb|AES83626.1| hypothetical protein MTR_042s0030 [Medicago truncatula]
          Length = 150

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 65/119 (54%), Positives = 77/119 (64%), Gaps = 22/119 (18%)

Query: 312 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
           +A AL+  RR IAS GD+G  +    L+ YC  G+ AG+VGGLLGLGGGF++GPLFLELG
Sbjct: 27  KATALFTQRRVIASAGDQGKAFTVGLLIIYCVFGVLAGIVGGLLGLGGGFVMGPLFLELG 86

Query: 372 IPPQ----------------------VSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           +P Q                      V SAT  FA+TFSSSMSVVEYYLLKRFP+PY L
Sbjct: 87  VPSQSSESFKLQYINRHSLILIIKQLVPSATTTFAMTFSSSMSVVEYYLLKRFPIPYGL 145


>gi|297849536|ref|XP_002892649.1| hypothetical protein ARALYDRAFT_334446 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338491|gb|EFH68908.1| hypothetical protein ARALYDRAFT_334446 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 345

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 69/260 (26%), Positives = 122/260 (46%), Gaps = 34/260 (13%)

Query: 60  KSRSGYKHVWPD-----IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFD 114
            + S + H  P+     IKF    ++  ++ FF     S+   GGG +F+ ++  I G +
Sbjct: 29  NATSQWLHFPPNLNESTIKFSIPTIIAAVLSFFA---ASISSAGGGALFLSIMTTISGLE 85

Query: 115 AKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFN 173
            K++++ S  M+TG +++ V  NL  R+P + D  +ID+DL+L  QP L+LG+SIGV  N
Sbjct: 86  MKTASSFSAFMITGVSIANVGCNLFARNPKSRDKTLIDFDLSLTLQPCLLLGVSIGVICN 145

Query: 174 VIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECK 233
            +F +W++  L  V     + +   KGV  W  E  ++RE  +                +
Sbjct: 146 RMFPNWLVLSLFAVFLAWSTMKTCKKGVSYWNLE--SEREKIRS---------------R 188

Query: 234 CDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYT------ 287
            D + +    +P    E +      +    W + G+LV +W +  ++ + +         
Sbjct: 189 RDDDRIKVARSPLLANEGEAEVERGMIRFPWMKLGVLVIIWLVFFSINLFRGNKYGQGII 248

Query: 288 --TTCSVLYWVLNLLQIPVA 305
               C  LYW L+ LQIP+ 
Sbjct: 249 SIKPCGGLYWFLSSLQIPLT 268


>gi|401403532|ref|XP_003881498.1| Os03g0726500 protein, related [Neospora caninum Liverpool]
 gi|325115911|emb|CBZ51465.1| Os03g0726500 protein, related [Neospora caninum Liverpool]
          Length = 494

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 73/246 (29%), Positives = 118/246 (47%), Gaps = 27/246 (10%)

Query: 3   RVGSECFRWRLLAAAFIGLLVVASVSVSAQSNLKP----EATSKN--------KNDHVEP 50
           R+  E  R RL   A    L   ++ V  Q    P    + T +N          D  +P
Sbjct: 110 RLQQEAIRVRLQQEAIRVGLQQEAIRVGLQQEAIPVGLQQETPRNTVGLYRSLSTD--DP 167

Query: 51  NIVIKVSAP-KSRSG---YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPM 106
            +V+   +P +  SG    +    ++  G  IV   +I   G A     G GGG IFVP+
Sbjct: 168 LLVLNTGSPVRKLSGLVDERRTLSELNNGPTIVCIVLIAIVG-AVSVTAGTGGGAIFVPL 226

Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGI 166
           + LI+ F+   + A S+C++TG A++ +  N  +R+P +D+P+ID D+ LL  P+ + G 
Sbjct: 227 MQLIMHFNTFEATATSQCLMTGSALAGLCLNFVRRNPVVDMPLIDMDMVLLLGPMQMCGS 286

Query: 167 SIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNG 226
           S+GV  N +   W+ITVLL+V  L  + R         ++    +REA K  +L    + 
Sbjct: 287 SVGVIVNRVLPAWLITVLLVVCLLYETVRLM-------RRLRDKQREAKKVTQLTASEHA 339

Query: 227 YQTEEC 232
           ++ E C
Sbjct: 340 HK-ETC 344


>gi|403339320|gb|EJY68918.1| hypothetical protein OXYTRI_10465 [Oxytricha trifallax]
          Length = 539

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 108/426 (25%), Positives = 193/426 (45%), Gaps = 66/426 (15%)

Query: 113 FDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP---VIDYDLALLFQPLLVLGISIG 169
           FD K++ A+S   +   +V+  +Y L ++HP  D     +I+Y+LA++  P +++G   G
Sbjct: 118 FDTKNAIAISNFAIFTCSVTRYIYTLDKKHPHPDKKQNVIIEYNLAIVMLPTVMMGSLTG 177

Query: 170 VAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE---------- 219
           V  N+IF    +  +L  L + LS ++ +KG + ++KETI  ++ A++L+          
Sbjct: 178 VFLNIIFPAIALQAILTALLIFLSLQSLMKGKDMYRKETIKFQQEAEKLKKKQDEERLMM 237

Query: 220 --LIVLGNGYQTE---ECKCDPEYLSND-TTPE-----ETREPKKSKVSIIENIYWKEFG 268
             ++ L    Q E   E   D E L  + T P+     E  E +     +I N+  +E  
Sbjct: 238 EKILKLKTDSQGEIIRESPIDEEQLKKEGTKPQQIQNFEIEEGQPLTKGLINNVNQEELQ 297

Query: 269 LLVAV---------WA------IILALQIAKNY------------TTTCSVLYWVLNLLQ 301
            L  +         W       +I +L ++ N                CS+L W++    
Sbjct: 298 QLERILEKEKGHKQWDKHLTCLLIFSLLVSTNLLRGNKSLDSIIGIERCSLLDWMVLFTF 357

Query: 302 IPVAGGVS-------AYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGL 354
             +   ++         E +   K +R +A      T     +L  +   G   G + G 
Sbjct: 358 YILCSSITFLGIRRVRKEQILKEKYKRGLADCDIRLTPRNTLRLQIFSFVG---GWISGA 414

Query: 355 LGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALS 414
           LGLGGG I  P+ + LG PP V++AT+++ I+FSS+ S   Y +     +P++++   + 
Sbjct: 415 LGLGGGAIFNPILIGLGTPPAVATATSMYMISFSSAGSTATYIIYGLINLPFSIWVGVIG 474

Query: 415 IIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL---GGVGLAKMIKRIEHKEYM 471
              A  G  +   + K   R S I+F L+ ++  SAL +   GG+ L K+I R E  +  
Sbjct: 475 CFGATGGLALFNVVTKKYNRQSFIVFVLAGVMGASALLVPIFGGLDLMKLIDRGE--DIF 532

Query: 472 GFDNIC 477
             ++IC
Sbjct: 533 KMNSIC 538


>gi|242083980|ref|XP_002442415.1| hypothetical protein SORBIDRAFT_08g019640 [Sorghum bicolor]
 gi|241943108|gb|EES16253.1| hypothetical protein SORBIDRAFT_08g019640 [Sorghum bicolor]
          Length = 170

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 65/89 (73%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           I+   + + +  + K+       M+ GG+VS V YNL+ +HP+LD+P+I+YDLALL QP+
Sbjct: 43  IYSTAIYVYLNLNEKAQVCWFCSMIMGGSVSTVYYNLKLKHPSLDMPLIEYDLALLMQPM 102

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFL 190
           L+LG+SIGV FNVIF +W+IT LLI +FL
Sbjct: 103 LMLGVSIGVIFNVIFPNWLITALLITIFL 131


>gi|397643932|gb|EJK76166.1| hypothetical protein THAOC_02088 [Thalassiosira oceanica]
          Length = 534

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 106/424 (25%), Positives = 174/424 (41%), Gaps = 76/424 (17%)

Query: 104 VPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLV 163
           VP+  LI+ F  K +  +S   V GG+++  + N R+RHP  D P+ID+DL ++ +P  +
Sbjct: 80  VPIYILILNFLPKHAIPLSNVTVFGGSIANTLLNWRKRHPVADRPLIDWDLIVVMEPPTL 139

Query: 164 LGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET--------------- 208
           LG  IG   N +  +  I +LL+VL +  S     K    ++KET               
Sbjct: 140 LGALIGANLNKLLPETAIAILLVVLLVYTSFNTLKKAHSMYQKETSEIKHRNINHRSVEP 199

Query: 209 ---ITKREAAKQLELIVLGNGYQTEECKCD------------------PEYLSNDTTPEE 247
              + K E+   LE   L N +  +    D                  P  +  D     
Sbjct: 200 TAGLFKTESENGLEEYFLLNDHLRDGRDSDSDTSDTGNIGLSLNEIRYPGDVERDGDLRN 259

Query: 248 TREPKKSKVSIIENIY-------WKEFG---------------------LLVAVWAIILA 279
            R  ++S   + E          + EFG                     L+  ++ ++L 
Sbjct: 260 VRLDEQSDEGVDEASQGSSTVESYTEFGHTLELQNILDEEKRPKKKNIALIATLFMVVLT 319

Query: 280 LQIAKN-------YTTTC-SVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG---- 327
           + I K            C S  +WV  +L +     VS      L K   K    G    
Sbjct: 320 INILKGGGAFESPLGIECGSASFWVAQILLLIWICVVSWIGRKMLLKSTAKKTDAGFAYL 379

Query: 328 DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITF 387
           DE  +W   + + Y      AG+  GL G+GGG I GPL + LG+ P V+SAT+   I F
Sbjct: 380 DEDIRWNGKKTIIYPMISTLAGVAAGLFGIGGGIIKGPLMIALGVHPAVASATSACMILF 439

Query: 388 SSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIF 447
           +S  +   + +       YA+    L  ++  VGQ V+  +++   R+S I +++ F++ 
Sbjct: 440 TSFTATTTFSVYGLMVRDYAIACSILGFVSTLVGQKVMNSILRKTNRSSYIAYSIGFVVL 499

Query: 448 VSAL 451
           +SA+
Sbjct: 500 LSAI 503


>gi|325193669|emb|CCA27930.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 591

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 56/158 (35%), Positives = 91/158 (57%), Gaps = 8/158 (5%)

Query: 325 SKGD-EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 383
           S+GD   TK + ++L FYC   + AG+  GLLG+GGG + GP+ LE+G+ P V  ATA F
Sbjct: 434 SEGDLHWTKKKTTRLPFYC---VIAGITAGLLGIGGGMVKGPIMLEMGVLPMVQVATANF 490

Query: 384 AITFSSSMSVVEYYLLKRFP----VPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 439
            I F+S+ + +++ +  +FP      Y  +F     + A+ GQ V+  L+K   R S+++
Sbjct: 491 MILFTSASTTLQFAIAGQFPGSLQYDYVGWFACAGFVGAYCGQSVVAFLLKKYNRESMLV 550

Query: 440 FTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           + L+ +I VSA  +G VG   +   I  + ++GF   C
Sbjct: 551 YILAVMIGVSAFCMGIVGFQIVENEIALRMHLGFSGSC 588



 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 47/137 (34%), Positives = 81/137 (59%), Gaps = 2/137 (1%)

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           ++ TI+    TA  +  G+GGGG+ VP   L++G   K +  +SK  + G +V+   +N 
Sbjct: 123 LIATIMTLLITALSAGCGIGGGGLLVPSFILVIGLSPKHAIPLSKATILGTSVANFWFNY 182

Query: 139 RQRHPTLD-IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
            +RHP  + +P+I+Y +A + +P  ++G  IGV  N +  +W+I +LLI L   ++ R F
Sbjct: 183 HRRHPMKNNVPIINYAMAAIMEPPTLIGAVIGVMLNHVVPNWLIFLLLISLLTSITLRTF 242

Query: 198 LKGVETWKKETITKREA 214
           +KG    +KET  +R+A
Sbjct: 243 IKGNRLREKET-KRRQA 258


>gi|356533449|ref|XP_003535276.1| PREDICTED: uncharacterized protein LOC100806242 [Glycine max]
          Length = 408

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 82/204 (40%), Positives = 113/204 (55%), Gaps = 15/204 (7%)

Query: 279 ALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQL 338
           A+ + +NYTT CS LYW++NLLQ+P+  G + YE V LYKG+  IASKGD+ T+W   QL
Sbjct: 201 AVIVKQNYTTNCSALYWIMNLLQVPITIGTTFYEVVLLYKGQSVIASKGDQQTRWHVQQL 260

Query: 339 VFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE--Y 396
           + YC CGI +G++GGLLGLGGGFILGPLF+ LGI PQ+ + T       SS+ +      
Sbjct: 261 ILYCMCGIISGIIGGLLGLGGGFILGPLFIGLGIHPQLKTNTKPHPWLLSSTHTQPHPPK 320

Query: 397 YLLKRFPVPYALYFFAL-----SIIAAFVGQHVLKKLIKILGRASIIIFTLSF------- 444
           +LL + P P +     +       I A        K+ +ILG        L F       
Sbjct: 321 FLLYQSPNPKSSRLITVHAEVKHTIVADTTTSSNDKIREILGNHDYNKLGLPFNEQKHLA 380

Query: 445 IIFVSALSLGGVGLAKMIKRIEHK 468
            I V A+ L  V +A   ++ + K
Sbjct: 381 NIIVDAV-LDNVSIATTYRKTQEK 403


>gi|219112005|ref|XP_002177754.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410639|gb|EEC50568.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 629

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 91/385 (23%), Positives = 174/385 (45%), Gaps = 45/385 (11%)

Query: 109 LIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISI 168
           L++G     +  ++   V GGA+++ + N+++RHP  D P+ID+DL L+ +PL ++G  +
Sbjct: 220 LVMGLHPHYAIPIASVTVFGGALASTIVNMQRRHPLADRPIIDWDLVLMMEPLTLIGTLL 279

Query: 169 GVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLGN--- 225
           G  F+ I ++ ++ VLL++L  + +     K +  ++ E    R           GN   
Sbjct: 280 GTLFHRILSEKILIVLLVLLLSITAHSTLSKAMRMYEAEKRYIRHLIAAQADSPRGNPSL 339

Query: 226 -GY-----QTEECKCDPEYLSNDTTPEETREP--------KKSKVSIIENIYWKEFGLLV 271
            GY       ++ + D           E R+         +  K  ++E         ++
Sbjct: 340 GGYVLPFGDEDDSRADTGCKEEARMAAEERQRILILNPDFRTMKTDLLEQEKVTPRSKII 399

Query: 272 A---VWAIILALQI-------AKNYTTTC-SVLYWVLNLLQIPVAGGVSAYEAVALYKGR 320
           A   ++++++ L +          +   C S  +WV++++ I      SA+ A      R
Sbjct: 400 ALCCMFSVLIFLNLMVGGGSFDSPWDIKCGSTAFWVVHVVMIAFLMS-SAWMAQTYLIAR 458

Query: 321 RKIAS-------KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIP 373
            +I          GD   KW     + Y A    AG+  G+ G+GGG ++ PL L  G+ 
Sbjct: 459 HEIKDMVRFDYVHGD--IKWDTRTSIIYPAVFTIAGVFAGMFGIGGGVVIVPLLLHSGVH 516

Query: 374 PQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI-- 431
           P V+SAT+   I F+S  SV  Y++       +A+  F +  I++ +GQ +++++ +   
Sbjct: 517 PGVASATSSAMILFTSLASVSTYFVFGLIVADFAMAGFVIGFISSTLGQILMRRVRQAKS 576

Query: 432 -----LGRASIIIFTLSFIIFVSAL 451
                  R S + F +  ++ VSAL
Sbjct: 577 ASGRKFERNSYLAFVIGGVVLVSAL 601


>gi|440800424|gb|ELR21463.1| hypothetical protein ACA1_184090 [Acanthamoeba castellanii str.
           Neff]
          Length = 512

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 67/228 (29%), Positives = 118/228 (51%), Gaps = 23/228 (10%)

Query: 266 EFGLLVAVWAIILAL-------QIAKNY--TTTCSVLYWVLNLLQIPVAGG---VSAYEA 313
           +F  L  VW I+ A+       +  ++Y     CS+ +W+L  +  P+  G   +S   A
Sbjct: 275 KFLYLAIVWCIMFAIVFVRGGGESDESYLGIVKCSMYFWILWGVMFPIMLGFMVLSCIIA 334

Query: 314 VALYKGRRKIASKGDEG-TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGI 372
             +Y  R++      EG  +W    L      G   G   G LG+G G + GP+ LE+G+
Sbjct: 335 RLIYSYRKRNGWPFIEGDVQWSVKSLFLIPFAGTIGGTAAGFLGIGSGMVNGPVMLEIGM 394

Query: 373 PPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKIL 432
            P+V++AT+ F I F++  +V +Y+++       AL+FF L ++AA VGQ+ ++ ++K  
Sbjct: 395 TPEVATATSSFIIVFTALSTVSQYFIIGALNWQPALWFFVLGVLAAVVGQYGVQYVVKRF 454

Query: 433 GRASIIIFTLSFIIFVSALSLGGVGLAKMIK---RIEHKEYMGFDNIC 477
            ++SII F L+F+I        G G+A ++    +I  +   GF ++C
Sbjct: 455 NKSSIISFLLAFVI-------AGSGVAMIVTGALQIADEGITGFADLC 495



 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 60/148 (40%), Positives = 96/148 (64%), Gaps = 2/148 (1%)

Query: 71  DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
           ++ FGWR+ +GT + F G A     G+GGGGI+VP+L LIVG+ AK +  +SK  V G A
Sbjct: 3   ELLFGWRLYLGTGLAFVGGALSGAVGIGGGGIYVPVLILIVGYTAKEAIPLSKVTVAGVA 62

Query: 131 VSAVVYNLRQRHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLF 189
           +S+ + N+ +RHP      ++DYD+A++  P  +LG ++GV   VI  +W+I +LLI++ 
Sbjct: 63  ISSFLVNVLRRHPRAPWRALVDYDVAMVMTPTTLLGTTVGVLVYVILPEWLILILLILVL 122

Query: 190 LVLSTRAFLKGVETWKKETITKREAAKQ 217
            ++  R F+  ++ WKKE + K EA +Q
Sbjct: 123 GLVDYRTFVAAIKLWKKEKVAK-EAERQ 149


>gi|340508148|gb|EGR33919.1| hypothetical protein IMG5_031030 [Ichthyophthirius multifiliis]
          Length = 434

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 88/394 (22%), Positives = 173/394 (43%), Gaps = 45/394 (11%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVL 164
           P+L +++ +  + +  +S C+V GG ++  +  LR++HP    P+IDY++ ++  P+++L
Sbjct: 55  PLLMILMNYQQRKATFISYCIVLGGCLANSLLILRKQHPLKKKPLIDYNIIMIINPMVIL 114

Query: 165 GISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLG 224
           G +IG+  NVIF + +  VL I+    +S   F K  +  +   + K   +  +  + + 
Sbjct: 115 GTNIGIILNVIFPEIVSGVLFIIFLCTVSPYLFKKESQLSRDNDLEKVNNSYIISDVKVD 174

Query: 225 NGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK 284
           N  Q++    DP  L      EE + P  +K+ I+  ++       V++  +I  L+  K
Sbjct: 175 NIAQSQIKNNDPGELKCFLMQEERQYPL-NKLLILMFVF-------VSIQFLIF-LRGGK 225

Query: 285 NYTT-----TCSVLYWVLNLLQIPVAGGVSAYEAVA-----LYKGRRKIASK-------- 326
              +      CS  YW+L       + G+  Y  V      ++  R +I  K        
Sbjct: 226 GVGSFIGIKICSNSYWLL-------SAGILVYSLVVSYFIKIFISRNEIQKKMIFQKYGL 278

Query: 327 ----GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATAL 382
                D+       +     A G+ +G + G  G G   +L P+F+   +PP + SA   
Sbjct: 279 EEYFKDDFDISDNKKYFIIWASGLLSGCLSGTFGSGAALLLMPVFISYQLPPIIGSAVCG 338

Query: 383 FAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           F   F +  S++  +  +       + +  L+ +  FV   +L  +++      I++F  
Sbjct: 339 FNYFFIACASIISVFSEQYLTAYEVIIYSFLAFLGGFVCARILYGIVERKKAQHIVVF-- 396

Query: 443 SFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNI 476
              I VS   L  +G   ++  I+ K   G D++
Sbjct: 397 ---IVVSLAVLNIIG--NIVYLIKKKNDYGIDSL 425


>gi|297597753|ref|NP_001044473.2| Os01g0786700 [Oryza sativa Japonica Group]
 gi|255673761|dbj|BAF06387.2| Os01g0786700 [Oryza sativa Japonica Group]
          Length = 300

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 45/115 (39%), Positives = 70/115 (60%), Gaps = 3/115 (2%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL---RQRHPTLDIPVIDYDLALLF 158
           ++VP+LN++ G   K++ A S  MVTGG +S V+Y L   R        P+IDYD+A++ 
Sbjct: 29  LYVPILNIVAGLSLKTATAFSTFMVTGGTLSNVLYTLIVLRGHEKGGHQPLIDYDIAVVS 88

Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE 213
           QP L+LG+S+GV  NV+F +W+IT L  V     + + +  G++ W+ ET   R 
Sbjct: 89  QPCLLLGVSVGVICNVMFPEWLITALFAVFLASATFKTYGTGMKRWRAETAAARR 143



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 59/107 (55%)

Query: 371 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 430
           G   + +S+T +F + F +SMS+V++ +L    +  AL +     +A+ VG  V++  I+
Sbjct: 194 GEGAKTASSTTMFMVLFCASMSMVQFIILGVDGIVTALVYAITCFVASIVGLVVIQGTIR 253

Query: 431 ILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
             GR S+I+F ++ I+ +S + +   G  ++  +    +YMGF   C
Sbjct: 254 KSGRVSLIVFMVAAILALSVVVIACSGAVRVWVQYTSGQYMGFKMPC 300


>gi|340501283|gb|EGR28084.1| hypothetical protein IMG5_183410 [Ichthyophthirius multifiliis]
          Length = 470

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 89/377 (23%), Positives = 167/377 (44%), Gaps = 63/377 (16%)

Query: 65  YKHVWP--DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           +K++WP   I+    I++  +IG       +VGG GG  + VP+L L++ +   +S  +S
Sbjct: 24  HKNLWPPNTIEICAYILIPILIGI-----SNVGGQGGSIVRVPLLMLMLNYSQSTSVFIS 78

Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
             ++ G  +   +  L +RHP  DIP+I++DL L+  P L++G   G+   ++  +++  
Sbjct: 79  FIILFGSCLPNSLLLLTKRHPFKDIPLINFDLVLILLPNLIVGNIYGILLTLVVPEFITI 138

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL-IVLGNGYQTEECKCDPEYLSN 241
           +L I+    ++   + KG++ +K++   K +  K++ L I L    Q      +     +
Sbjct: 139 ILFILYLFAITPYFYRKGMKLYKEK---KHKDQKEVYLQINLNKTIQRHNINENVNTYQD 195

Query: 242 D--------TTPEETREPKKSKVSIIENIYWKEFGL-----------LVAVWAI---ILA 279
           D            E + P+  K    + IY ++  L           ++A + I   IL 
Sbjct: 196 DNNSNISNYNNNIEIQSPQSQKN---KQIYIRKKKLKSILPIKKILAIIATFLIIQTILM 252

Query: 280 LQIAKNYT----TTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS---------- 325
           L+ ++ +      T ++ Y+++NL    V   ++ Y     +K R K  S          
Sbjct: 253 LRCSQKFDYLGIKTYNLYYYLINLFLFIV--NIAMY---LFFKDRFKTKSLIIQIKKQQQ 307

Query: 326 --------KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVS 377
                   K +E +       +   + G  +G+  GL G+G G  + P  L L I P V+
Sbjct: 308 IDLNESQYKDNEFSLQSYKCFLQIISLGFISGVFAGLFGIGSGLTIVPALLYLKIEPTVA 367

Query: 378 SATALFAITFSSSMSVV 394
           +AT  F   F S  SV+
Sbjct: 368 AATNGFITFFLSLNSVI 384


>gi|85002083|gb|ABC68446.1| unknown [Tragopogon pratensis]
          Length = 134

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 68/132 (51%), Positives = 80/132 (60%), Gaps = 6/132 (4%)

Query: 269 LLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGD 328
           LL AVWA  L LQI KN    CS  YWVL + Q P    V  +E V LYK  +K    G+
Sbjct: 3   LLFAVWAAFLVLQIIKNELVVCSTWYWVLTVAQFPATLVVFGFECVKLYKESKKRKRAGN 62

Query: 329 EGT------KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATAL 382
             +      +W    L+F   CGI  G VGGLLG GGGFILGPL LE+G+ PQV+SATA 
Sbjct: 63  TESVCEASIEWTIPSLIFCALCGILGGTVGGLLGSGGGFILGPLLLEIGVIPQVASATAT 122

Query: 383 FAITFSSSMSVV 394
           F + FSSS+SVV
Sbjct: 123 FVMMFSSSLSVV 134


>gi|148906476|gb|ABR16391.1| unknown [Picea sitchensis]
          Length = 157

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 63/126 (50%), Positives = 80/126 (63%), Gaps = 16/126 (12%)

Query: 17  AFIGLLVVASVSVSAQSNLKPEATSKNKN-------DHVEPNIVIKVSAPKSR------- 62
           A  GLL+  ++S + ++ L   + S           D VE N  +K    K+R       
Sbjct: 16  ATFGLLLAVTLSSAERTFLDGNSASSETERSMEKIVDPVEGN-TLKYHVLKARNFLWRSN 74

Query: 63  -SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
            S Y+ VWP++KFGWRIV+G+I+GFFG A GSVGGVGGGGIFVPML LI+GFDAKSS A+
Sbjct: 75  ISDYQRVWPEMKFGWRIVIGSIVGFFGAAVGSVGGVGGGGIFVPMLTLIIGFDAKSSTAI 134

Query: 122 SKCMVT 127
           SKC  T
Sbjct: 135 SKCKQT 140


>gi|301093159|ref|XP_002997428.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110684|gb|EEY68736.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 591

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 58/144 (40%), Positives = 88/144 (61%), Gaps = 5/144 (3%)

Query: 338 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           LVF   C + AG+  GLLG+GGG + GP+ LE+GI P V SATA F I F+SS + +++ 
Sbjct: 446 LVFPALCSM-AGVAAGLLGIGGGMVKGPIMLEMGILPPVQSATANFMILFTSSSTTLQFA 504

Query: 398 LLKRFP----VPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
           +  +FP      Y  +F  +  I  F GQ V+  L+K   R SI+++ L+  I +SAL++
Sbjct: 505 INGQFPGQLQYDYMAWFALMGCIGGFCGQKVVAYLVKKYRRESIMVYLLAVTIGLSALAM 564

Query: 454 GGVGLAKMIKRIEHKEYMGFDNIC 477
           G +GL   ++ IE   ++GF+ IC
Sbjct: 565 GIIGLKSTLRDIEKGVHLGFNGIC 588



 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/146 (37%), Positives = 82/146 (56%), Gaps = 1/146 (0%)

Query: 74  FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
           F W+ +V TII    TA GS  GVGGGG+ VPM     G   K +  +SK  + G AVSA
Sbjct: 109 FTWKDIVATIIALVSTALGSGCGVGGGGLLVPMYIFFYGLSPKHAIPLSKATIFGNAVSA 168

Query: 134 VVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
             +N  ++HP    +P+I+Y +A + +P  ++G   GV  N +F DW+I VLL+ L   +
Sbjct: 169 YFFNFNRKHPMNAKLPLINYQVAGVMEPTTLIGAIFGVMMNHMFPDWLILVLLVSLLSYI 228

Query: 193 STRAFLKGVETWKKETITKREAAKQL 218
           + +  LKG    +KE+  +R   K +
Sbjct: 229 TYKTVLKGNTIREKESRYQRAVVKSV 254


>gi|384249010|gb|EIE22493.1| hypothetical protein COCSUDRAFT_47913 [Coccomyxa subellipsoidea
           C-169]
          Length = 628

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 57/216 (26%), Positives = 105/216 (48%), Gaps = 4/216 (1%)

Query: 266 EFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS 325
           +  LL  ++A I+A    K+ T   +  YW++ L  +PV   ++ +    L +      +
Sbjct: 409 KLALLFLMFACIVASDSFKDRTVCGTWQYWLVVLSVLPVILIITLFVRAYLVRDFNAKQA 468

Query: 326 KG---DEGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 381
            G    EG  +W     + + A    AG++ G+ G+GGG + GPL LE+G+ P V++AT+
Sbjct: 469 SGYVWTEGDVEWSRRNTLLFPALSSLAGLIAGMFGVGGGIVKGPLMLEMGVLPDVAAATS 528

Query: 382 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT 441
              I F+++ + V Y      P  Y L  F + +I   VGQ     L+K L R S+++  
Sbjct: 529 ATMIMFTAASASVVYLSFGGIPFDYGLATFLVGLIFTMVGQVTCYWLMKALDRRSVVVIA 588

Query: 442 LSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           ++ ++ +S + +    +   I  ++    + F  IC
Sbjct: 589 MALLMVISMVIIYYEAVVSTIAAVQDHRLLDFGTIC 624



 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 104/194 (53%), Gaps = 11/194 (5%)

Query: 65  YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
           +K +WP     WR VVG I         + GG+GGGGI VP+  L++GF   ++ A+S  
Sbjct: 29  HKGLWP---LDWRDVVGYIFAAVSLFIAAGGGIGGGGILVPLYILVMGFPTNTAVALSNI 85

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
            + GGA+S  V+N+ +RH   + P+ID+++ L  +P  +LG  +G   N    +WM T+L
Sbjct: 86  TIVGGAISNFVFNVGRRHAFFNRPLIDWNIILAMEPATILGALLGGYLNKATPNWMTTIL 145

Query: 185 LIVLFLVLSTRAFLKGVETWKKETIT-KREAAKQLE------LIVLGNGYQTEECKCDPE 237
           L  L  +L+ +   + V TW+KE +  KR AA   +        +L  G Q E+ +   E
Sbjct: 146 LAALLTLLTYKLVDRAVVTWRKENLEFKRAAAGSSQDGSDPSEPLLRKGPQ-EQQEILNE 204

Query: 238 YLSNDTTPEETREP 251
             + + +P ++ +P
Sbjct: 205 AFAPEQSPRQSTQP 218


>gi|294944337|ref|XP_002784205.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239897239|gb|EER16001.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 392

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 91/368 (24%), Positives = 162/368 (44%), Gaps = 57/368 (15%)

Query: 96  GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL------RQRHPTLDIPV 149
           GVGGG +FVP+LNL+    +K++ A+S+ ++    V+ V+++L      R+R   +++P 
Sbjct: 27  GVGGGILFVPLLNLVGRLPSKNATAMSQGLIMSATVAKVLFSLHAQYTSRRRGRVINLPY 86

Query: 150 IDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWK 205
           +     +L  P +++G  IG+        W+  ++ ++L+++     S  + LKG   W+
Sbjct: 87  V-----VLMLPCMIVGGLIGIYIY----SWLPVLIQLILYVITALFGSAMSLLKGFHLWR 137

Query: 206 KETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSK-VSIIENIYW 264
            ET  K +A +  E+   G        K           P   R   + K V+ +  I+ 
Sbjct: 138 AETSEKEKAKRDAEVSAGGT------LKVPASSTVETVVPPLLRPITRRKAVACVITIF- 190

Query: 265 KEFGLLVAVWAIILALQIAKNYTTT--------CSVLYWV-LNLLQIPVAGGVSAYEAVA 315
                  AVW +++  ++    ++T        C  LYW    ++ + +     AY  + 
Sbjct: 191 -------AVWILVILSRLILGSSSTRSIVGISYCEGLYWALSVVVVVVLLMVPLAYALID 243

Query: 316 LYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQ 375
              G  K A           S L      GI  G +  ++G+ GG I+ PL +  G+ P 
Sbjct: 244 RSPGSSKAA------LTLSGSLL------GI--GFLAAVVGISGGIIITPLVMFTGLTPP 289

Query: 376 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 435
            +S T    I  +SS   + + L    P   AL+  AL    A  G  +L ++++  GR 
Sbjct: 290 QASGTGSVVILVNSSSLALSFGLGGYLPDASALWIIALPFCGALTGDIILTRIMRRTGRT 349

Query: 436 SIIIFTLS 443
           SI+ F LS
Sbjct: 350 SILAFLLS 357


>gi|325181581|emb|CCA16031.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 421

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 81/332 (24%), Positives = 147/332 (44%), Gaps = 43/332 (12%)

Query: 81  GTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQ 140
           G I  F   A  S  G+GGGG+ VP+  L      + +  +SK ++ G ++S ++  LR+
Sbjct: 97  GIIGTFLSAAIASGCGLGGGGLLVPLYILTQHLSPQKAIPLSKAVIFGASISGLIVTLRR 156

Query: 141 RHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK 199
           +HP +    +I++D  LL +P+ + G  IGV    I  D+++T+LL              
Sbjct: 157 KHPFVSSRSIINFDAVLLMEPMTLAGTVIGVTMITILPDYIVTILL-------------- 202

Query: 200 GVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSII 259
                  +    R +  Q         + +++C+ D           +     + K+   
Sbjct: 203 ------GQAQILRYSHHQ---------FASDDCQ-DQSLTDQSLNLSDGGIMNRQKIQTS 246

Query: 260 ENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAY 311
           +     +  ++   W  ++ + + K      S++        YW +      V   ++AY
Sbjct: 247 QRNLLCDMAVISLTWLSLIVISLVKGGHGAASIVGIGCGSYSYWAVITFMPYVFCAITAY 306

Query: 312 EAVALYKGRRKIAS---KGDEGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF 367
            A  + K R  + S   +  EG  +W    +V Y      AG+  G+LG+GGG + GP+ 
Sbjct: 307 FAQRILKQRAFLESYDYRYAEGEMRWDRRGVVRYPIFCSLAGVAAGMLGIGGGMVKGPIL 366

Query: 368 LELGIPPQVSSATALFAITFSSSMSVVEYYLL 399
           LE+   PQV+SAT+   I F+SS +V++  L+
Sbjct: 367 LEIRFHPQVASATSSTMIFFTSSTTVIQEALI 398


>gi|452823720|gb|EME30728.1| hypothetical protein Gasu_19670 [Galdieria sulphuraria]
          Length = 711

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/180 (31%), Positives = 92/180 (51%), Gaps = 8/180 (4%)

Query: 70  PDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGG 129
           P   F WR  +  ++ F      + GGVGGG +FVP+L L  G+ A ++AA+S+ +VTG 
Sbjct: 140 PLTDFTWRDGLTFVLVFIIAGLSNAGGVGGGFLFVPVLVLATGYRASTAAAISQALVTGA 199

Query: 130 AVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLF 189
           + +   Y L +RHP  + P IDY + + F P ++ G SIGV  N +F ++     L  L 
Sbjct: 200 SGANTFYGLIRRHPKRERPRIDYGVVIHFIPSVLCGTSIGVLLNELFPNFFTLFALSALV 259

Query: 190 LVLSTRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETR 249
           L +   +  KG+  WK+E     +A K+ E         ++  + + E   ND   +E +
Sbjct: 260 LYVFYVSLKKGISLWKQERKEAEDAKKKSE--------TSQSQQTNSELTQNDQVAQEQQ 311



 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/147 (22%), Positives = 71/147 (48%), Gaps = 1/147 (0%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSS 390
            W   +++++    I  G +G  +G GG F+  P+ +  +G+ P V  +TA F    S  
Sbjct: 562 HWTKLRVIYFSPLMIILGAIGAWVGAGGSFMSTPILVAGIGMDPVVVQSTAGFMNFTSGF 621

Query: 391 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSA 450
            S ++Y    +  + Y L   A +++ +F G ++L  L+      +I++  +S ++F + 
Sbjct: 622 SSALQYIFDHQMKIDYGLSLGATTLVGSFTGLYILNGLVARYNLQAILVIVMSIVMFGAF 681

Query: 451 LSLGGVGLAKMIKRIEHKEYMGFDNIC 477
                 G+ ++I  ++  E+    +IC
Sbjct: 682 AVDLYAGVQELIGVLDLNEHFPIHSIC 708


>gi|428163170|gb|EKX32257.1| hypothetical protein GUITHDRAFT_148761 [Guillardia theta CCMP2712]
          Length = 538

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 104/409 (25%), Positives = 180/409 (44%), Gaps = 37/409 (9%)

Query: 82  TIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQR 141
           T+I F   A  +  G GGGGI VPM  +I  F   S+  +SK  + GGA++  + N+++R
Sbjct: 136 TVITFVTIALAAPTGTGGGGILVPMYMIIGHFSPHSAIPLSKATILGGAIANNLINIQRR 195

Query: 142 HPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIV----LFLVLSTRAF 197
           HP  + P++DYD   +  P L++G  +GV  N +   W++T+ L+V     F + + +A+
Sbjct: 196 HPFANRPLVDYDSLQILVPSLLIGTILGVFLNAVSPAWLVTLGLVVSLGYSFAIAAKKAW 255

Query: 198 LKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVS 257
              VE   K ++ +RE             Y  +E K +P+    +    E+R   K+   
Sbjct: 256 AIYVEEVLK-SLPEREPLLGERKEQPAQHYSFDEDKLEPQL--REIIKAESRHDFKAIGM 312

Query: 258 IIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL- 316
           I+  I W    +LVAV ++I        +    S  +W++ LL  P+   +S     +L 
Sbjct: 313 IV--ISW----ILVAVCSLIKGGSGPNQFVACGSWSFWMVALLPFPIVMILSWRVGTSLN 366

Query: 317 --YKGRRKIASKGDEGTK-WRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIP 373
             ++ ++    +  EG   W    +  +    I  G++ G   LGG    G    E G  
Sbjct: 367 EKFESKKACGYRFAEGDAVWDVQHVRIFPFVSIIVGILAG--ALGGVEPCG----ERGDD 420

Query: 374 PQVSSATALF---AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 430
               S   L    A++   S      +LL                +AA +G   +  + +
Sbjct: 421 GAHGSLHVLLDHHAVSLPRSAQARLRHLLH-----------PRRTVAASIGNTAIHHVSR 469

Query: 431 ILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICYT 479
              +   ++  L+  I +SA+ LG VG  + I+     E MGF +IC++
Sbjct: 470 KYRKTWFVVAILAITIGLSAVLLGYVGYYRAIRSWLEGEDMGFRDICHS 518


>gi|348689886|gb|EGZ29700.1| hypothetical protein PHYSODRAFT_309925 [Phytophthora sojae]
          Length = 580

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 83/262 (31%), Positives = 128/262 (48%), Gaps = 30/262 (11%)

Query: 238 YLSNDTTPEETR---EPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL- 293
           + S+D    + R   E ++ +V   E I W     LV  W IIL   I +      S + 
Sbjct: 324 FPSDDDKKAQRRGTIERREGRVLPFEYI-WP----LVVSWFIILVQSILRGGHGAPSAIG 378

Query: 294 -------YWVLNLLQIPVAGGVSAYEAVALY-KGRRKIAS-----KGDEGTKW-RASQLV 339
                  YWVL L+ + +  G+S Y    L    R K+ S     +GD    W +   LV
Sbjct: 379 VGCNSSDYWVLTLMPLSILVGISLYVGYRLRLTNRLKVVSGYFFVEGD--MHWVKRRTLV 436

Query: 340 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLL 399
           F   C I AG+  GLLG+GGG + GP+ LE G+ P V SATA F I F++S + +++ + 
Sbjct: 437 FPAVCTI-AGVAAGLLGIGGGMVKGPIMLEAGVLPAVQSATASFMILFTASSTTLQFAIN 495

Query: 400 KRFPVPYALYFFA----LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGG 455
            +FP  +   F A    +  +  F G   +   +K   R SI+++TL+  I +SA+++  
Sbjct: 496 GQFPGEFQFDFMAWLAFVGFVGGFCGLKCVGYFVKKYRRESIMVYTLAATIGLSAVAMRF 555

Query: 456 VGLAKMIKRIEHKEYMGFDNIC 477
           +GL   +  IE   ++GF  IC
Sbjct: 556 IGLQSTLSDIESGVHLGFHGIC 577



 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 73/135 (54%), Gaps = 13/135 (9%)

Query: 74  FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
           F +  ++ T+I F  TA G+ GG+GGGG+ VPM  +  G + K +  +SK  + G AV+ 
Sbjct: 116 FTFNDLLTTLIAFSCTALGAGGGIGGGGLLVPMY-IFAGLNPKHAIPLSKVTIFGSAVA- 173

Query: 134 VVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLS 193
                      ++ P+ID+ L  L +P+ ++G   GV  N I  +W+I VLL+ L   ++
Sbjct: 174 -----------MNRPLIDFALVALMEPMTLVGTVFGVMLNHISPNWLILVLLVTLMSFIT 222

Query: 194 TRAFLKGVETWKKET 208
               LKG +   KE+
Sbjct: 223 YNTVLKGNKIQDKES 237


>gi|298710140|emb|CBJ31852.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 378

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 66/107 (61%), Gaps = 2/107 (1%)

Query: 103 FVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLL 162
            VP+  L++GFD K +  +S   + GGA++  V NL +RHP  D P++D+DL L+ +PL 
Sbjct: 146 LVPLYILVLGFDPKHAIPLSNITIFGGAITNTVLNLSKRHPAADRPLVDWDLILVMEPLT 205

Query: 163 VLGISIGVAFNVIFADWMITVLLIVL-FLVLSTRAFLKGVETWKKET 208
           + G  +G   N +  DW++ ++LIVL     + R   KG++++ KET
Sbjct: 206 IGGALVGSFINKVLPDWILAIMLIVLL-AATANRTLRKGIKSYNKET 251


>gi|294944269|ref|XP_002784171.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239897205|gb|EER15967.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 409

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 93/418 (22%), Positives = 176/418 (42%), Gaps = 71/418 (16%)

Query: 96  GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP--VIDYD 153
           G+GGG +FVP+L LI G   K S A+S+ ++   +++A ++N  +++   + P  +I + 
Sbjct: 27  GIGGGVLFVPVLRLIGGLQLKESTALSQALIASASLAATLFNCFEQYSARNDPKALIVWP 86

Query: 154 LALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWKKETI 209
             +L  P  V+G  IGV        W+ ++ +++L+       S  A+ KG+  WK E  
Sbjct: 87  FVILTLPCTVIGSLIGVYLY----SWLPSLFILILYFCFVCLGSFMAYRKGIRLWKAENG 142

Query: 210 TKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTT---PEETREPKKSKVSIIENIYWKE 266
            KR A          +G  T+  +      S++ T   P   R P   K++   +I    
Sbjct: 143 AKRRAV---------DGDSTDMSR------SSEVTVEVPSLLRMPNMKKLAAYTSIA--- 184

Query: 267 FGLLVAVWAIILALQIAKNYTTTCSVLY----WVLNLLQIPVAGGVSAYEAVAL------ 316
                 +WA+ L   +    T    V++     + +  Q  VA   S    V        
Sbjct: 185 ----ALIWAVCLIFPLLTG-THPLQVVFDLSPSLCDEQQDDVAATPSGVIGVPFCEEAFW 239

Query: 317 ----------------YKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGG 360
                           Y   ++ A     G     S +V         G++  ++G+ GG
Sbjct: 240 GLAALQALILLLIPTGYVVAKRTAETARVGLVLMTSMIVI--------GLISSIVGISGG 291

Query: 361 FILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 420
             + P+ L LG+ P+ ++AT    I  +S+ + + + L   FP    L+   +  I A +
Sbjct: 292 LFMIPVVLSLGLDPKQATATTSIVIFATSTSTALSFALGGYFPPASDLWIVVMPFIGALL 351

Query: 421 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK-RIEHKEYMGFDNIC 477
           G+ ++ +LI   GR SI++  L  ++ +  ++    G+  ++   +   E + F + C
Sbjct: 352 GKTIVARLIAKTGRMSILVLLLGTVVIIGGITTISTGIVSVVNGALNGDEVVQFGSFC 409


>gi|326433498|gb|EGD79068.1| hypothetical protein PTSG_02036 [Salpingoeca sp. ATCC 50818]
          Length = 607

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 47/132 (35%), Positives = 75/132 (56%)

Query: 346 ITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVP 405
           + AGM+ G LG+GGG +L PL L+  + P VSSAT  +   F+S+ S  ++ +L R P  
Sbjct: 475 LVAGMLAGFLGIGGGMVLAPLLLQFNMHPLVSSATTAYMTLFTSAGSFTQFVILNRVPYD 534

Query: 406 YALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRI 465
           Y +  F L+  A+ VGQ +L   ++  G +S+I F L F+I ++A+ L   G  ++    
Sbjct: 535 YGIALFLLAAAASVVGQILLHSYVRRTGNSSVIAFILGFVIGLAAIMLLVSGSLQLKAAH 594

Query: 466 EHKEYMGFDNIC 477
           +  E  GF  +C
Sbjct: 595 DRGESFGFKPLC 606



 Score = 47.0 bits (110), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 52/93 (55%), Gaps = 9/93 (9%)

Query: 121 VSKCMVTGGAVSAVVYNLRQRHPTLDIP------VIDYDLALLFQPLLVLGISIGVAFNV 174
           +S   + G ++    + +R+RHP   +P      VID+D  L+  PL + G  +GV FN 
Sbjct: 126 LSSTAIVGASIVQFFFQIRRRHP---LPGAQHRRVIDFDTILMLLPLALAGTVVGVIFNT 182

Query: 175 IFADWMITVLLIVLFLVLSTRAFLKGVETWKKE 207
           +  DW++ V++I++ +  + +  +KG E  ++E
Sbjct: 183 VSPDWLVLVVVIIVLVFTTFKTLVKGRELRRQE 215


>gi|449018085|dbj|BAM81487.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 607

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 87/163 (53%), Gaps = 3/163 (1%)

Query: 51  NIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLI 110
           NI +   A  +R  +K   P  +F WR  +   + F      S GGVGGG +FVP+L L+
Sbjct: 108 NIQLDCDANSNRCVHK---PLSRFTWRDGLTFGLIFVIAGLSSTGGVGGGFLFVPVLVLL 164

Query: 111 VGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGV 170
            GF A+ +AA+S+ +VTGG+ +   Y L  RHP  + P IDY +  +F   ++ G S+GV
Sbjct: 165 TGFQARRAAALSQALVTGGSGANAFYGLITRHPFRERPRIDYYVVTVFMATILCGTSVGV 224

Query: 171 AFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE 213
             N++F ++    +L VL   +   +  K ++ WK E    R 
Sbjct: 225 ILNILFPNFFTLFMLAVLVAYVFYISIKKAIQLWKDERAASRN 267



 Score = 46.6 bits (109), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 33/151 (21%), Positives = 70/151 (46%), Gaps = 1/151 (0%)

Query: 328 DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAIT 386
           +   +W  +++++     I  G VG  +G GG F+  P+ +  +G+ P V  ATA F   
Sbjct: 456 ERDMEWTRNRILWIAPLMIFLGAVGAWVGAGGSFMSTPILIAGVGMDPLVVQATAGFMNF 515

Query: 387 FSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFII 446
            ++  S ++YY+    P+ Y L     + + +        +L+      +I++F ++ ++
Sbjct: 516 VAAFSSAIQYYVNHELPLDYGLALGGTAFLGSLSFVVFFNRLVYKFKLQAILVFIMAGVM 575

Query: 447 FVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
           F +A+     G  ++   +   +   F NIC
Sbjct: 576 FGAAVLNIYAGALELKTTLNQGKPFPFGNIC 606


>gi|348690321|gb|EGZ30135.1| hypothetical protein PHYSODRAFT_353731 [Phytophthora sojae]
          Length = 449

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 103/194 (53%), Gaps = 12/194 (6%)

Query: 294 YWVLNLLQIPVAGGVSAYEAVALYK-GRRKIAS-----KGDEGTKWRASQLVFYCACGIT 347
           YW+L  L + +  G++ +    L    R K+       +GD    W   +++ +      
Sbjct: 255 YWILTFLPLLILVGITLWVGYHLRLLNRSKVLCDYPFVEGD--IHWIKRRILVFPTLCTM 312

Query: 348 AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP---- 403
           AG+  GLLG+GGG + GP+ LE+GI P V SATA F I F+SS + +++ +  +FP    
Sbjct: 313 AGVAAGLLGIGGGMVKGPIMLEMGILPPVQSATANFMILFTSSSTTLQFAINGQFPGERQ 372

Query: 404 VPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK 463
             Y  +F  +  I    GQ V+  L+K   R SI+++ L+  I +SAL++G +GL   + 
Sbjct: 373 YDYIAWFALIGCIGGLCGQKVVAYLVKKYKRESIMVYLLAVTIGLSALAMGFIGLKSTLS 432

Query: 464 RIEHKEYMGFDNIC 477
            IE   ++GF  IC
Sbjct: 433 DIEKGVHLGFHGIC 446



 Score = 62.0 bits (149), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 64/111 (57%), Gaps = 7/111 (6%)

Query: 112 GFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT-LDIPVIDYDLALLFQPLLVLGISIGV 170
           G   K +  +SK  + G AVSA ++N  ++HPT   +P+I+Y +A + +P  ++G   GV
Sbjct: 7   GLSPKHAIPLSKATIFGNAVSAYLFNFNRKHPTNAKLPLINYQVAGIMEPTTLIGAIFGV 66

Query: 171 AFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
             N +F DW+I VLL+ L   ++ +  LKG        I+++E+  QL ++
Sbjct: 67  MMNHMFPDWLILVLLVSLLSYITYKTILKG------NKISEKESKHQLSVV 111


>gi|403353250|gb|EJY76161.1| hypothetical protein OXYTRI_02333 [Oxytricha trifallax]
          Length = 548

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 91/189 (48%), Gaps = 6/189 (3%)

Query: 84  IGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP 143
           I    T C +V G+GGGG  VP+L     F+AK + A+S   +   AV+  ++N RQRHP
Sbjct: 70  ISILMTLC-TVAGIGGGGDLVPLLMSFFEFEAKEAVAISGFAIFLCAVTRYIFNFRQRHP 128

Query: 144 TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVET 203
           + D   IDY LA +  P + +G  +GV  N++F    + + L +L   L      K V  
Sbjct: 129 SKDSVQIDYGLASVMMPTVFIGSFLGVLINIVFPVLYLQIALTLLLCFLFYECTKKAVVI 188

Query: 204 WKKETITKREAAKQLELIVLGNG----YQTEECKCDPEYLSNDTTPEETREPKKSKVSII 259
           ++KE   ++    Q++ I +       YQ  E   + + L+   T +  +  K  + S  
Sbjct: 189 FRKENQAQQPTLSQVQGIKVDENQITSYQNNE-NINEKLLTKSFTKDSKKNSKIKRSSQR 247

Query: 260 ENIYWKEFG 268
            +I  +++ 
Sbjct: 248 ASILAEDYA 256



 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 52/195 (26%), Positives = 93/195 (47%), Gaps = 7/195 (3%)

Query: 289 TCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEG-----TKWRASQLVFYCA 343
           TCSV  W+     I     +S   AV   K ++++  K   G      K+   ++     
Sbjct: 355 TCSVWDWLCFAFFIVFCFFMSMV-AVNNLKKQQQLKMKSGRGLHPSDIKFEGKKVTKLIL 413

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 403
                G+V G  G+GGG I  PL L +G PP V+S+T ++ + FS+  S + Y +     
Sbjct: 414 VSSIGGLVSGAFGIGGGTIYNPLLLSMGAPPSVASSTGMYMVMFSNFGSSITYIVYGTLN 473

Query: 404 VPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK 463
           V +  +       ++ +G  +L K++K L R S I+  L+ ++ +SAL +   G   M+K
Sbjct: 474 VKFGFWIGGFCCASSILGLFLLNKIVKKLNRQSPIVIILAVVMGISALLVPIFGSIDMVK 533

Query: 464 RIEHKEYM-GFDNIC 477
           ++   + +  F ++C
Sbjct: 534 QVRDGQSITQFSSLC 548


>gi|118348628|ref|XP_001007789.1| hypothetical protein TTHERM_00069510 [Tetrahymena thermophila]
 gi|89289556|gb|EAR87544.1| hypothetical protein TTHERM_00069510 [Tetrahymena thermophila
           SB210]
          Length = 549

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 89/155 (57%), Gaps = 7/155 (4%)

Query: 65  YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
           Y  +WP   F  +++V  +I F      +VGG+GGG I VP++ +++ + +K +  +S C
Sbjct: 25  YNDLWPPSAF--QVIVYILIPFI-IGVSNVGGLGGGIIKVPLITVMLNYPSKVATFISYC 81

Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
           ++ G +V      + ++HP  + P+IDY++ L+  P+++LG + G+  NV+  + +  V+
Sbjct: 82  ILFGSSVVHSTIIIFKKHPLFNKPIIDYNIVLVINPMVLLGTNAGILLNVLMPEIVAGVI 141

Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
           + V   +++     K +  +K   ITK++  +Q+E
Sbjct: 142 ICVYLSLIAPYILFKAISLYK---ITKKQ-QQQIE 172



 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 67/251 (26%), Positives = 107/251 (42%), Gaps = 39/251 (15%)

Query: 208 TITKREAAKQLE---LIVLGNGY------QTEECKCDPEYLSNDTTPEETREPKKSKVSI 258
           +I K E+A QLE   L+   N Y        ++ K     +S ++  E   EPK+ ++  
Sbjct: 247 SIQKMESAAQLEDQLLMQRVNSYYVPQEPPKDKTKRGSIIVSQNSLKETPEEPKQEQIMT 306

Query: 259 IE--NIYWKEFG--------LLVAVWAIILALQIAKNYT--------TTCSVLYWVLN-- 298
            E    Y +E+         LLV ++ II  +   +            TCS  YWV N  
Sbjct: 307 EELKAFYDEEYQQFPKKKILLLVIIFCIIQYIVFQRGGKGLQSFVGIKTCSASYWVSNGA 366

Query: 299 LLQIPVAG-GVSAYEAVALYKGRRKIASK--------GDEGTKWRASQLVFYCACGITAG 349
           +L + VA   V  Y  +   K + +I  K        GD     +    V   A G+ AG
Sbjct: 367 ILVLCVAAIFVIRYYLLKWTKNKNEIIKKYNLQKEFEGDFNVLNKTHYFVVLLA-GLAAG 425

Query: 350 MVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALY 409
           +V G +G+G G  L PL L +G+ PQV +AT  F   F ++ ++++ +          + 
Sbjct: 426 LVAGTVGVGAGLTLVPLLLSIGVHPQVVAATCGFNYLFIATTTIIQVFTSHYLSYAQIVL 485

Query: 410 FFALSIIAAFV 420
           F  LS +  F+
Sbjct: 486 FSLLSFVGGFI 496


>gi|302834698|ref|XP_002948911.1| hypothetical protein VOLCADRAFT_89258 [Volvox carteri f.
           nagariensis]
 gi|300265656|gb|EFJ49846.1| hypothetical protein VOLCADRAFT_89258 [Volvox carteri f.
           nagariensis]
          Length = 1005

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 42/108 (38%), Positives = 70/108 (64%), Gaps = 1/108 (0%)

Query: 93  SVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT-LDIPVID 151
           S  GVGGG IF+P+ N +VGF+ KSS A+S+  +T GA + ++ NL   HP+   + ++D
Sbjct: 32  SSSGVGGGAIFIPLFNALVGFNLKSSTALSQACITFGAAATLLANLMHHHPSDPTVALVD 91

Query: 152 YDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK 199
           + L  +  P+L++G+ IGV  NV+   W++T LL++L + L  ++  K
Sbjct: 92  FTLVAVLTPMLLMGVGIGVLLNVMLPSWLLTTLLLLLLIALVAQSLSK 139


>gi|40253413|dbj|BAD05342.1| unknown protein [Oryza sativa Japonica Group]
 gi|40253789|dbj|BAD05727.1| unknown protein [Oryza sativa Japonica Group]
          Length = 132

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 46/65 (70%), Positives = 56/65 (86%)

Query: 63  SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
           + Y HVWP ++FGW+IV+G +IGFFG A GSVGGVGGGGIFVPML LI+GFDAKSS A+S
Sbjct: 55  ASYHHVWPPMEFGWKIVLGILIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDAKSSTAIS 114

Query: 123 KCMVT 127
           K +V+
Sbjct: 115 KFIVS 119


>gi|307108116|gb|EFN56357.1| hypothetical protein CHLNCDRAFT_144844 [Chlorella variabilis]
          Length = 670

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 68/119 (57%)

Query: 103 FVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLL 162
            VP+  +++GF    + A+S   + GG ++ + +N+++RHP    P+ID+DL ++ +P  
Sbjct: 149 LVPLNLMVLGFGTAPAVALSNITIVGGTLANLAFNVQRRHPARHGPLIDWDLIMVMEPST 208

Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
           +LG  +G   N I   W+ TV L VL  ++S + F KG E   +E  +   + ++L ++
Sbjct: 209 ILGALVGGYLNKILPGWLTTVSLSVLLALISYKTFNKGREIHARERDSLHHSLQRLSVV 267



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 97/214 (45%), Gaps = 15/214 (7%)

Query: 281 QIAKNYTTTCSVLYWVLNLLQIPVA-GGVSAYEAVALYKGRRKIASKGD--EGT-KWRAS 336
            + K+     S+ YW+     +PV    +SA     L KG  + AS  +  EG  +W   
Sbjct: 453 DVLKSQAACPSLPYWLAATAMVPVTLATLSAVRTYLLKKGAAQRASGHELLEGDVEWTPG 512

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFI-----------LGPLFLELGIPPQVSSATALFAI 385
             + Y A    AG+  G+ G+GGG I            GPL LE+G+ P+V++AT+   I
Sbjct: 513 STLLYPALCSFAGLAAGIFGVGGGIIKARWLAVRENPAGPLMLEMGVLPEVAAATSATMI 572

Query: 386 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 445
            F+S+ + V +         YA   F + +I    GQ ++  + + +   S+++F ++ +
Sbjct: 573 FFTSAFASVVFISFGAVQWDYAAMLFTMGLICTAAGQLLVLWVNQHIKSRSLLVFVMATV 632

Query: 446 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNICYT 479
           + VS+++L   G           +   F  IC T
Sbjct: 633 LSVSSVALAVQGAQSTAAAAAAHDLWHFHGICGT 666


>gi|307109812|gb|EFN58049.1| hypothetical protein CHLNCDRAFT_50737 [Chlorella variabilis]
          Length = 752

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 42/125 (33%), Positives = 77/125 (61%), Gaps = 1/125 (0%)

Query: 89  TACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDI 147
           +A  +  GVGGG +FVP+ ++++    K++ A+S+ ++TGGA+ +V ++L ++HP     
Sbjct: 339 SALSNAAGVGGGAVFVPLFSVLLSISIKAATALSQAVITGGAIGSVAFSLTRQHPLRPGA 398

Query: 148 PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE 207
           P+ID+ LAL   P L+LG++ GV  N+    W++T+LLI L +  + R    G+   + E
Sbjct: 399 PLIDFSLALTLLPPLLLGVATGVLLNLALPAWLVTILLIPLLITFAIRTAATGLSMRRAE 458

Query: 208 TITKR 212
              ++
Sbjct: 459 KQAQQ 463


>gi|326433239|gb|EGD78809.1| hypothetical protein PTSG_01784 [Salpingoeca sp. ATCC 50818]
          Length = 589

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 97/210 (46%), Gaps = 12/210 (5%)

Query: 260 ENIYWKEFGLLVAVW----AIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVA 315
             + W++  LL+ VW     I + L  A +    CS  + VL L  IP    ++ +    
Sbjct: 358 RTVPWEKLVLLILVWLGYTTITMLLYEANDVIKPCSAGWIVLLLCAIPYVIAITYFAGRM 417

Query: 316 LYKG--RRKIAS----KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE 369
           L +   R++  +     GD    W  + L  + A    AG+   ++G+GGG I  P+ L 
Sbjct: 418 LKRQTVRKRKCNYPFLPGD--VMWEGANLNKFPALAFFAGVAAAMMGIGGGMIKSPIMLA 475

Query: 370 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 429
           +G+ PQV + T+ F I F+SS + ++Y +L +          ++    A VGQ V+  +I
Sbjct: 476 MGLQPQVVTTTSSFMIIFTSSATTLQYLILGKLKPQQLGIVMSMGFAGAVVGQRVVNYII 535

Query: 430 KILGRASIIIFTLSFIIFVSALSLGGVGLA 459
               + S +IF L  +  VS + +    LA
Sbjct: 536 AKYKKQSFLIFLLGGLTIVSGIIIVATSLA 565



 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 92/171 (53%), Gaps = 7/171 (4%)

Query: 50  PNIVIKVSAPKSRSGYK----HVWPDIK--FGWRIVVGTIIGFFGTACGSVGGVGGGGIF 103
           PN     + P   + +     HV   IK    W+  V +I  F   +     G+GGG +F
Sbjct: 30  PNTSSTQTTPHRITSHHISSHHVTALIKMVLAWQWAVASICVFLCASLAVGAGIGGGALF 89

Query: 104 VPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLL 162
           V +  +I+G DA ++  +SK  + G A++A   NL +RHP +   P+IDYD AL+ +P+ 
Sbjct: 90  VGIYMIILGMDAHAAVPLSKATIFGLAIAAYSVNLWKRHPHSQQRPLIDYDTALMLEPMT 149

Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE 213
           +LG  +GV  NV+F +W++ + L +L +V+S R   KG+    KE  T  +
Sbjct: 150 LLGAIVGVLLNVLFPNWLVLLPLCLLLMVVSYRTIRKGLRLRAKEKGTPHQ 200


>gi|428174009|gb|EKX42907.1| hypothetical protein GUITHDRAFT_110958 [Guillardia theta CCMP2712]
          Length = 197

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 52/127 (40%), Positives = 75/127 (59%), Gaps = 4/127 (3%)

Query: 86  FFGTACGSV----GGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQR 141
           FF T C S     GG+GGGGI VP+L  + GF    +  +++  V G ++  ++YN+R+R
Sbjct: 17  FFLTFCASALGAAGGIGGGGIMVPLLVSVGGFSVHHAIPLTQATVLGASIMNLIYNVRKR 76

Query: 142 HPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGV 201
           +P LD P+IDY+ AL+ +   +LG  IGV  N I   W+IT+LLIV     + R   KG+
Sbjct: 77  NPVLDRPLIDYNTALILEVTTLLGTVIGVDVNKISPVWLITILLIVTLGYTTYRTLRKGL 136

Query: 202 ETWKKET 208
           E    ET
Sbjct: 137 ELRAIET 143


>gi|118395416|ref|XP_001030058.1| hypothetical protein TTHERM_01164150 [Tetrahymena thermophila]
 gi|89284345|gb|EAR82395.1| hypothetical protein TTHERM_01164150 [Tetrahymena thermophila
           SB210]
          Length = 503

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 83/361 (22%), Positives = 157/361 (43%), Gaps = 57/361 (15%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           I VP+L    G++ K S A+    +   ++  ++  ++Q+      PVIDY + +L  P 
Sbjct: 135 IVVPVLMSFFGYETKKSIALVFITIFSASLGNLMSFMKQKSKDGG-PVIDYRIVVLSLPT 193

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
           +++G   GVA N  F   ++    +  F++ S        +T+K     KRE AK+++  
Sbjct: 194 IMVGSIYGVALNK-FIPQIVLAFALAFFILQSL------TKTYKS---YKREKAKEVQ-- 241

Query: 222 VLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEF---GLLVAVWAIIL 278
              N    +  +  P Y        E ++P ++ +  I     KE     LL  ++ I L
Sbjct: 242 --ENQNNNKSDQSSPLY--------ELKQPNENGLPPISQSSKKEQYPKSLLSKIFCITL 291

Query: 279 ALQI------AKNYTT-----TCSVLYWVLNLLQIPVAGGVSAYEAVALYKG-------R 320
              +         + +      C  LY + NL         SAY A  L KG       +
Sbjct: 292 GFAVFSLLRGGSKFDSLLGIPPCGFLYQISNL--------ASAYVAYLLVKGIIAGLVIQ 343

Query: 321 RKIASK-----GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQ 375
            KI  K       + T+  A ++  +      AG++G   GLGGG +L P +LE GIP  
Sbjct: 344 NKIEEKLVVNTSSDDTQLTAQEMQGFAVTAFLAGLIGSTFGLGGGMVLVPKWLEQGIPSY 403

Query: 376 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 435
            ++  ++  +  ++  S +++ L   +     +YF  ++++++F+    +++ +K   +A
Sbjct: 404 KTTPCSISLLFLTAFNSAIQFALGGVYETEEIIYFSVIALLSSFIVSSSIQQYVKKTNQA 463

Query: 436 S 436
           S
Sbjct: 464 S 464


>gi|167525260|ref|XP_001746965.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774745|gb|EDQ88372.1| predicted protein [Monosiga brevicollis MX1]
          Length = 499

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/127 (34%), Positives = 71/127 (55%), Gaps = 1/127 (0%)

Query: 86  FFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-T 144
           F G A     G+GGG  FV +  L++G DA  +  +SK  + G +++A   N+ +RHP  
Sbjct: 15  FLGAALNVGAGIGGGATFVAIYFLVLGEDAHGAVPLSKATIFGLSLAAFAVNIWKRHPHD 74

Query: 145 LDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETW 204
              P+IDYD A++ +P+ +LG  +GV  NVIF +W++ + L +L   +S +   K     
Sbjct: 75  PKRPLIDYDTAMMLEPMTLLGGILGVILNVIFPNWLVLLPLCLLLGFISYKTLKKAWNMH 134

Query: 205 KKETITK 211
           KKE   +
Sbjct: 135 KKELAAR 141



 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 62/227 (27%), Positives = 98/227 (43%), Gaps = 37/227 (16%)

Query: 264 WKEFGLLVAVW----AIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKG 319
           W    +L AVW     +   L  A++    C   + VL L  IP   GV+   A  L+  
Sbjct: 259 WDRILMLGAVWIGYFVLTFLLYRAEDAVPRCEAGWVVLLLCSIPYVVGVTYLFARRLH-- 316

Query: 320 RRKIASK--------GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
           R+ I  K        GD    W    L +Y      AG+   ++G+GGG I  PL L +G
Sbjct: 317 RQTIQKKAVGYVFHPGD--VMWEGRNLYYYPEMAFFAGLCASMMGIGGGLIKSPLMLSMG 374

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII--------------A 417
           + PQ ++ T+ F I F+SS S  + +L      P A ++  L  +               
Sbjct: 375 LNPQTTTTTSSFMIIFTSSASTFQVWLRALVSTPKAPHYLILGKLHGAELAAVMASGFAG 434

Query: 418 AFVGQHVLKKLIKILGRASIIIFTLS-------FIIFVSALSLGGVG 457
           A +GQ V+  L++   + S+++F L         I+F  A++ G +G
Sbjct: 435 ALMGQKVVNHLVQKYQKQSLLMFLLGGLTVLSVVILFSLAIADGKIG 481


>gi|301118274|ref|XP_002906865.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108214|gb|EEY66266.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 260

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 59/237 (24%), Positives = 115/237 (48%), Gaps = 35/237 (14%)

Query: 231 ECKCDPEYLS----NDTTPEETREPKKSKVSIIE--NIYWKEFGLLVAVWAIILALQI-- 282
           +C   PE  +      ++ +  +  +++ + ++   NI W++   L +++ ++  +++  
Sbjct: 35  QCGVSPEATTLLGIASSSADTLKAHREADLKVVSRANIPWRKLATLASLFVVVAGMRVLR 94

Query: 283 -AKNYTT-----TCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRAS 336
             KN+ +     + S LY VL       +GG   YE  A                KW  S
Sbjct: 95  GGKNFDSPVGIDSSSTLYPVLQ-----QSGG---YELEA-------------HEIKWTPS 133

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
            + ++  C + AG V G+ G+GGG I GPL LE+G+ P  +SA     + FSS MS + Y
Sbjct: 134 SIRYFPMCSLAAGAVSGMFGIGGGIINGPLLLEVGVDPSAASAMTATTVLFSSGMSSLNY 193

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
             + +  +  A     + ++  ++G   L KL++     S+IIF+++ I+ +SA+++
Sbjct: 194 AAMGKMDLHLAQLMLPMGLVTTYIGHLCLLKLVRHYNCPSMIIFSMATIVLISAVAM 250


>gi|221481168|gb|EEE19573.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 299

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 61/100 (61%)

Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGI 166
           + L++ F+   + A S+C++TG +++ ++ N  +R+P +D+P+ID D+ LL  P+ + G 
Sbjct: 1   MQLLMRFNTFEATATSQCLMTGSSLAGLILNFIRRNPVVDMPLIDMDMVLLLGPMQMCGS 60

Query: 167 SIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
           SIGV  N +   W+I VLL+V  L  + R   +  +  K+
Sbjct: 61  SIGVIVNRVLPAWLIIVLLVVCLLYETIRLMRRLCDQPKR 100


>gi|384252265|gb|EIE25741.1| hypothetical protein COCSUDRAFT_64812 [Coccomyxa subellipsoidea
           C-169]
          Length = 553

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 75/114 (65%), Gaps = 1/114 (0%)

Query: 103 FVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPL 161
           FVP+ N+++ F  K +AA+S+ ++ GGA++ V   L ++HP     P+ID+DLAL+  P+
Sbjct: 91  FVPLFNILLQFSVKGAAALSQAVIAGGALAGVGVTLHKKHPYDPSKPLIDFDLALMLLPV 150

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAA 215
           ++LG+S+GV  N +F +W+ITVLL++L + L+     K +   + E + K E A
Sbjct: 151 ILLGVSVGVLANQLFPNWLITVLLLLLLIFLTHMTVKKALSLHRAEVLYKAEQA 204



 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 65/263 (24%), Positives = 120/263 (45%), Gaps = 10/263 (3%)

Query: 221 IVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILAL 280
           +V G   +T+    +  ++S D    +    ++ +    +   W +  +L+  W I +  
Sbjct: 281 VVSGARARTQIVSTEVGHMSGDEGEHDEDRQQEHRSVRSQPATWLQVVVLLGCWGIFVTF 340

Query: 281 QIAKNYTTTCSVLYWVLNLLQ--IPVAGGVSAYEAVALYKGRRK--------IASKGDEG 330
           Q+  +    CS  YW +  +Q  + +   V+    V+  K R +        +AS   E 
Sbjct: 341 QLLLSRWPHCSGPYWAIFAVQAGLCLIAEVAFLRLVSARKQRSQESRLEQPMLASVYKEA 400

Query: 331 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 390
             W   +L+      + AG + GLLG+GGG I+ PL LE G  P V++AT+   + FSSS
Sbjct: 401 PAWTLPRLIRSAIITLLAGFIAGLLGIGGGMIVNPLLLEFGTHPHVAAATSTLMVLFSSS 460

Query: 391 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSA 450
            + + +         +AL F    + A+ +G  ++ ++++  G ASII+F L+ +I   A
Sbjct: 461 SAALSFGFSHLLNAQFALVFGLCCMAASLIGVLIVSRIVERSGNASIIVFLLALVIATGA 520

Query: 451 LSLGGVGLAKMIKRIEHKEYMGF 473
                 G    ++ + H   +GF
Sbjct: 521 TLTAAFGGRFAVQDLIHHRSIGF 543


>gi|237844563|ref|XP_002371579.1| hypothetical protein TGME49_020560 [Toxoplasma gondii ME49]
 gi|211969243|gb|EEB04439.1| hypothetical protein TGME49_020560 [Toxoplasma gondii ME49]
          Length = 299

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 61/100 (61%)

Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGI 166
           + L++ F+   + A S+C++TG +++ ++ N  +R+P +D+P+ID D+ LL  P+ + G 
Sbjct: 1   MQLLMRFNTFEATATSQCLMTGSSLAGLILNFIRRNPVVDMPLIDMDMVLLLGPMQMCGS 60

Query: 167 SIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
           SIGV  N +   W+I VLL+V  L  + R   +  +  K+
Sbjct: 61  SIGVIVNRVLPAWLIIVLLVVCLLYETIRLMRRLCDQPKR 100


>gi|221501858|gb|EEE27611.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 299

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 61/100 (61%)

Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGI 166
           + L++ F+   + A S+C++TG +++ ++ N  +R+P +D+P+ID D+ LL  P+ + G 
Sbjct: 1   MQLLMRFNTFEATATSQCLMTGSSLAGLILNFIRRNPVVDMPLIDMDMVLLLGPMQMCGS 60

Query: 167 SIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
           SIGV  N +   W+I VLL+V  L  + R   +  +  K+
Sbjct: 61  SIGVIVNRVLPAWLIIVLLVVCLLYETIRLMRRLCDQPKR 100


>gi|294944335|ref|XP_002784204.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239897238|gb|EER16000.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 354

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 83/358 (23%), Positives = 155/358 (43%), Gaps = 57/358 (15%)

Query: 96  GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL------RQRHPTLDIPV 149
           GVGGG +FVP+LNL+    +K++ A+S+ ++    V+ V+++L      R+R   +++P 
Sbjct: 27  GVGGGILFVPLLNLVGRLPSKNATAMSQGLIMSATVAKVLFSLHAQYTSRRRGRVINLPY 86

Query: 150 IDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWK 205
           +     +L  P +++G  IG+        W+  ++ ++L+++     S  + LKG   W+
Sbjct: 87  V-----VLMLPCMIVGGLIGIYIY----SWLPVLIQLILYVITALFGSAMSLLKGFHLWR 137

Query: 206 KETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSK-VSIIENIYW 264
            ET  K +A +  E+   G        K           P   R   + K V+ +  I+ 
Sbjct: 138 AETSEKEKAKRDAEVSAGGT------LKVPASSTVETVVPPLLRPITRRKAVACVITIF- 190

Query: 265 KEFGLLVAVWAIILALQIAKNYTTT--------CSVLYWV-LNLLQIPVAGGVSAYEAVA 315
                  AVW +++  ++    ++T        C  LYW    ++ + +     AY  + 
Sbjct: 191 -------AVWILVILSRLILGSSSTRSIVGISYCEGLYWALSVVVVVVLLMVPLAYALID 243

Query: 316 LYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQ 375
              G  K            A+  +     GI  G +  ++G+ GG I+ PL +  G+ P 
Sbjct: 244 RSPGSSK------------AALTLSGSLLGI--GFLAAVVGISGGIIITPLVMFTGLTPP 289

Query: 376 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 433
            +S T    I  +SS   + + L    P   AL+  AL    A  G  +L ++++  G
Sbjct: 290 QASGTGSVVILVNSSSLALSFGLGGYLPDASALWIIALPFCGALTGDIILTRIMRRTG 347


>gi|299115466|emb|CBN75630.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 236

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 70/116 (60%), Gaps = 2/116 (1%)

Query: 103 FVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLL 162
            VP+  L++ F  K +  +S   + GGA++    N+++RHP  D P++D+DL L+ +PL 
Sbjct: 38  LVPIFILVLRFGPKYAVPLSNITIFGGAITNTFLNMKKRHPLADRPLVDWDLILVMEPLT 97

Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
           + G  +G     +  + ++T+ +++L +  + R F KG++ +KKE+  ++  AK+L
Sbjct: 98  IGGALVGSFIQKVLPEVVLTLSMVLLLVATADRTFRKGLKAFKKESSLQQ--AKEL 151


>gi|66825573|ref|XP_646141.1| hypothetical protein DDB_G0269644 [Dictyostelium discoideum AX4]
 gi|60474235|gb|EAL72172.1| hypothetical protein DDB_G0269644 [Dictyostelium discoideum AX4]
          Length = 549

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 51/135 (37%), Positives = 79/135 (58%)

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           +VG I+ F G A  S GGVGGGGI++P+L L+  +D K++  +S C+V G A++  + N 
Sbjct: 137 IVGFILLFIGCALSSGGGVGGGGIYIPILILVSKWDPKTAIPLSNCLVAGCALANFIQNF 196

Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
            +RHP  +  +IDY +ALL +PL + G   GV  +  F   +I +LL++     S +   
Sbjct: 197 PRRHPFSNKHLIDYSVALLIEPLTLAGTIFGVYLHTYFPPLVILLLLVITLGFTSYKTIS 256

Query: 199 KGVETWKKETITKRE 213
           KG++ WK E   K  
Sbjct: 257 KGIDIWKSEKKKKNS 271


>gi|301118272|ref|XP_002906864.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108213|gb|EEY66265.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 439

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 88/449 (19%), Positives = 182/449 (40%), Gaps = 57/449 (12%)

Query: 22  LVVASVSVSAQSNLK--PEATSKNKNDHVEPNIVIK-VSAPKSRSGYKHVWPDIKFGWRI 78
           LV+A   V   + +K  P  T   + +  EP  V++ V  P                   
Sbjct: 20  LVLAQRPVGEDNPVKFGPVPTFPPQGEQYEPRGVLRSVEKPHQ----------------- 62

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           ++G ++        S   +GGG +   +  L++      +  ++   V GGAV     NL
Sbjct: 63  LLGILLAALVVLISSGAKIGGGAVLDAVYILVLKLGPDEAIPLASITVFGGAVCDFFLNL 122

Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
            ++    + P+I++D  L+ QP+L++G + G +    F+ W++T+ LIV  + +  +AF 
Sbjct: 123 WKKPINSNFPLINWDFMLIMQPMLLMGAAFGASMISWFSTWLLTIALIVYLVYVGKKAFK 182

Query: 199 KGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI 258
           K      +E                  G++   C    E +S    P  + +        
Sbjct: 183 KARAVGHEE------------------GWR--WCSSS-ETMSLLGAPSMSFQDDDGSFQY 221

Query: 259 IENIYWKEFGLLVAVWA---IILALQIAKNYTT-----TCSVLYWVLNLLQIPVAGGVSA 310
              + W++ G+   ++    ++ ALQ  + + +       S  + ++++L       VS 
Sbjct: 222 KSGLSWRKLGINFGIFTATVLLTALQGGRYFPSPLGIPPTSFFFLIVSMLPFIFLSVVSH 281

Query: 311 Y---EAVALYKGR---RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILG 364
           Y   +AVA Y+ +   R I +  +   +W    +       +  G + G  G+GG     
Sbjct: 282 YQMKDAVATYQRQQNPRFILAPNE--VQWSPDAIRKIPLRLLGIGAIAGAFGVGGEGATS 339

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
            L   +   P   SA +  A+ F S M+  ++ L  +  +  A +   L  +   +G+  
Sbjct: 340 SLLRGVNFTPAAVSAMSATAVFFVSGMASFDFLLWGKLDLNLAKFLMPLGFMMTLLGRLC 399

Query: 425 LKKLIKILGRASIIIFTLSFIIFVSALSL 453
           L ++++     ++++F ++  +F+S   L
Sbjct: 400 LIRIVRKAKSRTLLLFAIAAAMFISIFPL 428


>gi|118348626|ref|XP_001007788.1| hypothetical protein TTHERM_00069500 [Tetrahymena thermophila]
 gi|89289555|gb|EAR87543.1| hypothetical protein TTHERM_00069500 [Tetrahymena thermophila
           SB210]
          Length = 570

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 70/126 (55%), Gaps = 3/126 (2%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVL 164
           PM+ L++ ++ K +  +S C++ G  ++     + ++HP  D P+IDY++ L+  P+++L
Sbjct: 63  PMVMLMLNYETKIATFISYCVLFGSCLANSTLLIFKKHPLYDKPIIDYNIVLMINPMVLL 122

Query: 165 GISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVLG 224
           G +IG+  N++  +    +L I   +++S   F KG+  +K   + K +   QL   +L 
Sbjct: 123 GTNIGIFLNILLPEIAAGILFIGFLILVSPYMFKKGLNLYK---LKKEQQKNQLSQSLLE 179

Query: 225 NGYQTE 230
           N  + E
Sbjct: 180 NDQENE 185



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 83/181 (45%), Gaps = 19/181 (10%)

Query: 288 TTCSVLYWVLN--LLQIPVAGGVSAYEAVALYKGRRKI-----ASKGDEGTKWRASQLVF 340
           TTCS+ YW+ N  +L + VA        +  ++  +KI       K +       +  + 
Sbjct: 369 TTCSISYWITNAGILVLAVAAVFVIRFFLLKWEKNKKILIEKYNLKEEFADDLDVTNSIM 428

Query: 341 YC---ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           Y      G+TAGM+ G  G+G G  L P+ L  G+ PQV++AT  F   F S+ ++++ +
Sbjct: 429 YAKISVAGLTAGMLAGTFGVGAGLALVPILLASGVNPQVAAATCGFNYFFISTTTIIQVF 488

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT---------LSFIIFV 448
                 +   + F  LS I  FV    +   ++    + +++F          +SFII++
Sbjct: 489 TNSYLDLSQIVLFSILSFIGGFVIAKFIYNFVERKKASYLVVFMVFGLAILNIISFIIYL 548

Query: 449 S 449
           +
Sbjct: 549 T 549


>gi|403372026|gb|EJY85900.1| hypothetical protein OXYTRI_16112 [Oxytricha trifallax]
          Length = 499

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 96/432 (22%), Positives = 173/432 (40%), Gaps = 60/432 (13%)

Query: 76  WRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVV 135
           W ++  T +    TA G  GGV    +  PM     GF  + + A+S   +    V+   
Sbjct: 43  WGLIAITFLMLLCTAGGIGGGV----VVAPMCIAFFGFGTREAVALSGFCILICQVTKWA 98

Query: 136 YNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTR 195
           Y+  Q+HP  D   IDY LA +  P ++LG  IG    ++    ++ +LL  L   L+ +
Sbjct: 99  YSWSQKHPEKDTVSIDYGLATVMLPTVLLGSFIGTFVTILVPPIVLQILLTALLTFLTVQ 158

Query: 196 AFLKGVETWKKET----------------------ITKREAAKQLE------LIVLGNGY 227
           + LK  E ++KE                          R++ + ++      L +     
Sbjct: 159 SGLKAKEIYEKENAKIKKLKEAEEAKAAAEADKMAKANRQSVQVVDTVDGKRLSINRGSL 218

Query: 228 QTEECKCDPEYLSNDTTPEETRE----------PKKSKVSII------ENIYW---KEFG 268
           + ++     +   NDTT + + +          P + ++  +      E  +W   K   
Sbjct: 219 REKKPTIASKVHDNDTTDDPSAKLLHGGDYSEGPTQEEIDKVDAMLRREKTHWQWDKHLT 278

Query: 269 LLVAVWA-----IILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVS--AYEAVALYKGRR 321
            LV + +     II   +  K+    C V  W+L  L   +   VS  A + +   +  +
Sbjct: 279 CLVVLVSQVIINIIRGSKSTKSVVEHCGVADWILVGLYAAICITVSVIAIKRIIAEQALK 338

Query: 322 KIASKG--DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 379
               KG      ++    +          G   G LGL GG I  PL L  G+PP V+S+
Sbjct: 339 TKVGKGLIPSDIRFNKQTVRSVVITAFVGGWASGCLGLSGGAIFNPLLLNQGVPPSVASS 398

Query: 380 TALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 439
           T ++ I FS+  + V Y +       +  +      +A+  G + L K++K  GR S ++
Sbjct: 399 TGMYMILFSTIGTCVVYSIQGSLNFAFGGWIGGWCCLASVGGMYALDKVVKKFGRQSPLV 458

Query: 440 FTLSFIIFVSAL 451
             L+ ++ +S +
Sbjct: 459 VVLTGVLALSTI 470


>gi|118348622|ref|XP_001007786.1| hypothetical protein TTHERM_00069480 [Tetrahymena thermophila]
 gi|89289553|gb|EAR87541.1| hypothetical protein TTHERM_00069480 [Tetrahymena thermophila
           SB210]
          Length = 575

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 68/115 (59%), Gaps = 1/115 (0%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVL 164
           PML L++ +  K +  +S C++ G  ++     + ++HP LD P+IDY++ L+  P+++L
Sbjct: 63  PMLMLMLNYQTKVATFISYCILFGSCLANSTLLIFKKHPLLDKPIIDYNIVLMINPMVLL 122

Query: 165 GISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWK-KETITKREAAKQL 218
           G +IG+  N++  +    +L I   +++S   F KG+  ++ K+   K + ++QL
Sbjct: 123 GTNIGIFLNILLPEIAAGILFICFLILISPYMFKKGLNLYRLKKEQQKCQLSEQL 177



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 51/185 (27%), Positives = 85/185 (45%), Gaps = 16/185 (8%)

Query: 288 TTCSVLYWVLN---LLQIPVAGGVSAYEAVALYKGRRKIASK---GDE-GTKWRASQLVF 340
           TTCS  YWV N   +L   VA  V  +      K +R    K    DE  +    +  + 
Sbjct: 374 TTCSTAYWVTNGGIVLLAFVAAFVIRFFLQKWEKNKRIFIEKYHLEDEFASDLDVNNNMN 433

Query: 341 YCA---CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           Y      G+TAGM+ G  G+G G  L P+ L  G+ PQV++AT  F   F S+ ++++ +
Sbjct: 434 YIKISLAGLTAGMLAGTFGVGAGLALVPILLASGVHPQVAAATCGFNYFFISTTTIIQVF 493

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF------TLSFIIFVSAL 451
                 +   + F  LS I  F+   ++ + ++    + +I+F       L+ + F+  L
Sbjct: 494 TNDYLNISQIILFSILSFIGGFLCAKLIYRYVEKKKASYLIVFIVFGLAILNILAFIVYL 553

Query: 452 SLGGV 456
           +  GV
Sbjct: 554 TKKGV 558


>gi|154342935|ref|XP_001567413.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134064745|emb|CAM42850.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 510

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 99/400 (24%), Positives = 181/400 (45%), Gaps = 39/400 (9%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP--TLDIPVIDYDLALLFQPLL 162
           P+   I+    + +  +S+  + G +  ++   ++Q+HP  + D P+I+Y    L  PL 
Sbjct: 120 PLFCGIMKIPMQFAVGMSQATICGQSTVSMYLIVQQKHPDSSWDRPLINYQYLSLILPLG 179

Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLST---RAFLKGVETWKKET--------ITK 211
           ++G  IG   + +F D    + L++LF++LST   R  +K V+ +K++T           
Sbjct: 180 LIGTLIGGILSKLFPD---VLRLLLLFVILSTALYRTLVKVVKQYKEDTNAWQATVEADG 236

Query: 212 REAAKQ------------LELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSII 259
             AA Q             ELI      ++E  +      ++  +      P +S+    
Sbjct: 237 ANAASQQGNYGNNDERQGYELINRAVDAKSESLQGTTVLATSAHSLPAFENPPQSQYP-- 294

Query: 260 ENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKG 319
                +E  +    + ++L   I + Y     +LYW+  L  IP+   +S    +   K 
Sbjct: 295 ----QQELAMNFTCFLVLLLFNILRMYLVCGGLLYWLCVL--IPLV-FLSTVFYLNYEKL 347

Query: 320 RRKIASKGDEGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS 378
           R+ + S   + T  W     V Y    + AG    +LG+GGG +LG +  E+G+ PQ +S
Sbjct: 348 RKLVESDSAQVTFAWTQKNTVMYPMVAVIAGASAAMLGIGGGLVLGFVLNEVGLVPQQAS 407

Query: 379 ATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASII 438
           AT   A  F +  SV+E  +     V + + F  + + +  +GQ V  + IK  G   +I
Sbjct: 408 ATGGMATLFIAFSSVLELLVTGHLVVDFGIVFCIVGLCSGSLGQFVFMEYIKGHGLNYLI 467

Query: 439 IFTLSFIIFVSALSLGGVGLAKMIKRIEH-KEYMGFDNIC 477
           I +L+F++  S +SLGG G+   +  +   +  M F ++C
Sbjct: 468 IGSLAFVLGGSLVSLGGYGIYTTVISVHAGRSLMAFGHLC 507


>gi|146096258|ref|XP_001467747.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398020624|ref|XP_003863475.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134072113|emb|CAM70812.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322501708|emb|CBZ36789.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 511

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 98/384 (25%), Positives = 180/384 (46%), Gaps = 46/384 (11%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP--TLDIPVIDYDLALLFQPLL 162
           PM   ++    +S+  +S+  + G +   +   +RQ+HP  + D P+I+Y    L  P  
Sbjct: 120 PMFCGLMEIPMQSAVGMSQSTICGQSTLNMYLIIRQKHPDSSWDRPLINYQYLSLLLPPG 179

Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET-------------- 208
           ++G  IG   + +  D +  +LL+VL  V+  R++    + ++++T              
Sbjct: 180 LIGTLIGGILSKLCPDVLRLILLLVLLSVVLYRSWETMKKQYRQDTDPMHVTVEANDANA 239

Query: 209 ITKREA------AKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENI 262
            + RE+      ++  EL     G +   C    E ++   TPE++  P +         
Sbjct: 240 TSHRESYDGNGKSQSRELTETAGGAKKGLC----EDIAVLPTPEQSPPPIERPPQ--SQY 293

Query: 263 YWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK 322
             +E  + +A + ++L   I + YT     LYW+  L  +PV     A+ +V  Y  R K
Sbjct: 294 LQQELSMNIACFLVLLLFNIFRTYTVCGGFLYWLCVL--VPV-----AFLSVVFYFNREK 346

Query: 323 IA--SKGDEGT---KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVS 377
           +   ++ D       W     V Y    + AG    +LG+GGG +LG +  E+GI PQ +
Sbjct: 347 LRKLTESDPAQMTFTWTQRNSVTYPMVAVLAGGSAAMLGIGGGLVLGFVLNEVGIVPQEA 406

Query: 378 SAT---ALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR 434
           SAT   A F I FSS++ ++   +     V +++ F  + + ++ +GQ V+   IK  G 
Sbjct: 407 SATGGMATFFIAFSSALHLL---ITGSLVVDFSVVFCIVGLCSSALGQLVIMNYIKRRGL 463

Query: 435 ASIIIFTLSFIIFVSALSLGGVGL 458
           + +II +L F++  S ++LGG G+
Sbjct: 464 SYLIIGSLVFVVGGSLVALGGYGI 487


>gi|159469083|ref|XP_001692697.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277950|gb|EDP03716.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1854

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 31/84 (36%), Positives = 56/84 (66%), Gaps = 2/84 (2%)

Query: 87   FGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD 146
            F +A  +  GVGGG I++P+ N ++GF+ K+S A+S+  +T G+++A+  NL +RHP   
Sbjct: 1663 FASAVATASGVGGGAIYIPLFNALIGFELKASTALSQACITAGSLAALGSNLHRRHPLRP 1722

Query: 147  IP--VIDYDLALLFQPLLVLGISI 168
                ++D+ L L+  P+L++G S+
Sbjct: 1723 EAWHLVDFRLMLVLTPVLLVGSSL 1746


>gi|221482011|gb|EEE20377.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 665

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 65/118 (55%), Gaps = 4/118 (3%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           I+VP++ L++GF    + A S+ ++ GG+++    NL +RHP  D P ID DL LL  P+
Sbjct: 145 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTSLNLFRRHPYADRPAIDLDLVLLMGPM 204

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
            + G + G+  N  +  ++I  LL+VL    + + F + +   K E    R   KQLE
Sbjct: 205 QIAGATFGLVINRCWPVYLIMALLVVLLFATAYKTF-RQMMRLKHEGAAAR---KQLE 258


>gi|221505087|gb|EEE30741.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 665

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 65/118 (55%), Gaps = 4/118 (3%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           I+VP++ L++GF    + A S+ ++ GG+++    NL +RHP  D P ID DL LL  P+
Sbjct: 145 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTSLNLFRRHPYADRPAIDLDLVLLMGPM 204

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
            + G + G+  N  +  ++I  LL+VL    + + F + +   K E    R   KQLE
Sbjct: 205 QIAGATFGLVINRCWPVYLIMALLVVLLFATAYKTF-RQMMRLKHEGAAAR---KQLE 258


>gi|237836929|ref|XP_002367762.1| hypothetical protein TGME49_005300 [Toxoplasma gondii ME49]
 gi|211965426|gb|EEB00622.1| hypothetical protein TGME49_005300 [Toxoplasma gondii ME49]
          Length = 665

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 65/118 (55%), Gaps = 4/118 (3%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           I+VP++ L++GF    + A S+ ++ GG+++    NL +RHP  D P ID DL LL  P+
Sbjct: 145 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTSLNLFRRHPYADRPAIDLDLVLLMGPM 204

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
            + G + G+  N  +  ++I  LL+VL    + + F + +   K E    R   KQLE
Sbjct: 205 QIAGATFGLVINRCWPVYLIMALLVVLLFATAYKTF-RQMMRLKHEGAAAR---KQLE 258


>gi|145528512|ref|XP_001450050.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417650|emb|CAK82653.1| unnamed protein product [Paramecium tetraurelia]
          Length = 400

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 104/384 (27%), Positives = 184/384 (47%), Gaps = 47/384 (12%)

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           ++G+++G         GG+GGG I  P++ +++G  +K +   +  M+ GG++       
Sbjct: 19  ILGSLVGL-----AQAGGIGGGPIVSPVMMVLLGCSSKQAIWNTYIMLFGGSIGNFARLG 73

Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
           R++      P+I+Y L  +  PLL+ G  +GVA       W+  +++++    +    FL
Sbjct: 74  REKIQDGSSPLINYQLVQITLPLLLAGAILGVA----SGKWLPKLVIVIFLFAILLNVFL 129

Query: 199 KGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPK--KSKV 256
           K    +KK     RE  +   LI +    + +E   + +    +  P+  ++ K  +SK+
Sbjct: 130 KTKNVYKKV----REKERNDLLIQV----EMKEININDQ----NILPQNLQQLKDNESKL 177

Query: 257 SIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGV 308
              EN+  KE  L V    I++AL + K   T  S+L        Y  +N     +  GV
Sbjct: 178 YPTENL--KEIALSV---FIVVALTLLKGAATIPSILGIGYCGYGYHFINF----IIFGV 228

Query: 309 SAYEAVALYKGRRK-----IASKGDEGTKWRASQLV-FYCACGITAGMVGGLLGLGGGFI 362
             Y  V  Y+ + K       S G + +  + S++        + AG +GGL+GLGGG +
Sbjct: 229 GFYN-VQRYRQQIKKDEVLKESIGYDFSGGKISEVYDITVKSSMKAGFLGGLVGLGGGVV 287

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
           L P++LE GI P  ++A+A F + F+S +SV    L   + +   L   ++S   +++  
Sbjct: 288 LTPVWLETGINPARAAASATFTVMFTSFISVFIIALSGGYQLSQFLILASVSGCGSYLVA 347

Query: 423 HVLKKLIKILGRASIIIFTLSFII 446
            +LKKL+K   R SIII  L  +I
Sbjct: 348 GILKKLVKKYKRESIIIQVLLVVI 371


>gi|110740663|dbj|BAE98434.1| hypothetical protein [Arabidopsis thaliana]
          Length = 210

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 57/199 (28%), Positives = 103/199 (51%), Gaps = 14/199 (7%)

Query: 290 CSVLYWVLNLLQIPV----------AGGVSAYEAVALYKGRRKIAS-KGDEGTKWRASQL 338
           C   YW+++  QIP+          +  V + +    +   + +   + ++G +   S  
Sbjct: 15  CGNAYWLISSSQIPLTLFFTLWICFSDNVQSQQQSDYHVSVKDVEDLRSNDGAR---SNK 71

Query: 339 VFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYL 398
             +    + AG++GG+ G+GGG ++ PL L++GI P+V++AT  F + FSS+MS ++Y L
Sbjct: 72  CMFPVMALLAGVLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLL 131

Query: 399 LKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 458
           L       A  F  +  +A+ VG  V++K+I   GRASII+F++  ++ +S + +   G 
Sbjct: 132 LGMEHTGTASIFAVICFVASLVGLKVVQKVITEYGRASIIVFSVGIVMALSIVLMTSYGA 191

Query: 459 AKMIKRIEHKEYMGFDNIC 477
             +        YMGF   C
Sbjct: 192 LDVWNDYVSGRYMGFKLPC 210


>gi|145501808|ref|XP_001436884.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404029|emb|CAK69487.1| unnamed protein product [Paramecium tetraurelia]
          Length = 424

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 78/354 (22%), Positives = 152/354 (42%), Gaps = 45/354 (12%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLLV 163
           P+L +++ ++   +  +S C++ GG V      +++ HP     P+++YDL ++    + 
Sbjct: 62  PILLMMLDYEQSYATQLSYCLMFGGCVINTFLLMKKSHPYDQKRPLVNYDLVVILNCSIA 121

Query: 164 LGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIVL 223
           LG  +G   NV  A  +I  +   +FL++     L      KKE + K     +L+L   
Sbjct: 122 LGSYLGSILNVFLAP-IIETMFQQIFLIIVIPFLLNKA---KKEKLRKIRCQSELDL--- 174

Query: 224 GNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIA 283
                        +YL N      T E +    +  +N +     L +A+   I++  I 
Sbjct: 175 ------------EKYLLNQKDSIYTEEQQLLLQNEFQN-FCPSKKLAIALSFFIVSQIIM 221

Query: 284 K-----------NYTTTCSVLYWV-LNLLQIPVAGGVSAYEAVALYKGRRKIASKGD--- 328
                       N       + W+ L ++ I ++  V  Y       G +K     D   
Sbjct: 222 TGGKYLKPFIPLNKCFDFRYMLWIMLFIVNIFMSRLVYTY-------GLKKEMIFDDYKI 274

Query: 329 --EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAIT 386
             +   ++ ++ +     G  AG++ GLL LG G I+ P+ LELG+ P++++AT+ F   
Sbjct: 275 YMQERYFQKNRFILIYVSGFFAGLISGLLALGAGLIMVPVLLELGLHPRIATATSAFNYF 334

Query: 387 FSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF 440
           F    ++V+     +  +    +FF L+++   +  H   KLI+ L    ++I+
Sbjct: 335 FIGLTNIVKLITDSQISIAEIAWFFGLALVFGTICCHFSLKLIEKLQLVHLVIY 388


>gi|403365874|gb|EJY82729.1| hypothetical protein OXYTRI_19655 [Oxytricha trifallax]
          Length = 683

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 100/196 (51%), Gaps = 11/196 (5%)

Query: 290 CSVLYWVLNLLQIPVAGGVSAYE---AVALYKGRRK----IASKGDEGTKWRASQLVFYC 342
           CS + W + ++ I     +SAY    AV   K + K    IAS   +  K    +LV   
Sbjct: 491 CSTMDWAILMIFITFCSMLSAYSIFIAVREQKLKTKYSMGIASSDIQFNKPAIFKLVISA 550

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
             G   G V G LGLGGG I  P+ L +G+PP+V+SAT ++ I FS+S S V Y + +  
Sbjct: 551 FIG---GTVSGALGLGGGAIFNPILLSMGVPPKVASATGMYMIMFSTSGSSVIYVMYRML 607

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMI 462
            + Y  +    S   + +G ++L K++K+  R S ++F L F++ +SA+ +   G   + 
Sbjct: 608 NIQYGFWLGFWSSSGSILGMYLLNKVVKMYNRQSPVVFCLVFVLALSAVLVPIFGYIDLK 667

Query: 463 KRI-EHKEYMGFDNIC 477
            RI E  +   F  IC
Sbjct: 668 ARISEGVDITKFSTIC 683



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 62/117 (52%), Gaps = 1/117 (0%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVL 164
           P       F+ K++ A+S   +   +++  + ++ +RHP  +  VIDY+LA +  P++++
Sbjct: 100 PFCMTFFSFETKNAIALSGLTILSCSITRYIVSINERHPERNSVVIDYNLAAIMLPIVLV 159

Query: 165 GISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET-ITKREAAKQLEL 220
           G  IGV  NV F    + ++L ++ L L+     K    +K+ET + K    +Q EL
Sbjct: 160 GSMIGVLVNVAFPSLYLQIMLTLVLLSLALHTAYKARFIYKQETEMLKVRQQQQKEL 216


>gi|301093157|ref|XP_002997427.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110683|gb|EEY68735.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 432

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 71/266 (26%), Positives = 118/266 (44%), Gaps = 31/266 (11%)

Query: 74  FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
           F ++ ++ T++ F  TA G+ GGVGGGG+ VPM  ++ G + K +  +SK  + G AV+ 
Sbjct: 116 FTYKDLITTLVAFSSTALGAGGGVGGGGLLVPMY-ILGGLNPKHAIPLSKVTIFGSAVAM 174

Query: 134 VVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
              N R++HP   + P+ID+ L  L +P  ++G   GV  N IF +W+I V L+ L   +
Sbjct: 175 YTVNFRRKHPLNKNRPLIDFGLVGLMEPTTLIGTVFGVMLNHIFPNWLILVCLVTLLTFI 234

Query: 193 STRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEET---- 248
           + +  LKG   +  E   +R   K      +    + +E         ND  P  +    
Sbjct: 235 TYKTILKGNTRFDVEVAARRWLNKTRRRRKVRQIREEDEADFQSLPPLNDRKPTSSDPLI 294

Query: 249 ------------REPKKSKVSIIENIYWKEFGL-----LVAVWAIILALQIAKNYTTTCS 291
                        + +  + ++IE    + F L     L   W +IL   + +      S
Sbjct: 295 RNKRQFGTFSSDEDTQTQRRNVIERREVRVFPLEYLWPLALSWLLILVQSMLRGGHGAPS 354

Query: 292 VL--------YWVLNLLQIPVAGGVS 309
            +        YW+L LL + V   VS
Sbjct: 355 AIGVKCNSSSYWLLTLLPLAVPISVS 380


>gi|401405669|ref|XP_003882284.1| hypothetical protein NCLIV_020390 [Neospora caninum Liverpool]
 gi|325116699|emb|CBZ52252.1| hypothetical protein NCLIV_020390 [Neospora caninum Liverpool]
          Length = 675

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 68/129 (52%), Gaps = 7/129 (5%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
           I+VP++ L++GF    + A S+ ++ GG+++    NL +RHP  D P ID DL LL  P+
Sbjct: 158 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTCLNLFRRHPFADRPAIDLDLVLLMGPM 217

Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
            + G + G+  N  +  ++I  +L+VL        F    +T ++    KRE A   +L+
Sbjct: 218 QIAGATYGLVINRCWPVYLIMAVLVVLL-------FATAYKTSRQMMRLKREGAAARKLL 270

Query: 222 VLGNGYQTE 230
              +G   E
Sbjct: 271 EQRSGSVCE 279


>gi|301093155|ref|XP_002997426.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110682|gb|EEY68734.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 124

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 65/118 (55%), Gaps = 4/118 (3%)

Query: 364 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP----VPYALYFFALSIIAAF 419
           GP+ LE G+ P V SATA F I F++S + +++ +  +FP      Y  +F  +  +  F
Sbjct: 4   GPIMLEAGVLPAVQSATASFMILFTASSTTLQFAINGQFPGEFQFDYMAWFALVGFVGGF 63

Query: 420 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            G   +   +K   R SI+++ L+  I +SAL++G +GL   +  IE   ++GF  IC
Sbjct: 64  CGLKCVGYFVKKYKRESIMVYMLATTIGLSALAMGFIGLKLTLADIESGVHLGFHGIC 121


>gi|449016482|dbj|BAM79884.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 649

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 71/143 (49%), Gaps = 8/143 (5%)

Query: 37  PEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGG 96
           P  +S N +     NI      PKS   +  VW D      I+V  +  F  +A  +  G
Sbjct: 121 PCTSSANCSSLQSCNINTGYCDPKSLF-HPFVWND-----GIMVPLM--FISSAVHNAAG 172

Query: 97  VGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLAL 156
            GGG  ++ +  L++GF   +++A S   +  G ++ V+ NL  RHP  D+P ID++L  
Sbjct: 173 TGGGTSYIALFVLVLGFRVATASANSHAFIFAGMIANVLVNLWARHPFRDMPRIDWNLVA 232

Query: 157 LFQPLLVLGISIGVAFNVIFADW 179
              PL + G SIGV  N +F ++
Sbjct: 233 TSVPLFLAGSSIGVFLNQLFPNY 255


>gi|397620930|gb|EJK66001.1| hypothetical protein THAOC_13100 [Thalassiosira oceanica]
          Length = 344

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 61/120 (50%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
           +W +   V Y A    AG   G+ G+GGG + GPL L +G+ P+VSSA++   I F+S  
Sbjct: 198 RWDSRATVVYPAVCTAAGFFAGMFGVGGGIVKGPLMLAMGVHPKVSSASSACMILFTSFT 257

Query: 392 SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
           +   + +     + YA     L   A  VGQ  L  L++   R S I F++  I+ +SA 
Sbjct: 258 ATTSFVVFGLLDMEYAPVCLLLGFAATLVGQIGLIYLMERFQRNSYIAFSIGAIVALSAF 317


>gi|401426825|ref|XP_003877896.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322494143|emb|CBZ29440.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 511

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 97/407 (23%), Positives = 178/407 (43%), Gaps = 50/407 (12%)

Query: 99  GGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP--TLDIPVIDYDLAL 156
           GG + +PML         S+  +S+  + G +   + + ++++HP  + D P+I+Y    
Sbjct: 123 GGLMEIPML---------SAVGMSQSTICGQSTLNMYFAIQEKHPDSSWDRPLINYQYLG 173

Query: 157 LFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF------------------- 197
           L  P  ++G  +G   + +  D +  +LL+VL  V+  R++                   
Sbjct: 174 LLLPPGLIGTLVGGILSKLCPDVLRLILLLVLLSVVLYRSWGTMKKQYRQDTNPSHVTVE 233

Query: 198 -LKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKV 256
                ET  +E     + +++ EL     G + E  +      + + +P+  R P +S+ 
Sbjct: 234 TGNANETSHRENHDNNDESQRCELTEKAGGVKRELGENTAILSTPEQSPQSLRCPPQSQY 293

Query: 257 SIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL 316
                   +E  L  A + ++L   I + Y      LYW+  L  +PV     A+ +V  
Sbjct: 294 P------QQELSLNFACFLVLLLFNIFRTYAVCGGFLYWLCVL--VPV-----AFLSVVF 340

Query: 317 YKGRRKI-----ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
           +  R K+     ++       W     V Y    + AG    +LG+GGG +LG +  E+G
Sbjct: 341 FFNREKLRKLAGSNPAQMTFAWTQRNSVAYPMVAVLAGASAAMLGIGGGLVLGFVLNEVG 400

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           I PQ +S T   A  F +  S ++  +     V + + F  + + ++ +GQ V    IK 
Sbjct: 401 IVPQEASVTGGMATFFIAFSSALQLLITGSLVVDFGIVFSIVGLCSSALGQLVFMTYIKS 460

Query: 432 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEH-KEYMGFDNIC 477
            G + +II +L F++  S +SLGG G+   +  I+     MGF  +C
Sbjct: 461 HGLSYLIIGSLIFVVGGSLVSLGGFGIYNAVISIQAGGSVMGFGCLC 507


>gi|348690322|gb|EGZ30136.1| hypothetical protein PHYSODRAFT_475542 [Phytophthora sojae]
          Length = 124

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 65/118 (55%), Gaps = 4/118 (3%)

Query: 364 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP----VPYALYFFALSIIAAF 419
           GP+ LE GI P V SATA F I F++S + +++ +  +FP      Y  +F  +  +  F
Sbjct: 4   GPIMLEAGILPPVQSATASFMILFTASSTTLQFAINGQFPGEFQFDYMAWFAFVGFVGGF 63

Query: 420 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            G   +   +K   R SI+++ L+  I +SA+++G +GL   +  +E   ++GF  IC
Sbjct: 64  CGLKCVGYFVKKYKRESIMVYMLAATIGLSAVAMGFIGLQSTLGDLESGVHLGFHGIC 121


>gi|294948280|ref|XP_002785684.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239899707|gb|EER17480.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 392

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 79/309 (25%), Positives = 140/309 (45%), Gaps = 45/309 (14%)

Query: 96  GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR-QRHPTLDI-PVIDYD 153
           G+GGG +FVP+L++I    A+ +AA S+ +V    ++ V+ ++  Q H    +  +I+  
Sbjct: 27  GMGGGTLFVPVLHIIGMLSARDAAATSQVLVASATLAKVLSSVYIQLHGNSSVESIINLP 86

Query: 154 LALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE 213
            A+L  P +V+G  +GV       +    VL +   ++ S   F KG+  WK ET     
Sbjct: 87  YAVLMLPPMVVGGLLGVYLYTWLPEIFQLVLYVFTAILASCMGFKKGIGLWKSET-EGAS 145

Query: 214 AAKQLELIVLGNGYQTEECKCDPEYLSNDTT-PEETREPKKSKVSIIENIYWKEFGL--- 269
           +A QL ++              P  ++ DT  P  T   +    SI  ++ +K+  L   
Sbjct: 146 SAGQLGVV-------------PPPCVTEDTVLPSVTSRAR----SISLSLKYKKAILITT 188

Query: 270 LVAVWAIILALQIAKNYTTT--------CSVLYWVLNLLQIPVAGGVSAYEAVALYKGRR 321
           L+AVW  ++  ++    ++T        C+ LYW L+++   +   V     VA+     
Sbjct: 189 LLAVWIAVILSRLLLGSSSTPSIIGIPYCTGLYWALSVIVCIMLMAVPGLFVVAI----- 243

Query: 322 KIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 381
           K A+      K   + L          G +  L+G GGG ++ PL L + + PQ ++AT 
Sbjct: 244 KSAAMLKLAVKLSGAMLCI--------GFIAALVGQGGGSLITPLLLYMELNPQQAAATG 295

Query: 382 LFAITFSSS 390
              +  +SS
Sbjct: 296 SVVMLITSS 304


>gi|224111746|ref|XP_002332885.1| predicted protein [Populus trichocarpa]
 gi|222833707|gb|EEE72184.1| predicted protein [Populus trichocarpa]
          Length = 151

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 79/133 (59%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
            + AGM+GG+ G+GGG ++ PL L +GI P++++AT  F + FSSSMS ++Y LL    V
Sbjct: 19  ALLAGMLGGVFGIGGGMLISPLLLHVGIAPEITAATCSFMVFFSSSMSALQYLLLGMEHV 78

Query: 405 PYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKR 464
             A+    +  +A+ +G  V+++ I   GRAS+I+F++S ++ +S + +   G   + + 
Sbjct: 79  DTAIILSVICFVASLLGLLVVQRAIVKYGRASMIVFSVSTVMALSTVLMTSFGALNVWRD 138

Query: 465 IEHKEYMGFDNIC 477
                 MGF   C
Sbjct: 139 YNSGRNMGFKLPC 151


>gi|145495145|ref|XP_001433566.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400684|emb|CAK66169.1| unnamed protein product [Paramecium tetraurelia]
          Length = 399

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 175/376 (46%), Gaps = 52/376 (13%)

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           ++GT++G         GG+GGG I  P++ +++G  +K +   +  M+ GG++       
Sbjct: 19  ILGTLVGL-----AQAGGIGGGPIVSPVMMVLLGCSSKQAIWNTYIMLFGGSIGNYARLG 73

Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
           R+R      P+I+Y L  +  PLL+ G  +GVA       W+  V++++    +    FL
Sbjct: 74  RERISNGSSPLINYQLVQITLPLLLAGAILGVA----TGKWLPKVVVVIFLFAILLNVFL 129

Query: 199 KGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETRE--PKKSKV 256
           K  + +KK  I ++E ++ L+ + +     T+              P++ ++    +SK+
Sbjct: 130 KTKKVYKK--IREKEQSELLQQVEMKELMLTD----------YSAVPQDLQQILENESKL 177

Query: 257 SIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGV 308
              EN+  KE    V    I++AL + K   T  S+L        +  +N +   +    
Sbjct: 178 YPTENL--KEIAFSV---IIVVALTLLKGAATIPSILGISYCGMEFHFINFIIFGI---- 228

Query: 309 SAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGIT------AGMVGGLLGLGGGFI 362
             Y  V  Y+   K   +  +   +  S        GIT      AG +GGL+GLGGG +
Sbjct: 229 -GYYNVQRYRKWIKKDEEFKQSLGYDFSGGKISEVFGITVKSSMKAGFLGGLVGLGGGVV 287

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA--FV 420
           L P++LE GI P  ++A+A F + F+S +SV    L   + +     F  L++I +  ++
Sbjct: 288 LTPIWLETGIHPPRAAASATFTVMFTSFISVFIIALSGGYHLS---QFLILAVINSLYYL 344

Query: 421 GQHVLKKLIKILGRAS 436
              +LKKL+K   R S
Sbjct: 345 IAGILKKLVKKYKRES 360


>gi|294948278|ref|XP_002785683.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|294954883|ref|XP_002788342.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239899706|gb|EER17479.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239903654|gb|EER20138.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 354

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 83/361 (22%), Positives = 155/361 (42%), Gaps = 75/361 (20%)

Query: 96  GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD--IPVIDYD 153
           G+GGG +FVP+L LI     K + A+S+ ++   +++A+++N  +++   +    +I + 
Sbjct: 27  GIGGGVLFVPVLRLIGKVPLKEATALSQALIASASLAAILFNFFEQYRARNESKALIVWP 86

Query: 154 LALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWKKETI 209
             +L  P  V+G  IGV    IF+ W+ ++ +++L+       S  A  KG++ WK ET 
Sbjct: 87  FVILIIPCTVIGSLIGV---YIFS-WLPSLFILILYFCYACLGSFMACKKGIKLWKAETR 142

Query: 210 TKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGL 269
            KR+       I +G+   +       + +     P     P + K              
Sbjct: 143 AKRQ-------IPVGDSTDSSRSSEVAQEI-----PPLLEMPNRKK-------------- 176

Query: 270 LVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDE 329
           L+A  +I+               L W + L+  P+                     KG+ 
Sbjct: 177 LIAYTSIV--------------ALIWAVCLIFPPL---------------------KGNS 201

Query: 330 GTKWRASQLV----FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 385
            T+ R+   V          I  G++  ++G  G   +  + L LG+ P+ ++ATA   I
Sbjct: 202 ATQKRSPGAVKIGLVLMTSTIVIGLLSSIIGTAGALFIIRVVLSLGLDPKQATATATVVI 261

Query: 386 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 445
             +SS + + + L   FP    L+   L    A +G+ ++ KLI   GR SI++  L+ +
Sbjct: 262 FATSSRTALSFALGGYFPPASNLWIVVLPFAGALLGKTIVAKLIAKTGRLSILVLLLAAM 321

Query: 446 I 446
           +
Sbjct: 322 V 322


>gi|403340471|gb|EJY69522.1| hypothetical protein OXYTRI_09740 [Oxytricha trifallax]
          Length = 715

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 64/117 (54%), Gaps = 1/117 (0%)

Query: 362 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           +  P+ L +G+PP VSS+T ++ I FS+S S + Y L       +A++      IA+ VG
Sbjct: 599 VFNPVLLSMGVPPSVSSSTGMYMIMFSTSGSSIVYILYGMLNYQFAMWLGFWCSIASLVG 658

Query: 422 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHK-EYMGFDNIC 477
             +L K +K   R S I+F L  I+ +SAL +       + K+I++    M F+++C
Sbjct: 659 LQMLNKFVKKYNRQSPIVFLLGLILGLSALLVPIFAYFDLNKQIQNGVNIMKFNSMC 715



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/119 (26%), Positives = 64/119 (53%)

Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
           G+ +P       F  K++ A+S   +   ++   VY L + HP  D  +IDY LA +  P
Sbjct: 46  GVVIPFCMTFFTFQTKNAIAMSGFTILTCSIVKFVYGLNEHHPEKDAVIIDYSLATIMLP 105

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
           ++++G  IGV  N++F   ++  +L ++ L+L+ ++  K    +KKE +  ++  ++ E
Sbjct: 106 VVMMGSMIGVLLNIMFPSLLLQTILTIVLLLLTWQSLGKARTIYKKENLKLQQIKEKQE 164


>gi|348688914|gb|EGZ28728.1| hypothetical protein PHYSODRAFT_475263 [Phytophthora sojae]
          Length = 257

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 57/249 (22%), Positives = 112/249 (44%), Gaps = 32/249 (12%)

Query: 216 KQLELIVLGNGYQTEECKCDPE---YLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVA 272
           + L+  +     +  +C   PE    L  D+   +  +    +  +  ++ W++   L  
Sbjct: 20  RTLQKAIRARQKERWQCGVSPESTSLLGVDSASTDGTKAHAGEPQLRADVPWRKIATLFG 79

Query: 273 VWAIILALQIAK---NYTT-----TCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIA 324
           ++ ++  +++ +   N+ +     + S LY VL   Q P      AYE  A         
Sbjct: 80  LFVVVAGMRLVRGGQNFDSPIGLDSSSALYPVLQ--QSP------AYELEA--------- 122

Query: 325 SKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFA 384
                  +W +S + F+      AG V G+ G+GGG I  PL LE+G+ P  +SA     
Sbjct: 123 ----HEIQWTSSSIKFFPVFSFAAGTVSGMFGIGGGIINAPLLLEVGVDPSAASAMTAAT 178

Query: 385 ITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSF 444
           + FSS MS   Y +L +  +  A     +  +  ++G   L K+++     S+I+F+++ 
Sbjct: 179 VLFSSGMSSFNYLMLGKLDLDLARLMLPMGFLTTYIGHICLLKVVRRFQCPSLIVFSMAA 238

Query: 445 IIFVSALSL 453
           I+ +SA+++
Sbjct: 239 IVLISAVAM 247


>gi|302828096|ref|XP_002945615.1| hypothetical protein VOLCADRAFT_85767 [Volvox carteri f. nagariensis]
 gi|300268430|gb|EFJ52610.1| hypothetical protein VOLCADRAFT_85767 [Volvox carteri f. nagariensis]
          Length = 1018

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 63/185 (34%), Positives = 99/185 (53%), Gaps = 11/185 (5%)

Query: 301  QIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFY------CACGITAGMVGGL 354
            Q P   G++A  A     G   +A     GT   A+ LV        CA    AG+ GGL
Sbjct: 835  QRPDGEGMAAPAAAVAAAGAVAVAEPALGGT---AALLVLEAPVKTLCAT-FGAGLTGGL 890

Query: 355  LGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALS 414
            LGLGGG ++GPL L++G+ PQV++A++   + FSSS +++++ LL R    YAL F A S
Sbjct: 891  LGLGGGMVMGPLLLQIGVHPQVTAASSGAMVLFSSSAALIQFVLLHRLNTDYALVFGAAS 950

Query: 415  IIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFD 474
            ++A  VG   +   IK  GR SI++  L+ ++ +  + +   GL     ++   + +GF 
Sbjct: 951  LVAGLVGTQTVSGAIKRSGRPSIVVLALAGVMGIGTVCVAAFGLRNAAGQLRRGD-LGFA 1009

Query: 475  NICYT 479
             IC +
Sbjct: 1010 GICSS 1014



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 62/94 (65%), Gaps = 1/94 (1%)

Query: 116 KSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQPLLVLGISIGVAFNV 174
           K S A+S+  +T  +++AVV NL + HP++ + P+ID+ L LL  P+L++G+ IGV  NV
Sbjct: 11  KPSTALSQACITASSLAAVVSNLPRTHPSVPEAPLIDFPLILLLTPVLLVGVGIGVLLNV 70

Query: 175 IFADWMITVLLIVLFLVLSTRAFLKGVETWKKET 208
               W++ +LL+VL L+L  +A  KG   W +E+
Sbjct: 71  ALPSWLLNLLLLVLLLLLLAQAIAKGKALWAQES 104


>gi|407410866|gb|EKF33150.1| hypothetical protein MOQ_002989 [Trypanosoma cruzi marinkellei]
          Length = 407

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 84/332 (25%), Positives = 145/332 (43%), Gaps = 34/332 (10%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT--LDIPVIDYDLALLFQPLL 162
           PM  L++G     +  +S+  + G ++  V   +R+R P      P+I+Y    L  PL 
Sbjct: 48  PMFCLLMGLPMDFAVGLSQSTICGQSILNVFIAIRKRFPCAGCSRPLINYQYLTLLVPLG 107

Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIV 222
           V+G  IG   N +  D +  +LL +L   +  R+  K +  ++K+  ++R     +    
Sbjct: 108 VIGTLIGGVLNRLCPDLLRLILLFLLLTAVLYRSVRKMIAQYRKDQ-SERRGTNTVS--- 163

Query: 223 LGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQI 282
                  EE    P       +PEE     + +        W E   +   + + L+   
Sbjct: 164 -----SAEEVSGTPTL----NSPEEVFHVTQPQYP------WIEISCVFFSFIVNLSFGA 208

Query: 283 AKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYC 342
            ++ T      Y V   L  PVA  ++ +     Y+ R     K      W  S  +FY 
Sbjct: 209 WRSRTKCGGGAYIVAYCL--PVALNIAIFFG---YRHRLSHMEKLSLVFHWSNSTTIFYP 263

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT----ALFAITFSSSMSVVEYYL 398
              + AG+   +LG+GGG +LG +  ++G+ P+ +S T     LF + FSS++S+V   +
Sbjct: 264 LVSVVAGIASAMLGIGGGLVLGFILYDVGLIPEEASVTGGVVTLF-LAFSSALSLV---I 319

Query: 399 LKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 430
              F + Y  + FA  I++   GQ VL +LIK
Sbjct: 320 EGHFLLDYGGFLFACGIVSTLFGQFVLMRLIK 351


>gi|118401229|ref|XP_001032935.1| hypothetical protein TTHERM_00470450 [Tetrahymena thermophila]
 gi|89287281|gb|EAR85272.1| hypothetical protein TTHERM_00470450 [Tetrahymena thermophila SB210]
          Length = 1325

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 72/140 (51%), Gaps = 10/140 (7%)

Query: 69   WPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTG 128
            +P+I FG  IV+ TI+GF     G   G+GGG    P+L  +  +D K S A+   +V  
Sbjct: 899  FPEI-FGS-IVIITILGF-----GQAAGIGGGTSITPILLALFLYDTKKSVALVILLVFS 951

Query: 129  GAVSAVVYNLRQRHPTLD-IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIV 187
             ++   +   R+R  T D +PVI+Y L L+  P L++G   GVA N       + +LL++
Sbjct: 952  SSLGNTIQISRER--THDGVPVIEYRLILVTLPTLIVGTVYGVAVNKFLPSIAVCILLVI 1009

Query: 188  LFLVLSTRAFLKGVETWKKE 207
            L      +++L+     K E
Sbjct: 1010 LLAQQIQKSYLRYKNMRKNE 1029


>gi|255635532|gb|ACU18117.1| unknown [Glycine max]
          Length = 114

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 31/70 (44%), Positives = 39/70 (55%), Gaps = 7/70 (10%)

Query: 26 SVSVSA-----QSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVV 80
          SVSVSA       N+ P      +   +    V+      S SGY+H WPDI+FGWRI+ 
Sbjct: 24 SVSVSALQHVSHKNINPTTVETTQTSFL--GKVVNFLWSSSGSGYQHTWPDIEFGWRIIT 81

Query: 81 GTIIGFFGTA 90
          GTIIGF G+A
Sbjct: 82 GTIIGFLGSA 91


>gi|397638312|gb|EJK73008.1| hypothetical protein THAOC_05398, partial [Thalassiosira oceanica]
          Length = 318

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 45/76 (59%), Gaps = 10/76 (13%)

Query: 315 ALYKGRRKIASKG----------DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILG 364
           A Y+GRR +AS            D   +W A+ L  Y A  + AG+V GL+G+GGG +LG
Sbjct: 205 AAYQGRRIVASAAAKVRAGYPFRDRDVRWDAAALRLYGAWTLLAGVVAGLVGIGGGMVLG 264

Query: 365 PLFLELGIPPQVSSAT 380
           PL L + I P+VS+AT
Sbjct: 265 PLMLAMNIDPRVSTAT 280


>gi|157873729|ref|XP_001685369.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68128441|emb|CAJ08542.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 511

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 101/407 (24%), Positives = 181/407 (44%), Gaps = 53/407 (13%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP--TLDIPVIDYDLALLFQPLL 162
           P+   ++    +S   +S+  + G +   +   ++QRHP  + D P+I+Y    L  PL 
Sbjct: 120 PLFCGLMEVPMQSGVGMSQSAICGQSTLNIYLAIQQRHPDSSWDRPLINYQYLSLLLPLG 179

Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIV 222
           ++G  IG   + +  D +  +LL+VL  V+  R++    ET KK+    R+    + + V
Sbjct: 180 LIGTLIGGVLSKLCPDVLRLILLLVLLSVVLYRSW----ETMKKQ---YRQDTDPMHVTV 232

Query: 223 LGNGYQTEECKCDPEYLSNDTT-PEETRE---------------------PKKSKVSI-- 258
                +  +        S D+T   ++RE                     P++S  SI  
Sbjct: 233 -----EASDANATSHQKSYDSTGKSQSRELTETAGGAKKVLGEDIAVLPTPEQSPPSIER 287

Query: 259 -IENIY-WKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL 316
             ++ Y  +E  +  A + ++L   I + YT     LYW+  L  +PV     A+ +V  
Sbjct: 288 PPQSQYPQQELSMNFACFLVLLLFNILRTYTVCGGFLYWLCVL--VPV-----AFLSVVF 340

Query: 317 YKGRRKIASKGDEGT-----KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
           Y  R K+    +         W     V Y    + AG+   +LG+GGG +LG +  E+G
Sbjct: 341 YFNREKLRKLAESDPAQMTFTWTERNSVTYPMVAVLAGVSAAMLGIGGGLVLGFVLNEVG 400

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           I PQ +S T+     F +  SV++  +     V + + F  + + ++ +GQ VL   IK 
Sbjct: 401 IAPQEASVTSGMTTFFIAFSSVLQLLITGSLVVDFGVVFCIVGLCSSALGQLVLMNYIKR 460

Query: 432 LGRASIIIFTLSFIIFVSALSLGGVGL-AKMIKRIEHKEYMGFDNIC 477
            G   +II +L F++  S ++LGG G+ + +I        + F  +C
Sbjct: 461 RGLNYLIIGSLVFVVGGSLVALGGYGIYSAVISTQTGGSVLAFGRLC 507


>gi|145531341|ref|XP_001451437.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419092|emb|CAK84040.1| unnamed protein product [Paramecium tetraurelia]
          Length = 430

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 36/154 (23%), Positives = 73/154 (47%), Gaps = 13/154 (8%)

Query: 93  SVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDY 152
           +VGG+GGG   +P+L +++ F    +      +  G  +   +  + Q+HP  D  +IDY
Sbjct: 47  TVGGLGGGIEKIPILIVMLNFSQSKATLYVYVLTFGTNLVNFLMLIYQKHPLADKQIIDY 106

Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE----- 207
           +L+L+  P  + G + G   + I  D  +  +LIV F +   + ++K  +  ++E     
Sbjct: 107 ELSLILLPTALFGSAFGNILHQILPDIFLISILIVFFSIFVPKLYIKAKQNREQEMQVDN 166

Query: 208 ---TITKREAAKQLELIVLGNGYQTEECKCDPEY 238
              T+  +E        ++   Y++E+ +  P Y
Sbjct: 167 KQKTVINQEVTH-----LIAEQYKSEDQQIIPLY 195


>gi|294944271|ref|XP_002784172.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239897206|gb|EER15968.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 180

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/153 (28%), Positives = 79/153 (51%), Gaps = 17/153 (11%)

Query: 96  GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP--VIDYD 153
           G+GGG +FVP+L LI G   K S+++S+ +V   A++A ++N   ++   + P  +I + 
Sbjct: 4   GIGGGILFVPVLRLIGGLSQKESSSLSQALVAASALAANLFNFYAQYRAKNEPKALIVWP 63

Query: 154 LALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWKKETI 209
             +L  P  V+G  IG    +    W+ ++L ++L+ ++    S  A+ KG + WK ET 
Sbjct: 64  FVILMLPCAVVGSLIG----IYLYSWLPSLLQLILYFIVACFGSLAAYRKGYKLWKAETD 119

Query: 210 TKREAAKQLE---LIVLGNGYQTEE----CKCD 235
            K  A ++ +    +      + EE    C CD
Sbjct: 120 AKESAIREFDSTTTVTCSPSVEPEEVYHLCGCD 152


>gi|294944267|ref|XP_002784170.1| cgmp-dependent protein kinase, putative [Perkinsus marinus ATCC
           50983]
 gi|239897204|gb|EER15966.1| cgmp-dependent protein kinase, putative [Perkinsus marinus ATCC
           50983]
          Length = 311

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 58/230 (25%), Positives = 103/230 (44%), Gaps = 38/230 (16%)

Query: 96  GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLA 155
           G+GGG +FVP+L LI G   K S A+S+ ++   +++A ++N  +++   + P      A
Sbjct: 4   GIGGGVLFVPVLRLIGGLQLKESTALSQALIASASLAATLFNCFEQYSARNDPK-----A 58

Query: 156 LLFQPLLVLGIS---IGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWKKET 208
           L+  P ++L +    IG    V    W+ ++ +++L+       S  A+ KG+  WK E 
Sbjct: 59  LIVWPFVILTLPCTVIGSLIGVYLYSWLPSLFILILYFCFVCLGSFMAYRKGIRLWKAEN 118

Query: 209 ITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTT---PEETREPKKSKVSIIENIYWK 265
             KR A          +G  T+  +      S++ T   P   R P   K++   +I   
Sbjct: 119 GAKRRAV---------DGDSTDMSR------SSEVTVEVPSLLRMPNMKKLAAYTSIA-- 161

Query: 266 EFGLLVAVWAIILALQIAKNYTTTC----SVLYWVLNLLQIPVAGGVSAY 311
              L+ AV+     +       TTC    +V +W L +L I +  G   +
Sbjct: 162 --ALVWAVYGHFPGIDHDARVATTCGYGYAVDFWTLGILLIEMLTGKPPF 209


>gi|71420985|ref|XP_811672.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70876359|gb|EAN89821.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 473

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 91/349 (26%), Positives = 152/349 (43%), Gaps = 36/349 (10%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT--LDIPVIDYDLALLFQPLL 162
           PM  L++G     +  +S+  + G ++  V+  +R+R P      P+I+Y    L  PL 
Sbjct: 114 PMFCLLMGLPMDFAVGLSQTTICGQSILNVLIAIRKRFPCAGCSRPLINYQYLTLLVPLG 173

Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIV 222
           V+G  IG   N +  D    VLL +L   +  R+  K +  ++K+  ++R     +    
Sbjct: 174 VIGTLIGGVLNRLCPDLFRLVLLFLLLTAVLYRSVRKMIAQYRKDQ-SERRGTNTVS--- 229

Query: 223 LGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQI 282
                 TEE    P       +PEE     + +        W E   +V  + + L+   
Sbjct: 230 -----STEEVSGTPTL----NSPEEIFHVTQPQYP------WIEISCVVFSFIVNLSFGA 274

Query: 283 AKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYC 342
            ++ T      Y V   L  PV   +  +     Y+ R     K      W  S  + Y 
Sbjct: 275 WRSRTKCGGGAYIVAYCL--PVVLNIVIFFC---YRHRLSNMEKFRLVFHWNNSTTILYP 329

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT----ALFAITFSSSMS-VVEYY 397
              + AG+   +LG+GGG +LG +  ++G+ P+ +S T     LF + FSS++S ++E +
Sbjct: 330 LVSVVAGVASAMLGIGGGLVLGFILYDVGLIPEEASVTGGVVTLF-LAFSSALSLLIESH 388

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFII 446
           LL    + Y    FA  I++  +GQ VL +LIK      +II  L  II
Sbjct: 389 LL----IDYGGVLFACGIVSTILGQFVLMRLIKKYKLKFLIIAALVTII 433


>gi|403352629|gb|EJY75833.1| hypothetical protein OXYTRI_02776 [Oxytricha trifallax]
          Length = 366

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 61/110 (55%), Gaps = 5/110 (4%)

Query: 113 FDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP---VIDYDLALLFQPLLVLGISIG 169
           FD K++ A+S   +   +V+  +Y L ++HP  D     +I+Y+LA++  P +++G   G
Sbjct: 118 FDTKNAIAISNFAIFTCSVTRYIYTLDKKHPHPDKKQNVIIEYNLAIVMLPTVMMGSLTG 177

Query: 170 VAFNVIFADWMITVLLIVLFLVLSTRAFLKG--VETWKKETITKREAAKQ 217
           V  N+IF    +  +L  L + LS ++ +KG  ++ +K ++  K    KQ
Sbjct: 178 VFLNIIFPAIALQAILTALLIFLSLQSLMKGKDIQKFKGQSHNKSRILKQ 227


>gi|449016905|dbj|BAM80307.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 804

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/105 (25%), Positives = 53/105 (50%)

Query: 113 FDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAF 172
           F    ++A S+ ++ G +++  VYN R+RH   D P ID++L +   P  + G + G   
Sbjct: 131 FPIPQASANSQSLILGASIANSVYNFRKRHVIRDAPRIDWNLVISTMPFFLCGTTPGYFL 190

Query: 173 NVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQ 217
           N+    +    +L  +   L+ ++FL G    +++   +RE  +Q
Sbjct: 191 NISLPGYFTGFVLAAMLGALTIQSFLSGTRMTRRQWRMRREFLRQ 235


>gi|159115095|ref|XP_001707771.1| Hypothetical protein GL50803_4181 [Giardia lamblia ATCC 50803]
 gi|157435878|gb|EDO80097.1| hypothetical protein GL50803_4181 [Giardia lamblia ATCC 50803]
          Length = 520

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/159 (28%), Positives = 69/159 (43%), Gaps = 14/159 (8%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRH---PTLDIPVIDYDLALLF 158
           IFV ML L  G     +A +SK M+ GG+      N+ Q     PT   P I +DL  + 
Sbjct: 39  IFVSMLQLF-GVSPHVAAPLSKAMIFGGSCVLTCMNIFQHEDNEPTK--PSIIWDLVFII 95

Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETI--------T 210
           +P  V G  IG   NV+  +W++ VL +   L  + +     + T  KE I        T
Sbjct: 96  EPAAVSGALIGALINVVLPEWLLLVLEVAFLLYTTQKMLRSSLATLNKERIAAGKRLLCT 155

Query: 211 KREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETR 249
           ++  A  L +   G+ +Q      D    S +TT  E +
Sbjct: 156 RKSRAPALSIDERGSPHQPSTFIEDQSTRSGNTTSNEIQ 194



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 66/134 (49%), Gaps = 2/134 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+ AG++G +LG+GGG +  P+ +  GI P+ +   +   I F+S  S++ Y ++     
Sbjct: 388 GLFAGILGAMLGIGGGLLKNPILISFGIDPERARTASTVMIAFTSMSSMISYVVIGGLHF 447

Query: 405 PYALYFFALSIIAAFVGQHVLKKL-IKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK 463
            YA +   L++ A FV  + L +L I+     S I F ++ +I V    +    +   I 
Sbjct: 448 EYA-WPLMLTVGAFFVSGYYLSELIIRCFKTKSFIPFLITALIVVCTCFIVANMIIVFID 506

Query: 464 RIEHKEYMGFDNIC 477
             +     GF ++C
Sbjct: 507 IAKTGHLPGFTSLC 520


>gi|302828094|ref|XP_002945614.1| hypothetical protein VOLCADRAFT_85766 [Volvox carteri f.
           nagariensis]
 gi|300268429|gb|EFJ52609.1| hypothetical protein VOLCADRAFT_85766 [Volvox carteri f.
           nagariensis]
          Length = 136

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 59/103 (57%), Gaps = 1/103 (0%)

Query: 375 QVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR 434
           QV++A++   + FSSS +++++ LL R    YA+ F A S++A  +G   + + IK  GR
Sbjct: 33  QVTAASSSTMVLFSSSAALIQFILLHRLNTDYAIVFGAASLVAGLLGTQAVSRAIKRSGR 92

Query: 435 ASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 477
            S+++  L+ +I ++ L +   GL     ++   E +GF  IC
Sbjct: 93  PSVVVLALAGVIGIATLCVAIFGLRNAAVQLRAGE-LGFLGIC 134


>gi|452819360|gb|EME26420.1| hypothetical protein Gasu_59110 [Galdieria sulphuraria]
          Length = 579

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 60/112 (53%), Gaps = 5/112 (4%)

Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
            IFVP+L +++ F    +  VS+ M+TG AV+       +RHP  D P + Y++     P
Sbjct: 107 PIFVPVLEVMLNFAVSQAGGVSQSMITGMAVAVSTIAFLKRHPERDRPAVYYEIVTSMVP 166

Query: 161 LLVLGISIGVAFNVI----FADWMITVLLIVLFLVLSTRAF-LKGVETWKKE 207
           + ++G  IG   N +    F  +++ ++LI L +  S +A  L+ +E  +++
Sbjct: 167 ICLMGTFIGTYVNQVLPGYFTAFVVVIILIYLVITTSMKAMSLRRIELHERD 218



 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 83/205 (40%), Gaps = 22/205 (10%)

Query: 264 WKEFGLLVAVWA---IILALQIAKNYTTT------CSVLYWVLNLLQIPVAGGVSAYEAV 314
           W +  LLV  W+   I +AL+  K    +      C  +YW L  +   V   +S+   +
Sbjct: 351 WGDLLLLVTTWSFVIIFVALRGGKPRIISPLGVHLCGWIYWFLLAILELVLLIISSVTML 410

Query: 315 ALYKGRRKIASKG----DEGTKWRASQLVFY---C-ACGITAGMVG-GLLGLGGGFILGP 365
            LY   +     G        +W    L+ Y   C  CG+ A  VG  +  L  GF+L  
Sbjct: 411 RLYSLHQHRVRLGYLFCRSDVRWTPKTLILYGIFCFVCGLVASWVGISVETLAAGFLL-- 468

Query: 366 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 425
             + LG+ P V   T      F+SS    E          YAL++   + + A VG  V+
Sbjct: 469 --VVLGVDPLVVQLTGGVINLFTSSAIAAESAANGSLAWRYALFYAGFTFLGALVGVLVV 526

Query: 426 KKLIKILGRASIIIFTLSFIIFVSA 450
              +K     SII+F L F + V+ 
Sbjct: 527 GHFVKKYHLKSIIVFCLVFFLIVAT 551


>gi|145493138|ref|XP_001432565.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124399678|emb|CAK65168.1| unnamed protein product [Paramecium tetraurelia]
          Length = 441

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 81/348 (23%), Positives = 152/348 (43%), Gaps = 42/348 (12%)

Query: 94  VGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT-LDIPVIDY 152
           +GG+G G I  P+LNL++ + +  S  V+ C +   +    ++   ++HP   ++P+I++
Sbjct: 55  MGGLGSGMIKRPILNLLLNYPSSISTQVADCFLFTTSTLNSLFLFFEKHPDHHNLPLINF 114

Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK--ETIT 210
           D++++    + L  S G         ++I +  I L  ++    FL     WK     + 
Sbjct: 115 DISIILNQTIPLAWSAGAFLQTRSPKFVIYI--IQLCFLMGAIPFL-----WKYMLAYLQ 167

Query: 211 KR-EAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI-------IENI 262
           KR E  K+  +I++    +T+E       ++++T  +E +  +  K  I       I+N+
Sbjct: 168 KRIEQDKRENVILVNQKIKTQED------MASETNFDEKQLTQYQKFYINDHSKFQIKNL 221

Query: 263 YWKEFGLLVAVWAIILALQIAKNYT----TTCSVLYWVLNLLQIPVAGGVSAYEAVALYK 318
            +  FG  +    IIL      N +      C++     NL+ + + G    Y     + 
Sbjct: 222 CFI-FGSFLVNQTIILMRSNKYNNSIIGLDKCTLEN---NLILLAILGINLTYTFFIYWS 277

Query: 319 GRRKIASKGDEGTKWRASQ--------LVFYCACGITAGMVGGLLGLGGGFILGPLFLEL 370
            R +   K  +  ++R +Q          FY   G  AG + G LG+GGG I+    L  
Sbjct: 278 KRNEEYYK--DIVQYRPNQRFFKDKKQFFFYYMAGFLAGFITGFLGMGGGLIMVTFLLSQ 335

Query: 371 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA 418
            I  + ++ATA F     S  S++++ L K       + FF L I+  
Sbjct: 336 KIIAREAAATAAFGSFMISLNSLIQFILQKTISDEQMIVFFILGILGV 383


>gi|323456207|gb|EGB12074.1| hypothetical protein AURANDRAFT_9279, partial [Aureococcus
           anophagefferens]
          Length = 89

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 33/90 (36%), Positives = 52/90 (57%), Gaps = 4/90 (4%)

Query: 126 VTGGAVSAVVYNLRQRH---PTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
           ++GG++ A +Y   QR+   PTL  P+IDYD +LLF P L+ G   G  F+V+F  W++ 
Sbjct: 1   ISGGSI-ANLYTYTQRYHPNPTLRRPLIDYDASLLFCPALLAGTMFGGLFSVMFPPWLVV 59

Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKR 212
           + L+VL      R   KG+  W  E+  ++
Sbjct: 60  ICLVVLLGYSGKRTVKKGIAKWNAESAKQK 89


>gi|348677068|gb|EGZ16885.1| hypothetical protein PHYSODRAFT_503702 [Phytophthora sojae]
          Length = 254

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 65/272 (23%), Positives = 117/272 (43%), Gaps = 56/272 (20%)

Query: 194 TRAFLKGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPE---YLSNDTTPEETRE 250
           TR   K +   +KE                       +C   PE    L  D++ + +  
Sbjct: 19  TRTLQKAISARRKERW---------------------QCCVSPEATALLGIDSSEKVSST 57

Query: 251 PKKSKVSIIENIYWKEFGLLVAVWAIILALQI---AKNYTT-----TCSVLYWVLNLLQI 302
           P+K   +   ++ W++   L +++ +I  ++I    K++ +     + S LY +L   Q 
Sbjct: 58  PEKKPQA---DVPWRKLATLSSLFVVIAGMRILRGGKDFDSPLGIDSSSALYPMLQ--QT 112

Query: 303 PVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 362
           P       YE  A                KW    + F+    + AG V G+ G+GGG I
Sbjct: 113 P------GYELEA-------------HEIKWTPLSIRFFPFFSLAAGAVSGMFGIGGGII 153

Query: 363 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 422
            GPL LE+GI    +SA     + FSS MS   Y ++ +  +  A     +  +  ++GQ
Sbjct: 154 NGPLLLEVGIDASAASAMTATTVLFSSGMSAFNYTVMGKTDIHLAQVLLPMGFLMTYIGQ 213

Query: 423 HVLKKLIKILGRASIIIFTLSFIIFVSALSLG 454
             L K+++     S+IIF+++ I+ +SA+++ 
Sbjct: 214 LCLLKVVRRFQCPSLIIFSMAVIVLISAIAMS 245


>gi|167521960|ref|XP_001745318.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776276|gb|EDQ89896.1| predicted protein [Monosiga brevicollis MX1]
          Length = 512

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 69/134 (51%), Gaps = 15/134 (11%)

Query: 104 VPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP---VIDYDLALLFQP 160
           VPML ++  F    +  +S   + GG++   +  ++++HP   +    +I+Y + LL  P
Sbjct: 45  VPMLLVVGDFPTYYAIPLSVTAIVGGSIVRFIMQVQRKHPNPKVAHRQLINYPMVLLLLP 104

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQ--- 217
           + + G  IGV  N +  +W+I   L  +FLVL+  +F K ++  K+    ++EA  Q   
Sbjct: 105 MALAGTVIGVLLNSVAPNWLI---LATIFLVLTYTSF-KTLKKGKELRAKEKEAHAQMAA 160

Query: 218 -----LELIVLGNG 226
                +ELIV  NG
Sbjct: 161 TELHTMELIVDDNG 174



 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 44/78 (56%)

Query: 376 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 435
           VSSAT  +   F+S  S  ++ +L R PV Y +  F L+ +A+ +GQ  L   ++  G+ 
Sbjct: 371 VSSATTSYMTLFTSISSFTQFLVLNRVPVDYGILLFFLAAVASVLGQLALNSYVRKTGKN 430

Query: 436 SIIIFTLSFIIFVSALSL 453
           SII + L  II ++ + L
Sbjct: 431 SIIAYILGVIISLATVLL 448


>gi|253746884|gb|EET01870.1| Hypothetical protein GL50581_856 [Giardia intestinalis ATCC 50581]
          Length = 511

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 52/113 (46%), Gaps = 7/113 (6%)

Query: 70  PDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGG 129
           P ++    +VV  +   F     S  G+GGG IFV ML L  G     +A +SK M+ GG
Sbjct: 8   PGLEVWMEVVVAIVCALFAML-ASAAGIGGGVIFVSMLQLF-GVSPHVAAPLSKAMIFGG 65

Query: 130 A---VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
           +       ++      PT   P I +DL  + +P  V G  IG   NV+  +W
Sbjct: 66  SCVLTCMTIFQHEDNDPTK--PAIIWDLVFIIEPAAVSGALIGALINVVLPEW 116



 Score = 46.6 bits (109), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 54/105 (51%), Gaps = 2/105 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+ AG++G +LG+GGG +  P+ +  GI P+ +   +   I F+S  S++ Y ++     
Sbjct: 379 GLFAGILGAMLGIGGGLLKNPILISFGIEPERARTASTVMIAFTSMSSMISYVVIGGLHF 438

Query: 405 PYALYFFALSIIAAFVGQHVLKKLI-KILGRASIIIFTLSFIIFV 448
            YA +   L I   FV  + L  LI +     S I F ++ +I V
Sbjct: 439 EYA-WPLMLVIGVFFVSGYYLSGLIMRCFKTKSFIPFIITALIIV 482


>gi|145549159|ref|XP_001460259.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428088|emb|CAK92862.1| unnamed protein product [Paramecium tetraurelia]
          Length = 482

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 80/390 (20%), Positives = 147/390 (37%), Gaps = 90/390 (23%)

Query: 92  GSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVI 150
           G++G +G G +  P L   + FD++S+  ++ C+     ++ ++    QRHP   + P+I
Sbjct: 77  GALGALGAGVVKRPFLEATLNFDSESAGNMTMCLQFASQLANIIIIFFQRHPEDPERPII 136

Query: 151 DYDLALLFQPLLVLGISIGVAF-------NVIFADWMITVLLIVLFLVLSTRAFLKGVET 203
           +Y++AL++   + L  S+G  F       ++++   +  V +  +  + +T+      + 
Sbjct: 137 NYEIALIYCLAIPLSQSLGTEFANYLPLGSLLWIQNIFFVAVCPILFLFATKELTLQTQK 196

Query: 204 WKKETITKREAAKQLELI-----------VLGNGYQTEECKCDPEY-------------- 238
            +   I  + A   LE +           VL   +  ++C   P +              
Sbjct: 197 EQDNDILLKSALVPLEGLKDQQQLQQQEAVLYKQFYNDQCNKFPMFPIVVTVGTFAINEA 256

Query: 239 --LSND----TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSV 292
             LS      T+P  T E KK +              + A W   + L +    T    V
Sbjct: 257 IILSRTTPGLTSPYYTNETKKERT-------------ICAAWNFYMLLLLFAVNTIITLV 303

Query: 293 LYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTK-WRASQLVFYCACGITAGMV 351
           +Y++    +          +AV  +   R        GT+ W+        A G   G+V
Sbjct: 304 VYYIKRKEEY-------TKDAVKFFNCERFYTP----GTRFWKIY------AAGWVTGIV 346

Query: 352 GGLLGLGGGFILGPLFLELGIPPQVSSATA----------LFAITFSSSMSVVEYYLLKR 401
            G  G+  G I+     E G+   V+  TA          LFA   S+ +          
Sbjct: 347 AGFFGMAAGLIMVTTMSEFGLISAVAGGTANFCYFIICCQLFATVLSNQIEE------NN 400

Query: 402 FPVPYALYFFALSIIAAF----VGQHVLKK 427
           F V    +F+ L II+      +G + LKK
Sbjct: 401 FGVGDQFFFYGLGIISVLIFTNIGYYYLKK 430


>gi|145538953|ref|XP_001455171.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124422970|emb|CAK87774.1| unnamed protein product [Paramecium tetraurelia]
          Length = 406

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 87/372 (23%), Positives = 153/372 (41%), Gaps = 69/372 (18%)

Query: 126 VTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIF-ADWMITVL 184
           + G A+   +   +Q  P    PV+    A++  P +++G  +G+  N I  A ++I++L
Sbjct: 84  IFGAALGNFLNLAQQLDPQTKTPVVKIRSAIVACPAMIIGSMVGLILNKILPAFFLISIL 143

Query: 185 LIVLFLVLSTRAFLK-GVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDT 243
               FL LS + F K  ++ WK E   ++     +E+ +L                 N  
Sbjct: 144 --QYFLFLSCQKFYKTALKEWKNEN-QRKLTQNSVEIPIL-----------------NKL 183

Query: 244 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILA------LQIAKNYTTT-----CSV 292
             E+T +      S+ ENI      L+VA+  I++       L+ + N  +      C  
Sbjct: 184 QEEQTLQ----GASLKENI-----DLIVAIGVIVVTVLLGFFLRGSPNVESIIGISYCGF 234

Query: 293 LYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVG 352
            YW++ L  + V      Y    ++   +K   K  +    R S           AG+  
Sbjct: 235 FYWIITLGLVVVL----YYYFEYIFDRFQKEELKPLKKECLRDS---------FKAGIFN 281

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G  GLGGG IL P++L +G     S+ T+ F +  SS     +  +L       A   F 
Sbjct: 282 G-FGLGGGMILIPMYLGMGFTTIQSTGTSSFNVLLSSFQVAAQVIILGYMGTLQAFSLFT 340

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTL-SFIIFVSALSLGGVGLA-KMIKRIEHKEY 470
           ++++  ++    +   +K   R S+I++ L  F+IF        V LA + I++  H  Y
Sbjct: 341 MTVVGCYLSSSYIFNNLKKRDRLSLIMWGLVGFVIFAM------VNLAIQFIQKWSHNGY 394

Query: 471 -----MGFDNIC 477
                  F +IC
Sbjct: 395 QISELFAFQSIC 406


>gi|408404324|ref|YP_006862307.1| sulfonate transporter [Candidatus Nitrososphaera gargensis Ga9.2]
 gi|408364920|gb|AFU58650.1| putative sulfonate transporter [Candidatus Nitrososphaera gargensis
           Ga9.2]
          Length = 256

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 49/86 (56%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
           + +V   A G+ AG +G ++G+GGG I+ P    LG+PP  +++T+L A+T +S  S +E
Sbjct: 6   TTIVVLIAVGLGAGTLGSMIGVGGGIIMVPALTFLGLPPAQAASTSLIAVTSTSVSSTIE 65

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVG 421
           Y   KR      L   A ++  A +G
Sbjct: 66  YSRQKRIDYRLGLTMSAFAVPGAVLG 91


>gi|401412944|ref|XP_003885919.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325120339|emb|CBZ55893.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 486

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/105 (24%), Positives = 58/105 (55%)

Query: 114 DAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFN 173
           D   +  +SK  + G  + +V++NL ++ P+ ++ +I Y+LA + +P  ++G  +GV  N
Sbjct: 41  DVHKAVPLSKVTIFGACLVSVLFNLGRKQPSGNLLLISYELAAILEPFTLMGGVLGVLLN 100

Query: 174 VIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
           ++ +D  I   L+V+    + +   +G+  ++ E+    E A +L
Sbjct: 101 IVMSDIQIICCLVVVLSFTTYKTTRRGIIQYQTESRLLAERAARL 145


>gi|405984262|ref|ZP_11042565.1| hypothetical protein HMPREF9451_01696 [Slackia piriformis YIT
           12062]
 gi|404388094|gb|EJZ83178.1| hypothetical protein HMPREF9451_01696 [Slackia piriformis YIT
           12062]
          Length = 274

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 5/93 (5%)

Query: 341 YCA----CGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVE 395
           YCA     G+ AG++ G +G+GGGFI+ PLF+  L IP +++S T+L A+   S    +E
Sbjct: 153 YCAKVALIGVVAGLLSGYVGVGGGFIMVPLFMTLLSIPMRLASGTSLIAVCILSVPGTIE 212

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
             LL        +   A SI  AFVG  ++KK+
Sbjct: 213 QALLGNIHFLLGIAMAAGSIPGAFVGASMVKKV 245


>gi|403374048|gb|EJY86957.1| hypothetical protein OXYTRI_08652 [Oxytricha trifallax]
          Length = 426

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/379 (23%), Positives = 157/379 (41%), Gaps = 64/379 (16%)

Query: 129 GAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQPLLVLGISIGVAFNVIFAD-WMITVLLI 186
            ++S  + N  QRHP   +   IDYD+  L  P++ LG   GV       +  + T    
Sbjct: 42  SSLSRFIINYTQRHPKQPERLAIDYDIIELSMPIMYLGTLFGVQIGTRLTEVQLATTFAC 101

Query: 187 VLFLVLSTRAFLKGVETWKKET---------------------------------ITKRE 213
           VLF V S +   K +   K+E                                  I  RE
Sbjct: 102 VLFFV-SYKTTTKAISMIKEENAKKKMKYLSYTDKNEYKELLLPKSNRTSKQNSPIKDRE 160

Query: 214 AAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWK-------- 265
           +   L+ +   N       +     L +DT  EE  + +K++  +IE  Y +        
Sbjct: 161 SDSALQKLSHQNDQNLSTVQVQ---LFSDTPFEEHNQQQKNR-QMIEKQYQQVLKDQSQH 216

Query: 266 -------EFGLLVAVWAIILALQIAKNYTTT-CSVLYWVLNLLQIPVAGGVSAYEAVA-- 315
                   FGL +    I   L   ++ +    S +Y   +LL   +    S  +     
Sbjct: 217 FTLRRCLTFGLTLGFLMITSTLLNQRDDSQYYVSPMYGYASLLLFLIYAVFSTIKKAQYL 276

Query: 316 --LYKGRRKIASKGDEGT-KWRASQLVF--YCACGITAGMVGGLLGLGGGFILGPLFLEL 370
             ++  + ++  K D+    +   +++F     C I +G++GG++G+ GG IL PLFL+L
Sbjct: 277 QEMHSVKLQMGYKFDQNDLTYENYRVIFNIILLCHI-SGILGGVVGIAGGIILAPLFLQL 335

Query: 371 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 430
           G+ P + ++T  +    S+     +Y+ L      Y   F  L II +++G  V+ K+++
Sbjct: 336 GMLPVIVASTNQYLALISTIAVSSQYWYLGMMNWQYCSVFGFLGIIGSYIGIKVVMKIVQ 395

Query: 431 ILGRASIIIFTLSFIIFVS 449
             GR SI++  L+F++F S
Sbjct: 396 KSGRQSIMVVILAFVLFTS 414


>gi|308160584|gb|EFO63064.1| Hypothetical protein GLP15_1793 [Giardia lamblia P15]
          Length = 520

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 6/111 (5%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQ---RHPTLDIPVIDYDLALLF 158
           IFV ML L  G     +A +SK M+ GG+      N+ Q     PT   P I +DL  + 
Sbjct: 39  IFVSMLQLF-GVSPHVAAPLSKAMIFGGSCVLTCINIFQYEDDEPTK--PSIIWDLVFII 95

Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETI 209
           +P  V G  IG   NV+  +W++ VL ++  L  + +     + T  KE I
Sbjct: 96  EPAAVSGALIGALINVVLPEWLLLVLEVIFLLYTTQKMLRNSLTTLNKERI 146



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 57/105 (54%), Gaps = 2/105 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+ AG++G +LG+GGG +  P+ +  GI P+ +   +   I F+S  S++ Y ++     
Sbjct: 388 GLFAGILGAMLGIGGGLLKNPILISFGIDPERARTASTVMIAFTSMSSMISYTVIGGLHF 447

Query: 405 PYALYFFALSIIAAFVGQHVLKKL-IKILGRASIIIFTLSFIIFV 448
            YA +   L++   FVG + L +L I+     S I F ++ +I V
Sbjct: 448 EYA-WPLMLTVGTFFVGGYYLSELMIRCFRTKSFIPFIITALIVV 491


>gi|297526402|ref|YP_003668426.1| hypothetical protein Shell_0395 [Staphylothermus hellenicus DSM
           12710]
 gi|297255318|gb|ADI31527.1| protein of unknown function DUF81 [Staphylothermus hellenicus DSM
           12710]
          Length = 250

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 62/111 (55%), Gaps = 6/111 (5%)

Query: 320 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSS 378
           R++I +K    +K R    V        AG+  G+LG+GGG +  PL  + LG+P + + 
Sbjct: 122 RKRIDAKNHMVSKQRKGLGVLIA---FLAGLTSGMLGIGGGVLKVPLMTIVLGLPIRTAI 178

Query: 379 ATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII-AAFVGQHVLKKL 428
           AT+ F +  ++S   +  YLLK +  PYA+   AL II  A +G H+LKK+
Sbjct: 179 ATSSFMVGLTASAGSL-VYLLKGYVNPYAVAALALGIIPGATLGAHMLKKI 228


>gi|294891216|ref|XP_002773478.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239878631|gb|EER05294.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 385

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 62/107 (57%), Gaps = 1/107 (0%)

Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
            +++P    ++G DA  +  +SK  + G AVSA ++NLRQRHPT D P+ID D+ LL +P
Sbjct: 34  ALYMPAFVAVMG-DAHWAVPLSKVAINGVAVSATIFNLRQRHPTYDQPLIDLDIGLLLEP 92

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE 207
           L +LG  +GV  NV      I   L+++  + +   F K ++  + E
Sbjct: 93  LTLLGSMLGVYLNVAMTSVEIFSCLVLVLSITAALTFRKAIQRRRLE 139


>gi|449017188|dbj|BAM80590.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 606

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 58/106 (54%)

Query: 74  FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
           F W   V  ++ F  T   + GG+GGG + VP+L+L  GF  K+++A ++ ++ G +++ 
Sbjct: 85  FLWSDGVLIVLVFLATGLSNAGGIGGGILLVPLLSLFAGFTLKNASANAQPLIFGASIAN 144

Query: 134 VVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
            + N+ +RH   D P ID++L     P  ++G + G   N  F  +
Sbjct: 145 AMVNVPRRHAARDTPRIDWNLVACTVPFFLMGTTPGTFMNQAFPSY 190



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/237 (25%), Positives = 108/237 (45%), Gaps = 29/237 (12%)

Query: 264 WKEFGLLVAVWAIILALQI----AKNYTTT----CSVLYWVLNLLQIPV---AGGVSAYE 312
           W+    L+A W ++  +QI    A+ ++      C  +YW+L  +Q  V    G  + + 
Sbjct: 372 WRYMLFLLACWLVLFIIQILSGSAEKHSPVGAPLCGAVYWILFAIQESVLFAMGCFTIWR 431

Query: 313 AVALYKGRRKI-----ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF 367
            ++L + R ++     +  G    +W  + +  Y      AG+ G  +G+GG  +L P F
Sbjct: 432 NLSLNRLRDRLGYPWYSKNGLGDIRWTRALIFGYPVWAFIAGVFGSWVGIGGSSLLIP-F 490

Query: 368 LEL---GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           L L     P  V S+ +L  +  S+S + V Y    R  + Y L+F   +++ ++ G  +
Sbjct: 491 LNLVGRADPATVQSSMSLSNLLGSASGAFV-YLAQGRLNISYGLFFGLFALLGSYTGVWM 549

Query: 425 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMG----FDNIC 477
           +  L++     ++ +F L+ I FV+AL   G  L   I+     E  G    F NIC
Sbjct: 550 VYFLVERYQIRALFVFALT-ICFVTAL---GAILYIAIRNTLGVEAAGVGWVFTNIC 602


>gi|159479540|ref|XP_001697848.1| hypothetical protein CHLREDRAFT_151478 [Chlamydomonas reinhardtii]
 gi|158273946|gb|EDO99731.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 929

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 65/115 (56%), Gaps = 1/115 (0%)

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           PL L LG+ PQV++AT+   + FSSS +++++ L       YA  F A S +AA  G  V
Sbjct: 816 PLLLHLGVHPQVTAATSGAMVLFSSSTALLQFALAGELNAQYAAVFAAASAVAALAGTLV 875

Query: 425 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICYT 479
           +  L++  GR SI++  L+ ++ +  +S+   GL +  K +   + +GF  +C T
Sbjct: 876 VAGLVRRSGRPSIVVLALAGVMGLGLVSVAVFGLQRAAKDLGAGD-IGFSQLCAT 929


>gi|294925749|ref|XP_002778995.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239887841|gb|EER10790.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 397

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 62/107 (57%), Gaps = 1/107 (0%)

Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
            +++P    ++G DA  +  +SK  + G AVSA ++NLRQRHPT D P+ID D+ LL +P
Sbjct: 34  ALYMPAFVAVMG-DAHWAVPLSKVAINGVAVSATIFNLRQRHPTYDQPLIDLDIGLLLEP 92

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE 207
           L +LG  +GV  NV      I   L+++  + +   F K ++  + E
Sbjct: 93  LTLLGSMLGVYLNVAMTSVEIFSCLVLVLSITAALTFRKAIQRRRLE 139


>gi|126465319|ref|YP_001040428.1| hypothetical protein Smar_0408 [Staphylothermus marinus F1]
 gi|126014142|gb|ABN69520.1| protein of unknown function DUF81 [Staphylothermus marinus F1]
          Length = 250

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/83 (40%), Positives = 52/83 (62%), Gaps = 3/83 (3%)

Query: 348 AGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPY 406
           AG+  G+LG+GGG +  PL  + LG+P + + AT+ F +  ++S   +  YLLK +  PY
Sbjct: 147 AGLTSGMLGIGGGVLKVPLMTMVLGLPIRTAIATSSFMVGLTASAGSL-VYLLKGYVNPY 205

Query: 407 ALYFFALSII-AAFVGQHVLKKL 428
           A+   AL II  A +G H+LKK+
Sbjct: 206 AVASLALGIIPGATLGAHMLKKI 228


>gi|403359664|gb|EJY79493.1| hypothetical protein OXYTRI_23231 [Oxytricha trifallax]
          Length = 420

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 85/376 (22%), Positives = 157/376 (41%), Gaps = 64/376 (17%)

Query: 129 GAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQPLLVLG----ISIGVAFNVIFADWMITV 183
            ++S  + N  QRHP   +   IDYD+  L  P+L LG    + IG   N +    + T 
Sbjct: 42  SSLSRFIINYAQRHPKQPERLAIDYDIIELSMPILYLGTLFGVQIGTRLNEV---QLATT 98

Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKR------EAAKQLELIVLGNGYQTEECKCDPE 237
              VLF V S +   K +   ++E   K+          + + ++L    +T +     +
Sbjct: 99  FACVLFFV-SYKTTTKAISMIREENAKKKMKYLSYTDKNEYKDLLLPKSNRTSKQNSPIK 157

Query: 238 YLSNDT-----------------------TPEETREPKKSKVSIIENIYWK--------- 265
              +D+                       TP E +  ++ K  ++E  Y +         
Sbjct: 158 DRESDSALQKVSHQVNQNLSSVQVQLFSDTPFEKQNQQQEKRQMLEKKYQQVLKDQSQHF 217

Query: 266 ------EFGL---LVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ-IPVAGGVSAYEAVA 315
                  FGL    + + + +L   +   Y     +L+ +  +   I  A  +    +V 
Sbjct: 218 TLKRSLTFGLTLGFLMITSTLLNYYVNPMYGYASLLLFLIYAVFSTIKKAQYLQETHSVK 277

Query: 316 LYKGRRKIASKGDEGTKWRASQLVF--YCACGITAGMVGGLLGLGGGFILGPLFLELGIP 373
           L  G +           +   +++F     C I +G++GG++G+ GG IL PLFL+LG+ 
Sbjct: 278 LQMGYQF----DQNDLNYENYKVIFNIILLCHI-SGILGGVVGIAGGIILAPLFLQLGML 332

Query: 374 PQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 433
           P + ++T  +    S+     +Y+ L      Y   F  L II +++G  V+ K+++  G
Sbjct: 333 PVIVASTNQYLALISTIAVSSQYWYLGMMNWQYCSVFGFLGIIGSYIGIKVVMKIVQKSG 392

Query: 434 RASIIIFTLSFIIFVS 449
           R SI++  L+F++F S
Sbjct: 393 RQSIMVVILAFVLFTS 408


>gi|145539706|ref|XP_001455543.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423351|emb|CAK88146.1| unnamed protein product [Paramecium tetraurelia]
          Length = 484

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 46/79 (58%), Gaps = 1/79 (1%)

Query: 92  GSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-DIPVI 150
            S+G +GGG +  P L  I+ FDA  +  ++ C++    +  +V+   Q HP + + PVI
Sbjct: 72  ASLGALGGGIVKRPFLQSILNFDASIAGDITACLMISAQIVNMVFIFLQNHPDVPERPVI 131

Query: 151 DYDLALLFQPLLVLGISIG 169
           +YD+A+++   + + + +G
Sbjct: 132 NYDIAIIYTLAIPVSLCLG 150


>gi|407465090|ref|YP_006775972.1| hypothetical protein NSED_06140 [Candidatus Nitrosopumilus sp. AR2]
 gi|407048278|gb|AFS83030.1| hypothetical protein NSED_06140 [Candidatus Nitrosopumilus sp. AR2]
          Length = 255

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 48/93 (51%), Gaps = 4/93 (4%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 403
            G  AG++G ++GLGGG I+ P+   LG PP  +++ +LFA   ++  S + Y   KR  
Sbjct: 10  LGFAAGILGSMIGLGGGVIVVPVLTFLGFPPTAAASNSLFAALSNAIASTISYSKQKRIE 69

Query: 404 VPYALYFFALSI----IAAFVGQHVLKKLIKIL 432
            P  L    LSI    + A +   V   + KIL
Sbjct: 70  YPLGLKLGLLSIPGTVLGALISSDVAPDVFKIL 102


>gi|387196905|gb|AFJ68782.1| hypothetical protein NGATSA_3034300 [Nannochloropsis gaditana
           CCMP526]
          Length = 278

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 50/181 (27%), Positives = 99/181 (54%), Gaps = 8/181 (4%)

Query: 293 LYWVLNLLQIPVAGGVSAY-EAVALYKGRRKIAS-----KGDEGTKWRASQLVFYCACGI 346
           +YW + LL +P AG    Y   + L +   K+ +     +GD   +W  +  + +    +
Sbjct: 91  MYWFVTLLTLPWAGAFFLYIRRLILKEQEAKVLAHYPFVEGD--IRWTPTSTLKFPLLCV 148

Query: 347 TAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPY 406
            AG++ G+ G+GGG + GPL LE+G+ P V++A+A   I ++S+ + V +Y+    P  Y
Sbjct: 149 MAGVLAGVFGVGGGIVKGPLMLEMGVLPPVAAASAATMILYTSASATVAFYVFGLIPGDY 208

Query: 407 ALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIE 466
            L+FF    +   VGQ +L +L++   + S+I+ ++  +I +SA+ +    +A  ++  +
Sbjct: 209 GLFFFFWGFLCTGVGQILLLRLLQHSHKQSLIVLSIGLVITLSAVMMSFQAVADYLENPD 268

Query: 467 H 467
           +
Sbjct: 269 N 269


>gi|294887822|ref|XP_002772241.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239876300|gb|EER04057.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 116

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 27/114 (23%), Positives = 59/114 (51%), Gaps = 1/114 (0%)

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           P+ L LG+ P+ ++AT    I  +S+ + + + L   FP    L+   +  I A +G+ +
Sbjct: 3   PVVLSLGLDPKQATATTSIVIFATSTSTALSFALGGYFPPASDLWIVVMPFIGALLGKTI 62

Query: 425 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK-RIEHKEYMGFDNIC 477
           + +LI   GR S+++  L  ++ +  ++    G+  ++K  +  +E + F + C
Sbjct: 63  VARLIAKTGRMSMLVLLLGTVVIIGGITTISTGIVSVVKGALNGEEVVQFGSFC 116


>gi|145515812|ref|XP_001443800.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411200|emb|CAK76403.1| unnamed protein product [Paramecium tetraurelia]
          Length = 439

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 72/334 (21%), Positives = 136/334 (40%), Gaps = 29/334 (8%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQPLLV 163
           P+LNL++ + A  +  V  C +        +  + ++HP   ++P+++++++++F   + 
Sbjct: 66  PILNLMLNYPASIATQVGDCFLFVTTTLNSLLLIFEKHPDHPELPLVNFEISIIFNQTIP 125

Query: 164 LGISIGVAFNVIFADWMITVLLIVLF---LVLSTRAFL-KGVETWKKETITKREAAKQLE 219
           L  S+G      F    I    I LF    +L    FL K   + K+  I KR+     +
Sbjct: 126 LAWSVGA-----FLQQRIPQFAIYLFQLCFMLGAIPFLWKFTHSQKQLEIDKRDK----K 176

Query: 220 LIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILA 279
           ++V+      E+   +    + +    E       K   I+N+ +  FG  +    I+L 
Sbjct: 177 VLVIEKIKTKEDMANETSLNAKELQQYEIFYILNHKKIQIKNLCFI-FGSFIVNQTIVLM 235

Query: 280 LQIAKNYT----TTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK------IASKGDE 329
                N +      C++     NL+ I +      Y  +  +  R +      +  + D 
Sbjct: 236 RSNKFNNSIIGIDACTLEN---NLILILILCANCIYTILVYWNKRNEEFYKDLVQYRPDY 292

Query: 330 GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSS 389
                     FY   G  AG   G +G+GGG I+  + L   I  + ++ATA F     +
Sbjct: 293 RYFTPKKTFWFYYLGGCLAGFSTGFIGMGGGLIMVSILLHKKIIAREAAATAGFGTFMIA 352

Query: 390 SMSVVEYYLLKRFPVPYALYFFALSIIA-AFVGQ 422
             S+++ +L K         FF L II  AF+ +
Sbjct: 353 LNSLIQLFLQKTITTGQMFTFFGLGIIGLAFITK 386


>gi|221485444|gb|EEE23725.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 482

 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 89/354 (25%), Positives = 154/354 (43%), Gaps = 65/354 (18%)

Query: 114 DAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFN 173
           D   +  +SK  + G  + +V++NL ++ P+ ++ +I Y+LA + +P  +LG  +GV  N
Sbjct: 41  DVHKAVPLSKVTIFGACLVSVLFNLGRKQPSGNLLLISYELAAILEPFTLLGGVLGVLLN 100

Query: 174 VIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET--ITKREAAKQLELIVLGNGYQTEE 231
           ++  D  I   L+++    + +   +G+  ++ E+  + +R A    E   L    + EE
Sbjct: 101 IVMTDIQIISCLVLVLSFTTYKTTRRGLLQYQTESRLLAERAAVLSREHHSLSTRDREEE 160

Query: 232 CK---------------CDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFG-----LL- 270
                            C   +L+  +  E    P+     + E     + G     LL 
Sbjct: 161 TSSLLAGEEPRASSAPACGENFLAVSSLAEMMETPQVLLDPLPEGCEADDLGAKGEALLP 220

Query: 271 --VAVWAIILALQI----AKNYTTTCSVLYWV----LNLLQIPVAGGV------SAYE-- 312
              A  A+IL  +     AK++ +  S++Y +    +N L +  AGG        A++  
Sbjct: 221 SSSAESAVILKERREILEAKHHPSWISLVYLIFDVAVNTLCLLAAGGPVAVVCGEAWQQA 280

Query: 313 -----------AVALY-------KGRRKIASKGDE-----GTKWRASQ-LVFYCACGITA 348
                      A ALY       K RR+     DE     G  W + + +V Y    + A
Sbjct: 281 LILFLISFHVFATALYARWLLKSKRRREQLGIQDEDVEHGGLGWVSPRTVVLYPFLSLLA 340

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           G+  G LG+GGG I GPL LE+G+    +  TA F I F+SS + ++Y  L R 
Sbjct: 341 GVAAGSLGIGGGLIKGPLLLEIGLNSLSAVTTANFMIFFTSSANALQYATLGRL 394


>gi|237835457|ref|XP_002367026.1| hypothetical protein TGME49_046180 [Toxoplasma gondii ME49]
 gi|211964690|gb|EEA99885.1| hypothetical protein TGME49_046180 [Toxoplasma gondii ME49]
 gi|221506300|gb|EEE31935.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 482

 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 89/354 (25%), Positives = 154/354 (43%), Gaps = 65/354 (18%)

Query: 114 DAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFN 173
           D   +  +SK  + G  + +V++NL ++ P+ ++ +I Y+LA + +P  +LG  +GV  N
Sbjct: 41  DVHKAVPLSKVTIFGACLVSVLFNLGRKQPSGNLLLISYELAAILEPFTLLGGVLGVLLN 100

Query: 174 VIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET--ITKREAAKQLELIVLGNGYQTEE 231
           ++  D  I   L+++    + +   +G+  ++ E+  + +R A    E   L    + EE
Sbjct: 101 IVMTDIQIISCLVLVLSFTTYKTTRRGLLQYQTESRLLAERAAVLSREHHSLSTRDREEE 160

Query: 232 CK---------------CDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFG-----LL- 270
                            C   +L+  +  E    P+     + E     + G     LL 
Sbjct: 161 TSSLLAEEEPRASSAPACGENFLAVSSLAEMMETPQVLLDPLPEGCEADDLGAKGEALLP 220

Query: 271 --VAVWAIILALQI----AKNYTTTCSVLYWV----LNLLQIPVAGGV------SAYE-- 312
              A  A+IL  +     AK++ +  S++Y +    +N L +  AGG        A++  
Sbjct: 221 SSSAESAVILKERREILEAKHHPSWISLVYLIFDVAVNTLCLLAAGGPVAVVCGEAWQQA 280

Query: 313 -----------AVALY-------KGRRKIASKGDE-----GTKWRASQ-LVFYCACGITA 348
                      A ALY       K RR+     DE     G  W + + +V Y    + A
Sbjct: 281 LILFLISFHVFATALYARWLLKSKRRREQLGIQDEDVEHGGLGWVSPRTVVLYPFLSLLA 340

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           G+  G LG+GGG I GPL LE+G+    +  TA F I F+SS + ++Y  L R 
Sbjct: 341 GVAAGSLGIGGGLIKGPLLLEIGLNSLSAVTTANFMIFFTSSANALQYATLGRL 394


>gi|83950218|ref|ZP_00958951.1| membrane protein [Roseovarius nubinhibens ISM]
 gi|83838117|gb|EAP77413.1| membrane protein [Roseovarius nubinhibens ISM]
          Length = 307

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 61/106 (57%), Gaps = 2/106 (1%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           + G+  G++ G+ G+GGGF++ PL   +GIPP V+ AT+   I  +S  +++ +   +  
Sbjct: 19  SIGLMVGLLSGIFGVGGGFLITPLLFFIGIPPAVAVATSANQIVAASFSAILAHLRRRTV 78

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
            +P  L      ++ + VG ++   L++ LG+  +++  L +++F+
Sbjct: 79  DIPMGLALMIGGLLGSTVGIYIF-NLLRSLGQVDLLV-NLFYVVFL 122


>gi|403360891|gb|EJY80141.1| hypothetical protein OXYTRI_22577 [Oxytricha trifallax]
          Length = 388

 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 29/114 (25%), Positives = 54/114 (47%), Gaps = 1/114 (0%)

Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP-VIDYDLALLFQ 159
           GI +P L + +    +    ++       +V+  VYN  Q+HP      +IDY++  L  
Sbjct: 66  GIVIPFLMIFMKMPIQECIPLANIFAMISSVTRFVYNFNQKHPYRPFRMIIDYEIVTLTM 125

Query: 160 PLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE 213
           P++  G  IGV    +     + +LL ++      + F K ++T+ KET  +R+
Sbjct: 126 PMVYFGSLIGVYAGSLMNQLTLVILLQIVLAFTLYKTFQKALQTYIKETNRRRQ 179



 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 53/93 (56%), Gaps = 6/93 (6%)

Query: 364 GPLFLELGIPPQVSSATALFA---ITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 420
            P+FL LG+ P V++AT  +     TFS S+   ++    +    YA    A+ +  A +
Sbjct: 292 SPVFLSLGLLPSVTAATNQYIGMISTFSVSL---QFIYKGQLNYSYAYVIGAVVLFTAII 348

Query: 421 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
           G  V+++++K  GR SII+F +SF++ +S L L
Sbjct: 349 GLSVVERVVKKSGRQSIIVFIISFVLLISFLVL 381


>gi|78042783|ref|YP_361264.1| hypothetical protein CHY_2470 [Carboxydothermus hydrogenoformans
           Z-2901]
 gi|77994898|gb|ABB13797.1| putative membrane protein [Carboxydothermus hydrogenoformans
           Z-2901]
          Length = 266

 Score = 47.4 bits (111), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 43/82 (52%), Gaps = 1/82 (1%)

Query: 344 CGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
            GI  GM+G L+G GGGFIL P L L     PQ ++ T+LF + F++    + Y   KR 
Sbjct: 9   IGIFVGMIGTLIGAGGGFILIPYLILVAKFSPQTAAGTSLFMVFFNALSGSIAYIRQKRV 68

Query: 403 PVPYALYFFALSIIAAFVGQHV 424
               A YF   +I  A  G ++
Sbjct: 69  DFRTAFYFALATIPGAIFGAYL 90



 Score = 41.6 bits (96), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 61/148 (41%), Gaps = 7/148 (4%)

Query: 83  IIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRH 142
           IIG F    G++ G GGG I +P L L+  F  +++A  S  MV   A+S  +  +RQ+ 
Sbjct: 8   IIGIFVGMIGTLIGAGGGFILIPYLILVAKFSPQTAAGTSLFMVFFNALSGSIAYIRQKR 67

Query: 143 PTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVE 202
                  +D+  A  F    + G   G   N      +  VL  +L   L+ R F     
Sbjct: 68  -------VDFRTAFYFALATIPGAIFGAYLNSFLHSRLFNVLFALLLFFLAVRTFFHKTS 120

Query: 203 TWKKETITKREAAKQLELIVLGNGYQTE 230
             KKE    +    +   I+   G + E
Sbjct: 121 NAKKENKETKIPGHETREIIDAEGNKYE 148



 Score = 38.1 bits (87), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 39/70 (55%), Gaps = 3/70 (4%)

Query: 329 EGTKWRASQLVFYCACGIT--AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAIT 386
           EG K+  S    +   GI+   G++  LLG+GGG I  PL   LG P  +++AT+ F + 
Sbjct: 143 EGNKYEYS-FNLWLGIGISLIVGILSSLLGIGGGIIHVPLMGFLGFPMHIATATSHFILV 201

Query: 387 FSSSMSVVEY 396
            +S + V+ +
Sbjct: 202 ITSLIGVISH 211


>gi|402849112|ref|ZP_10897353.1| membrane protein, putative [Rhodovulum sp. PH10]
 gi|402500640|gb|EJW12311.1| membrane protein, putative [Rhodovulum sp. PH10]
          Length = 310

 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 68/129 (52%), Gaps = 12/129 (9%)

Query: 338 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           LVF  A G   G + G+ G+GGGF++ PL + +G+ P VS AT    IT SS    + Y+
Sbjct: 16  LVF--AMGTAVGFISGMFGIGGGFLMTPLLIFIGVSPAVSVATVAAHITASSFSGALTYW 73

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHV---LKKLIKILGRASIIIFTLSFIIFVSALSLG 454
             +   +P A       +I    G  +   L  +++ LG+  + I  L++++ +   S+G
Sbjct: 74  RRRAIDLPLAFVL----LIGGLTGTAIGVWLFGVLRALGQLDLFI-ALAYVVLLC--SVG 126

Query: 455 GVGLAKMIK 463
           G+ +A+ ++
Sbjct: 127 GLMVAESVR 135


>gi|119873364|ref|YP_931371.1| hypothetical protein Pisl_1881 [Pyrobaculum islandicum DSM 4184]
 gi|119674772|gb|ABL89028.1| protein of unknown function DUF81 [Pyrobaculum islandicum DSM 4184]
          Length = 243

 Score = 46.6 bits (109), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 30/68 (44%), Positives = 40/68 (58%), Gaps = 3/68 (4%)

Query: 66  KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
           K+  P     W   +G ++ F G    S+ GVGGG IFVP L LI GFD KS+AA+S  +
Sbjct: 112 KNARPRRNIPW---LGLLLIFIGGFASSLFGVGGGTIFVPALILISGFDPKSAAAMSMGI 168

Query: 126 VTGGAVSA 133
           +   AVS+
Sbjct: 169 IFPTAVSS 176


>gi|444375855|ref|ZP_21175107.1| hypothetical protein D515_1881 [Enterovibrio sp. AK16]
 gi|443680083|gb|ELT86731.1| hypothetical protein D515_1881 [Enterovibrio sp. AK16]
          Length = 257

 Score = 46.6 bits (109), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 1/91 (1%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
           + L+      + AG +  + G GGG ++ P  L +G+PP ++  T   + TF+SS +   
Sbjct: 8   NTLIILGMVALAAGFIDAVAG-GGGMLMVPALLSIGLPPHIALGTNKLSATFASSTAAFT 66

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHVLK 426
           YY  K F   + ++ F  + I A +G  V+ 
Sbjct: 67  YYRKKLFSPSFWIHAFVATFIGAILGTLVVD 97


>gi|75674383|ref|YP_316804.1| hypothetical protein Nwi_0184 [Nitrobacter winogradskyi Nb-255]
 gi|74419253|gb|ABA03452.1| Protein of unknown function DUF81 [Nitrobacter winogradskyi Nb-255]
          Length = 343

 Score = 46.6 bits (109), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 68/134 (50%), Gaps = 4/134 (2%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
            ++   A G   G V G+ G+GGGF++ PL + +GI P V+ AT    I  SS    + Y
Sbjct: 48  NVLLILAMGAAVGFVSGMFGIGGGFLMTPLLIFVGIAPAVAVATVTSHIAASSFSGALSY 107

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL--SLG 454
           +  +      AL   +  +I   +G     +L + LG+  ++I  LS+++ +S +  ++ 
Sbjct: 108 WRKRAVDPMLALVLLSGGVIGTILGVLTFTRL-RSLGQLDLLI-ALSYLVLLSGVGAAMS 165

Query: 455 GVGLAKMIKRIEHK 468
             GL  M++  +++
Sbjct: 166 WEGLRAMLRARQNR 179


>gi|145483119|ref|XP_001427582.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394664|emb|CAK60184.1| unnamed protein product [Paramecium tetraurelia]
          Length = 491

 Score = 46.2 bits (108), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 69/140 (49%), Gaps = 18/140 (12%)

Query: 92  GSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQR-------HPT 144
           GS+G +GGG +  P L  I+ +DA  +  ++ C++    +  +++   Q+       HP 
Sbjct: 75  GSLGALGGGTVKRPFLQAILNYDASIAGDITSCLMFSAQLVNMIFIFFQKYIMNQFSHPD 134

Query: 145 LD-IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLI---VLFLVLSTRAFLKG 200
           +D  P+I+Y++ L++     LGI I +      A+++  + L+   VLF V+ +   L  
Sbjct: 135 VDERPIINYEIGLIY----TLGIPISMCMGSELANFLPLLPLLSFQVLFFVVISPVLLYY 190

Query: 201 VETWKKETITKREAAKQLEL 220
               KK+ I +R      +L
Sbjct: 191 A---KKQDIIERSKETNNDL 207


>gi|85705724|ref|ZP_01036821.1| membrane protein [Roseovarius sp. 217]
 gi|85669714|gb|EAQ24578.1| membrane protein [Roseovarius sp. 217]
          Length = 304

 Score = 46.2 bits (108), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 61/117 (52%), Gaps = 4/117 (3%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G+ G+GGGFIL PL   +GIPP V+ AT    I  +S  +V+ ++  K   +        
Sbjct: 29  GMFGVGGGFILTPLLFFIGIPPAVAVATGANQIMAASFSAVLAHFRRKTVDLEMGTVLLI 88

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE 469
             +I A +G  V   L K LG+  +++ TL +++F+    +GG+   + ++ I +  
Sbjct: 89  GGLIGAAMGVFVFNYL-KSLGQVDLLV-TLCYVVFLGV--IGGMMFIESLRSIRNTR 141


>gi|85713974|ref|ZP_01044963.1| hypothetical protein NB311A_07458 [Nitrobacter sp. Nb-311A]
 gi|85699100|gb|EAQ36968.1| hypothetical protein NB311A_07458 [Nitrobacter sp. Nb-311A]
          Length = 371

 Score = 46.2 bits (108), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 58/117 (49%), Gaps = 7/117 (5%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           A G   G V G+ G+GGGF++ PL + +GI P V+ AT    I  SS    + Y+  +  
Sbjct: 82  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGIAPAVAVATVTSHIAASSFSGALSYWRKRAV 141

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 459
               A+   +  II    G     +L + LG+  ++I  LS+++ +S     GVG A
Sbjct: 142 DPMLAVILLSGGIIGTVFGVSTFTRL-RSLGQLDLLI-ALSYLVLLS-----GVGAA 191


>gi|407798212|ref|ZP_11145120.1| hypothetical protein OCGS_0193 [Oceaniovalibus guishaninsula
           JLT2003]
 gi|407059648|gb|EKE45576.1| hypothetical protein OCGS_0193 [Oceaniovalibus guishaninsula
           JLT2003]
          Length = 303

 Score = 46.2 bits (108), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 57/103 (55%), Gaps = 2/103 (1%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           G++ G+ G+GGGF++ PL   +GIPP V+ AT+   I  SS   V+ ++  +   +   L
Sbjct: 25  GILSGMFGVGGGFLITPLLFFVGIPPAVAVATSANQIVASSVSGVLAHFRRRTVDLRMGL 84

Query: 409 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
              A  ++ +  G  +   L + +G+  +++  L +++F+ A+
Sbjct: 85  VLLAGGLVGSAAGVQIFNVL-RQMGQVDLMV-RLCYVVFLGAI 125


>gi|421872468|ref|ZP_16304086.1| conserved hypothetical protein [Brevibacillus laterosporus GI-9]
 gi|372458441|emb|CCF13635.1| conserved hypothetical protein [Brevibacillus laterosporus GI-9]
          Length = 275

 Score = 46.2 bits (108), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 7/98 (7%)

Query: 338 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSS 390
           ++ +   G+ AG++G L+G+GGG    P  L  G       I PQ++S T+L  I  ++ 
Sbjct: 7   ILLFMFIGLFAGVMGSLVGIGGGMFFVPALLYFGNMYEPGSITPQIASGTSLLVIAITAL 66

Query: 391 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
            S + ++  K+  V  A+ FF  S   A VG +V K L
Sbjct: 67  SSSISFWKQKKVEVQAAVLFFIGSAPGAIVGVYVNKWL 104


>gi|339007279|ref|ZP_08639854.1| hypothetical membrane protein [Brevibacillus laterosporus LMG
           15441]
 gi|338776488|gb|EGP36016.1| hypothetical membrane protein [Brevibacillus laterosporus LMG
           15441]
          Length = 275

 Score = 46.2 bits (108), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 7/98 (7%)

Query: 338 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSS 390
           ++ +   G+ AG++G L+G+GGG    P  L  G       I PQ++S T+L  I  ++ 
Sbjct: 7   ILLFMFIGLFAGVMGSLVGIGGGMFFVPALLYFGNMYEPGSITPQIASGTSLLVIAITAL 66

Query: 391 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
            S + ++  K+  V  A+ FF  S   A VG +V K L
Sbjct: 67  SSSISFWKQKKVEVQAAVLFFIGSAPGAIVGVYVNKWL 104


>gi|338211621|ref|YP_004655674.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336305440|gb|AEI48542.1| protein of unknown function DUF81 [Runella slithyformis DSM 19594]
          Length = 301

 Score = 46.2 bits (108), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 44/95 (46%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
           S LVFY   GI A +V G LG+G G       L LG+PP VSS +   +  F++  S + 
Sbjct: 57  SDLVFYLLVGIGAQLVDGALGMGYGATATSFLLTLGVPPAVSSTSVHLSEMFTTGASAIS 116

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 430
           ++  K              +  + +G ++L  +I 
Sbjct: 117 HFKFKNINKKLFKTLLIPGVTGSIIGAYLLSDVID 151


>gi|294954881|ref|XP_002788341.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239903653|gb|EER20137.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 318

 Score = 45.8 bits (107), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 68/276 (24%), Positives = 119/276 (43%), Gaps = 37/276 (13%)

Query: 160 PLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
           P +V+G  +GV       +    VL +   ++ S   F KG+  WK ET     +A QL 
Sbjct: 3   PPMVVGGLLGVYLYTWLPEIFQLVLYVFTAILASCMGFKKGIGLWKSET-EGASSAGQLG 61

Query: 220 LIVLGNGYQTEECKCDPEYLSNDTT-PEETREPKKSKVSIIENIYWKEFGL---LVAVWA 275
           ++              P  ++ DT  P  T   +    SI  ++ +K+  L   L+AVW 
Sbjct: 62  VV-------------PPPCVTEDTVLPSVTSRAR----SISLSLKYKKAILITTLLAVWI 104

Query: 276 IILALQIAKNYTTT--------CSVLYWVLN----LLQIPVAG-GVSAYEAVALYKGRRK 322
            ++  ++    ++T        C+ LYW L+    ++ + V G  V A ++ A+ K   K
Sbjct: 105 AVILSRLLLGSSSTPSIIGIPYCTGLYWALSVIVCIMLMAVPGLFVVAIKSAAMLKLAVK 164

Query: 323 IASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATAL 382
           ++         ++S LV     GI  G +  L+G GGG ++ PL L + + PQ ++AT  
Sbjct: 165 LSGAMLCNATAKSSALVLGNIIGI--GFIAALVGQGGGSLITPLLLYMELNPQQAAATGS 222

Query: 383 FAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA 418
             +  +SS   + + L    P    ++   L    A
Sbjct: 223 VVMLITSSSLALSFGLGGFLPAASDMWIAVLPFFGA 258


>gi|149202975|ref|ZP_01879946.1| membrane protein [Roseovarius sp. TM1035]
 gi|149143521|gb|EDM31557.1| membrane protein [Roseovarius sp. TM1035]
          Length = 304

 Score = 45.8 bits (107), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 61/117 (52%), Gaps = 4/117 (3%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G+ G+GGGFIL PL   +GIPP V+ AT    I  +S  +V+ ++  K   +        
Sbjct: 29  GMFGVGGGFILTPLLFFIGIPPAVAVATGANQIMAASFSAVLAHFRRKTVDLEMGTVLLI 88

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE 469
             +I A +G  V   L K LG+  +++ TL +++F+    +GG+   + ++ I +  
Sbjct: 89  GGLIGAALGVMVFNYL-KSLGQVDLLV-TLCYVVFLGV--IGGMMFIESLRAIRNTR 141


>gi|212704214|ref|ZP_03312342.1| hypothetical protein DESPIG_02269 [Desulfovibrio piger ATCC 29098]
 gi|212672418|gb|EEB32901.1| hypothetical protein DESPIG_02269 [Desulfovibrio piger ATCC 29098]
          Length = 420

 Score = 45.4 bits (106), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 5/122 (4%)

Query: 307 GVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 366
           G  AYEA  +  G+   A      T + AS      A G+ AG++ G +G GGGFI+ P 
Sbjct: 51  GAEAYEAHEMLGGKSAYAPAFYTTTIFLAS-----IAVGLAAGLITGCIGAGGGFIITPA 105

Query: 367 FLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 426
            +  G+   ++  T LF I   + M    +  L    V  A+ F   S    FVG  + K
Sbjct: 106 LMAAGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNVSVKLAIAFLVGSGAGTFVGGAINK 165

Query: 427 KL 428
            L
Sbjct: 166 GL 167


>gi|145514580|ref|XP_001443195.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410573|emb|CAK75798.1| unnamed protein product [Paramecium tetraurelia]
          Length = 472

 Score = 45.1 bits (105), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 63/307 (20%), Positives = 130/307 (42%), Gaps = 25/307 (8%)

Query: 99  GGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDI-PVIDYDLALL 157
           GG +  P+L +I+ +    +  ++ C + GG +      + +++P  +  P+I+Y +A++
Sbjct: 75  GGVLKGPLLEMILNYSQSEATHIAYCFMFGGTLLNTALLMFEKNPEDERRPIINYRIAII 134

Query: 158 FQPLLVLGISIGVAFNVIFADWMITVLLIVLFLV-----------LSTRAFLKGVETWKK 206
           F   +    ++G +    F   + T++L  LFL             + +A LK  E  KK
Sbjct: 135 FNLAVPFATNLGSSL-ASFLPQLYTLILQELFLFGVAPILWQKAKDAKKAELKISEGSKK 193

Query: 207 ET---ITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIY 263
           E    +       ++EL  +   ++ +E K   +  SN++     ++  ++ + ++  ++
Sbjct: 194 ENTQNLNDSNLNPKIELQTIEGIHEQQEYKIS-DANSNESLYSHFKQESENILPLMPILF 252

Query: 264 -WKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK 322
               FGL   ++  + +    K          W  + +   +      Y+ +    G ++
Sbjct: 253 ILGSFGL-NQIFIQMRSTNPNKPSYVGIEDCTWQNDFMIFILILANVLYDYLIWQFGTKQ 311

Query: 323 ------IASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQV 376
                 I     E      S+     A G  AG V G LG+G GF++ P  L  G+ P+ 
Sbjct: 312 EKQFNQINFLPKERYFTPISKFFKIYAGGFLAGFVSGFLGMGAGFVMVPTLLFSGLIPRC 371

Query: 377 SSATALF 383
           +SAT+ F
Sbjct: 372 ASATSAF 378


>gi|145499407|ref|XP_001435689.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402823|emb|CAK68292.1| unnamed protein product [Paramecium tetraurelia]
          Length = 400

 Score = 45.1 bits (105), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 91/375 (24%), Positives = 169/375 (45%), Gaps = 43/375 (11%)

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           ++G ++G         G +       P++ +++G  +K +   +  M+ GG++   +   
Sbjct: 19  ILGILVGLSQAGGIGGGPIIS-----PVMMVLLGCPSKKAIWNTYIMLLGGSLGNFLRLG 73

Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
           ++R    + P+I+Y L  +  PLL+ G  +GVA       W+  +++++    +    FL
Sbjct: 74  KERTANGNAPLINYQLVQITLPLLLAGAILGVA----TGKWLPKLIIVIFLFGILMTVFL 129

Query: 199 KGVETWKKETITKREAAKQLELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI 258
           K    + K     +E  +QL  + L +    +E        SN +      + K +++  
Sbjct: 130 KTKSLYAKT--RSKEMNEQLIPVELKDLTVQKE--------SNHSKELNILKEKDARLYP 179

Query: 259 IENIYWKEFGLLVAVWAIILALQIAKNYTTT-----CSVLYWVLNLLQIPVAGGVSAYEA 313
           IE +   E  L + +  ++  L+ +    +      C   Y  LN     V  G++ Y  
Sbjct: 180 IEPL--TEISLTILIIIVVTLLKGSGAVPSLLGVDFCGYGYHFLNF----VIFGIAFYNV 233

Query: 314 VALYKGRRKIASKGDE-----GTKWRASQL--VF--YCACGITAGMVGGLLGLGGGFILG 364
               +  RK  SK +E     G  +   ++  VF       + AG +GGL+GLGGG +L 
Sbjct: 234 ----QRYRKQISKDEEYRESIGYDFADGKMSSVFDITVKSSLYAGFLGGLVGLGGGVVLT 289

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           PL+LE GI P  ++A+A F + F+SS+SV    L   +     +    +S + +++    
Sbjct: 290 PLWLETGINPPRAAASATFTVLFTSSISVFIIALSGGYQFSEFIILGLVSSLGSYLVAGF 349

Query: 425 LKKLIKILGRASIII 439
           LKKL+K   R SI+I
Sbjct: 350 LKKLVKKYKRESILI 364


>gi|145479595|ref|XP_001425820.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392892|emb|CAK58422.1| unnamed protein product [Paramecium tetraurelia]
          Length = 494

 Score = 45.1 bits (105), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 46/86 (53%), Gaps = 8/86 (9%)

Query: 92  GSVGGVG-------GGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT 144
           G++GG         GG +  P L  I+ FDA  S  ++ C++    +  +V+   Q HP 
Sbjct: 75  GALGGTSLQIIFLKGGIVKRPFLQTILNFDASISGDITACLMISAQIVNMVFIFLQNHPD 134

Query: 145 L-DIPVIDYDLALLFQPLLVLGISIG 169
           + + PVI+YD+A+++   + + + +G
Sbjct: 135 VQERPVINYDIAIIYTLAIPVSLCLG 160


>gi|452824321|gb|EME31325.1| hypothetical protein Gasu_15590 [Galdieria sulphuraria]
          Length = 322

 Score = 44.7 bits (104), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 50/91 (54%), Gaps = 1/91 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 403
           G +AG +  L+G GGG +L PLF+   G+  + +  T+L A++F+S ++   Y+L  R  
Sbjct: 75  GFSAGCIASLVGAGGGVVLTPLFVTFYGLTQKQAQGTSLVAVSFNSLVASFIYFLGGRVL 134

Query: 404 VPYALYFFALSIIAAFVGQHVLKKLIKILGR 434
           +  A +    ++I A VG  V  KL  +  R
Sbjct: 135 LLPAFFLTLTAVICARVGAKVTSKLDNLFLR 165


>gi|298290659|ref|YP_003692598.1| hypothetical protein Snov_0646 [Starkeya novella DSM 506]
 gi|296927170|gb|ADH87979.1| protein of unknown function DUF81 [Starkeya novella DSM 506]
          Length = 308

 Score = 44.7 bits (104), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 35/54 (64%), Gaps = 3/54 (5%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA---TALFAITFSSSMS 392
            A GI  G + G+ G+GGGF++ PL + LG+PP V+ A   T + A +FS +++
Sbjct: 18  LALGIAVGFISGMFGVGGGFLMTPLLIFLGVPPAVAVASVSTHMAASSFSGTLT 71


>gi|315230493|ref|YP_004070929.1| hypothetical protein TERMP_00729 [Thermococcus barophilus MP]
 gi|315183521|gb|ADT83706.1| hypothetical protein TERMP_00729 [Thermococcus barophilus MP]
          Length = 254

 Score = 44.7 bits (104), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 42/81 (51%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
           Y A G+  G++  L GLGGGF++ P    LG+    +  T+  A+ F+S  S + Y   K
Sbjct: 7   YFAVGVVIGILAALFGLGGGFLIVPTLNLLGVEIHHAVGTSSAAVVFTSLSSALAYSRQK 66

Query: 401 RFPVPYALYFFALSIIAAFVG 421
           R      L   + +II A++G
Sbjct: 67  RVHYKVGLLLASTAIIGAYIG 87


>gi|303325610|ref|ZP_07356053.1| putative membrane protein [Desulfovibrio sp. 3_1_syn3]
 gi|345892217|ref|ZP_08843040.1| hypothetical protein HMPREF1022_01700 [Desulfovibrio sp.
           6_1_46AFAA]
 gi|302863526|gb|EFL86457.1| putative membrane protein [Desulfovibrio sp. 3_1_syn3]
 gi|345047356|gb|EGW51221.1| hypothetical protein HMPREF1022_01700 [Desulfovibrio sp.
           6_1_46AFAA]
          Length = 421

 Score = 44.7 bits (104), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 58/129 (44%), Gaps = 7/129 (5%)

Query: 302 IPVAGG--VSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 359
           +P+  G    AYE   +  G+   A      T + AS      A G+ AG++ G +G GG
Sbjct: 44  VPILAGCLAEAYEYHEMLGGKSAYAPAFYTTTIFLAS-----IAVGLAAGLITGCIGAGG 98

Query: 360 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 419
           GFI+ P  + +G+   ++  T LF I   + M    +  L       A+ F   SI+  F
Sbjct: 99  GFIITPALMAVGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNVSAKLAIAFLVGSIVGTF 158

Query: 420 VGQHVLKKL 428
           +G  + K L
Sbjct: 159 IGGAINKGL 167


>gi|317130121|ref|YP_004096403.1| hypothetical protein Bcell_3430 [Bacillus cellulosilyticus DSM
           2522]
 gi|315475069|gb|ADU31672.1| protein of unknown function DUF81 [Bacillus cellulosilyticus DSM
           2522]
          Length = 274

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 61/132 (46%), Gaps = 7/132 (5%)

Query: 71  DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
           +I +G+  ++G +IGF    C  + G+GGG + VP + L+ GF    + A S  M+   A
Sbjct: 148 EITYGYHPIIGILIGFVVGMCSGLFGIGGGSLMVPAMILLFGFPPHIAVATSMLMIFLSA 207

Query: 131 VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
           + + V ++   +       ID+   L   P   +G  +G A N       I  LL V  +
Sbjct: 208 ILSSVSHMALGN-------IDWLYVLALLPGAWIGGQLGAAINRKMQSDAIVNLLRVFLI 260

Query: 191 VLSTRAFLKGVE 202
           ++  R   +G+ 
Sbjct: 261 IIGIRLIYQGIT 272



 Score = 38.5 bits (88), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 42/85 (49%), Gaps = 1/85 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 403
           G   GM  GL G+GGG ++ P + L  G PP ++ AT++  I  S+ +S V +  L    
Sbjct: 162 GFVVGMCSGLFGIGGGSLMVPAMILLFGFPPHIAVATSMLMIFLSAILSSVSHMALGNID 221

Query: 404 VPYALYFFALSIIAAFVGQHVLKKL 428
             Y L     + I   +G  + +K+
Sbjct: 222 WLYVLALLPGAWIGGQLGAAINRKM 246


>gi|256419285|ref|YP_003119938.1| hypothetical protein Cpin_0237 [Chitinophaga pinensis DSM 2588]
 gi|256034193|gb|ACU57737.1| protein of unknown function DUF81 [Chitinophaga pinensis DSM 2588]
          Length = 335

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 43/92 (46%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
           +L+FY   G+ A  V G LG+  G     L L LGIPP V+SA+   A  F++  S + +
Sbjct: 79  ELLFYILVGLAAQTVDGALGMAYGATSSSLLLGLGIPPSVASASVHVAEVFTTGASGISH 138

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           +          LY     II +  G  ++  +
Sbjct: 139 FRFGNVNKKLFLYLLVPGIIGSIAGSFLISNM 170


>gi|452853551|ref|YP_007495235.1| conserved membrane protein of unknown function [Desulfovibrio
           piezophilus]
 gi|451897205|emb|CCH50084.1| conserved membrane protein of unknown function [Desulfovibrio
           piezophilus]
          Length = 309

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 66/129 (51%), Gaps = 9/129 (6%)

Query: 338 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           LVF  A G   G++ G+ G+GGGF++ PL +  GIPP V++A+    I  +S+   + +Y
Sbjct: 17  LVF--ALGGFVGLLSGIFGVGGGFLMTPLLIMFGIPPTVAAASDSNQIVGASTSGCLAHY 74

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVG 457
            L               ++  F G  V+K L K +G A  +I  ++++     L LGGVG
Sbjct: 75  RLGNVDFKMGFLLLIGGVLGGFAGVQVIKVL-KAMGNADFLI-NITYV-----LMLGGVG 127

Query: 458 LAKMIKRIE 466
               I+ ++
Sbjct: 128 SYMFIESVQ 136


>gi|242279660|ref|YP_002991789.1| hypothetical protein Desal_2192 [Desulfovibrio salexigens DSM 2638]
 gi|242122554|gb|ACS80250.1| protein of unknown function DUF81 [Desulfovibrio salexigens DSM
           2638]
          Length = 426

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 62/130 (47%), Gaps = 11/130 (8%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           A G+ AG++ G +G GGGFI+ P  +  GI   ++  T LF I   + M  V +  +   
Sbjct: 86  AIGLGAGLITGCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVIHRKMGNV 145

Query: 403 PVPYALYFFALSIIAAFVG---QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 459
            VP A+ F   +I+ A  G     VL ++  +L  A        FI  V +L LG +G  
Sbjct: 146 SVPLAVVFLIGAIVGATAGGIINRVLYEINPVLSDA--------FITTVYSLMLGFLGFY 197

Query: 460 KMIKRIEHKE 469
            M   ++ ++
Sbjct: 198 AMFDYLKARK 207


>gi|399056281|ref|ZP_10743692.1| putative permease [Brevibacillus sp. CF112]
 gi|433543570|ref|ZP_20499974.1| hypothetical protein D478_07713 [Brevibacillus agri BAB-2500]
 gi|398046272|gb|EJL38897.1| putative permease [Brevibacillus sp. CF112]
 gi|432185127|gb|ELK42624.1| hypothetical protein D478_07713 [Brevibacillus agri BAB-2500]
          Length = 274

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 46/91 (50%), Gaps = 7/91 (7%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSV 393
           +   G+ AG++G + GLGGG    P  +          +PPQV++AT+L  I  ++  S 
Sbjct: 9   FLLIGVVAGVIGSIAGLGGGIFFVPALMFFANWYEPGSMPPQVAAATSLLVIAVTALSSS 68

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           + Y   K+  +  AL FF  S   A VG ++
Sbjct: 69  ISYIKQKKVDMQSALLFFIGSAPGAIVGVYL 99


>gi|268591933|ref|ZP_06126154.1| putative membrane protein [Providencia rettgeri DSM 1131]
 gi|291312322|gb|EFE52775.1| putative membrane protein [Providencia rettgeri DSM 1131]
          Length = 264

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 39/156 (25%), Positives = 71/156 (45%), Gaps = 23/156 (14%)

Query: 311 YEAVALYKGRRKIASKGDEGTK-------------WRASQLVFYCACGITAGMVGGLLGL 357
           +  V LY G R   ++GDE                W     + +   G+  G++ G+LG+
Sbjct: 101 FSLVMLYVGGRIFFNRGDEQNDAHCLINTETGKFIWNTKTALIFGGIGVITGLLTGMLGV 160

Query: 358 GGGFILGPL---FLELGIPPQVSSATALFAITFSSSMSV-VEYYLLKRFPVPYALYFFAL 413
           GGGFI+ P    F  L +   +  AT+L  I    S+S+ +  +   ++P   ++ F A 
Sbjct: 161 GGGFIIVPALRKFTNLNMKSII--ATSLMVIFCIGSISIFINVWNGFQYPKEISIVFIAS 218

Query: 414 SIIAAFVGQHVL----KKLIKILGRASIIIFTLSFI 445
            ++   +G+ ++     K I+IL  +++I   L  I
Sbjct: 219 CVVGLLIGRLIMHYISNKAIQILFSSAVIFVALGLI 254


>gi|254467336|ref|ZP_05080747.1| permease [Rhodobacterales bacterium Y4I]
 gi|206688244|gb|EDZ48726.1| permease [Rhodobacterales bacterium Y4I]
          Length = 285

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 55/101 (54%), Gaps = 2/101 (1%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           G++ G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++  +   +   L
Sbjct: 3   GVLSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRRTVDIKMGL 62

Query: 409 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVS 449
              A  ++ A +G  V   L K LG+  +++  L +++F+ 
Sbjct: 63  VLQAGGLMGAALGVVVFNYL-KALGQVDLLV-KLCYVVFLG 101


>gi|170741286|ref|YP_001769941.1| hypothetical protein M446_3097 [Methylobacterium sp. 4-46]
 gi|168195560|gb|ACA17507.1| protein of unknown function DUF81 [Methylobacterium sp. 4-46]
          Length = 307

 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 32/51 (62%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVV 394
            G   G V G+ G+GGGF++ P+ + LGIPP ++ AT    I  SS+ SV+
Sbjct: 20  LGAAVGFVSGVFGIGGGFLMTPILIVLGIPPAIAVATQTAPIVASSTTSVL 70


>gi|294500093|ref|YP_003563793.1| hypothetical protein BMQ_3337 [Bacillus megaterium QM B1551]
 gi|294350030|gb|ADE70359.1| putative membrane protein [Bacillus megaterium QM B1551]
          Length = 258

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 51/82 (62%), Gaps = 3/82 (3%)

Query: 349 GMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAITFSSSM-SVVEYYLLKRFPVPY 406
           G V G+LG GG FIL P+ L  L IP +++ AT+L A+TF SS+ S V   ++ + P   
Sbjct: 153 GAVAGVLGAGGAFILVPVMLVVLKIPTRITIATSL-AVTFISSIGSTVGKLIMHQVPFIP 211

Query: 407 ALYFFALSIIAAFVGQHVLKKL 428
           AL   A S+IA+ +G  V +K+
Sbjct: 212 ALILVAASLIASPIGAKVGQKM 233


>gi|114048031|ref|YP_738581.1| hypothetical protein Shewmr7_2539 [Shewanella sp. MR-7]
 gi|113889473|gb|ABI43524.1| protein of unknown function DUF81 [Shewanella sp. MR-7]
          Length = 257

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 48/91 (52%), Gaps = 11/91 (12%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY---LLKR 401
           G+ AG +  ++G GGG +  P  L LGIPP  +  T  FA +F SSM+   YY   LLK 
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWTYYRQHLLKP 75

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKIL 432
                   F+ ++ IA F+G  +   L+ +L
Sbjct: 76  -------SFWYMAFIATFIGAVLGSVLVYLL 99


>gi|113970806|ref|YP_734599.1| hypothetical protein Shewmr4_2471 [Shewanella sp. MR-4]
 gi|113885490|gb|ABI39542.1| protein of unknown function DUF81 [Shewanella sp. MR-4]
          Length = 257

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 48/91 (52%), Gaps = 11/91 (12%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY---LLKR 401
           G+ AG +  ++G GGG +  P  L LGIPP  +  T  FA +F SSM+   YY   LLK 
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWTYYRQHLLKP 75

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKIL 432
                   F+ ++ IA F+G  +   L+ +L
Sbjct: 76  -------SFWYMAFIATFIGAVLGSVLVYLL 99


>gi|332295235|ref|YP_004437158.1| hypothetical protein Thena_0382 [Thermodesulfobium narugense DSM
           14796]
 gi|332178338|gb|AEE14027.1| protein of unknown function DUF81 [Thermodesulfobium narugense DSM
           14796]
          Length = 253

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 40/70 (57%)

Query: 320 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 379
           ++K  + G++    +   L      G  AG V GLLG+GGG ++ P+ + +G+P + ++A
Sbjct: 117 KQKKNNSGEDFQINKVQMLTIGVPIGAFAGFVAGLLGIGGGNMIIPILIFIGVPARFAAA 176

Query: 380 TALFAITFSS 389
           T  F + FSS
Sbjct: 177 TTSFIVLFSS 186


>gi|259415120|ref|ZP_05739042.1| membrane protein [Silicibacter sp. TrichCH4B]
 gi|259349030|gb|EEW60784.1| membrane protein [Silicibacter sp. TrichCH4B]
          Length = 305

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 5/104 (4%)

Query: 348 AGMVG---GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
            GMVG   G+ G+GGGF+L PL   +GIPP V+ AT    I  SS   V+ +   +   +
Sbjct: 21  GGMVGVLSGMFGVGGGFLLTPLLFFIGIPPAVAVATGANQIVASSVSGVLAHLRRRTVDL 80

Query: 405 PYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
                  A  +I A +G  +   L K LG+  + + TL +++F+
Sbjct: 81  KMGAVLQAGGLIGAALGVAIFNYL-KSLGQIDLFV-TLCYVVFL 122


>gi|254475159|ref|ZP_05088545.1| membrane protein [Ruegeria sp. R11]
 gi|214029402|gb|EEB70237.1| membrane protein [Ruegeria sp. R11]
          Length = 307

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 54/100 (54%), Gaps = 2/100 (2%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           G++ G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++  +   +   L
Sbjct: 25  GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRRTVDIKMGL 84

Query: 409 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
                 +I A +G  V   L K LG+  +++  L +++F+
Sbjct: 85  VLQVGGLIGAALGVVVFNYL-KALGQVDLLV-KLCYVVFL 122


>gi|358344799|ref|XP_003636474.1| hypothetical protein MTR_042s0017 [Medicago truncatula]
 gi|355502409|gb|AES83612.1| hypothetical protein MTR_042s0017 [Medicago truncatula]
          Length = 107

 Score = 44.3 bits (103), Expect = 0.14,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 37/63 (58%), Gaps = 3/63 (4%)

Query: 228 QTEECKCDPEY--LSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK- 284
           +T +   + EY  LS  +     +  K  KV+IIEN+  KEFGLL  VW  ++A QIAK 
Sbjct: 8   ETIDSGAEVEYKPLSTGSNGALEKGAKMHKVTIIENVCRKEFGLLAFVWVSLIAAQIAKQ 67

Query: 285 NYT 287
           NYT
Sbjct: 68  NYT 70


>gi|452965232|gb|EME70258.1| permease [Magnetospirillum sp. SO-1]
          Length = 301

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 67/124 (54%), Gaps = 9/124 (7%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           A G   G++ G+ G+GGGF+L PL + LGIPP V+ A+    +  SS   V  ++  +R 
Sbjct: 20  AVGGGIGVLSGIFGVGGGFLLTPLLIMLGIPPAVAVASGANQVLGSSVSGVFAHW--RRR 77

Query: 403 PVPYALYFFALSIIAAFVGQHV-LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
            V   +  F L+   A     V L  L+K LG+  ++I +LS++ F     LG VGL  +
Sbjct: 78  NVDIRMAVFLLAGGFAGSAAGVWLFALLKRLGQIDLVI-SLSYVGF-----LGSVGLLML 131

Query: 462 IKRI 465
           ++ +
Sbjct: 132 VETL 135


>gi|84685127|ref|ZP_01013026.1| membrane protein [Maritimibacter alkaliphilus HTCC2654]
 gi|84666859|gb|EAQ13330.1| membrane protein [Rhodobacterales bacterium HTCC2654]
          Length = 304

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 51/96 (53%), Gaps = 2/96 (2%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ +   K   +       +
Sbjct: 31  GMFGVGGGFLMTPLLFMIGIPPAVAVATGTNQIVASSVSGVLAHLKRKTVDLKMGCVLLS 90

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
             ++ A  G  +   L + +G+  +++ TLS+++F+
Sbjct: 91  GGLVGAAAGVQLFAYL-RSIGQVDLLV-TLSYVVFL 124


>gi|117921072|ref|YP_870264.1| hypothetical protein Shewana3_2631 [Shewanella sp. ANA-3]
 gi|117613404|gb|ABK48858.1| protein of unknown function DUF81 [Shewanella sp. ANA-3]
          Length = 257

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 11/80 (13%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY---LLKR 401
           G+ AG +  ++G GGG +  P  L LGIPP  +  T  FA +F SSM+   YY   LLK 
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWTYYRQHLLKP 75

Query: 402 FPVPYALYFFALSIIAAFVG 421
                   F+ ++ IA F+G
Sbjct: 76  -------SFWYMAFIATFIG 88


>gi|57640914|ref|YP_183392.1| hypothetical protein TK0979 [Thermococcus kodakarensis KOD1]
 gi|57159238|dbj|BAD85168.1| hypothetical membrane protein, conserved, DUF81 family
           [Thermococcus kodakarensis KOD1]
          Length = 254

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 42/81 (51%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
           Y   GI  G++  + GLGGGF++ P    LG+    +  T+  A+ F+S  S + Y+  +
Sbjct: 7   YLGVGIVIGILAAMFGLGGGFLIVPTLNFLGVEIHHAVGTSSAAVVFTSLSSAIAYHRQR 66

Query: 401 RFPVPYALYFFALSIIAAFVG 421
           R      L   + ++I A++G
Sbjct: 67  RIHYKAGLLLASTAVIGAYIG 87


>gi|24373383|ref|NP_717426.1| 4-toluene sulfonate uptake permease family protein [Shewanella
           oneidensis MR-1]
 gi|24347649|gb|AAN54870.1| 4-toluene sulfonate uptake permease family protein [Shewanella
           oneidensis MR-1]
          Length = 257

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 50/100 (50%), Gaps = 11/100 (11%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
           S      A G+ AG +  ++G GGG +  P  L LGIPP  +  T  FA +F SSM+   
Sbjct: 8   SNWALLAAIGLVAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWT 66

Query: 396 YY---LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKIL 432
           +Y   LLK         F+ ++ +A F+G  +   L+ +L
Sbjct: 67  FYRQHLLKP-------AFWYMAFVATFIGAVLGSILVYLL 99


>gi|23016247|ref|ZP_00056005.1| COG0730: Predicted permeases [Magnetospirillum magnetotacticum
           MS-1]
          Length = 301

 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 60/112 (53%), Gaps = 13/112 (11%)

Query: 357 LGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII 416
           +GGGF+L PL + LGIPP V+ A+    +  SS   V  ++  +   V  A++     ++
Sbjct: 34  VGGGFLLTPLLIMLGIPPAVAVASGANQVLGSSVSGVFAHWRRRNVDVKMAVFL----LV 89

Query: 417 AAFVGQHV---LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRI 465
             F G  V   L  L+K LG+  ++I +LS++ F     LG VGL  +I+ +
Sbjct: 90  GGFAGSGVGVWLFALLKRLGQIDLVI-SLSYVGF-----LGSVGLLMLIETL 135


>gi|23099829|ref|NP_693295.1| hypothetical protein OB2374 [Oceanobacillus iheyensis HTE831]
 gi|22778060|dbj|BAC14330.1| hypothetical conserved protein [Oceanobacillus iheyensis HTE831]
          Length = 285

 Score = 43.9 bits (102), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 48/98 (48%), Gaps = 7/98 (7%)

Query: 316 LYKGRRKIASKGDEGTKWRASQLVFYCACGIT--AGMVGGLLGLGGGFILGPLFLEL-GI 372
           L  G R    KG+        +L F  A  I    GM+ GL G+GGG I+ P+ + L G 
Sbjct: 148 LEPGARSFELKGEV----YIYKLSFISAISIALFVGMLSGLFGIGGGAIMVPVMMLLFGF 203

Query: 373 PPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYF 410
           P  +++AT++F I F S M  + +  L      Y L+F
Sbjct: 204 PAHLAAATSMFMIIFVSFMGSITHIYLGNVVWEYVLFF 241



 Score = 38.1 bits (87), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 54/106 (50%), Gaps = 7/106 (6%)

Query: 96  GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLA 155
           G+GGG I VP++ L+ GF A  +AA S  M+       V +     H  L   V +Y   
Sbjct: 186 GIGGGAIMVPVMMLLFGFPAHLAAATSMFMII-----FVSFMGSITHIYLGNVVWEY--V 238

Query: 156 LLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGV 201
           L F P   +G  +G   N + ++ ++ +LL V+ +++  R  ++G+
Sbjct: 239 LFFIPGAWIGGKLGAKVNHLLSNNVLELLLKVMLILVGIRMIMQGI 284


>gi|86606127|ref|YP_474890.1| hypothetical protein CYA_1459 [Synechococcus sp. JA-3-3Ab]
 gi|86554669|gb|ABC99627.1| putative membrane protein [Synechococcus sp. JA-3-3Ab]
          Length = 317

 Score = 43.9 bits (102), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 35/53 (66%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           G+ AG++ G LG+GGG IL PL + LGIPP  + AT+ F+I  +S    ++ +
Sbjct: 31  GLAAGILAGFLGIGGGTILVPLQVSLGIPPLQAVATSNFSILLTSLAGSIQNW 83


>gi|436841460|ref|YP_007325838.1| conserved membrane protein of unknown function [Desulfovibrio
           hydrothermalis AM13 = DSM 14728]
 gi|432170366|emb|CCO23737.1| conserved membrane protein of unknown function [Desulfovibrio
           hydrothermalis AM13 = DSM 14728]
          Length = 426

 Score = 43.9 bits (102), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 12/178 (6%)

Query: 296 VLNLLQIPVA-GGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGL 354
           VL L+ IPV  GG++    +A          K      +     +   A G+ AG++ G 
Sbjct: 38  VLGLMLIPVILGGIAFASDIAPVLPDLLGGKKAYSPAFYSLGIFIVSIAIGMGAGLITGC 97

Query: 355 LGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALS 414
           +G GGGFI+ P  +  GI   ++  T LF I   + M  V +  +    VP A  F   +
Sbjct: 98  IGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAMMGSVIHRKMGNVSVPLAGVFLIGA 157

Query: 415 IIAAFVG---QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE 469
           I+ A  G     VL ++  +L  A        FI  V +L LG +G   M+  ++ ++
Sbjct: 158 IVGATAGGVINRVLYEINPVLSDA--------FITTVYSLMLGFLGTYAMMDYLKARK 207


>gi|452973225|gb|EME73047.1| hypothetical protein BSONL12_14964 [Bacillus sonorensis L12]
          Length = 272

 Score = 43.9 bits (102), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 45/91 (49%), Gaps = 8/91 (8%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLEL-GIP-------PQVSSATALFAITFSSSMSVVE 395
            G+ AG +G L+GLGGG ++ P  L L GIP       PQ++  T+L  I F+   S + 
Sbjct: 9   LGLIAGTIGSLVGLGGGIVIVPSLLFLSGIPGVFDHVTPQMAVGTSLLVIIFTGLSSTIA 68

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHVLK 426
           Y   K       L FF  S   + +G HV K
Sbjct: 69  YMKYKTVDYKSGLIFFIGSGPGSIIGAHVSK 99


>gi|242398129|ref|YP_002993553.1| permease [Thermococcus sibiricus MM 739]
 gi|242264522|gb|ACS89204.1| Predicted permease [Thermococcus sibiricus MM 739]
          Length = 254

 Score = 43.9 bits (102), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 43/81 (53%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
           Y + GI  G++  L GLGGGF++ P    LG+    +  T+  A+ F+S  S + Y+  +
Sbjct: 7   YFSIGIFVGILAALFGLGGGFLVVPTLNLLGVEIHHAVGTSSAAVVFTSLSSAIAYHRQR 66

Query: 401 RFPVPYALYFFALSIIAAFVG 421
           R      L   + ++I A++G
Sbjct: 67  RIHYKAGLLLASTAVIGAYIG 87


>gi|163744253|ref|ZP_02151613.1| integral membrane protein, putative [Oceanibulbus indolifex HEL-45]
 gi|161381071|gb|EDQ05480.1| integral membrane protein, putative [Oceanibulbus indolifex HEL-45]
          Length = 304

 Score = 43.9 bits (102), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 57/98 (58%), Gaps = 6/98 (6%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL--YF 410
           G+ G+GGGF++ PL   +GIPP V+ AT+   I  SS  +++ +  LKR  V + +    
Sbjct: 29  GMFGVGGGFLITPLLFFVGIPPAVAVATSTNQIVASSFSALLAH--LKRRTVDFRMGWVL 86

Query: 411 FALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
            A  +I + +G  +   L K LG+  +++ TL ++IF+
Sbjct: 87  LAGGLIGSGIGMFIFNYL-KSLGQVDLLV-TLCYVIFL 122


>gi|340027139|ref|ZP_08663202.1| hypothetical protein PaTRP_00390 [Paracoccus sp. TRP]
          Length = 307

 Score = 43.9 bits (102), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 64/132 (48%), Gaps = 18/132 (13%)

Query: 315 ALYKGRR----KIASKGDEG--------TKWRASQL----VFYCACGITAGMVGGLLGLG 358
           AL++ RR    K   +   G         K+RAS L    +     G   G++G ++G+G
Sbjct: 136 ALHRARRSGPIKPMRRHQHGWVHKWPLKMKFRASGLYISAIPVLMVGAAVGVLGAIMGVG 195

Query: 359 GGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVEYYLLKR-FPVPYALYFFALSII 416
           GGFI+ P  +  LG+P +V   T+LF ITF S+ + + + +      V  A+      +I
Sbjct: 196 GGFIMVPAMIYLLGMPTKVVIGTSLFQITFLSAYTTLMHAVSSNTVDVMLAVLLIVGGVI 255

Query: 417 AAFVGQHVLKKL 428
            A VG H+  +L
Sbjct: 256 GAQVGTHLGARL 267


>gi|303246503|ref|ZP_07332782.1| protein of unknown function DUF81 [Desulfovibrio fructosovorans JJ]
 gi|302492213|gb|EFL52088.1| protein of unknown function DUF81 [Desulfovibrio fructosovorans JJ]
          Length = 420

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 53/119 (44%), Gaps = 5/119 (4%)

Query: 308 VSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF 367
           VS  EA     G+   A      T + AS      A G+ AG++ G +G GGGFI+ P  
Sbjct: 49  VSFAEAGGYLGGKHAYAPAFYTSTIFLAS-----IAVGLAAGLITGCIGAGGGFIITPAL 103

Query: 368 LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 426
           + +G+   ++  T LF I   + M    +  L    V  A+ F   SI   FVG  + K
Sbjct: 104 MSVGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNVSVKLAIAFLVGSISGTFVGGAINK 162


>gi|301102374|ref|XP_002900274.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262102015|gb|EEY60067.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 340

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 61/133 (45%), Gaps = 6/133 (4%)

Query: 319 GRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVS 377
           GRR + S   E T   A  L    A G  AGMV  L G+GGG IL P   +   +  Q  
Sbjct: 21  GRRVLCSAPKERTSSAAIPLGVGVATGGVAGMVSALTGVGGGIILIPAMAKFTSLSQQAI 80

Query: 378 SATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG---QHVL--KKLIKIL 432
           + T+L A+  +SS+    Y       VP AL     +I+A   G    H L  KKL  ++
Sbjct: 81  NGTSLGAVAIASSVGAWNYLQSGLCNVPLALTTTIPAILATKYGVRTAHRLTSKKLSLVV 140

Query: 433 GRASIIIFTLSFI 445
           G A ++   L F+
Sbjct: 141 GSAMLLCSPLIFL 153


>gi|407787737|ref|ZP_11134876.1| hypothetical protein B30_16868 [Celeribacter baekdonensis B30]
 gi|407199016|gb|EKE69040.1| hypothetical protein B30_16868 [Celeribacter baekdonensis B30]
          Length = 302

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 53/103 (51%), Gaps = 6/103 (5%)

Query: 332 KWRASQLVFYC----ACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAIT 386
           K+RAS L        A G+  GM+G ++G+GGGFI+ P +   LG+P +V   T+LF I 
Sbjct: 164 KFRASGLYISAIPPIAVGMMVGMLGAVMGVGGGFIMVPAMIYLLGMPTKVVIGTSLFQII 223

Query: 387 FSSSM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           F++   +++  Y      V  A       +I A +G  +  K+
Sbjct: 224 FTAGFTTLLHAYTNHTVDVLLAFLLLVGGVIGAQIGTQIGLKM 266



 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 64/118 (54%), Gaps = 7/118 (5%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           G++ G+ G+GGGF++ PL   +GIPP V+ AT+   I  SS   V+ ++  K   +   L
Sbjct: 25  GVLSGMFGVGGGFLITPLLFFVGIPPAVAVATSANQIVASSVSGVLAHFKRKTVDLKMGL 84

Query: 409 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIE 466
                 ++ + +G  +   ++K +G+  +++ TL +++F     LG +G   +I+ I 
Sbjct: 85  VLLGGGLVGSAIGIQIF-NMLKAMGQVDLLV-TLCYVVF-----LGIIGFLMLIESIR 135


>gi|89099437|ref|ZP_01172313.1| hypothetical protein B14911_19705 [Bacillus sp. NRRL B-14911]
 gi|89085823|gb|EAR64948.1| hypothetical protein B14911_19705 [Bacillus sp. NRRL B-14911]
          Length = 285

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 68/140 (48%), Gaps = 10/140 (7%)

Query: 259 IENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVS-AYEAVALY 317
           ++NI WKE G ++ V  + LA Q AK      S     L+   IP    V  +Y A  ++
Sbjct: 64  LKNILWKE-GAILGVSGM-LATQAAKPLVLFLS--EKGLDATVIPACYIVLLSYFAFTMF 119

Query: 318 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQV 376
           +  +K   +  EG    A  L+     G + G V   LG+GGGFI+ PL +  LG+ P+ 
Sbjct: 120 RQGKKTGEQSREGRPSLAGMLL----IGFSGGFVSAALGVGGGFIMVPLSIAFLGLQPRK 175

Query: 377 SSATALFAITFSSSMSVVEY 396
           +  T+LFA+    S   + Y
Sbjct: 176 AVGTSLFAVLLIVSTGFLSY 195


>gi|154250913|ref|YP_001411737.1| hypothetical protein Plav_0457 [Parvibaculum lavamentivorans DS-1]
 gi|154154863|gb|ABS62080.1| protein of unknown function DUF81 [Parvibaculum lavamentivorans
           DS-1]
          Length = 307

 Score = 43.5 bits (101), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 33/59 (55%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
            A G T G + G+ G+GGGF++ PL + LGIPP V+  T    +  SS    + ++  K
Sbjct: 18  LAMGATVGFLSGMFGIGGGFLMTPLLIFLGIPPAVAVGTQTTQVVASSVTGALAHFTRK 76


>gi|323144141|ref|ZP_08078778.1| hypothetical protein HMPREF9444_01429 [Succinatimonas hippei YIT
           12066]
 gi|322416088|gb|EFY06785.1| hypothetical protein HMPREF9444_01429 [Succinatimonas hippei YIT
           12066]
          Length = 251

 Score = 43.5 bits (101), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 61/120 (50%), Gaps = 7/120 (5%)

Query: 96  GVGGGGIFVPMLNLIV-GFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
           G+GGG + VP+L  +   F  +  AA S  +V G A+  + Y +RQ+        I Y  
Sbjct: 6   GIGGGVLMVPLLMTVFPQFSMQMVAATSLSIVIGSALINLTYFIRQKIA------ISYKG 59

Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREA 214
            +L+   +++G+  G   + IF   +I  + IV  L+L+ + FLK  ++++ +   K  A
Sbjct: 60  LVLWSAGMIIGVQGGFELSFIFHPNVIVGIFIVTMLILAVKTFLKLKKSYQVKEDVKVNA 119


>gi|407462768|ref|YP_006774085.1| hypothetical protein NKOR_06310 [Candidatus Nitrosopumilus
           koreensis AR1]
 gi|407046390|gb|AFS81143.1| hypothetical protein NKOR_06310 [Candidatus Nitrosopumilus
           koreensis AR1]
          Length = 120

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 35/57 (61%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           G  AG++G ++GLGGG I+ P+   LG PP  +++ +LFA   ++  S V Y   +R
Sbjct: 11  GFAAGVLGSMIGLGGGIIVVPVLTFLGFPPIAAASNSLFAALSNAVASTVSYSRQRR 67


>gi|138896571|ref|YP_001127024.1| hypothetical protein GTNG_2934 [Geobacillus thermodenitrificans
           NG80-2]
 gi|196249274|ref|ZP_03147972.1| protein of unknown function DUF81 [Geobacillus sp. G11MC16]
 gi|134268084|gb|ABO68279.1| Hypothetical membrane spanning protein [Geobacillus
           thermodenitrificans NG80-2]
 gi|196211031|gb|EDY05792.1| protein of unknown function DUF81 [Geobacillus sp. G11MC16]
          Length = 300

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 52/116 (44%), Gaps = 14/116 (12%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 397
           G  AG VG L GLGGG I+ P  L  G       + PQV+  T+L  I F+   S + Y 
Sbjct: 10  GFVAGTVGSLAGLGGGVIIVPALLFFGSLGWLSAVTPQVAVGTSLVVIIFNGLSSTLSYM 69

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
             K       L F   SI    +G  V   L       S+  F+L F +F+ A+SL
Sbjct: 70  KDKMVDYQSGLLFCLGSIPGTVIGAWVNNTL-------SVDRFSLYFGLFLMAMSL 118


>gi|442317193|ref|YP_007357214.1| hypothetical protein MYSTI_00171 [Myxococcus stipitatus DSM 14675]
 gi|441484835|gb|AGC41530.1| hypothetical protein MYSTI_00171 [Myxococcus stipitatus DSM 14675]
          Length = 300

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 53/98 (54%), Gaps = 4/98 (4%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+  G++ GL+G GGGF++ P    +G+P  V++AT+L  I    +  +V +  L    +
Sbjct: 145 GVGVGILSGLVGAGGGFLIVPALAMVGLPTPVATATSLVVIALQCAAGLVGH--LGHLDL 202

Query: 405 PYAL--YFFALSIIAAFVGQHVLKKLIKILGRASIIIF 440
           P+ L     A++++ +FVG  +  ++   L R    +F
Sbjct: 203 PWLLTAQVIAVALVGSFVGGRLAGRVAPGLLRKGFAVF 240


>gi|158423157|ref|YP_001524449.1| permease [Azorhizobium caulinodans ORS 571]
 gi|158330046|dbj|BAF87531.1| putative permease [Azorhizobium caulinodans ORS 571]
          Length = 317

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 34/53 (64%), Gaps = 3/53 (5%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA---TALFAITFSSSMS 392
           A G   G V G+ G+GGGF++ PL L +G+PP V+ A   T + A +FS ++S
Sbjct: 19  ALGAAVGFVSGMFGVGGGFLMTPLLLFIGVPPAVAVASVSTHMAASSFSGALS 71


>gi|449017054|dbj|BAM80456.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 383

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 39/63 (61%), Gaps = 1/63 (1%)

Query: 335 ASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSV 393
           A+++V     G+TAG +G L+G+GGG I+ PL   + G+    +  T+L A++FSS +  
Sbjct: 105 AARIVLSGLIGLTAGFLGSLIGVGGGIIMTPLLTSVAGLSQHEAHGTSLVAVSFSSCVGA 164

Query: 394 VEY 396
           + Y
Sbjct: 165 LAY 167


>gi|256828130|ref|YP_003156858.1| hypothetical protein Dbac_0315 [Desulfomicrobium baculatum DSM
           4028]
 gi|256577306|gb|ACU88442.1| protein of unknown function DUF81 [Desulfomicrobium baculatum DSM
           4028]
          Length = 415

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 2/98 (2%)

Query: 336 SQLVFYCA--CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 393
           S  +F+ +   G+ AG++ G +G GGGFI+ P  +  G+   ++  T LF I   + M  
Sbjct: 67  SLYIFFVSIFVGVGAGLISGCIGAGGGFIIAPALMSAGVKGILAVGTDLFHIFAKAIMGS 126

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           V +  +    VP A  F   +II   VG  + + L  I
Sbjct: 127 VLHRKMGNVSVPLAFVFLIGAIIGTTVGAGINRALYNI 164



 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 334 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMS 392
           R    +F    GI  GM  G++G+GGGF+  P+F+  LG+    +  T +F I F++   
Sbjct: 240 RKISWIFLVLSGILVGMAAGIMGVGGGFLTFPIFVYILGVSSLTTVGTDIFQIVFTAGYG 299

Query: 393 VVEYYLLKRF 402
            +  Y +  F
Sbjct: 300 AITQYAIYGF 309


>gi|108757215|ref|YP_628460.1| hypothetical protein MXAN_0177 [Myxococcus xanthus DK 1622]
 gi|108461095|gb|ABF86280.1| membrane protein [Myxococcus xanthus DK 1622]
          Length = 293

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 48/163 (29%), Positives = 77/163 (47%), Gaps = 14/163 (8%)

Query: 306 GGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 365
           G V    AVA+ + RR+        +   A++++   A GI  G + GL+G GGGF++ P
Sbjct: 104 GSVMVAAAVAMLR-RREAGPAATSVSPLPAARVL---AQGIAVGALSGLVGAGGGFLIVP 159

Query: 366 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 425
                G+P  V++AT+L  I    +  +V +  L    +P+ L    L  +AA  G  V 
Sbjct: 160 ALSLAGLPMPVATATSLVVIALQCAAGLVGH--LGHLDLPWVLTGEVL--LAAMTGSLVG 215

Query: 426 KKLIKILGRASIIIFTLSFIIFV---SALSLGGVGLAKMIKRI 465
            +L    GR S  +    F +FV   +A  LG    A + +R+
Sbjct: 216 GRLA---GRVSPAMLRKGFAVFVLTTAAFLLGAQSPAPLRERV 255


>gi|406672837|ref|ZP_11080062.1| hypothetical protein HMPREF9700_00604 [Bergeyella zoohelcum CCUG
           30536]
 gi|405587381|gb|EKB61109.1| hypothetical protein HMPREF9700_00604 [Bergeyella zoohelcum CCUG
           30536]
          Length = 263

 Score = 43.5 bits (101), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 1/96 (1%)

Query: 334 RASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMS 392
           + +      + G+  G++ G +G GGGF++ P L +  G+  + + AT+LF I  +S++ 
Sbjct: 140 KENNYTLLISQGLLVGIIMGFIGAGGGFLIVPALVMLFGMNMREAVATSLFIIMINSAVG 199

Query: 393 VVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
            +          P+ L F  LSII  F+G  + KK+
Sbjct: 200 FISSLDKIAIDWPFLLVFSGLSIIGIFIGMLIAKKM 235


>gi|254410195|ref|ZP_05023975.1| conserved domain protein, putative [Coleofasciculus chthonoplastes
           PCC 7420]
 gi|196183231|gb|EDX78215.1| conserved domain protein, putative [Coleofasciculus chthonoplastes
           PCC 7420]
          Length = 267

 Score = 43.5 bits (101), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 39/64 (60%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
           +QL+ + + G+ AG++ G LG+GGG +L PL + LG   Q + AT+  +I  ++    V+
Sbjct: 4   TQLLIFASAGLFAGILAGFLGIGGGTVLVPLLVTLGYDYQQAVATSTLSIVITAISGTVQ 63

Query: 396 YYLL 399
            + L
Sbjct: 64  NWRL 67



 Score = 39.3 bits (90), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 61/135 (45%), Gaps = 15/135 (11%)

Query: 85  GFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT 144
           G F        G+GGG + VP+L + +G+D + + A S   +   A+S  V N R  +  
Sbjct: 13  GLFAGILAGFLGIGGGTVLVPLL-VTLGYDYQQAVATSTLSIVITAISGTVQNWRLGN-- 69

Query: 145 LDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT-----VLLIVLFLVLSTRAFLK 199
                ID+   +      ++   IG     +FAD+ +      +LLI ++LV   R  LK
Sbjct: 70  -----IDFKRIIAIGFPAIITAPIGAYLTELFADYWLKAAFGLLLLINIYLV-RLRKRLK 123

Query: 200 GVETWKK-ETITKRE 213
             E     ET+T R+
Sbjct: 124 AREKQDSLETVTPRQ 138


>gi|423315709|ref|ZP_17293614.1| hypothetical protein HMPREF9699_00185 [Bergeyella zoohelcum ATCC
           43767]
 gi|405585813|gb|EKB59616.1| hypothetical protein HMPREF9699_00185 [Bergeyella zoohelcum ATCC
           43767]
          Length = 263

 Score = 43.1 bits (100), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 1/96 (1%)

Query: 334 RASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMS 392
           + +      + G+  G++ G +G GGGF++ P L +  G+  + + AT+LF I  +S++ 
Sbjct: 140 KENNYTLLISQGLLVGIIMGFIGAGGGFLIVPALVMLFGMNMREAVATSLFIIMINSAVG 199

Query: 393 VVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
            +          P+ L F  LSII  F+G  + KK+
Sbjct: 200 FISSLDKIAIDWPFLLVFSGLSIIGIFIGMLIAKKM 235


>gi|417646396|ref|ZP_12296255.1| hypothetical protein SEVCU144_0935 [Staphylococcus epidermidis
           VCU144]
 gi|329727892|gb|EGG64342.1| hypothetical protein SEVCU144_0935 [Staphylococcus epidermidis
           VCU144]
          Length = 125

 Score = 43.1 bits (100), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 61/119 (51%), Gaps = 10/119 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFA---LSIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           S +++       LK       L F A    +IIA ++  ++LK LI I+  + ++I+T+
Sbjct: 61  SALKFLRSGKVDLKIVGKMLPLIFVASGGAAIIATYIPANILKPLI-IIALSLVLIYTV 118


>gi|255534255|ref|YP_003094626.1| hypothetical protein FIC_00088 [Flavobacteriaceae bacterium
           3519-10]
 gi|255340451|gb|ACU06564.1| conserved hypothetical transmembrane protein [Flavobacteriaceae
           bacterium 3519-10]
          Length = 266

 Score = 43.1 bits (100), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 58/107 (54%), Gaps = 4/107 (3%)

Query: 326 KGDEGTKWRASQLVFY---CACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATA 381
           + +E  + R    V Y    + G+  G+V GL+G GGGF++ P L + LG+  + + AT+
Sbjct: 129 RKNERPRLRKYDEVNYTILVSQGLLVGIVTGLIGAGGGFLIVPALVMLLGLNMKKAVATS 188

Query: 382 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           LF I  +S +  +    + +    + L F +LS+I  F+G  + K++
Sbjct: 189 LFIIAMNSLIGFLSTMKIVKHDWVFLLSFTSLSVIGIFIGLALSKRM 235


>gi|260431503|ref|ZP_05785474.1| putative membrane protein [Silicibacter lacuscaerulensis ITI-1157]
 gi|260415331|gb|EEX08590.1| putative membrane protein [Silicibacter lacuscaerulensis ITI-1157]
          Length = 306

 Score = 43.1 bits (100), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 53/101 (52%), Gaps = 2/101 (1%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           G++ G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++  K   +    
Sbjct: 25  GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRKTVDLKMGC 84

Query: 409 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVS 449
                 ++ A +G  V   L K LG+  +++  L +++F+ 
Sbjct: 85  VLLVGGLLGAALGVVVFNYL-KSLGQVDLLV-KLCYVVFLG 123


>gi|16079702|ref|NP_390526.1| hypothetical protein BSU26490 [Bacillus subtilis subsp. subtilis
           str. 168]
 gi|221310580|ref|ZP_03592427.1| hypothetical protein Bsubs1_14486 [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|221314904|ref|ZP_03596709.1| hypothetical protein BsubsN3_14407 [Bacillus subtilis subsp.
           subtilis str. NCIB 3610]
 gi|221319826|ref|ZP_03601120.1| hypothetical protein BsubsJ_14323 [Bacillus subtilis subsp.
           subtilis str. JH642]
 gi|221324105|ref|ZP_03605399.1| hypothetical protein BsubsS_14457 [Bacillus subtilis subsp.
           subtilis str. SMY]
 gi|402776802|ref|YP_006630746.1| integral inner membrane protein [Bacillus subtilis QB928]
 gi|418032177|ref|ZP_12670660.1| hypothetical protein BSSC8_16040 [Bacillus subtilis subsp. subtilis
           str. SC-8]
 gi|452915804|ref|ZP_21964430.1| sulfite exporter TauE/SafE family protein [Bacillus subtilis
           MB73/2]
 gi|1731129|sp|P54437.1|YRKJ_BACSU RecName: Full=UPF0721 transmembrane protein YrkJ
 gi|1303709|dbj|BAA12365.1| YrkJ [Bacillus subtilis]
 gi|2635094|emb|CAB14590.1| putative integral inner membrane protein [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|351471040|gb|EHA31161.1| hypothetical protein BSSC8_16040 [Bacillus subtilis subsp. subtilis
           str. SC-8]
 gi|402481982|gb|AFQ58491.1| Putative integral inner membrane protein [Bacillus subtilis QB928]
 gi|407959832|dbj|BAM53072.1| hypothetical protein BEST7613_4141 [Bacillus subtilis BEST7613]
 gi|407965475|dbj|BAM58714.1| hypothetical protein BEST7003_2513 [Bacillus subtilis BEST7003]
 gi|452116152|gb|EME06548.1| sulfite exporter TauE/SafE family protein [Bacillus subtilis
           MB73/2]
          Length = 261

 Score = 43.1 bits (100), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 60/114 (52%), Gaps = 11/114 (9%)

Query: 318 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQV 376
           KG+     K     KW AS L F        G V G+LG GG FIL P+ L  L IP +V
Sbjct: 131 KGQEHSEDKEVIFNKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRV 184

Query: 377 SSATALFAITFSSSM--SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           + A++L AITF SS+  +V +    +   VP AL     S+IA+ +G  V +K+
Sbjct: 185 TVASSL-AITFLSSIGATVGKVITGQVLFVP-ALVLMITSLIASPIGASVGQKV 236


>gi|392410298|ref|YP_006446905.1| putative permease [Desulfomonile tiedjei DSM 6799]
 gi|390623434|gb|AFM24641.1| putative permease [Desulfomonile tiedjei DSM 6799]
          Length = 279

 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 6/98 (6%)

Query: 328 DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL---GIPPQVSSATALFA 384
            E T + A + V Y A G   G+ G L+G GGGF+L P+ L L     P Q++S +   A
Sbjct: 2   TETTLFSAERFVGYVALGFGTGLYGTLIGAGGGFVLMPILLLLFPKESPEQLTSIS--LA 59

Query: 385 ITFSSSMSVVE-YYLLKRFPVPYALYFFALSIIAAFVG 421
           + F +++   E Y L++R      L F A +I  A +G
Sbjct: 60  VVFFNALGGSEAYALMRRIDYRSGLMFAAATIPGAILG 97


>gi|240102618|ref|YP_002958927.1| putative permease [Thermococcus gammatolerans EJ3]
 gi|239910172|gb|ACS33063.1| Predicted permease [Thermococcus gammatolerans EJ3]
          Length = 254

 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 42/81 (51%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
           Y A G+  G++  L GLGGGF++ P    LG+    +  T+  A+ F+S  S + Y   K
Sbjct: 7   YFAVGVFIGILAALFGLGGGFLVVPTLNLLGVEMHHAVGTSSAAVVFTSLSSTIAYSRQK 66

Query: 401 RFPVPYALYFFALSIIAAFVG 421
           R      L   + ++I A++G
Sbjct: 67  RIHYRVGLLLASTAVIGAYIG 87


>gi|333373756|ref|ZP_08465660.1| protein of hypothetical function DUF81 [Desmospora sp. 8437]
 gi|332969423|gb|EGK08447.1| protein of hypothetical function DUF81 [Desmospora sp. 8437]
          Length = 273

 Score = 43.1 bits (100), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 326 KGDEGTKWR-ASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALF 383
           K  EG  +R          CG   G++ GL G+GGG +L PL + L   PP V++AT++F
Sbjct: 143 KDAEGNIYRYGYHRTIALTCGFVVGLLSGLFGIGGGALLVPLMVLLFRFPPHVATATSMF 202

Query: 384 AITFSS 389
            I  SS
Sbjct: 203 VIFLSS 208


>gi|384921152|ref|ZP_10021141.1| hypothetical protein C357_18457 [Citreicella sp. 357]
 gi|384464952|gb|EIE49508.1| hypothetical protein C357_18457 [Citreicella sp. 357]
          Length = 302

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 59/107 (55%), Gaps = 6/107 (5%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G   G++ G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ +  LKR  V
Sbjct: 21  GALVGILSGMFGVGGGFLITPLLFFMGIPPAVAVATGANQIVASSVSGVLAH--LKRRTV 78

Query: 405 PYALYFFALS--IIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVS 449
              +    LS  I+ A VG  V   L+K +G+  +++  L +++F+ 
Sbjct: 79  DLRMGTVLLSGGIVGAAVGIWVF-NLLKGMGQVDLLV-RLCYVVFLG 123


>gi|260431501|ref|ZP_05785472.1| membrane protein [Silicibacter lacuscaerulensis ITI-1157]
 gi|260415329|gb|EEX08588.1| membrane protein [Silicibacter lacuscaerulensis ITI-1157]
          Length = 307

 Score = 42.7 bits (99), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 36/57 (63%), Gaps = 1/57 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           GI  G + G+ G+GGGF++ PL   +GIPP V+ AT+   +  +SS S V  +L +R
Sbjct: 21  GIAVGFLSGMFGVGGGFLITPLLFLIGIPPAVAVATSANQVV-ASSFSGVLAHLKRR 76


>gi|308270305|emb|CBX26917.1| hypothetical protein N47_A09460 [uncultured Desulfobacterium sp.]
          Length = 260

 Score = 42.7 bits (99), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 52/112 (46%), Gaps = 3/112 (2%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           GI AG++ G +G GGGFI+ P  +  GI   ++  T LF I   + M  V +  L    V
Sbjct: 88  GICAGLITGCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVLHRKLGNISV 147

Query: 405 PYALYFFALSIIAAFVG---QHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
              L F   SII A  G     V+  +  IL  A I +  +  +  + A SL
Sbjct: 148 ALGLIFILGSIIGATTGGVINRVIYDINPILSDAFITVIYVFMLGILGAYSL 199


>gi|317152720|ref|YP_004120768.1| hypothetical protein Daes_1006 [Desulfovibrio aespoeensis Aspo-2]
 gi|316942971|gb|ADU62022.1| protein of unknown function DUF81 [Desulfovibrio aespoeensis
           Aspo-2]
          Length = 427

 Score = 42.7 bits (99), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 53/111 (47%), Gaps = 6/111 (5%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+ AG++ G +G GGGFI+ P  +  GI   ++  T LF I   + M  V +  L    V
Sbjct: 88  GMGAGLITGCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVIHRKLGNVSV 147

Query: 405 PYALYFFALSIIAAFVG---QHVLKKLIKILGRASIII---FTLSFIIFVS 449
           P A  F   +II A  G     VL ++  IL  A I     F L F+ F S
Sbjct: 148 PLAAVFLIGAIIGASAGGIINRVLYEINPILSDAFITTVYSFMLGFLGFYS 198



 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 334 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMS 392
           R    VF    GI  G+  G++G+GGGF+  P+F+  LG+    +  T +F I F++  +
Sbjct: 251 RQISWVFLVLSGILVGLAAGIMGVGGGFLTFPIFVYVLGVSSMTTVGTDIFQIVFTAGFA 310

Query: 393 VVEYYLLKRF 402
            +  Y +  F
Sbjct: 311 SISQYAIYGF 320


>gi|83312652|ref|YP_422916.1| permease [Magnetospirillum magneticum AMB-1]
 gi|82947493|dbj|BAE52357.1| Predicted permease [Magnetospirillum magneticum AMB-1]
          Length = 301

 Score = 42.7 bits (99), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 58/108 (53%), Gaps = 7/108 (6%)

Query: 358 GGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIA 417
           GGGF+L PL + LGIPP V+ A+    +  SS   V  ++  +   +  AL+  A   I 
Sbjct: 35  GGGFLLTPLLIMLGIPPAVAVASGANQVLGSSVSGVFAHWRRRNVDIKMALFLLAGGFIG 94

Query: 418 AFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRI 465
           +  G   L  ++K LG+  ++I +LS++ F     LG VGL  +++ +
Sbjct: 95  SAAGVW-LFAVLKRLGQIDLVI-SLSYVGF-----LGSVGLLMLVETL 135


>gi|422350303|ref|ZP_16431189.1| hypothetical protein HMPREF9465_02079 [Sutterella wadsworthensis
           2_1_59BFAA]
 gi|404657451|gb|EKB30342.1| hypothetical protein HMPREF9465_02079 [Sutterella wadsworthensis
           2_1_59BFAA]
          Length = 266

 Score = 42.7 bits (99), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 8/88 (9%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS--------ATALFAITFSSSMSVVEY 396
           GI  G  G L+G+GGG I+ PLF+   +PP  S+         T+LF +  ++      Y
Sbjct: 11  GIGVGSFGALVGIGGGLIMVPLFMYFMMPPSGSTFANVQEVVGTSLFGVLLNALSGTWAY 70

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHV 424
           +  KR  +  A+ F   ++  AF+G +V
Sbjct: 71  FRQKRIILSVAMPFALATVPGAFLGSYV 98


>gi|212557467|gb|ACJ29921.1| Conserved hypothetical protein [Shewanella piezotolerans WP3]
          Length = 257

 Score = 42.7 bits (99), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 41/82 (50%), Gaps = 1/82 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+ AG +  + G GGG +  P  L +G+PP ++ AT   A  F SSM+   YY  + F  
Sbjct: 17  GLIAGFIDAVAG-GGGLLSIPALLTMGVPPHMALATNKLAACFGSSMAAYTYYKQRLFSP 75

Query: 405 PYALYFFALSIIAAFVGQHVLK 426
               + F  + I A +G  ++ 
Sbjct: 76  SLWYHTFIATFIGAVIGTFIVS 97


>gi|90422261|ref|YP_530631.1| hypothetical protein RPC_0741 [Rhodopseudomonas palustris BisB18]
 gi|90104275|gb|ABD86312.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
           BisB18]
          Length = 308

 Score = 42.7 bits (99), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 31/115 (26%), Positives = 58/115 (50%), Gaps = 2/115 (1%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
            ++   A G   G V G+ G+GGGF++ PL + +GI P V+ A+    +  SS    + Y
Sbjct: 13  NVLLILAMGAAVGFVSGMFGIGGGFLMTPLLIFIGIAPAVAVASVTSHMAASSLSGALSY 72

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
           +  +      AL      +I   +G      L++ LG+  ++I  LS+++ ++A+
Sbjct: 73  WRRRAIDPALALVLLCGGLIGTALGVWFF-TLMRSLGQLDLVI-ALSYVVLLTAV 125


>gi|339446247|ref|YP_004712251.1| hypothetical protein EGYY_28770 [Eggerthella sp. YY7918]
 gi|338905999|dbj|BAK45850.1| hypothetical protein EGYY_28770 [Eggerthella sp. YY7918]
          Length = 285

 Score = 42.7 bits (99), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 3/108 (2%)

Query: 322 KIASKGDEGTK--WRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSS 378
           K   + D GT   +   QL+     G+ AG+  G +G+GGGF++ PL L L GI  + +S
Sbjct: 147 KKTPEADTGTPATFSRRQLLMGAGIGLLAGVASGYVGVGGGFLMVPLMLSLVGIGMRQAS 206

Query: 379 ATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 426
            T+L A+   +   VVE  LL        +     +I  A +G  +++
Sbjct: 207 GTSLIAVMILAVPGVVEQALLGNIDYMAGIAISIGTIPGALIGAQLVR 254


>gi|171185687|ref|YP_001794606.1| hypothetical protein Tneu_1233 [Pyrobaculum neutrophilum V24Sta]
 gi|170934899|gb|ACB40160.1| protein of unknown function DUF81 [Pyrobaculum neutrophilum V24Sta]
          Length = 242

 Score = 42.7 bits (99), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 39/69 (56%), Gaps = 3/69 (4%)

Query: 66  KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
           K+V P     W   +G  + F G    S+ GVGGG +FVP L L  G DAK +AA+S  +
Sbjct: 111 KNVEPRRNAPW---LGPPLVFIGGFASSLFGVGGGTVFVPTLMLTSGLDAKRAAAMSMGI 167

Query: 126 VTGGAVSAV 134
           +   AVS+V
Sbjct: 168 IFPTAVSSV 176


>gi|154686411|ref|YP_001421572.1| hypothetical protein RBAM_019790 [Bacillus amyloliquefaciens FZB42]
 gi|154352262|gb|ABS74341.1| YrkJ1 [Bacillus amyloliquefaciens FZB42]
          Length = 175

 Score = 42.7 bits (99), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 63/107 (58%), Gaps = 8/107 (7%)

Query: 349 GMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAITFSSSM-SVVEYYLLKRFPVPY 406
           G+  G++G GG F+L PL L  L IP +++ AT+L A+TF SS+ + V   ++ + P   
Sbjct: 69  GLASGIVGAGGAFLLFPLMLVVLKIPTKITIATSL-AVTFISSIGTTVTKLVIGQVPFMP 127

Query: 407 ALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
           AL   A S+IA+ +G  + KK+      A+I+   L F+I ++ + +
Sbjct: 128 ALVIVAASLIASPIGVLIGKKM-----NANILQGILIFVILITTVKI 169


>gi|295705457|ref|YP_003598532.1| hypothetical protein BMD_3342 [Bacillus megaterium DSM 319]
 gi|294803116|gb|ADF40182.1| putative membrane protein [Bacillus megaterium DSM 319]
          Length = 258

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 3/82 (3%)

Query: 349 GMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAITFSSSM-SVVEYYLLKRFPVPY 406
           G V G+LG GG FIL P+ L  L IP +++ AT+L A+TF SS+ S +   +  + P   
Sbjct: 153 GAVAGVLGAGGAFILVPVMLVVLKIPTRITIATSL-AVTFISSIGSTIGKLITHQVPFIP 211

Query: 407 ALYFFALSIIAAFVGQHVLKKL 428
           AL   A S+IA+ +G  V +K+
Sbjct: 212 ALILVAASLIASPIGAKVGQKM 233


>gi|375082782|ref|ZP_09729829.1| permease [Thermococcus litoralis DSM 5473]
 gi|374742630|gb|EHR79021.1| permease [Thermococcus litoralis DSM 5473]
          Length = 254

 Score = 42.7 bits (99), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 42/81 (51%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
           Y   G+  G++  L GLGGGF++ P    LG+    +  T+  A+ F+S  S + Y+  +
Sbjct: 7   YFGVGVFIGILAALFGLGGGFLIVPTLNLLGVEIHHAVGTSSAAVVFTSLSSAIAYHRQR 66

Query: 401 RFPVPYALYFFALSIIAAFVG 421
           R      L   + ++I A++G
Sbjct: 67  RIHYKAGLLLASTAVIGAYIG 87


>gi|456352291|dbj|BAM86736.1| permease [Agromonas oligotrophica S58]
          Length = 308

 Score = 42.7 bits (99), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 63/126 (50%), Gaps = 8/126 (6%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+  + 
Sbjct: 18  LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYWRRRA 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV----G 457
                AL       +   +G      L++ LG+  ++I  LS+++ +S  S+GGV    G
Sbjct: 78  IDPSLALVLMIGGTLGTALGVSTF-TLLRSLGQLDLMI-ALSYVVLLS--SVGGVMFWEG 133

Query: 458 LAKMIK 463
           L  M++
Sbjct: 134 LRAMMR 139


>gi|215740460|dbj|BAG97116.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 109

 Score = 42.4 bits (98), Expect = 0.47,   Method: Composition-based stats.
 Identities = 20/27 (74%), Positives = 24/27 (88%)

Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVT 127
           GIFVPML LI+GFDAKSS A+SK +V+
Sbjct: 70  GIFVPMLTLIIGFDAKSSTAISKFIVS 96


>gi|408381578|ref|ZP_11179127.1| permease [Methanobacterium formicicum DSM 3637]
 gi|407816045|gb|EKF86608.1| permease [Methanobacterium formicicum DSM 3637]
          Length = 267

 Score = 42.4 bits (98), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 27/109 (24%), Positives = 52/109 (47%), Gaps = 1/109 (0%)

Query: 320 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 379
           +++ +SK D     R  Q +     G+ +G++ G+ G+ G   +      LG+P      
Sbjct: 133 KKETSSKNDPVVLTRPRQFL-ASLFGVASGLLAGIFGISGTPPVSAGLYSLGLPAMTVVG 191

Query: 380 TALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           T +F + F+S   +  Y+LL R  +   +     + + AF+G  +LKK+
Sbjct: 192 TTVFVLIFNSLTGIGGYFLLGRLDITLIILLAGGAAVGAFIGPKLLKKI 240


>gi|126458964|ref|YP_001055242.1| hypothetical protein Pcal_0341 [Pyrobaculum calidifontis JCM 11548]
 gi|126248685|gb|ABO07776.1| protein of unknown function DUF81 [Pyrobaculum calidifontis JCM
           11548]
          Length = 244

 Score = 42.4 bits (98), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 38/57 (66%)

Query: 79  VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVV 135
           V+G ++ F G    ++ GVGGG +FVP L L +GFDAK++AA S  ++   AV++ +
Sbjct: 121 VLGYLLIFAGGLASALFGVGGGTVFVPALVLAMGFDAKAAAASSMGIILPTAVASTL 177


>gi|308159430|gb|EFO61961.1| Hypothetical protein GLP15_502 [Giardia lamblia P15]
          Length = 748

 Score = 42.4 bits (98), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 3/54 (5%)

Query: 141 RHPTLDI--PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
           RH   D   P++ +DL +LFQP  +LG  +G   N IF  W++++  + LFL+L
Sbjct: 75  RHKKYDALSPLLQWDLLILFQPFSLLGALVGSICNAIFPSWVLSI-FVCLFLIL 127


>gi|218667447|ref|YP_002426656.1| hypothetical protein AFE_2258 [Acidithiobacillus ferrooxidans ATCC
           23270]
 gi|415968072|ref|ZP_11558253.1| hypothetical protein GGI1_04804 [Acidithiobacillus sp. GGI-221]
 gi|218519660|gb|ACK80246.1| membrane protein, putative [Acidithiobacillus ferrooxidans ATCC
           23270]
 gi|339833297|gb|EGQ61154.1| hypothetical protein GGI1_04804 [Acidithiobacillus sp. GGI-221]
          Length = 296

 Score = 42.4 bits (98), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 44/84 (52%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+  G + GL G+GGGF++ PL + +G+PP V+  T    I  +S++    ++ +    +
Sbjct: 12  GLIVGFLSGLTGVGGGFLITPLLIFVGVPPLVAVGTGAAQIVGASAVGSYAHWRMGNVDM 71

Query: 405 PYALYFFALSIIAAFVGQHVLKKL 428
             A    A S +   +G HV + L
Sbjct: 72  RMAFILLAGSWMGGLLGVHVARIL 95


>gi|99078076|ref|YP_611334.1| hypothetical protein TM1040_3098 [Ruegeria sp. TM1040]
 gi|99035214|gb|ABF62072.1| protein of unknown function DUF81 [Ruegeria sp. TM1040]
          Length = 330

 Score = 42.4 bits (98), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 27/56 (48%), Positives = 36/56 (64%), Gaps = 4/56 (7%)

Query: 349 GMVG---GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           GMVG   G+ G+GGGF+L PL   +GIPP V+ AT    I  +SS+S V  +L +R
Sbjct: 47  GMVGVLSGMFGVGGGFLLTPLLFFIGIPPAVAVATGANQIV-ASSVSGVLAHLRRR 101


>gi|223478524|ref|YP_002582988.1| hypothetical protein [Thermococcus sp. AM4]
 gi|214033750|gb|EEB74576.1| conserved hypothetical protein [Thermococcus sp. AM4]
          Length = 254

 Score = 42.4 bits (98), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 42/81 (51%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
           Y A G+  G++  L GLGGGF++ P    LG+    +  T+  A+ F+S  S + Y   +
Sbjct: 7   YFAVGVFIGILAALFGLGGGFLIVPTLNLLGVEMHHAVGTSSAAVVFTSLSSAIAYSRQR 66

Query: 401 RFPVPYALYFFALSIIAAFVG 421
           R      L   + ++I A++G
Sbjct: 67  RIHYKVGLLLASTAVIGAYIG 87


>gi|11497744|ref|NP_068966.1| hypothetical protein AF0127 [Archaeoglobus fulgidus DSM 4304]
 gi|2650522|gb|AAB91106.1| predicted coding region AF_0127 [Archaeoglobus fulgidus DSM 4304]
          Length = 475

 Score = 42.4 bits (98), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 29/118 (24%), Positives = 58/118 (49%), Gaps = 1/118 (0%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
           + +F  A G   GM+ G +G GG F+L P  + +G P  ++ A+ +      + +     
Sbjct: 24  EAIFLLALGFFGGMLSGFIGSGGAFVLTPGMMSIGTPGPIAVASNMCHKFPKAMIGAYRR 83

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF-TLSFIIFVSALSL 453
           Y L++     A+     +I    VG  V K++ + LG     ++ +++F++ + A+SL
Sbjct: 84  YKLRQLDPKLAVIMAVSAIAGVQVGIQVQKQIYEALGETGTNLYVSIAFLVVLPAVSL 141


>gi|119385456|ref|YP_916512.1| hypothetical protein Pden_2732 [Paracoccus denitrificans PD1222]
 gi|119375223|gb|ABL70816.1| protein of unknown function DUF81 [Paracoccus denitrificans PD1222]
          Length = 307

 Score = 42.4 bits (98), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 51/99 (51%), Gaps = 17/99 (17%)

Query: 315 ALYKGRR----KIASKGDEG--------TKWRASQL----VFYCACGITAGMVGGLLGLG 358
           AL++ RR    K   +   G         K+RAS L    +     G   G++G L+G+G
Sbjct: 136 ALHRARRSGPIKPMRRHQHGWVHKWPLKMKFRASGLYISSIPVLLVGAAVGVLGALMGVG 195

Query: 359 GGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEY 396
           GGFI+ P +   LG+P +V   T+LF ITF S+ + + +
Sbjct: 196 GGFIMVPAMIYLLGMPTKVVIGTSLFQITFLSAYTTLMH 234


>gi|25026873|ref|NP_736927.1| hypothetical protein CE0317 [Corynebacterium efficiens YS-314]
 gi|259506064|ref|ZP_05748966.1| integral membrane protein [Corynebacterium efficiens YS-314]
 gi|23492153|dbj|BAC17127.1| conserved hypothetical protein [Corynebacterium efficiens YS-314]
 gi|259166352|gb|EEW50906.1| integral membrane protein [Corynebacterium efficiens YS-314]
          Length = 284

 Score = 42.4 bits (98), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 64/137 (46%), Gaps = 5/137 (3%)

Query: 313 AVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELG 371
           A A+ +GR++ A  G++     A  L    A G+  G+V GL+G GGGF++ P L L  G
Sbjct: 113 ATAMLRGRKQRADTGEK----TALPLGKILAEGLAVGLVTGLVGAGGGFLVVPALALLGG 168

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           +P  V+  T+L  I+  S   +  Y        P  L     +++ A +G  +   + + 
Sbjct: 169 LPMPVAVGTSLLVISMKSFAGLAGYMTSVSLDWPLVLAVTGAALVGALIGARLTSVVPEQ 228

Query: 432 LGRASIIIFTLSFIIFV 448
             R    +F L   +F+
Sbjct: 229 ALRKGFGVFVLVMGVFI 245


>gi|206598109|gb|ACI15919.1| hypothetical protein [Bodo saltans]
          Length = 526

 Score = 42.4 bits (98), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 90/412 (21%), Positives = 168/412 (40%), Gaps = 39/412 (9%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDI----PVIDYDLALLFQP 160
           PM   +      S+  +S+  + G +   ++Y + +  PT       P+I+Y    +  P
Sbjct: 112 PMYAALTEMPLISAVGLSQAAICGQSAFNMIYQIPKTMPTQTPNEVRPLINYQYLAILLP 171

Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL-E 219
           L ++G  +G     +  DW+   LL +L   +  R   K    +K++   K E A  L +
Sbjct: 172 LSLIGTLLGSLGGRVVPDWLRLALLFLLLTSVLHRVIEKAKRQYKQDAAEKNEEAAALTK 231

Query: 220 LIVLGNGYQTEECKCDPEYLSN----DTTPEETREPKKSKVSIIEN-------------- 261
            +V     QT+    D   +S      TTP   R    + V + E               
Sbjct: 232 KVVQKPQQQTDANATDTGAVSMKPVLSTTPAAPRGQTTTTVVVAEEAVGSEGESHFDDND 291

Query: 262 ---------IYWKEFGLLVAVWAIILALQIAKNYTTTC-SVLYWVLNLLQIPVAGGV--- 308
                    +  +E  L+   + ++L   I ++  T C S  + +L ++ I V  G+   
Sbjct: 292 GPRSPPSLRMPRREISLMGFSFLVLLISNIVRSNFTVCGSTAHVLLFIIPIVVLAGLWWL 351

Query: 309 -SAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF 367
                AV L        S      +W    ++++    + AG    LLG+GGG +L  + 
Sbjct: 352 AKHMAAVTLANVASGFLSPDHMTFQWNQKTMIYFPVAAVVAGAGAALLGIGGGLVLSFVL 411

Query: 368 LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 427
            E G+ P+ +SAT+ FA    +S + +      +    YA  +F   +++  +GQ     
Sbjct: 412 FEAGLAPEEASATSGFATLLIASEAALLMLFQGQLVPDYATMYFVCGVVSTVLGQSGFMA 471

Query: 428 LIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE--YMGFDNIC 477
            I+   +  +I+  L+ II  S L L   GL +++++ +  +   + F +IC
Sbjct: 472 YIRWSKKRFLIVTALACIIGGSLLMLTSYGLYEVVEQYKRHDGSLLRFGHIC 523


>gi|145493226|ref|XP_001432609.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124399722|emb|CAK65212.1| unnamed protein product [Paramecium tetraurelia]
          Length = 454

 Score = 42.4 bits (98), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 33/151 (21%), Positives = 71/151 (47%), Gaps = 8/151 (5%)

Query: 78  IVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYN 137
           +V+   IG F +      G+G  G  + +L L+  +  K ++ +++ M         + N
Sbjct: 33  LVIPISIGVFNSI-----GLGAQGFQIFILLLVFNYGKKDASLMNQPMQLAQYFLTSIEN 87

Query: 138 LRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
           L++R    + P ++ +L LL  P+++ G +IG+        +   V+ ++  L+L    +
Sbjct: 88  LKKRKQP-NQPAVNENLVLLLGPMMISGCTIGLHSKDYIPTFFTIVITLISLLILMMTTY 146

Query: 198 LKGVETWKKETITKREAAKQLE--LIVLGNG 226
            K  +    E++  +E  K+ E   I +G+G
Sbjct: 147 KKTKQVRYLESLATKEQLKEFEDDFINVGDG 177


>gi|448239247|ref|YP_007403305.1| DUF81 family protein [Geobacillus sp. GHH01]
 gi|445208089|gb|AGE23554.1| DUF81 family protein [Geobacillus sp. GHH01]
          Length = 295

 Score = 42.4 bits (98), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 52/116 (44%), Gaps = 14/116 (12%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 397
           G  AG VG L GLGGG I+ P  L  G       + PQV+  T+L  I F+   S + Y 
Sbjct: 10  GFIAGTVGSLAGLGGGVIIVPALLFFGALGWLSAVTPQVAVGTSLVVIIFNGLSSTLSYM 69

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
             K       L F   S+  A +G  V   L       S   F+L F +F+ A+SL
Sbjct: 70  KDKMVDYQSGLLFCLGSVPGAVIGAWVNNTL-------SAAHFSLYFGLFLIAMSL 118


>gi|408379853|ref|ZP_11177444.1| permease [Agrobacterium albertimagni AOL15]
 gi|407746230|gb|EKF57755.1| permease [Agrobacterium albertimagni AOL15]
          Length = 305

 Score = 42.4 bits (98), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 32/126 (25%), Positives = 63/126 (50%), Gaps = 4/126 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++    
Sbjct: 18  LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSVSGAISHFRRGT 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             +          ++ A +G ++   L + LG+  +II +L ++IF+   ++GG+ LA+ 
Sbjct: 78  LDMKLGTVLLIGGLVGATIGIYIFSWL-RRLGQLDLII-SLLYVIFLG--TVGGLMLAES 133

Query: 462 IKRIEH 467
           ++ +  
Sbjct: 134 LRALRR 139


>gi|337285350|ref|YP_004624824.1| membrane protein, conserved [Pyrococcus yayanosii CH1]
 gi|334901284|gb|AEH25552.1| hypothetical membrane protein, conserved [Pyrococcus yayanosii CH1]
          Length = 254

 Score = 42.4 bits (98), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 40/81 (49%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
           Y A GI  G +  L GLGGGF++ P    LG+    +  T+  AI F+S  S   Y+   
Sbjct: 7   YFAVGIVIGTLAALFGLGGGFLIVPTLNLLGVEIHKAVGTSSAAIVFTSLSSAYAYHRQG 66

Query: 401 RFPVPYALYFFALSIIAAFVG 421
           R      L   + +I+ A++G
Sbjct: 67  RIHYKAGLLLASTAIVGAYIG 87


>gi|198284012|ref|YP_002220333.1| hypothetical protein Lferr_1905 [Acidithiobacillus ferrooxidans
           ATCC 53993]
 gi|198248533|gb|ACH84126.1| protein of unknown function DUF81 [Acidithiobacillus ferrooxidans
           ATCC 53993]
          Length = 305

 Score = 42.4 bits (98), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 44/84 (52%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+  G + GL G+GGGF++ PL + +G+PP V+  T    I  +S++    ++ +    +
Sbjct: 21  GLIVGFLSGLTGVGGGFLITPLLIFVGVPPLVAVGTGAAQIVGASAVGSYAHWRMGNVDM 80

Query: 405 PYALYFFALSIIAAFVGQHVLKKL 428
             A    A S +   +G HV + L
Sbjct: 81  RMAFILLAGSWMGGLLGVHVARIL 104


>gi|392955479|ref|ZP_10321010.1| putative permease with tauE sulfite export domain [Bacillus
           macauensis ZFHKF-1]
 gi|391878406|gb|EIT86995.1| putative permease with tauE sulfite export domain [Bacillus
           macauensis ZFHKF-1]
          Length = 290

 Score = 42.4 bits (98), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 62/131 (47%), Gaps = 12/131 (9%)

Query: 259 IENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYW---VLNLLQIPVAGGVSAYEAVA 315
           ++NIYWK   + + +  + LA Q A       S  +W   V+ +L I + G    Y  V 
Sbjct: 63  LKNIYWKT-AITLGISGV-LATQAANPLVVYMSNHHWDKIVIPVLYIVLMG----YFCVQ 116

Query: 316 LYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPP 374
           L   RRK   + D G    A  L      G  +G +   LG+GGGF++ PL +  L I P
Sbjct: 117 LLVKRRK--KEQDRGATRHAFSLPKTLLIGFVSGFLSSTLGVGGGFVMVPLMISILKIEP 174

Query: 375 QVSSATALFAI 385
           + +  T+L ++
Sbjct: 175 RKAVGTSLVSV 185


>gi|84500291|ref|ZP_00998557.1| membrane protein [Oceanicola batsensis HTCC2597]
 gi|84392225|gb|EAQ04493.1| membrane protein [Oceanicola batsensis HTCC2597]
          Length = 306

 Score = 42.0 bits (97), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 36/53 (67%), Gaps = 1/53 (1%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           G++ G+ G+GGGF++ PL   +GIPP V+ AT+   I  +SS+S V  +L +R
Sbjct: 25  GVLSGMFGVGGGFLITPLLFFVGIPPAVAVATSANQIV-ASSVSAVLAHLRRR 76


>gi|420205978|ref|ZP_14711489.1| hypothetical protein HMPREF9977_00722 [Staphylococcus epidermidis
           NIHLM008]
 gi|394278651|gb|EJE22965.1| hypothetical protein HMPREF9977_00722 [Staphylococcus epidermidis
           NIHLM008]
          Length = 253

 Score = 42.0 bits (97), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 62/119 (52%), Gaps = 12/119 (10%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYY---------LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT 441
           S +++          + K FP+ +     A +IIA ++  ++LK LI I+  + ++I+T
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGA-AIIATYIPANILKPLI-IIALSLVLIYT 117


>gi|134299284|ref|YP_001112780.1| hypothetical protein Dred_1425 [Desulfotomaculum reducens MI-1]
 gi|134051984|gb|ABO49955.1| protein of unknown function DUF81 [Desulfotomaculum reducens MI-1]
          Length = 426

 Score = 42.0 bits (97), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 64/139 (46%), Gaps = 7/139 (5%)

Query: 336 SQLVFY--CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 393
           + ++FY   A GI AG++ G +G GGGF++ P  + LG+   ++  T  F I   + M  
Sbjct: 70  NPMMFYGSMAVGICAGLITGCIGAGGGFVITPALMSLGVKGILAVGTDQFHIFAKAIMGT 129

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
           V +  L    V  A+ F   S I    G  + + L  +    S  I +L +++      L
Sbjct: 130 VIHKKLGNVNVALAIAFLVGSGIGVTAGGTLNRALFNMNPVLSDFIISLVYVVM-----L 184

Query: 454 GGVGLAKMIKRIEHKEYMG 472
           G +G   M   I++K   G
Sbjct: 185 GFLGFYSMYDFIKNKNTSG 203


>gi|242242978|ref|ZP_04797423.1| membrane protein [Staphylococcus epidermidis W23144]
 gi|242233579|gb|EES35891.1| membrane protein [Staphylococcus epidermidis W23144]
          Length = 254

 Score = 42.0 bits (97), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T+
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118


>gi|127513412|ref|YP_001094609.1| hypothetical protein Shew_2484 [Shewanella loihica PV-4]
 gi|126638707|gb|ABO24350.1| protein of unknown function DUF81 [Shewanella loihica PV-4]
          Length = 257

 Score = 42.0 bits (97), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 41/80 (51%), Gaps = 5/80 (6%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            A G  AG +  + G GGG +  P  L +GIPP  +  T   A +F SSM+   YY  K+
Sbjct: 14  AATGFIAGFIDAIAG-GGGLLSIPALLTIGIPPHTALGTNKLAASFGSSMAAFTYY-RKQ 71

Query: 402 FPVPYALYFFALSIIAAFVG 421
           F  P    F+  + IA F+G
Sbjct: 72  FFTP---RFWYHASIATFIG 88


>gi|386759198|ref|YP_006232414.1| Sulfite exporter TauE/SafE [Bacillus sp. JS]
 gi|384932480|gb|AFI29158.1| Sulfite exporter TauE/SafE [Bacillus sp. JS]
          Length = 258

 Score = 42.0 bits (97), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 60/114 (52%), Gaps = 11/114 (9%)

Query: 318 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQV 376
           KG+ +   K     KW AS L F        G V G+LG GG FIL P  L  L IP ++
Sbjct: 128 KGQEQSEDKEVIFNKWLASSLAFII------GGVSGVLGAGGAFILVPFMLSILNIPVRI 181

Query: 377 SSATALFAITFSSSM--SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           + A++L AITF SS+  +V +    +   VP AL     S+IA+ +G  V +K+
Sbjct: 182 TVASSL-AITFLSSIGATVGKVITGQVLFVP-ALVLMIASLIASPIGASVGQKV 233


>gi|418614254|ref|ZP_13177232.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
 gi|374820914|gb|EHR84988.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
          Length = 254

 Score = 42.0 bits (97), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T+
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118


>gi|375010109|ref|YP_004983742.1| hypothetical protein [Geobacillus thermoleovorans CCB_US3_UF5]
 gi|359288958|gb|AEV20642.1| hypothetical protein GTCCBUS3UF5_33410 [Geobacillus thermoleovorans
           CCB_US3_UF5]
          Length = 300

 Score = 42.0 bits (97), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 52/116 (44%), Gaps = 14/116 (12%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 397
           G  AG VG L GLGGG I+ P  L  G       + PQV+  T+L  I F+   S + Y 
Sbjct: 10  GFIAGTVGSLAGLGGGVIIVPSLLFFGALGWLSAVTPQVAVGTSLVVIIFNGLSSTLSYM 69

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
             K       L F   S+  A +G  V   L       S   F+L F +F+ A+SL
Sbjct: 70  KDKMVDYQSGLLFCLGSVPGAVIGAWVNNTL-------SAAHFSLYFGLFLIAMSL 118


>gi|56421519|ref|YP_148837.1| hypothetical protein GK2984 [Geobacillus kaustophilus HTA426]
 gi|56381361|dbj|BAD77269.1| hypothetical conserved protein [Geobacillus kaustophilus HTA426]
          Length = 300

 Score = 42.0 bits (97), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 52/116 (44%), Gaps = 14/116 (12%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 397
           G  AG VG L GLGGG I+ P  L  G       + PQV+  T+L  I F+   S + Y 
Sbjct: 10  GFIAGTVGSLAGLGGGVIIVPSLLFFGALGWLSAVTPQVAVGTSLVVIIFNGLSSTLSYM 69

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
             K       L F   S+  A +G  V   L       S   F+L F +F+ A+SL
Sbjct: 70  KDKMVDYQSGLLFCLGSVPGAVIGAWVNNTL-------SAAHFSLYFGLFLIAMSL 118


>gi|418634984|ref|ZP_13197372.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
 gi|420190314|ref|ZP_14696257.1| hypothetical protein HMPREF9984_06342 [Staphylococcus epidermidis
           NIHLM037]
 gi|420192642|ref|ZP_14698500.1| hypothetical protein HMPREF9983_06044 [Staphylococcus epidermidis
           NIHLM023]
 gi|420204616|ref|ZP_14710174.1| hypothetical protein HMPREF9978_05542 [Staphylococcus epidermidis
           NIHLM015]
 gi|374835742|gb|EHR99339.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
 gi|394258759|gb|EJE03633.1| hypothetical protein HMPREF9984_06342 [Staphylococcus epidermidis
           NIHLM037]
 gi|394260815|gb|EJE05619.1| hypothetical protein HMPREF9983_06044 [Staphylococcus epidermidis
           NIHLM023]
 gi|394273626|gb|EJE18057.1| hypothetical protein HMPREF9978_05542 [Staphylococcus epidermidis
           NIHLM015]
          Length = 254

 Score = 42.0 bits (97), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T+
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118


>gi|283778256|ref|YP_003369011.1| hypothetical protein Psta_0463 [Pirellula staleyi DSM 6068]
 gi|283436709|gb|ADB15151.1| protein of unknown function DUF81 [Pirellula staleyi DSM 6068]
          Length = 275

 Score = 42.0 bits (97), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 45/159 (28%), Positives = 73/159 (45%), Gaps = 26/159 (16%)

Query: 93  SVGGVGGGG-IF-VPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVI 150
           S+G  GGGG IF VP+L   +GFD++ +A +S   V   ++  ++    QR        +
Sbjct: 16  SLGLTGGGGAIFAVPLLVYGLGFDSRQAATMSLVTVGITSLIGLLQKAAQRQ-------L 68

Query: 151 DYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETIT 210
           ++ + LLF     LG  IGV       + +   L   L LV++        + W      
Sbjct: 69  EFKVGLLFAIAGALGSPIGVMLAAQLHETLRLALFAGLMLVIA-------FQMW------ 115

Query: 211 KREAAKQLELIVLGNGYQTEE----CKCDPEYLSNDTTP 245
           +R +  QLEL +  +   TE+    C+ DPE +   T+P
Sbjct: 116 RRASTVQLELPLAWHRTPTEQRAVTCQRDPEGILRITSP 154


>gi|255036228|ref|YP_003086849.1| hypothetical protein Dfer_2466 [Dyadobacter fermentans DSM 18053]
 gi|254948984|gb|ACT93684.1| protein of unknown function DUF81 [Dyadobacter fermentans DSM
           18053]
          Length = 293

 Score = 42.0 bits (97), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 26/108 (24%), Positives = 47/108 (43%)

Query: 323 IASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATAL 382
           I+   ++ T++       Y   G+ A +V G LG+  G       L +G+ P VSSA+  
Sbjct: 37  ISVSSEQVTEFLTGDFALYVLVGLAAQLVDGALGMAYGVTSNSFLLSVGVTPAVSSASVH 96

Query: 383 FAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 430
           FA  F++  S + ++  K              ++ A  G ++L  +I 
Sbjct: 97  FAEMFTTGASAISHFRFKNINKKLFKSLLIPGVLGAIAGAYLLSDVID 144


>gi|390960289|ref|YP_006424123.1| hypothetical protein CL1_0114 [Thermococcus sp. CL1]
 gi|390518597|gb|AFL94329.1| hypothetical protein CL1_0114 [Thermococcus sp. CL1]
          Length = 238

 Score = 42.0 bits (97), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 46/85 (54%), Gaps = 1/85 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 403
           G  +G+  GLLG+ GG I  PLF EL GIP + +  T+  A+ F+S  +   +Y L +  
Sbjct: 132 GAFSGLASGLLGVSGGIINVPLFHELAGIPVKYAVGTSSLALFFTSLTAAYAHYTLGQVD 191

Query: 404 VPYALYFFALSIIAAFVGQHVLKKL 428
           +  A       +  +FVG H++ ++
Sbjct: 192 ITTAAMIVPGLMAGSFVGAHLVSRI 216


>gi|350266812|ref|YP_004878119.1| hypothetical protein GYO_2879 [Bacillus subtilis subsp. spizizenii
           TU-B-10]
 gi|349599699|gb|AEP87487.1| YrkJ [Bacillus subtilis subsp. spizizenii TU-B-10]
          Length = 259

 Score = 42.0 bits (97), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 62/115 (53%), Gaps = 8/115 (6%)

Query: 322 KIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSAT 380
           K  ++G++    R  + +         G+  G++G GG F+L PL L  L IP +++ AT
Sbjct: 127 KTGTEGNQSDHIRFPK-ILTSIMTFVVGLASGIVGAGGAFLLIPLMLVILKIPARITIAT 185

Query: 381 ALFAITFSSSMSVVEYYL-LKRFPVPYALYFFALSIIAA----FVGQHVLKKLIK 430
           +L A+TF SS+      L + + P+  AL   A S+IAA    F+G+ V  K+++
Sbjct: 186 SL-AVTFISSIGTTATKLVIGQVPMMPALVVMAASLIAAPLGVFIGKKVNAKVLQ 239


>gi|145494802|ref|XP_001433395.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400512|emb|CAK65998.1| unnamed protein product [Paramecium tetraurelia]
          Length = 466

 Score = 42.0 bits (97), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 77/378 (20%), Positives = 156/378 (41%), Gaps = 42/378 (11%)

Query: 109 LIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDI-PVIDYDLALLFQPLLVLGIS 167
           +I+ +    +  ++ C++ GG +   +  + +++P     P+I+Y ++++F   +    +
Sbjct: 72  MILNYTQSEATHIAYCLMFGGTLLNTILLMFEKNPEDQRRPIINYRISIIFNLAVPFATN 131

Query: 168 IGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETIT----KREAAKQLELIVL 223
           +G +    F   + T++L  LFL        K  +  K E ++    ++  ++ L L   
Sbjct: 132 LGSSL-ASFLPQLYTLILQELFLFAVAPILWKKAQKAKSEELSTPDKQKNESQALNLDGS 190

Query: 224 GNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIIL----- 278
               + E  K + +  S   T       K++  ++      +   +L  + A+ +     
Sbjct: 191 NTQQKIELQKIEEQQYSVSLTSFSVENSKQNSNTLYYQFKQETENILPFMPALFILGSFG 250

Query: 279 --ALQIAKNYTTTCSVLY-------WVLNLLQIPVAGGVSAYEAVALYKGRRK------I 323
              + I    T      Y       W  + + + +      ++ VA   G R+      +
Sbjct: 251 LNQIFIQMRSTNPSKPSYVGIYDCTWQNDFMILILIIANVLFDYVAWNFGSRQEKYFDQL 310

Query: 324 ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 383
               +E      S+     A G  AG V G LG+G GF++ P  L  G+ P+ +SAT+ F
Sbjct: 311 NYLPNERYFTPISRFFKIYAGGFGAGFVSGFLGMGAGFVMVPTLLYSGLIPRCASATSAF 370

Query: 384 AITFSSSMS-----VVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASII 438
            I F  S++     +  +YL ++      L F  L++I    G  V+ K+  IL     I
Sbjct: 371 -IYFMISLNNLITLLTNHYLDQQM----ILLFTGLAVI----GGSVITKIGYILLSKYKI 421

Query: 439 IFTLSFIIFVSALSLGGV 456
            +T+  I+F  AL +  +
Sbjct: 422 GYTVILIVF--ALDIANI 437


>gi|420202259|ref|ZP_14707852.1| hypothetical protein HMPREF9979_05842 [Staphylococcus epidermidis
           NIHLM018]
 gi|394269667|gb|EJE14197.1| hypothetical protein HMPREF9979_05842 [Staphylococcus epidermidis
           NIHLM018]
          Length = 254

 Score = 42.0 bits (97), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 61/118 (51%), Gaps = 10/118 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSVMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 441
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYT 117


>gi|86138958|ref|ZP_01057529.1| membrane protein [Roseobacter sp. MED193]
 gi|85824189|gb|EAQ44393.1| membrane protein [Roseobacter sp. MED193]
          Length = 306

 Score = 42.0 bits (97), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 52/96 (54%), Gaps = 2/96 (2%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++  +   +   +    
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPVVAVATEANQIVASSFSGVLAHFRRRTVDIKMGVVLQV 88

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
             +I A +G  V   L K LG+  +++ TL +++F+
Sbjct: 89  GGLIGAGLGVVVFNYL-KSLGQVDLLV-TLCYVVFL 122


>gi|420174532|ref|ZP_14680982.1| hypothetical protein HMPREF9990_02314 [Staphylococcus epidermidis
           NIHLM061]
 gi|394245037|gb|EJD90364.1| hypothetical protein HMPREF9990_02314 [Staphylococcus epidermidis
           NIHLM061]
          Length = 254

 Score = 42.0 bits (97), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T+
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118


>gi|319789078|ref|YP_004150711.1| hypothetical protein Theam_0096 [Thermovibrio ammonificans HB-1]
 gi|317113580|gb|ADU96070.1| protein of unknown function DUF81 [Thermovibrio ammonificans HB-1]
          Length = 262

 Score = 42.0 bits (97), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 40/71 (56%), Gaps = 8/71 (11%)

Query: 345 GITAGMVGGLLGLGGGFILGPL----FLELGIPPQVS----SATALFAITFSSSMSVVEY 396
           GI AG V GL G+GGG IL PL    F ++G+P ++S     AT+L  I  S+  S V +
Sbjct: 12  GIAAGFVAGLFGIGGGVILVPLFWFFFQKIGVPQELSFKLAVATSLSVIAVSTLFSTVSH 71

Query: 397 YLLKRFPVPYA 407
            L   +P+  A
Sbjct: 72  ILKGSYPLKEA 82


>gi|345020875|ref|ZP_08784488.1| hypothetical protein OTW25_06042 [Ornithinibacillus scapharcae
           TW25]
          Length = 271

 Score = 42.0 bits (97), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 5/88 (5%)

Query: 346 ITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           +  G + GL G+GGG I+ P + L  G P  +++AT++F I F S M    +  L     
Sbjct: 161 LVVGTLSGLFGIGGGSIMVPAMILLFGFPVHIATATSMFMIFFVSMMGASSHIFLGHIAW 220

Query: 405 PYALYFFALSIIAAFVGQHVLKKLIKIL 432
            Y L+F    I  A++G  V  K+ ++L
Sbjct: 221 EYVLFF----IPGAWIGGKVGAKISQLL 244


>gi|89053334|ref|YP_508785.1| hypothetical protein Jann_0843 [Jannaschia sp. CCS1]
 gi|88862883|gb|ABD53760.1| protein of unknown function DUF81 [Jannaschia sp. CCS1]
          Length = 307

 Score = 42.0 bits (97), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 49/97 (50%), Gaps = 2/97 (2%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ +   K   +   L    
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHLKRKTVDLKMGLVLLI 88

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVS 449
             +I A +G  V   L  I G+  +++  L +++F+ 
Sbjct: 89  GGLIGAAIGVQVFAALTAI-GQVDLLV-RLCYVVFLG 123


>gi|408375545|ref|ZP_11173209.1| hypothetical protein A11A3_15572 [Alcanivorax hongdengensis A-11-3]
 gi|407764566|gb|EKF73039.1| hypothetical protein A11A3_15572 [Alcanivorax hongdengensis A-11-3]
          Length = 264

 Score = 41.6 bits (96), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 32/57 (56%), Gaps = 2/57 (3%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF--AITFSSSM 391
            ++ Y A G   G+  GL G+GGG ++ PL L  G PPQ +  T L   AIT SS M
Sbjct: 7   DMLLYIAAGGLVGLAIGLTGVGGGSLMTPLLLLFGFPPQTAIGTDLLYAAITKSSGM 63


>gi|261313116|ref|ZP_05952313.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10]
 gi|261302142|gb|EEY05639.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10]
          Length = 306

 Score = 41.6 bits (96), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 30/133 (22%), Positives = 66/133 (49%), Gaps = 4/133 (3%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
            ++     G   G + GL G+GG F++ PL +   IPP ++ AT    +  SS    + +
Sbjct: 13  NMLVLLGMGAAVGFLSGLFGVGGSFLITPLLIFYNIPPAIAVATGANQVIASSVSGALAH 72

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 456
           +  +   +   L+  A  I+ + +G  V   L + LG+  +I+ ++ ++ F+   ++GG+
Sbjct: 73  FKRRTLDIKLGLFLVAGGILGSLIGIFVFSWL-RDLGQLDLIV-SILYVFFLG--TIGGL 128

Query: 457 GLAKMIKRIEHKE 469
            L + ++ +   +
Sbjct: 129 MLVESVQALRRAK 141


>gi|212223719|ref|YP_002306955.1| membrane protein [Thermococcus onnurineus NA1]
 gi|212008676|gb|ACJ16058.1| hypothetical membrane protein, conserved [Thermococcus onnurineus
           NA1]
          Length = 254

 Score = 41.6 bits (96), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 52/103 (50%), Gaps = 4/103 (3%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
           Y   G+  G++  L GLGGGF++ P    LG+    +  T+  A+ F+S  S + Y   K
Sbjct: 7   YFTVGVFIGILAALFGLGGGFLIVPTLNLLGVEIHHAVGTSSAAVVFTSLSSALAYSRQK 66

Query: 401 RFPVPYALYFFALSIIAAFVG----QHVLKKLIKILGRASIII 439
           R      L   + ++I A++G      +   ++K++  A++II
Sbjct: 67  RIHYKIGLLLASTAVIGAYIGAWMTSFISAGMLKVIFGATLII 109


>gi|404416492|ref|ZP_10998312.1| hypothetical protein SARL_01356 [Staphylococcus arlettae CVD059]
 gi|403491149|gb|EJY96674.1| hypothetical protein SARL_01356 [Staphylococcus arlettae CVD059]
          Length = 297

 Score = 41.6 bits (96), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 53/112 (47%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
           +L+ +   G  A M+ G LG+G G     + L  G  P + SAT  F+   +++ S V +
Sbjct: 3   KLLIFALAGFFAQMIDGSLGMGFGATSSSILLAAGAAPAIVSATIHFSEIATTAASGVSH 62

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
           Y  K      A+       IAAF+G  VL  +   L +  I +F L+  +F+
Sbjct: 63  YKFKNVDKRMAIKLAIPGAIAAFIGSAVLSNIHSDLIKPFISLFLLTMGLFI 114


>gi|27468308|ref|NP_764945.1| hypothetical protein SE1390 [Staphylococcus epidermidis ATCC 12228]
 gi|57867226|ref|YP_188850.1| hypothetical protein SERP1278 [Staphylococcus epidermidis RP62A]
 gi|251811104|ref|ZP_04825577.1| membrane protein [Staphylococcus epidermidis BCM-HMP0060]
 gi|282875864|ref|ZP_06284731.1| putative membrane protein [Staphylococcus epidermidis SK135]
 gi|293366339|ref|ZP_06613019.1| conserved hypothetical protein [Staphylococcus epidermidis
           M23864:W2(grey)]
 gi|417656164|ref|ZP_12305855.1| putative membrane protein [Staphylococcus epidermidis VCU028]
 gi|417658659|ref|ZP_12308280.1| putative membrane protein [Staphylococcus epidermidis VCU045]
 gi|417908781|ref|ZP_12552538.1| putative membrane protein [Staphylococcus epidermidis VCU037]
 gi|417911376|ref|ZP_12555083.1| putative membrane protein [Staphylococcus epidermidis VCU105]
 gi|417914293|ref|ZP_12557945.1| putative membrane protein [Staphylococcus epidermidis VCU109]
 gi|418603691|ref|ZP_13167072.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU041]
 gi|418606180|ref|ZP_13169474.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU057]
 gi|418610089|ref|ZP_13173215.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU065]
 gi|418612531|ref|ZP_13175566.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU117]
 gi|418618135|ref|ZP_13181014.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU120]
 gi|418623251|ref|ZP_13185969.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU123]
 gi|418625217|ref|ZP_13187873.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU125]
 gi|418627260|ref|ZP_13189839.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU126]
 gi|418629290|ref|ZP_13191802.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU127]
 gi|418664063|ref|ZP_13225560.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU081]
 gi|419769208|ref|ZP_14295304.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-250]
 gi|419771352|ref|ZP_14297406.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-K]
 gi|420166604|ref|ZP_14673287.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM088]
 gi|420170403|ref|ZP_14676964.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM070]
 gi|420187092|ref|ZP_14693115.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM039]
 gi|420195366|ref|ZP_14701159.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM021]
 gi|420196947|ref|ZP_14702681.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM020]
 gi|420209206|ref|ZP_14714644.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM003]
 gi|420215110|ref|ZP_14720382.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH05005]
 gi|420216860|ref|ZP_14722054.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH05001]
 gi|420220645|ref|ZP_14725604.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH04008]
 gi|420221508|ref|ZP_14726438.1| hypothetical protein HMPREF1390_00623 [Staphylococcus epidermidis
           NIH08001]
 gi|420225897|ref|ZP_14730724.1| hypothetical protein HMPREF1389_10265 [Staphylococcus epidermidis
           NIH06004]
 gi|420227493|ref|ZP_14732261.1| hypothetical protein HMPREF1388_05408 [Staphylococcus epidermidis
           NIH05003]
 gi|420229811|ref|ZP_14734513.1| hypothetical protein HMPREF1387_05447 [Staphylococcus epidermidis
           NIH04003]
 gi|420232217|ref|ZP_14736858.1| hypothetical protein HMPREF1386_05683 [Staphylococcus epidermidis
           NIH051668]
 gi|420234863|ref|ZP_14739423.1| hypothetical protein HMPREF1385_05672 [Staphylococcus epidermidis
           NIH051475]
 gi|421606803|ref|ZP_16048057.1| hypothetical protein B440_00608 [Staphylococcus epidermidis
           AU12-03]
 gi|27315854|gb|AAO04989.1|AE016748_223 conserved hypothetical protein [Staphylococcus epidermidis ATCC
           12228]
 gi|57637884|gb|AAW54672.1| conserved hypothetical protein [Staphylococcus epidermidis RP62A]
 gi|251805324|gb|EES57981.1| membrane protein [Staphylococcus epidermidis BCM-HMP0060]
 gi|281294889|gb|EFA87416.1| putative membrane protein [Staphylococcus epidermidis SK135]
 gi|291319577|gb|EFE59943.1| conserved hypothetical protein [Staphylococcus epidermidis
           M23864:W2(grey)]
 gi|329737414|gb|EGG73668.1| putative membrane protein [Staphylococcus epidermidis VCU028]
 gi|329737501|gb|EGG73754.1| putative membrane protein [Staphylococcus epidermidis VCU045]
 gi|341652991|gb|EGS76765.1| putative membrane protein [Staphylococcus epidermidis VCU109]
 gi|341653699|gb|EGS77466.1| putative membrane protein [Staphylococcus epidermidis VCU105]
 gi|341656142|gb|EGS79865.1| putative membrane protein [Staphylococcus epidermidis VCU037]
 gi|374405505|gb|EHQ76437.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU065]
 gi|374407009|gb|EHQ77878.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU041]
 gi|374409263|gb|EHQ80062.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU057]
 gi|374410953|gb|EHQ81682.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU081]
 gi|374816377|gb|EHR80581.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU120]
 gi|374818856|gb|EHR82999.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU117]
 gi|374824144|gb|EHR88117.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU123]
 gi|374825718|gb|EHR89642.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU125]
 gi|374829935|gb|EHR93727.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU126]
 gi|374834371|gb|EHR98017.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU127]
 gi|383358277|gb|EID35736.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-250]
 gi|383361578|gb|EID38948.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-K]
 gi|394233474|gb|EJD79078.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM088]
 gi|394240741|gb|EJD86164.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM070]
 gi|394256839|gb|EJE01765.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM039]
 gi|394263320|gb|EJE08056.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM021]
 gi|394266921|gb|EJE11539.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM020]
 gi|394279434|gb|EJE23742.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM003]
 gi|394282559|gb|EJE26749.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH05005]
 gi|394285998|gb|EJE30064.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH04008]
 gi|394290606|gb|EJE34460.1| hypothetical protein HMPREF1390_00623 [Staphylococcus epidermidis
           NIH08001]
 gi|394291222|gb|EJE35046.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH05001]
 gi|394293331|gb|EJE37054.1| hypothetical protein HMPREF1389_10265 [Staphylococcus epidermidis
           NIH06004]
 gi|394297117|gb|EJE40729.1| hypothetical protein HMPREF1388_05408 [Staphylococcus epidermidis
           NIH05003]
 gi|394298610|gb|EJE42175.1| hypothetical protein HMPREF1387_05447 [Staphylococcus epidermidis
           NIH04003]
 gi|394301540|gb|EJE44996.1| hypothetical protein HMPREF1386_05683 [Staphylococcus epidermidis
           NIH051668]
 gi|394304106|gb|EJE47516.1| hypothetical protein HMPREF1385_05672 [Staphylococcus epidermidis
           NIH051475]
 gi|406657581|gb|EKC83966.1| hypothetical protein B440_00608 [Staphylococcus epidermidis
           AU12-03]
          Length = 254

 Score = 41.6 bits (96), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 61/118 (51%), Gaps = 10/118 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 441
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYT 117


>gi|20093583|ref|NP_613430.1| permease [Methanopyrus kandleri AV19]
 gi|19886439|gb|AAM01360.1| Predicted permease [Methanopyrus kandleri AV19]
          Length = 252

 Score = 41.6 bits (96), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 44/98 (44%), Gaps = 8/98 (8%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
            ++ Y   G   G+V G+ G+GGGF+L PL    G+P  ++  T L AI+         +
Sbjct: 6   SMMLYLGIGGLVGLVSGMFGVGGGFLLVPLLNSTGMPMHLALGTTLLAISLGGFTGAYRH 65

Query: 397 YLLKRFPVPYALYFFALSIIAAFVG--------QHVLK 426
                  V  A  F   +I+ A VG        +HVLK
Sbjct: 66  LQEGNVHVDAAPIFGLSAIVGAQVGSYLACLTPEHVLK 103


>gi|254451969|ref|ZP_05065406.1| membrane protein [Octadecabacter arcticus 238]
 gi|198266375|gb|EDY90645.1| membrane protein [Octadecabacter arcticus 238]
          Length = 319

 Score = 41.6 bits (96), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 49/95 (51%), Gaps = 2/95 (2%)

Query: 354 LLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFAL 413
           + G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ +   K   +         
Sbjct: 44  MFGVGGGFLMTPLLFFIGIPPAVAVATGAVQIVASSFSGVLAHLRRKTVDLKMGTVLLVG 103

Query: 414 SIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
            +I A VG  VL   +K LG+  +++  L +++F+
Sbjct: 104 GLIGAAVGI-VLFNYLKFLGQVDLLV-NLCYVVFL 136


>gi|224476019|ref|YP_002633625.1| hypothetical protein Sca_0526 [Staphylococcus carnosus subsp.
           carnosus TM300]
 gi|222420626|emb|CAL27440.1| putative membrane protein [Staphylococcus carnosus subsp. carnosus
           TM300]
          Length = 274

 Score = 41.6 bits (96), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 3/74 (4%)

Query: 320 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSS 378
           R  IA+ G            F+   GI  G++ GL G+GGG ++ PL L     PP V+ 
Sbjct: 141 RTYIANDGTTYHYHVPPVFAFFATFGI--GILTGLFGIGGGALMTPLMLIVFRFPPHVAV 198

Query: 379 ATALFAITFSSSMS 392
            T++  I FSS MS
Sbjct: 199 GTSMMMIFFSSVMS 212


>gi|11499103|ref|NP_070337.1| hypothetical protein AF1508 [Archaeoglobus fulgidus DSM 4304]
 gi|2649059|gb|AAB89742.1| predicted coding region AF_1508 [Archaeoglobus fulgidus DSM 4304]
          Length = 480

 Score = 41.6 bits (96), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 32/141 (22%), Positives = 65/141 (46%), Gaps = 7/141 (4%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
           + +F  A G   GM+ G +G GG F+L P  + +G P  ++ A+ +      + +     
Sbjct: 24  EALFLLALGFFGGMLSGFIGTGGAFVLTPGMMSIGTPGPIAVASNMCHKFPKAMIGAYRR 83

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF-TLSFIIFVSALSLGG 455
           Y L++     AL     +I    VG  V K + ++LG     ++ +++F+I +  ++   
Sbjct: 84  YKLRQLDPKLALLMATSAIFGVQVGIQVQKHIYELLGPTGTNLYVSVAFLIVLPTVA--- 140

Query: 456 VGLAKMIKRIEHKEYMGFDNI 476
              A +I+ +     MG  ++
Sbjct: 141 ---AVLIRDVVKARRMGISDL 158


>gi|416125430|ref|ZP_11596028.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
 gi|319401027|gb|EFV89246.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
          Length = 254

 Score = 41.6 bits (96), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  +  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFIDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T+
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118


>gi|205375197|ref|ZP_03227988.1| hypothetical protein Bcoam_19517 [Bacillus coahuilensis m4-4]
          Length = 273

 Score = 41.6 bits (96), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 43/92 (46%), Gaps = 1/92 (1%)

Query: 318 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQV 376
           KG+++     D  T     Q V   A     G+  GL G+GGG ++ P + L    PP V
Sbjct: 135 KGKKRSFQDADGNTYEYHIQPVIAIASAFIVGLTSGLFGIGGGSLMVPAMILLFLFPPHV 194

Query: 377 SSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           + AT++F I  S+ +S V +         YAL
Sbjct: 195 AVATSMFMIFLSAIVSSVTHIAFGNVDWLYAL 226


>gi|149909081|ref|ZP_01897739.1| hypothetical protein PE36_17470 [Moritella sp. PE36]
 gi|149807832|gb|EDM67777.1| hypothetical protein PE36_17470 [Moritella sp. PE36]
          Length = 259

 Score = 41.6 bits (96), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 53/120 (44%), Gaps = 14/120 (11%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
           S  +  C+  + AG +  + G GGG +  P  L  G+PP ++  T   A +F S  + + 
Sbjct: 9   SVFIILCSVALVAGFIDAIAG-GGGLLTVPALLSAGLPPHLALGTNKLAASFGSCAAAIT 67

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHV--------LKKLIKILGRASIIIFTLSFIIF 447
           +Y  + F   +    F  + I A  G  V        L KL+ +     IIIFT  + IF
Sbjct: 68  FYKKRLFNPKFWWRSFVFTAIGAICGTLVVNVISTAWLDKLLPV-----IIIFTAIYSIF 122


>gi|420173231|ref|ZP_14679726.1| hypothetical protein HMPREF9991_07281 [Staphylococcus epidermidis
           NIHLM067]
 gi|420183366|ref|ZP_14689497.1| hypothetical protein HMPREF9987_08245 [Staphylococcus epidermidis
           NIHLM049]
 gi|394240409|gb|EJD85833.1| hypothetical protein HMPREF9991_07281 [Staphylococcus epidermidis
           NIHLM067]
 gi|394249078|gb|EJD94299.1| hypothetical protein HMPREF9987_08245 [Staphylococcus epidermidis
           NIHLM049]
          Length = 254

 Score = 41.6 bits (96), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 61/118 (51%), Gaps = 10/118 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 441
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYT 117


>gi|328952971|ref|YP_004370305.1| hypothetical protein Desac_1265 [Desulfobacca acetoxidans DSM
           11109]
 gi|328453295|gb|AEB09124.1| protein of unknown function DUF81 [Desulfobacca acetoxidans DSM
           11109]
          Length = 311

 Score = 41.6 bits (96), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 27/39 (69%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT 380
              G+  G + G+ G+GGGF+L PL + +GIPP V++A+
Sbjct: 18  IGLGVLVGFLSGMFGVGGGFLLTPLMMMVGIPPAVAAAS 56


>gi|367478122|ref|ZP_09477444.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 285]
 gi|365269682|emb|CCD89912.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 285]
          Length = 331

 Score = 41.6 bits (96), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 58/115 (50%), Gaps = 4/115 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+  + 
Sbjct: 41  LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYWRRRA 100

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 456
                AL       +   +G      L++ LG+  ++I  LS+++ ++  S+GGV
Sbjct: 101 IDPSLALVLMIGGTLGTALGVATF-TLLRSLGQLDLMI-ALSYVVLLT--SVGGV 151


>gi|398828136|ref|ZP_10586338.1| putative permease [Phyllobacterium sp. YR531]
 gi|398218854|gb|EJN05356.1| putative permease [Phyllobacterium sp. YR531]
          Length = 316

 Score = 41.6 bits (96), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 64/127 (50%), Gaps = 4/127 (3%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
             G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++     
Sbjct: 25  GMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPAVAVATGANQVIASSFSGALTHFKRGTL 84

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMI 462
            V          I  A +G +V   L++ LG+  +I+ +L +++F+S  ++GG+ L + +
Sbjct: 85  DVKLGSLLVIGGIFGAAIGIYVF-VLLRELGQLDLIV-SLLYVVFLS--TIGGLMLYESV 140

Query: 463 KRIEHKE 469
           K +   +
Sbjct: 141 KAMRRTK 147


>gi|161528556|ref|YP_001582382.1| hypothetical protein Nmar_1048 [Nitrosopumilus maritimus SCM1]
 gi|160339857|gb|ABX12944.1| protein of unknown function DUF81 [Nitrosopumilus maritimus SCM1]
          Length = 257

 Score = 41.6 bits (96), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 35/58 (60%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            G  AG++G ++GLGGG I+ P+   LG PP  +++ +LFA   ++  S + Y   KR
Sbjct: 10  LGFAAGVLGSMIGLGGGVIVVPVLTFLGFPPTAAASNSLFAALSNAVASTMSYSKQKR 67


>gi|348027031|ref|YP_004766836.1| hypothetical protein MELS_1790 [Megasphaera elsdenii DSM 20460]
 gi|341823085|emb|CCC74009.1| putative membrane protein [Megasphaera elsdenii DSM 20460]
          Length = 262

 Score = 41.6 bits (96), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 41/79 (51%), Gaps = 9/79 (11%)

Query: 331 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 390
           T   AS L+F  A G  AG V  + G GGG I  P+ L  G+PP ++  T  F+ TF + 
Sbjct: 8   TTLSASLLLFIMAGGFLAGFVDSIAG-GGGLISLPVLLAAGLPPHLAIGTNKFSATFGAV 66

Query: 391 MSVVEYY--------LLKR 401
           MS  +++        LLKR
Sbjct: 67  MSAWQFWRAGKVDMGLLKR 85


>gi|376296158|ref|YP_005167388.1| hypothetical protein DND132_1375 [Desulfovibrio desulfuricans
           ND132]
 gi|323458719|gb|EGB14584.1| protein of unknown function DUF81 [Desulfovibrio desulfuricans
           ND132]
          Length = 309

 Score = 41.6 bits (96), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 38/62 (61%), Gaps = 2/62 (3%)

Query: 338 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           LVF  A G   G++ G+ G+GGGF++ PL +  GIPP V++A+    I  +S+   + +Y
Sbjct: 17  LVF--ALGGFVGLLSGIFGVGGGFLMTPLLIMFGIPPTVAAASDSNQIVGASTSGCLAHY 74

Query: 398 LL 399
            L
Sbjct: 75  RL 76


>gi|443327177|ref|ZP_21055809.1| putative permease [Xenococcus sp. PCC 7305]
 gi|442793208|gb|ELS02663.1| putative permease [Xenococcus sp. PCC 7305]
          Length = 270

 Score = 41.6 bits (96), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 47/89 (52%)

Query: 339 VFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYL 398
           VF    G  +G++ G LG+GGG +L PL + LG  P  + AT+  AIT ++    ++ + 
Sbjct: 7   VFIGISGFFSGILAGFLGIGGGTVLVPLLVALGYEPIQAVATSALAITITALSGTLQNWR 66

Query: 399 LKRFPVPYALYFFALSIIAAFVGQHVLKK 427
           +    +   LY    +++AA VG +   K
Sbjct: 67  MGYIKLQSILYLGLPALLAAQVGVYFADK 95


>gi|420178011|ref|ZP_14684345.1| hypothetical protein HMPREF9989_07051 [Staphylococcus epidermidis
           NIHLM057]
 gi|420181101|ref|ZP_14687307.1| hypothetical protein HMPREF9988_09900 [Staphylococcus epidermidis
           NIHLM053]
 gi|394247198|gb|EJD92446.1| hypothetical protein HMPREF9989_07051 [Staphylococcus epidermidis
           NIHLM057]
 gi|394247337|gb|EJD92583.1| hypothetical protein HMPREF9988_09900 [Staphylococcus epidermidis
           NIHLM053]
          Length = 254

 Score = 41.6 bits (96), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  +  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFIDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T+
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118


>gi|390449540|ref|ZP_10235145.1| hypothetical protein A33O_08366 [Nitratireductor aquibiodomus RA22]
 gi|389664037|gb|EIM75548.1| hypothetical protein A33O_08366 [Nitratireductor aquibiodomus RA22]
          Length = 306

 Score = 41.6 bits (96), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 62/128 (48%), Gaps = 4/128 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            A G   G + G+ G+GGGF++ PL +   IPP ++ AT    +  SS    + ++  K 
Sbjct: 18  LAMGGAVGFLSGMFGVGGGFLITPLLIFYNIPPAIAVATGANQVIASSFSGALAHFKRKS 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             +       A  I+ A +G  V   L + LG+  +I+ +L ++ F+    +GG+ L + 
Sbjct: 78  LDLKLGTVLLAGGIVGASLGVSVFAYL-RRLGQLDLIV-SLLYVAFLGV--VGGLMLMES 133

Query: 462 IKRIEHKE 469
           ++ I    
Sbjct: 134 VRAIRRSR 141


>gi|357419022|ref|YP_004932014.1| hypothetical protein Tlie_0171 [Thermovirga lienii DSM 17291]
 gi|355396488|gb|AER65917.1| protein of unknown function DUF81 [Thermovirga lienii DSM 17291]
          Length = 274

 Score = 41.6 bits (96), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 47/176 (26%), Positives = 76/176 (43%), Gaps = 22/176 (12%)

Query: 32  QSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTAC 91
           +S+   E T   K +H +  ++      K  S  + +    K   R+++G      G   
Sbjct: 113 RSDDIQEKTPHQKANHTKNTLLDCSGKFKDPSSGETIAYTPK---RLLLGMAFSTLGGII 169

Query: 92  GSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMV-TGGAVSAVVYNLR-QRHPTLDIPV 149
            ++ G+GGG I VP++ L++G   K++ A S  MV    +VSAV+Y       P L IP 
Sbjct: 170 SALLGIGGGPIKVPIMRLVMGLPLKAATATSTFMVGITASVSAVIYLFEGMVKPDLAIPA 229

Query: 150 IDYDLALLFQPLLVLGISIGVAFNVIFADWM----ITVLLIVLFLVLSTRAFLKGV 201
                        VLGI IG     + A  M    I     +L ++++    LKG+
Sbjct: 230 -------------VLGIFIGARTGALVAGKMKSHHIRTAFSILLVLIAIEMLLKGL 272



 Score = 38.1 bits (87), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 44/79 (55%), Gaps = 2/79 (2%)

Query: 319 GRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF-LELGIPPQVS 377
           G+ K  S G E   +   +L+   A     G++  LLG+GGG I  P+  L +G+P + +
Sbjct: 138 GKFKDPSSG-ETIAYTPKRLLLGMAFSTLGGIISALLGIGGGPIKVPIMRLVMGLPLKAA 196

Query: 378 SATALFAITFSSSMSVVEY 396
           +AT+ F +  ++S+S V Y
Sbjct: 197 TATSTFMVGITASVSAVIY 215


>gi|347529542|ref|YP_004836290.1| hypothetical protein SLG_31580 [Sphingobium sp. SYK-6]
 gi|345138224|dbj|BAK67833.1| conserved hypothetical membrane protein [Sphingobium sp. SYK-6]
          Length = 304

 Score = 41.6 bits (96), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 55/106 (51%), Gaps = 7/106 (6%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G+ G+GGGF+  PL +  GIPP V++A+A   +  +S  SV+ +   K   +P       
Sbjct: 29  GMFGVGGGFLTTPLLIFYGIPPTVAAASAATQVMGASVSSVMSHLGRKTVDLPMGGVMVV 88

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 458
             II A +G  V+ +++  LG+   +I  L  ++      LG +GL
Sbjct: 89  GGIIGAAIGT-VIFRILDRLGQIDTVIGLLYVVM------LGSIGL 127


>gi|374299305|ref|YP_005050944.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
 gi|332552241|gb|EGJ49285.1| protein of unknown function DUF81 [Desulfovibrio africanus str.
           Walvis Bay]
          Length = 423

 Score = 41.6 bits (96), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 42/84 (50%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+ AG++ G +G GGGFI+ P  +  GI   ++  T LF I   + M    +  L    V
Sbjct: 81  GLIAGLITGCIGAGGGFIITPALMAAGIKGILAVGTDLFHIFAKAIMGTTVHKKLGNVSV 140

Query: 405 PYALYFFALSIIAAFVGQHVLKKL 428
             AL F A + I  F+G  + K L
Sbjct: 141 KLALGFLAGASIGTFIGGAINKGL 164


>gi|430756311|ref|YP_007208850.1| hypothetical protein A7A1_0819 [Bacillus subtilis subsp. subtilis
           str. BSP1]
 gi|430020831|gb|AGA21437.1| Hypothetical protein YrkJ [Bacillus subtilis subsp. subtilis str.
           BSP1]
          Length = 262

 Score = 41.6 bits (96), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 9/100 (9%)

Query: 331 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 389
            KW AS L F        G V G+LG GG FIL P+ L  L IP +++ A++L AITF S
Sbjct: 145 NKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 197

Query: 390 SM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           S+ + V   +  +     AL     S+IA+ +G  V +K+
Sbjct: 198 SIGATVGKVITDQVLFVPALVLIMASLIASPIGASVGQKV 237


>gi|392955880|ref|ZP_10321410.1| hypothetical protein A374_04039 [Bacillus macauensis ZFHKF-1]
 gi|391878122|gb|EIT86712.1| hypothetical protein A374_04039 [Bacillus macauensis ZFHKF-1]
          Length = 254

 Score = 41.6 bits (96), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 65/124 (52%), Gaps = 7/124 (5%)

Query: 307 GVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 366
           G+ A  AV L    ++ A +  + T  R   + F    G+ AG    +LG GG FIL P+
Sbjct: 111 GILALIAVMLMFTPKREALESGKVTFHRGGAIFFSFVIGLVAG----VLGAGGAFILVPV 166

Query: 367 FLE-LGIPPQVSSATALFAITFSSSM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
            L  L IP + + AT+L AITF SS+ S +   +  + P   A+    +S+IA+ +G  V
Sbjct: 167 MLAVLKIPTKTTIATSL-AITFLSSIGSTIGKLVTHQVPYIPAIVLMIVSLIASPIGAKV 225

Query: 425 LKKL 428
            ++L
Sbjct: 226 GQRL 229


>gi|260430369|ref|ZP_05784342.1| membrane protein [Citreicella sp. SE45]
 gi|260418398|gb|EEX11655.1| membrane protein [Citreicella sp. SE45]
          Length = 301

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 32/61 (52%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
            L      G   G++ GL G+GGGFIL PL   +GIPP V+ +T    I  SS    + +
Sbjct: 13  NLFLIVGLGGVVGIMSGLFGVGGGFILTPLLFFIGIPPAVAVSTQAVQIVASSCSGALAH 72

Query: 397 Y 397
           +
Sbjct: 73  W 73


>gi|220903637|ref|YP_002478949.1| hypothetical protein Ddes_0357 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. ATCC 27774]
 gi|219867936|gb|ACL48271.1| protein of unknown function DUF81 [Desulfovibrio desulfuricans
           subsp. desulfuricans str. ATCC 27774]
          Length = 420

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 43/86 (50%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           A G+ AG++ G +G GGGFI+ P  + +G+   ++  T LF I   + M    +  L   
Sbjct: 82  AVGLAAGLITGCIGAGGGFIITPALMAVGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNV 141

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKL 428
               A+ F   S++  F+G  + K L
Sbjct: 142 SGKLAVAFLCGSVVGTFIGGAINKGL 167


>gi|406888182|gb|EKD34745.1| hypothetical protein ACD_75C02211G0002 [uncultured bacterium]
          Length = 381

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 31/120 (25%), Positives = 60/120 (50%), Gaps = 2/120 (1%)

Query: 336 SQLVFYCAC--GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 393
           + L+F+ +   G+ AG++ G +G GGGFI+ P  +  G+   ++  T LF I   + M  
Sbjct: 14  TNLIFFASVFVGVCAGLITGCIGAGGGFIIAPALMSAGVKGILAVGTDLFHIFAKAIMGS 73

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
           V +  +    +  A++F   S+I A +G  + + L  +    S +  T+ ++  +  LS 
Sbjct: 74  VLHRKMGNISIGLAVHFLIGSLIGATLGGMLNRYLYDLNPVLSDLFITVVYVFILGFLSF 133


>gi|225017187|ref|ZP_03706379.1| hypothetical protein CLOSTMETH_01113 [Clostridium methylpentosum
           DSM 5476]
 gi|224950106|gb|EEG31315.1| hypothetical protein CLOSTMETH_01113 [Clostridium methylpentosum
           DSM 5476]
          Length = 117

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 43/92 (46%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
           Q   +   G  AG++ GL G GGG  + PL    GI  + S AT++  I   S +S   Y
Sbjct: 3   QKTHFIVTGAVAGLLNGLFGAGGGVAVVPLLEHAGIEARKSHATSIAIIAALSVVSAAFY 62

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
               +F    AL +    ++ A VG  +L K+
Sbjct: 63  LFNGKFDFWQALPYLPFGLVGAVVGAKLLPKI 94


>gi|262277629|ref|ZP_06055422.1| membrane protein [alpha proteobacterium HIMB114]
 gi|262224732|gb|EEY75191.1| membrane protein [alpha proteobacterium HIMB114]
          Length = 351

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 58/104 (55%), Gaps = 4/104 (3%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 403
           G+  G++  LLG+GGGFIL P+ +  +G+P ++   T+LFA+ F   +  + + +     
Sbjct: 182 GVLIGLISSLLGVGGGFILVPILIYIIGMPAKLVPGTSLFAMIFVMIIVTLLHAIANYTI 241

Query: 404 VPYALYFFAL-SIIAAFVGQHVLKKLI--KILGRASIIIFTLSF 444
             Y ++  AL S++ A +G  +  KL   ++ G  SI+I T  F
Sbjct: 242 DIYLVFILALGSVVGAQLGSIISSKLAGEELRGLLSILILTFGF 285


>gi|52081145|ref|YP_079936.1| hypothetical protein BL02055 [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|319644890|ref|ZP_07999123.1| YrkJ protein [Bacillus sp. BT1B_CT2]
 gi|404490024|ref|YP_006714130.1| hypothetical protein BLi02790 [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|423683115|ref|ZP_17657954.1| hypothetical protein MUY_02968 [Bacillus licheniformis WX-02]
 gi|52004356|gb|AAU24298.1| conserved membrane protein YrkJ [Bacillus licheniformis DSM 13 =
           ATCC 14580]
 gi|52349027|gb|AAU41661.1| DUF81 transmembrane protein YrkJ [Bacillus licheniformis DSM 13 =
           ATCC 14580]
 gi|317392699|gb|EFV73493.1| YrkJ protein [Bacillus sp. BT1B_CT2]
 gi|383439889|gb|EID47664.1| hypothetical protein MUY_02968 [Bacillus licheniformis WX-02]
          Length = 260

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 71/134 (52%), Gaps = 9/134 (6%)

Query: 322 KIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSAT 380
           K  ++G++    R  + +         G+  G++G GG F+L PL L  + IP +++ AT
Sbjct: 127 KTGTEGNQSDHIRFPK-ILTSIMTFVVGLASGIVGAGGAFLLVPLMLVIIKIPARITIAT 185

Query: 381 ALFAITFSSSMSVVEYYL-LKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 439
           +L A+TF SS+      L + + P+  AL   A S+IA+ +G  + KK+      A ++ 
Sbjct: 186 SL-AVTFISSIGTTATKLVIGQVPIMPALVVMAASLIASPLGVFIGKKV-----NAKVLQ 239

Query: 440 FTLSFIIFVSALSL 453
             L F+IF++ + +
Sbjct: 240 GILIFVIFLTTVKI 253


>gi|341581528|ref|YP_004762020.1| putative permease [Thermococcus sp. 4557]
 gi|340809186|gb|AEK72343.1| putative permease [Thermococcus sp. 4557]
          Length = 254

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 42/81 (51%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
           Y A G+  G++  + GLGGGF++ P    LG+    +  T+  A+ F+S  S + Y   +
Sbjct: 7   YFAVGVFIGILAAMFGLGGGFLIVPTLNFLGVEIHHAVGTSSAAVVFTSLSSAIAYSRQR 66

Query: 401 RFPVPYALYFFALSIIAAFVG 421
           R      L   + ++I A++G
Sbjct: 67  RIHYKAGLLLASTAVIGAYIG 87


>gi|262276252|ref|ZP_06054061.1| hypothetical protein VHA_003237 [Grimontia hollisae CIP 101886]
 gi|262220060|gb|EEY71376.1| hypothetical protein VHA_003237 [Grimontia hollisae CIP 101886]
          Length = 257

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 41/79 (51%), Gaps = 1/79 (1%)

Query: 348 AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYA 407
           AG +  + G GGG +  P  L +G+PP ++  T   + TF+SS + + YY  K F   + 
Sbjct: 20  AGFIDAVAG-GGGMLTVPALLSIGLPPHIALGTNKLSATFASSTAALTYYRKKLFSPRFW 78

Query: 408 LYFFALSIIAAFVGQHVLK 426
            + F  ++I A  G  V+ 
Sbjct: 79  SHAFLATLIGAVFGTLVVD 97


>gi|254510226|ref|ZP_05122293.1| membrane protein [Rhodobacteraceae bacterium KLH11]
 gi|221533937|gb|EEE36925.1| membrane protein [Rhodobacteraceae bacterium KLH11]
          Length = 321

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 54/103 (52%), Gaps = 2/103 (1%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           G++ G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++  +   +    
Sbjct: 40  GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRRTVDLKMGT 99

Query: 409 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
                 +  A +G  +L   +K LG+  +++  L +++F+  +
Sbjct: 100 VLLVGGLAGAALGV-ILFNYLKSLGQVDLLV-KLCYVVFLGVI 140


>gi|410463419|ref|ZP_11316934.1| putative permease [Desulfovibrio magneticus str. Maddingley MBC34]
 gi|409983446|gb|EKO39820.1| putative permease [Desulfovibrio magneticus str. Maddingley MBC34]
          Length = 313

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 27/36 (75%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT 380
           G   G++ G+ G+GGGF++ PL + +GIPP V++AT
Sbjct: 25  GGAVGLLSGIFGVGGGFLMTPLLIMMGIPPTVAAAT 60


>gi|381397639|ref|ZP_09923049.1| protein of unknown function DUF81 [Microbacterium laevaniformans
           OR221]
 gi|380775207|gb|EIC08501.1| protein of unknown function DUF81 [Microbacterium laevaniformans
           OR221]
          Length = 269

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 56/108 (51%), Gaps = 4/108 (3%)

Query: 325 SKGDEGTKWRASQLVFYCAC---GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSAT 380
           S     T  R   + F  +C   G+ AG++ GL G+GGG ++ P L L L    ++++ T
Sbjct: 2   SPAASATPARRRDIRFVLSCIGVGLAAGLLSGLFGVGGGTVIVPMLVLLLRFDQRLAAGT 61

Query: 381 ALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           +L AI  ++S+ VV Y +        AL   A +++ A +G  +L ++
Sbjct: 62  SLAAIVPTASVGVVSYAIHGSVAWIPALILAAGAVVGAQIGSWLLARI 109


>gi|321312144|ref|YP_004204431.1| YrkJ [Bacillus subtilis BSn5]
 gi|320018418|gb|ADV93404.1| YrkJ [Bacillus subtilis BSn5]
          Length = 258

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/75 (42%), Positives = 43/75 (57%), Gaps = 10/75 (13%)

Query: 320 RRKIASKGDE--GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQV 376
           R +   +G+E    KW AS L F        G V G+LG GG FIL P+ L  L IP ++
Sbjct: 128 RGQAHREGEEVIFNKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRI 181

Query: 377 SSATALFAITFSSSM 391
           + A++L AITF SS+
Sbjct: 182 TVASSL-AITFLSSI 195


>gi|27376265|ref|NP_767794.1| hypothetical protein bll1154 [Bradyrhizobium japonicum USDA 110]
 gi|27349405|dbj|BAC46419.1| bll1154 [Bradyrhizobium japonicum USDA 110]
          Length = 380

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+
Sbjct: 94  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGAISYW 148


>gi|433446826|ref|ZP_20410718.1| sulfite exporter, TauE/SafE family [Anoxybacillus flavithermus
           TNO-09.006]
 gi|432000333|gb|ELK21233.1| sulfite exporter, TauE/SafE family [Anoxybacillus flavithermus
           TNO-09.006]
          Length = 258

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 70/138 (50%), Gaps = 14/138 (10%)

Query: 318 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQV 376
           KG   I+       KW A+ L F    G++AG+VG      G FIL P+ L  L IP ++
Sbjct: 128 KGIDNISLDQVTFNKWLAAALAFIV--GVSAGIVGA----AGAFILVPIMLVVLKIPTRM 181

Query: 377 SSATALFAITFSSSM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 435
           + AT+L AITF SS+ + V      +F    A      S+IA+ +G +V KK+       
Sbjct: 182 TIATSL-AITFISSIGATVGKITTGQFDFWTAFIMVVASLIASPIGANVGKKV-----NT 235

Query: 436 SIIIFTLSFIIFVSALSL 453
            I+ F LS +I  +A+ +
Sbjct: 236 KILQFILSALIVATAIKI 253


>gi|52081735|ref|YP_080526.1| hypothetical protein BL02149 [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|319647651|ref|ZP_08001869.1| YunE protein [Bacillus sp. BT1B_CT2]
 gi|404490618|ref|YP_006714724.1| hypothetical protein BLi03429 [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|423683731|ref|ZP_17658570.1| hypothetical protein MUY_03584 [Bacillus licheniformis WX-02]
 gi|52004946|gb|AAU24888.1| conserved protein YunE [Bacillus licheniformis DSM 13 = ATCC 14580]
 gi|52349623|gb|AAU42257.1| DUF81 transmembrane protein YunE [Bacillus licheniformis DSM 13 =
           ATCC 14580]
 gi|317389992|gb|EFV70801.1| YunE protein [Bacillus sp. BT1B_CT2]
 gi|383440505|gb|EID48280.1| hypothetical protein MUY_03584 [Bacillus licheniformis WX-02]
          Length = 273

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 43/89 (48%), Gaps = 8/89 (8%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLEL-GIP-------PQVSSATALFAITFSSSMSVVE 395
            G  AG +G L+GLGGG ++ P  L L G+P       PQ++  T+L  I F+   S + 
Sbjct: 9   LGFIAGTIGSLVGLGGGIVIVPSLLFLAGLPTLFDHVTPQMAVGTSLVVIIFTGLSSTIA 68

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           Y   K       L FF  S   + VG HV
Sbjct: 69  YMKYKTVDYKSGLIFFIGSGPGSIVGAHV 97


>gi|242058939|ref|XP_002458615.1| hypothetical protein SORBIDRAFT_03g036735 [Sorghum bicolor]
 gi|241930590|gb|EES03735.1| hypothetical protein SORBIDRAFT_03g036735 [Sorghum bicolor]
          Length = 110

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 52/111 (46%), Gaps = 19/111 (17%)

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           P+F  +G+PP+ +SA  +F I  SSSMS  +                  SI     G   
Sbjct: 16  PIFFHVGVPPRSASARTMFLILLSSSMSTAQ------------------SITLGAEGATD 57

Query: 425 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM-IKRIEHKEYMGFD 474
               I+  GRAS+I+  ++ ++ ++A  +   G+ ++  +    +EYMGF 
Sbjct: 58  ATSAIRKSGRASLIVLAVATVMALTAAVVLRSGVPRVWAQYTGGREYMGFK 108


>gi|375361158|ref|YP_005129197.1| hypothetical protein BACAU_0468 [Bacillus amyloliquefaciens subsp.
           plantarum CAU B946]
 gi|421732851|ref|ZP_16171967.1| UPF0721 transmembrane protein [Bacillus amyloliquefaciens subsp.
           plantarum M27]
 gi|371567152|emb|CCF04002.1| UPF0721 transmembrane protein [Bacillus amyloliquefaciens subsp.
           plantarum CAU B946]
 gi|407073212|gb|EKE46209.1| UPF0721 transmembrane protein [Bacillus amyloliquefaciens subsp.
           plantarum M27]
          Length = 258

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 38/99 (38%), Positives = 53/99 (53%), Gaps = 9/99 (9%)

Query: 331 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 389
            KW AS L F        G V G+LG GG FIL P+ L  L IP +++ A++L AITF S
Sbjct: 141 NKWLASSLAFLI------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 193

Query: 390 SM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 427
           S+ + V   +  +     AL     S+IA+ +G  V +K
Sbjct: 194 SIGATVGKVITGQVLFTPALVLMIASLIASPIGASVGQK 232


>gi|154246428|ref|YP_001417386.1| hypothetical protein Xaut_2487 [Xanthobacter autotrophicus Py2]
 gi|154160513|gb|ABS67729.1| protein of unknown function DUF81 [Xanthobacter autotrophicus Py2]
          Length = 307

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 34/53 (64%), Gaps = 3/53 (5%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 392
           A G+  G + G+ G+GGGF++ PL +  G+PP V+ A+    + A +FS ++S
Sbjct: 19  AMGLAVGFISGMFGVGGGFLMTPLLIFTGVPPAVAVASVSPYMAASSFSGALS 71


>gi|420198663|ref|ZP_14704355.1| hypothetical protein HMPREF9980_00100 [Staphylococcus epidermidis
           NIHLM031]
 gi|394273839|gb|EJE18266.1| hypothetical protein HMPREF9980_00100 [Staphylococcus epidermidis
           NIHLM031]
          Length = 254

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 60/119 (50%), Gaps = 10/119 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYLLKRFPVPYALYFFAL--------SIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           S +++    +  +  A   F L        +IIA ++   +LK LI I+  + ++I+T+
Sbjct: 61  SALKFLRSGKVDLKIAGKMFPLIFVASGGGAIIATYIPAIILKPLI-IIALSLVLIYTI 118


>gi|154685010|ref|YP_001420171.1| hypothetical protein RBAM_005480 [Bacillus amyloliquefaciens FZB42]
 gi|154350861|gb|ABS72940.1| YrkJ [Bacillus amyloliquefaciens FZB42]
          Length = 258

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 9/100 (9%)

Query: 331 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 389
            KW AS L F        G V G+LG GG FIL P+ L  L IP +++ A++L AITF S
Sbjct: 141 NKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 193

Query: 390 SM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           S+ + V   +  +     AL     S+IA+ +G  V +K+
Sbjct: 194 SIGATVGKVITGQVLFAPALVLMIASLIASPIGASVGQKV 233


>gi|376294959|ref|YP_005166189.1| hypothetical protein DND132_0167 [Desulfovibrio desulfuricans
           ND132]
 gi|323457520|gb|EGB13385.1| protein of unknown function DUF81 [Desulfovibrio desulfuricans
           ND132]
          Length = 425

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 60/130 (46%), Gaps = 11/130 (8%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           A G+ AG++ G +G GGGFI+ P  +  GI   ++  T LF I   + M  V +  L   
Sbjct: 86  AIGVGAGLITGCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVIHRKLGNV 145

Query: 403 PVPYALYFFALSIIAAFVG---QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 459
            V  A+ F   +I+ A  G     VL ++  IL  A        FI  V +L LG +G  
Sbjct: 146 SVSLAVVFLIGAILGATAGGVINRVLYEINPILSDA--------FITTVYSLMLGFLGFY 197

Query: 460 KMIKRIEHKE 469
            +   +  ++
Sbjct: 198 ALTDFLRSRK 207


>gi|206598110|gb|ACI15920.1| hypothetical protein [Bodo saltans]
          Length = 507

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 87/410 (21%), Positives = 168/410 (40%), Gaps = 36/410 (8%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDI----PVIDYDLALL 157
           I VPM   ++     ++  +S+  + G +   ++Y + +  PT       P+I+Y    +
Sbjct: 96  ILVPMFAALIEMPLIAAVGLSQAAICGQSAFNMIYQVTKTMPTQTPNEVRPLINYQYLAI 155

Query: 158 FQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQ 217
             PL ++G  +G     +  DW+   LL +L   +  R   K    + ++  T    ++ 
Sbjct: 156 LLPLSLIGTLLGSLGGRVVPDWLRLALLFLLLTSVLHRVIEKAKRQYMQDASTS-SGSRI 214

Query: 218 LELIVLGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYW------------- 264
            E++ L +         D +   N+   E     ++   S + NI +             
Sbjct: 215 AEVVKLTSTRVVVHASPD-DANPNEDDGEGGDGIRQGVASELANINFVNESASLVLADSF 273

Query: 265 ----------KEFGLLVAVWAIILALQIAKNYTTTC-SVLYWVLNLLQIPVAGGV----S 309
                     +E  LL   + ++L   I ++  T C SV + +L ++ I V  G+     
Sbjct: 274 DGPLTIRSSRREMALLAIGFVVLLVCNIVRSSMTVCGSVDHALLFIIPISVLVGLWWLGK 333

Query: 310 AYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE 369
              A+ L   +    S      +W    + ++    I AG    LLG GGG +L  +  E
Sbjct: 334 EMAAMTLTNVQSGFLSPDHMTFQWNDRTMTWFPFAAIIAGAGTALLGNGGGLVLSFVLFE 393

Query: 370 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 429
             + P+ +SAT+ FA    +S S +      +    YA  +F   +++  +GQ      I
Sbjct: 394 AELTPEEASATSGFANLLIASESALLMLFQGQLVPDYATMYFVCGVVSTILGQSGFMAYI 453

Query: 430 KILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE--YMGFDNIC 477
           +   +  +I+ +L+ II  S L L   GL ++I++++  +     F +IC
Sbjct: 454 RWSKKRFLIVTSLACIIGGSLLMLLSYGLFEVIEQVKRHDGSLWAFGHIC 503


>gi|378550344|ref|ZP_09825560.1| hypothetical protein CCH26_09660 [Citricoccus sp. CH26A]
          Length = 298

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 7/77 (9%)

Query: 313 AVALYKGRRKI-ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 371
           A A+ +GRRK  +S GDE  K    ++V     G+  G+V GL+G GGGF++ P  + LG
Sbjct: 113 AAAMIRGRRKAPSSGGDEAKKLPVVKVVLE---GLVVGLVTGLVGAGGGFLVVPALVLLG 169

Query: 372 ---IPPQVSSATALFAI 385
              +P  V ++  + A+
Sbjct: 170 GLSMPAAVGTSLVVIAM 186


>gi|451348137|ref|YP_007446768.1| hypothetical protein KSO_017010 [Bacillus amyloliquefaciens IT-45]
 gi|449851895|gb|AGF28887.1| hypothetical protein KSO_017010 [Bacillus amyloliquefaciens IT-45]
          Length = 258

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 38/99 (38%), Positives = 53/99 (53%), Gaps = 9/99 (9%)

Query: 331 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 389
            KW AS L F        G V G+LG GG FIL P+ L  L IP +++ A++L AITF S
Sbjct: 141 NKWLASSLAFLI------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 193

Query: 390 SM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 427
           S+ + V   +  +     AL     S+IA+ +G  V +K
Sbjct: 194 SIGATVGKVITGQVLFTPALVLMIASLIASPIGASVGQK 232


>gi|393762657|ref|ZP_10351283.1| hypothetical protein AGRI_06745 [Alishewanella agri BL06]
 gi|392606279|gb|EIW89164.1| hypothetical protein AGRI_06745 [Alishewanella agri BL06]
          Length = 262

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 39/87 (44%), Gaps = 1/87 (1%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           C+    AG +  + G GGG +  P  L  G+PP V   T   A TF S  + V YY  K 
Sbjct: 16  CSVAFIAGYIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSLTASVTYYRKKL 74

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKL 428
           F   Y       + I A +G  V+ +L
Sbjct: 75  FNPVYWRNSLLFTAIGALLGTLVVDQL 101


>gi|359795015|ref|ZP_09297681.1| hypothetical protein MAXJ12_35646 [Mesorhizobium alhagi CCNWXJ12-2]
 gi|359248639|gb|EHK52372.1| hypothetical protein MAXJ12_35646 [Mesorhizobium alhagi CCNWXJ12-2]
          Length = 307

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 67/130 (51%), Gaps = 8/130 (6%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            A G   G + G+ G+GGGF++ PL +   IPP ++ AT    +  SS    + ++  KR
Sbjct: 18  LAMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPAIAVATGANQVIASSFSGALAHF--KR 75

Query: 402 FPVPYAL--YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 459
             + + L     A  I+ + +G +    L+++LG+  + I +L +++F+    +GG+ L 
Sbjct: 76  GTLDFKLGTVLLAGGIVGSTIGIYAF-TLLRVLGQLDLFI-SLLYVVFLGL--VGGLMLV 131

Query: 460 KMIKRIEHKE 469
           + I  I   +
Sbjct: 132 ESINAIRRAK 141


>gi|407850242|gb|EKG04697.1| hypothetical protein TCSYLVIO_004241 [Trypanosoma cruzi]
          Length = 368

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 53/110 (48%), Gaps = 2/110 (1%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT--LDIPVIDYDLALLFQPLL 162
           PM  L++G     +  +S+  + G ++  V+  +R+R P      P+I+Y    L  PL 
Sbjct: 108 PMFCLLMGLPMDFAVGLSQSTICGQSILNVLIAIRKRFPCAGCSRPLINYQYLTLLVPLG 167

Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR 212
           V+G  IG   N +  D    VLL +L   +  R+  K +  ++K+   +R
Sbjct: 168 VIGTLIGGVLNRLCPDLFRLVLLFLLLTTVLYRSVRKMIAQYRKDQSERR 217


>gi|332186970|ref|ZP_08388711.1| hypothetical protein SUS17_2059 [Sphingomonas sp. S17]
 gi|332012980|gb|EGI55044.1| hypothetical protein SUS17_2059 [Sphingomonas sp. S17]
          Length = 304

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 61/117 (52%), Gaps = 7/117 (5%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            A G   G++ G+ G+GGGF+  PL +  GIPP V++A+A   +T +S   V  ++    
Sbjct: 18  IALGAGVGLLSGMFGVGGGFLTTPLLIVYGIPPTVAAASAASQVTGASVSGVFAHWQRGG 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 458
             V       A  I+ +F+G  + + L     +AS  I T+  II+V  L LG +GL
Sbjct: 78  VDVKMGGVLVAGGIVGSFIGAWIFRLL-----QASGQIDTVIAIIYV--LMLGWIGL 127


>gi|384264092|ref|YP_005419799.1| hypothetical protein BANAU_0461 [Bacillus amyloliquefaciens subsp.
           plantarum YAU B9601-Y2]
 gi|380497445|emb|CCG48483.1| UPF0721 transmembrane protein [Bacillus amyloliquefaciens subsp.
           plantarum YAU B9601-Y2]
          Length = 258

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 45/128 (35%), Positives = 64/128 (50%), Gaps = 13/128 (10%)

Query: 307 GVSAYEAVALY----KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 362
           G+ A  AV L     KG+     +     KW AS L F        G V G+LG GG FI
Sbjct: 113 GILAILAVVLMFIPKKGQEHSEDEEVIFNKWLASSLAFII------GGVSGILGAGGAFI 166

Query: 363 LGPLFLE-LGIPPQVSSATALFAITFSSSM-SVVEYYLLKRFPVPYALYFFALSIIAAFV 420
           L P+ L  L IP +++ A++L AITF SS+ + V   +  +     AL     S+IA+ +
Sbjct: 167 LVPIMLSILNIPVRITVASSL-AITFLSSIGATVGKVITGQVLFAPALVLMIASLIASPI 225

Query: 421 GQHVLKKL 428
           G  V +K+
Sbjct: 226 GASVGQKV 233


>gi|423480829|ref|ZP_17457519.1| hypothetical protein IEQ_00607 [Bacillus cereus BAG6X1-2]
 gi|401146715|gb|EJQ54226.1| hypothetical protein IEQ_00607 [Bacillus cereus BAG6X1-2]
          Length = 255

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 48/86 (55%), Gaps = 12/86 (13%)

Query: 312 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 368
            A+ ++  +R    KG EG    KW AS L F      + G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGSEGIMYNKWLASLLAF------SVGSVSGIIGAGGAFLLVPIML 169

Query: 369 -ELGIPPQVSSATALFAITFSSSMSV 393
             L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194


>gi|387897005|ref|YP_006327301.1| hypothetical protein MUS_0502 [Bacillus amyloliquefaciens Y2]
 gi|387171115|gb|AFJ60576.1| conserved hypothetical protein YrkJ [Bacillus amyloliquefaciens Y2]
          Length = 276

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 9/100 (9%)

Query: 331 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 389
            KW AS L F        G V G+LG GG FIL P+ L  L IP +++ A++L AITF S
Sbjct: 159 NKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 211

Query: 390 SM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           S+ + V   +  +     AL     S+IA+ +G  V +K+
Sbjct: 212 SIGATVGKVITGQVLFAPALVLMIASLIASPIGASVGQKV 251


>gi|288959672|ref|YP_003450013.1| hypothetical protein AZL_028310 [Azospirillum sp. B510]
 gi|288911980|dbj|BAI73469.1| hypothetical protein AZL_028310 [Azospirillum sp. B510]
          Length = 305

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 62/128 (48%), Gaps = 4/128 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL + +G+PP ++  T    +  +S   V+ ++    
Sbjct: 18  LGMGWLVGFLSGMFGVGGGFLMTPLLIFIGVPPAIAVGTQANQLVAASVSGVLAHWRRGN 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             V   +      ++   VG  V   L + LG+  I I TLS++ F+   ++GG+ L + 
Sbjct: 78  VDVKLGVVMLGGGVLGTAVGVWVFGILQR-LGQIDIAI-TLSYVFFLG--TIGGMMLVES 133

Query: 462 IKRIEHKE 469
            + I  + 
Sbjct: 134 SRAILRRR 141


>gi|206890428|ref|YP_002248551.1| hypothetical protein THEYE_A0709 [Thermodesulfovibrio yellowstonii
           DSM 11347]
 gi|206742366|gb|ACI21423.1| membrane protein, putative [Thermodesulfovibrio yellowstonii DSM
           11347]
          Length = 422

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 43/89 (48%), Gaps = 7/89 (7%)

Query: 328 DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAIT 386
           DE    R    +F   CG   G++  ++G+GGGF+  PLF+  LG+    +  T +  I 
Sbjct: 242 DEDFGGRRISWLFVAMCGFLVGLLAAVMGVGGGFVTFPLFVYVLGVSTPTTVGTDILQII 301

Query: 387 FSSSMSVVEYYLLKRFPVPYALYFFALSI 415
           F++  S +  Y +      Y   F+ L++
Sbjct: 302 FTAGYSSISQYAI------YGYVFYTLAM 324



 Score = 38.9 bits (89), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 43/86 (50%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           A G+ AG++ G +G GGGFI+ P  +  G+   ++  T  F I   + M  V +  L   
Sbjct: 79  AVGLAAGLITGCIGAGGGFIIAPALMAAGVKGILAVGTDQFHIFAKAIMGTVIHKKLGNV 138

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKL 428
            V  A+ F   S++ A  G ++ + +
Sbjct: 139 CVGLAVAFVIGSVLGATTGGYIQRTI 164


>gi|297583461|ref|YP_003699241.1| hypothetical protein [Bacillus selenitireducens MLS10]
 gi|297141918|gb|ADH98675.1| protein of unknown function DUF81 [Bacillus selenitireducens MLS10]
          Length = 272

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 43/90 (47%), Gaps = 2/90 (2%)

Query: 321 RKIASKGDEGTKWR-ASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSS 378
           R +     +G +W    Q V     G   G + GL G+GGG ++ P + L    PP ++ 
Sbjct: 136 RNLYWTDHQGVEWTYGYQPVIGIMLGFIVGSISGLFGIGGGSLMVPAMILLFAFPPHLAV 195

Query: 379 ATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           AT++F I  S+ +  V + L+      YAL
Sbjct: 196 ATSMFMIMLSAIVGSVSHILVGNVHWLYAL 225


>gi|84515899|ref|ZP_01003260.1| membrane protein [Loktanella vestfoldensis SKA53]
 gi|84510341|gb|EAQ06797.1| membrane protein [Loktanella vestfoldensis SKA53]
          Length = 305

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 53/101 (52%), Gaps = 2/101 (1%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G+ G+GGGF++ PL   +GIPP V+ AT+   I  SS   V+ ++  K            
Sbjct: 29  GMFGVGGGFLITPLLFFIGIPPAVAVATSTNQIVASSISGVLAHFKRKTVDPVMGGVLLV 88

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
             +I + +G  +   L + LG+  +++  L +++F+ A+ +
Sbjct: 89  GGVIGSAIGIVIFNYL-RALGQVDLLV-QLCYVVFLGAIGM 127


>gi|346994770|ref|ZP_08862842.1| membrane protein [Ruegeria sp. TW15]
          Length = 306

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 52/100 (52%), Gaps = 2/100 (2%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           G++ G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++  +   +    
Sbjct: 25  GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRRTVDLRMGT 84

Query: 409 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
                 +  A +G  V   L K LG+  +++  L ++IF+
Sbjct: 85  VLLVGGLAGAALGVVVFNYL-KSLGQVDLLV-KLCYVIFL 122


>gi|120402827|ref|YP_952656.1| hypothetical protein Mvan_1828 [Mycobacterium vanbaalenii PYR-1]
 gi|119955645|gb|ABM12650.1| protein of unknown function DUF81 [Mycobacterium vanbaalenii PYR-1]
          Length = 291

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 5/86 (5%)

Query: 312 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLEL 370
            A+A+ +GR+ I + G+ G +    ++V   A G+  G+V GL+G GGGF++ P L L  
Sbjct: 109 TAIAMLRGRKTIET-GETGHRLPVPKIV---AEGLIVGLVTGLVGAGGGFLVVPALALLG 164

Query: 371 GIPPQVSSATALFAITFSSSMSVVEY 396
           G+P  ++  T+L  I   S   +  Y
Sbjct: 165 GLPMPIAVGTSLIVIAMKSFAGLAGY 190


>gi|408526472|emb|CCK24646.1| hypothetical protein BN159_0267 [Streptomyces davawensis JCM 4913]
          Length = 306

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 6/85 (7%)

Query: 313 AVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELG 371
           AVA+ +GRRK  +K           ++     GI  G+V GL+G GGGF++ P L L  G
Sbjct: 110 AVAMIRGRRKATTKTHH-----ELPVLHVLLDGIVVGLVTGLVGAGGGFLVVPALALLGG 164

Query: 372 IPPQVSSATALFAITFSSSMSVVEY 396
           +P  V+  T+L  I+  S   +  Y
Sbjct: 165 LPMTVAVGTSLLVISMKSFAGLAGY 189


>gi|126740488|ref|ZP_01756175.1| membrane protein [Roseobacter sp. SK209-2-6]
 gi|126718289|gb|EBA15004.1| membrane protein [Roseobacter sp. SK209-2-6]
          Length = 306

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 51/96 (53%), Gaps = 2/96 (2%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++  +   +   +    
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPTVAVATEANQIVASSFSGVLAHFRRRTVDIKMGVVLQI 88

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
             +I A +G  V   L K LG+  +++  L +++F+
Sbjct: 89  GGVIGAALGVVVFNYL-KSLGQVDLLV-KLCYVVFL 122


>gi|157962478|ref|YP_001502512.1| hypothetical protein Spea_2657 [Shewanella pealeana ATCC 700345]
 gi|157847478|gb|ABV87977.1| protein of unknown function DUF81 [Shewanella pealeana ATCC 700345]
          Length = 257

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 41/84 (48%), Gaps = 1/84 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G  AG +  + G GGG +  P  L +G+PP ++  T   A +F SSM+   YY  + F  
Sbjct: 17  GFIAGFIDAVSG-GGGLLSIPALLTMGVPPHLALGTNKLAASFGSSMAAYTYYKQRLFSP 75

Query: 405 PYALYFFALSIIAAFVGQHVLKKL 428
               + F  + I A +G  ++  +
Sbjct: 76  SLWYHTFIATFIGAVIGTFIVYHI 99


>gi|410030356|ref|ZP_11280186.1| hypothetical protein MaAK2_14171 [Marinilabilia sp. AK2]
          Length = 291

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 39/84 (46%), Gaps = 8/84 (9%)

Query: 330 GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSS 389
           G  W A   +++   G  A  + G LG+  G     + L +GIPP V+SA   FA  F+S
Sbjct: 40  GENWNAEMFLWFLLVGFIAQTIDGALGMAYGVSSNSILLGIGIPPAVASAWVHFAQVFTS 99

Query: 390 --------SMSVVEYYLLKRFPVP 405
                    +  V++ L KR  +P
Sbjct: 100 LASGISHLKLGNVQWSLAKRLMIP 123


>gi|225352706|ref|ZP_03743729.1| hypothetical protein BIFPSEUDO_04335 [Bifidobacterium
           pseudocatenulatum DSM 20438 = JCM 1200]
 gi|225156671|gb|EEG70065.1| hypothetical protein BIFPSEUDO_04335 [Bifidobacterium
           pseudocatenulatum DSM 20438 = JCM 1200]
          Length = 308

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 24/90 (26%), Positives = 47/90 (52%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+  G++ G+ G+GGG ++ P  + LG+  + ++AT++ AI  +S   V+ Y        
Sbjct: 61  GVAVGLLSGMFGIGGGTVIVPALVWLGLTQRNAAATSMLAIVPTSISGVISYATCGNVDW 120

Query: 405 PYALYFFALSIIAAFVGQHVLKKLIKILGR 434
             AL  F    +   +G  +L +L +++ R
Sbjct: 121 LAALLLFCGMFVGGQIGSWLLSRLPELVLR 150


>gi|429504024|ref|YP_007185208.1| hypothetical protein B938_02510 [Bacillus amyloliquefaciens subsp.
           plantarum AS43.3]
 gi|429485614|gb|AFZ89538.1| hypothetical protein B938_02510 [Bacillus amyloliquefaciens subsp.
           plantarum AS43.3]
          Length = 258

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 55/106 (51%), Gaps = 13/106 (12%)

Query: 331 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 389
            KW AS L F        G V G+LG GG FIL P+ L  L IP +++ A++L AITF S
Sbjct: 141 NKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 193

Query: 390 SM-SVVEYYLLKRFPVPYALYFFALSIIA----AFVGQHVLKKLIK 430
           S+ + V   +  +     AL     S+IA    A VGQ V  K ++
Sbjct: 194 SIGATVGKVITGQVLFAPALVLMIASLIASPIGATVGQKVNTKFLQ 239


>gi|255263607|ref|ZP_05342949.1| membrane protein, putative [Thalassiobium sp. R2A62]
 gi|255105942|gb|EET48616.1| membrane protein, putative [Thalassiobium sp. R2A62]
          Length = 304

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 2/96 (2%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++  K   +        
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRKTVDLKMGTVLML 88

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
             ++ A +G  +   L K LG+  +++  L +++F+
Sbjct: 89  GGLVGAALGVFIFNYL-KSLGQVDLLV-KLCYVVFL 122



 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 41/71 (57%), Gaps = 5/71 (7%)

Query: 331 TKWRASQLVFYC----ACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAI 385
           TK+R S L        A G+  G++  ++G+GGGFI+ P +   LG+P +V   T+LF I
Sbjct: 161 TKFRVSGLYISVLPPLAVGVVVGILAAIMGVGGGFIMVPAMIYLLGMPTKVVVGTSLFQI 220

Query: 386 TFSSSMSVVEY 396
            F ++ + + +
Sbjct: 221 IFVTAFTTMLH 231


>gi|154489152|ref|ZP_02030001.1| hypothetical protein BIFADO_02467 [Bifidobacterium adolescentis
           L2-32]
 gi|154083289|gb|EDN82334.1| hypothetical protein BIFADO_02467 [Bifidobacterium adolescentis
           L2-32]
          Length = 292

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 70/143 (48%), Gaps = 5/143 (3%)

Query: 307 GVSAYEAVALYKGRRKIASK-GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 365
           G +A  A A+    +K+ S+ G+      A  ++     G+  G++ G  G+GGG ++ P
Sbjct: 5   GETADRADAVRTESQKMKSRAGETALDGSARSIIVLIVVGMLVGVLSGTFGIGGGTMIVP 64

Query: 366 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 425
             + LG+  + ++AT++ AI  +S   V+ Y          A   F    I   +G  +L
Sbjct: 65  ALVWLGLSQRNAAATSMLAIVPTSISGVISYATGGHVDWLAAALLFCGMFIGGQIGSWLL 124

Query: 426 KKLIKILGRASIIIFTLSFIIFV 448
            +L +++ R    IF ++F++FV
Sbjct: 125 SRLPELVLR---WIF-VAFLVFV 143


>gi|400753245|ref|YP_006561613.1| hypothetical protein PGA2_c03480 [Phaeobacter gallaeciensis 2.10]
 gi|398652398|gb|AFO86368.1| hypothetical protein PGA2_c03480 [Phaeobacter gallaeciensis 2.10]
          Length = 307

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 31/49 (63%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           G++ G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++
Sbjct: 25  GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHF 73


>gi|220925134|ref|YP_002500436.1| hypothetical protein Mnod_5288 [Methylobacterium nodulans ORS 2060]
 gi|219949741|gb|ACL60133.1| protein of unknown function DUF81 [Methylobacterium nodulans ORS
           2060]
          Length = 307

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 30/44 (68%)

Query: 351 VGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVV 394
           + G+ G+GGGF++ P+ + LGIPP ++ AT    I  SS+ SV+
Sbjct: 27  ISGVFGIGGGFLMTPVLIVLGIPPAIAVATQTAPIVASSTTSVL 70


>gi|399991606|ref|YP_006571846.1| hypothetical protein PGA1_c03930 [Phaeobacter gallaeciensis DSM
           17395 = CIP 105210]
 gi|398656161|gb|AFO90127.1| hypothetical protein PGA1_c03930 [Phaeobacter gallaeciensis DSM
           17395 = CIP 105210]
          Length = 307

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 31/49 (63%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           G++ G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++
Sbjct: 25  GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHF 73


>gi|418327069|ref|ZP_12938243.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU071]
 gi|365223948|gb|EHM65221.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU071]
          Length = 254

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 61/118 (51%), Gaps = 10/118 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  +  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWDLSIILMIVAFGFIASFIDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 441
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASAGGAIIATYIPANILKPLI-IIALSLVLIYT 117


>gi|257062790|ref|YP_003142462.1| permease [Slackia heliotrinireducens DSM 20476]
 gi|256790443|gb|ACV21113.1| predicted permease [Slackia heliotrinireducens DSM 20476]
          Length = 277

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 1/53 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVEY 396
           G+  G++ GL G+GGG I+ PLF L  G+PP  ++AT+LF +  +S   +V++
Sbjct: 11  GLLVGVLSGLFGIGGGTIMVPLFRLAFGLPPIGATATSLFTVIPTSLSGLVKH 63


>gi|226314422|ref|YP_002774318.1| hypothetical protein BBR47_48370 [Brevibacillus brevis NBRC 100599]
 gi|226097372|dbj|BAH45814.1| conserved hypothetical membrane protein [Brevibacillus brevis NBRC
           100599]
          Length = 274

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 8/102 (7%)

Query: 339 VFYCACGITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSM 391
           + +   G+ AG++G + GLGGG    P  L          + PQV++AT+L  I  ++  
Sbjct: 7   ILFVIIGVVAGVIGSIAGLGGGIFFVPALLYFANWYTPGSMNPQVAAATSLIVIAVTALS 66

Query: 392 SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 433
           S + Y   K+     AL FF  S   A VG + L  L+ + G
Sbjct: 67  SSLSYLKQKKVDKQSALLFFIGSAPGAIVGVY-LNTLLAVEG 107


>gi|240102617|ref|YP_002958926.1| putative permease [Thermococcus gammatolerans EJ3]
 gi|239910171|gb|ACS33062.1| Predicted permease [Thermococcus gammatolerans EJ3]
          Length = 243

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 7/82 (8%)

Query: 320 RRKIASKGDEGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVS 377
           R+K     +EG   +    LV     GI +G+V GLLG+ GG +  PLF   +GIP + +
Sbjct: 116 RKKCRPHDEEGAVHYSRVPLV-----GILSGLVSGLLGISGGVLNVPLFHTFVGIPMRYA 170

Query: 378 SATALFAITFSSSMSVVEYYLL 399
             T+ FA+ F++    +E+Y L
Sbjct: 171 VGTSSFALFFTALAGAIEHYRL 192


>gi|428224218|ref|YP_007108315.1| hypothetical protein GEI7407_0765 [Geitlerinema sp. PCC 7407]
 gi|427984119|gb|AFY65263.1| protein of unknown function DUF81 [Geitlerinema sp. PCC 7407]
          Length = 275

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 46/84 (54%), Gaps = 4/84 (4%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK--RF 402
           G  AG++ G LG+GGG IL PL + LG  P  + AT+  AI  +S    ++ + +   RF
Sbjct: 25  GFLAGILAGFLGIGGGTILVPLMVTLGFAPVQAVATSSLAILITSLSGSLQNWRMGYLRF 84

Query: 403 PVPYALYFFAL--SIIAAFVGQHV 424
           P   AL   AL  + I AF+  ++
Sbjct: 85  PQVLALGLPALLTAQIGAFLANYI 108


>gi|340503379|gb|EGR29975.1| hypothetical protein IMG5_145240 [Ichthyophthirius multifiliis]
          Length = 162

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 39/63 (61%), Gaps = 1/63 (1%)

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA-FVGQH 423
           P +LELGI P  ++A ++  + F+S  S  +Y+L   + +   +YFF LS+I++ FV  +
Sbjct: 52  PKWLELGISPTRTTACSICILFFTSFNSFFQYFLANVYNIEEIIYFFILSVISSQFVCNY 111

Query: 424 VLK 426
           +L 
Sbjct: 112 ILN 114


>gi|326386778|ref|ZP_08208399.1| hypothetical protein Y88_2671 [Novosphingobium nitrogenifigens DSM
           19370]
 gi|326208831|gb|EGD59627.1| hypothetical protein Y88_2671 [Novosphingobium nitrogenifigens DSM
           19370]
          Length = 345

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 40/136 (29%), Positives = 61/136 (44%), Gaps = 14/136 (10%)

Query: 280 LQIAKNYTTTCSVLYWVL-----NLLQIPVAGGVSAYEAVALYKGRRKIASK--GDEGTK 332
           LQ      TT +VLY V+     +L+ +   G + A +     K RR+           +
Sbjct: 145 LQALGQIDTTINVLYVVMLGSIGSLMGVESFGAIRARQTGRAQKARRRRHHPLVASLPLR 204

Query: 333 WRASQLVFYCA------CGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAI 385
           WR  +   Y +       GI  G++  L+G+GGGFIL P  L  LG+   V   T+LF I
Sbjct: 205 WRFYRSGLYISPLAPLLLGIATGILTMLMGIGGGFILVPAMLYILGMSVTVVVGTSLFQI 264

Query: 386 TFSSSMSVVEYYLLKR 401
            F +  + + + L  R
Sbjct: 265 LFVTMATTMMHALTTR 280


>gi|316932351|ref|YP_004107333.1| hypothetical protein [Rhodopseudomonas palustris DX-1]
 gi|315600065|gb|ADU42600.1| protein of unknown function DUF81 [Rhodopseudomonas palustris DX-1]
          Length = 320

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 69/134 (51%), Gaps = 6/134 (4%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
            ++   A G   G V G+ G+GGGF++ PL + +GI P V+ A+    +  SS    + Y
Sbjct: 26  NVLLVLAMGAAVGFVSGMFGVGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALSY 85

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHV-LKKLIKILGRASIIIFTLSFIIFVSALSLGG 455
           +  +R  +   L F  L    A  G  V    L++ +G+  ++I  LS+++ ++A  +GG
Sbjct: 86  W--RRRAIDPLLAFILLCGGIAGTGLGVWFFVLMRSVGQLDLVI-ALSYVVLLTA--VGG 140

Query: 456 VGLAKMIKRIEHKE 469
           + +++ I+ I    
Sbjct: 141 LMVSEGIRAIRRTS 154


>gi|418323949|ref|ZP_12935206.1| sulfite exporter TauE/SafE [Staphylococcus pettenkoferi VCU012]
 gi|365228878|gb|EHM70051.1| sulfite exporter TauE/SafE [Staphylococcus pettenkoferi VCU012]
          Length = 274

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 53/109 (48%), Gaps = 2/109 (1%)

Query: 318 KGRRKIASKGDEGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQ 375
           K + + +   +EG  +  +   +F     +  G++ GL G+GGG ++ PL L     PP 
Sbjct: 136 KDKYRRSYTDNEGVTYHYNVPPLFAFVATLFIGILTGLFGIGGGALMTPLMLIVFRFPPH 195

Query: 376 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           V+  T++  I FSS MS + + +       Y+L     SII A +G  +
Sbjct: 196 VAVGTSMLMIFFSSLMSSLGHMVQGHVAWGYSLVLIISSIIGAQIGVRI 244


>gi|335423623|ref|ZP_08552644.1| hypothetical protein SSPSH_13047 [Salinisphaera shabanensis E1L3A]
 gi|334891448|gb|EGM29696.1| hypothetical protein SSPSH_13047 [Salinisphaera shabanensis E1L3A]
          Length = 287

 Score = 40.4 bits (93), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 61/136 (44%), Gaps = 16/136 (11%)

Query: 340 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY-- 397
            +   G  A +V G LG+G G     + + LGI P V+SA+  FA   +++ S   +Y  
Sbjct: 1   MFGIAGFVAQLVDGSLGMGYGLTSSTMLIALGISPAVASASIHFAEIATTAASGTAHYRF 60

Query: 398 ------LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
                 LLKR  VP AL         AFVG   L  +   L +  +  F L   ++V+A 
Sbjct: 61  GNVDVALLKRMVVPGAL--------GAFVGALFLGSISGDLIKPFVSAFLLLLGVYVTAS 112

Query: 452 SLGGVGLAKMIKRIEH 467
            L G+   +  K + H
Sbjct: 113 VLLGLRRTREKKLLSH 128


>gi|160902447|ref|YP_001568028.1| hypothetical protein Pmob_0983 [Petrotoga mobilis SJ95]
 gi|160360091|gb|ABX31705.1| protein of unknown function DUF81 [Petrotoga mobilis SJ95]
          Length = 246

 Score = 40.4 bits (93), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 46/97 (47%), Gaps = 16/97 (16%)

Query: 340 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMS------- 392
           F+   G  A +V G LG+  G I   L L  G+PP +SSA+  FA  F++S+S       
Sbjct: 6   FFIIVGFLAQVVDGALGMAYGTISNALLLSAGVPPAISSASVHFAEIFTTSISGFSHLKL 65

Query: 393 -VVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
             V+  L K+  +P         I+   +G ++L  +
Sbjct: 66  GNVDKSLFKKLLIP--------GIVGGVLGAYILTNI 94


>gi|145514235|ref|XP_001443028.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410389|emb|CAK75631.1| unnamed protein product [Paramecium tetraurelia]
          Length = 473

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 26/41 (63%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 383
           A G  AG V G LG+G GF++ P  L  G+ P+ +SAT+ F
Sbjct: 339 AGGFMAGFVSGFLGMGAGFVMVPTLLYSGLIPRCASATSAF 379



 Score = 39.3 bits (90), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 24/114 (21%), Positives = 58/114 (50%), Gaps = 2/114 (1%)

Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD-IPVIDYDLALLFQPLLV 163
           P+L +I+ +    +  ++ C++ GG +   V  + +++P  D  P+I+Y +A++F   + 
Sbjct: 72  PLLEMILNYSQSEATHIAYCLMFGGTLLNTVLLMFEKNPEDDRRPIINYRIAIIFNLAVP 131

Query: 164 LGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQ 217
              ++G +    F   + T++L  LFL        +  ++ K+  +   ++AK+
Sbjct: 132 FATNLGSSL-ASFLPQLYTLILQELFLFGVAPILWQKAKSAKEAELKISDSAKE 184


>gi|56695290|ref|YP_165638.1| hypothetical protein SPO0375 [Ruegeria pomeroyi DSS-3]
 gi|56677027|gb|AAV93693.1| membrane protein [Ruegeria pomeroyi DSS-3]
          Length = 316

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 31/49 (63%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           G++ G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++
Sbjct: 35  GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHF 83


>gi|254440248|ref|ZP_05053742.1| conserved domain protein, putative [Octadecabacter antarcticus 307]
 gi|198255694|gb|EDY80008.1| conserved domain protein, putative [Octadecabacter antarcticus 307]
          Length = 305

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 50/96 (52%), Gaps = 2/96 (2%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ +   K   +        
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEAVQIVASSFSGVLAHLRRKTVDLRMGTVLLI 88

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
             +I A +G  VL   +K LG+  +++  L +++F+
Sbjct: 89  GGLIGAAIGV-VLFNYLKSLGQVDLLV-RLCYVVFL 122


>gi|223042876|ref|ZP_03612924.1| putative membrane protein [Staphylococcus capitis SK14]
 gi|417905704|ref|ZP_12549503.1| putative membrane protein [Staphylococcus capitis VCU116]
 gi|222443730|gb|EEE49827.1| putative membrane protein [Staphylococcus capitis SK14]
 gi|341598598|gb|EGS41098.1| putative membrane protein [Staphylococcus capitis VCU116]
          Length = 275

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 50/102 (49%), Gaps = 2/102 (1%)

Query: 329 EGTKWRASQLVFYC-ACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 386
           EG  +  S   F+     +  G++ GL G+GGG ++ PL L     PP V+  T++  I 
Sbjct: 148 EGKTYHYSVPPFFAFVATLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207

Query: 387 FSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           FSS MS + + +       Y++     SII A +G  V + L
Sbjct: 208 FSSVMSSIGHIVQGHVAWGYSVALIISSIIGAQIGVKVNRSL 249


>gi|77360916|ref|YP_340491.1| hypothetical protein PSHAa1992 [Pseudoalteromonas haloplanktis
           TAC125]
 gi|76875827|emb|CAI87048.1| conserved protein of unknown functions similar to YfcA
           [Pseudoalteromonas haloplanktis TAC125]
          Length = 258

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 4/133 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           CA  + AG +  + G GGG +  P  L  G+PP ++  T   A +F S  +   YY    
Sbjct: 15  CAVALAAGFIDAIAG-GGGMLTVPALLTAGLPPHLTLGTNKLAASFGSLTASYTYYKKDL 73

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKL-IKILGR--ASIIIFTLSFIIFVSALSLGGVGL 458
           F   + L     + I A +G  ++  L I  L +    III    + +F S  +  G  L
Sbjct: 74  FNPKFWLASIFATAIGALIGTLIVDHLSIDFLNKLLPIIIIVVACYSLFGSLSTTEGSEL 133

Query: 459 AKMIKRIEHKEYM 471
            K+ + ++ K+++
Sbjct: 134 PKLTQTMKIKQWL 146


>gi|423643404|ref|ZP_17619022.1| hypothetical protein IK9_03349 [Bacillus cereus VD166]
 gi|401275408|gb|EJR81375.1| hypothetical protein IK9_03349 [Bacillus cereus VD166]
          Length = 266

 Score = 40.0 bits (92), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            G+ AG +G L+GLGGG I+ PL + L  + PQ++  T++  + F+   S + Y   KR
Sbjct: 9   IGLIAGTIGSLVGLGGGIIIVPLLIGLHSLSPQLAVGTSMVTVVFTGLSSTLTYMKHKR 67


>gi|77462758|ref|YP_352262.1| hypothetical protein RSP_2206 [Rhodobacter sphaeroides 2.4.1]
 gi|77387176|gb|ABA78361.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1]
          Length = 306

 Score = 40.0 bits (92), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 2/100 (2%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           G + G+ G+GGGF++ PL   +GIPP V+ AT    +  SS   V+     K       L
Sbjct: 25  GFMSGMFGVGGGFLITPLLFFIGIPPAVAVATGANQVVASSVSGVLAQLKRKTVDFRMGL 84

Query: 409 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
              A  +  +  G  V  +L   LG+  + +  LS+++F+
Sbjct: 85  VLLAGGVAGSAFGVAVFARL-NTLGQVDLFV-QLSYVLFL 122


>gi|46201920|ref|ZP_00054109.2| COG0730: Predicted permeases [Magnetospirillum magnetotacticum
           MS-1]
          Length = 308

 Score = 40.0 bits (92), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 38/64 (59%), Gaps = 5/64 (7%)

Query: 332 KWRASQL----VFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAIT 386
           ++R SQL    +   A G+  G++  L+G+GGGFI+ P +   LG+P  V   T+LF I 
Sbjct: 166 RFRRSQLYISAIVPAAIGLLVGILAALMGVGGGFIMVPAMIYLLGMPTAVVVGTSLFQII 225

Query: 387 FSSS 390
           F ++
Sbjct: 226 FVTA 229


>gi|407972997|ref|ZP_11153910.1| hypothetical protein NA8A_01815 [Nitratireductor indicus C115]
 gi|407431768|gb|EKF44439.1| hypothetical protein NA8A_01815 [Nitratireductor indicus C115]
          Length = 306

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 47/98 (47%), Gaps = 1/98 (1%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            A G   G + G+ G+GGGF++ PL +   IPP ++ AT    +  SS    + ++  K 
Sbjct: 18  LAMGGAVGFLSGMFGVGGGFLITPLLIFYNIPPSIAVATGANQVIASSFSGALAHFKRKS 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 439
             V       A  I+ A +G  V   L + LG+  +I+
Sbjct: 78  LDVKLGTVLLAGGIVGASLGAVVFAYL-RRLGQLDLIV 114


>gi|420163862|ref|ZP_14670596.1| hypothetical protein HMPREF9995_07828 [Staphylococcus epidermidis
           NIHLM095]
 gi|420168565|ref|ZP_14675173.1| hypothetical protein HMPREF9993_07496 [Staphylococcus epidermidis
           NIHLM087]
 gi|394232988|gb|EJD78599.1| hypothetical protein HMPREF9995_07828 [Staphylococcus epidermidis
           NIHLM095]
 gi|394233274|gb|EJD78882.1| hypothetical protein HMPREF9993_07496 [Staphylococcus epidermidis
           NIHLM087]
          Length = 254

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 60/118 (50%), Gaps = 10/118 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++     G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVTFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 441
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYT 117


>gi|126461650|ref|YP_001042764.1| hypothetical protein Rsph17029_0881 [Rhodobacter sphaeroides ATCC
           17029]
 gi|221638621|ref|YP_002524883.1| hypothetical protein RSKD131_0522 [Rhodobacter sphaeroides KD131]
 gi|332557643|ref|ZP_08411965.1| hypothetical protein RSWS8N_01290 [Rhodobacter sphaeroides WS8N]
 gi|429206978|ref|ZP_19198238.1| Sulfite exporter TauE/SafE [Rhodobacter sp. AKP1]
 gi|126103314|gb|ABN75992.1| protein of unknown function DUF81 [Rhodobacter sphaeroides ATCC
           17029]
 gi|221159402|gb|ACM00382.1| Hypothetical Protein RSKD131_0522 [Rhodobacter sphaeroides KD131]
 gi|332275355|gb|EGJ20670.1| hypothetical protein RSWS8N_01290 [Rhodobacter sphaeroides WS8N]
 gi|428189973|gb|EKX58525.1| Sulfite exporter TauE/SafE [Rhodobacter sp. AKP1]
          Length = 306

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 2/100 (2%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           G + G+ G+GGGF++ PL   +GIPP V+ AT    +  SS   V+     K       L
Sbjct: 25  GFMSGMFGVGGGFLITPLLFFIGIPPAVAVATGANQVVASSVSGVLAQLKRKTVDFRMGL 84

Query: 409 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
              A  +  +  G  V  +L   LG+  + +  LS+++F+
Sbjct: 85  VLLAGGVAGSAFGVAVFARL-NTLGQVDLFV-QLSYVLFL 122


>gi|261420421|ref|YP_003254103.1| hypothetical protein GYMC61_3061 [Geobacillus sp. Y412MC61]
 gi|319768087|ref|YP_004133588.1| hypothetical protein [Geobacillus sp. Y412MC52]
 gi|261376878|gb|ACX79621.1| protein of unknown function DUF81 [Geobacillus sp. Y412MC61]
 gi|317112953|gb|ADU95445.1| protein of unknown function DUF81 [Geobacillus sp. Y412MC52]
          Length = 300

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 51/116 (43%), Gaps = 14/116 (12%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 397
           G  AG VG L GLGGG I+ P  L  G       + PQ +  T+L  I F+   S + Y 
Sbjct: 10  GFIAGTVGSLAGLGGGVIIVPSLLFFGALGWLSAVTPQTAVGTSLVVIIFNGLSSTLSYM 69

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
             K       L F   S+  A +G  V   L       S   F+L F +F+ A+SL
Sbjct: 70  KDKMVDYQSGLLFCLGSVPGAVIGAWVNNTL-------SAAHFSLYFGLFLIAMSL 118


>gi|407777273|ref|ZP_11124543.1| hypothetical protein NA2_04876 [Nitratireductor pacificus pht-3B]
 gi|407300973|gb|EKF20095.1| hypothetical protein NA2_04876 [Nitratireductor pacificus pht-3B]
          Length = 306

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 62/128 (48%), Gaps = 4/128 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            A G   G + G+ G+GGGF++ PL +   IPP ++ AT    +  SS    + ++  K 
Sbjct: 18  LAMGGAVGFLSGMFGVGGGFLITPLLIFYNIPPAIAVATGANQVIASSFSGALVHFKRKT 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             V       A  I+ A +G      L + LG+  +I+ +L +++F+    +GG+ L + 
Sbjct: 78  LDVKLGTVLLAGGIVGASLGAMAFAYL-RRLGQLDLIV-SLLYVLFLGV--VGGLMLLES 133

Query: 462 IKRIEHKE 469
           ++ +    
Sbjct: 134 VRALRRAR 141


>gi|404445797|ref|ZP_11010928.1| hypothetical protein MVAC_21158 [Mycobacterium vaccae ATCC 25954]
 gi|403651615|gb|EJZ06726.1| hypothetical protein MVAC_21158 [Mycobacterium vaccae ATCC 25954]
          Length = 291

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 45/79 (56%), Gaps = 5/79 (6%)

Query: 312 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLEL 370
            AVA+ +GRR + +   EG +    ++V   A G+  G+V GL+G GGGF++ P L L  
Sbjct: 109 TAVAMLRGRRAVGTS-REGHRLPVPKIV---AEGLVVGLVTGLVGAGGGFLVVPALALLG 164

Query: 371 GIPPQVSSATALFAITFSS 389
           G+P  ++  T+L  I   S
Sbjct: 165 GLPMPIAVGTSLIVIAMKS 183


>gi|390960291|ref|YP_006424125.1| putative permease [Thermococcus sp. CL1]
 gi|390518599|gb|AFL94331.1| putative permease [Thermococcus sp. CL1]
          Length = 254

 Score = 40.0 bits (92), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 43/85 (50%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
           + V Y A G+  G++  L GLGGGF++ P    LG+    +  T+  A+ F+S  S + Y
Sbjct: 3   KYVSYFAVGVFIGILAALFGLGGGFLIVPTLNFLGVEIHHAVGTSSAAVVFTSLSSALAY 62

Query: 397 YLLKRFPVPYALYFFALSIIAAFVG 421
               R      L   + ++I A++G
Sbjct: 63  SRQGRIHYKVGLLLASTAVIGAYIG 87


>gi|297528876|ref|YP_003670151.1| hypothetical protein GC56T3_0521 [Geobacillus sp. C56-T3]
 gi|297252128|gb|ADI25574.1| protein of unknown function DUF81 [Geobacillus sp. C56-T3]
          Length = 300

 Score = 40.0 bits (92), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 51/116 (43%), Gaps = 14/116 (12%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 397
           G  AG VG L GLGGG I+ P  L  G       + PQ +  T+L  I F+   S + Y 
Sbjct: 10  GFIAGTVGSLAGLGGGVIIVPSLLFFGALGWLSAVTPQTAVGTSLVVIIFNGLSSTLSYM 69

Query: 398 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
             K       L F   S+  A +G  V   L       S   F+L F +F+ A+SL
Sbjct: 70  KDKMVDYQSGLLFCLGSVPGAVIGAWVNNTL-------SAAHFSLYFGLFLIAMSL 118


>gi|418412109|ref|ZP_12985374.1| hypothetical protein HMPREF9281_00978 [Staphylococcus epidermidis
           BVS058A4]
 gi|420184668|ref|ZP_14690777.1| hypothetical protein HMPREF9986_02847 [Staphylococcus epidermidis
           NIHLM040]
 gi|394257319|gb|EJE02241.1| hypothetical protein HMPREF9986_02847 [Staphylococcus epidermidis
           NIHLM040]
 gi|410890123|gb|EKS37923.1| hypothetical protein HMPREF9281_00978 [Staphylococcus epidermidis
           BVS058A4]
          Length = 254

 Score = 40.0 bits (92), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 60/118 (50%), Gaps = 10/118 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++     G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVTFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 441
           S +++       LK     + L F A    +IIA ++  ++LK LI I+  + ++I+T
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYT 117


>gi|229056545|ref|ZP_04195952.1| hypothetical protein bcere0026_6680 [Bacillus cereus AH603]
 gi|228720758|gb|EEL72315.1| hypothetical protein bcere0026_6680 [Bacillus cereus AH603]
          Length = 257

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 42/122 (34%), Positives = 63/122 (51%), Gaps = 13/122 (10%)

Query: 312 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 368
            A+ ++  +R    KG EG    KW AS L F        G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-DRKGSEGIMYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169

Query: 369 -ELGIPPQVSSATALFAITFSSSMSVVEYYLLK-RFPVPYALYFFALSIIAAFVGQHVLK 426
             L +P + + AT++ AITF SS+ +    ++  +  V  AL     S+ AA +G  V K
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGITTGKVITGQIVVIPALIVAIASLFAAPLGVKVGK 228

Query: 427 KL 428
           +L
Sbjct: 229 RL 230


>gi|229165721|ref|ZP_04293489.1| hypothetical protein bcere0007_6980 [Bacillus cereus AH621]
 gi|423595218|ref|ZP_17571249.1| hypothetical protein IIG_04086 [Bacillus cereus VD048]
 gi|228617722|gb|EEK74779.1| hypothetical protein bcere0007_6980 [Bacillus cereus AH621]
 gi|401222489|gb|EJR29079.1| hypothetical protein IIG_04086 [Bacillus cereus VD048]
          Length = 257

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)

Query: 312 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 368
            A+ ++  +R    KG EG    KW AS L F        G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169

Query: 369 -ELGIPPQVSSATALFAITFSSSMSV 393
             L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194


>gi|86356207|ref|YP_468099.1| permease [Rhizobium etli CFN 42]
 gi|86280309|gb|ABC89372.1| putative permease protein [Rhizobium etli CFN 42]
          Length = 307

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 61/126 (48%), Gaps = 4/126 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++    
Sbjct: 18  LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             +          +  A VG  +   L++ +G+  +II +L ++IF+   ++GG+ L + 
Sbjct: 78  LDIKLGTVLLVGGLSGATVGIWIF-SLLRAIGQLDLII-SLMYVIFLG--TVGGLMLLES 133

Query: 462 IKRIEH 467
           IK +  
Sbjct: 134 IKAMRR 139


>gi|326332008|ref|ZP_08198293.1| putative membrane protein [Nocardioidaceae bacterium Broad-1]
 gi|325950146|gb|EGD42201.1| putative membrane protein [Nocardioidaceae bacterium Broad-1]
          Length = 311

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 40/78 (51%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           G+V GL G+GGG ++ P  + LG+PP  + A  L A + + S+    ++      +  A 
Sbjct: 21  GIVVGLTGMGGGALMTPALIFLGVPPTTAVANDLVAASVNKSVGAAVHWRHGSPNIRLAT 80

Query: 409 YFFALSIIAAFVGQHVLK 426
           Y    S+  AF+G  V K
Sbjct: 81  YLIIGSVPCAFLGTFVHK 98


>gi|260907045|ref|ZP_05915367.1| hypothetical protein BlinB_17054 [Brevibacterium linens BL2]
          Length = 266

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 53/107 (49%), Gaps = 4/107 (3%)

Query: 316 LYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG-IPP 374
           + +GRR+   + +   +   S+++   A G   G+V G +G GGGF++ P  + L  +P 
Sbjct: 126 MIRGRRQTGDQHERPPELPISKVL---ATGAVVGLVSGFVGAGGGFLIVPALVLLARLPM 182

Query: 375 QVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
             + AT+L  IT  S+  +  Y L        A+    L++  +F+G
Sbjct: 183 PRAVATSLLIITMQSASGLAGYALSTPLDWSLAVAIAGLAVAGSFIG 229


>gi|423455682|ref|ZP_17432535.1| hypothetical protein IEE_04426 [Bacillus cereus BAG5X1-1]
 gi|401134319|gb|EJQ41936.1| hypothetical protein IEE_04426 [Bacillus cereus BAG5X1-1]
          Length = 255

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)

Query: 312 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 368
            A+ ++  +R    KG EG    KW AS L F        G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGIMYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169

Query: 369 -ELGIPPQVSSATALFAITFSSSMSV 393
             L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194


>gi|332185052|ref|ZP_08386801.1| hypothetical protein SUS17_418 [Sphingomonas sp. S17]
 gi|332014776|gb|EGI56832.1| hypothetical protein SUS17_418 [Sphingomonas sp. S17]
          Length = 253

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 30/129 (23%), Positives = 59/129 (45%), Gaps = 12/129 (9%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
           + A G  A +V G LG+  G I   L + LG+PP  +SA      TF++++S + + + +
Sbjct: 14  FVAVGFAAQVVDGALGMAFGVISNTLLMSLGVPPAAASAGVHSVETFTTAVSGISHAVHR 73

Query: 401 RFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAK 460
                  L      +I   +G +VL  +   + +  I+++            L G+G+  
Sbjct: 74  NVDWKLFLRLMIPGVIGGALGAYVLSNIDAAVVKPFILMY------------LAGIGVYL 121

Query: 461 MIKRIEHKE 469
           +I+ + H+ 
Sbjct: 122 LIRALRHRP 130


>gi|423367325|ref|ZP_17344758.1| hypothetical protein IC3_02427 [Bacillus cereus VD142]
 gi|401085435|gb|EJP93677.1| hypothetical protein IC3_02427 [Bacillus cereus VD142]
          Length = 257

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)

Query: 312 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 368
            A+ ++  +R    KG EG    KW AS L F        G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169

Query: 369 -ELGIPPQVSSATALFAITFSSSMSV 393
             L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194


>gi|423474379|ref|ZP_17451118.1| hypothetical protein IEM_05680 [Bacillus cereus BAG6O-2]
 gi|402423143|gb|EJV55362.1| hypothetical protein IEM_05680 [Bacillus cereus BAG6O-2]
          Length = 255

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)

Query: 312 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 368
            A+ ++  +R    KG EG    KW AS L F        G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGIMYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169

Query: 369 -ELGIPPQVSSATALFAITFSSSMSV 393
             L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194


>gi|395790715|ref|ZP_10470175.1| hypothetical protein MEC_00166 [Bartonella alsatica IBS 382]
 gi|395409467|gb|EJF76057.1| hypothetical protein MEC_00166 [Bartonella alsatica IBS 382]
          Length = 304

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 1/54 (1%)

Query: 342 CACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 394
            A G+  G++  ++G+GGGF + P L   L +P  V   T+LF ITF SS + V
Sbjct: 179 LAIGLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQITFVSSFTTV 232


>gi|358639366|dbj|BAL26663.1| permease, putative [Azoarcus sp. KH32C]
          Length = 276

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 49/87 (56%), Gaps = 4/87 (4%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS--ATALFAITFSSSMSVVEYYLLKRF 402
           G   G + G+LG+GGGF++ P  L   +P  + S  AT+L AI  +SS++ V   L  R 
Sbjct: 163 GAATGFLSGMLGVGGGFVIVPA-LRATLPLSMHSAVATSLMAIALTSSITFVTCILQGRI 221

Query: 403 -PVPYALYFFALSIIAAFVGQHVLKKL 428
            P+  A+ F A S++   +G+ V  ++
Sbjct: 222 VPLLAAVPFVAGSVLGMVLGRRVAPRI 248


>gi|119026567|ref|YP_910412.1| hypothetical protein BAD_1549 [Bifidobacterium adolescentis ATCC
           15703]
 gi|118766151|dbj|BAF40330.1| widely conserved hypothetical transmembrane protein with dub81
           [Bifidobacterium adolescentis ATCC 15703]
          Length = 292

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 69/143 (48%), Gaps = 5/143 (3%)

Query: 307 GVSAYEAVALYKGRRKIASK-GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 365
           G +A  A A+    +K+ S+ G+      A  ++     G+  G++ G  G+GGG ++ P
Sbjct: 5   GETADRADAVRTESQKMESRAGETALDGSARSIIVLIVVGMLVGVLSGTFGIGGGTVIVP 64

Query: 366 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 425
             + LG+  + ++AT+  AI  +S   VV Y          A   F    +   +G  +L
Sbjct: 65  ALVWLGLSQRNAAATSTLAIVPTSISGVVSYATGGHVDWLAAALLFCGMFVGGQIGSWLL 124

Query: 426 KKLIKILGRASIIIFTLSFIIFV 448
            +L +++ R    IF ++F++FV
Sbjct: 125 SRLPELVLR---WIF-VAFLVFV 143


>gi|423556322|ref|ZP_17532625.1| hypothetical protein II3_01527 [Bacillus cereus MC67]
 gi|401195511|gb|EJR02467.1| hypothetical protein II3_01527 [Bacillus cereus MC67]
          Length = 257

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)

Query: 312 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 368
            A+ ++  +R    KG EG    KW AS L F        G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169

Query: 369 -ELGIPPQVSSATALFAITFSSSMSV 393
             L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194


>gi|347523588|ref|YP_004781158.1| hypothetical protein Pyrfu_1038 [Pyrolobus fumarii 1A]
 gi|343460470|gb|AEM38906.1| protein of unknown function DUF81 [Pyrolobus fumarii 1A]
          Length = 241

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 42/81 (51%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 403
            G+ AG+VG + GLGGG I+ PL +  G+ P  + + +LFA+  +S   ++  +      
Sbjct: 5   VGLLAGIVGSIAGLGGGAIMVPLLILAGVEPHHAVSASLFAVLGTSLGGIIHLHKEGLLD 64

Query: 404 VPYALYFFALSIIAAFVGQHV 424
              A+     S+  A +G +V
Sbjct: 65  TRLAVTLATASVTGAVIGAYV 85


>gi|443325967|ref|ZP_21054637.1| putative permease [Xenococcus sp. PCC 7305]
 gi|442794404|gb|ELS03821.1| putative permease [Xenococcus sp. PCC 7305]
          Length = 260

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 48/93 (51%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
             L+F    G  +G++ G LG+GGG +L PL + LG  P  + AT+  AIT ++    ++
Sbjct: 4   DNLLFIAFGGFISGILAGFLGIGGGTVLVPLLVALGYSPIQAVATSALAITITALSGTLQ 63

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
            + +        LY    +++ A VG ++ + L
Sbjct: 64  NWRMGYIKPKSLLYLGVPALLFAQVGVYLSEGL 96


>gi|218887074|ref|YP_002436395.1| hypothetical protein DvMF_1984 [Desulfovibrio vulgaris str.
           'Miyazaki F']
 gi|218758028|gb|ACL08927.1| protein of unknown function DUF81 [Desulfovibrio vulgaris str.
           'Miyazaki F']
          Length = 415

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 41/86 (47%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           A G+ AG++ G +G GGGFI+ P  +  G+   ++  T LF I   + M    +  L   
Sbjct: 79  AVGLAAGLITGCIGAGGGFIITPALMAAGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNV 138

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKL 428
            V  A+ F   S    F+G  + K L
Sbjct: 139 SVKLAIAFLVGSGAGTFIGGAINKGL 164


>gi|163938698|ref|YP_001643582.1| hypothetical protein BcerKBAB4_0692 [Bacillus weihenstephanensis
           KBAB4]
 gi|229131719|ref|ZP_04260596.1| hypothetical protein bcere0014_6720 [Bacillus cereus BDRD-ST196]
 gi|423485997|ref|ZP_17462679.1| hypothetical protein IEU_00620 [Bacillus cereus BtB2-4]
 gi|423491721|ref|ZP_17468365.1| hypothetical protein IEW_00619 [Bacillus cereus CER057]
 gi|423501487|ref|ZP_17478104.1| hypothetical protein IEY_04714 [Bacillus cereus CER074]
 gi|423515545|ref|ZP_17492026.1| hypothetical protein IG7_00615 [Bacillus cereus HuA2-4]
 gi|163860895|gb|ABY41954.1| protein of unknown function DUF81 [Bacillus weihenstephanensis
           KBAB4]
 gi|228651773|gb|EEL07733.1| hypothetical protein bcere0014_6720 [Bacillus cereus BDRD-ST196]
 gi|401153579|gb|EJQ61004.1| hypothetical protein IEY_04714 [Bacillus cereus CER074]
 gi|401158654|gb|EJQ66044.1| hypothetical protein IEW_00619 [Bacillus cereus CER057]
 gi|401166933|gb|EJQ74231.1| hypothetical protein IG7_00615 [Bacillus cereus HuA2-4]
 gi|402440558|gb|EJV72550.1| hypothetical protein IEU_00620 [Bacillus cereus BtB2-4]
          Length = 257

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)

Query: 312 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 368
            A+ ++  +R    KG EG    KW AS L F        G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169

Query: 369 -ELGIPPQVSSATALFAITFSSSMSV 393
             L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194


>gi|418327458|ref|ZP_12938613.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
 gi|418630881|ref|ZP_13193353.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
 gi|365232948|gb|EHM73921.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
 gi|374836191|gb|EHR99779.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
          Length = 254

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 61/119 (51%), Gaps = 10/119 (8%)

Query: 332 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 391
            W  S ++   A G  A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  
Sbjct: 2   DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60

Query: 392 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 442
           S +++       LK     + L F A    +IIA ++   +LK LI I+  + ++I+T+
Sbjct: 61  SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPAIILKPLI-IIALSLVLIYTI 118


>gi|152974977|ref|YP_001374494.1| hypothetical protein Bcer98_1177 [Bacillus cytotoxicus NVH 391-98]
 gi|152023729|gb|ABS21499.1| protein of unknown function DUF81 [Bacillus cytotoxicus NVH 391-98]
          Length = 266

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 35/58 (60%), Gaps = 1/58 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           G+ AG +G L+GLGGG I+ PL + L  + PQ++  T++  + F+   S + Y   KR
Sbjct: 10  GLIAGTIGSLVGLGGGIIVVPLLIGLHSLSPQIAVGTSIVTVVFTGLSSTLAYVKHKR 67


>gi|18313308|ref|NP_559975.1| hypothetical protein PAE2388 [Pyrobaculum aerophilum str. IM2]
 gi|18160832|gb|AAL64157.1| hypothetical protein PAE2388 [Pyrobaculum aerophilum str. IM2]
          Length = 243

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 31/47 (65%)

Query: 80  VGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMV 126
           +G ++ F G    S+ GVGGG IFVP L L+ G DAK +AA+S  ++
Sbjct: 123 LGYLLVFIGGFVSSLFGVGGGTIFVPALILLAGLDAKLAAAMSMGII 169


>gi|384222369|ref|YP_005613535.1| hypothetical protein BJ6T_87050 [Bradyrhizobium japonicum USDA 6]
 gi|354961268|dbj|BAL13947.1| hypothetical protein BJ6T_87050 [Bradyrhizobium japonicum USDA 6]
          Length = 305

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALSYW 73


>gi|334341160|ref|YP_004546140.1| hypothetical protein [Desulfotomaculum ruminis DSM 2154]
 gi|334092514|gb|AEG60854.1| protein of unknown function DUF81 [Desulfotomaculum ruminis DSM
           2154]
          Length = 426

 Score = 39.7 bits (91), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 2/98 (2%)

Query: 336 SQLVFY--CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 393
           +  +FY   A G+ AG++ G +G GGGF++ P  + LG+   ++  T  F I   + M  
Sbjct: 70  TPFMFYGSMAVGVCAGLITGCIGAGGGFVITPALMSLGVKGILAVGTDQFHIFAKAIMGT 129

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 431
           V +  L    +P A+ F   S I    G  + + L  +
Sbjct: 130 VIHKKLGNVNIPLAIAFLCGSGIGVTAGGTLNRALFNM 167


>gi|254422238|ref|ZP_05035956.1| conserved domain protein, putative [Synechococcus sp. PCC 7335]
 gi|196189727|gb|EDX84691.1| conserved domain protein, putative [Synechococcus sp. PCC 7335]
          Length = 266

 Score = 39.7 bits (91), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 10/88 (11%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
           S +V     G+ +G++ G LG+GGG +L P+ L+LG     ++AT+  AI  +S+    +
Sbjct: 4   SSVVLLSIAGLFSGILAGFLGIGGGTLLVPVLLQLGFESHAATATSSLAILVTSTTGSAQ 63

Query: 396 YY----------LLKRFPVPYALYFFAL 413
            +          LL   P   A +F AL
Sbjct: 64  NWRMGYLDPKQILLLGIPAAIAGFFAAL 91


>gi|298674473|ref|YP_003726223.1| hypothetical protein Metev_0515 [Methanohalobium evestigatum
           Z-7303]
 gi|298287461|gb|ADI73427.1| protein of unknown function DUF81 [Methanohalobium evestigatum
           Z-7303]
          Length = 252

 Score = 39.7 bits (91), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 29/46 (63%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSS 389
            G   G+V GLLGL GG  + PL + LG   + +SAT+LF I F+S
Sbjct: 140 SGFMIGIVAGLLGLSGGVFIVPLLVYLGYNIKTASATSLFIIVFTS 185


>gi|229817013|ref|ZP_04447295.1| hypothetical protein BIFANG_02268 [Bifidobacterium angulatum DSM
           20098 = JCM 7096]
 gi|229785758|gb|EEP21872.1| hypothetical protein BIFANG_02268 [Bifidobacterium angulatum DSM
           20098 = JCM 7096]
          Length = 280

 Score = 39.7 bits (91), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 34/136 (25%), Positives = 63/136 (46%), Gaps = 15/136 (11%)

Query: 311 YEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL 370
           YE  AL+   R+               L    A G+ AG+  G+ G+GGG ++ P  + +
Sbjct: 3   YEQAALHGSARR--------------TLCILLAVGLIAGLFSGMFGIGGGTVIVPALVWI 48

Query: 371 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 430
           G+  + ++AT++ AI  +S   V+ Y          AL      ++ + +G  +L +L +
Sbjct: 49  GLTQRQAAATSICAIIITSISGVISYAHGGDVDWIAALLLAVGMMVGSQIGSWLLSRLSE 108

Query: 431 ILGRASIIIFTLSFII 446
           +  R   ++F L F+I
Sbjct: 109 VFLRWFYVVF-LCFVI 123


>gi|119485715|ref|ZP_01619990.1| hypothetical protein L8106_25070 [Lyngbya sp. PCC 8106]
 gi|119457040|gb|EAW38167.1| hypothetical protein L8106_25070 [Lyngbya sp. PCC 8106]
          Length = 297

 Score = 39.7 bits (91), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 33/54 (61%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSS 389
           S+L+     GI AG++ G LG+GGG +L P+ L LG  P  + AT+  +I  +S
Sbjct: 4   SELLTLAVSGIFAGILAGFLGIGGGTVLVPMMLGLGYEPINAVATSTLSIMITS 57


>gi|374572587|ref|ZP_09645683.1| putative permease [Bradyrhizobium sp. WSM471]
 gi|374420908|gb|EHR00441.1| putative permease [Bradyrhizobium sp. WSM471]
          Length = 305

 Score = 39.7 bits (91), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73


>gi|83313505|ref|YP_423769.1| permease [Magnetospirillum magneticum AMB-1]
 gi|82948346|dbj|BAE53210.1| Predicted permease [Magnetospirillum magneticum AMB-1]
          Length = 317

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 38/64 (59%), Gaps = 5/64 (7%)

Query: 332 KWRASQL----VFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAIT 386
           ++R SQL    +   A G+  G++  L+G+GGGFI+ P +   LG+P  V   T+LF I 
Sbjct: 175 RFRRSQLYISAIVPAAIGLLVGILAALMGVGGGFIMVPAMIYLLGMPTAVVVGTSLFQII 234

Query: 387 FSSS 390
           F ++
Sbjct: 235 FVTA 238


>gi|423677376|ref|ZP_17652315.1| hypothetical protein IKS_04919 [Bacillus cereus VDM062]
 gi|401306991|gb|EJS12457.1| hypothetical protein IKS_04919 [Bacillus cereus VDM062]
          Length = 255

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 11/72 (15%)

Query: 326 KGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATA 381
           KG EG    KW AS L F        G V G++G GG F+L P+ L  L +P + + AT+
Sbjct: 130 KGTEGIMYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRTTIATS 183

Query: 382 LFAITFSSSMSV 393
           + AITF SS+ +
Sbjct: 184 I-AITFISSIGI 194


>gi|371777478|ref|ZP_09483800.1| hypothetical protein AnHS1_08668 [Anaerophaga sp. HS1]
          Length = 265

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 28/108 (25%), Positives = 55/108 (50%)

Query: 320 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 379
           RRK + K DE   +  S+++     G+ AG+V GL G  G   +    L L +P ++ + 
Sbjct: 128 RRKRSGKTDEMKLFSRSKVIQSSVYGLLAGVVTGLFGTSGTAPVLAGLLALELPLKIVAG 187

Query: 380 TALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 427
           T+L  +T ++  ++  + L+    +   L+  A +I+ AF+G   + +
Sbjct: 188 TSLLVVTTNTFSALGAHLLIGEIDMTLILFLTAGTILGAFMGPRRMAR 235


>gi|423666571|ref|ZP_17641600.1| hypothetical protein IKO_00268 [Bacillus cereus VDM034]
 gi|401305297|gb|EJS10838.1| hypothetical protein IKO_00268 [Bacillus cereus VDM034]
          Length = 255

 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 11/72 (15%)

Query: 326 KGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATA 381
           KG EG    KW AS L F        G V G++G GG F+L P+ L  L +P + + AT+
Sbjct: 130 KGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRTTIATS 183

Query: 382 LFAITFSSSMSV 393
           + AITF SS+ +
Sbjct: 184 I-AITFISSIGI 194


>gi|314933135|ref|ZP_07840500.1| putative membrane protein [Staphylococcus caprae C87]
 gi|313653285|gb|EFS17042.1| putative membrane protein [Staphylococcus caprae C87]
          Length = 275

 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 50/102 (49%), Gaps = 2/102 (1%)

Query: 329 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 386
           EG  +  S   +F     +  G++ GL G+GGG ++ PL L     PP V+  T++  I 
Sbjct: 148 EGKTYHYSVPPLFAFVATLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207

Query: 387 FSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           FSS MS V + +       Y++     SII A +G  V + L
Sbjct: 208 FSSVMSSVGHIVQGHVAWGYSVALIISSIIGAQIGVKVNRSL 249


>gi|414153131|ref|ZP_11409458.1| conserved membrane hypothetical protein [Desulfotomaculum
           hydrothermale Lam5 = DSM 18033]
 gi|411455513|emb|CCO07360.1| conserved membrane hypothetical protein [Desulfotomaculum
           hydrothermale Lam5 = DSM 18033]
          Length = 428

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 62/138 (44%), Gaps = 7/138 (5%)

Query: 339 VFY--CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
           +FY   A G+ AG++ G +G GGGF++ P  + LG+   ++  T  F I   + M  V +
Sbjct: 73  MFYGSMAVGVCAGLITGCIGAGGGFVITPALMSLGVKGILAVGTDQFHIFAKAIMGTVIH 132

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 456
             L    VP A+ F   S +    G  + + L       S  + +L +++      LG +
Sbjct: 133 KKLGNVNVPLAIAFLVGSGLGVTAGGTLNRALFNANPVLSDFVISLVYVVM-----LGFL 187

Query: 457 GLAKMIKRIEHKEYMGFD 474
           G   +   I+ +   G D
Sbjct: 188 GFYSLYDFIKTRHQKGGD 205


>gi|423601769|ref|ZP_17577769.1| hypothetical protein III_04571 [Bacillus cereus VD078]
 gi|401228892|gb|EJR35412.1| hypothetical protein III_04571 [Bacillus cereus VD078]
          Length = 257

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 11/72 (15%)

Query: 326 KGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATA 381
           KG EG    KW AS L F        G V G++G GG F+L P+ L  L +P + + AT+
Sbjct: 130 KGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRTTIATS 183

Query: 382 LFAITFSSSMSV 393
           + AITF SS+ +
Sbjct: 184 I-AITFISSIGI 194


>gi|398824727|ref|ZP_10583048.1| putative permease [Bradyrhizobium sp. YR681]
 gi|398224594|gb|EJN10895.1| putative permease [Bradyrhizobium sp. YR681]
          Length = 305

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73


>gi|365896889|ref|ZP_09434939.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3843]
 gi|365422354|emb|CCE07481.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3843]
          Length = 308

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 62/126 (49%), Gaps = 8/126 (6%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+  + 
Sbjct: 18  LAMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPGVAVASVSSHIAASSFSGALSYWRRRA 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV----G 457
                A        I   +G +    L++ LG+  ++I  LS++  ++  S+GGV    G
Sbjct: 78  IDPSLASVLLIGGAIGTALGVYTF-TLLRALGQLDLMI-ALSYVALLT--SVGGVMFWEG 133

Query: 458 LAKMIK 463
           L  M++
Sbjct: 134 LRAMLR 139


>gi|325959155|ref|YP_004290621.1| hypothetical protein Metbo_1410 [Methanobacterium sp. AL-21]
 gi|325330587|gb|ADZ09649.1| protein of unknown function DUF81 [Methanobacterium sp. AL-21]
          Length = 263

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 33/134 (24%), Positives = 63/134 (47%), Gaps = 8/134 (5%)

Query: 321 RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT 380
           +++  +  E T  +  +       G+ +G++ G+ G+ G   +      LG+P  +   T
Sbjct: 129 KELNGENQEKTTLKGPKRAVASLFGVASGLLAGVFGISGTPPIIAGLYSLGLPALMVVGT 188

Query: 381 ALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI-----KILG-- 433
           ++F + F+S   +  +YLL R  +   +     + + AF+G  +LKK+      KI G  
Sbjct: 189 SVFVLIFNSLAGIGGFYLLGRLNLTLIILLGGSAAVGAFIGPLLLKKINPNTFEKIYGPV 248

Query: 434 -RASIIIFTLSFII 446
              ++IIF L  II
Sbjct: 249 IMGTMIIFALGLII 262


>gi|386399941|ref|ZP_10084719.1| putative permease [Bradyrhizobium sp. WSM1253]
 gi|385740567|gb|EIG60763.1| putative permease [Bradyrhizobium sp. WSM1253]
          Length = 305

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73


>gi|384085444|ref|ZP_09996619.1| hypothetical protein AthiA1_08064 [Acidithiobacillus thiooxidans
           ATCC 19377]
          Length = 304

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 23/87 (26%), Positives = 42/87 (48%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            + G+  G + GL G+GGG ++ PL + +G+PP V+  T    I   S++    ++ +  
Sbjct: 18  VSFGLAVGFLSGLTGVGGGILITPLLIFIGVPPLVAVGTGAAQIVGGSAIGSYAHWRMGN 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKL 428
             +  A    A S     +G H+ + L
Sbjct: 78  VDMRMAFVLLAGSWTGGLIGVHIARIL 104


>gi|110632388|ref|YP_672596.1| hypothetical protein Meso_0026 [Chelativorans sp. BNC1]
 gi|110283372|gb|ABG61431.1| protein of unknown function DUF81 [Chelativorans sp. BNC1]
          Length = 305

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 37/138 (26%), Positives = 66/138 (47%), Gaps = 14/138 (10%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMSV 393
            ++   A G   G + G+ G+GGGF++ PL +   +PP ++ AT    + A +FS ++S 
Sbjct: 13  NMLVLLAMGGAVGFLSGMFGVGGGFLITPLLIFYNVPPAIAVATGANQVIASSFSGTLS- 71

Query: 394 VEYYLLKR--FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
                LKR    V   L      II +  G +V   L ++     ++  T+S +  V   
Sbjct: 72  ----HLKRGTLDVKLGLVLLVGGIIGSAAGIYVFAYLRQL----GLLDLTVSLLYVVLLG 123

Query: 452 SLGGVGLAKMIKRIEHKE 469
           S+GG+ L + ++ I+   
Sbjct: 124 SVGGLMLVESVRAIQRSR 141


>gi|398812927|ref|ZP_10571633.1| putative permease [Brevibacillus sp. BC25]
 gi|398039917|gb|EJL33039.1| putative permease [Brevibacillus sp. BC25]
          Length = 274

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 8/102 (7%)

Query: 339 VFYCACGITAGMVGGLLGLGGGFILGPLFLELG---IP----PQVSSATALFAITFSSSM 391
           + +   G+ AG++G + GLGGG    P  L      IP    PQV++AT+L  I  ++  
Sbjct: 7   ILFVIIGVVAGVIGSIAGLGGGIFFVPALLYFANWYIPGSMNPQVAAATSLIVIAVTALS 66

Query: 392 SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 433
           S + Y    +     AL FF  S   A VG + L  L+ + G
Sbjct: 67  SSLSYLKQNKVDKQSALLFFIGSAPGAIVGVY-LNTLLAVEG 107


>gi|354583034|ref|ZP_09001934.1| protein of unknown function DUF81 [Paenibacillus lactis 154]
 gi|353198451|gb|EHB63921.1| protein of unknown function DUF81 [Paenibacillus lactis 154]
          Length = 263

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 3/83 (3%)

Query: 320 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSS 378
           R  I + G E     A  L F     +  G +  L G+GGG ++ P + L L  PP +++
Sbjct: 132 RNFIDASGAE--HRYAFNLWFGSGVALFVGFISSLFGIGGGSVMVPTMILFLAFPPHIAT 189

Query: 379 ATALFAITFSSSMSVVEYYLLKR 401
           AT++F+I  SS +  + + +L  
Sbjct: 190 ATSMFSILLSSLVGTISHVVLDH 212


>gi|83949994|ref|ZP_00958727.1| membrane protein [Roseovarius nubinhibens ISM]
 gi|83837893|gb|EAP77189.1| membrane protein [Roseovarius nubinhibens ISM]
          Length = 305

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 50/96 (52%), Gaps = 2/96 (2%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ ++  K   +        
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRKTVDLRMGTVLLV 88

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
             +  A VG  VL   +K +G+  +++  L +++F+
Sbjct: 89  GGLCGAAVGV-VLFNYLKSMGQVDLLV-KLCYVVFL 122


>gi|146278323|ref|YP_001168482.1| hypothetical protein Rsph17025_2287 [Rhodobacter sphaeroides ATCC
           17025]
 gi|145556564|gb|ABP71177.1| protein of unknown function DUF81 [Rhodobacter sphaeroides ATCC
           17025]
          Length = 306

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 49/100 (49%), Gaps = 2/100 (2%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 408
           G + G+ G+GGGF++ PL   +GIPP V+ AT    +  SS   V+     K       +
Sbjct: 25  GFMSGMFGVGGGFLITPLLFFIGIPPAVAVATGANQVVASSVSGVLAQIKRKAVDFRMGM 84

Query: 409 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
              A  +  +  G  V  +L   LG+  + +  LS+++F+
Sbjct: 85  VLLAGGVAGSAFGVAVFARL-NTLGQVDLFV-QLSYVLFL 122


>gi|383935848|ref|ZP_09989281.1| UPF0721 transmembrane protein [Rheinheimera nanhaiensis E407-8]
 gi|383703167|dbj|GAB59372.1| UPF0721 transmembrane protein [Rheinheimera nanhaiensis E407-8]
          Length = 260

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 41/93 (44%), Gaps = 1/93 (1%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
           S L+  C    +AG +  + G GGG +  P  L  G+PP V   T   A TF S  + + 
Sbjct: 10  SVLLILCLVAFSAGFIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSFTASLT 68

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           YY  K F   Y       + + A +G  V+  L
Sbjct: 69  YYRKKLFNPLYWRRSLGYTALGAVLGTLVVDHL 101


>gi|383769051|ref|YP_005448114.1| hypothetical protein S23_07820 [Bradyrhizobium sp. S23321]
 gi|381357172|dbj|BAL74002.1| hypothetical protein S23_07820 [Bradyrhizobium sp. S23321]
          Length = 305

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73


>gi|251810296|ref|ZP_04824769.1| hypothetical membrane protein [Staphylococcus epidermidis
           BCM-HMP0060]
 gi|251806178|gb|EES58835.1| hypothetical membrane protein [Staphylococcus epidermidis
           BCM-HMP0060]
          Length = 275

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 329 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 386
           EG  +R S   +F     +  G++ GL G+GGG ++ PL L     PP V+  T++  I 
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGLLTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207

Query: 387 FSSSMSVVEYYLLKRFP 403
           FSS MS + +      P
Sbjct: 208 FSSVMSSIGHIFQGHVP 224


>gi|120404979|ref|YP_954808.1| hypothetical protein Mvan_4025 [Mycobacterium vanbaalenii PYR-1]
 gi|119957797|gb|ABM14802.1| protein of unknown function DUF81 [Mycobacterium vanbaalenii PYR-1]
          Length = 291

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 48/86 (55%), Gaps = 5/86 (5%)

Query: 312 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLEL 370
            A+A+ +GR+ I + G+ G +    +++   A G+  G+V GL+G GGGF++ P L L  
Sbjct: 109 TAIAMLRGRKTIET-GETGHRLPVPKIL---AEGLIVGLVTGLVGAGGGFLVVPALALLG 164

Query: 371 GIPPQVSSATALFAITFSSSMSVVEY 396
           G+P  ++  T+L  I   S   +  Y
Sbjct: 165 GLPMPIAVGTSLIVIAMKSFAGLAGY 190


>gi|340622187|ref|YP_004740639.1| hypothetical protein Ccan_14160 [Capnocytophaga canimorsus Cc5]
 gi|339902453|gb|AEK23532.1| UPF0721 transmembrane protein [Capnocytophaga canimorsus Cc5]
          Length = 266

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 19/95 (20%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELG-IPPQVSSATALFAITFSS----------SMSV 393
           G+T G++ GL+G GGGF++ P  + L  +P +++  T+LF I   S          S ++
Sbjct: 153 GLTVGVLTGLVGAGGGFLIIPALVLLARMPMKLAVGTSLFIIAIKSLLGFLGDLQTSYTL 212

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           + + LL        LYF  LS+I  F+G  + KK+
Sbjct: 213 INWRLL--------LYFTLLSVIGIFIGMLLAKKI 239


>gi|189345900|ref|YP_001942429.1| hypothetical protein Clim_0357 [Chlorobium limicola DSM 245]
 gi|189340047|gb|ACD89450.1| conserved hypothetical protein [Chlorobium limicola DSM 245]
          Length = 121

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSS-MSVV 394
           Q VF    GI AG++ G+ G+GGG I+ P L L  G+    ++AT+L A+      + V+
Sbjct: 2   QTVFLLIIGIAAGLLSGMFGVGGGIIIVPALVLLFGMSQHSANATSLVALLLPVGILGVM 61

Query: 395 EYY 397
           EYY
Sbjct: 62  EYY 64


>gi|124028506|ref|YP_001013826.1| hypothetical protein Hbut_1668 [Hyperthermus butylicus DSM 5456]
 gi|123979200|gb|ABM81481.1| universally conserved protein [Hyperthermus butylicus DSM 5456]
          Length = 255

 Score = 39.7 bits (91), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 45/85 (52%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 403
            G+ AG VG LLG+GGG I+ P  + LG+P +V++  +L AI  +S   +   +      
Sbjct: 11  VGVVAGFVGTLLGIGGGAIMVPALVLLGVPVKVAAPASLVAILGTSLGGIRRLFRRGLVD 70

Query: 404 VPYALYFFALSIIAAFVGQHVLKKL 428
           V  A++    S + A VG   + +L
Sbjct: 71  VMLAVFLETASGLGALVGVIAVGRL 95


>gi|347732908|ref|ZP_08865979.1| hypothetical protein DA2_2276 [Desulfovibrio sp. A2]
 gi|347518384|gb|EGY25558.1| hypothetical protein DA2_2276 [Desulfovibrio sp. A2]
          Length = 415

 Score = 39.7 bits (91), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 41/86 (47%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           A G+ AG++ G +G GGGFI+ P  +  G+   ++  T LF I   + M    +  L   
Sbjct: 79  AVGLAAGLITGCIGAGGGFIITPALMAAGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNV 138

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKL 428
            V  A+ F   S    F+G  + K L
Sbjct: 139 SVKLAIAFLVGSGAGTFIGGAINKGL 164


>gi|393795138|ref|ZP_10378502.1| hypothetical protein CNitlB_01991 [Candidatus Nitrosoarchaeum
           limnia BG20]
          Length = 258

 Score = 39.3 bits (90), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 34/58 (58%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            G  AG++G ++GLGGG ++ P+    G PP ++++ +LFA   ++  S   Y   KR
Sbjct: 11  LGFAAGVLGSMIGLGGGIVVVPVLTFFGFPPTLAASNSLFAAFSNAVGSTFSYSRQKR 68


>gi|15616016|ref|NP_244321.1| hypothetical protein BH3454 [Bacillus halodurans C-125]
 gi|10176077|dbj|BAB07173.1| BH3454 [Bacillus halodurans C-125]
          Length = 273

 Score = 39.3 bits (90), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 46/93 (49%), Gaps = 13/93 (13%)

Query: 340 FYCAC-GITAGMVGGLLGLGGGFILGPLFLELG--------IPPQVSSATALFAITFSSS 390
           F   C G+ AG +G L+GLGGG I+ P  L LG        + PQV+  T+L  + F+  
Sbjct: 4   FILVCVGLLAGTIGSLMGLGGGIIVVPALLWLGGTALLSEAVTPQVAVGTSLLIMIFTGL 63

Query: 391 MSVVEYYLLKRFPVPY--ALYFFALSIIAAFVG 421
            S   Y  +K+  V Y   L FF  S   A VG
Sbjct: 64  SSTFAY--MKKGTVDYKSGLIFFIGSGPGALVG 94


>gi|209547817|ref|YP_002279734.1| hypothetical protein Rleg2_0208 [Rhizobium leguminosarum bv.
           trifolii WSM2304]
 gi|424915621|ref|ZP_18338985.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM597]
 gi|209533573|gb|ACI53508.1| protein of unknown function DUF81 [Rhizobium leguminosarum bv.
           trifolii WSM2304]
 gi|392851797|gb|EJB04318.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM597]
          Length = 307

 Score = 39.3 bits (90), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++    
Sbjct: 18  LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             V          +  A VG  +   L++ +G+  +II +L ++IF+   ++GG+ L + 
Sbjct: 78  LDVKLGTVLLVGGLTGATVGIWIF-SLLRAIGQLDLII-SLMYVIFLG--TVGGLMLLES 133

Query: 462 IKRIEH 467
           I  +  
Sbjct: 134 INAMRR 139


>gi|94498747|ref|ZP_01305296.1| hypothetical protein SKA58_14342 [Sphingomonas sp. SKA58]
 gi|94421797|gb|EAT06849.1| hypothetical protein SKA58_14342 [Sphingomonas sp. SKA58]
          Length = 304

 Score = 39.3 bits (90), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 48/168 (28%), Positives = 74/168 (44%), Gaps = 23/168 (13%)

Query: 280 LQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAV----ALYKGRRKIASKGDEG----- 330
           LQ      T   +LY    +L +   GG+ A E++    AL  GR+  ASK         
Sbjct: 104 LQSLGQIDTVIGILY----VLMLGGIGGLMAKESIQALIALKTGRKPQASKRRHHPLVAA 159

Query: 331 --TKWRASQLVFYCA------CGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATA 381
              +WR  +   Y +       G+  G++  LLG+GGGFIL P  L  LG+  Q    T+
Sbjct: 160 LPMRWRFYRSGLYISPLAPFLLGMATGILTMLLGVGGGFILVPAMLYLLGMTTQSVVGTS 219

Query: 382 LFAITFSSSMSVVEYYL-LKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           LF I F +  + + + +  K   +  A+     S+  A +G  +  KL
Sbjct: 220 LFQILFVTMATTMMHAMTTKAVDLVLAMLLLIGSVTGAQIGTRLSMKL 267


>gi|395788323|ref|ZP_10467887.1| hypothetical protein ME7_01222 [Bartonella birtlesii LL-WM9]
 gi|395408240|gb|EJF74851.1| hypothetical protein ME7_01222 [Bartonella birtlesii LL-WM9]
          Length = 306

 Score = 39.3 bits (90), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 1/54 (1%)

Query: 342 CACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 394
            A G+  G++  ++G+GGGF + P L   L +P  V   T+LF ITF SS + +
Sbjct: 179 LAIGLIVGLLSSIMGIGGGFFMIPALIYLLHVPTSVVIGTSLFQITFVSSFTTI 232


>gi|329765383|ref|ZP_08256963.1| hypothetical protein Nlim_0726 [Candidatus Nitrosoarchaeum limnia
           SFB1]
 gi|329138289|gb|EGG42545.1| hypothetical protein Nlim_0726 [Candidatus Nitrosoarchaeum limnia
           SFB1]
          Length = 258

 Score = 39.3 bits (90), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 37/59 (62%), Gaps = 2/59 (3%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM-SVVEYYLLKR 401
            G  AG++G ++GLGGG ++ P+    G PP ++++ +LFA  FS+++ S   Y   KR
Sbjct: 11  LGFAAGVLGSMIGLGGGIVVVPVLTFFGFPPTLAASNSLFA-AFSNAIGSTFSYSRQKR 68


>gi|260914000|ref|ZP_05920474.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325]
 gi|260632087|gb|EEX50264.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325]
          Length = 266

 Score = 39.3 bits (90), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 61/130 (46%), Gaps = 8/130 (6%)

Query: 96  GVGGGGIFVPML-NLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
           G+GGG + VP+L  L   F  +  AA S  +V G A+  + Y  +Q+        I++  
Sbjct: 24  GIGGGVLMVPILYTLFPEFPLQMIAATSLTIVIGSALINLTYFYKQKIQ------INFKA 77

Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREA 214
            LL+   +++G+ +G   +    D  I V+ I +  +L+ R       T K +++ K E 
Sbjct: 78  MLLWSLGMIIGVQLGFEASFYLPDIFIIVVFISVLTILAIRTIFFSKST-KADSVEKNEN 136

Query: 215 AKQLELIVLG 224
            K   L + G
Sbjct: 137 LKGAGLCLFG 146


>gi|395777819|ref|ZP_10458333.1| hypothetical protein MCU_00034 [Bartonella elizabethae Re6043vi]
 gi|423715745|ref|ZP_17689966.1| hypothetical protein MEE_01163 [Bartonella elizabethae F9251]
 gi|395418604|gb|EJF84926.1| hypothetical protein MCU_00034 [Bartonella elizabethae Re6043vi]
 gi|395429259|gb|EJF95328.1| hypothetical protein MEE_01163 [Bartonella elizabethae F9251]
          Length = 327

 Score = 39.3 bits (90), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 5/68 (7%)

Query: 332 KWRASQL----VFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAIT 386
           ++RAS +    +     G+  G++  ++G+GGGF + P L   L +P  V   T+LF IT
Sbjct: 186 RFRASMIYVSIIPVLGIGLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQIT 245

Query: 387 FSSSMSVV 394
           F SS + V
Sbjct: 246 FVSSFTTV 253


>gi|86141473|ref|ZP_01060019.1| membrane protein, putative [Leeuwenhoekiella blandensis MED217]
 gi|295133642|ref|YP_003584318.1| hypothetical protein ZPR_1792 [Zunongwangia profunda SM-A87]
 gi|85832032|gb|EAQ50487.1| membrane protein, putative [Leeuwenhoekiella blandensis MED217]
 gi|294981657|gb|ADF52122.1| membrane protein, putative [Zunongwangia profunda SM-A87]
          Length = 267

 Score = 39.3 bits (90), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 52/105 (49%), Gaps = 5/105 (4%)

Query: 318 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQV 376
            G R+  S+  E    + S  +     G+  G+V G++G GGGF++ P L L   +P + 
Sbjct: 133 NGNRQFNSEVIE----KKSNYLLLSVQGLFIGVVTGMVGAGGGFLIIPALVLLAKLPMKK 188

Query: 377 SSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           + AT+LF I  +S +  +          P+ L F  +SI+  F+G
Sbjct: 189 AVATSLFIIAINSLIGFLGDVQNLDIDWPFLLIFTGISIVGIFIG 233


>gi|423525299|ref|ZP_17501772.1| hypothetical protein IGC_04682 [Bacillus cereus HuA4-10]
 gi|401167981|gb|EJQ75250.1| hypothetical protein IGC_04682 [Bacillus cereus HuA4-10]
          Length = 257

 Score = 39.3 bits (90), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 60/113 (53%), Gaps = 10/113 (8%)

Query: 318 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQV 376
           +  RK+ ++G    KW AS L F        G V G++G GG F+L P+ L  L +P + 
Sbjct: 126 RSNRKV-TEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRT 178

Query: 377 SSATALFAITFSSSMSVVEYYLLK-RFPVPYALYFFALSIIAAFVGQHVLKKL 428
           + AT++ AITF SS+ +    ++  +  V  AL     S+ AA +G  V K+L
Sbjct: 179 TIATSI-AITFISSIGITTGKVITGQIVVIPALIVAIASLFAAPLGVKVGKRL 230


>gi|298710141|emb|CBJ31853.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 105

 Score = 39.3 bits (90), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 43/81 (53%)

Query: 375 QVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR 434
            V+SAT+   I ++S  +   + +       YA+  F + + A  VGQ V+  L+K   R
Sbjct: 1   MVASATSAVMILYTSFTATTSFMVFGLLEEDYAIALFVVGLAATAVGQVVVNHLVKKYKR 60

Query: 435 ASIIIFTLSFIIFVSALSLGG 455
            S I+ ++  ++ +SA+ +GG
Sbjct: 61  TSFIVLSIGAVVALSAVMMGG 81


>gi|297570449|ref|YP_003691793.1| protein of unknown function DUF81 [Desulfurivibrio alkaliphilus
           AHT2]
 gi|296926364|gb|ADH87174.1| protein of unknown function DUF81 [Desulfurivibrio alkaliphilus
           AHT2]
          Length = 435

 Score = 39.3 bits (90), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 57/117 (48%), Gaps = 8/117 (6%)

Query: 339 VFYCACGIT--AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
           +F+ + G+   AG++ G +G GGGFI+ P  +  G+   ++  T LF I   + M  V +
Sbjct: 80  IFFASIGVGLGAGLITGCIGAGGGFIIAPALMSAGVKGIMAVGTDLFHIFAKAIMGSVLH 139

Query: 397 YLLKRFPVPYALYFFALSIIAAFVG---QHVLKKLIKILGRA---SIIIFTLSFIIF 447
             L    V  A+ F   +I  A VG      L +L  +L  A   ++ +F L F+ F
Sbjct: 140 RKLGNISVALAVTFLIGAIGGATVGGLINRTLYELNPVLSDAFITTVYVFMLGFLGF 196


>gi|258514257|ref|YP_003190479.1| hypothetical protein Dtox_0967 [Desulfotomaculum acetoxidans DSM
           771]
 gi|257777962|gb|ACV61856.1| protein of unknown function DUF81 [Desulfotomaculum acetoxidans DSM
           771]
          Length = 303

 Score = 39.3 bits (90), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 55/102 (53%), Gaps = 3/102 (2%)

Query: 329 EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITF 387
           E +   +S +V + A G+  G++  L+G+GGGFI+ P+ + L G+P   +  T++F I F
Sbjct: 165 EVSGIESSAIVLF-ALGLLVGVLAALMGVGGGFIMLPVMIYLIGMPTHNAVGTSIFVIIF 223

Query: 388 SS-SMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           ++ +++V    L     +  A+     S I A  G  + KKL
Sbjct: 224 TAINVTVANSALNHTVDLVLAMVLLVGSSIGAQFGAKLGKKL 265


>gi|425063283|ref|ZP_18466408.1| Putative permease [Pasteurella multocida subsp. gallicida X73]
 gi|404382846|gb|EJZ79303.1| Putative permease [Pasteurella multocida subsp. gallicida X73]
          Length = 266

 Score = 39.3 bits (90), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 32/132 (24%), Positives = 62/132 (46%), Gaps = 8/132 (6%)

Query: 96  GVGGGGIFVPML-NLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
           G+GGG + VP+L  L   F  +  AA S  +V G ++  ++Y  +Q+        I++  
Sbjct: 24  GIGGGVLMVPILYTLFPDFPLQMVAATSLTIVIGSSLINLIYFYKQKIQ------INFKA 77

Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREA 214
            LL+   +++G+ +G   +    D +I ++ I +  +L+ R      +  K +   + E 
Sbjct: 78  MLLWSVGMIIGVQLGFEASFYLPDTLIVIVFISVLTILAIRTIFHS-KIVKTDQAERHEN 136

Query: 215 AKQLELIVLGNG 226
            K   L + G G
Sbjct: 137 LKGAGLCLFGGG 148


>gi|420199664|ref|ZP_14705335.1| hypothetical protein HMPREF9980_05334 [Staphylococcus epidermidis
           NIHLM031]
 gi|394271414|gb|EJE15907.1| hypothetical protein HMPREF9980_05334 [Staphylococcus epidermidis
           NIHLM031]
          Length = 275

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 2/72 (2%)

Query: 329 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 386
           EG  +R S   +F     +  G++ GL G+GGG ++ PL L     PP V+  T++  I 
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207

Query: 387 FSSSMSVVEYYL 398
           FSS MS + + L
Sbjct: 208 FSSVMSSIGHIL 219


>gi|420178833|ref|ZP_14685159.1| hypothetical protein HMPREF9989_11288 [Staphylococcus epidermidis
           NIHLM057]
 gi|420181508|ref|ZP_14687706.1| hypothetical protein HMPREF9988_11921 [Staphylococcus epidermidis
           NIHLM053]
 gi|394245744|gb|EJD91018.1| hypothetical protein HMPREF9989_11288 [Staphylococcus epidermidis
           NIHLM057]
 gi|394245936|gb|EJD91205.1| hypothetical protein HMPREF9988_11921 [Staphylococcus epidermidis
           NIHLM053]
          Length = 275

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 2/72 (2%)

Query: 329 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 386
           EG  +R S   +F     +  G++ GL G+GGG ++ PL L     PP V+  T++  I 
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207

Query: 387 FSSSMSVVEYYL 398
           FSS MS + + L
Sbjct: 208 FSSVMSSIGHIL 219


>gi|57866438|ref|YP_188101.1| hypothetical protein SERP0509 [Staphylococcus epidermidis RP62A]
 gi|282875582|ref|ZP_06284453.1| putative membrane protein [Staphylococcus epidermidis SK135]
 gi|417656395|ref|ZP_12306082.1| putative membrane protein [Staphylococcus epidermidis VCU028]
 gi|417913194|ref|ZP_12556865.1| putative membrane protein [Staphylococcus epidermidis VCU109]
 gi|418611361|ref|ZP_13174451.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU117]
 gi|418663817|ref|ZP_13225324.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU081]
 gi|420171339|ref|ZP_14677883.1| hypothetical protein HMPREF9992_10312 [Staphylococcus epidermidis
           NIHLM070]
 gi|420182552|ref|ZP_14688688.1| hypothetical protein HMPREF9987_04152 [Staphylococcus epidermidis
           NIHLM049]
 gi|420210378|ref|ZP_14715806.1| hypothetical protein HMPREF9976_10849 [Staphylococcus epidermidis
           NIHLM003]
 gi|420212014|ref|ZP_14717369.1| hypothetical protein HMPREF9975_06216 [Staphylococcus epidermidis
           NIHLM001]
 gi|421607567|ref|ZP_16048806.1| hypothetical protein B440_04399 [Staphylococcus epidermidis
           AU12-03]
 gi|57637096|gb|AAW53884.1| conserved hypothetical protein [Staphylococcus epidermidis RP62A]
 gi|281295609|gb|EFA88132.1| putative membrane protein [Staphylococcus epidermidis SK135]
 gi|329736846|gb|EGG73111.1| putative membrane protein [Staphylococcus epidermidis VCU028]
 gi|341656570|gb|EGS80284.1| putative membrane protein [Staphylococcus epidermidis VCU109]
 gi|374411142|gb|EHQ81861.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU081]
 gi|374823447|gb|EHR87443.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU117]
 gi|394238212|gb|EJD83690.1| hypothetical protein HMPREF9992_10312 [Staphylococcus epidermidis
           NIHLM070]
 gi|394250097|gb|EJD95299.1| hypothetical protein HMPREF9987_04152 [Staphylococcus epidermidis
           NIHLM049]
 gi|394276430|gb|EJE20770.1| hypothetical protein HMPREF9976_10849 [Staphylococcus epidermidis
           NIHLM003]
 gi|394280281|gb|EJE24565.1| hypothetical protein HMPREF9975_06216 [Staphylococcus epidermidis
           NIHLM001]
 gi|406656772|gb|EKC83172.1| hypothetical protein B440_04399 [Staphylococcus epidermidis
           AU12-03]
          Length = 275

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 329 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 386
           EG  +R S   +F     +  G++ GL G+GGG ++ PL L     PP V+  T++  I 
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGLLTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207

Query: 387 FSSSMSVVEY 396
           FSS MS + +
Sbjct: 208 FSSVMSSIGH 217


>gi|347734809|ref|ZP_08867798.1| permease [Azospirillum amazonense Y2]
 gi|346922127|gb|EGY02616.1| permease [Azospirillum amazonense Y2]
          Length = 307

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 58/109 (53%), Gaps = 20/109 (18%)

Query: 306 GGVSA---YEAV-ALYKGRRKIASKGD-------EGTKWRA----SQLVFYCACGITAGM 350
           GG+S     E+V AL++GRRK A++G         G  ++     S+L       +  G 
Sbjct: 123 GGISGLMLMESVRALFRGRRKGATRGKLHHHTWLHGLPFKMRFPRSRLYISVLLPVGIGF 182

Query: 351 VGGLL----GLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVV 394
           +GG+L    G+GGGF+L P  +  LG+P  + + T+L  I  +++++ V
Sbjct: 183 IGGVLVAIMGIGGGFLLVPAMIYILGMPTTLVAGTSLLQIILTTAVAAV 231


>gi|242242213|ref|ZP_04796658.1| hypothetical membrane protein [Staphylococcus epidermidis W23144]
 gi|416123952|ref|ZP_11595138.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
 gi|418327602|ref|ZP_12938754.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
 gi|418613538|ref|ZP_13176544.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
 gi|418632622|ref|ZP_13195052.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
 gi|418633694|ref|ZP_13196100.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
 gi|420175320|ref|ZP_14681760.1| hypothetical protein HMPREF9990_06284 [Staphylococcus epidermidis
           NIHLM061]
 gi|420189924|ref|ZP_14695872.1| hypothetical protein HMPREF9984_04365 [Staphylococcus epidermidis
           NIHLM037]
 gi|420193318|ref|ZP_14699172.1| hypothetical protein HMPREF9983_09537 [Staphylococcus epidermidis
           NIHLM023]
 gi|420203830|ref|ZP_14709391.1| hypothetical protein HMPREF9978_01579 [Staphylococcus epidermidis
           NIHLM015]
 gi|242234332|gb|EES36644.1| hypothetical membrane protein [Staphylococcus epidermidis W23144]
 gi|319401800|gb|EFV90008.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
 gi|365232855|gb|EHM73831.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
 gi|374823394|gb|EHR87391.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
 gi|374832192|gb|EHR95912.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
 gi|374838641|gb|EHS02179.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
 gi|394243782|gb|EJD89143.1| hypothetical protein HMPREF9990_06284 [Staphylococcus epidermidis
           NIHLM061]
 gi|394259217|gb|EJE04075.1| hypothetical protein HMPREF9984_04365 [Staphylococcus epidermidis
           NIHLM037]
 gi|394260170|gb|EJE04990.1| hypothetical protein HMPREF9983_09537 [Staphylococcus epidermidis
           NIHLM023]
 gi|394274412|gb|EJE18833.1| hypothetical protein HMPREF9978_01579 [Staphylococcus epidermidis
           NIHLM015]
          Length = 275

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 2/72 (2%)

Query: 329 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 386
           EG  +R S   +F     +  G++ GL G+GGG ++ PL L     PP V+  T++  I 
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207

Query: 387 FSSSMSVVEYYL 398
           FSS MS + + L
Sbjct: 208 FSSVMSSIGHIL 219


>gi|417644318|ref|ZP_12294319.1| putative membrane protein [Staphylococcus warneri VCU121]
 gi|330684914|gb|EGG96596.1| putative membrane protein [Staphylococcus epidermidis VCU121]
          Length = 275

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 49/99 (49%), Gaps = 4/99 (4%)

Query: 329 EGTKWRAS--QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAI 385
           EG  +  S   LV + A  +  G++ GL G+GGG ++ PL L     PP V+  T++  I
Sbjct: 148 EGKTYHYSVPPLVAFVAT-LLIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206

Query: 386 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
            FSS MS + + +       Y++     S I A +G  V
Sbjct: 207 FFSSVMSSIGHIIQGHVAWDYSIVLIISSYIGAKIGVKV 245


>gi|27467533|ref|NP_764170.1| hypothetical protein SE0615 [Staphylococcus epidermidis ATCC 12228]
 gi|293368299|ref|ZP_06614927.1| conserved hypothetical protein [Staphylococcus epidermidis
           M23864:W2(grey)]
 gi|417646541|ref|ZP_12296396.1| putative membrane protein [Staphylococcus epidermidis VCU144]
 gi|417658571|ref|ZP_12308195.1| putative membrane protein [Staphylococcus epidermidis VCU045]
 gi|417910002|ref|ZP_12553734.1| putative membrane protein [Staphylococcus epidermidis VCU037]
 gi|417911750|ref|ZP_12555450.1| putative membrane protein [Staphylococcus epidermidis VCU105]
 gi|418604210|ref|ZP_13167570.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU041]
 gi|418608065|ref|ZP_13171279.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU057]
 gi|418609993|ref|ZP_13173126.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU065]
 gi|418617244|ref|ZP_13180148.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU120]
 gi|418620984|ref|ZP_13183774.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU123]
 gi|418624342|ref|ZP_13187018.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU125]
 gi|419770080|ref|ZP_14296166.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-250]
 gi|419770736|ref|ZP_14296803.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-K]
 gi|420166657|ref|ZP_14673339.1| hypothetical protein HMPREF9994_10576 [Staphylococcus epidermidis
           NIHLM088]
 gi|420172159|ref|ZP_14678674.1| hypothetical protein HMPREF9991_01925 [Staphylococcus epidermidis
           NIHLM067]
 gi|420187849|ref|ZP_14693865.1| hypothetical protein HMPREF9985_06315 [Staphylococcus epidermidis
           NIHLM039]
 gi|420194410|ref|ZP_14700224.1| hypothetical protein HMPREF9982_02468 [Staphylococcus epidermidis
           NIHLM021]
 gi|420198269|ref|ZP_14703984.1| hypothetical protein HMPREF9981_09791 [Staphylococcus epidermidis
           NIHLM020]
 gi|420203118|ref|ZP_14708702.1| hypothetical protein HMPREF9979_10294 [Staphylococcus epidermidis
           NIHLM018]
 gi|420206739|ref|ZP_14712244.1| hypothetical protein HMPREF9977_04533 [Staphylococcus epidermidis
           NIHLM008]
 gi|420215409|ref|ZP_14720677.1| hypothetical protein HMPREF9974_10136 [Staphylococcus epidermidis
           NIH05005]
 gi|420216635|ref|ZP_14721838.1| hypothetical protein HMPREF9973_04057 [Staphylococcus epidermidis
           NIH05001]
 gi|420221146|ref|ZP_14726099.1| hypothetical protein HMPREF9972_12757 [Staphylococcus epidermidis
           NIH04008]
 gi|420222261|ref|ZP_14727183.1| hypothetical protein HMPREF1390_04404 [Staphylococcus epidermidis
           NIH08001]
 gi|420225187|ref|ZP_14730022.1| hypothetical protein HMPREF1389_06703 [Staphylococcus epidermidis
           NIH06004]
 gi|420226740|ref|ZP_14731518.1| hypothetical protein HMPREF1388_01657 [Staphylococcus epidermidis
           NIH05003]
 gi|420229060|ref|ZP_14733770.1| hypothetical protein HMPREF1387_01666 [Staphylococcus epidermidis
           NIH04003]
 gi|420231420|ref|ZP_14736071.1| hypothetical protein HMPREF1386_01642 [Staphylococcus epidermidis
           NIH051668]
 gi|420234086|ref|ZP_14738658.1| hypothetical protein HMPREF1385_01791 [Staphylococcus epidermidis
           NIH051475]
 gi|27315077|gb|AAO04212.1|AE016746_2 conserved hypothetical protein [Staphylococcus epidermidis ATCC
           12228]
 gi|291317546|gb|EFE57964.1| conserved hypothetical protein [Staphylococcus epidermidis
           M23864:W2(grey)]
 gi|329726803|gb|EGG63263.1| putative membrane protein [Staphylococcus epidermidis VCU144]
 gi|329737583|gb|EGG73829.1| putative membrane protein [Staphylococcus epidermidis VCU045]
 gi|341651884|gb|EGS75675.1| putative membrane protein [Staphylococcus epidermidis VCU037]
 gi|341652261|gb|EGS76050.1| putative membrane protein [Staphylococcus epidermidis VCU105]
 gi|374402560|gb|EHQ73581.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU057]
 gi|374405432|gb|EHQ76366.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU041]
 gi|374405979|gb|EHQ76886.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU065]
 gi|374819091|gb|EHR83222.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU120]
 gi|374827860|gb|EHR91717.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU125]
 gi|374830843|gb|EHR94603.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU123]
 gi|383357543|gb|EID35012.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-250]
 gi|383363082|gb|EID40427.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-K]
 gi|394233386|gb|EJD78993.1| hypothetical protein HMPREF9994_10576 [Staphylococcus epidermidis
           NIHLM088]
 gi|394243630|gb|EJD88992.1| hypothetical protein HMPREF9991_01925 [Staphylococcus epidermidis
           NIHLM067]
 gi|394255694|gb|EJE00641.1| hypothetical protein HMPREF9985_06315 [Staphylococcus epidermidis
           NIHLM039]
 gi|394264655|gb|EJE09330.1| hypothetical protein HMPREF9981_09791 [Staphylococcus epidermidis
           NIHLM020]
 gi|394264800|gb|EJE09471.1| hypothetical protein HMPREF9982_02468 [Staphylococcus epidermidis
           NIHLM021]
 gi|394268449|gb|EJE13006.1| hypothetical protein HMPREF9979_10294 [Staphylococcus epidermidis
           NIHLM018]
 gi|394276842|gb|EJE21175.1| hypothetical protein HMPREF9977_04533 [Staphylococcus epidermidis
           NIHLM008]
 gi|394282277|gb|EJE26480.1| hypothetical protein HMPREF9974_10136 [Staphylococcus epidermidis
           NIH05005]
 gi|394285105|gb|EJE29191.1| hypothetical protein HMPREF9972_12757 [Staphylococcus epidermidis
           NIH04008]
 gi|394289497|gb|EJE33378.1| hypothetical protein HMPREF1390_04404 [Staphylococcus epidermidis
           NIH08001]
 gi|394291602|gb|EJE35399.1| hypothetical protein HMPREF9973_04057 [Staphylococcus epidermidis
           NIH05001]
 gi|394293931|gb|EJE37628.1| hypothetical protein HMPREF1389_06703 [Staphylococcus epidermidis
           NIH06004]
 gi|394298189|gb|EJE41769.1| hypothetical protein HMPREF1388_01657 [Staphylococcus epidermidis
           NIH05003]
 gi|394299585|gb|EJE43124.1| hypothetical protein HMPREF1387_01666 [Staphylococcus epidermidis
           NIH04003]
 gi|394302741|gb|EJE46177.1| hypothetical protein HMPREF1386_01642 [Staphylococcus epidermidis
           NIH051668]
 gi|394304583|gb|EJE47981.1| hypothetical protein HMPREF1385_01791 [Staphylococcus epidermidis
           NIH051475]
          Length = 275

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 329 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 386
           EG  +R S   +F     +  G++ GL G+GGG ++ PL L     PP V+  T++  I 
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGLLTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207

Query: 387 FSSSMSVVEY 396
           FSS MS + +
Sbjct: 208 FSSVMSSIGH 217


>gi|423609344|ref|ZP_17585205.1| hypothetical protein IIM_00059 [Bacillus cereus VD107]
 gi|401251962|gb|EJR58230.1| hypothetical protein IIM_00059 [Bacillus cereus VD107]
          Length = 257

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 8/70 (11%)

Query: 325 SKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALF 383
           S+G    KW AS L F        G V G++G GG F+L P+ L  L +P + + AT++ 
Sbjct: 132 SEGTMYNKWLASMLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRTTIATSI- 184

Query: 384 AITFSSSMSV 393
           AITF SS+ +
Sbjct: 185 AITFISSVGI 194


>gi|148252390|ref|YP_001236975.1| permease [Bradyrhizobium sp. BTAi1]
 gi|146404563|gb|ABQ33069.1| putative permease [Bradyrhizobium sp. BTAi1]
          Length = 308

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 57/115 (49%), Gaps = 4/115 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+  + 
Sbjct: 18  LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYWRRRA 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 456
                AL       +   +G      L++ LG+  ++I  LS++  ++  S+GGV
Sbjct: 78  IDPSLALVLMIGGTLGTALGVSTF-TLLRSLGQLDLMI-ALSYVALLT--SVGGV 128


>gi|18976740|ref|NP_578097.1| hypothetical protein PF0368 [Pyrococcus furiosus DSM 3638]
 gi|397650869|ref|YP_006491450.1| hypothetical protein PFC_00925 [Pyrococcus furiosus COM1]
 gi|18892327|gb|AAL80492.1| hypothetical protein PF0368 [Pyrococcus furiosus DSM 3638]
 gi|393188460|gb|AFN03158.1| hypothetical protein PFC_00925 [Pyrococcus furiosus COM1]
          Length = 251

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 43/84 (51%)

Query: 338 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           ++F    GI  G++  + GLGGGF++ P    LG+    +  T+  +I F+S  S + Y 
Sbjct: 1   MIFLFFVGIVVGILAAMFGLGGGFLIVPTLNILGVEIHSAIGTSSASIVFTSLSSSIAYS 60

Query: 398 LLKRFPVPYALYFFALSIIAAFVG 421
             +R      +   + +II A++G
Sbjct: 61  RQRRIHYRAGILLASTAIIGAYIG 84


>gi|421252075|ref|ZP_15707880.1| hypothetical protein AAUPMB_06233, partial [Pasteurella multocida
           subsp. multocida str. Anand1_buffalo]
 gi|401696812|gb|EJS89423.1| hypothetical protein AAUPMB_06233, partial [Pasteurella multocida
           subsp. multocida str. Anand1_buffalo]
          Length = 270

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 32/132 (24%), Positives = 62/132 (46%), Gaps = 8/132 (6%)

Query: 96  GVGGGGIFVPML-NLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
           G+GGG + VP+L  L   F  +  AA S  +V G ++  ++Y  +Q+        I++  
Sbjct: 28  GIGGGVLMVPILYTLFPDFPLQMVAATSLTIVIGSSLINLIYFYKQKIQ------INFKA 81

Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREA 214
            LL+   +++G+ +G   +    D +I ++ I +  +L+ R      +  K +   + E 
Sbjct: 82  MLLWSVGMIIGVQLGFEASFYLPDTLIVIVFISVLTILAIRTIFHS-KIVKTDQAERHEN 140

Query: 215 AKQLELIVLGNG 226
            K   L + G G
Sbjct: 141 LKGAGLCLFGGG 152


>gi|418326457|ref|ZP_12937641.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU071]
 gi|420162808|ref|ZP_14669563.1| hypothetical protein HMPREF9995_02315 [Staphylococcus epidermidis
           NIHLM095]
 gi|420167251|ref|ZP_14673912.1| hypothetical protein HMPREF9993_01067 [Staphylococcus epidermidis
           NIHLM087]
 gi|420185285|ref|ZP_14691380.1| hypothetical protein HMPREF9986_06026 [Staphylococcus epidermidis
           NIHLM040]
 gi|365225378|gb|EHM66622.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU071]
 gi|394235805|gb|EJD81355.1| hypothetical protein HMPREF9995_02315 [Staphylococcus epidermidis
           NIHLM095]
 gi|394238880|gb|EJD84337.1| hypothetical protein HMPREF9993_01067 [Staphylococcus epidermidis
           NIHLM087]
 gi|394255019|gb|EJD99979.1| hypothetical protein HMPREF9986_06026 [Staphylococcus epidermidis
           NIHLM040]
          Length = 275

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 329 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 386
           EG  +R S   +F     +  G++ GL G+GGG ++ PL L     PP V+  T++  I 
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGLLTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207

Query: 387 FSSSMSVVEY 396
           FSS MS + +
Sbjct: 208 FSSVMSSIGH 217


>gi|333987091|ref|YP_004519698.1| hypothetical protein MSWAN_0870 [Methanobacterium sp. SWAN-1]
 gi|333825235|gb|AEG17897.1| protein of unknown function DUF81 [Methanobacterium sp. SWAN-1]
          Length = 265

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 50/107 (46%), Gaps = 4/107 (3%)

Query: 326 KGDEG----TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 381
           KGDE     T     + V     G+ +G++ G+ G+ G   +      LG+P  +   T 
Sbjct: 132 KGDESKKEFTTLTGPKRVIASFFGVASGLMAGVFGISGTPPVTAGLYSLGLPAIMVVGTT 191

Query: 382 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           +F + F+S   +  Y+LL R  +   +     + + AF+G  +LKK+
Sbjct: 192 VFVLIFNSIAGIGGYFLLGRLDLTLIILLGGGAAVGAFIGPKLLKKI 238


>gi|15602461|ref|NP_245533.1| hypothetical protein PM0596 [Pasteurella multocida subsp. multocida
           str. Pm70]
 gi|378774303|ref|YP_005176546.1| transmembrane protein [Pasteurella multocida 36950]
 gi|383310238|ref|YP_005363048.1| hypothetical protein PMCN06_0625 [Pasteurella multocida subsp.
           multocida str. HN06]
 gi|386834283|ref|YP_006239598.1| hypothetical protein [Pasteurella multocida subsp. multocida str.
           3480]
 gi|421263329|ref|ZP_15714383.1| hypothetical protein KCU_03134 [Pasteurella multocida subsp.
           multocida str. P52VAC]
 gi|425065382|ref|ZP_18468502.1| putative permease [Pasteurella multocida subsp. gallicida P1059]
 gi|12720866|gb|AAK02680.1| unknown [Pasteurella multocida subsp. multocida str. Pm70]
 gi|356596851|gb|AET15577.1| transmembrane protein [Pasteurella multocida 36950]
 gi|380871510|gb|AFF23877.1| hypothetical protein PMCN06_0625 [Pasteurella multocida subsp.
           multocida str. HN06]
 gi|385200984|gb|AFI45839.1| protein of unknown function, putative [Pasteurella multocida subsp.
           multocida str. 3480]
 gi|401689722|gb|EJS85109.1| hypothetical protein KCU_03134 [Pasteurella multocida subsp.
           multocida str. P52VAC]
 gi|404383758|gb|EJZ80203.1| putative permease [Pasteurella multocida subsp. gallicida P1059]
          Length = 266

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 32/132 (24%), Positives = 62/132 (46%), Gaps = 8/132 (6%)

Query: 96  GVGGGGIFVPML-NLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
           G+GGG + VP+L  L   F  +  AA S  +V G ++  ++Y  +Q+        I++  
Sbjct: 24  GIGGGVLMVPILYTLFPDFPLQMVAATSLTIVIGSSLINLIYFYKQKIQ------INFKA 77

Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREA 214
            LL+   +++G+ +G   +    D +I ++ I +  +L+ R      +  K +   + E 
Sbjct: 78  MLLWSVGMIIGVQLGFEASFYLPDTLIVIVFISVLTILAIRTIFHS-KIVKTDQAERHEN 136

Query: 215 AKQLELIVLGNG 226
            K   L + G G
Sbjct: 137 LKGAGLCLFGGG 148


>gi|418245013|ref|ZP_12871425.1| hypothetical protein KIQ_05893 [Corynebacterium glutamicum ATCC
           14067]
 gi|354511028|gb|EHE83945.1| hypothetical protein KIQ_05893 [Corynebacterium glutamicum ATCC
           14067]
          Length = 248

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 65/143 (45%), Gaps = 8/143 (5%)

Query: 312 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLEL 370
            + A+ +GR++      + + WR          G+  G V GL+G GGGF++ P L L  
Sbjct: 112 TSTAMLRGRKE-KRGASKSSLWR------VLVDGLVVGAVTGLVGAGGGFLVVPALALLG 164

Query: 371 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 430
           G+   V+  T+L  IT  S   +  Y    +      L   A +I+ + VG  +  ++ +
Sbjct: 165 GLSMPVAVGTSLVVITMKSFAGLAGYLTSVQLDWGLVLMVTAAAIVGSLVGSRLAGRMPE 224

Query: 431 ILGRASIIIFTLSFIIFVSALSL 453
            L R    +F L   +FV +L L
Sbjct: 225 TLLRKGFGVFVLVMGVFVLSLEL 247


>gi|152996364|ref|YP_001341199.1| hypothetical protein Mmwyl1_2342 [Marinomonas sp. MWYL1]
 gi|150837288|gb|ABR71264.1| protein of unknown function DUF81 [Marinomonas sp. MWYL1]
          Length = 253

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 4/117 (3%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
             ++Y + G   G+  G+ G+GGG ++ PL L  G PP ++  T L     + S  VV +
Sbjct: 2   DFLWYISAGAAVGLAVGITGVGGGSLMTPLLLMFGFPPHIAIGTDLMYAGIAKSTGVVMH 61

Query: 397 YLLKRFPVPYALYF--FALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
              KR  V + + F   A SI A+ +    L +  K      I+  TL  ++ ++A+
Sbjct: 62  A--KRGNVNWKIVFAMAAGSIPASLITVWALSQFEKPDHYQEILTSTLGMMLIITAM 116


>gi|418626680|ref|ZP_13189277.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU126]
 gi|374831454|gb|EHR95194.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU126]
          Length = 275

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 329 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 386
           EG  +R S   +F     +  G++ GL G+GGG ++ PL L     PP V+  T++  I 
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGLLTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207

Query: 387 FSSSMSVVEY 396
           FSS MS + +
Sbjct: 208 FSSVMSSIGH 217


>gi|224370873|ref|YP_002605037.1| hypothetical protein HRM2_38150 [Desulfobacterium autotrophicum
           HRM2]
 gi|223693590|gb|ACN16873.1| conserved hypothetical protein [Desulfobacterium autotrophicum
           HRM2]
          Length = 424

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 26/108 (24%), Positives = 54/108 (50%), Gaps = 10/108 (9%)

Query: 334 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMS 392
           R+   +F    G+  G+  G++G+GGGF+  P+F+  +G+    +  T +F I F++  +
Sbjct: 247 RSISWLFLVMSGVLVGLAAGIMGVGGGFLTFPIFVYVMGVSSMTTVGTDIFQIIFTAGYA 306

Query: 393 VVEYYLLKRFPVPYALYFFALSI---IAAFVGQHVLKKLIKILGRASI 437
            +  Y +      Y   F+ L+I   + + VG  +   + K++   +I
Sbjct: 307 SITQYAI------YGFIFYTLAIGMLLGSLVGIQIGAMVTKVVSDITI 348



 Score = 38.5 bits (88), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 52/108 (48%), Gaps = 6/108 (5%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 403
            G+ AG++ G +G GGGFI+ P  +  GI   ++  T LF I   + M  V +  L    
Sbjct: 87  VGLLAGLITGCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVIHRKLGNVS 146

Query: 404 VPYALYFFALSIIAAFVG---QHVLKKLIKILGRA---SIIIFTLSFI 445
           VP A  F   +++ A  G      L +L  +L  A   ++ +F L F+
Sbjct: 147 VPLAFVFVIGALMGATGGGLLNRYLYELNPVLSDAFISTVYVFMLGFL 194


>gi|158521364|ref|YP_001529234.1| hypothetical protein Dole_1353 [Desulfococcus oleovorans Hxd3]
 gi|158510190|gb|ABW67157.1| protein of unknown function DUF81 [Desulfococcus oleovorans Hxd3]
          Length = 427

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 73/161 (45%), Gaps = 9/161 (5%)

Query: 297 LNLLQIPVAGGVSAYE---AVALYKGRRKIASKGDEGTK-WRASQLVFYCACGITAGMVG 352
           L LL I + G V A E    +  Y G +   S     TK + AS L+  CA     G++ 
Sbjct: 41  LMLLPILIGGYVFADEIANGIPQYLGGKHAYSPSFFTTKIFIASILIGLCA-----GLIT 95

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 412
           G +G GGGFI+ P  +  GI   ++  T LF I   + M  V +  L    V  A+ F  
Sbjct: 96  GCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVLHRKLGNISVVLAVTFLI 155

Query: 413 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
            SI  A +G  + + L +I    S    T  +++ + AL L
Sbjct: 156 GSIGGATLGGWINRVLYEINPIISDAFITTIYVVMLGALGL 196


>gi|90407709|ref|ZP_01215888.1| hypothetical membrane spanning protein [Psychromonas sp. CNPT3]
 gi|90311178|gb|EAS39284.1| hypothetical membrane spanning protein [Psychromonas sp. CNPT3]
          Length = 274

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 44/93 (47%), Gaps = 1/93 (1%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
           S L    + G+ AG +  ++G GGG +  P  L LG+ P ++  T   A +F S+ +   
Sbjct: 24  SNLFLLASVGLLAGFIDAIVG-GGGLLTVPTLLSLGLSPHLTLGTNKLAASFGSATAAFT 82

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           Y+    F   +  + F  S+I A  G  V+  +
Sbjct: 83  YFKKNLFDPVFWKHSFYSSLIGALAGTLVINAI 115


>gi|407451351|ref|YP_006723075.1| hypothetical protein B739_0573 [Riemerella anatipestifer RA-CH-1]
 gi|403312336|gb|AFR35177.1| hypothetical protein B739_0573 [Riemerella anatipestifer RA-CH-1]
          Length = 264

 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 56/110 (50%), Gaps = 2/110 (1%)

Query: 320 RRKIASKGDEGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVS 377
           +  + SK +  T +   S+     + G+  G++ GL+G GGGF++ P L + LG+  + +
Sbjct: 126 KMIVGSKTEPSTLEISPSRNTLLISQGLLVGIITGLVGAGGGFLIIPALVMILGLKMKEA 185

Query: 378 SATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 427
             T+LF IT SS++  V           + L F  LSI+   +G  + K+
Sbjct: 186 IGTSLFIITLSSTIGFVSSLDKVAIDWHFLLSFTGLSIVGVLLGLALSKR 235


>gi|389807654|ref|ZP_10204191.1| Sulfate transporter, CysZ-type [Rhodanobacter thiooxydans LCS2]
 gi|388443779|gb|EIL99914.1| Sulfate transporter, CysZ-type [Rhodanobacter thiooxydans LCS2]
          Length = 250

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 60/116 (51%), Gaps = 6/116 (5%)

Query: 336 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 395
           S+ + + A G  A +V G LG+  G +   + L LG+PP V+SA+  +A TF+   S + 
Sbjct: 5   SEFLTFAAVGALAQLVDGALGMAYGVVSAAMLLGLGLPPAVASASVHYAETFTCGASGLS 64

Query: 396 YYLLKRFPVPYALYFFALSI---IAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 448
           +  L    V   L F+ L+I   I A +G +V+  +     R ++ I+ L   +F+
Sbjct: 65  H--LAAGNVRRQL-FWTLAIPGAIGAVIGAYVVIHVPADGMRLALTIYMLGMGVFL 117


>gi|445060181|ref|YP_007385585.1| sulfite exporter TauE/SafE-like protein [Staphylococcus warneri
           SG1]
 gi|443426238|gb|AGC91141.1| sulfite exporter TauE/SafE-like protein [Staphylococcus warneri
           SG1]
          Length = 275

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 49/99 (49%), Gaps = 4/99 (4%)

Query: 329 EGTKWRAS--QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAI 385
           EG  +  S   LV + A  +  G++ GL G+GGG ++ PL L     PP V+  T++  I
Sbjct: 148 EGKTYHYSVPPLVAFVAT-LLIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206

Query: 386 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
            FSS MS + + +       Y++     S I A +G  V
Sbjct: 207 FFSSVMSSIGHIIQGHVAWDYSIVLIISSYIGAKIGVKV 245


>gi|120599392|ref|YP_963966.1| hypothetical protein Sputw3181_2588 [Shewanella sp. W3-18-1]
 gi|146292612|ref|YP_001183036.1| hypothetical protein Sputcn32_1512 [Shewanella putrefaciens CN-32]
 gi|386313288|ref|YP_006009453.1| hypothetical protein [Shewanella putrefaciens 200]
 gi|120559485|gb|ABM25412.1| protein of unknown function DUF81 [Shewanella sp. W3-18-1]
 gi|145564302|gb|ABP75237.1| protein of unknown function DUF81 [Shewanella putrefaciens CN-32]
 gi|319425913|gb|ADV53987.1| protein of unknown function DUF81 [Shewanella putrefaciens 200]
          Length = 257

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 5/77 (6%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+ AG +  ++G GGG +  P  L LGI P  +  T   A +F SSM+   YY    F  
Sbjct: 17  GLIAGFIDAIVG-GGGLLSIPALLTLGIAPHTALGTNKLAASFGSSMAAWTYYRQHLF-T 74

Query: 405 PYALYFFALSIIAAFVG 421
           P    F+ ++ IA F+G
Sbjct: 75  P---SFWYMAFIATFIG 88


>gi|410457114|ref|ZP_11310951.1| hypothetical protein BABA_24600 [Bacillus bataviensis LMG 21833]
 gi|409926453|gb|EKN63624.1| hypothetical protein BABA_24600 [Bacillus bataviensis LMG 21833]
          Length = 273

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 52/113 (46%), Gaps = 11/113 (9%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEY 396
            G+ A  +G L+G+GGG I+ P  L L        + PQV   T+LF + F+   S + Y
Sbjct: 9   VGLVASSIGSLIGMGGGIIVVPALLYLATLSTFSHLTPQVVVGTSLFTMIFTGLSSTLSY 68

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVS 449
             LK       L F   S   + +G  V +K    L   S  I+   FI+FVS
Sbjct: 69  MKLKTVDYKSGLIFLIGSGPGSILGAWVTEK----LNLHSFNIYFGIFILFVS 117


>gi|387772993|ref|ZP_10128593.1| sulfite exporter TauE/SafE [Haemophilus parahaemolyticus HK385]
 gi|386905753|gb|EIJ70511.1| sulfite exporter TauE/SafE [Haemophilus parahaemolyticus HK385]
          Length = 264

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 32/130 (24%), Positives = 61/130 (46%), Gaps = 8/130 (6%)

Query: 96  GVGGGGIFVPML-NLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
           G+GGG + VP+L  L   F  +  AA S  +V G ++  +VY  RQ+        I+Y  
Sbjct: 24  GIGGGVLMVPILYTLFPEFPLQMVAATSLTIVMGSSMINLVYFYRQKVE------INYKA 77

Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREA 214
            L++   +++G+ +G   +    D +I  + ++    L+ R  +   +  + ++    E 
Sbjct: 78  MLIWSVGMMIGVQLGFEVSFYVPDIVIISVFVITLSALAIRTLMSK-QAVENDSQNANET 136

Query: 215 AKQLELIVLG 224
           AK + L   G
Sbjct: 137 AKGIRLCTFG 146


>gi|163761304|ref|ZP_02168379.1| hypothetical protein HPDFL43_21639 [Hoeflea phototrophica DFL-43]
 gi|162281461|gb|EDQ31757.1| hypothetical protein HPDFL43_21639 [Hoeflea phototrophica DFL-43]
          Length = 308

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 61/126 (48%), Gaps = 4/126 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++    
Sbjct: 18  LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPAVAVATGANQVVASSISGALAHFRRGS 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             +   L      +  A VG  +   L++ LG+  +II +L +++F+   S+GG+ L + 
Sbjct: 78  LDIKLGLVLLIGGLGGATVGIQIF-TLLRQLGQLDLII-SLLYVVFLG--SVGGLMLWES 133

Query: 462 IKRIEH 467
           +  +  
Sbjct: 134 VNSMRR 139


>gi|219851806|ref|YP_002466238.1| hypothetical protein Mpal_1173 [Methanosphaerula palustris E1-9c]
 gi|219546065|gb|ACL16515.1| protein of unknown function DUF81 [Methanosphaerula palustris
           E1-9c]
          Length = 267

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 46/92 (50%), Gaps = 3/92 (3%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
            LV + A G   G+V G+ G  GG +  P  + LG+P  ++ AT+L  I  +S      +
Sbjct: 152 HLVIWGAMG---GLVSGITGTSGGALFVPALVVLGVPIHLAVATSLLTIIPTSITGAATH 208

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
             L    +PY + + A +++ AF G  +  ++
Sbjct: 209 IALGNISLPYVVVYGAGAVLGAFAGASLAPRI 240


>gi|298208928|ref|YP_003717107.1| hypothetical protein CA2559_11828 [Croceibacter atlanticus
           HTCC2559]
 gi|83848855|gb|EAP86724.1| membrane protein, putative [Croceibacter atlanticus HTCC2559]
          Length = 266

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 53/110 (48%), Gaps = 5/110 (4%)

Query: 313 AVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELG 371
           +V++ + +RK   +  E T           A GI  G + G++G GGGF++ P L L   
Sbjct: 127 SVSMIRNKRKETDEEAEIT----YNYPLIIAEGIIVGAITGIVGAGGGFLIIPALVLLAK 182

Query: 372 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           +P + + AT+LF I   S +  +          P+ L F  LSII  F+G
Sbjct: 183 LPMKKAVATSLFIIAIKSLIGFLGDVQNLDIDWPFLLIFTGLSIIGIFIG 232


>gi|241202991|ref|YP_002974087.1| hypothetical protein Rleg_0237 [Rhizobium leguminosarum bv.
           trifolii WSM1325]
 gi|424879924|ref|ZP_18303556.1| putative permease [Rhizobium leguminosarum bv. trifolii WU95]
 gi|240856881|gb|ACS54548.1| protein of unknown function DUF81 [Rhizobium leguminosarum bv.
           trifolii WSM1325]
 gi|392516287|gb|EIW41019.1| putative permease [Rhizobium leguminosarum bv. trifolii WU95]
          Length = 307

 Score = 39.3 bits (90), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 32/126 (25%), Positives = 60/126 (47%), Gaps = 4/126 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++    
Sbjct: 18  LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGS 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             V          +  A VG  +   L++ +G+  +II +L +++F+   ++GG+ L + 
Sbjct: 78  LDVKLGTVLLVGGLTGATVGIWIF-SLLRAIGQLDLII-SLMYVVFLG--TVGGLMLLES 133

Query: 462 IKRIEH 467
           I  +  
Sbjct: 134 INAMRR 139


>gi|327398711|ref|YP_004339580.1| hypothetical protein Hipma_0550 [Hippea maritima DSM 10411]
 gi|327181340|gb|AEA33521.1| protein of unknown function DUF81 [Hippea maritima DSM 10411]
          Length = 259

 Score = 38.9 bits (89), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 30/47 (63%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 390
            G+  G V G+LG+GGGF+  P+ + +G     ++AT  + +TFSS+
Sbjct: 144 SGLLIGFVAGMLGIGGGFLAAPILMSMGYGAIRAAATTAYIVTFSSA 190


>gi|336311114|ref|ZP_08566081.1| hypothetical protein SOHN41_01564 [Shewanella sp. HN-41]
 gi|335865330|gb|EGM70356.1| hypothetical protein SOHN41_01564 [Shewanella sp. HN-41]
          Length = 257

 Score = 38.9 bits (89), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 5/88 (5%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+ AG +  ++G GGG +  P  L LGI P     T   A +F SSM+   YY    F  
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAAWTYYRQHLFKP 75

Query: 405 PYALYFFALSIIAAFVGQHVLKKLIKIL 432
                F+ ++ IA FVG  +   L+ +L
Sbjct: 76  A----FWYIAFIATFVGAALGSLLVYLL 99


>gi|389578235|ref|ZP_10168262.1| putative permease [Desulfobacter postgatei 2ac9]
 gi|389399870|gb|EIM62092.1| putative permease [Desulfobacter postgatei 2ac9]
          Length = 427

 Score = 38.9 bits (89), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 51/109 (46%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
             G+ AG++ G +G GGGFI+ P  + +G+   ++  T LF I   + M  V +  L   
Sbjct: 86  GVGLCAGLITGCIGAGGGFIIAPALMSVGVKGILAVGTDLFHIFAKAIMGSVLHKKLGNI 145

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
            V  AL F   SI  A +G  + + L       S    T  ++I + AL
Sbjct: 146 SVSIALIFLIGSIGGATLGGWINRVLYDFNPVISDAFITTIYVIMLGAL 194


>gi|374995932|ref|YP_004971431.1| permease [Desulfosporosinus orientis DSM 765]
 gi|357214298|gb|AET68916.1| putative permease [Desulfosporosinus orientis DSM 765]
          Length = 257

 Score = 38.9 bits (89), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 61/125 (48%), Gaps = 21/125 (16%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFA---ITFSSSMSV 393
            LV  C+ G  A MV  + G GGG I  P  L +GIPP ++  T  FA    +F+SS++ 
Sbjct: 11  NLVIICSLGFLAAMVDAIAG-GGGLISLPALLMVGIPPHLALGTNKFAASVASFNSSLTF 69

Query: 394 VE----YYLLKRFPVPYALYFFALSIIAAFVGQ-HVLKKLIKILGRASIIIFTLSFIIFV 448
                 ++ L ++ +P+ L       + AF+G   VLK   + L +A      L  I+FV
Sbjct: 70  ARSGKVHFPLVKWQIPFTL-------LGAFLGAWAVLKVSSEFLNKA-----VLFLILFV 117

Query: 449 SALSL 453
              +L
Sbjct: 118 GVYTL 122


>gi|421587991|ref|ZP_16033328.1| hypothetical protein RCCGEPOP_04960 [Rhizobium sp. Pop5]
 gi|403707384|gb|EJZ22401.1| hypothetical protein RCCGEPOP_04960 [Rhizobium sp. Pop5]
          Length = 307

 Score = 38.9 bits (89), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++    
Sbjct: 18  LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             V          +  A VG  +   L++ +G+  +II +L ++IF+   ++GG+ L + 
Sbjct: 78  LDVKLGTVLLVGGLTGATVGIWIF-SLLRRVGQLDLII-SLMYVIFLG--TVGGLMLLES 133

Query: 462 IKRIEH 467
           I  +  
Sbjct: 134 INAMRR 139


>gi|424888735|ref|ZP_18312338.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM2012]
 gi|393174284|gb|EJC74328.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM2012]
          Length = 307

 Score = 38.9 bits (89), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++    
Sbjct: 18  LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             V          +  A VG  +   L++ +G+  +II +L ++IF+   ++GG+ L + 
Sbjct: 78  LDVKLGTVLLIGGLTGATVGIWIF-SLLRAIGQLDLII-SLMYVIFLG--TVGGLMLLES 133

Query: 462 IKRIEH 467
           I  +  
Sbjct: 134 INAMRR 139


>gi|427430367|ref|ZP_18920221.1| membrane protein, putative [Caenispirillum salinarum AK4]
 gi|425878827|gb|EKV27538.1| membrane protein, putative [Caenispirillum salinarum AK4]
          Length = 307

 Score = 38.9 bits (89), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 31/49 (63%)

Query: 349 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           G++ G+ G+GGGF+L PL + LGIPP V+  T    +  +S   V+ ++
Sbjct: 25  GVLSGMFGVGGGFLLTPLLIFLGIPPTVAVGTGANQVVGASVSGVLAHW 73


>gi|374703196|ref|ZP_09710066.1| hypothetical protein PseS9_07346 [Pseudomonas sp. S9]
          Length = 260

 Score = 38.9 bits (89), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 44/78 (56%), Gaps = 10/78 (12%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPL----FLELGIPPQV----SSATALFAITFS 388
           +L+ Y   G  AG++ GL G+GGG I+ P+    F   G  P+V    +  T+L  I F+
Sbjct: 2   ELLLYLVLGAAAGVMAGLFGVGGGLIIVPVLVASFAAHGFAPEVLTHLAVGTSLATIIFT 61

Query: 389 SSMSVVEYYLLK--RFPV 404
           S  SV+E++ L   R+P+
Sbjct: 62  SINSVLEHHRLGAVRWPL 79


>gi|452851399|ref|YP_007493083.1| conserved membrane protein of unknown function [Desulfovibrio
           piezophilus]
 gi|451895053|emb|CCH47932.1| conserved membrane protein of unknown function [Desulfovibrio
           piezophilus]
          Length = 420

 Score = 38.9 bits (89), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 19/64 (29%), Positives = 35/64 (54%), Gaps = 1/64 (1%)

Query: 340 FYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVEYYL 398
           F  ACG+  G +  ++G+GGGF+  P+F+  LG+    +  T +  I F++  + +  Y 
Sbjct: 253 FVAACGLIVGFMAAIMGVGGGFLTFPMFVYVLGVSSFTTVGTDILQIIFTAGFAAISQYA 312

Query: 399 LKRF 402
           +  F
Sbjct: 313 IYGF 316


>gi|163868912|ref|YP_001610139.1| hypothetical protein Btr_1874 [Bartonella tribocorum CIP 105476]
 gi|161018586|emb|CAK02144.1| putative membrane protein [Bartonella tribocorum CIP 105476]
          Length = 308

 Score = 38.9 bits (89), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 394
           G+  G++  ++G+GGGF + P L   L +P  V   T+LF ITF SS + V
Sbjct: 184 GLIVGLLSSIMGIGGGFFMIPALIYLLRVPTNVVIGTSLFQITFVSSFTTV 234


>gi|332296379|ref|YP_004438302.1| hypothetical protein Thena_1559 [Thermodesulfobium narugense DSM
           14796]
 gi|332179482|gb|AEE15171.1| protein of unknown function DUF81 [Thermodesulfobium narugense DSM
           14796]
          Length = 433

 Score = 38.9 bits (89), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 7/73 (9%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           CG T G V  ++G+GGGF+  PLF+  LG+    +  T +  I F+++ S +  Y +   
Sbjct: 259 CGFTVGFVAAIMGVGGGFLTFPLFVYGLGVSTFTTVGTDILQIIFTTAYSSIFNYAI--- 315

Query: 403 PVPYALYFFALSI 415
              Y   F++++I
Sbjct: 316 ---YGYIFYSIAI 325


>gi|145479593|ref|XP_001425819.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392891|emb|CAK58421.1| unnamed protein product [Paramecium tetraurelia]
          Length = 487

 Score = 38.9 bits (89), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 21/76 (27%), Positives = 42/76 (55%), Gaps = 5/76 (6%)

Query: 96  GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDL 154
            +GGG +  P L  I+ F++ +S  ++ C++ G  +   V    Q HP T + P++++++
Sbjct: 79  ALGGGVVKRPFLEAILNFNSSTSGDITACLMFGAQLVNQVIIFFQSHPDTPERPLVNFEI 138

Query: 155 ALLFQPLLVLGISIGV 170
            L++     LGI I +
Sbjct: 139 GLVY----ALGIPISM 150


>gi|402828737|ref|ZP_10877622.1| sulfite exporter TauE/SafE [Slackia sp. CM382]
 gi|402285895|gb|EJU34375.1| sulfite exporter TauE/SafE [Slackia sp. CM382]
          Length = 300

 Score = 38.9 bits (89), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 40/62 (64%), Gaps = 2/62 (3%)

Query: 345 GITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 403
           G+  G++ G+LG+GGG ++ PLF L  G+ P  ++AT+LF+I   +S++ +  +L     
Sbjct: 11  GVAIGILSGMLGIGGGTVMVPLFRLAFGLDPLAATATSLFSI-IPTSLAGLSKHLRNGTC 69

Query: 404 VP 405
           +P
Sbjct: 70  IP 71


>gi|385232877|ref|YP_005794219.1| permease protein [Ketogulonicigenium vulgare WSH-001]
 gi|343461788|gb|AEM40223.1| Permease protein [Ketogulonicigenium vulgare WSH-001]
          Length = 309

 Score = 38.9 bits (89), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 54/109 (49%), Gaps = 2/109 (1%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
             G   G++ G+ G+GGGF++ PL +  GIP  V+ ATA   I  SS   +  +   K  
Sbjct: 24  GLGWIVGILSGMFGVGGGFLITPLLIFSGIPSAVAVATATNQIVASSVSGLFAHLRRKNV 83

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
            +          ++ A +G  +   + + +G+  ++I TLS++I +  +
Sbjct: 84  DLRMGTVLMIGGLVGAAIGVGIFNWM-RRMGQVDLMI-TLSYVILLGTI 130


>gi|157374704|ref|YP_001473304.1| hypothetical protein Ssed_1565 [Shewanella sediminis HAW-EB3]
 gi|157317078|gb|ABV36176.1| protein of unknown function DUF81 [Shewanella sediminis HAW-EB3]
          Length = 257

 Score = 38.9 bits (89), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 5/77 (6%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           G+ AG +  + G GGG +  P  L +GIPP ++  T   A  F SSM+   YY    F  
Sbjct: 17  GLIAGFIDAVAG-GGGLLSIPALLTVGIPPHLALGTNKLAACFGSSMAAFTYYRKNLF-T 74

Query: 405 PYALYFFALSIIAAFVG 421
           P   Y    + IA F+G
Sbjct: 75  PNLWYH---TFIATFIG 88


>gi|219669184|ref|YP_002459619.1| hypothetical protein Dhaf_3160 [Desulfitobacterium hafniense DCB-2]
 gi|219539444|gb|ACL21183.1| protein of unknown function DUF81 [Desulfitobacterium hafniense
           DCB-2]
          Length = 443

 Score = 38.9 bits (89), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 37/67 (55%), Gaps = 3/67 (4%)

Query: 321 RKIASKGDEGTKWRASQLVFY--CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS 378
             I  K   G  + ++++ FY   A G+ AG+V G +G GGGF++ P  + LG+   +S 
Sbjct: 57  EAIGGKAAYGPSYFSAKM-FYGSIAIGVVAGLVTGCIGAGGGFLITPALMSLGVKGILSV 115

Query: 379 ATALFAI 385
            T +F I
Sbjct: 116 GTDMFHI 122


>gi|86751708|ref|YP_488204.1| hypothetical protein RPB_4610 [Rhodopseudomonas palustris HaA2]
 gi|86574736|gb|ABD09293.1| Protein of unknown function DUF81 [Rhodopseudomonas palustris HaA2]
          Length = 307

 Score = 38.9 bits (89), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 3/59 (5%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 392
            ++   A G   G V G+ G+GGGF+L PL + +GI P V+ A+    + A +FS ++S
Sbjct: 13  NVLLILAMGAAVGFVSGMFGVGGGFLLTPLLIFVGIAPAVAVASVTSHMAASSFSGALS 71


>gi|298527660|ref|ZP_07015064.1| protein of unknown function DUF81 [Desulfonatronospira
           thiodismutans ASO3-1]
 gi|298511312|gb|EFI35214.1| protein of unknown function DUF81 [Desulfonatronospira
           thiodismutans ASO3-1]
          Length = 254

 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 323 IASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATA 381
           +A +G     W  ++++     G+  G + G+LG+GGG  + PL +  L + P+ ++AT 
Sbjct: 120 VAGQGTRAHVWSLARVLQASFLGLVIGFMAGMLGIGGGVFVVPLLIYMLQVNPRTAAATT 179

Query: 382 LFAITFSS 389
            F + FSS
Sbjct: 180 AFIVCFSS 187


>gi|126734854|ref|ZP_01750600.1| integral membrane protein, putative [Roseobacter sp. CCS2]
 gi|126715409|gb|EBA12274.1| integral membrane protein, putative [Roseobacter sp. CCS2]
          Length = 306

 Score = 38.9 bits (89), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ +  LKR
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAH--LKR 75


>gi|91975280|ref|YP_567939.1| hypothetical protein RPD_0800 [Rhodopseudomonas palustris BisB5]
 gi|91681736|gb|ABE38038.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
           BisB5]
          Length = 307

 Score = 38.9 bits (89), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 35/59 (59%), Gaps = 3/59 (5%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 392
            ++   A G   G V G+ G+GGGF++ PL + +GI P V+ A+    + A +FS ++S
Sbjct: 13  NVLLILAMGAAVGFVSGMFGIGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALS 71


>gi|423392833|ref|ZP_17370059.1| hypothetical protein ICG_04681 [Bacillus cereus BAG1X1-3]
 gi|423421123|ref|ZP_17398212.1| hypothetical protein IE3_04595 [Bacillus cereus BAG3X2-1]
 gi|401099674|gb|EJQ07676.1| hypothetical protein IE3_04595 [Bacillus cereus BAG3X2-1]
 gi|401632812|gb|EJS50595.1| hypothetical protein ICG_04681 [Bacillus cereus BAG1X1-3]
          Length = 252

 Score = 38.9 bits (89), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 8/74 (10%)

Query: 321 RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSA 379
           +K  S+     KW AS L F        G V G++G GG F+L P+ L  L +P + + A
Sbjct: 125 KKNNSRAVNYNKWLASMLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRTTIA 178

Query: 380 TALFAITFSSSMSV 393
           T++ AITF SS+ +
Sbjct: 179 TSI-AITFISSVGI 191


>gi|39933890|ref|NP_946166.1| hypothetical protein RPA0813 [Rhodopseudomonas palustris CGA009]
 gi|192289309|ref|YP_001989914.1| hypothetical protein Rpal_0881 [Rhodopseudomonas palustris TIE-1]
 gi|39647737|emb|CAE26257.1| possible permease [Rhodopseudomonas palustris CGA009]
 gi|192283058|gb|ACE99438.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
           TIE-1]
          Length = 320

 Score = 38.9 bits (89), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 35/59 (59%), Gaps = 3/59 (5%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 392
            ++   A G   G V G+ G+GGGF++ PL + +GI P V+ A+    + A +FS ++S
Sbjct: 26  NVLLVLAMGAAVGFVSGMFGVGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALS 84


>gi|339504564|ref|YP_004691984.1| hypothetical protein RLO149_c030650 [Roseobacter litoralis Och 149]
 gi|338758557|gb|AEI95021.1| hypothetical protein DUF1925 [Roseobacter litoralis Och 149]
          Length = 312

 Score = 38.9 bits (89), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ +  LKR
Sbjct: 37  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAH--LKR 83


>gi|212640166|ref|YP_002316686.1| permease [Anoxybacillus flavithermus WK1]
 gi|212561646|gb|ACJ34701.1| Predicted permease [Anoxybacillus flavithermus WK1]
          Length = 258

 Score = 38.9 bits (89), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 41/113 (36%), Positives = 58/113 (51%), Gaps = 9/113 (7%)

Query: 318 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQV 376
           KG   I        KW A+ L F    G++AG+VG      G FIL P+ L  L IP ++
Sbjct: 128 KGIDNIPLDQVTFNKWLAAALAFIV--GVSAGIVGA----AGAFILVPIMLVVLKIPTRM 181

Query: 377 SSATALFAITFSSSM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 428
           + AT+L AITF SS+ + V      +F    A      S+IA+ +G +V KK+
Sbjct: 182 TIATSL-AITFISSIGATVGKITTGQFDFWTAFIMVVASLIASPIGANVGKKV 233


>gi|375111110|ref|ZP_09757321.1| hypothetical protein AJE_14130 [Alishewanella jeotgali KCTC 22429]
 gi|374568652|gb|EHR39824.1| hypothetical protein AJE_14130 [Alishewanella jeotgali KCTC 22429]
          Length = 262

 Score = 38.9 bits (89), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 38/87 (43%), Gaps = 1/87 (1%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           C     AG +  + G GGG +  P  L  G+PP V   T   A TF S  + + YY  + 
Sbjct: 16  CGVAFIAGYIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSFTASLTYYRKQL 74

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKL 428
           F   Y       + I A +G  V+ +L
Sbjct: 75  FNPHYWRNSLVFTAIGALLGTLVVDQL 101


>gi|365879405|ref|ZP_09418829.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 375]
 gi|365292656|emb|CCD91360.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 375]
          Length = 308

 Score = 38.5 bits (88), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 8/126 (6%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
            A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+  + 
Sbjct: 18  LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYWRRRA 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV----G 457
                A        +   +G      L++ LG+  ++I  LS+++ ++  S+GGV    G
Sbjct: 78  IDPSLASVLMIGGALGTALGVSTF-TLLRSLGQLDLMI-ALSYVVLLT--SVGGVMFWEG 133

Query: 458 LAKMIK 463
           L  M++
Sbjct: 134 LRAMMR 139


>gi|229149118|ref|ZP_04277359.1| hypothetical protein bcere0011_6830 [Bacillus cereus m1550]
 gi|228634317|gb|EEK90905.1| hypothetical protein bcere0011_6830 [Bacillus cereus m1550]
          Length = 235

 Score = 38.5 bits (88), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 58/110 (52%), Gaps = 9/110 (8%)

Query: 321 RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSA 379
           +K  S   +  KW AS L F        G V G++G GG F+L P+ L  L +P +++ A
Sbjct: 106 KKNNSGAVKYNKWLASVLAFIV------GGVSGIIGAGGAFLLVPIMLVILKLPLRMTIA 159

Query: 380 TALFAITFSSSMSVVEYYLLK-RFPVPYALYFFALSIIAAFVGQHVLKKL 428
           T++ AITF SS+ +    ++  +  +  AL     SI AA VG  V K++
Sbjct: 160 TSI-AITFISSVGITTGKVITGQVVIIPALIIAVASIFAAPVGARVGKRI 208


>gi|397169771|ref|ZP_10493201.1| hypothetical protein AEST_09670 [Alishewanella aestuarii B11]
 gi|396088666|gb|EJI86246.1| hypothetical protein AEST_09670 [Alishewanella aestuarii B11]
          Length = 260

 Score = 38.5 bits (88), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 38/87 (43%), Gaps = 1/87 (1%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           C     AG +  + G GGG +  P  L  G+PP V   T   A TF S  + + YY  + 
Sbjct: 14  CGVAFIAGYIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSFTASLTYYRKQL 72

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKL 428
           F   Y       + I A +G  V+ +L
Sbjct: 73  FNPHYWRNSLVFTAIGALLGTLVVDQL 99


>gi|310815340|ref|YP_003963304.1| hypothetical protein EIO_0856 [Ketogulonicigenium vulgare Y25]
 gi|308754075|gb|ADO42004.1| integral membrane protein, putative [Ketogulonicigenium vulgare
           Y25]
          Length = 304

 Score = 38.5 bits (88), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 54/109 (49%), Gaps = 2/109 (1%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
             G   G++ G+ G+GGGF++ PL +  GIP  V+ ATA   I  SS   +  +   K  
Sbjct: 19  GLGWIVGILSGMFGVGGGFLITPLLIFSGIPSAVAVATATNQIVASSVSGLFAHLRRKNV 78

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 451
            +          ++ A +G  +   + + +G+  ++I TLS++I +  +
Sbjct: 79  DLRMGTVLMIGGLVGAAIGVGIFNWM-RRMGQVDLMI-TLSYVILLGTI 125


>gi|223478732|ref|YP_002582987.1| hypothetical protein [Thermococcus sp. AM4]
 gi|214033958|gb|EEB74784.1| conserved hypothetical protein [Thermococcus sp. AM4]
          Length = 243

 Score = 38.5 bits (88), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
            GI +G+V GLLG+ GG +  PLF   +GIP + +  T+ FA+ F++     E+Y L + 
Sbjct: 136 VGILSGLVSGLLGISGGVLNVPLFHTFVGIPMRYAVGTSSFALFFTALAGAFEHYRLGQV 195

Query: 403 PVPYAL 408
            +  AL
Sbjct: 196 DLHMAL 201


>gi|21322817|dbj|BAB97446.1| Hypothetical membrane protein [Corynebacterium glutamicum ATCC
           13032]
          Length = 263

 Score = 38.5 bits (88), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 40/144 (27%), Positives = 65/144 (45%), Gaps = 10/144 (6%)

Query: 312 EAVALYKGRRKIASKG-DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLE 369
            + A+ +GR++   KG  + + WR          G+  G V GL+G GGGF++ P L L 
Sbjct: 127 TSTAMLRGRKQ--KKGASKSSLWR------VLVDGLVVGAVTGLVGAGGGFLVVPALALL 178

Query: 370 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 429
            G+   V+  T+L  IT  S   +  Y    +      L   A +I+ +  G  +  ++ 
Sbjct: 179 GGLSMPVAVGTSLVVITMKSFAGLAGYLTSVQLDWGLVLMVTAAAIVGSLAGSRLAGRVP 238

Query: 430 KILGRASIIIFTLSFIIFVSALSL 453
           + L R    +F L   +FV  L L
Sbjct: 239 ETLLRKGFGVFVLVMGVFVLGLEL 262


>gi|325959191|ref|YP_004290657.1| hypothetical protein Metbo_1448 [Methanobacterium sp. AL-21]
 gi|325330623|gb|ADZ09685.1| protein of unknown function DUF81 [Methanobacterium sp. AL-21]
          Length = 273

 Score = 38.5 bits (88), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 25/99 (25%), Positives = 46/99 (46%)

Query: 329 EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFS 388
           E  K    + +F    G+  G++ G+ GL G   +      LG+P  +   T +F + F+
Sbjct: 148 ETLKLTGPKRIFASLFGVAGGVLAGVFGLSGTPPISAGLYSLGLPTLMVVGTTVFVLVFN 207

Query: 389 SSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 427
           S   +  Y  L RF V   +     +++ +F+G  +L+K
Sbjct: 208 SLAGIGGYLFLGRFDVVLTVLLGGGAVVGSFLGPKLLEK 246


>gi|395793084|ref|ZP_10472493.1| hypothetical protein MEI_01114 [Bartonella vinsonii subsp.
           arupensis Pm136co]
 gi|423713627|ref|ZP_17687887.1| hypothetical protein ME1_00633 [Bartonella vinsonii subsp.
           arupensis OK-94-513]
 gi|395422254|gb|EJF88462.1| hypothetical protein ME1_00633 [Bartonella vinsonii subsp.
           arupensis OK-94-513]
 gi|395431922|gb|EJF97928.1| hypothetical protein MEI_01114 [Bartonella vinsonii subsp.
           arupensis Pm136co]
          Length = 306

 Score = 38.5 bits (88), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 394
           G+  G++  ++G+GGGF + P L   L +P  V   T+LF ITF SS + V
Sbjct: 182 GLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQITFVSSFTTV 232


>gi|333908013|ref|YP_004481599.1| hypothetical protein Mar181_1641 [Marinomonas posidonica
           IVIA-Po-181]
 gi|333478019|gb|AEF54680.1| protein of unknown function DUF81 [Marinomonas posidonica
           IVIA-Po-181]
          Length = 253

 Score = 38.5 bits (88), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 31/60 (51%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
           + ++Y   G   G+  G+ G+GGG ++ PL L  G PP ++  T L     + S  VV +
Sbjct: 2   EFIWYIFAGAVVGLAVGITGVGGGSLMTPLLLLFGFPPHIAVGTDLMYAGIAKSTGVVMH 61


>gi|116250358|ref|YP_766196.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841]
 gi|424873568|ref|ZP_18297230.1| putative permease [Rhizobium leguminosarum bv. viciae WSM1455]
 gi|115255006|emb|CAK06080.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae
           3841]
 gi|393169269|gb|EJC69316.1| putative permease [Rhizobium leguminosarum bv. viciae WSM1455]
          Length = 307

 Score = 38.5 bits (88), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 32/126 (25%), Positives = 60/126 (47%), Gaps = 4/126 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++    
Sbjct: 18  LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGS 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             V          +  A VG  +   L++ +G+  +II +L ++IF+   ++GG+ L + 
Sbjct: 78  LDVKLGTVLLVGGLAGATVGIWIF-SLLRAIGQLDLII-SLMYVIFLG--TVGGLMLLES 133

Query: 462 IKRIEH 467
           +  +  
Sbjct: 134 VNAMRR 139


>gi|409435971|ref|ZP_11263175.1| conserved membrane hypothetical protein [Rhizobium mesoamericanum
           STM3625]
 gi|408752280|emb|CCM74324.1| conserved membrane hypothetical protein [Rhizobium mesoamericanum
           STM3625]
          Length = 306

 Score = 38.5 bits (88), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++    
Sbjct: 18  LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             V          +  A VG  +   L++ +G+  +II +L ++IF+   ++GG+ L + 
Sbjct: 78  LDVKLGTVLLVGGLTGATVGIWIF-SLLRQIGQLDLII-SLLYVIFLG--TVGGLMLLES 133

Query: 462 IKRIEH 467
           I  +  
Sbjct: 134 INALRR 139


>gi|424898188|ref|ZP_18321762.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM2297]
 gi|393182415|gb|EJC82454.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM2297]
          Length = 307

 Score = 38.5 bits (88), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++    
Sbjct: 18  LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             V          +  A VG  +   L++ +G+  +II +L ++IF+   ++GG+ L + 
Sbjct: 78  LDVKLGTVLLIGGLTGATVGIWIF-SLLRSIGQLDLII-SLMYVIFLG--TVGGLMLLES 133

Query: 462 IKRIEH 467
           I  +  
Sbjct: 134 INAMRR 139


>gi|325964992|ref|YP_004242898.1| permease [Arthrobacter phenanthrenivorans Sphe3]
 gi|323471079|gb|ADX74764.1| putative permease [Arthrobacter phenanthrenivorans Sphe3]
          Length = 299

 Score = 38.5 bits (88), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 65/139 (46%), Gaps = 6/139 (4%)

Query: 313 AVALYKGRRKIASKGDEGTKWRASQLVF--YCACGITAGMVGGLLGLGGGFILGP-LFLE 369
           +VA+ +GR++   K D+G      +L        G   G+V GL+G GGGF++ P L L 
Sbjct: 113 SVAMLRGRKR---KNDDGAAPAKHELPLGRVLLDGAVVGLVTGLVGAGGGFLVVPALALL 169

Query: 370 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 429
            G+P  V+  T+L  I   S   +  Y    +      L   A +++ + +G  +  ++ 
Sbjct: 170 GGLPMSVAVGTSLVVIAMKSFAGLAGYLTTVQLDWGVTLGVTAAAVVGSLIGAKLAGRIP 229

Query: 430 KILGRASIIIFTLSFIIFV 448
           + + R +   F L+   FV
Sbjct: 230 EAVLRKAFGWFVLAMGTFV 248


>gi|110678948|ref|YP_681955.1| hypothetical protein RD1_1642 [Roseobacter denitrificans OCh 114]
 gi|109455064|gb|ABG31269.1| integral membrane protein, putative [Roseobacter denitrificans OCh
           114]
          Length = 312

 Score = 38.5 bits (88), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           G+ G+GGGF++ PL   +GIPP V+ AT    I  SS   V+ +  LKR
Sbjct: 37  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAH--LKR 83


>gi|395785666|ref|ZP_10465394.1| hypothetical protein ME5_00712 [Bartonella tamiae Th239]
 gi|423717442|ref|ZP_17691632.1| hypothetical protein MEG_01172 [Bartonella tamiae Th307]
 gi|395424124|gb|EJF90311.1| hypothetical protein ME5_00712 [Bartonella tamiae Th239]
 gi|395427657|gb|EJF93748.1| hypothetical protein MEG_01172 [Bartonella tamiae Th307]
          Length = 308

 Score = 38.5 bits (88), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 59/123 (47%), Gaps = 21/123 (17%)

Query: 293 LYWVLNLLQIPVAGGVSAYEAVALYK-------GRR-KIASKGDEG--------TKWRAS 336
           L  +++LL + + GGV    AV  ++       GR+  I   G            ++RAS
Sbjct: 112 LDLIISLLYVILLGGVGGLMAVESWRAIMRTRSGRQVNIRRPGQHNWIHRLPLKLRFRAS 171

Query: 337 QL----VFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSM 391
            +    +   A G   G++  ++G+GGGFI+ P L   L +P  V   T+LF ITF ++ 
Sbjct: 172 MIYVSIIPVLAIGFIIGLLSSIMGVGGGFIMVPALIYLLRVPTNVVVGTSLFQITFVTAF 231

Query: 392 SVV 394
           + V
Sbjct: 232 TTV 234


>gi|115522742|ref|YP_779653.1| hypothetical protein RPE_0716 [Rhodopseudomonas palustris BisA53]
 gi|115516689|gb|ABJ04673.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
           BisA53]
          Length = 308

 Score = 38.5 bits (88), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 35/59 (59%), Gaps = 3/59 (5%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 392
            ++   A G   G V G+ G+GGGF++ PL + +GI P V+ A+    + A +FS ++S
Sbjct: 13  NVLLILAMGAAVGFVSGMFGIGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALS 71


>gi|269216459|ref|ZP_06160313.1| hypothetical membrane protein, conserved, DUF81 family [Slackia
           exigua ATCC 700122]
 gi|269129988|gb|EEZ61070.1| hypothetical membrane protein, conserved, DUF81 family [Slackia
           exigua ATCC 700122]
          Length = 297

 Score = 38.5 bits (88), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 33/46 (71%), Gaps = 1/46 (2%)

Query: 345 GITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSS 389
           G+  G++ G+LG+GGG ++ PLF L  G+ P  ++AT+LF+I  +S
Sbjct: 11  GVAIGILSGMLGIGGGTVMVPLFRLAFGLDPLAATATSLFSIIPTS 56


>gi|414153246|ref|ZP_11409573.1| conserved membrane hypothetical protein [Desulfotomaculum
           hydrothermale Lam5 = DSM 18033]
 gi|411455628|emb|CCO07476.1| conserved membrane hypothetical protein [Desulfotomaculum
           hydrothermale Lam5 = DSM 18033]
          Length = 339

 Score = 38.5 bits (88), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 346 ITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 404
           I +G+V G LG+GGGFI  P+ +  LG+P  ++  T LFAI  ++S     Y L  +  +
Sbjct: 186 IFSGLVSGFLGVGGGFIRVPILIYALGLPTVMAVGTDLFAILITNSWGAYIYALAGKVEI 245

Query: 405 PYALYFFALSIIAAFVGQHV 424
             AL    + ++ A VG  +
Sbjct: 246 IGAL----VMVVGAAVGAQI 261


>gi|407783749|ref|ZP_11130944.1| hypothetical protein P24_15941 [Oceanibaculum indicum P24]
 gi|407200184|gb|EKE70195.1| hypothetical protein P24_15941 [Oceanibaculum indicum P24]
          Length = 306

 Score = 38.5 bits (88), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 1/103 (0%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 396
            ++     G   G + G+ G+GGGF++ PL + +GIPP V+  T    I  +S   V+  
Sbjct: 13  NVILILGMGWVVGFLSGIFGVGGGFLMTPLLIFIGIPPAVAVGTQANQIVAASVSGVIAQ 72

Query: 397 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 439
           +  K           A  +  + +G  V   L + LG+  +II
Sbjct: 73  WRRKNVDFKMGAVLLAGGLFGSTIGVWVFSWL-QTLGQIDLII 114


>gi|414176339|ref|ZP_11430568.1| hypothetical protein HMPREF9695_04214 [Afipia broomeae ATCC 49717]
 gi|410886492|gb|EKS34304.1| hypothetical protein HMPREF9695_04214 [Afipia broomeae ATCC 49717]
          Length = 308

 Score = 38.5 bits (88), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 33/53 (62%), Gaps = 3/53 (5%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 392
           A G   G V G+ G+GGGF++ PL + +GI P V+ A+    + A +FS ++S
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFVGISPAVAVASVSSHMAASSFSGALS 71


>gi|199597888|ref|ZP_03211313.1| Predicted permease [Lactobacillus rhamnosus HN001]
 gi|258507211|ref|YP_003169962.1| permease [Lactobacillus rhamnosus GG]
 gi|385826935|ref|YP_005864707.1| putative transporter protein [Lactobacillus rhamnosus GG]
 gi|199591145|gb|EDY99226.1| Predicted permease [Lactobacillus rhamnosus HN001]
 gi|257147138|emb|CAR86111.1| Permease [Lactobacillus rhamnosus GG]
 gi|259648580|dbj|BAI40742.1| putative transporter protein [Lactobacillus rhamnosus GG]
          Length = 280

 Score = 38.5 bits (88), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 32/134 (23%), Positives = 70/134 (52%), Gaps = 14/134 (10%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVE 395
           Q++     G+  G++G +LG+GGG I+ P+  + +G+P Q +   ++ ++  +SS + + 
Sbjct: 3   QMLLLMGTGVLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGG 455
           Y   +   +  A++    + + A +G  V+  L+   G+   I+F  + +IF SA +   
Sbjct: 63  YLKDEMLNLRVAMFLEIATTVGAVIGA-VITGLVN--GKVLYILFG-ALLIF-SAFN--- 114

Query: 456 VGLAKMIKRIEHKE 469
                MI+++  K+
Sbjct: 115 -----MIRKLRMKD 123


>gi|159045387|ref|YP_001534181.1| hypothetical protein Dshi_2847 [Dinoroseobacter shibae DFL 12]
 gi|157913147|gb|ABV94580.1| conserved hypothetical protein [Dinoroseobacter shibae DFL 12]
          Length = 306

 Score = 38.5 bits (88), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 1/49 (2%)

Query: 353 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
           G+ G+GGGF++ PL   +GIPP V+ AT    I  +SS S V  +L +R
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIV-ASSFSGVLAHLRRR 76


>gi|403525486|ref|YP_006660373.1| hypothetical protein ARUE_c04000 [Arthrobacter sp. Rue61a]
 gi|403227913|gb|AFR27335.1| putative integral membrane protein [Arthrobacter sp. Rue61a]
          Length = 299

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 64/139 (46%), Gaps = 6/139 (4%)

Query: 313 AVALYKGRRKIASKGDEGTKWRASQLVF--YCACGITAGMVGGLLGLGGGFILGP-LFLE 369
           +VA+ +GR+K   K D+G      +L        G   G++ GL+G GGGF++ P L L 
Sbjct: 113 SVAMLRGRKK---KNDDGAAPVKHELPIGRVLLDGAVVGLITGLVGAGGGFLVVPALALL 169

Query: 370 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 429
            G+P  V+  T+L  I   S   +  Y    +      L   A +II + +G  +  ++ 
Sbjct: 170 GGLPMSVAVGTSLVVIAMKSFAGLAGYLTTVQLDWGLTLGVTAAAIIGSLIGSKLAGRIP 229

Query: 430 KILGRASIIIFTLSFIIFV 448
           +   R +   F L+   FV
Sbjct: 230 ESALRKAFGWFVLAMGTFV 248


>gi|71395121|ref|XP_802308.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70861404|gb|EAN80862.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 261

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 55/113 (48%), Gaps = 2/113 (1%)

Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT--LDIPVIDYDLALLFQ 159
           I VPM  L++G     +  +S+  + G ++  V+  +R+R P      P+I+Y    L  
Sbjct: 111 ILVPMFCLLMGLPMDFAVGLSQSTICGQSILNVLIAIRKRFPCAGCSRPLINYQYLTLLV 170

Query: 160 PLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR 212
           PL V+G  IG   N +  D    VLL +L   +  R+  K +  ++++   +R
Sbjct: 171 PLGVIGTLIGGVLNRLCPDLFRLVLLFLLLTAVLYRSVRKMIAQYREDQSERR 223


>gi|448644624|ref|ZP_21679080.1| hypothetical protein C436_20063 [Haloarcula sinaiiensis ATCC 33800]
 gi|445757585|gb|EMA08928.1| hypothetical protein C436_20063 [Haloarcula sinaiiensis ATCC 33800]
          Length = 342

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 37/75 (49%), Gaps = 1/75 (1%)

Query: 349 GMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYA 407
           G++ G LG+GGGFI  P LF  +G+P  ++  T LF I FS  +    Y L     +   
Sbjct: 191 GLLSGFLGVGGGFIRMPALFYLIGVPVPIAVGTDLFEIVFSGGLGSFLYALDGGVDLSIV 250

Query: 408 LYFFALSIIAAFVGQ 422
           L   A S   A VG 
Sbjct: 251 LPLLAGSAFGARVGS 265


>gi|395789707|ref|ZP_10469217.1| hypothetical protein ME9_00934 [Bartonella taylorii 8TBB]
 gi|395428545|gb|EJF94621.1| hypothetical protein ME9_00934 [Bartonella taylorii 8TBB]
          Length = 306

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 32/51 (62%), Gaps = 1/51 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVV 394
           G+  G++  ++G+GGGF + P  + L  +P +V   T+LF ITF SS + V
Sbjct: 182 GLIVGLLSSIMGIGGGFFMIPALIYLMRVPTRVVIGTSLFQITFVSSFTTV 232


>gi|301064479|ref|ZP_07204889.1| conserved hypothetical protein [delta proteobacterium NaphS2]
 gi|300441417|gb|EFK05772.1| conserved hypothetical protein [delta proteobacterium NaphS2]
          Length = 472

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 56/126 (44%), Gaps = 8/126 (6%)

Query: 75  GWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAV 134
           GW  +VG   G F  A  S  GVGGG +FVP L  + G      A  S   V  G + ++
Sbjct: 343 GWIPIVG---GLFIAALASFLGVGGGFLFVPFLTAVAGLPMFLVAGTSGLAVLVGMIVSI 399

Query: 135 VYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLST 194
              +  +   +   +I  +L  +F     +G  IG       +D  +++L IVL  ++  
Sbjct: 400 FTYMVAKQIPVHWSLIGAELVGIF-----VGAMIGPRTQKYLSDKALSILFIVLAFLVGI 454

Query: 195 RAFLKG 200
           R  LKG
Sbjct: 455 RYTLKG 460


>gi|257465039|ref|ZP_05629410.1| permease [Actinobacillus minor 202]
 gi|257450699|gb|EEV24742.1| permease [Actinobacillus minor 202]
          Length = 266

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 32/125 (25%), Positives = 62/125 (49%), Gaps = 11/125 (8%)

Query: 96  GVGGGGIFVPMLN-LIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
           G+GGG + VP+L+ L   F  +  AA S   V G A+  ++   +Q+      P++ + +
Sbjct: 24  GIGGGVLMVPILHTLFPEFPLQMVAATSLTTVMGTALINLISFYKQKISVELKPLLLWSV 83

Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREA 214
            +      ++G+ +G   +  F +++I  + I   LVL+ R+FL      K +  T   A
Sbjct: 84  GM------IVGVQLGFELSFYFPNFLIITIFISTLLVLAWRSFLAK----KGQNQTASTA 133

Query: 215 AKQLE 219
            ++L+
Sbjct: 134 NEKLK 138


>gi|395765751|ref|ZP_10446343.1| hypothetical protein MCO_01219 [Bartonella sp. DB5-6]
 gi|395410946|gb|EJF77488.1| hypothetical protein MCO_01219 [Bartonella sp. DB5-6]
          Length = 306

 Score = 38.5 bits (88), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 1/52 (1%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVV 394
            G+  G++  ++G+GGGF + P  + L  +P +V   T+LF ITF SS + V
Sbjct: 181 IGLIVGLLSSIMGIGGGFFMIPALIYLMRVPTRVVIGTSLFQITFVSSFTTV 232


>gi|389852005|ref|YP_006354239.1| permease [Pyrococcus sp. ST04]
 gi|388249311|gb|AFK22164.1| putative permease [Pyrococcus sp. ST04]
          Length = 254

 Score = 38.5 bits (88), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 50/91 (54%), Gaps = 2/91 (2%)

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           PL   LG+P   + AT+ FAI F+S+ S +++YLL    V +        II A +G  +
Sbjct: 161 PLLTWLGMPIHYAVATSSFAIVFTSTSSAIKHYLLGNVEVQWLPLLVPGLIIGAQIGAKI 220

Query: 425 LKKL-IKILGRA-SIIIFTLSFIIFVSALSL 453
            KK   K L  A ++++  L+F + + AL+L
Sbjct: 221 AKKTRAKSLKNAFAVVMVILAFRMILKALNL 251


>gi|398307487|ref|ZP_10511073.1| hypothetical protein BvalD_19187 [Bacillus vallismortis DV1-F-3]
          Length = 256

 Score = 38.5 bits (88), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 32/69 (46%), Positives = 41/69 (59%), Gaps = 10/69 (14%)

Query: 326 KGDEGT--KWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATAL 382
           KG+E    KW AS L F        G V G+LG GG FIL P + L L IP +++ A++L
Sbjct: 132 KGEEVIIHKWLASFLAFII------GGVSGILGAGGAFILVPIMLLILKIPVRITVASSL 185

Query: 383 FAITFSSSM 391
            AITF SS+
Sbjct: 186 -AITFLSSI 193


>gi|240850515|ref|YP_002971914.1| hypothetical membrane protein [Bartonella grahamii as4aup]
 gi|240267638|gb|ACS51226.1| hypothetical membrane protein [Bartonella grahamii as4aup]
          Length = 306

 Score = 38.5 bits (88), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 394
           G+  G++  ++G+GGGF + P L   L +P  V   T+LF ITF SS + V
Sbjct: 182 GLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQITFVSSFTTV 232


>gi|418023649|ref|ZP_12662634.1| protein of unknown function DUF81 [Shewanella baltica OS625]
 gi|353537532|gb|EHC07089.1| protein of unknown function DUF81 [Shewanella baltica OS625]
          Length = 257

 Score = 38.5 bits (88), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 5/88 (5%)

Query: 334 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 393
             S        G+ AG +  ++G GGG +  P  L LGI P     T   A +F SSM+ 
Sbjct: 6   EPSHWALLAVIGLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAA 64

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVG 421
             YY    F       F+ ++ IA F+G
Sbjct: 65  WTYYRQHLFKPS----FWYMAFIATFIG 88


>gi|126173862|ref|YP_001050011.1| hypothetical protein Sbal_1628 [Shewanella baltica OS155]
 gi|386340622|ref|YP_006036988.1| hypothetical protein [Shewanella baltica OS117]
 gi|125997067|gb|ABN61142.1| protein of unknown function DUF81 [Shewanella baltica OS155]
 gi|334863023|gb|AEH13494.1| protein of unknown function DUF81 [Shewanella baltica OS117]
          Length = 257

 Score = 38.5 bits (88), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 5/88 (5%)

Query: 334 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 393
             S        G+ AG +  ++G GGG +  P  L LGI P     T   A +F SSM+ 
Sbjct: 6   EPSHWALLAVIGLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAA 64

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVG 421
             YY    F       F+ ++ IA F+G
Sbjct: 65  WTYYRQHLFKPS----FWYMAFIATFIG 88


>gi|55380318|ref|YP_138167.1| hypothetical protein rrnB0294 [Haloarcula marismortui ATCC 43049]
 gi|55233043|gb|AAV48461.1| unknown [Haloarcula marismortui ATCC 43049]
          Length = 342

 Score = 38.5 bits (88), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 37/75 (49%), Gaps = 1/75 (1%)

Query: 349 GMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYA 407
           G++ G LG+GGGFI  P LF  +G+P  ++  T LF I FS  +    Y L     +   
Sbjct: 191 GLLSGFLGVGGGFIRMPALFYLIGVPVPIAVGTDLFEIVFSGGLGSFLYALDGGVDLSIV 250

Query: 408 LYFFALSIIAAFVGQ 422
           L   A S   A VG 
Sbjct: 251 LPLLAGSAFGARVGS 265


>gi|218681756|ref|ZP_03529531.1| hypothetical protein RetlC8_23708 [Rhizobium etli CIAT 894]
          Length = 191

 Score = 38.5 bits (88), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G + G+ G+GGGF++ PL +   IPP V+ AT    +  SS    + ++    
Sbjct: 18  LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77

Query: 402 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 461
             V          +  A VG  +   L++ +G+  +II +L ++IF+   ++GG+ L + 
Sbjct: 78  LDVKLGTVLLIGGLSGATVGIWIF-SLLRAIGQLDLII-SLMYVIFLG--TVGGLMLLES 133

Query: 462 IKRIEH 467
           I  +  
Sbjct: 134 INAMRR 139


>gi|451942306|ref|YP_007462943.1| putative membrane protein [Bartonella vinsonii subsp. berkhoffii
           str. Winnie]
 gi|451901693|gb|AGF76155.1| putative membrane protein [Bartonella vinsonii subsp. berkhoffii
           str. Winnie]
          Length = 306

 Score = 38.5 bits (88), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 345 GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 394
           G+  G++  ++G+GGGF + P L   L +P  V   T+LF ITF SS + V
Sbjct: 182 GLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQITFVSSFTTV 232


>gi|448651767|ref|ZP_21680806.1| hypothetical protein C435_06850 [Haloarcula californiae ATCC 33799]
 gi|445770230|gb|EMA21297.1| hypothetical protein C435_06850 [Haloarcula californiae ATCC 33799]
          Length = 342

 Score = 38.5 bits (88), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 37/75 (49%), Gaps = 1/75 (1%)

Query: 349 GMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYA 407
           G++ G LG+GGGFI  P LF  +G+P  ++  T LF I FS  +    Y L     +   
Sbjct: 191 GLLSGFLGVGGGFIRMPALFYLIGVPVPIAVGTDLFEIVFSGGLGSFLYALDGGVDLSIV 250

Query: 408 LYFFALSIIAAFVGQ 422
           L   A S   A VG 
Sbjct: 251 LPLLAGSAFGARVGS 265


>gi|425737948|ref|ZP_18856217.1| permease [Staphylococcus massiliensis S46]
 gi|425480853|gb|EKU48016.1| permease [Staphylococcus massiliensis S46]
          Length = 273

 Score = 38.5 bits (88), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 1/82 (1%)

Query: 344 CGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 402
           C    G++ GL G+GGG ++ PL L     PP V+  T++  I FSS  S   + L    
Sbjct: 162 CSFLIGIITGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIFFSSLTSSFGHILQGHV 221

Query: 403 PVPYALYFFALSIIAAFVGQHV 424
              YAL     S I A +G  +
Sbjct: 222 DWFYALILVISSYIGAKIGVRI 243


>gi|146343498|ref|YP_001208546.1| permease membrane protein [Bradyrhizobium sp. ORS 278]
 gi|146196304|emb|CAL80331.1| putative permease; putative membrane protein [Bradyrhizobium sp.
           ORS 278]
          Length = 331

 Score = 38.5 bits (88), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 343 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
           A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+
Sbjct: 42  AMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYW 96


>gi|240948000|ref|ZP_04752418.1| permease [Actinobacillus minor NM305]
 gi|240297670|gb|EER48144.1| permease [Actinobacillus minor NM305]
          Length = 266

 Score = 38.1 bits (87), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 52/104 (50%), Gaps = 7/104 (6%)

Query: 96  GVGGGGIFVPMLN-LIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
           G+GGG + VP+L+ L   F  +  AA S   V G A   ++   +Q+      P++ + +
Sbjct: 24  GIGGGVLMVPILHTLFPEFSLQMVAATSLTTVMGTACINLISFYKQKFSVQIKPLLLWSV 83

Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
            +      + G+ +G   +  F +++I  + I   LVL+ R+FL
Sbjct: 84  GM------IAGVQLGFELSFCFPNFLIITIFIGTLLVLAWRSFL 121


>gi|383828154|ref|ZP_09983243.1| putative permease [Saccharomonospora xinjiangensis XJ-54]
 gi|383460807|gb|EID52897.1| putative permease [Saccharomonospora xinjiangensis XJ-54]
          Length = 289

 Score = 38.1 bits (87), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 35/138 (25%), Positives = 62/138 (44%), Gaps = 4/138 (2%)

Query: 312 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLEL 370
            A A+ +G++K   +          +++     G+  G+V GL+G GGGF++ P L L  
Sbjct: 112 TATAMIRGKKKATGENSGHGDLPLKRIILD---GLVVGLVTGLVGAGGGFLVVPALALLG 168

Query: 371 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 430
           G+P  ++  T+L  I   S   +  Y        P  L   A +I+ + +G  +  ++ +
Sbjct: 169 GLPMAIAVGTSLVVIAMKSFAGLAGYLTTVALNWPLVLGVTAAAIVGSILGAMLTSRVPE 228

Query: 431 ILGRASIIIFTLSFIIFV 448
              R    IF L   +FV
Sbjct: 229 AALRKGFGIFVLVMGVFV 246


>gi|384135922|ref|YP_005518636.1| hypothetical protein TC41_2211 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius Tc-4-1]
 gi|339290007|gb|AEJ44117.1| protein of unknown function DUF81 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius Tc-4-1]
          Length = 256

 Score = 38.1 bits (87), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 55/115 (47%), Gaps = 6/115 (5%)

Query: 358 GGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIA 417
           GGG I  P  L +G+PP  +  T  FA T S+  S   Y L  +  +      F L ++A
Sbjct: 28  GGGLISLPALLSIGLPPAAALGTNKFAGTISAITSFSSYLLSGKVRLKLVGPLFPLGVMA 87

Query: 418 AFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL--GGVGLAKMIKRIEHKEY 470
           + +G +++ +L     R     F L  ++ V+A +L    +GL + I+ +  K +
Sbjct: 88  SALGAYIVHQLPSSFLRP----FVLVMLVIVAAYTLWKKDLGLVESIRPLTRKTF 138


>gi|330813138|ref|YP_004357377.1| transmembrane permease [Candidatus Pelagibacter sp. IMCC9063]
 gi|327486233|gb|AEA80638.1| putative transmembrane permease [Candidatus Pelagibacter sp.
           IMCC9063]
          Length = 352

 Score = 38.1 bits (87), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 31/44 (70%), Gaps = 1/44 (2%)

Query: 345 GITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITF 387
           GI  G++  +LG+GGGF+L P+ +  LG+P ++ + T+LF + F
Sbjct: 182 GIVVGILSSMLGVGGGFLLVPIMIYILGMPARLVAGTSLFVMIF 225


>gi|289551261|ref|YP_003472165.1| hypothetical protein SLGD_01950 [Staphylococcus lugdunensis
           HKU09-01]
 gi|385784877|ref|YP_005761050.1| hypothetical protein SLUG_19420 [Staphylococcus lugdunensis
           N920143]
 gi|418414556|ref|ZP_12987771.1| hypothetical protein HMPREF9308_00936 [Staphylococcus lugdunensis
           ACS-027-V-Sch2]
 gi|289180793|gb|ADC88038.1| hypothetical protein SLGD_01950 [Staphylococcus lugdunensis
           HKU09-01]
 gi|339895133|emb|CCB54450.1| putative membrane protein [Staphylococcus lugdunensis N920143]
 gi|410877163|gb|EKS25060.1| hypothetical protein HMPREF9308_00936 [Staphylococcus lugdunensis
           ACS-027-V-Sch2]
          Length = 275

 Score = 38.1 bits (87), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 4/82 (4%)

Query: 318 KGRRKIASKGDEGTKWRAS--QLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPP 374
           K + + +    EG  +  S   LV + A  +  G++ GL G+GGG ++ PL L +   PP
Sbjct: 137 KPQYRKSYTDSEGVTYHYSVPPLVAFVAT-LFIGVLTGLFGIGGGALMTPLMLIIFRFPP 195

Query: 375 QVSSATALFAITFSSSMSVVEY 396
            V+  T++  I FSS MS + +
Sbjct: 196 HVAVGTSMMMIFFSSVMSSIGH 217


>gi|365886110|ref|ZP_09425076.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3809]
 gi|365338422|emb|CCD97607.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3809]
          Length = 319

 Score = 38.1 bits (87), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 31/58 (53%)

Query: 340 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 397
              A G   G V G+ G+GGGF++ PL + +GI P V+ A+    I  SS    + Y+
Sbjct: 27  LLLAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYW 84


>gi|326795569|ref|YP_004313389.1| hypothetical protein Marme_2312 [Marinomonas mediterranea MMB-1]
 gi|326546333|gb|ADZ91553.1| protein of unknown function DUF81 [Marinomonas mediterranea MMB-1]
          Length = 253

 Score = 38.1 bits (87), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 31/57 (54%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 393
           ++ +Y A G+  G+  G+ G+GGG ++ PL L  G PP ++  T L     + S  V
Sbjct: 2   EIFWYIAAGVGVGLAVGITGVGGGSLMTPLLLLFGFPPHIAIGTDLMYAGIAKSTGV 58


>gi|217973886|ref|YP_002358637.1| hypothetical protein Sbal223_2726 [Shewanella baltica OS223]
 gi|217499021|gb|ACK47214.1| protein of unknown function DUF81 [Shewanella baltica OS223]
          Length = 257

 Score = 38.1 bits (87), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 5/88 (5%)

Query: 334 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 393
             S        G+ AG +  ++G GGG +  P  L LGI P     T   A +F SSM+ 
Sbjct: 6   EPSHWALLAVIGLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAA 64

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVG 421
             YY    F       F+ ++ IA F+G
Sbjct: 65  WTYYRQHLFKPS----FWYMAFIATFIG 88


>gi|153000146|ref|YP_001365827.1| hypothetical protein Shew185_1617 [Shewanella baltica OS185]
 gi|373949018|ref|ZP_09608979.1| protein of unknown function DUF81 [Shewanella baltica OS183]
 gi|386325143|ref|YP_006021260.1| hypothetical protein [Shewanella baltica BA175]
 gi|151364764|gb|ABS07764.1| protein of unknown function DUF81 [Shewanella baltica OS185]
 gi|333819288|gb|AEG11954.1| protein of unknown function DUF81 [Shewanella baltica BA175]
 gi|373885618|gb|EHQ14510.1| protein of unknown function DUF81 [Shewanella baltica OS183]
          Length = 257

 Score = 38.1 bits (87), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 5/88 (5%)

Query: 334 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 393
             S        G+ AG +  ++G GGG +  P  L LGI P     T   A +F SSM+ 
Sbjct: 6   EPSHWALLAVIGLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAA 64

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVG 421
             YY    F       F+ ++ IA F+G
Sbjct: 65  WTYYRQHLFKPS----FWYMAFIATFIG 88


>gi|395782829|ref|ZP_10463201.1| hypothetical protein MCY_01598 [Bartonella rattimassiliensis 15908]
 gi|395416707|gb|EJF83077.1| hypothetical protein MCY_01598 [Bartonella rattimassiliensis 15908]
          Length = 306

 Score = 38.1 bits (87), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 1/52 (1%)

Query: 344 CGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 394
            G+  G++  ++G+GGGF + P L   L +P  V   T+LF ITF SS + V
Sbjct: 181 IGLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQITFVSSFTTV 232


>gi|328952894|ref|YP_004370228.1| hypothetical protein Desac_1185 [Desulfobacca acetoxidans DSM
           11109]
 gi|328453218|gb|AEB09047.1| protein of unknown function DUF81 [Desulfobacca acetoxidans DSM
           11109]
          Length = 427

 Score = 38.1 bits (87), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 1/65 (1%)

Query: 339 VFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVEYY 397
           VF   CG   G+V  ++G+GGGF+  P+F+  LG+    +  T +  I F++  + +  Y
Sbjct: 258 VFVALCGFVVGVVAAIMGVGGGFLAFPMFVYLLGVSSFTTVGTDILQIIFTAGYAAISQY 317

Query: 398 LLKRF 402
            +  F
Sbjct: 318 AIYGF 322


>gi|433544184|ref|ZP_20500574.1| hypothetical protein D478_10790 [Brevibacillus agri BAB-2500]
 gi|432184540|gb|ELK42051.1| hypothetical protein D478_10790 [Brevibacillus agri BAB-2500]
          Length = 262

 Score = 38.1 bits (87), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 45/71 (63%), Gaps = 6/71 (8%)

Query: 326 KGDEGTKWRASQLVFYCACGITA----GMVGGLLGLGGGFILGPLFLE-LGIPPQVSSAT 380
           K ++GT+ +A ++ +  A  I A    G+V G++G GG FIL P+ L  L IP +V+ A+
Sbjct: 130 KKNKGTQEQADEIEYNRAVAIPAAFVVGIVSGIVGAGGAFILIPIMLTVLKIPTRVTIAS 189

Query: 381 ALFAITFSSSM 391
           +L AI F S++
Sbjct: 190 SL-AIVFISAI 199


>gi|418636065|ref|ZP_13198419.1| sulfite exporter TauE/SafE [Staphylococcus lugdunensis VCU139]
 gi|374841224|gb|EHS04701.1| sulfite exporter TauE/SafE [Staphylococcus lugdunensis VCU139]
          Length = 275

 Score = 38.1 bits (87), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 4/82 (4%)

Query: 318 KGRRKIASKGDEGTKWRAS--QLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPP 374
           K + + +    EG  +  S   LV + A  +  G++ GL G+GGG ++ PL L +   PP
Sbjct: 137 KPQYRKSYTDSEGVTYHYSVPPLVAFVAT-LFIGVLTGLFGIGGGALMTPLMLIIFRFPP 195

Query: 375 QVSSATALFAITFSSSMSVVEY 396
            V+  T++  I FSS MS + +
Sbjct: 196 HVAVGTSMMMIFFSSVMSSIGH 217


>gi|160874768|ref|YP_001554084.1| hypothetical protein Sbal195_1651 [Shewanella baltica OS195]
 gi|378708021|ref|YP_005272915.1| hypothetical protein [Shewanella baltica OS678]
 gi|160860290|gb|ABX48824.1| protein of unknown function DUF81 [Shewanella baltica OS195]
 gi|315267010|gb|ADT93863.1| protein of unknown function DUF81 [Shewanella baltica OS678]
          Length = 257

 Score = 38.1 bits (87), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 5/88 (5%)

Query: 334 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 393
             S        G+ AG +  ++G GGG +  P  L LGI P     T   A +F SSM+ 
Sbjct: 6   EPSHWALLAVIGLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAA 64

Query: 394 VEYYLLKRFPVPYALYFFALSIIAAFVG 421
             YY    F       F+ ++ IA F+G
Sbjct: 65  WTYYRQHLFKPS----FWYMAFIATFIG 88


>gi|313206800|ref|YP_004045977.1| hypothetical protein Riean_1314 [Riemerella anatipestifer ATCC
           11845 = DSM 15868]
 gi|383486108|ref|YP_005395020.1| hypothetical protein RA0C_1590 [Riemerella anatipestifer ATCC 11845
           = DSM 15868]
 gi|386321217|ref|YP_006017379.1| hypothetical protein RIA_0901 [Riemerella anatipestifer RA-GD]
 gi|416109380|ref|ZP_11591339.1| hypothetical protein RAYM_00625 [Riemerella anatipestifer RA-YM]
 gi|442313984|ref|YP_007355287.1| hypothetical protein G148_0288 [Riemerella anatipestifer RA-CH-2]
 gi|312446116|gb|ADQ82471.1| protein of unknown function DUF81 [Riemerella anatipestifer ATCC
           11845 = DSM 15868]
 gi|315023873|gb|EFT36875.1| hypothetical protein RAYM_00625 [Riemerella anatipestifer RA-YM]
 gi|325335760|gb|ADZ12034.1| Protein of unknown function DUF81 [Riemerella anatipestifer RA-GD]
 gi|380460793|gb|AFD56477.1| hypothetical protein RA0C_1590 [Riemerella anatipestifer ATCC 11845
           = DSM 15868]
 gi|441482907|gb|AGC39593.1| hypothetical protein G148_0288 [Riemerella anatipestifer RA-CH-2]
          Length = 264

 Score = 38.1 bits (87), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 58/111 (52%), Gaps = 8/111 (7%)

Query: 320 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSS 378
           R K      E +  R + L+   + G+  G++ GL+G GGGF++ P L + LG+  + + 
Sbjct: 130 RSKTEPSTLEISPSRNTLLI---SQGLLVGIITGLVGAGGGFLIIPALVMILGLKMKEAI 186

Query: 379 ATALFAITFSSSMSVVEYYLLKRFPVP--YALYFFALSIIAAFVGQHVLKK 427
            T+LF IT SS++  V    L +  +   + L F  LSI+   +G  + K+
Sbjct: 187 GTSLFIITLSSTIGFVSS--LDKVAIDWYFLLSFTGLSILGVLLGLALSKR 235


>gi|19551303|ref|NP_599305.1| hypothetical protein NCgl0052 [Corynebacterium glutamicum ATCC
           13032]
 gi|62388948|ref|YP_224350.1| hypothetical protein cg0072 [Corynebacterium glutamicum ATCC 13032]
 gi|41324281|emb|CAF18621.1| conserved hypothetical membrane protein [Corynebacterium glutamicum
           ATCC 13032]
          Length = 248

 Score = 38.1 bits (87), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 64/141 (45%), Gaps = 10/141 (7%)

Query: 315 ALYKGRRKIASKG-DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGI 372
           A+ +GR++   KG  + + WR          G+  G V GL+G GGGF++ P L L  G+
Sbjct: 115 AMLRGRKQ--KKGASKSSLWR------VLVDGLVVGAVTGLVGAGGGFLVVPALALLGGL 166

Query: 373 PPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKIL 432
              V+  T+L  IT  S   +  Y    +      L   A +I+ +  G  +  ++ + L
Sbjct: 167 SMPVAVGTSLVVITMKSFAGLAGYLTSVQLDWGLVLMVTAAAIVGSLAGSRLAGRVPETL 226

Query: 433 GRASIIIFTLSFIIFVSALSL 453
            R    +F L   +FV  L L
Sbjct: 227 LRKGFGVFVLVMGVFVLGLEL 247


>gi|448406174|ref|ZP_21572696.1| hypothetical protein C475_00150 [Halosimplex carlsbadense 2-9-1]
 gi|445678013|gb|ELZ30508.1| hypothetical protein C475_00150 [Halosimplex carlsbadense 2-9-1]
          Length = 343

 Score = 38.1 bits (87), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 342 CACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSM 391
            A     G++ G LG+GGGFI  P LF  +G+P  V+  T LF I FS  +
Sbjct: 183 LAVAFATGLLSGFLGVGGGFIRMPALFYLIGVPVPVAVGTDLFEIVFSGGI 233


>gi|398385949|ref|ZP_10543964.1| putative permease [Sphingobium sp. AP49]
 gi|397719442|gb|EJK80010.1| putative permease [Sphingobium sp. AP49]
          Length = 304

 Score = 38.1 bits (87), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 65/119 (54%), Gaps = 11/119 (9%)

Query: 342 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 401
              G   G++ G+ G+GGGF+  PL +  GIPP V++A+A   +T +S   VV +  + R
Sbjct: 18  IGLGGVVGLLSGMFGVGGGFLTTPLLIFYGIPPTVAAASAASQVTGASVSGVVTH--MSR 75

Query: 402 FPVPYAL--YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 458
             V + +     A  I+ A +G  +  +L++ +G+   +I     I++V  L LGG+GL
Sbjct: 76  GTVDFRMGGVLIAGGIVGAGLGVLIF-RLLQAIGQIDTVI----GILYV--LMLGGIGL 127


>gi|387128098|ref|YP_006296703.1| permease [Methylophaga sp. JAM1]
 gi|386275160|gb|AFI85058.1| putative permease [Methylophaga sp. JAM1]
          Length = 269

 Score = 38.1 bits (87), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 8/67 (11%)

Query: 341 YCACGITAGMVGGLLGLGGGFILGP----LFLELGIPPQ----VSSATALFAITFSSSMS 392
           Y A G+ AG + GL G+GGG ++ P    LF + G+P Q    ++ AT+L  I  +S  S
Sbjct: 13  YAAAGMVAGTLAGLFGIGGGLVIVPVLVWLFSKQGLPDQYLIHMAVATSLMTIVVTSMSS 72

Query: 393 VVEYYLL 399
           +  ++ L
Sbjct: 73  IFAHWRL 79


>gi|423434381|ref|ZP_17411362.1| hypothetical protein IE9_00562 [Bacillus cereus BAG4X12-1]
 gi|401126850|gb|EJQ34583.1| hypothetical protein IE9_00562 [Bacillus cereus BAG4X12-1]
          Length = 254

 Score = 38.1 bits (87), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 39/110 (35%), Positives = 58/110 (52%), Gaps = 9/110 (8%)

Query: 321 RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSA 379
           +K  S   +  KW AS L F        G V G++G GG F+L P+ L  L +P +++ A
Sbjct: 125 KKNNSGAVKYNKWLASVLAFIV------GGVSGIIGAGGAFLLVPIMLVILKLPLRMTIA 178

Query: 380 TALFAITFSSSMSVVEYYLLK-RFPVPYALYFFALSIIAAFVGQHVLKKL 428
           T++ AITF SS+ +    ++  +  V  AL     SI AA +G  V KK+
Sbjct: 179 TSI-AITFISSVGITTGKVITGQVVVIPALIIAVASIFAAPLGARVGKKI 227


>gi|417985491|ref|ZP_12626075.1| membrane protein [Lactobacillus casei 32G]
 gi|410528519|gb|EKQ03371.1| membrane protein [Lactobacillus casei 32G]
          Length = 280

 Score = 38.1 bits (87), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 21/89 (23%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVE 395
           Q++     G+  G++G +LG+GGG I+ P+  + +G+P Q +   ++ ++  +SS + + 
Sbjct: 3   QMIMLMVTGLLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           Y   +   +  A++    + + A +G  V
Sbjct: 63  YLKDEMLNLRVAMFLEIATTVGAVIGAVV 91


>gi|336423425|ref|ZP_08603555.1| hypothetical protein HMPREF0993_02932 [Lachnospiraceae bacterium
           5_1_57FAA]
 gi|336004677|gb|EGN34737.1| hypothetical protein HMPREF0993_02932 [Lachnospiraceae bacterium
           5_1_57FAA]
          Length = 253

 Score = 38.1 bits (87), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 9/90 (10%)

Query: 339 VFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY-- 396
           +  C     AGMV  + G GGG I  P +L  G+PP  + AT   +  F ++++   +  
Sbjct: 9   IIVCPLVFLAGMVDAIAG-GGGLISLPAYLIAGLPPHAAVATNKMSSPFGTALATYRFAR 67

Query: 397 -YLLK-RFPVPYALYFFALSIIAAFVGQHV 424
            +L+  R  VP  +     +II +F+G HV
Sbjct: 68  NHLINVRLAVPSVMA----AIIGSFIGSHV 93


>gi|325106350|ref|YP_004276004.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324975198|gb|ADY54182.1| protein of unknown function DUF81 [Pedobacter saltans DSM 12145]
          Length = 265

 Score = 38.1 bits (87), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 2/86 (2%)

Query: 345 GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV-EYYLLKRF 402
           GI  G++ GL+G GGGF++ P + L +G+P  ++  T+L  I  +S +    ++  L   
Sbjct: 154 GIIVGVLSGLIGAGGGFLIVPAITLFMGVPVHLAVGTSLLIIALNSLVGFTGDFSHLSEI 213

Query: 403 PVPYALYFFALSIIAAFVGQHVLKKL 428
              Y L F   S +   VG ++ KKL
Sbjct: 214 NWSYLLEFSLFSSVGVLVGVYLGKKL 239


>gi|227534549|ref|ZP_03964598.1| permease [Lactobacillus paracasei subsp. paracasei ATCC 25302]
 gi|227187798|gb|EEI67865.1| permease [Lactobacillus paracasei subsp. paracasei ATCC 25302]
          Length = 280

 Score = 38.1 bits (87), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 21/89 (23%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 337 QLVFYCACGITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVE 395
           Q++     G+  G++G +LG+GGG I+ P+  + +G+P Q +   ++ ++  +SS + + 
Sbjct: 3   QMIMLMVTGLLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62

Query: 396 YYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           Y   +   +  A++    + + A +G  V
Sbjct: 63  YLKDEMLNLRVAMFLEIATTVGAVIGAVV 91


>gi|209881434|ref|XP_002142155.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209557761|gb|EEA07806.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 525

 Score = 38.1 bits (87), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 72/167 (43%), Gaps = 35/167 (20%)

Query: 83  IIGFFGTAC-GSVG---GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
           II      C GS+    G GGG I +P+  + +      + A+S  ++ GG++ +V+ ++
Sbjct: 61  IIALITVGCLGSIAVSAGAGGGVISIPVFLVFMNMPFYQAIALSTSVILGGSLCSVILDI 120

Query: 139 RQRHPTL-----------------------DIPVIDYDLALLFQPLL----VLGISIGVA 171
            ++ P L                        +P++D  L +    LL    +LGI I   
Sbjct: 121 VKKKPELPEYVNTLNYFKSALDNKLDVNLNKLPIMDLPLVIFLSSLLSAGTLLGILISKV 180

Query: 172 FNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
            N I +  ++  LLI +F     + ++K  +  + E IT+ E   ++
Sbjct: 181 LNSILSFLILEFLLIYVFY----KTWIKFWKIREMEKITQSENNMEI 223


>gi|399887998|ref|ZP_10773875.1| hypothetical protein CarbS_05635 [Clostridium arbusti SL206]
          Length = 274

 Score = 38.1 bits (87), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 1/91 (1%)

Query: 331 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 390
           ++   S L+  C  G  A  V  + G GGG I  P FL  G+P +++  T  F  T  + 
Sbjct: 21  SQTTPSILLLLCIFGFIAAAVDSIAG-GGGLISLPAFLLAGLPARLALGTNKFCSTSGTL 79

Query: 391 MSVVEYYLLKRFPVPYALYFFALSIIAAFVG 421
           MS  EYY   +       Y F  ++I A +G
Sbjct: 80  MSSFEYYKNGKMNFKLLKYLFPFTLIGAVIG 110


>gi|427412871|ref|ZP_18903063.1| hypothetical protein HMPREF9282_00470 [Veillonella ratti
           ACS-216-V-Col6b]
 gi|425715687|gb|EKU78673.1| hypothetical protein HMPREF9282_00470 [Veillonella ratti
           ACS-216-V-Col6b]
          Length = 271

 Score = 38.1 bits (87), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 54/113 (47%), Gaps = 6/113 (5%)

Query: 342 CACGITAGMVGGLLGLGGGFI-LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 400
           C   +  G +  + G+GGG I +  L   LG P  +++AT+ F +  S+ M V  +Y L 
Sbjct: 156 CLISVVVGFISSVFGIGGGLIHVAALVYLLGFPTHIATATSQFILFLSTIMGVTTHYFLG 215

Query: 401 RFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 453
                 A+     ++I A +G  + K+L     +A+ I+   S  + + AL L
Sbjct: 216 HIQWNIAIACGIGAVIGAQLGAAIAKRL-----KATSILMVFSVGVGILALQL 263


>gi|197106230|ref|YP_002131607.1| hypothetical protein PHZ_c2769 [Phenylobacterium zucineum HLK1]
 gi|196479650|gb|ACG79178.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1]
          Length = 269

 Score = 38.1 bits (87), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 59/105 (56%), Gaps = 4/105 (3%)

Query: 365 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 424
           P+ + LGIPP V+ A+    +  SS  SV+ Y   +   +       A  ++ AF+G   
Sbjct: 3   PMLVFLGIPPAVAVASMSNHVAASSMSSVIAYGRRRAVDLRMGGVLAAGGVVGAFLGVE- 61

Query: 425 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE 469
           L +L+++LG+A +++ ++S+++F+S   +GG+ L + +  I  + 
Sbjct: 62  LFRLLRLLGQADLVV-SVSYLVFLSI--IGGLMLTESLGAILRRR 103


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.324    0.140    0.423 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,306,450,149
Number of Sequences: 23463169
Number of extensions: 303328800
Number of successful extensions: 1344461
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1384
Number of HSP's successfully gapped in prelim test: 2794
Number of HSP's that attempted gapping in prelim test: 1332744
Number of HSP's gapped (non-prelim): 13400
length of query: 479
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 333
effective length of database: 8,933,572,693
effective search space: 2974879706769
effective search space used: 2974879706769
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 79 (35.0 bits)