BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 011811
         (477 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224092328|ref|XP_002309560.1| predicted protein [Populus trichocarpa]
 gi|222855536|gb|EEE93083.1| predicted protein [Populus trichocarpa]
          Length = 479

 Score =  733 bits (1891), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 381/478 (79%), Positives = 426/478 (89%), Gaps = 2/478 (0%)

Query: 1   MADLGAKWKILRSVMMIFLNFALAFVCVKAERGLKIE--TLRFNGTERQDSNYLLKAVNF 58
           MA+ G KW+ LRSV+M+ LNF LAFV V AERGLK E  + R N T    S+Y+LKAVNF
Sbjct: 1   MAEFGGKWRGLRSVLMVLLNFLLAFVFVSAERGLKHEATSTRMNETGDSVSSYVLKAVNF 60

Query: 59  LWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPK 118
           LWQP+  GY HVWPE+KFGW+IVLG+IIGFFGAAFGSVGGVGGGGIFVPMLSL+IGFDPK
Sbjct: 61  LWQPDHKGYQHVWPEMKFGWQIVLGSIIGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDPK 120

Query: 119 SATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIF 178
           SATAISKCMIMGAA+STVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNV+F
Sbjct: 121 SATAISKCMIMGAAISTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVVF 180

Query: 179 ADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSG 238
           ADWMVTVLLI+LF+GTSTKAFLKGVETWK+ETI+K+EAA+ L S+GA AGEVEYK LP G
Sbjct: 181 ADWMVTVLLIILFLGTSTKAFLKGVETWKKETIMKREAAKRLESDGASAGEVEYKPLPGG 240

Query: 239 PRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQI 298
           P + PQK   E EV+ILEN+ WKELGLLVFVWVAFL LQI K  T++CS  YWVL+LLQI
Sbjct: 241 PSNSPQKADKEEEVSILENVYWKELGLLVFVWVAFLVLQITKEGTSTCSTTYWVLNLLQI 300

Query: 299 PVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFI 358
           PVS+ VS+YEA+SLYKG R+IASKG +G +F + QLV YC FG+LAG+VGGLLGLGGGFI
Sbjct: 301 PVSVGVSMYEAVSLYKGHRIIASKGTEGTNFTILQLVIYCLFGILAGVVGGLLGLGGGFI 360

Query: 359 MGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQ 418
           MGPLFLELG+PPQVSSATATFAMTFSSSMSVVEYYLLKRFPV YA+YFVAVAT AAF+GQ
Sbjct: 361 MGPLFLELGIPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVSYAVYFVAVATFAAFIGQ 420

Query: 419 HVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           H+VR+LII+ GRASLIIFILA  IF+SAISLGGVG++NMIGK HRHEYMGFENLCKYD
Sbjct: 421 HIVRRLIIVFGRASLIIFILASTIFISAISLGGVGVANMIGKIHRHEYMGFENLCKYD 478


>gi|255551046|ref|XP_002516571.1| conserved hypothetical protein [Ricinus communis]
 gi|223544391|gb|EEF45912.1| conserved hypothetical protein [Ricinus communis]
          Length = 476

 Score =  709 bits (1829), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/481 (79%), Positives = 427/481 (88%), Gaps = 11/481 (2%)

Query: 1   MADLGA-KWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQD----SNYLLKA 55
           MA+ G  KW+ LRSV+M+ LNF LAFV V AERG++ E   F GT + +    S Y +KA
Sbjct: 1   MAEFGGGKWRGLRSVLMVSLNFLLAFVLVSAERGMRRE---FTGTPQTEGEGLSTYFVKA 57

Query: 56  VNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGF 115
           V+FLWQP+ +GY HVWPE++FGW+IVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGF
Sbjct: 58  VDFLWQPDQTGYRHVWPEMRFGWQIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGF 117

Query: 116 DPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFN 175
           DPKSATAISKCMIMGAA STVYYNLKLRHPTLDMPIIDYDL LLIQPMLMLGISIGVAFN
Sbjct: 118 DPKSATAISKCMIMGAAASTVYYNLKLRHPTLDMPIIDYDLTLLIQPMLMLGISIGVAFN 177

Query: 176 VIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSL 235
           VIFADWMVTVLLI+LFIGTSTKAF KGVETWK+ETI+KKEAA+ L +NGA   EVEYK L
Sbjct: 178 VIFADWMVTVLLIILFIGTSTKAFFKGVETWKKETIMKKEAAKRLEANGA---EVEYKPL 234

Query: 236 PSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDL 295
           PSGP +GP+K+A EPEVT+LEN+ WKELGLLVFVWVAFL L IAKNH  +CSI YWV++L
Sbjct: 235 PSGPSTGPEKEAKEPEVTLLENVYWKELGLLVFVWVAFLILHIAKNHLPACSIEYWVVNL 294

Query: 296 LQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGG 355
           LQIPVSL VS YEA+SLYKGRR+IASKG +G +F+V QLV YC+ GVLAG+VGGLLGLGG
Sbjct: 295 LQIPVSLGVSGYEAVSLYKGRRIIASKGKEGTNFKVHQLVLYCSCGVLAGVVGGLLGLGG 354

Query: 356 GFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAF 415
           GFIMGPLFLELG+PPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFV+VATIAA 
Sbjct: 355 GFIMGPLFLELGIPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVSVATIAAL 414

Query: 416 VGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           +GQH+VR++II+ GRASLIIFILAF IFVSAISLGGVGISNMI K   +EYMGFENLCKY
Sbjct: 415 IGQHIVRRMIIMFGRASLIIFILAFTIFVSAISLGGVGISNMIWKIQHNEYMGFENLCKY 474

Query: 476 D 476
           +
Sbjct: 475 E 475


>gi|297821995|ref|XP_002878880.1| hypothetical protein ARALYDRAFT_481426 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324719|gb|EFH55139.1| hypothetical protein ARALYDRAFT_481426 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 479

 Score =  682 bits (1759), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/478 (75%), Positives = 408/478 (85%), Gaps = 4/478 (0%)

Query: 1   MADLGAKWKILRSVMMIFLNFALAFVCVKAERGL-KIETLRFNG--TERQDSNYLLKAVN 57
           MA L +KW  LRSV+M+ +NF+LAF  V AER + K ++LR N   T   +S++ LKAVN
Sbjct: 1   MAKLRSKWLGLRSVIMVLINFSLAFAFVSAERRIIKGKSLRLNSDETRENESSFFLKAVN 60

Query: 58  FLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDP 117
           FLW+ +  GY HVWPE +F W+IVLGT++GFFGAAFGSVGGVGGGGIFVPMLSL+IGFDP
Sbjct: 61  FLWESDQIGYRHVWPEFEFNWQIVLGTLVGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDP 120

Query: 118 KSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVI 177
           KSATAISKCMIMGA+VSTVYYNL+LRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVI
Sbjct: 121 KSATAISKCMIMGASVSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVI 180

Query: 178 FADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPS 237
           F DW+VTVLLI+LF+GTSTKAFLKG ETW +ETI KKEAA+ L SNG    EVEY  LP+
Sbjct: 181 FPDWLVTVLLIILFLGTSTKAFLKGSETWNKETIEKKEAAKRLESNGVSGAEVEYVPLPA 240

Query: 238 GPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQ 297
            P + P     E EV+I+EN+ WKELGLLVFVWV FL LQI+K + A+CS+ YWV++LLQ
Sbjct: 241 APSTNPGNKKKE-EVSIIENVYWKELGLLVFVWVVFLALQISKQNLANCSVAYWVINLLQ 299

Query: 298 IPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGF 357
           IPV++ VS YEA++LY+GRR+IASKG    +F V QLV YC FG+LAGIVGGLLGLGGGF
Sbjct: 300 IPVAVGVSGYEAVALYQGRRIIASKGQGDSNFTVGQLVMYCTFGILAGIVGGLLGLGGGF 359

Query: 358 IMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVG 417
           IMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALY V VATIAAFVG
Sbjct: 360 IMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLVGVATIAAFVG 419

Query: 418 QHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           QHVVR+LI  LGRASLIIFILA MIF+SAISLGGVGI NMIGK  RHEYMGFENLCKY
Sbjct: 420 QHVVRRLIAALGRASLIIFILASMIFISAISLGGVGIVNMIGKIQRHEYMGFENLCKY 477


>gi|225430021|ref|XP_002284175.1| PREDICTED: uncharacterized protein LOC100267889 [Vitis vinifera]
 gi|296081886|emb|CBI20891.3| unnamed protein product [Vitis vinifera]
          Length = 478

 Score =  681 bits (1758), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/481 (77%), Positives = 417/481 (86%), Gaps = 9/481 (1%)

Query: 1   MADLGAKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQ--DSNYLLKAVNF 58
           MA    KW +L    MI  +F LA   V  ER +K E   FN T+    +SNYL K VNF
Sbjct: 1   MAGFATKWLML----MILCSFMLASAFVSGERSIKHEASTFNVTKEAGFNSNYLSKVVNF 56

Query: 59  LWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPK 118
           LWQ + SGY HVWPE++FGW+IV+G++IGFFGAAFGSVGGVGGGGIFVPMLSL+IGFDPK
Sbjct: 57  LWQSDRSGYQHVWPEMEFGWQIVVGSVIGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDPK 116

Query: 119 SATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIF 178
           SATA+SKCMIMGAA STVYYNLKLRHPTLDMPIIDYDLALL QPMLM+GISIGVAFNV+F
Sbjct: 117 SATALSKCMIMGAAGSTVYYNLKLRHPTLDMPIIDYDLALLFQPMLMMGISIGVAFNVLF 176

Query: 179 ADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSG 238
           ADWMVTVLLIVLF+GTSTKAFLKGVETWK+ETI+K+EAA+ LG+NG G  EVEYK LPSG
Sbjct: 177 ADWMVTVLLIVLFLGTSTKAFLKGVETWKKETIMKREAAKRLGTNGNGTEEVEYKPLPSG 236

Query: 239 PRSGPQK---DAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDL 295
           P +G Q     + EPEV+I+EN+ WKELGLLVFVWVAFL LQIAKN+TA+CS+ YWVL+ 
Sbjct: 237 PSNGTQNATNKSKEPEVSIIENVYWKELGLLVFVWVAFLALQIAKNNTATCSMAYWVLNF 296

Query: 296 LQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGG 355
           +QIPVS+ VSLYEA+SLYKGRR+IASKGD G +FRV QL+ YC  GVLAGIVGGLLGLGG
Sbjct: 297 MQIPVSVGVSLYEAVSLYKGRRIIASKGDAGTNFRVHQLILYCFCGVLAGIVGGLLGLGG 356

Query: 356 GFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAF 415
           GFI+GPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYA+Y V VATIAAF
Sbjct: 357 GFILGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYAVYLVLVATIAAF 416

Query: 416 VGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           +GQHVVR+LI +LGRASLIIFILAF IF+SAISLGGVG+SNMIGK  RHEYMGFENLC+Y
Sbjct: 417 IGQHVVRRLISILGRASLIIFILAFTIFISAISLGGVGLSNMIGKIERHEYMGFENLCRY 476

Query: 476 D 476
           +
Sbjct: 477 E 477


>gi|30682879|ref|NP_850068.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|22531098|gb|AAM97053.1| unknown protein [Arabidopsis thaliana]
 gi|25083770|gb|AAN72117.1| unknown protein [Arabidopsis thaliana]
 gi|26450521|dbj|BAC42374.1| unknown protein [Arabidopsis thaliana]
 gi|28951043|gb|AAO63445.1| At2g25737 [Arabidopsis thaliana]
 gi|330252652|gb|AEC07746.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 476

 Score =  672 bits (1733), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/475 (74%), Positives = 401/475 (84%), Gaps = 1/475 (0%)

Query: 1   MADLGAKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLW 60
           M  L +KW  LRSV +  +NF+LAF  V AER  K   L  + T   +S++ LKA+NFLW
Sbjct: 1   MTKLRSKWLGLRSVTIFLINFSLAFAFVSAERRGKSLRLSTDETRENESSFFLKAINFLW 60

Query: 61  QPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSA 120
           + +  GY HVWPE +F W+IVLGT++GFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSA
Sbjct: 61  ESDQIGYRHVWPEFEFNWQIVLGTLVGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSA 120

Query: 121 TAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFAD 180
           TAISKCMIMGA+VSTVYYNL+LRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIF D
Sbjct: 121 TAISKCMIMGASVSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFPD 180

Query: 181 WMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPR 240
           W+VTVLLIVLF+GTSTKAFLKG ETW +ETI KKEAA+ L SNG    EVEY  LP+ P 
Sbjct: 181 WLVTVLLIVLFLGTSTKAFLKGSETWNKETIEKKEAAKRLESNGVSGTEVEYVPLPAAPS 240

Query: 241 SGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPV 300
           + P     E EV+I+EN+ WKELGLLVFVW+ FL LQI+K + A+CS+ YWV++LLQIPV
Sbjct: 241 TNPGNKKKE-EVSIIENVYWKELGLLVFVWIVFLALQISKQNLANCSVAYWVINLLQIPV 299

Query: 301 SLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMG 360
           ++ VS YEA++LY+GRR+IASKG    +F V QLV YC FG++AGIVGGLLGLGGGFIMG
Sbjct: 300 AVGVSGYEAVALYQGRRIIASKGQGDSNFTVGQLVMYCTFGIIAGIVGGLLGLGGGFIMG 359

Query: 361 PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALY V VATIAA+VGQHV
Sbjct: 360 PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLVGVATIAAWVGQHV 419

Query: 421 VRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           VR+LI  +GRASLIIFILA MIF+SAISLGGVGI NMIGK  RHEYMGFENLCKY
Sbjct: 420 VRRLIAAIGRASLIIFILASMIFISAISLGGVGIVNMIGKIQRHEYMGFENLCKY 474


>gi|224143013|ref|XP_002324818.1| predicted protein [Populus trichocarpa]
 gi|222866252|gb|EEF03383.1| predicted protein [Populus trichocarpa]
          Length = 448

 Score =  671 bits (1732), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/448 (80%), Positives = 393/448 (87%), Gaps = 2/448 (0%)

Query: 31  ERGLKIETLRFNGTERQDS--NYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGF 88
           ERGL  E       E  DS  +Y+LKAV+ LWQP+  GY HVWPE+KFGW+IVLG+IIGF
Sbjct: 1   ERGLTREATSARTDETGDSVSSYVLKAVSCLWQPDQRGYQHVWPEMKFGWQIVLGSIIGF 60

Query: 89  FGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLD 148
            GAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVS+VYYNLKLRHPT+D
Sbjct: 61  LGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSSVYYNLKLRHPTID 120

Query: 149 MPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKR 208
           MPIIDYDLALLIQPMLMLGIS+GVA NVIFADWMVTVLLIVLF  TSTKAF KGVETWK+
Sbjct: 121 MPIIDYDLALLIQPMLMLGISLGVALNVIFADWMVTVLLIVLFTVTSTKAFFKGVETWKK 180

Query: 209 ETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVF 268
           ETI+K+EAA+ L SNG  AGEVEYK L  GP + PQK+  E E+T+ EN+ WKELGLLVF
Sbjct: 181 ETIMKREAAKHLESNGTDAGEVEYKPLRGGPSNSPQKETKELEITVFENVYWKELGLLVF 240

Query: 269 VWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKS 328
           VWVAFL LQIAK  T +CSI YWVL+LLQIPVS+ V+LYE +SLYKGRRV+ASKGD+G +
Sbjct: 241 VWVAFLVLQIAKESTYTCSIGYWVLNLLQIPVSVGVTLYEVVSLYKGRRVVASKGDEGTN 300

Query: 329 FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS 388
           FRV QL++YCAFGVLAG+VGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS
Sbjct: 301 FRVLQLMTYCAFGVLAGVVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS 360

Query: 389 VVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAIS 448
           VVEYYLLKRFPVPYA+YFV VAT AAF GQH+VR+LII+ GRASLIIFILAF IFVSAI 
Sbjct: 361 VVEYYLLKRFPVPYAVYFVVVATFAAFTGQHIVRRLIIVFGRASLIIFILAFTIFVSAIL 420

Query: 449 LGGVGISNMIGKFHRHEYMGFENLCKYD 476
           LGG GISNMIGK HR EYMGFEN CKYD
Sbjct: 421 LGGFGISNMIGKIHRQEYMGFENFCKYD 448


>gi|356510003|ref|XP_003523730.1| PREDICTED: uncharacterized protein LOC100809318 [Glycine max]
          Length = 472

 Score =  653 bits (1685), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/478 (73%), Positives = 403/478 (84%), Gaps = 11/478 (2%)

Query: 1   MADLGAKWKILRSVMMIFLNFALAFVCVKAERGL-KIETLRFNGTERQ-DSNYLLKAVNF 58
           M + G +   + S+  +   FA  F  V AER + K+E  RFN T R+   ++L KA+NF
Sbjct: 1   MVEFGTR---MGSIWTVLFCFASVFAIVSAERSIVKMEVPRFNVTSREPQQSFLTKALNF 57

Query: 59  LWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPK 118
           LWQ   SGY HVWP+++FGWEIVLGT +GF GAAFGSVGGVGGGGIFVPMLSLIIGFDPK
Sbjct: 58  LWQSGESGYQHVWPDMEFGWEIVLGTFVGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPK 117

Query: 119 SATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIF 178
           S+TAISKCMIMGAAVSTVYYNLKLRHPTL+MPIIDYDLALLIQPMLMLGISIGV FNV+F
Sbjct: 118 SSTAISKCMIMGAAVSTVYYNLKLRHPTLNMPIIDYDLALLIQPMLMLGISIGVVFNVVF 177

Query: 179 ADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSG 238
            DW+VT+LLIVLF+GTSTKAF KGVETWK+ETI+KKEAA+   SNG+GA  VEYK LPSG
Sbjct: 178 PDWIVTILLIVLFLGTSTKAFFKGVETWKKETIMKKEAAKRQESNGSGA-VVEYKPLPSG 236

Query: 239 PRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAK-NHTASCSIVYWVLDLLQ 297
           P    +KD  E E++I+EN+ WKE GLLVFVWV+FL LQIAK N+T +CS +YWVL+LLQ
Sbjct: 237 P----EKDTKEQEMSIIENVYWKEFGLLVFVWVSFLALQIAKENYTTTCSTLYWVLNLLQ 292

Query: 298 IPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGF 357
           +PVS+ V+ YEA +L+ GRRVIAS G+ GK F V QL+ YC FGVLAG+VGG+LGLGGGF
Sbjct: 293 VPVSVGVTAYEAAALFSGRRVIASTGEQGKDFTVLQLMIYCVFGVLAGVVGGMLGLGGGF 352

Query: 358 IMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVG 417
           +MGPLFLELGVPPQVSSATATFAMTFSSSMSV+EYYLLKRFPVPYALYF+ VATIAAFVG
Sbjct: 353 VMGPLFLELGVPPQVSSATATFAMTFSSSMSVIEYYLLKRFPVPYALYFILVATIAAFVG 412

Query: 418 QHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           QH+VRKLIIL GRASLIIFILA  IFVSA+SLGGVGI NM+ K   HEYMGFE+LCKY
Sbjct: 413 QHIVRKLIILFGRASLIIFILASTIFVSAVSLGGVGIVNMVHKIQNHEYMGFEDLCKY 470


>gi|356518316|ref|XP_003527825.1| PREDICTED: uncharacterized protein LOC100810784 [Glycine max]
          Length = 477

 Score =  652 bits (1682), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/479 (72%), Positives = 401/479 (83%), Gaps = 8/479 (1%)

Query: 1   MADLGAKWKILRSVMMIFLNFALAFVCVKAERG-LKIETLRFNGTERQD--SNYLLKAVN 57
           MA+ G +     S+  +   FA     V +ER  +K+E  RFN T  ++   ++L KA+N
Sbjct: 1   MAEFGTR---TGSIWTVLFCFASVVAIVASERSIMKMEVARFNVTTSREPQQSFLTKALN 57

Query: 58  FLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDP 117
           FLWQ   SGY HVWP+++FGW+IV+GT +GF GAAFGSVGGVGGGGIFVPMLSLI+GFD 
Sbjct: 58  FLWQSGESGYQHVWPDMEFGWQIVVGTFVGFCGAAFGSVGGVGGGGIFVPMLSLIVGFDQ 117

Query: 118 KSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVI 177
           KS+TAISKCMIMGAAVSTVYYNLKLRHPTL+MPIIDYDLALLIQPMLMLGISIGV FNV+
Sbjct: 118 KSSTAISKCMIMGAAVSTVYYNLKLRHPTLNMPIIDYDLALLIQPMLMLGISIGVVFNVV 177

Query: 178 FADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPS 237
           F DW+VT+LLIVLF+GTSTKAF KG+ETWK+ETI+KKEAA+   SNG+GA EVEYK LPS
Sbjct: 178 FPDWIVTILLIVLFLGTSTKAFFKGIETWKKETIMKKEAAKRQESNGSGA-EVEYKPLPS 236

Query: 238 GPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAK-NHTASCSIVYWVLDLL 296
           GP    +KD  E EVTI+EN+ WKE GLLVFVWV+FL LQIAK N+T +CS  YWVL+LL
Sbjct: 237 GPNGANEKDTKEQEVTIIENVYWKEFGLLVFVWVSFLALQIAKENYTTTCSTFYWVLNLL 296

Query: 297 QIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGG 356
           Q+PVS+ V+ YEA +L+ GRRVIAS G+ GK F V QL+ YC FGVLAG+VGG+LGLGGG
Sbjct: 297 QVPVSVGVTAYEAAALFSGRRVIASTGEQGKDFTVLQLIIYCVFGVLAGVVGGMLGLGGG 356

Query: 357 FIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFV 416
           F+MGPLFLELGVPPQVSSATATFAMTFSSSMSV+EYYLLKRFP+PYALYFV VATIAAFV
Sbjct: 357 FVMGPLFLELGVPPQVSSATATFAMTFSSSMSVIEYYLLKRFPIPYALYFVLVATIAAFV 416

Query: 417 GQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           GQH+VRKLIIL GRASLIIFILA  IFVSA+SLGGVGI NM+ K   HEYMGFE+LCKY
Sbjct: 417 GQHIVRKLIILFGRASLIIFILASTIFVSAVSLGGVGIVNMVHKIDNHEYMGFEDLCKY 475


>gi|358344801|ref|XP_003636475.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
 gi|355502410|gb|AES83613.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
          Length = 470

 Score =  639 bits (1649), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/470 (72%), Positives = 397/470 (84%), Gaps = 12/470 (2%)

Query: 11  LRSVMMIFLNFALAFVCVKAERGL-KIETLRFNGT---ERQDSNYLLKAVNFLWQPEGSG 66
           +RS++ + L    +F+ V  ER L KI+   FN T   ++ + ++L KAVNFLW+ +GSG
Sbjct: 6   MRSILWMMLLLFGSFLIVSGERKLVKIQLPSFNVTTQPQKHEQSFLTKAVNFLWKSDGSG 65

Query: 67  YHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKC 126
           Y HVWPE++FGW+IVLG+ IGF GAAFGSVGGVGGGGIFVPMLSLIIGFDPKS+TAISKC
Sbjct: 66  YQHVWPEMEFGWQIVLGSFIGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSSTAISKC 125

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MIMGAA+STVYYNL+LRHPTLDMPIIDYDLALLIQPMLMLGISIGV FNV+F DW+VT+L
Sbjct: 126 MIMGAALSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWLVTIL 185

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           LIVLF+GTSTKAF KG+ETW +ETI+KKEAAR   SNG+G    EYK+LP+GP    +K 
Sbjct: 186 LIVLFLGTSTKAFFKGLETWNKETIMKKEAARRQESNGSG----EYKALPTGPNVAIEK- 240

Query: 247 AMEPEVTILENINWKELGLLVFVWVAFLGLQIAK-NHTASCSIVYWVLDLLQIPVSLVVS 305
             E +V+I+EN+ WKE GLL FVW++FL LQIAK N+T +CS  YW+L+LLQIP+++ V+
Sbjct: 241 --ENKVSIIENVYWKEFGLLAFVWISFLALQIAKQNYTTTCSAAYWILNLLQIPIAVGVT 298

Query: 306 LYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE 365
            YEA +L+ GRRVIAS GD GK+F V QLV YC FGVLAGIVGGLLGLGGGF+MGPLFLE
Sbjct: 299 AYEATALFTGRRVIASTGDQGKAFTVGQLVIYCVFGVLAGIVGGLLGLGGGFVMGPLFLE 358

Query: 366 LGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLI 425
           LGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALY   VATIAA VGQH+VR+LI
Sbjct: 359 LGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLSLVATIAALVGQHIVRRLI 418

Query: 426 ILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           IL GRASLIIFILA  IF+SAISLGGVGISNM+ K   HEYMGFEN+CKY
Sbjct: 419 ILFGRASLIIFILAGTIFISAISLGGVGISNMVHKIANHEYMGFENICKY 468


>gi|225442593|ref|XP_002284423.1| PREDICTED: uncharacterized protein LOC100266912 [Vitis vinifera]
 gi|297743252|emb|CBI36119.3| unnamed protein product [Vitis vinifera]
          Length = 481

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 351/480 (73%), Positives = 406/480 (84%), Gaps = 4/480 (0%)

Query: 1   MADLGAKWKILRSVMMIFLNF-ALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFL 59
           MA +G+KW  LR V +I + F  +A V V AER LK E+   +G+   +   + K VNFL
Sbjct: 1   MAGIGSKWWGLRRVALILVTFLGVASVLVSAERTLKQESSSHDGSREVEPGLMTKIVNFL 60

Query: 60  WQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKS 119
           W P  S Y HVWP++KFGW+I++GTIIGFFGAAFGSVGGVGGGGIFVPML+L+IGFD KS
Sbjct: 61  WDPNLSAYEHVWPDMKFGWQIIVGTIIGFFGAAFGSVGGVGGGGIFVPMLTLVIGFDAKS 120

Query: 120 ATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFA 179
           +TAISKCMIMGAA STVYYNLKLRHPTLDMPIIDYDLALL QPML+LGISIGVAFNVIFA
Sbjct: 121 STAISKCMIMGAAASTVYYNLKLRHPTLDMPIIDYDLALLFQPMLVLGISIGVAFNVIFA 180

Query: 180 DWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGP 239
           DWMVTVLLIVLFIGTSTKAF KGVETWK+ETI+KKEAA+ L SNG G+ + EYK+LP+GP
Sbjct: 181 DWMVTVLLIVLFIGTSTKAFFKGVETWKKETIIKKEAAKRLESNGNGSEDGEYKALPAGP 240

Query: 240 RSGPQKDAMEP---EVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLL 296
             G Q+D   P   EV+I+EN+ WKELGLL  VWV  LGLQI KN+T++CS  YW L+LL
Sbjct: 241 NDGTQRDTNAPKDKEVSIIENVCWKELGLLFAVWVIILGLQIGKNYTSTCSAGYWTLNLL 300

Query: 297 QIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGG 356
           Q+PV++ VS YEA+ LYKGRRVIASKG++G +++V  L+ YC  GVLAGIVGGLLGLGGG
Sbjct: 301 QVPVAVGVSGYEAVCLYKGRRVIASKGEEGTNWKVHNLIFYCGVGVLAGIVGGLLGLGGG 360

Query: 357 FIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFV 416
           FI+GPLFLELG+PPQVSSATATFAMTFS+SMSVVEYYLLKRFPVPYALYFVAVAT+AAF+
Sbjct: 361 FILGPLFLELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYFVAVATVAAFL 420

Query: 417 GQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           GQHVVRKLI +LGRASLIIFIL+F IFVSAISLGGVGI +MI K  + +YMGFENLC Y+
Sbjct: 421 GQHVVRKLINILGRASLIIFILSFTIFVSAISLGGVGIKDMIEKIEQKDYMGFENLCTYE 480


>gi|358344803|ref|XP_003636476.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
 gi|355502411|gb|AES83614.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
          Length = 480

 Score =  632 bits (1630), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 342/480 (71%), Positives = 397/480 (82%), Gaps = 22/480 (4%)

Query: 11  LRSVMMIFLNFALAFVCVKAERGL-KIETLRFNGT---ERQDSNYLLKAVNFLWQPEGSG 66
           +RS++ + L    +F+ V  ER L KI+   FN T   ++ + ++L KAVNFLW+ +GSG
Sbjct: 6   MRSILWMMLLLFGSFLIVSGERKLVKIQLPSFNVTTQPQKHEQSFLTKAVNFLWKSDGSG 65

Query: 67  YHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKC 126
           Y HVWPE++FGW+IVLG+ IGF GAAFGSVGGVGGGGIFVPMLSLIIGFDPKS+TAISKC
Sbjct: 66  YQHVWPEMEFGWQIVLGSFIGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSSTAISKC 125

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MIMGAA+STVYYNL+LRHPTLDMPIIDYDLALLIQPMLMLGISIGV FNV+F DW+VT+L
Sbjct: 126 MIMGAALSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWLVTIL 185

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           LIVLF+GTSTKAF KG+ETW +ETI+KKEAAR   SNG+G    EYK+LP+GP    +K 
Sbjct: 186 LIVLFLGTSTKAFFKGLETWNKETIMKKEAARRQESNGSG----EYKALPTGPNVAIEK- 240

Query: 247 AMEPEVTILENINWKELGLLVFVWVAFLGLQIAK-NHTASCSIVYWVLDLLQIPVSLVVS 305
             E +V+I+EN+ WKE GLL FVW++FL LQIAK N+T +CS  YW+L+LLQIP+++ V+
Sbjct: 241 --ENKVSIIENVYWKEFGLLAFVWISFLALQIAKQNYTTTCSAAYWILNLLQIPIAVGVT 298

Query: 306 LYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE 365
            YEA +L+ GRRVIAS GD GK+F V QLV YC FGVLAGIVGGLLGLGGGF+MGPLFLE
Sbjct: 299 AYEATALFTGRRVIASTGDQGKAFTVGQLVIYCVFGVLAGIVGGLLGLGGGFVMGPLFLE 358

Query: 366 LGVPPQ----------VSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAF 415
           LGVPPQ          VSSATATFAMTFSSSMSVVEYYLLKRFPVPYALY   VATIAA 
Sbjct: 359 LGVPPQVSFKLQYINHVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLSLVATIAAL 418

Query: 416 VGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           VGQH+VR+LIIL GRASLIIFILA  IF+SAISLGGVGISNM+ K   HEYMGFEN+CKY
Sbjct: 419 VGQHIVRRLIILFGRASLIIFILAGTIFISAISLGGVGISNMVHKIANHEYMGFENICKY 478


>gi|255549968|ref|XP_002516035.1| conserved hypothetical protein [Ricinus communis]
 gi|223544940|gb|EEF46455.1| conserved hypothetical protein [Ricinus communis]
          Length = 483

 Score =  621 bits (1602), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 342/484 (70%), Positives = 406/484 (83%), Gaps = 10/484 (2%)

Query: 1   MADLGAKWKIL-----RSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKA 55
           MA +G++WK L     ++ ++ F+      +  +A +  +  + R   +E  +S +L++ 
Sbjct: 1   MAMIGSEWKSLSLRYVKASLICFIGVVSMTIVSEARQQQEGSSDR--NSEDVESAFLMRV 58

Query: 56  VNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGF 115
           VNFLWQ    GY HVWP++KFGW+IV+G+IIGFFGAAFGSVGGVGGGGIFVPML+LIIGF
Sbjct: 59  VNFLWQKGQLGYTHVWPDMKFGWQIVVGSIIGFFGAAFGSVGGVGGGGIFVPMLTLIIGF 118

Query: 116 DPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFN 175
           D KS+TAISKCMI GAA STVYYNLKLRHPTLDMPIIDYDLALL QP+L+LGISIGVAFN
Sbjct: 119 DAKSSTAISKCMITGAAASTVYYNLKLRHPTLDMPIIDYDLALLFQPVLVLGISIGVAFN 178

Query: 176 VIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSL 235
           VIFADWM+TVLLI+LFIGTSTKAFLKGVETWK+ETI+KKEAAR L SNG  + EVEYK L
Sbjct: 179 VIFADWMITVLLIILFIGTSTKAFLKGVETWKKETIIKKEAARRLASNGGASEEVEYKPL 238

Query: 236 PSGPRSGPQKDAMEP---EVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWV 292
           P GP +G   ++ EP   EV+I+EN+ WKELG+L  VW+  L L+I KN+T +CS+VYWV
Sbjct: 239 PGGPTNGTGPESNEPKREEVSIIENVRWKELGILFAVWLIILALEITKNYTTTCSVVYWV 298

Query: 293 LDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLG 352
            +LLQIPV+L VS Y+A+SLYKG+R IASKG+ G ++RV QLV YCA GVLAG+VGGLLG
Sbjct: 299 CNLLQIPVALGVSSYQAVSLYKGKRRIASKGEAGTNWRVHQLVLYCACGVLAGMVGGLLG 358

Query: 353 LGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATI 412
           LGGGFI+GPLFLELGVPPQVSSATATFAMTFS+SMSV+EYYLLKRFPVPYALYFVAV+TI
Sbjct: 359 LGGGFILGPLFLELGVPPQVSSATATFAMTFSASMSVIEYYLLKRFPVPYALYFVAVSTI 418

Query: 413 AAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENL 472
           AAFVGQHVVR+LI +LGRASLIIFIL+F IFVSAISLGGVGI+NMI K   HEYMGF+N+
Sbjct: 419 AAFVGQHVVRRLISILGRASLIIFILSFTIFVSAISLGGVGIANMIEKIENHEYMGFDNI 478

Query: 473 CKYD 476
           C Y+
Sbjct: 479 CSYE 482


>gi|449523247|ref|XP_004168635.1| PREDICTED: uncharacterized LOC101214783 [Cucumis sativus]
          Length = 471

 Score =  617 bits (1590), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 322/465 (69%), Positives = 379/465 (81%), Gaps = 7/465 (1%)

Query: 12  RSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVW 71
           RS+ +   NFA+    V  ER +K+E         +D N    A+NFLWQ + +GYHHVW
Sbjct: 13  RSMALTMFNFAVVLALVSGERKMKLEG------SSEDLNLFQGALNFLWQSDKTGYHHVW 66

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           PE+ FGW+IV+G++IGF GAAFGSVGGVGGGGI+VPMLSLIIGFDPKS+TAISKCMIMGA
Sbjct: 67  PEMAFGWQIVVGSVIGFMGAAFGSVGGVGGGGIYVPMLSLIIGFDPKSSTAISKCMIMGA 126

Query: 132 AVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLF 191
           A STVYYNLKLRHPTL++PIIDYDL LLI PMLMLGIS+GV FNVIFADWMVTVLLIVLF
Sbjct: 127 AASTVYYNLKLRHPTLELPIIDYDLVLLIMPMLMLGISVGVVFNVIFADWMVTVLLIVLF 186

Query: 192 IGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPE 251
           + TSTK+FLKGVETWK+ETI+K EAAR   SN   A +  Y  LPSGP   P+ +  + E
Sbjct: 187 LATSTKSFLKGVETWKKETIMKTEAARRAESNDIQA-DTAYAPLPSGPSHRPETNYTDRE 245

Query: 252 VTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAIS 311
           V ILEN+ WKE+GLL+ VW AFL +QI K H+ +CS  YW+L+ LQ+P++  VS YEA+ 
Sbjct: 246 VPILENVYWKEMGLLLLVWFAFLAIQITKKHSPNCSWEYWLLNFLQVPIAFGVSGYEAVG 305

Query: 312 LYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ 371
           LYKG+R IAS G+    FR++QLV YC  G++AG+VGGLLGLGGGFIMGPLFLELG+PPQ
Sbjct: 306 LYKGKRKIASLGNQKTDFRIYQLVLYCFMGIIAGVVGGLLGLGGGFIMGPLFLELGIPPQ 365

Query: 372 VSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRA 431
           VSSA+ATF MTFSSSMSV++YYLL RFPVPYALYF  VA +AAFVGQHV+RKLI+L+GRA
Sbjct: 366 VSSASATFGMTFSSSMSVIQYYLLNRFPVPYALYFTIVAAVAAFVGQHVIRKLILLIGRA 425

Query: 432 SLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           SLIIFIL+F IFVSA+SLGGVGIS MIG+  RHEYMGFENLCKYD
Sbjct: 426 SLIIFILSFTIFVSALSLGGVGISKMIGQIQRHEYMGFENLCKYD 470


>gi|449437030|ref|XP_004136295.1| PREDICTED: uncharacterized protein LOC101214783 [Cucumis sativus]
          Length = 471

 Score =  617 bits (1590), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 322/465 (69%), Positives = 379/465 (81%), Gaps = 7/465 (1%)

Query: 12  RSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVW 71
           RS+ +   NFA+    V  ER +K+E         +D N    A+NFLWQ + +GYHHVW
Sbjct: 13  RSMALTMFNFAVVLALVSGERKMKLEG------SSEDLNLFQGALNFLWQSDKTGYHHVW 66

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           PE+ FGW+IV+G++IGF GAAFGSVGGVGGGGI+VPMLSLIIGFDPKS+TAISKCMIMGA
Sbjct: 67  PEMAFGWQIVVGSVIGFMGAAFGSVGGVGGGGIYVPMLSLIIGFDPKSSTAISKCMIMGA 126

Query: 132 AVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLF 191
           A STVYYNLKLRHPTL++PIIDYDL LLI PMLMLGIS+GV FNVIFADWMVTVLLIVLF
Sbjct: 127 AASTVYYNLKLRHPTLELPIIDYDLVLLIMPMLMLGISVGVVFNVIFADWMVTVLLIVLF 186

Query: 192 IGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPE 251
           + TSTK+FLKGVETWK+ETI+K EAAR   SN   A +  Y  LPSGP   P+ +  + E
Sbjct: 187 LATSTKSFLKGVETWKKETIMKTEAARRAESNDIQA-DTAYAPLPSGPSHRPETNNTDRE 245

Query: 252 VTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAIS 311
           V ILEN+ WKE+GLL+ VW AFL +QI K H+ +CS  YW+L+ LQ+P++  VS YEA+ 
Sbjct: 246 VPILENVYWKEMGLLLLVWFAFLAIQITKKHSPNCSWEYWLLNFLQVPIAFGVSGYEAVG 305

Query: 312 LYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ 371
           LYKG+R IAS G+    FR++QLV YC  G++AG+VGGLLGLGGGFIMGPLFLELG+PPQ
Sbjct: 306 LYKGKRKIASLGNQKTDFRIYQLVLYCFMGIIAGVVGGLLGLGGGFIMGPLFLELGIPPQ 365

Query: 372 VSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRA 431
           VSSA+ATF MTFSSSMSV++YYLL RFPVPYALYF  VA +AAFVGQHV+RKLI+L+GRA
Sbjct: 366 VSSASATFGMTFSSSMSVIQYYLLNRFPVPYALYFTIVAAVAAFVGQHVIRKLILLIGRA 425

Query: 432 SLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           SLIIFIL+F IFVSA+SLGGVGIS MIG+  RHEYMGFENLCKYD
Sbjct: 426 SLIIFILSFTIFVSALSLGGVGISKMIGQIQRHEYMGFENLCKYD 470


>gi|224143011|ref|XP_002324817.1| predicted protein [Populus trichocarpa]
 gi|222866251|gb|EEF03382.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  609 bits (1570), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 328/475 (69%), Positives = 388/475 (81%), Gaps = 4/475 (0%)

Query: 1   MADLGAKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLW 60
           MA LGAK+  LR   ++F  F LA   + AER +K    R +  E  + NYL+K ++FLW
Sbjct: 1   MAFLGAKFLSLRPAAILFFTFMLASAFISAERSIKHGVSRLSEAEELEPNYLVKIISFLW 60

Query: 61  QPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSA 120
           Q + SGY HVWPEL F W+ V+GTIIGFFGAAFGSVGGVGGGGIFVPML+LIIGFDPKSA
Sbjct: 61  QSDESGYQHVWPELSFNWQAVIGTIIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSA 120

Query: 121 TAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFAD 180
           TAISKCMIMGAAVSTVY+NL+LRHPTLD+P+IDYDL LLIQPMLMLGISIGV FNVIF D
Sbjct: 121 TAISKCMIMGAAVSTVYHNLRLRHPTLDLPVIDYDLVLLIQPMLMLGISIGVTFNVIFPD 180

Query: 181 WMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPR 240
           W++TV LI+L IGTS KAF KG++TWK ETILK+ AA+   S G G+ EVEYK LP G  
Sbjct: 181 WVITVTLIILCIGTSVKAFFKGLDTWKIETILKEVAAKRFESTG-GSEEVEYKPLPDGQG 239

Query: 241 SGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPV 300
           + P K +   +VTIL+NI WKELGLLVFVWV++L +QIAKN+TA CS  +WVL+LLQIP+
Sbjct: 240 NDPPKAS---KVTILQNIYWKELGLLVFVWVSYLAVQIAKNYTAPCSTTFWVLNLLQIPI 296

Query: 301 SLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMG 360
           S+ V LYEAI LYKGRR I+SKGD+   ++V +L+ +   GV+AGIVGGLLG+GGGF+MG
Sbjct: 297 SIGVFLYEAIGLYKGRRRISSKGDEIMDWQVHRLLMFSVCGVVAGIVGGLLGIGGGFVMG 356

Query: 361 PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           PLFLE+G+PPQVSS TATF M FSSSMSVVEYYLL RFPVPYALYF+AVA IAAF+GQ++
Sbjct: 357 PLFLEMGIPPQVSSGTATFGMLFSSSMSVVEYYLLDRFPVPYALYFIAVAVIAAFIGQNI 416

Query: 421 VRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           + KLI + GRASLIIF+LAF IF+SAI+LGGVGI+  IG   RHEYMGFENLCKY
Sbjct: 417 ITKLITISGRASLIIFVLAFTIFISAIALGGVGITRTIGMVERHEYMGFENLCKY 471


>gi|218201973|gb|EEC84400.1| hypothetical protein OsI_30977 [Oryza sativa Indica Group]
          Length = 465

 Score =  599 bits (1545), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 320/433 (73%), Positives = 370/433 (85%), Gaps = 8/433 (1%)

Query: 47  QDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFV 106
           ++   L K  NFLWQ +G+ YHHVWPE++ GW+IVLG++IGFFGAAFGSVGGVGGGGIFV
Sbjct: 37  EEVGILRKVANFLWQTDGNSYHHVWPEMELGWQIVLGSLIGFFGAAFGSVGGVGGGGIFV 96

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           PML+LIIGFDPKS+TAISKCMIMGAAVSTVYYNLKL+HPTLDMP+IDYDLALLIQPMLML
Sbjct: 97  PMLTLIIGFDPKSSTAISKCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLML 156

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG 226
           GISIGV FNV+F DW+VTVLLI+LF+GTSTKAFLKGVETWK+ETI+K+EAA+ L      
Sbjct: 157 GISIGVIFNVLFPDWLVTVLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLEQT--- 213

Query: 227 AGEVEYKSLPSGPRSGPQKDAMEPE---VTILENINWKELGLLVFVWVAFLGLQIAKNHT 283
           + E EY  LP+GP  G   DA  P     ++++NI WKE GLL FVW+AFL LQ+ KN+T
Sbjct: 214 SEEPEYAPLPTGP--GAVADAKRPSDEAASLMKNIYWKEFGLLAFVWMAFLVLQVTKNYT 271

Query: 284 ASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVL 343
           A+CS  YW+L+LLQIPVS+ V+LYEA+ L  G+RV++SKG++  + +  QL  YC FG+ 
Sbjct: 272 ATCSSWYWILNLLQIPVSVGVTLYEALGLMSGKRVLSSKGNEQTTLKFHQLCIYCFFGIT 331

Query: 344 AGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYA 403
           AG+VGGLLGLGGGFIMGPLFLELG+PPQVSSATATFAM FSSSMSVVEYYLL RFPVPYA
Sbjct: 332 AGVVGGLLGLGGGFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYA 391

Query: 404 LYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHR 463
           LYFV VA IAA +GQHVVR+LI  LGRASLIIFILAFMIFVSAISLGGVGISNMI K ++
Sbjct: 392 LYFVIVAFIAAIIGQHVVRRLINWLGRASLIIFILAFMIFVSAISLGGVGISNMIHKINQ 451

Query: 464 HEYMGFENLCKYD 476
           HEYMGFENLCKYD
Sbjct: 452 HEYMGFENLCKYD 464


>gi|115478679|ref|NP_001062933.1| Os09g0344800 [Oryza sativa Japonica Group]
 gi|50252864|dbj|BAD29095.1| membrane protein-like [Oryza sativa Japonica Group]
 gi|113631166|dbj|BAF24847.1| Os09g0344800 [Oryza sativa Japonica Group]
 gi|215692830|dbj|BAG88213.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 481

 Score =  598 bits (1541), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 318/433 (73%), Positives = 370/433 (85%), Gaps = 8/433 (1%)

Query: 47  QDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFV 106
           ++   L K  NFLWQ +G+ YHHVWPE++ GW+IVLG++IGFFGAAFGSVGGVGGGGIFV
Sbjct: 53  EEVGILRKVANFLWQTDGNSYHHVWPEMELGWQIVLGSLIGFFGAAFGSVGGVGGGGIFV 112

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           PML+LIIGFDPKS+TAISKCMIMGAAVSTVYYNLKL+HPTLDMP+IDYDLALLIQPMLML
Sbjct: 113 PMLTLIIGFDPKSSTAISKCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLML 172

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG 226
           GISIGV FNV+F DW+VTVLLI+LF+GTSTKAFLKGVETWK+ETI+K+EAA+ L      
Sbjct: 173 GISIGVIFNVLFPDWLVTVLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLEQT--- 229

Query: 227 AGEVEYKSLPSGPRSGPQKDAMEPE---VTILENINWKELGLLVFVWVAFLGLQIAKNHT 283
           + E EY  LP+GP  G   DA  P     ++++NI WKE GLL FVW+AFL LQ+ KN+T
Sbjct: 230 SEEPEYAPLPTGP--GAVADAKRPSDEAASLMKNIYWKEFGLLAFVWMAFLVLQVTKNYT 287

Query: 284 ASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVL 343
           A+CS  YW+L+LLQIPVS+ V++YEA+ L  G+RV++SKG++  + +  QL  YC FG+ 
Sbjct: 288 ATCSSWYWILNLLQIPVSVGVTMYEALGLMSGKRVLSSKGNEQTTLKFHQLCIYCFFGIT 347

Query: 344 AGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYA 403
           AG+VGGLLGLGGGFIMGPLFLELG+PPQVSSATATFAM FSSSMSVVEYYLL RFPVPYA
Sbjct: 348 AGVVGGLLGLGGGFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYA 407

Query: 404 LYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHR 463
           LYFV VA IAA +GQHVVR+LI  LGRASLIIFILAFMIFVSAISLGGVGISNMI + ++
Sbjct: 408 LYFVIVAFIAAIIGQHVVRRLINWLGRASLIIFILAFMIFVSAISLGGVGISNMIHRINQ 467

Query: 464 HEYMGFENLCKYD 476
           HEYMGFENLCKYD
Sbjct: 468 HEYMGFENLCKYD 480


>gi|222641386|gb|EEE69518.1| hypothetical protein OsJ_28979 [Oryza sativa Japonica Group]
          Length = 481

 Score =  598 bits (1541), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 318/433 (73%), Positives = 370/433 (85%), Gaps = 8/433 (1%)

Query: 47  QDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFV 106
           ++   L K  NFLWQ +G+ YHHVWPE++ GW+IVLG++IGFFGAAFGSVGGVGGGGIFV
Sbjct: 53  EEVGILRKVANFLWQTDGNSYHHVWPEMELGWQIVLGSLIGFFGAAFGSVGGVGGGGIFV 112

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           PML+LIIGFDPKS+TAISKCMIMGAAVSTVYYNLKL+HPTLDMP+IDYDLALLIQPMLML
Sbjct: 113 PMLTLIIGFDPKSSTAISKCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLML 172

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG 226
           GISIGV FNV+F DW+VTVLLI+LF+GTSTKAFLKGVETWK+ETI+K+EAA+ L      
Sbjct: 173 GISIGVIFNVLFPDWLVTVLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLEQT--- 229

Query: 227 AGEVEYKSLPSGPRSGPQKDAMEPE---VTILENINWKELGLLVFVWVAFLGLQIAKNHT 283
           + E EY  LP+GP  G   DA  P     ++++NI WKE GLL FVW+AFL LQ+ KN+T
Sbjct: 230 SEEPEYAPLPTGP--GAVADAKRPSDEAASLMKNIYWKEFGLLAFVWMAFLVLQVTKNYT 287

Query: 284 ASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVL 343
           A+CS  YW+L+LLQIPVS+ V++YEA+ L  G+RV++SKG++  + +  QL  YC FG+ 
Sbjct: 288 ATCSSWYWILNLLQIPVSVGVTMYEALGLMSGKRVLSSKGNEQTTLKFHQLCIYCFFGIT 347

Query: 344 AGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYA 403
           AG+VGGLLGLGGGFIMGPLFLELG+PPQVSSATATFAM FSSSMSVVEYYLL RFPVPYA
Sbjct: 348 AGVVGGLLGLGGGFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYA 407

Query: 404 LYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHR 463
           LYFV VA IAA +GQHVVR+LI  LGRASLIIFILAFMIFVSAISLGGVGISNMI + ++
Sbjct: 408 LYFVIVAFIAAIIGQHVVRRLINWLGRASLIIFILAFMIFVSAISLGGVGISNMIHRINQ 467

Query: 464 HEYMGFENLCKYD 476
           HEYMGFENLCKYD
Sbjct: 468 HEYMGFENLCKYD 480


>gi|326489334|dbj|BAK01650.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  592 bits (1527), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 317/450 (70%), Positives = 370/450 (82%), Gaps = 10/450 (2%)

Query: 31  ERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFG 90
           +RGL +   R       + + L K  NF+WQ +G+ Y HVWP ++FGW++VLG++IGFFG
Sbjct: 27  DRGLSLRAHRAVVAAPDEISLLNKIANFMWQTDGNSYQHVWPPMEFGWKMVLGSLIGFFG 86

Query: 91  AAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMP 150
           A+FGSVGGVGGGGIFVPML+LIIGFDPKS+TAISKCMI GAAVSTVYYNLKL+HPTLDMP
Sbjct: 87  ASFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAISKCMITGAAVSTVYYNLKLKHPTLDMP 146

Query: 151 IIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRET 210
           +IDYDLALLIQPMLMLGISIGV FNVIF DW+VTVLLIVLF+GTSTKAFLKGVETWK+ET
Sbjct: 147 VIDYDLALLIQPMLMLGISIGVIFNVIFPDWLVTVLLIVLFLGTSTKAFLKGVETWKKET 206

Query: 211 ILKKEAARCLGSNGAGAGEVEYKSLPSGPRS----GPQKDAMEPEVTILENINWKELGLL 266
           I+K+EAA+ L      + E EY  LP+GP +     PQ D      ++++NI WKE GLL
Sbjct: 207 IIKREAAKRLEQT---SEEPEYAPLPTGPGAVADVKPQSDEAP---SLMKNIYWKEFGLL 260

Query: 267 VFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDG 326
            FVW+AFL +QI KN+  +CS  YWVL+ LQIPVS+ V++YEA+ L  G+RV++SKGD+ 
Sbjct: 261 TFVWLAFLAIQITKNYAPTCSTWYWVLNFLQIPVSVGVTMYEALGLMNGKRVLSSKGDEQ 320

Query: 327 KSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSS 386
            + +  QL  YC FGV AG+VGGLLGLGGGFIMGPLFLELG+PPQVSSATATFAM FS+S
Sbjct: 321 TTLKFHQLCIYCFFGVTAGLVGGLLGLGGGFIMGPLFLELGIPPQVSSATATFAMMFSAS 380

Query: 387 MSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSA 446
           MSVVEYYLL RFPVPYA YFVAVA +AA +GQHVVR+LI  LGRASLIIFILAFMIFVSA
Sbjct: 381 MSVVEYYLLNRFPVPYAAYFVAVAFVAAIIGQHVVRRLISWLGRASLIIFILAFMIFVSA 440

Query: 447 ISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           ISLGGVGISNMI K  +HEYMGFENLCKYD
Sbjct: 441 ISLGGVGISNMIHKIEQHEYMGFENLCKYD 470


>gi|357158060|ref|XP_003578003.1| PREDICTED: uncharacterized protein LOC100827421 [Brachypodium
           distachyon]
          Length = 469

 Score =  590 bits (1520), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 311/433 (71%), Positives = 366/433 (84%), Gaps = 4/433 (0%)

Query: 45  ERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGI 104
           + ++ + L K  NFLWQ +G+ YHHVWP ++FGW++VLGT+IGFFGA+FGSVGGVGGGGI
Sbjct: 39  QPEEISLLNKIANFLWQSDGNSYHHVWPPMEFGWKVVLGTLIGFFGASFGSVGGVGGGGI 98

Query: 105 FVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPML 164
           FVPML+LIIGFDPKS+TAISKCMI GAAVSTVYYNLKL+HPTLDMP+IDYDLALLIQPML
Sbjct: 99  FVPMLTLIIGFDPKSSTAISKCMITGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPML 158

Query: 165 MLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNG 224
           MLGISIGV FNVIF DW+VTVLLI+LF+GTSTKAFLKG+ETWK+ETILK+EAA+ L    
Sbjct: 159 MLGISIGVIFNVIFPDWLVTVLLIILFLGTSTKAFLKGLETWKKETILKREAAKRLEQT- 217

Query: 225 AGAGEVEYKSLPSGPRSGPQ-KDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHT 283
             + E EY  LP+GP +    K   +   ++++NI W+E GLL  VW++FL +QI KN+T
Sbjct: 218 --SEEPEYAPLPTGPGAAADVKIPSDEAPSLMKNIYWREFGLLTLVWLSFLAIQITKNYT 275

Query: 284 ASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVL 343
           ++CS  YW+L+ LQIPVS+ V+LYEA+ L  G+RV++SKG +  + +  QL  YC FGV 
Sbjct: 276 STCSTWYWILNFLQIPVSVGVTLYEALGLMNGKRVLSSKGGEQTTLKFHQLCIYCFFGVT 335

Query: 344 AGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYA 403
           AG+VGGLLGLGGGFIMGPLFLELG+PPQVSSATATFAM FS+SMSVVEYYLL RFPVPYA
Sbjct: 336 AGLVGGLLGLGGGFIMGPLFLELGIPPQVSSATATFAMMFSASMSVVEYYLLNRFPVPYA 395

Query: 404 LYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHR 463
           +YFVAVA IAA +GQH VRKLI  LGRASLIIFILAFMIFVSAISLGGVG+SNMI K  R
Sbjct: 396 VYFVAVAFIAAIIGQHAVRKLISWLGRASLIIFILAFMIFVSAISLGGVGVSNMIQKIQR 455

Query: 464 HEYMGFENLCKYD 476
           HEYMGFENLCKYD
Sbjct: 456 HEYMGFENLCKYD 468


>gi|115476246|ref|NP_001061719.1| Os08g0389700 [Oryza sativa Japonica Group]
 gi|40253412|dbj|BAD05341.1| membrane protein-like [Oryza sativa Japonica Group]
 gi|40253788|dbj|BAD05726.1| membrane protein-like [Oryza sativa Japonica Group]
 gi|113623688|dbj|BAF23633.1| Os08g0389700 [Oryza sativa Japonica Group]
          Length = 465

 Score =  585 bits (1507), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 312/431 (72%), Positives = 363/431 (84%), Gaps = 5/431 (1%)

Query: 47  QDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFV 106
           ++  Y+ K VNFLW  E S YHHVWP ++FGW+IVLG +IGFFGAAFGSVGGVGGGGIFV
Sbjct: 38  EEVGYMRKVVNFLWSGEAS-YHHVWPPMEFGWKIVLGILIGFFGAAFGSVGGVGGGGIFV 96

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           PML+LIIGFD KS+TAISKCMIMGAAVSTVYYNLKL+HPTLDMP+IDYDLALLIQPMLML
Sbjct: 97  PMLTLIIGFDAKSSTAISKCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLML 156

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG 226
           GISIGV FNVIF DW++TVLLI+LF+GTSTKAFLKGVETWK+ETILK+EAA+ L      
Sbjct: 157 GISIGVLFNVIFPDWLITVLLIILFLGTSTKAFLKGVETWKKETILKREAAKRLEQI--- 213

Query: 227 AGEVEYKSLPSGPRSGPQKDAMEPEV-TILENINWKELGLLVFVWVAFLGLQIAKNHTAS 285
           A E EY  LP+GP +  +  A   E  ++ +N+ WKE GLL FVW+AFL LQ+ KN+  +
Sbjct: 214 AEEPEYSPLPTGPNATAESKAPSDEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPT 273

Query: 286 CSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAG 345
           CS  YWVL+ LQIPVS+ V++YE + L +GRRVI+SKGD+  + +  QL+ YC FG++AG
Sbjct: 274 CSTWYWVLNFLQIPVSVGVTMYEGLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAG 333

Query: 346 IVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALY 405
           +VGGLLGLGGGFIMGPLFLELG+PPQVSSATATFAM FSSSMSVVEYYLL RFPVPYAL+
Sbjct: 334 VVGGLLGLGGGFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALF 393

Query: 406 FVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHE 465
           F  VA  AA +GQH+VRKLI  LGRASLIIFIL+FMIF+SAISLGGVGISNMIGK  RHE
Sbjct: 394 FTVVAFFAAIIGQHIVRKLINWLGRASLIIFILSFMIFISAISLGGVGISNMIGKIARHE 453

Query: 466 YMGFENLCKYD 476
           YMGF+N+C YD
Sbjct: 454 YMGFDNICNYD 464


>gi|218201092|gb|EEC83519.1| hypothetical protein OsI_29108 [Oryza sativa Indica Group]
          Length = 466

 Score =  582 bits (1499), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 309/431 (71%), Positives = 362/431 (83%), Gaps = 5/431 (1%)

Query: 47  QDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFV 106
           ++  Y+ K VNFLW  E S YHHVWP ++FGW+I+LG +IGFFGAAFGSVGGVGGGGIFV
Sbjct: 39  EEVGYMRKVVNFLWSGEAS-YHHVWPPMEFGWKIILGILIGFFGAAFGSVGGVGGGGIFV 97

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           PML+LIIGFD KS+TAISKCMIMGAAVSTVYYNLKL+HPTLDMP+IDYDLALLIQPMLML
Sbjct: 98  PMLTLIIGFDAKSSTAISKCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLML 157

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG 226
           GISIGV FNVIF DW++TVLLI+LF+GTSTKAFLKGVETWK+ETI+K+EAA+ L      
Sbjct: 158 GISIGVLFNVIFPDWLITVLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLEQI--- 214

Query: 227 AGEVEYKSLPSGPRSGPQKDAMEPEV-TILENINWKELGLLVFVWVAFLGLQIAKNHTAS 285
             E EY  LP+GP +  +  A   E  ++ +N+ WKE GLL FVW+AFL LQ+ KN+  +
Sbjct: 215 TEEPEYSPLPTGPNATAESKAPSDEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPT 274

Query: 286 CSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAG 345
           CS  YWVL+ LQIPVS+ V++YE + L +GRRVI+SKGD+  + +  QL+ YC FG++AG
Sbjct: 275 CSTWYWVLNFLQIPVSVGVTMYEGLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAG 334

Query: 346 IVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALY 405
           +VGGLLGLGGGFIMGPLFLELG+PPQVSSATATFAM FSSSMSVVEYYLL RFPVPYAL+
Sbjct: 335 VVGGLLGLGGGFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALF 394

Query: 406 FVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHE 465
           F  VA  AA +GQH+VRKLI  LGRASLIIFIL+FMIF+SAISLGGVGISNMIGK  RHE
Sbjct: 395 FTTVAFFAAIIGQHIVRKLINWLGRASLIIFILSFMIFISAISLGGVGISNMIGKIARHE 454

Query: 466 YMGFENLCKYD 476
           YMGF+N+C YD
Sbjct: 455 YMGFDNICNYD 465


>gi|224070901|ref|XP_002303288.1| predicted protein [Populus trichocarpa]
 gi|222840720|gb|EEE78267.1| predicted protein [Populus trichocarpa]
          Length = 426

 Score =  579 bits (1492), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 328/426 (76%), Positives = 368/426 (86%), Gaps = 3/426 (0%)

Query: 53  LKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLI 112
           ++ +NFLWQ    GY HVWP++KFGW+IV+GTIIGFFGAA GSVGGVGGGGIFVPML+LI
Sbjct: 1   MRVLNFLWQKGLYGYTHVWPDMKFGWKIVVGTIIGFFGAALGSVGGVGGGGIFVPMLTLI 60

Query: 113 IGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGV 172
           IGFD KS+TAISKCMI GAA STVYYNLKLRHPTLDMP+IDYDLALL QPML+LGISIGV
Sbjct: 61  IGFDAKSSTAISKCMITGAAASTVYYNLKLRHPTLDMPVIDYDLALLFQPMLVLGISIGV 120

Query: 173 AFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEY 232
           AFNVIFADWM+TVLLI+LFIGTSTKAFLKGVETWK+ETILK+EAAR L SN     EVEY
Sbjct: 121 AFNVIFADWMITVLLIILFIGTSTKAFLKGVETWKKETILKQEAARRLESNDDDNEEVEY 180

Query: 233 KSLPSGPRSGPQKDAMEPE---VTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIV 289
           + LP GP  G + +  EP+   V+I+EN+ WKELGLL  VW A L L+I KN+T +CS+ 
Sbjct: 181 QPLPGGPSGGGEAENKEPKKEAVSIIENVYWKELGLLFAVWGAILALEIGKNYTTTCSMA 240

Query: 290 YWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGG 349
           YW L+LLQIPV++ VS YEA+SLYKG R IASKG+ G ++R  QLV YCA GVLAGIVGG
Sbjct: 241 YWALNLLQIPVAVGVSSYEAVSLYKGTRKIASKGETGTNWRAHQLVLYCACGVLAGIVGG 300

Query: 350 LLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAV 409
           LLGLGGGFI+GPLFLELG+PPQVSSATATFAMTFS+SMSVVEYYLLKRFPVPYALYFVAV
Sbjct: 301 LLGLGGGFILGPLFLELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYFVAV 360

Query: 410 ATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGF 469
           AT++AFVGQ VVRKLI LLGRASLIIFILAF IFVSAISLGGVGI NMI K   H+YMGF
Sbjct: 361 ATVSAFVGQFVVRKLINLLGRASLIIFILAFTIFVSAISLGGVGIVNMIEKIEHHDYMGF 420

Query: 470 ENLCKY 475
           EN+C Y
Sbjct: 421 ENICSY 426


>gi|414885094|tpg|DAA61108.1| TPA: hypothetical protein ZEAMMB73_890925 [Zea mays]
          Length = 470

 Score =  578 bits (1490), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 305/432 (70%), Positives = 367/432 (84%), Gaps = 4/432 (0%)

Query: 47  QDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFV 106
           ++ + L K  N +W  +G+ Y HVWP ++FGW+IVLG++IGFFGAAFGSVGGVGGGGIFV
Sbjct: 40  EEVSLLRKVANLMWNGDGNSYQHVWPPMEFGWQIVLGSLIGFFGAAFGSVGGVGGGGIFV 99

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           PML+LIIGFDPKS+TAISKCMIMGA+VSTVYYNLKL+HPTLDMP+IDYDLA+L+QPMLML
Sbjct: 100 PMLTLIIGFDPKSSTAISKCMIMGASVSTVYYNLKLKHPTLDMPVIDYDLAVLMQPMLML 159

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG 226
           GISIGV FNVIF DW+VTVLLI+LF+GTSTKAFLKG+ETWK+ET++++EAA+ L    AG
Sbjct: 160 GISIGVIFNVIFPDWLVTVLLIILFLGTSTKAFLKGIETWKKETVIQREAAKLL-EQTAG 218

Query: 227 AGEVEYKSLPSGPRSGPQKDAMEPE--VTILENINWKELGLLVFVWVAFLGLQIAKNHTA 284
             E EY +LP+GP     K A+  +   ++++NI+WK++GLL FVWVAFL LQ+ KN+TA
Sbjct: 219 E-EAEYAALPTGPDVAANKKALTSDEAPSLIKNIHWKKVGLLSFVWVAFLVLQVTKNYTA 277

Query: 285 SCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLA 344
           +CS  YWVL+LLQ+PVS+ V+LYE   L  G+RV++SKG    + +  Q+V Y  FG+ A
Sbjct: 278 TCSPWYWVLNLLQVPVSVGVTLYEGFGLMSGKRVLSSKGSGQTTMKFHQVVVYGLFGIAA 337

Query: 345 GIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYAL 404
           G+VGGLLGLGGGFIMGPLFLELG+PPQVSSATATFAM FSSSMSVVEYYLL RFPVPYA+
Sbjct: 338 GLVGGLLGLGGGFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYAV 397

Query: 405 YFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRH 464
           YF+ VA +AA +GQHVVRKLI  LGRASLIIFILAFMIFVSAISLGGVG+SNM+ K  RH
Sbjct: 398 YFIVVAFVAAIIGQHVVRKLINWLGRASLIIFILAFMIFVSAISLGGVGVSNMVHKIERH 457

Query: 465 EYMGFENLCKYD 476
           EYMGFE+LCKYD
Sbjct: 458 EYMGFEDLCKYD 469


>gi|242044426|ref|XP_002460084.1| hypothetical protein SORBIDRAFT_02g022510 [Sorghum bicolor]
 gi|241923461|gb|EER96605.1| hypothetical protein SORBIDRAFT_02g022510 [Sorghum bicolor]
          Length = 470

 Score =  576 bits (1485), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 304/444 (68%), Positives = 365/444 (82%), Gaps = 4/444 (0%)

Query: 35  KIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFG 94
           ++  +R      ++ +   K  N +W+ +G+ Y HVWP ++ GW+IVLG++IGFFGAAFG
Sbjct: 28  RLSLVRAAVAPEEEMSLFRKVANLMWKSDGNSYQHVWPTMELGWQIVLGSLIGFFGAAFG 87

Query: 95  SVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDY 154
           SVGGVGGGGIFVPML+LIIGFDPKS+TAISKCMIMGA+VSTVYYNLKL+HPTLDMP+IDY
Sbjct: 88  SVGGVGGGGIFVPMLTLIIGFDPKSSTAISKCMIMGASVSTVYYNLKLKHPTLDMPVIDY 147

Query: 155 DLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKK 214
           DLA+LIQPMLMLGISIGV FNVIF DW+VTVLLI+LF+GTSTKAFLKG+ETWK+ETI+++
Sbjct: 148 DLAVLIQPMLMLGISIGVIFNVIFPDWLVTVLLIILFLGTSTKAFLKGIETWKKETIIQR 207

Query: 215 EAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPE--VTILENINWKELGLLVFVWVA 272
           EAA+ L        E EY  LP+GP +   K  +  +   ++++NI+WKE+GLL FVWVA
Sbjct: 208 EAAKRL--EQIAGEEAEYAPLPTGPGAAANKKTLSSDEAPSLIKNIHWKEVGLLSFVWVA 265

Query: 273 FLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVF 332
           FL LQ+ KN+TA+CS  YWVL+LLQ+PVS+ V+LYE   L  G+RV++SKG +  + +  
Sbjct: 266 FLVLQVTKNYTATCSPWYWVLNLLQVPVSVGVTLYEGFGLMSGKRVLSSKGSEQTTMKFH 325

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           Q+V Y  FG+ AG+VGGLLGLGGGFIMGPLFLELG+PPQVSSATATFAM FSSSMSVVEY
Sbjct: 326 QVVVYGLFGIAAGLVGGLLGLGGGFIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEY 385

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGV 452
           YLL RFPVPYA+YF  VA  AA +GQH VRKLI  LGRASLIIFILAFMIFVSA+SLGGV
Sbjct: 386 YLLNRFPVPYAVYFTIVAFAAALIGQHAVRKLINWLGRASLIIFILAFMIFVSALSLGGV 445

Query: 453 GISNMIGKFHRHEYMGFENLCKYD 476
           GISNM+ K  RHEYMGFENLCKYD
Sbjct: 446 GISNMVHKIARHEYMGFENLCKYD 469


>gi|255551044|ref|XP_002516570.1| conserved hypothetical protein [Ricinus communis]
 gi|223544390|gb|EEF45911.1| conserved hypothetical protein [Ricinus communis]
          Length = 465

 Score =  570 bits (1470), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 309/478 (64%), Positives = 375/478 (78%), Gaps = 18/478 (3%)

Query: 1   MADLGAKWKILRSVMMIFLNFALAFVCVKAERGLKIETL-RFNGTERQDSNYLLKAVNFL 59
           MA  GAK+ +L+  +++     L  +       +K E + R +G + ++      +   L
Sbjct: 1   MAVFGAKFLVLKCTVIVLFFSFLFLLVSAERLSIKHEQVPRLSGEKEKEKQEAESS---L 57

Query: 60  WQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKS 119
           WQ +GSGY HVWP ++ GW++V+G+IIGFFGAAFGSVGGVGGGGIFVPML+LIIGFDPKS
Sbjct: 58  WQSDGSGYQHVWPGMELGWQVVVGSIIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKS 117

Query: 120 ATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFA 179
           ATAISKCMIMGAA+STVYYNLKLRHPTLD+PI+DYDL LLIQP ++LGIS GV  NV F 
Sbjct: 118 ATAISKCMIMGAAMSTVYYNLKLRHPTLDLPIVDYDLVLLIQPAVLLGISTGVTLNVFFP 177

Query: 180 DWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGP 239
           DW VTVLLI+L IGTS KA+ KGV+TW +ET LK+EAA+            +YK LP GP
Sbjct: 178 DWAVTVLLIILLIGTSVKAYFKGVDTWNKETTLKEEAAK------------QYKLLPGGP 225

Query: 240 RS-GP-QKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQ 297
            +  P QK   E EV +LENI+WKE  LLVF WVA+L LQIAKN+TA+CS  YWVL+LLQ
Sbjct: 226 SNVNPRQKHTREREVPVLENIHWKEFALLVFDWVAYLALQIAKNYTATCSTAYWVLNLLQ 285

Query: 298 IPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGF 357
           +PVS+ V LY A+ LY+GRR IASKGD+G + + +QL+++CA+GV+AGIVGGLLGLGGGF
Sbjct: 286 VPVSVGVFLYAAVGLYRGRRRIASKGDEGMNLKAYQLLAFCAYGVMAGIVGGLLGLGGGF 345

Query: 358 IMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVG 417
           IMGPLFLELG+PPQV+SATATF M FSSSMSV EYYLL RFPVPYALYF+AVATIAAF G
Sbjct: 346 IMGPLFLELGIPPQVTSATATFGMAFSSSMSVAEYYLLSRFPVPYALYFIAVATIAAFAG 405

Query: 418 QHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           QH++ KLI + GRASLIIF+LAF IFVSA++LGGVGISN+IGK  + E M F+NLC Y
Sbjct: 406 QHIITKLIAITGRASLIIFVLAFTIFVSALTLGGVGISNIIGKIEKGESMEFQNLCNY 463


>gi|125603285|gb|EAZ42610.1| hypothetical protein OsJ_27176 [Oryza sativa Japonica Group]
          Length = 442

 Score =  553 bits (1424), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 300/408 (73%), Positives = 346/408 (84%), Gaps = 6/408 (1%)

Query: 72  PE--LKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIM 129
           PE  ++FGW+IVLG +IGFFGAAFGSVGGVGGGGIFVPML+LIIGFD KS+TAISKCMIM
Sbjct: 37  PEEPMEFGWKIVLGKLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDAKSSTAISKCMIM 96

Query: 130 GAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIV 189
           GAAVSTVYYNLKL+HPTLDMP+IDYDLALLIQPMLMLGISIGV FNVIF DW++TVLLI+
Sbjct: 97  GAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVLFNVIFPDWLITVLLII 156

Query: 190 LFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAME 249
           LF+GTSTKAFLKGVETWK+ETILK+EAA+ L      A E EY  LP+GP +  +  A  
Sbjct: 157 LFLGTSTKAFLKGVETWKKETILKREAAKRLEQI---AEEPEYSPLPTGPNATAESKAPS 213

Query: 250 PEVTIL-ENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYE 308
            E   L +N+ WKE GLL FVW+AFL LQ+ KN+  +CS  YWVL+ LQIPVS+ V++YE
Sbjct: 214 DEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPTCSTWYWVLNFLQIPVSVGVTMYE 273

Query: 309 AISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGV 368
            + L +GRRVI+SKGD+  + +  QL+ YC FG++AG+VGGLLGLGGGFIMGPLFLELG+
Sbjct: 274 GLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAGVVGGLLGLGGGFIMGPLFLELGI 333

Query: 369 PPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
           PPQVSSATATFAM FSSSMSVVEYYLL RFPVPYAL+F  VA  AA +GQH+VRKLI  L
Sbjct: 334 PPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALFFTVVAFFAAIIGQHIVRKLINWL 393

Query: 429 GRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           GRASLIIFIL+FMIF+SAISLGGVGISNMIGK  RHEYMGF+N+C YD
Sbjct: 394 GRASLIIFILSFMIFISAISLGGVGISNMIGKIARHEYMGFDNICNYD 441


>gi|414885096|tpg|DAA61110.1| TPA: hypothetical protein ZEAMMB73_890925 [Zea mays]
          Length = 404

 Score =  551 bits (1421), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 295/405 (72%), Positives = 351/405 (86%), Gaps = 4/405 (0%)

Query: 74  LKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAV 133
           ++FGW+IVLG++IGFFGAAFGSVGGVGGGGIFVPML+LIIGFDPKS+TAISKCMIMGA+V
Sbjct: 1   MEFGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAISKCMIMGASV 60

Query: 134 STVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIG 193
           STVYYNLKL+HPTLDMP+IDYDLA+L+QPMLMLGISIGV FNVIF DW+VTVLLI+LF+G
Sbjct: 61  STVYYNLKLKHPTLDMPVIDYDLAVLMQPMLMLGISIGVIFNVIFPDWLVTVLLIILFLG 120

Query: 194 TSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPE-- 251
           TSTKAFLKG+ETWK+ET++++EAA+ L    AG  E EY +LP+GP     K A+  +  
Sbjct: 121 TSTKAFLKGIETWKKETVIQREAAKLL-EQTAGE-EAEYAALPTGPDVAANKKALTSDEA 178

Query: 252 VTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAIS 311
            ++++NI+WK++GLL FVWVAFL LQ+ KN+TA+CS  YWVL+LLQ+PVS+ V+LYE   
Sbjct: 179 PSLIKNIHWKKVGLLSFVWVAFLVLQVTKNYTATCSPWYWVLNLLQVPVSVGVTLYEGFG 238

Query: 312 LYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ 371
           L  G+RV++SKG    + +  Q+V Y  FG+ AG+VGGLLGLGGGFIMGPLFLELG+PPQ
Sbjct: 239 LMSGKRVLSSKGSGQTTMKFHQVVVYGLFGIAAGLVGGLLGLGGGFIMGPLFLELGIPPQ 298

Query: 372 VSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRA 431
           VSSATATFAM FSSSMSVVEYYLL RFPVPYA+YF+ VA +AA +GQHVVRKLI  LGRA
Sbjct: 299 VSSATATFAMMFSSSMSVVEYYLLNRFPVPYAVYFIVVAFVAAIIGQHVVRKLINWLGRA 358

Query: 432 SLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           SLIIFILAFMIFVSAISLGGVG+SNM+ K  RHEYMGFE+LCKYD
Sbjct: 359 SLIIFILAFMIFVSAISLGGVGVSNMVHKIERHEYMGFEDLCKYD 403


>gi|356556244|ref|XP_003546436.1| PREDICTED: uncharacterized protein LOC100816528 [Glycine max]
          Length = 476

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 297/432 (68%), Positives = 354/432 (81%), Gaps = 3/432 (0%)

Query: 48  DSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVP 107
           ++++L K +NFLW   GSGY H WPE++FGW I+ GTIIGF G+AFG+VGGVGGGGIFV 
Sbjct: 44  ETSFLGKVMNFLWSSSGSGYEHTWPEIEFGWRIITGTIIGFVGSAFGTVGGVGGGGIFVT 103

Query: 108 MLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLG 167
           MLSLIIGFD KSATAISKCMI G A +TV+YNL+ +HPTLDMP+IDYDLALL QP+L+LG
Sbjct: 104 MLSLIIGFDAKSATAISKCMITGGAAATVFYNLRQKHPTLDMPVIDYDLALLFQPVLVLG 163

Query: 168 ISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG- 226
           ISIGVAFNVIFADWM+TVLL+++F+G +TKAFLKGVETWK+ETI+KKE AR    NG   
Sbjct: 164 ISIGVAFNVIFADWMITVLLLIIFVGIATKAFLKGVETWKKETIIKKETARQSQFNGTER 223

Query: 227 AGEVEYKSLPSGPRSGPQKDAMEPEVT--ILENINWKELGLLVFVWVAFLGLQIAKNHTA 284
             EV Y+ LP GP +   K+  + + T  ILEN+ WK LG+L  VWV  L  +IAK+HT 
Sbjct: 224 TAEVAYEPLPGGPNTSNHKEPKKSKETGSILENVRWKALGVLFTVWVLILASEIAKSHTT 283

Query: 285 SCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLA 344
           +CS+ YWVL+LLQ+PV+L  + Y+A+ LY G+RVIASKGD    +R  QL+ YC+ G+ A
Sbjct: 284 TCSVEYWVLNLLQVPVALGATSYQAVLLYTGKRVIASKGDQRTQWRAHQLIMYCSCGICA 343

Query: 345 GIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYAL 404
           GIVGGLLGLGGGFI+GPLFLELG+PPQVSSATATFAMTFS+SMSVVEYYLLKRFP+PY L
Sbjct: 344 GIVGGLLGLGGGFILGPLFLELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPIPYTL 403

Query: 405 YFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRH 464
           YFVAV+T AAFVGQ +VRKL+ +LGRASLIIFIL+  IFVSAISLGGVGISNMI +    
Sbjct: 404 YFVAVSTFAAFVGQVLVRKLVAILGRASLIIFILSSTIFVSAISLGGVGISNMIQRIANK 463

Query: 465 EYMGFENLCKYD 476
           EYMGFENLC Y 
Sbjct: 464 EYMGFENLCTYS 475


>gi|326510405|dbj|BAJ87419.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 474

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 300/449 (66%), Positives = 356/449 (79%), Gaps = 15/449 (3%)

Query: 39  LRFNG--TERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSV 96
           L F G  T   +++YL K  N LWQ +G+ YHHVWP LK GW+IVLG+ IGFFGAAFGS+
Sbjct: 27  LSFAGGATPPGEASYLRKLANSLWQSDGTSYHHVWPPLKLGWKIVLGSFIGFFGAAFGSI 86

Query: 97  GGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDL 156
           GGVGGGG+FVPML+LIIGFDPKS+ A+SKCMIMGAAVSTVY N+KL+HPTLDMP+IDYDL
Sbjct: 87  GGVGGGGVFVPMLTLIIGFDPKSSAAMSKCMIMGAAVSTVYCNVKLKHPTLDMPVIDYDL 146

Query: 157 ALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEA 216
           ALLIQPMLMLG+SIGV  NVIF DW+VTVLLI+LFI TSTKAFLK VETWK+ETI    A
Sbjct: 147 ALLIQPMLMLGVSIGVICNVIFPDWLVTVLLIILFIVTSTKAFLKAVETWKKETITIAVA 206

Query: 217 ARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAM----------EPEVTILENINWKELGLL 266
            +    +   + E EY  L +G  +  Q + +          +  V+I +N+ WKE+GLL
Sbjct: 207 TK---RSEETSEEQEYMQLSAGLDAASQSEILSDHDQNETPPDEAVSIWKNVYWKEVGLL 263

Query: 267 VFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDG 326
            FVWVAFL LQ+AKN+ A+CSI YWVL+LLQIPVS+ V++YEA+ L +GRRVI+SK ++ 
Sbjct: 264 GFVWVAFLALQLAKNYMATCSISYWVLNLLQIPVSVGVAMYEAVGLMQGRRVISSKRNEE 323

Query: 327 KSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSS 386
            S R  QL+ YC  GV AG+V GLLG+GGG IMGPLFLELGVPPQV+SATATFAM FS+S
Sbjct: 324 NSLRAHQLLVYCFLGVTAGVVAGLLGVGGGSIMGPLFLELGVPPQVASATATFAMMFSAS 383

Query: 387 MSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSA 446
           MSVVEYYLL RFPVPYALY   +A +AA +GQ VVRKLI LLGRAS+IIFIL+FMIF+SA
Sbjct: 384 MSVVEYYLLNRFPVPYALYLTILAFLAAIIGQRVVRKLIDLLGRASIIIFILSFMIFISA 443

Query: 447 ISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           +SLGGVGISN I K  R EYMGFEN+CKY
Sbjct: 444 LSLGGVGISNTIHKIARREYMGFENICKY 472


>gi|356547396|ref|XP_003542098.1| PREDICTED: uncharacterized protein LOC100789733 [Glycine max]
          Length = 487

 Score =  548 bits (1413), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 301/462 (65%), Positives = 361/462 (78%), Gaps = 11/462 (2%)

Query: 26  VCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTI 85
           V V A + +  + +     E   +++L K VNFLW   GSGY H WP+++FGW I+ GTI
Sbjct: 25  VSVSALQHVSHKNINPTTVETTQTSFLGKVVNFLWSSSGSGYQHTWPDIEFGWRIITGTI 84

Query: 86  IGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP 145
           IGF G+AFG+VGGVGGGGIFV MLSLIIGFD KSATAISKCMI G A +TV+YNLK +HP
Sbjct: 85  IGFLGSAFGTVGGVGGGGIFVTMLSLIIGFDAKSATAISKCMITGGAAATVFYNLKQKHP 144

Query: 146 TLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVET 205
           TLDMP+IDYDLALL QP+L+LGISIGVAFNVIFADWM+TVLL+++F+G +TKAFLKGVET
Sbjct: 145 TLDMPVIDYDLALLFQPVLVLGISIGVAFNVIFADWMITVLLLIIFVGIATKAFLKGVET 204

Query: 206 WKRETILKKEAARCLGSNGAG-AGEVEYKSLPSGPRSGPQKDAMEPEVT----------I 254
           WK+ETI+KKE AR    NG   + EV Y+ LP GP +    +  + + T          +
Sbjct: 205 WKKETIIKKETARQSQFNGTERSEEVAYEPLPGGPNTSNHNEPKKSKETTVRLRHHKGSV 264

Query: 255 LENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYK 314
           LEN+ WK LG+L  VWV  L  +IAK+HT +CS+ YW+L+LLQ+PV+L  + Y+A+ LY 
Sbjct: 265 LENVRWKALGVLFTVWVLILASEIAKSHTTTCSVEYWILNLLQVPVALGATSYQAVLLYT 324

Query: 315 GRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSS 374
           G+RVIASKGD    +R  QLV YC+ G+ AGIVGGLLGLGGGFI+GPLFLELG+PPQVSS
Sbjct: 325 GKRVIASKGDQRTQWRAHQLVLYCSCGICAGIVGGLLGLGGGFILGPLFLELGIPPQVSS 384

Query: 375 ATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLI 434
           ATATFAMTFS+SMSVVEYYLLKRFP+PY LYFVAV+T AAFVGQ +VRKL+ +LGRASLI
Sbjct: 385 ATATFAMTFSASMSVVEYYLLKRFPIPYTLYFVAVSTFAAFVGQVLVRKLVAILGRASLI 444

Query: 435 IFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           IFIL+  IFVSAISLGGVGISNMI K    EYMGFENLC Y 
Sbjct: 445 IFILSGTIFVSAISLGGVGISNMIQKIANKEYMGFENLCTYS 486


>gi|357147619|ref|XP_003574412.1| PREDICTED: uncharacterized protein LOC100846223 [Brachypodium
           distachyon]
          Length = 468

 Score =  546 bits (1406), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 308/474 (64%), Positives = 374/474 (78%), Gaps = 8/474 (1%)

Query: 4   LGAKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPE 63
           +G +W    +V  + ++  +    V A+RG ++        E +D ++L K  +++WQ +
Sbjct: 1   MGRQW---HAVAALCISCTVMAAAVGADRG-RLFASTAAAPEPEDVSFLRKVASYMWQKD 56

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
           G+ YHHVWP +KFGW+IVLG+  GF G AFGS+GGVGGGGIFVPML+LIIGFDPKSA A+
Sbjct: 57  GTTYHHVWPPMKFGWKIVLGSFFGFIGGAFGSIGGVGGGGIFVPMLTLIIGFDPKSAAAM 116

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGAAVSTVY NLKL+HPTLDMP+IDYDL LLIQPMLMLG+SIGV  NVIF +W+V
Sbjct: 117 SKCMIMGAAVSTVYCNLKLKHPTLDMPVIDYDLVLLIQPMLMLGVSIGVICNVIFPNWLV 176

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGP 243
           TVLLI+LF+ TSTKAFLKGVETWK+ETI+ KEA +    +   + E EY  LP+GP S  
Sbjct: 177 TVLLIILFLVTSTKAFLKGVETWKKETIIIKEAEK---RSEQISEETEYTPLPTGPDSPA 233

Query: 244 QKDAMEPE-VTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSL 302
           +      E V+I +N++WKE GLLVFVWVAFL LQ+ KN+ A+CS  YWVL+LLQIPVS+
Sbjct: 234 ETITPSDEAVSIWKNVHWKEFGLLVFVWVAFLALQVTKNYIATCSTWYWVLNLLQIPVSV 293

Query: 303 VVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPL 362
            V++YEA+ L +G+RVI+SK +D  S +V QL+ YC  G++AG+V GLLG+GGG IMGPL
Sbjct: 294 GVAMYEAVGLLQGKRVISSKANDQTSLKVHQLLLYCFLGLMAGVVAGLLGVGGGSIMGPL 353

Query: 363 FLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVR 422
           FLELGVPPQVSSATATFAM FSSSMSVVEY+LL RFPVPYAL+F  +A  AA VGQ VVR
Sbjct: 354 FLELGVPPQVSSATATFAMMFSSSMSVVEYFLLNRFPVPYALFFTVLAFFAAIVGQRVVR 413

Query: 423 KLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           KLI LLGRASLI+FIL+ MIFVSA SLGGVGISNMI K  RHEYMGFEN+CKYD
Sbjct: 414 KLIGLLGRASLIVFILSMMIFVSAFSLGGVGISNMIHKITRHEYMGFENICKYD 467


>gi|224094753|ref|XP_002310222.1| predicted protein [Populus trichocarpa]
 gi|222853125|gb|EEE90672.1| predicted protein [Populus trichocarpa]
          Length = 454

 Score =  546 bits (1406), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 306/460 (66%), Positives = 369/460 (80%), Gaps = 9/460 (1%)

Query: 17  IFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKF 76
           IFL   L      AER LK +       +     Y++K ++FL+Q   S Y  VWP++KF
Sbjct: 1   IFLVQVLMTNMASAERLLKDKEAETLVKKEMSPGYIVKMIHFLFQGGKSSYEPVWPDMKF 60

Query: 77  GWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTV 136
            W +V+GTI+GFFGAA GSVGGVGGGGIFVPML+LIIGFDPKS+TAISKCMIMGAA +TV
Sbjct: 61  DWRVVVGTIVGFFGAALGSVGGVGGGGIFVPMLTLIIGFDPKSSTAISKCMIMGAAGATV 120

Query: 137 YYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTST 196
           YYNL+LRHPTLDMPIIDYDLALL QPMLMLGISIGVAFNV+FADWMVTVLLI+LFIGTST
Sbjct: 121 YYNLRLRHPTLDMPIIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLLIILFIGTST 180

Query: 197 KAFLKGVETWKRETILKKEAARCLGSN---GAGAGEVEYKSLPSGPRSGPQKDAMEPEVT 253
           KA  KG++TWK+E+++KKEAA+ L S    G GA E EY+ LPSGP         + EV 
Sbjct: 181 KALFKGIDTWKKESVMKKEAAKQLESESKPGDGAEE-EYQPLPSGPVV-----LHDDEVP 234

Query: 254 ILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLY 313
           + ENI WKEL LLV+VWVAFL +QI + +  +CS++YW+L+ LQ+P++  V+L+EAI L 
Sbjct: 235 LRENIRWKELALLVYVWVAFLVVQIVQTNVPTCSVMYWILNSLQVPIAASVTLFEAICLC 294

Query: 314 KGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVS 373
           KG RVIASKG +  ++++ Q++ YC+ G++AGIVGGLLGLGGGFI+GPLFLELG+PPQV+
Sbjct: 295 KGTRVIASKGKEITNWKLHQVLLYCSCGIIAGIVGGLLGLGGGFILGPLFLELGIPPQVA 354

Query: 374 SATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASL 433
           SAT+TFAM FSSSMSVVEYYLL RFPVPYA YFV VATIAAF GQHV+R++I +LGRAS+
Sbjct: 355 SATSTFAMAFSSSMSVVEYYLLNRFPVPYAAYFVLVATIAAFAGQHVIRRIIAILGRASI 414

Query: 434 IIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           IIFILA  IFVSAISLGGVGI+NM+ K    EYMGFENLC
Sbjct: 415 IIFILALTIFVSAISLGGVGIANMVEKLENDEYMGFENLC 454


>gi|356496755|ref|XP_003517231.1| PREDICTED: uncharacterized protein LOC100801441 [Glycine max]
          Length = 470

 Score =  540 bits (1390), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 302/469 (64%), Positives = 368/469 (78%), Gaps = 15/469 (3%)

Query: 5   GAKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEG 64
            A W +L  + MI  N +LA      +R LK + L  N   ++    L   VNFLW+   
Sbjct: 14  AATWMVLCIITMI-CNVSLA------DRILKEKELG-NDVPKERQGILKAIVNFLWEEGK 65

Query: 65  SGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAIS 124
           S Y  VWP +KFGW I++G+IIGF GAA GSVGGVGGGGIFVPML+LIIGFDPKS+TAIS
Sbjct: 66  SSYEPVWPNMKFGWRIIVGSIIGFLGAALGSVGGVGGGGIFVPMLALIIGFDPKSSTAIS 125

Query: 125 KCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVT 184
           KCMIMGA++STVYYNL+LRHPTLDMP+IDYDLAL+ QPMLMLGISIGV  NV+FADWMVT
Sbjct: 126 KCMIMGASISTVYYNLRLRHPTLDMPLIDYDLALIFQPMLMLGISIGVICNVMFADWMVT 185

Query: 185 VLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQ 244
           VLLI+LFI TSTKA  KG++TWK+ETI KKEA++ L +      +  YKSLPSGP     
Sbjct: 186 VLLIILFIATSTKATYKGIDTWKKETIAKKEASKLLEAEPKSGDD--YKSLPSGPT---- 239

Query: 245 KDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVV 304
            +++  E  +L+NI WKEL LL +VWVAF  +QI K +T  CSI +W+L+ LQ+PV++ V
Sbjct: 240 -ESLFEEAPLLKNIYWKELSLLAYVWVAFFIVQIVKEYTKPCSIQFWLLNFLQVPVAVSV 298

Query: 305 SLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFL 364
           +L+EAI LYKG RVIASKG +  ++++ Q+  YC+ G++AG+VGGLLGLGGGFI+GPLFL
Sbjct: 299 TLFEAIGLYKGTRVIASKGKEVTNWKIHQICLYCSTGIMAGMVGGLLGLGGGFILGPLFL 358

Query: 365 ELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           ELG+PPQV+SAT+TFAM FSSSMSVV+YYLL RFPVPYA YF  VATIAAF GQHVVRK+
Sbjct: 359 ELGIPPQVASATSTFAMVFSSSMSVVQYYLLDRFPVPYASYFALVATIAAFTGQHVVRKV 418

Query: 425 IILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           I++LGRAS+IIFILA  IF+SAISLGGVGI N+I K   HEYMGFE+LC
Sbjct: 419 IVVLGRASIIIFILALTIFISAISLGGVGIENIIEKIENHEYMGFEDLC 467


>gi|255561190|ref|XP_002521607.1| conserved hypothetical protein [Ricinus communis]
 gi|223539285|gb|EEF40878.1| conserved hypothetical protein [Ricinus communis]
          Length = 478

 Score =  539 bits (1388), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 290/432 (67%), Positives = 359/432 (83%), Gaps = 9/432 (2%)

Query: 45  ERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGI 104
             ++ ++L KA +FLWQ   S Y  VWPE++F ++IV+GTI+GF GAA GSVGGVGGGGI
Sbjct: 52  HEKEKSFLQKAAHFLWQSGKSSYEPVWPEMEFNYKIVVGTIVGFLGAALGSVGGVGGGGI 111

Query: 105 FVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPML 164
           FVPML+L+IGFDPKS+TAISKCMIMGAA STVYYN++LRHPTLDMP+IDYDLALL QPML
Sbjct: 112 FVPMLTLLIGFDPKSSTAISKCMIMGAAGSTVYYNMRLRHPTLDMPLIDYDLALLFQPML 171

Query: 165 MLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNG 224
           MLGISIGVAFNV+FADWMVTVLLI+LFIGTSTKA  KG++TWK+E+++KKEAAR L S  
Sbjct: 172 MLGISIGVAFNVMFADWMVTVLLIILFIGTSTKALFKGIDTWKKESMMKKEAARQLESES 231

Query: 225 A---GAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKN 281
               G G+ +YK LPSGP +       + E+++ +NI WKEL LLV+VWV FL +QI K+
Sbjct: 232 KPNDGEGQ-DYKPLPSGPVA-----LEDEEISLFQNIYWKELALLVYVWVGFLAVQIVKS 285

Query: 282 HTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFG 341
           +  +CS+ YW+L+ LQ+P++  V+L+EAI L KG RVIAS+G +  +++  Q++ YC+ G
Sbjct: 286 YVRTCSVAYWILNALQVPIAASVTLFEAICLRKGTRVIASRGKEITNWKYHQILLYCSCG 345

Query: 342 VLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVP 401
           ++AG+VGGLLGLGGGFI+GPLFLELG+PPQV+SAT+TFAM FSSSMSVV+YYLLKRFPVP
Sbjct: 346 IIAGMVGGLLGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYLLKRFPVP 405

Query: 402 YALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKF 461
           YA +FV VAT+AA VGQHVVRK+I +LGRAS+IIFILA  IFVSAISLGGVGI +M+ K 
Sbjct: 406 YATFFVLVATVAALVGQHVVRKIIAILGRASIIIFILALTIFVSAISLGGVGIVDMVEKL 465

Query: 462 HRHEYMGFENLC 473
              EYMGFENLC
Sbjct: 466 TNEEYMGFENLC 477


>gi|224134334|ref|XP_002327812.1| predicted protein [Populus trichocarpa]
 gi|222836897|gb|EEE75290.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  536 bits (1381), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 295/461 (63%), Positives = 363/461 (78%), Gaps = 7/461 (1%)

Query: 16  MIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELK 75
           +IFL   L      AER LK +    +  +     Y++K ++FL+Q   S Y  VWP++K
Sbjct: 20  LIFLVQVLMTNMASAERLLKDKEPEASAEKEMSPGYIVKVLHFLFQGGKSSYEPVWPDMK 79

Query: 76  FGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVST 135
           F W IV+GTI+GF GAA GSVGGVGGGGIFVPML+LIIGFDPKS+TAISKCMIMGAA +T
Sbjct: 80  FDWRIVVGTIVGFLGAALGSVGGVGGGGIFVPMLTLIIGFDPKSSTAISKCMIMGAAGAT 139

Query: 136 VYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTS 195
           VYYNL+LRHPTLDMP+IDYDL LL QPMLMLGISIGV+F+V+FADWMVTVLLI+LFIGTS
Sbjct: 140 VYYNLRLRHPTLDMPLIDYDLTLLFQPMLMLGISIGVSFSVMFADWMVTVLLIILFIGTS 199

Query: 196 TKAFLKGVETWKRETILKKEAARCLGSNG--AGAGEVEYKSLPSGPRSGPQKDAMEPEVT 253
           TKA  KG++TWK+ET+++K+AA+ L S        E +YK LPSGP   P     + EV 
Sbjct: 200 TKALFKGIDTWKKETMMRKDAAKQLESESKPEDGAEEDYKPLPSGPVVLP-----DDEVP 254

Query: 254 ILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLY 313
           + ENI WKE+ LL +VW  FL +QI + +  +CS+ YW+L+ LQ+P++  V+L+EAI L 
Sbjct: 255 LRENIYWKEVALLFYVWAGFLVVQIVQTYFPTCSVTYWILNSLQVPIAASVTLFEAICLC 314

Query: 314 KGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVS 373
           KG RVIASKG +  ++++ Q++ YC+ G++AG+VGGLLGLGGGFI+GPLFLELG+PPQV+
Sbjct: 315 KGTRVIASKGKEITNWKLHQILLYCSCGIIAGMVGGLLGLGGGFILGPLFLELGIPPQVA 374

Query: 374 SATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASL 433
           SAT+TFAM FSSSMSVVEYYLL RFPVPYA YF  VATI+AFVGQHVVRK+I  LGRASL
Sbjct: 375 SATSTFAMAFSSSMSVVEYYLLNRFPVPYAAYFALVATISAFVGQHVVRKIIAFLGRASL 434

Query: 434 IIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
           IIFILA  IFVSA+SLGGVGI+NM+ K    EYMGFENLC+
Sbjct: 435 IIFILALTIFVSAVSLGGVGIANMVEKLENEEYMGFENLCQ 475


>gi|125561413|gb|EAZ06861.1| hypothetical protein OsI_29096 [Oryza sativa Indica Group]
          Length = 469

 Score =  533 bits (1374), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 279/428 (65%), Positives = 344/428 (80%), Gaps = 8/428 (1%)

Query: 50  NYLLKAVNFLWQPEG-SGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPM 108
           ++L K  NFLW+  G + YHHVWP ++FGW+IVLG+ +GF GAAFGS+GGVGGGG FVPM
Sbjct: 48  SHLRKVANFLWRSGGENSYHHVWPPMEFGWQIVLGSFVGFIGAAFGSIGGVGGGGFFVPM 107

Query: 109 LSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGI 168
           L+LIIGFD KS+ AISKCMIMGAAVSTVY NLK +HPTLDMP+IDYDLALLIQPMLMLGI
Sbjct: 108 LTLIIGFDAKSSVAISKCMIMGAAVSTVYCNLKRKHPTLDMPVIDYDLALLIQPMLMLGI 167

Query: 169 SIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAG 228
           SIGV FNVIF DW+VTVLLI+LF+GTSTKAFLKG+ETWK+ETI+K+EA +    +   + 
Sbjct: 168 SIGVIFNVIFPDWLVTVLLIILFLGTSTKAFLKGIETWKKETIIKREAEK---RSEQTSE 224

Query: 229 EVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSI 288
           E+EY+ +P+     P  +A    V+IL N+ WKE GLL FVW+AFL LQ+ KN+  +CS 
Sbjct: 225 ELEYRPVPASESKPPSDEA----VSILHNVYWKEFGLLAFVWIAFLALQVTKNYMPTCST 280

Query: 289 VYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVG 348
            YWVL+LLQIPVS+ V++YE + L +GRRVI+S G++  + +  QL+ YC FG+ AGIV 
Sbjct: 281 WYWVLNLLQIPVSVGVTMYEGLGLMQGRRVISSNGNEQTNLKFHQLLMYCFFGITAGIVA 340

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVA 408
           GLLG+GGG I+GP+FL+LGVPPQV+SATATF+M FS+SMS VEYY L RFPVPYALY   
Sbjct: 341 GLLGVGGGSILGPMFLDLGVPPQVASATATFSMMFSASMSAVEYYFLDRFPVPYALYLTV 400

Query: 409 VATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMG 468
           VA  +A VGQ +VRK+I  LGRAS+IIF L+ MIF+S I LGG+GI N IGK  RHEYMG
Sbjct: 401 VAFFSAIVGQRMVRKVINWLGRASIIIFTLSIMIFLSTIPLGGIGIVNWIGKIERHEYMG 460

Query: 469 FENLCKYD 476
           FE++CK+D
Sbjct: 461 FEDICKFD 468


>gi|356509743|ref|XP_003523605.1| PREDICTED: uncharacterized protein LOC100816633 [Glycine max]
          Length = 474

 Score =  533 bits (1372), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 293/452 (64%), Positives = 360/452 (79%), Gaps = 12/452 (2%)

Query: 26  VCVKAERGLKIETLR-FNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGT 84
           VC+ AER LK +    F   ERQ    +   ++F W         VWPE+KFGW IV+G+
Sbjct: 30  VCL-AERVLKDQKPESFVVKERQ--GVMNSIIDFFWNDGEPTNDRVWPEMKFGWRIVVGS 86

Query: 85  IIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRH 144
           I+GFFGAA GSVGGVGGGGIF+PML+L+IGFD KS+TA+SKCMIMGAAVSTVYYNL+ RH
Sbjct: 87  IVGFFGAALGSVGGVGGGGIFIPMLTLVIGFDAKSSTALSKCMIMGAAVSTVYYNLRFRH 146

Query: 145 PTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVE 204
           PTLD+P+IDYDLALL QPMLMLGISIGVAFNV+FADWMVTVLLI+LFI TSTKA  KG++
Sbjct: 147 PTLDLPVIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLLIILFIATSTKALFKGID 206

Query: 205 TWKRETILKKEAARCLGSNGAG--AGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKE 262
           TWK+ETI+KKEAA+ L S+ +     E +YKSLP+G       D  + EV +L+NI WKE
Sbjct: 207 TWKKETIMKKEAAKMLESDSSPGYVSEEDYKSLPAG-----SADPRDEEVPLLKNIYWKE 261

Query: 263 LGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASK 322
           L +L +VWVAFL +QI K +T +CSI+YWVL+ LQ+P+++ V+LYEAI L  G RVIASK
Sbjct: 262 LLVLAYVWVAFLIVQIIKTYTKTCSILYWVLNSLQVPIAISVTLYEAICLCNGTRVIASK 321

Query: 323 GDDGKSF-RVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAM 381
           G +   + ++ ++  YC+ G++AGIV GLLGLGGGFI+GPLFLELG+PPQV+SAT+TFAM
Sbjct: 322 GKENTDWMKLHKICLYCSCGIIAGIVSGLLGLGGGFILGPLFLELGIPPQVASATSTFAM 381

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFM 441
            FSSSMSVV+YYLL+RFPVPYA YF+ VATIAA  GQHVVRK+I + GRAS+IIF+LAF 
Sbjct: 382 VFSSSMSVVQYYLLERFPVPYASYFILVATIAALTGQHVVRKIIAIFGRASIIIFVLAFT 441

Query: 442 IFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           IF+SAISLGGVGI NM+ K   +EYMGF N+C
Sbjct: 442 IFLSAISLGGVGIENMVEKMENNEYMGFANIC 473


>gi|225430784|ref|XP_002267318.1| PREDICTED: uncharacterized protein LOC100253061 [Vitis vinifera]
          Length = 476

 Score =  531 bits (1368), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 300/447 (67%), Positives = 365/447 (81%), Gaps = 9/447 (2%)

Query: 29  KAERGLKIETLRFNG--TERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTII 86
            AER LK +         + ++  +L K V+FLWQ   S Y HVWP++KFGW++V+ +II
Sbjct: 34  NAERLLKADDRPAEHLVNKNKEPGFLSKVVHFLWQSGKSSYQHVWPDMKFGWKLVVASII 93

Query: 87  GFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPT 146
           GFFGAA GSVGGVGGGGIFVPML+LI+GFDPKS+TAISKCMIMGAA STVYYN++LRHPT
Sbjct: 94  GFFGAALGSVGGVGGGGIFVPMLTLIVGFDPKSSTAISKCMIMGAAGSTVYYNMRLRHPT 153

Query: 147 LDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETW 206
           LDMPIIDYDLALL QPMLMLGISIGVAFNVIFADWMVTVLLI+LF+GTSTKA  KG+ETW
Sbjct: 154 LDMPIIDYDLALLFQPMLMLGISIGVAFNVIFADWMVTVLLIILFLGTSTKALFKGIETW 213

Query: 207 KRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLL 266
           K+ETI+KKEAAR L +     G+  Y+ LPSGP +       +  V ++ NI WKEL LL
Sbjct: 214 KKETIMKKEAARLLEAETKPTGD--YRPLPSGPAT-----IRDDHVPVIYNICWKELALL 266

Query: 267 VFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDG 326
           V+VWVAFL +QI K +T +CSI YWVL+ LQ+P++  VS+YEA+ LYKG RVIASKG + 
Sbjct: 267 VYVWVAFLAVQIVKTYTVTCSIEYWVLNCLQVPIAASVSIYEAVCLYKGTRVIASKGKEV 326

Query: 327 KSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSS 386
            ++++ Q+  YC+ G++AG+VGGLLGLGGGFI+GPLFLELG+PPQV+SAT+TF+M FSSS
Sbjct: 327 TNWKIHQIFLYCSCGIVAGLVGGLLGLGGGFILGPLFLELGIPPQVASATSTFSMAFSSS 386

Query: 387 MSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSA 446
           MSVV+Y+LL+RFPVPYA YFV VATIAA VGQHVVRK+I L+GRAS+IIFILA  IF+SA
Sbjct: 387 MSVVQYHLLRRFPVPYASYFVLVATIAALVGQHVVRKVIKLVGRASIIIFILALTIFISA 446

Query: 447 ISLGGVGISNMIGKFHRHEYMGFENLC 473
           ISLGGVGI  M+ K   +EYMGF++LC
Sbjct: 447 ISLGGVGIMTMVEKLANNEYMGFDDLC 473


>gi|356518101|ref|XP_003527721.1| PREDICTED: uncharacterized protein LOC100789748 [Glycine max]
 gi|83853827|gb|ABC47859.1| membrane protein-like protein [Glycine max]
          Length = 491

 Score =  530 bits (1366), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 291/458 (63%), Positives = 362/458 (79%), Gaps = 11/458 (2%)

Query: 23  LAFVCVKAERGLKIE-TLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIV 81
           +A+    AER LK + +  F   ERQ    +   ++F           VWPE+KFGW IV
Sbjct: 27  MAYNVSLAERVLKDQKSESFVAKERQ--GVMNSIIDFFRNDGEPTNDRVWPEMKFGWRIV 84

Query: 82  LGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLK 141
           +G+I+GFFGAA GSVGGVGGGGIF+PML+L+IGFD KS+TA+SKCMIMGAAVSTVYYNL+
Sbjct: 85  VGSIVGFFGAALGSVGGVGGGGIFIPMLTLVIGFDAKSSTALSKCMIMGAAVSTVYYNLR 144

Query: 142 LRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLK 201
            RHPTLD+P+IDYDLALL QPMLMLGISIGVAFNV+FADWMVTVLLI+LFI TSTKA  K
Sbjct: 145 FRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLLIILFIATSTKALFK 204

Query: 202 GVETWKRETILKKEAARCLGSNGAG--AGEVEYKSLPSGPRSGPQKDAMEPEVTILENIN 259
           G++TWK+ETI+KKEAA+ L S+ +     E +YKSLP+G       D  + EV +L+NI 
Sbjct: 205 GIDTWKKETIMKKEAAKMLESDSSPGYVSEEDYKSLPAG-----SADPRDEEVPLLKNIY 259

Query: 260 WKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVI 319
           WKEL +LV+VWVAFL +QI K +T +CSI+YWVL+ LQ+P+++ V+LYEAI L  G RVI
Sbjct: 260 WKELLVLVYVWVAFLIVQIIKTYTKTCSILYWVLNSLQVPIAISVTLYEAICLCNGTRVI 319

Query: 320 ASKGDDGKSF-RVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATAT 378
           +SKG +   + ++ ++  YC+ G++AGIV GLLGLGGGFI+GPLFLELG+PPQV+SAT+T
Sbjct: 320 SSKGKENTDWMKLHKICLYCSCGIIAGIVSGLLGLGGGFILGPLFLELGIPPQVASATST 379

Query: 379 FAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFIL 438
           FAM FSSSMSVV+YYLL+RFPVPYA YF+ VATIAA  GQHVVRK+I + GRAS+IIF+L
Sbjct: 380 FAMVFSSSMSVVQYYLLERFPVPYASYFILVATIAALTGQHVVRKIIAIFGRASIIIFVL 439

Query: 439 AFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           AF IF+SAISLGGVGI NM+ K   +EYMGF N+C  +
Sbjct: 440 AFTIFLSAISLGGVGIENMVEKMENNEYMGFANICHVN 477


>gi|217073872|gb|ACJ85296.1| unknown [Medicago truncatula]
          Length = 474

 Score =  530 bits (1365), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 280/433 (64%), Positives = 347/433 (80%), Gaps = 16/433 (3%)

Query: 52  LLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGG-GGIFVPMLS 110
           ++K  +F W    S Y  VWPE+KFGW IV+G+I+GFFGAA GSVGGVGG  GIF+PML+
Sbjct: 47  VMKKTSFFWYSGESSYERVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGWKGIFIPMLT 106

Query: 111 LIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISI 170
           LIIGFDPKS+TA+SKCMI GAA STVY NL+LRHPTLDMP+IDYDLALL QPMLMLGISI
Sbjct: 107 LIIGFDPKSSTALSKCMITGAAGSTVYCNLRLRHPTLDMPLIDYDLALLFQPMLMLGISI 166

Query: 171 GVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEA----ARCL--GSNG 224
           GVAFNV+FADWMVT+LLI+LFIGTSTKA +KG++TWK+E I+KKEA    A+ L  GS  
Sbjct: 167 GVAFNVMFADWMVTILLIILFIGTSTKALVKGIDTWKKEAIMKKEAFEEAAQMLESGSTP 226

Query: 225 AGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTA 284
             A E +YKSLP+        D  + EV +L+NI+WKEL +L++VWVAFL +QI K ++ 
Sbjct: 227 DYASEEDYKSLPA--------DLQDEEVPLLDNIHWKELSVLMYVWVAFLIVQILKTYSK 278

Query: 285 SCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLA 344
           +CSI YW+L+ LQ+P+++ V+L+EAI L KG RVIAS+G +  +++  ++  YC  G++A
Sbjct: 279 TCSIEYWLLNSLQVPIAISVTLFEAICLCKGTRVIASRGKE-ITWKFHKICLYCFCGIIA 337

Query: 345 GIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYAL 404
           G+V GLLGLGGGFI+GPLFLELG+PPQV+SAT+TFAM FSSSMSVV+YY L RFP+PYA 
Sbjct: 338 GMVSGLLGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYHLDRFPIPYAS 397

Query: 405 YFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRH 464
           Y V VATIAA  GQHVVRK+I + GRAS+I+FILAF IFVSAISLGGVGI NM+ K    
Sbjct: 398 YLVLVATIAALTGQHVVRKIIAIFGRASIIVFILAFTIFVSAISLGGVGIGNMVEKMENA 457

Query: 465 EYMGFENLCKYDR 477
           EYMGF+N C + +
Sbjct: 458 EYMGFDNFCHHGK 470


>gi|148906357|gb|ABR16333.1| unknown [Picea sitchensis]
          Length = 505

 Score =  530 bits (1364), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 323/505 (63%), Positives = 373/505 (73%), Gaps = 30/505 (5%)

Query: 1   MADLGAKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSN---------- 50
           MA  G KW +L  VM+      LA     AER         + TER              
Sbjct: 1   MAGGGRKW-LLVVVMVATFGLLLAVTLSSAERTFLDGNSASSETERSMEKIVDPVEGNTL 59

Query: 51  --YLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPM 108
             +LLKA NFLW+   S Y  VWPE+KFGW IV+G+I+GFFGAA GSVGGVGGGGIFVPM
Sbjct: 60  KYHLLKARNFLWRSNISDYQRVWPEMKFGWRIVIGSIVGFFGAAVGSVGGVGGGGIFVPM 119

Query: 109 LSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGI 168
           L+LIIGFD KS+TAISKCMIMGAA STVYYNLKL+HPTLD+PIIDYDLA+L QPMLMLGI
Sbjct: 120 LTLIIGFDAKSSTAISKCMIMGAAGSTVYYNLKLKHPTLDLPIIDYDLAMLFQPMLMLGI 179

Query: 169 SIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGA- 227
           SIGVAFNVIFADWMVTVLLI+LF+GTSTKAFLKGVETWK+ETILKKEA R    N     
Sbjct: 180 SIGVAFNVIFADWMVTVLLIILFLGTSTKAFLKGVETWKQETILKKEAERVRVENVTADV 239

Query: 228 -----GEVEYKSLPSGP-----------RSGPQKDAMEPEVTILENINWKELGLLVFVWV 271
                 EV++K LPS P            S  ++   +   TI  N+ WKELG+LV VW+
Sbjct: 240 EPEEHQEVDFKPLPSEPVPNEESNGNNEGSNNERLPTDVTNTIWFNVRWKELGILVVVWI 299

Query: 272 AFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRV 331
            FL LQ+ K  + +CS+ YWVL+LLQIPV++ VSLYEAI LYKG + +ASKG+ G ++++
Sbjct: 300 IFLVLQVVKAKSTTCSVEYWVLNLLQIPVAMSVSLYEAIGLYKGTKAVASKGEAGINWKI 359

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVE 391
            QLV Y   G+LAG+VGGLLGLGGGFI+GPL LELGVPPQVSSATATF MTFSSSMSVVE
Sbjct: 360 SQLVLYFFCGILAGVVGGLLGLGGGFILGPLLLELGVPPQVSSATATFVMTFSSSMSVVE 419

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGG 451
           YY LKRFPVPYA Y   V  IAAF+GQHV+R+L+ILLGRASLIIFILAF+IF+SAI+LGG
Sbjct: 420 YYFLKRFPVPYAAYLFGVCVIAAFMGQHVIRRLVILLGRASLIIFILAFVIFLSAITLGG 479

Query: 452 VGISNMIGKFHRHEYMGFENLCKYD 476
           VGI  MI KF   +YMGFENLC Y 
Sbjct: 480 VGIVKMIHKFENGDYMGFENLCSYS 504


>gi|338762845|gb|AEI98632.1| hypothetical protein 111018.19 [Coffea canephora]
          Length = 465

 Score =  529 bits (1363), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 296/446 (66%), Positives = 350/446 (78%), Gaps = 17/446 (3%)

Query: 29  KAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGF 88
            AER LK         +     ++++ V+FLWQ   S Y HVWPE++FGW++V+GT++GF
Sbjct: 35  SAERLLKTVEPGHIVEKEMRQGFIVRLVHFLWQSGKSSYQHVWPEMEFGWKLVVGTVVGF 94

Query: 89  FGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLD 148
            GAA GSVGGVGGGGIFVPMLSLIIGFDPKS+TAISKCMIMGAA STVYYNL+LRHPTLD
Sbjct: 95  LGAALGSVGGVGGGGIFVPMLSLIIGFDPKSSTAISKCMIMGAAGSTVYYNLRLRHPTLD 154

Query: 149 MPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKR 208
           +P+IDYDLALL QPMLMLGISIGVAFNVIFADWMVTVLLI+LFIGTS+KAF KG+ETW +
Sbjct: 155 LPLIDYDLALLFQPMLMLGISIGVAFNVIFADWMVTVLLILLFIGTSSKAFFKGLETWNK 214

Query: 209 ETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVF 268
           ET +K            G G  EYK LP GP +   +     +V +L NI WKEL LL+F
Sbjct: 215 ETKMK-----------LGPGN-EYKPLPGGPAAHADE-----KVPLLYNIYWKELCLLLF 257

Query: 269 VWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKS 328
           VW+AFL +QI K +T +CS  YW L+ LQ+P++  VSLYEA  LYKG R IASKG +   
Sbjct: 258 VWIAFLAIQIMKTYTQTCSAKYWTLNFLQVPIAASVSLYEATCLYKGTRTIASKGKEITV 317

Query: 329 FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS 388
           ++   +  YC  G++AG+VGGLLGLGGGF++GPLFLELGVPPQV+SAT+TFAMTFSSSMS
Sbjct: 318 WKPNLIFLYCCLGIVAGVVGGLLGLGGGFVLGPLFLELGVPPQVASATSTFAMTFSSSMS 377

Query: 389 VVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAIS 448
           VV+YYLL RFPVPYA YFV VAT+AA VGQHVVR++I +LGRASLIIFILA  IFVSAIS
Sbjct: 378 VVQYYLLNRFPVPYATYFVLVATVAALVGQHVVRRIIAILGRASLIIFILALTIFVSAIS 437

Query: 449 LGGVGISNMIGKFHRHEYMGFENLCK 474
           LGGVGI++MI K   HEYMGF+NLC 
Sbjct: 438 LGGVGIASMIEKLQNHEYMGFDNLCH 463


>gi|297735172|emb|CBI17534.3| unnamed protein product [Vitis vinifera]
          Length = 481

 Score =  529 bits (1362), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 301/452 (66%), Positives = 367/452 (81%), Gaps = 14/452 (3%)

Query: 29  KAERGLKIETLRFNG--TERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTII 86
            AER LK +         + ++  +L K V+FLWQ   S Y HVWP++KFGW++V+ +II
Sbjct: 34  NAERLLKADDRPAEHLVNKNKEPGFLSKVVHFLWQSGKSSYQHVWPDMKFGWKLVVASII 93

Query: 87  GFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPT 146
           GFFGAA GSVGGVGGGGIFVPML+LI+GFDPKS+TAISKCMIMGAA STVYYN++LRHPT
Sbjct: 94  GFFGAALGSVGGVGGGGIFVPMLTLIVGFDPKSSTAISKCMIMGAAGSTVYYNMRLRHPT 153

Query: 147 LDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETW 206
           LDMPIIDYDLALL QPMLMLGISIGVAFNVIFADWMVTVLLI+LF+GTSTKA  KG+ETW
Sbjct: 154 LDMPIIDYDLALLFQPMLMLGISIGVAFNVIFADWMVTVLLIILFLGTSTKALFKGIETW 213

Query: 207 KRETILKKEAARCLG-----SNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWK 261
           K+ETI+KKEAAR L      ++  G G+  Y+ LPSGP +       +  V ++ NI WK
Sbjct: 214 KKETIMKKEAARLLEAETKPTDNTGGGD--YRPLPSGPAT-----IRDDHVPVIYNICWK 266

Query: 262 ELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIAS 321
           EL LLV+VWVAFL +QI K +T +CSI YWVL+ LQ+P++  VS+YEA+ LYKG RVIAS
Sbjct: 267 ELALLVYVWVAFLAVQIVKTYTVTCSIEYWVLNCLQVPIAASVSIYEAVCLYKGTRVIAS 326

Query: 322 KGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAM 381
           KG +  ++++ Q+  YC+ G++AG+VGGLLGLGGGFI+GPLFLELG+PPQV+SAT+TF+M
Sbjct: 327 KGKEVTNWKIHQIFLYCSCGIVAGLVGGLLGLGGGFILGPLFLELGIPPQVASATSTFSM 386

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFM 441
            FSSSMSVV+Y+LL+RFPVPYA YFV VATIAA VGQHVVRK+I L+GRAS+IIFILA  
Sbjct: 387 AFSSSMSVVQYHLLRRFPVPYASYFVLVATIAALVGQHVVRKVIKLVGRASIIIFILALT 446

Query: 442 IFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           IF+SAISLGGVGI  M+ K   +EYMGF++LC
Sbjct: 447 IFISAISLGGVGIMTMVEKLANNEYMGFDDLC 478


>gi|357483379|ref|XP_003611976.1| Membrane protein-like protein [Medicago truncatula]
 gi|358344383|ref|XP_003636269.1| Membrane protein-like protein [Medicago truncatula]
 gi|355502204|gb|AES83407.1| Membrane protein-like protein [Medicago truncatula]
 gi|355513311|gb|AES94934.1| Membrane protein-like protein [Medicago truncatula]
          Length = 470

 Score =  526 bits (1354), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 305/470 (64%), Positives = 375/470 (79%), Gaps = 15/470 (3%)

Query: 4   LGAKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPE 63
           + A W I+  ++MI  N +LA      ER LK E       E++   +L   V+FLW+  
Sbjct: 14  IAATWLIMCILVMI-CNVSLA------ERVLK-EKEPAKFVEKETKGFLKAMVDFLWESG 65

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
            S Y  VWPE+KF W+I++G+IIGF GAA GSVGGVGGGGIFVPML+LIIGFDPKS+TAI
Sbjct: 66  KSSYEPVWPEMKFDWKIIVGSIIGFLGAALGSVGGVGGGGIFVPMLALIIGFDPKSSTAI 125

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGAA+STVYYN++LR+PTLDMP+IDYDLALL QPMLMLGISIGV  NV+FADWMV
Sbjct: 126 SKCMIMGAALSTVYYNMRLRNPTLDMPLIDYDLALLFQPMLMLGISIGVICNVMFADWMV 185

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGP 243
           TVLLI+LFIGTSTKA +KG+ TWK+ET+LKKE A+ L       GE +YK LP GP  G 
Sbjct: 186 TVLLIILFIGTSTKALIKGINTWKKETMLKKETAKQLEEE-PKTGE-DYKPLPKGP--GE 241

Query: 244 QKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLV 303
            +D + P   +L+NI WKEL LLV+VWVAFL +QI K +T +CSI YW+L+ LQ+P+++ 
Sbjct: 242 IQDEVVP---LLKNIYWKELSLLVYVWVAFLIVQIVKTYTKTCSIEYWILNFLQVPIAIS 298

Query: 304 VSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLF 363
           V+L+EA+ +YKG RVI SKG + K+ +++Q++ YC+ GV+AG+VGGLLGLGGGFI+GPLF
Sbjct: 299 VTLFEAVCIYKGTRVIKSKGKEVKNMKIYQILLYCSIGVIAGMVGGLLGLGGGFILGPLF 358

Query: 364 LELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRK 423
           LE+G+PPQV+SAT+TF+M FSSSMSVV+YY L RFPVPYA YFV VATIAAF GQHVVR+
Sbjct: 359 LEMGIPPQVASATSTFSMLFSSSMSVVQYYYLDRFPVPYASYFVLVATIAAFAGQHVVRR 418

Query: 424 LIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           +I +LGRAS+IIFILA  IF+SAISLGGVGI NMI K   HEYMGFENLC
Sbjct: 419 IIAILGRASIIIFILASTIFISAISLGGVGIQNMIVKLENHEYMGFENLC 468


>gi|115476232|ref|NP_001061712.1| Os08g0387200 [Oryza sativa Japonica Group]
 gi|40253486|dbj|BAD05436.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623681|dbj|BAF23626.1| Os08g0387200 [Oryza sativa Japonica Group]
 gi|125603275|gb|EAZ42600.1| hypothetical protein OsJ_27163 [Oryza sativa Japonica Group]
          Length = 469

 Score =  526 bits (1354), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 278/428 (64%), Positives = 344/428 (80%), Gaps = 8/428 (1%)

Query: 50  NYLLKAVNFLWQPEG-SGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPM 108
           ++L K  NFLW+  G + YHHVWP ++FGW+IVLG+ +GF GAAFGS+GGVGGGG F+PM
Sbjct: 48  SHLRKVANFLWRSGGENSYHHVWPPMEFGWQIVLGSFVGFIGAAFGSIGGVGGGGFFMPM 107

Query: 109 LSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGI 168
           L+LIIGFD KS+ AISKCMIMGAAVSTVY NLK +HPTLDMP+IDYDLALLIQPMLMLGI
Sbjct: 108 LTLIIGFDAKSSVAISKCMIMGAAVSTVYCNLKRKHPTLDMPVIDYDLALLIQPMLMLGI 167

Query: 169 SIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAG 228
           SIGV FNVIF DW+VTVLLI+LF+GTSTKAFLKG+ETWK+ETI+K+EA +    +   + 
Sbjct: 168 SIGVIFNVIFPDWLVTVLLIILFLGTSTKAFLKGIETWKKETIIKREAEK---RSEQTSE 224

Query: 229 EVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSI 288
           E+EY+ +P+     P  +A    V+IL N+ WKE GLL FVW+AFL LQ+ KN+  +CS 
Sbjct: 225 ELEYRPVPASESKPPSDEA----VSILHNVYWKEFGLLAFVWIAFLALQVTKNYMPTCST 280

Query: 289 VYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVG 348
            YWVL+LLQIPVS+ V++YE + L +GRRVI+S G++  + +  QL+ YC FG+ AGIV 
Sbjct: 281 WYWVLNLLQIPVSVGVTMYEGLGLMQGRRVISSNGNEQTNLKFHQLLMYCFFGITAGIVA 340

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVA 408
           GLLG+GGG I+GP+FL+LGVPPQV+SATATF+M FS+SMS VEYY L RFPVPYALY   
Sbjct: 341 GLLGVGGGSILGPMFLDLGVPPQVASATATFSMMFSASMSAVEYYFLDRFPVPYALYLTV 400

Query: 409 VATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMG 468
           VA  +A VGQ +VRK+I  LGRAS+IIF L+ MIF+S I LGG+GI N IGK  RHEYMG
Sbjct: 401 VAFFSAIVGQRMVRKVINWLGRASIIIFTLSIMIFLSTIPLGGIGIVNWIGKIERHEYMG 460

Query: 469 FENLCKYD 476
           FE++CK+D
Sbjct: 461 FEDICKFD 468


>gi|357147621|ref|XP_003574413.1| PREDICTED: uncharacterized protein LOC100846525 [Brachypodium
           distachyon]
          Length = 461

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 279/412 (67%), Positives = 337/412 (81%), Gaps = 3/412 (0%)

Query: 65  SGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAIS 124
           + YHHVWP +K GW ++LG+ I FFGAAFGS+GGVGGGGIFVPML+LIIGFD KS+ AIS
Sbjct: 52  TTYHHVWPPMKVGWRVLLGSFIVFFGAAFGSIGGVGGGGIFVPMLTLIIGFDTKSSAAIS 111

Query: 125 KCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVT 184
           KCMIMGAA+STVY NLKL+HP+LDMP+IDYDLALLIQPMLMLG+SIGV  NVIF DW+VT
Sbjct: 112 KCMIMGAALSTVYCNLKLKHPSLDMPLIDYDLALLIQPMLMLGVSIGVICNVIFPDWLVT 171

Query: 185 VLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQ 244
           VLLI LF+ TSTKAFLKGVETWK+ET+ K+EA + L      + + EY   PSG      
Sbjct: 172 VLLISLFLVTSTKAFLKGVETWKKETLTKREATKQLEQT---SEQWEYTLPPSGADDAAS 228

Query: 245 KDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVV 304
           K   +  V+I +NI WKE GLL FVW+AFL LQI K++ A+CS  YWVL++LQIPVS+ V
Sbjct: 229 KAPSDEAVSIWKNIYWKEFGLLAFVWIAFLVLQITKDYVATCSTWYWVLNILQIPVSVGV 288

Query: 305 SLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFL 364
           ++Y+A+ L +G+RVI+SK ++  S + +QL  YC+ G+ AG +GGLLG+GGG IMGPLFL
Sbjct: 289 TMYQAVGLAQGKRVISSKANNQTSLKAYQLFVYCSLGLTAGSMGGLLGVGGGIIMGPLFL 348

Query: 365 ELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           ELGVPPQVSSAT+TFAM FS+SMSVVEYYLLKRFPVPYA +F  VA +AA VGQ VVRK+
Sbjct: 349 ELGVPPQVSSATSTFAMMFSASMSVVEYYLLKRFPVPYAAFFTIVAFLAAIVGQGVVRKM 408

Query: 425 IILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           I  LGRASLIIF+L+FMIF+SA+SLGGVG SNMI K  +HEY+GFEN+CKYD
Sbjct: 409 ISWLGRASLIIFVLSFMIFISALSLGGVGTSNMIHKITQHEYLGFENICKYD 460


>gi|388493882|gb|AFK35007.1| unknown [Medicago truncatula]
          Length = 473

 Score =  517 bits (1331), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 282/432 (65%), Positives = 349/432 (80%), Gaps = 15/432 (3%)

Query: 52  LLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSL 111
           ++K  +F W    S Y  VWPE+KFGW IV+G+I+GFFGAA GSVGGVGGGGIF+PML+L
Sbjct: 47  VMKKTSFFWYSGESSYERVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTL 106

Query: 112 IIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIG 171
           IIGFDPKS+TA+SKCMI GAA STVY NL+LRHPTLDMP+IDYDLALL QPMLMLGISIG
Sbjct: 107 IIGFDPKSSTALSKCMITGAAGSTVYCNLRLRHPTLDMPLIDYDLALLFQPMLMLGISIG 166

Query: 172 VAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKK----EAARCL--GSNGA 225
           VAFNV+FADWMVT+LLI+LFIGTSTKA +KG++TWK+E I+KK    EAA+ L  GS   
Sbjct: 167 VAFNVMFADWMVTILLIILFIGTSTKALVKGIDTWKKEAIMKKEAFEEAAQMLESGSTPD 226

Query: 226 GAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTAS 285
            A E +YKSLP+        D  + EV +L+NI+WKEL +L++VWVAFL +QI K ++ +
Sbjct: 227 YASEEDYKSLPA--------DLQDEEVPLLDNIHWKELSVLMYVWVAFLIVQILKTYSKT 278

Query: 286 CSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAG 345
           CSI YW+L+ LQ+P+++ V+L+EAI L KG RVIAS+G +  +++  ++  YC  G++AG
Sbjct: 279 CSIEYWLLNSLQVPIAISVTLFEAICLCKGTRVIASRGKE-ITWKFHKICLYCFCGIIAG 337

Query: 346 IVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALY 405
           +V GLLGLGGGFI+GPLFLELG+PPQV+SAT+TFAM FSSSMSVV+YY L RFP+PYA Y
Sbjct: 338 MVSGLLGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYHLDRFPIPYASY 397

Query: 406 FVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHE 465
            V VATIAA  GQHVVRK+I + GRAS+I+FILAF IFVSAISLGGVGI NM+ K    E
Sbjct: 398 LVLVATIAALTGQHVVRKIIAIFGRASIIVFILAFTIFVSAISLGGVGIGNMVEKMENAE 457

Query: 466 YMGFENLCKYDR 477
           YMGF+NLC + +
Sbjct: 458 YMGFDNLCHHGK 469


>gi|357518359|ref|XP_003629468.1| hypothetical protein MTR_8g077830 [Medicago truncatula]
 gi|355523490|gb|AET03944.1| hypothetical protein MTR_8g077830 [Medicago truncatula]
          Length = 480

 Score =  516 bits (1328), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 284/439 (64%), Positives = 351/439 (79%), Gaps = 22/439 (5%)

Query: 52  LLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSL 111
           ++K  +F W    S Y  VWPE+KFGW IV+G+I+GFFGAA GSVGGVGGGGIF+PML+L
Sbjct: 47  VMKKTSFFWYSGESSYERVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTL 106

Query: 112 IIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIG 171
           IIGFDPKS+TA+SKCMI GAA STVYYNL+LRHPTLDMP+IDYDLALL QPMLMLGISIG
Sbjct: 107 IIGFDPKSSTALSKCMITGAAGSTVYYNLRLRHPTLDMPLIDYDLALLFQPMLMLGISIG 166

Query: 172 VAFNVIFADWMVTVLLIVLFIG-------TSTKAFLKGVETWKRETILKK----EAARCL 220
           VAFNV+FADWMVT+LLI+LFIG       TSTKA +KG++TWK+ETI+KK    EAA+ L
Sbjct: 167 VAFNVMFADWMVTILLIILFIGNNHYLFSTSTKALVKGIDTWKKETIMKKEAFEEAAQML 226

Query: 221 --GSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQI 278
             GS    A E +YKSLP+        D  + EV +L+NI+WKEL +L++VWVAFL +QI
Sbjct: 227 ESGSTPDYASEEDYKSLPA--------DLQDEEVPLLDNIHWKELSVLMYVWVAFLIVQI 278

Query: 279 AKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYC 338
            K ++ +CSI YW+L+ LQ+P+++ V+L+EAI L KG RVIAS+G +  +++  ++  YC
Sbjct: 279 LKTYSKTCSIEYWLLNSLQVPIAISVTLFEAICLCKGTRVIASRGKE-ITWKFHKICLYC 337

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
             G++AG+V GLLGLGGGFI+GPLFLELG+PPQV+SAT+TFAM FSSSMSVV+YY L RF
Sbjct: 338 FCGIIAGMVSGLLGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYHLDRF 397

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMI 458
           P+PYA Y V VATIAA  GQHVVRK+I + GRAS+I+FILAF IFVSAISLGGVGI NM+
Sbjct: 398 PIPYASYLVLVATIAALTGQHVVRKIIAIFGRASIIVFILAFTIFVSAISLGGVGIGNMV 457

Query: 459 GKFHRHEYMGFENLCKYDR 477
            K    EYMGF+NLC + +
Sbjct: 458 EKMENEEYMGFDNLCHHGK 476


>gi|255549972|ref|XP_002516037.1| conserved hypothetical protein [Ricinus communis]
 gi|223544942|gb|EEF46457.1| conserved hypothetical protein [Ricinus communis]
          Length = 481

 Score =  511 bits (1315), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 298/450 (66%), Positives = 360/450 (80%), Gaps = 3/450 (0%)

Query: 30  AERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFF 89
           +E  LK+    +   +  + +++ +A NF +Q    GY HVWPE++FGW+IVLG +IG F
Sbjct: 31  SETRLKLLGSDYKRPKEVEFHHVTRASNFFYQKGRLGYKHVWPEMRFGWKIVLGGVIGVF 90

Query: 90  GAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDM 149
           GAA GSVGGVGGGGIFVPML+LI+GFD KS+TA+SKCMI GAA +TVYYNLKLRHPTL++
Sbjct: 91  GAACGSVGGVGGGGIFVPMLTLIVGFDAKSSTALSKCMITGAAAATVYYNLKLRHPTLEL 150

Query: 150 PIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRE 209
           PIIDYDLALL QPML+LGIS+GVA NVIFADW +T+LLI+LFI TS+KAF KGVETWK+E
Sbjct: 151 PIIDYDLALLFQPMLVLGISLGVALNVIFADWAITILLIILFIVTSSKAFFKGVETWKQE 210

Query: 210 TILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQ---KDAMEPEVTILENINWKELGLL 266
           T LKKEAAR L  +     E+E KS P+G     Q    ++   +V+I EN+ WKELGLL
Sbjct: 211 TKLKKEAARHLEIDEDTTEEMEEKSPPTGSTIVAQAQFSESKREKVSIGENVCWKELGLL 270

Query: 267 VFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDG 326
           V VW   L LQI KN++ +CS+ YWVL  LQIPV++ V+ YEAISLYKG+R IAS+G+  
Sbjct: 271 VAVWSIILALQIGKNYSTTCSVTYWVLSTLQIPVAVGVTGYEAISLYKGQRKIASRGEVS 330

Query: 327 KSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSS 386
            ++++ QLVSY A G+LAG+VGG+LGLGGGFI+GPLFLE+G+PPQVSSATATF M FS+S
Sbjct: 331 TNWKIHQLVSYSAIGILAGVVGGMLGLGGGFILGPLFLEMGIPPQVSSATATFIMAFSAS 390

Query: 387 MSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSA 446
           MSV+EYYLLKRFPVPYALY  AVA IAAFVGQHVVRKLI LLGRASLIIFILAF IF SA
Sbjct: 391 MSVIEYYLLKRFPVPYALYLFAVAIIAAFVGQHVVRKLISLLGRASLIIFILAFTIFASA 450

Query: 447 ISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           ISLGGVGI+  I K  R EYMGFE++C Y+
Sbjct: 451 ISLGGVGIARAIKKIERKEYMGFEDICSYE 480


>gi|224059076|ref|XP_002299704.1| predicted protein [Populus trichocarpa]
 gi|222846962|gb|EEE84509.1| predicted protein [Populus trichocarpa]
          Length = 424

 Score =  508 bits (1308), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 294/411 (71%), Positives = 343/411 (83%), Gaps = 3/411 (0%)

Query: 66  GYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISK 125
           GY HVWPE++FGW+IVLG+IIGFFGAA GSVGGVGGGGIFVPMLSLIIGFD KSATA+SK
Sbjct: 14  GYTHVWPEMRFGWKIVLGSIIGFFGAACGSVGGVGGGGIFVPMLSLIIGFDTKSATAMSK 73

Query: 126 CMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTV 185
           CMI G A +TVYYNLKLRHPTL++P+IDYDLALL QPML+LGISIGVA NVIFA+WM+T+
Sbjct: 74  CMITGGAAATVYYNLKLRHPTLELPVIDYDLALLFQPMLVLGISIGVALNVIFANWMITI 133

Query: 186 LLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQK 245
           LLIVLFI TSTKAF KGVETWK+ET  KKEA R L SN     EVE +  P GP  G Q 
Sbjct: 134 LLIVLFIVTSTKAFFKGVETWKKETKTKKEATRSLESNDDRNEEVEGEPPPVGPTVGAQT 193

Query: 246 DAME---PEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSL 302
           +  E    +V+I++N+ W EL LLV VW+  L LQI KN++ +CS+ YW+L++LQIPV+ 
Sbjct: 194 ETKEYKKEQVSIIKNVYWWELSLLVAVWLIILALQIGKNYSTTCSMAYWLLNILQIPVAF 253

Query: 303 VVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPL 362
            VS YEA +LYKGRR IASKGD   ++++ QLV YC  G+LAG+VGG+LGLGGGFI+GPL
Sbjct: 254 GVSSYEAFNLYKGRRKIASKGDAVTNWKIRQLVLYCIIGLLAGVVGGMLGLGGGFILGPL 313

Query: 363 FLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVR 422
           FLE+G+PPQVSSATATFAM FS+SMSVVEYYLLKRFPVPYALYF AVAT+AA VGQHV+R
Sbjct: 314 FLEMGIPPQVSSATATFAMMFSASMSVVEYYLLKRFPVPYALYFFAVATVAALVGQHVIR 373

Query: 423 KLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           KLI +LGRASLIIF LAF IFVSAI LGGVGI+ M+ +  R EYMGFEN+C
Sbjct: 374 KLISILGRASLIIFTLAFTIFVSAILLGGVGIARMVKRIERKEYMGFENIC 424


>gi|67043799|gb|AAY63998.1| unknown [Brassica napus]
          Length = 348

 Score =  505 bits (1300), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 264/348 (75%), Positives = 302/348 (86%), Gaps = 1/348 (0%)

Query: 129 MGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLI 188
           MGA+VSTVYYNL+LRHPTLDMPIIDYDLALLIQPMLMLGIS+GVAFNV+F DWMVTVLLI
Sbjct: 1   MGASVSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISVGVAFNVMFPDWMVTVLLI 60

Query: 189 VLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAM 248
           +LF+GTSTKAFLKG ETW +ETI K EAA+ L S+G  A EVEY  LP+ P + P  +  
Sbjct: 61  ILFLGTSTKAFLKGRETWNKETIEKMEAAKRLESDGVSATEVEYMPLPAAPNTNPGNNK- 119

Query: 249 EPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYE 308
           + EV+I+EN+ WKELGLLVFVW+ FL LQIAK +  +CS+ YWV++LLQIPV++ VS YE
Sbjct: 120 KREVSIIENVYWKELGLLVFVWIVFLALQIAKKNMPTCSVGYWVINLLQIPVAVGVSGYE 179

Query: 309 AISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGV 368
           A++LY+GRR+IAS G  G +F + QL+ YC+FGVLAG+VGGLLGLGGGFIMGPLFLELGV
Sbjct: 180 AVALYQGRRIIASNGQGGSNFTIGQLILYCSFGVLAGVVGGLLGLGGGFIMGPLFLELGV 239

Query: 369 PPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
           PPQVSSATATFAMTFSSSMSVVEYYLLKRFP+PYALY V VATIAA VGQHVVR+LI +L
Sbjct: 240 PPQVSSATATFAMTFSSSMSVVEYYLLKRFPIPYALYLVGVATIAALVGQHVVRRLIAVL 299

Query: 429 GRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           GRASLIIFILA  IF+SAISLGGVGI NM+GK  RHEYMGFENLCKY 
Sbjct: 300 GRASLIIFILASTIFISAISLGGVGIVNMMGKLQRHEYMGFENLCKYS 347


>gi|356554517|ref|XP_003545592.1| PREDICTED: uncharacterized protein LOC100781779 [Glycine max]
          Length = 462

 Score =  502 bits (1293), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 273/469 (58%), Positives = 343/469 (73%), Gaps = 11/469 (2%)

Query: 11  LRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHV 70
           +R V++   NF L  V + A    K E    N  E Q  +Y +KA+ F+W+    GY HV
Sbjct: 1   MRHVVIFSFNFLLLCVFISAITSTKSEVSSTN--EEQSFSYHIKALEFIWK--HLGYQHV 56

Query: 71  WPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMG 130
           WPE++F W IV+GT+IG  GAAFGSVGGVGGGGIFVPML LIIGFDPKSA AISKCM+ G
Sbjct: 57  WPEMEFSWRIVVGTLIGILGAAFGSVGGVGGGGIFVPMLILIIGFDPKSAVAISKCMVTG 116

Query: 131 AAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVL 190
           AA+S V++ +K RHPTLD P+IDYDL LLIQP LMLGISIGV  +VIFADWMVT+LLI+L
Sbjct: 117 AAISAVFFCMKQRHPTLDEPVIDYDLMLLIQPTLMLGISIGVILSVIFADWMVTILLIIL 176

Query: 191 FIGTSTKAFLKGVETWKRETILKKEAARC--LGSNGAGAGEVE-YKSLPSGPRSGPQKDA 247
            I TS +AF  G +TWK+ET +K+E+     L  + A   E E YK LP     G +KD 
Sbjct: 177 CIVTSIRAFFMGADTWKKETKMKEESQETIKLSESTATCSEEEGYKYLPGCSDEGYEKDT 236

Query: 248 MEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLY 307
            +PEV+IL N+ WKE  L+  VW+AF+ LQIAKN+T SCS+ YW+L L QIP+++   LY
Sbjct: 237 RKPEVSILGNMYWKEFVLIFIVWLAFVVLQIAKNYTVSCSVTYWILILSQIPITVGFYLY 296

Query: 308 EAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELG 367
           +A +LY+GR    S+        +F L S C+  +LAGIVGGLLG G GF+MGPLFLE+G
Sbjct: 297 QARALYQGRAA-GSQHTHWPLHHLF-LASICS--LLAGIVGGLLGTGSGFVMGPLFLEVG 352

Query: 368 VPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIIL 427
           + PQV+SATATF M +SSS+SV++YYLL RFPVPYAL+   VA IAAF+GQ+++ KL+ +
Sbjct: 353 IAPQVASATATFGMMYSSSLSVIQYYLLNRFPVPYALFLTLVAAIAAFLGQYLIDKLVNI 412

Query: 428 LGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
             RASLIIF+LAF IFVS+I+LGGVGISNMI K  R+EYMGF+N C+ D
Sbjct: 413 FQRASLIIFVLAFTIFVSSIALGGVGISNMILKIQRNEYMGFDNFCRND 461


>gi|83853809|gb|ABC47842.1| membrane protein-like protein [Glycine max]
          Length = 469

 Score =  494 bits (1271), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 278/452 (61%), Positives = 347/452 (76%), Gaps = 17/452 (3%)

Query: 26  VCVKAERGLKIETLR-FNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGT 84
           VC+ AER LK +    F   ERQ    +   ++F W         VWPE+KFGW IV+G+
Sbjct: 30  VCL-AERVLKDQKPESFVVKERQ--GVMNSIIDFFWNDGEPTNDRVWPEMKFGWRIVVGS 86

Query: 85  IIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRH 144
           I+GFFGAA GSVGGVGGGGIF+PML+L+IGFD KS+TA+SKCMIMGAAVSTVYYNL+ RH
Sbjct: 87  IVGFFGAALGSVGGVGGGGIFIPMLTLVIGFDAKSSTALSKCMIMGAAVSTVYYNLRFRH 146

Query: 145 PTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVE 204
           PTLD+P+IDYDLALL QPMLMLGISIGVAFNV+FADWMVTVLLI+LFI TSTKA  KG++
Sbjct: 147 PTLDLPVIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLLIILFIATSTKALFKGID 206

Query: 205 TWKRETILKKEAARCLGSNGAG--AGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKE 262
           TWK+ETI+KKEAA+ L S+ +     E +YKSLP+G      ++ +   +  +   N+ E
Sbjct: 207 TWKKETIMKKEAAKMLESDSSPGYVSEEDYKSLPAGSADPRDEEVIPMAIQGIPYSNFIE 266

Query: 263 LGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASK 322
           +         FL L +   +T +CSI+YWVL+ LQ+P+++ V+LYEAI L  G RVIASK
Sbjct: 267 V---------FLALHVV-TYTKTCSILYWVLNSLQVPIAISVTLYEAICLCNGTRVIASK 316

Query: 323 GDDGKSF-RVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAM 381
           G +   + ++ ++  YC+ G++AGIV GLLGLGGGFI+GPLFLELG+PPQV+SAT+TFAM
Sbjct: 317 GKENTDWMKLHKICLYCSCGIIAGIVSGLLGLGGGFILGPLFLELGIPPQVASATSTFAM 376

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFM 441
            FSSSMSVV+YYLL+RFPVPYA YF+ VATIAA  GQHVVRK+I + GRAS+IIF+LAF 
Sbjct: 377 VFSSSMSVVQYYLLERFPVPYASYFILVATIAALTGQHVVRKIIAIFGRASIIIFVLAFT 436

Query: 442 IFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           IF+SAISLGGVGI NM+ K   +EYMGF N+C
Sbjct: 437 IFLSAISLGGVGIENMVEKMENNEYMGFANIC 468


>gi|3643608|gb|AAC42255.1| hypothetical protein [Arabidopsis thaliana]
 gi|20197510|gb|AAM15102.1| hypothetical protein [Arabidopsis thaliana]
          Length = 902

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 262/347 (75%), Positives = 288/347 (82%), Gaps = 18/347 (5%)

Query: 129 MGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLI 188
           MGA+VSTVYYNL+LRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIF DW+VTVLLI
Sbjct: 1   MGASVSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFPDWLVTVLLI 60

Query: 189 VLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAM 248
           VLF+GTSTKAFLKG ETW +ETI KKEAA+ L SNG    EVEY  LP+ P + P     
Sbjct: 61  VLFLGTSTKAFLKGSETWNKETIEKKEAAKRLESNGVSGTEVEYVPLPAAPSTNPGNKKK 120

Query: 249 EPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYE 308
           E EV+I+EN+ WKELGLLVFVW+ FL LQI+K                 IPV++ VS YE
Sbjct: 121 E-EVSIIENVYWKELGLLVFVWIVFLALQISK-----------------IPVAVGVSGYE 162

Query: 309 AISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGV 368
           A++LY+GRR+IASKG    +F V QLV YC FG++AGIVGGLLGLGGGFIMGPLFLELGV
Sbjct: 163 AVALYQGRRIIASKGQGDSNFTVGQLVMYCTFGIIAGIVGGLLGLGGGFIMGPLFLELGV 222

Query: 369 PPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
           PPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALY V VATIAA+VGQHVVR+LI  +
Sbjct: 223 PPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLVGVATIAAWVGQHVVRRLIAAI 282

Query: 429 GRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           GRASLIIFILA MIF+SAISLGGVGI NMIGK  RHEYMGFENLCKY
Sbjct: 283 GRASLIIFILASMIFISAISLGGVGIVNMIGKIQRHEYMGFENLCKY 329


>gi|356528375|ref|XP_003532779.1| PREDICTED: uncharacterized protein LOC100801788 [Glycine max]
          Length = 469

 Score =  489 bits (1259), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 258/411 (62%), Positives = 325/411 (79%)

Query: 67  YHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKC 126
           Y H WP +KFGW I++G ++GF G+AFG+VGGVGGGGIFVPML+LI+GFD KSATAISKC
Sbjct: 59  YKHHWPSMKFGWRIIVGAVVGFLGSAFGTVGGVGGGGIFVPMLTLIVGFDQKSATAISKC 118

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MI G A +TV+YNL+ RHPTLD+P+IDYDLALL QPMLMLGISIGVAFNVIF +WM+TVL
Sbjct: 119 MITGGATATVFYNLRQRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVIFPEWMLTVL 178

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           LI+ F+G S K+F KGV+TWK+ETI+KKEA +    +  G+ E +   + +G  +    +
Sbjct: 179 LIIFFVGISVKSFFKGVDTWKKETIMKKEAKKNSRIDDIGSPEDDAHYIQTGDPTKDDTN 238

Query: 247 AMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSL 306
               +V+I+ENI WKELGLL   W+  L L+I K HT +CS +YW+ +LLQ+P+++ +S 
Sbjct: 239 QSRKKVSIIENIRWKELGLLFAGWIMILALEIGKKHTTTCSRLYWLSNLLQVPIAVGMSS 298

Query: 307 YEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLEL 366
           YEA+ LYKG+R+IASKGD    + V QLV +CA G LAG++ GLLGLGGGFI+GPLFL L
Sbjct: 299 YEAVRLYKGKRIIASKGDKQTHWCVLQLVLFCACGTLAGMIAGLLGLGGGFILGPLFLGL 358

Query: 367 GVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
           G+PPQV+SAT+T  M FS+SM+VVEYYLLKRFPVPYALYFVA+AT AA VGQH+VRK I 
Sbjct: 359 GIPPQVASATSTLVMAFSASMAVVEYYLLKRFPVPYALYFVAIATAAALVGQHLVRKAIA 418

Query: 427 LLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYDR 477
           +LGRAS+IIFIL   + VSA+ LGGVGI++MI K    EYMGF +LC Y +
Sbjct: 419 ILGRASVIIFILTLTLSVSAVLLGGVGIAHMIQKIENKEYMGFGDLCTYRK 469


>gi|356511005|ref|XP_003524222.1| PREDICTED: uncharacterized protein LOC100818778 [Glycine max]
          Length = 474

 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 262/419 (62%), Positives = 327/419 (78%), Gaps = 11/419 (2%)

Query: 67  YHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKC 126
           Y H WP +KFGW I++G I+GF G+AFG+VGGVGGGGIFVPML+LI+GFD KSATAISKC
Sbjct: 59  YKHHWPSMKFGWRIIVGAIVGFLGSAFGTVGGVGGGGIFVPMLTLIVGFDQKSATAISKC 118

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MI G A +TV+YNL+ RHPTLD+P+IDYDLALL QPMLMLGISIGVAFNVIF +WM+TVL
Sbjct: 119 MITGGATATVFYNLRQRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVIFPEWMLTVL 178

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           LI+ F+G S K+F KGV+TWK+ETI+KKEA +    +  G+ E +   + +G    P KD
Sbjct: 179 LIIFFVGISVKSFFKGVDTWKKETIMKKEAKKNSRIDDIGSPEDDAHYIQTG---DPAKD 235

Query: 247 AMEP--------EVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQI 298
                       +V+I+ENI WKELGLL   W+  L L+I K HT +CS ++W+L+LLQ+
Sbjct: 236 DTNQSRKKKTVMQVSIIENIRWKELGLLFAGWIMILALEIGKKHTTTCSRLFWLLNLLQV 295

Query: 299 PVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFI 358
           P+++ +S YEA+ LYKG+R+IASKGD    + V QLV +CA G LAG++ GLLGLGGGFI
Sbjct: 296 PIAVGMSSYEAVRLYKGKRIIASKGDQQTHWCVLQLVLFCACGTLAGMIAGLLGLGGGFI 355

Query: 359 MGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQ 418
           +GPLFL LG+PPQV+SAT+T  M FS+SM+VVEYYLLKRFPVPYALYFVA+AT AA VGQ
Sbjct: 356 LGPLFLGLGIPPQVASATSTLVMAFSASMAVVEYYLLKRFPVPYALYFVAIATAAALVGQ 415

Query: 419 HVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYDR 477
           H+VRK I +LGRAS+IIFIL   + VSA+ LGGVGI++MI K    EYMGF +LC Y +
Sbjct: 416 HLVRKAIAILGRASVIIFILTLTLSVSAVLLGGVGIAHMIQKIENKEYMGFGDLCTYRK 474


>gi|215694369|dbj|BAG89362.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 349

 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 253/351 (72%), Positives = 293/351 (83%), Gaps = 4/351 (1%)

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MIMGAAVSTVYYNLKL+HPTLDMP+IDYDLALLIQPMLMLGISIGV FNVIF DW++TVL
Sbjct: 1   MIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVLFNVIFPDWLITVL 60

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           LI+LF+GTSTKAFLKGVETWK+ETILK+EAA+ L      A E EY  LP+GP +  +  
Sbjct: 61  LIILFLGTSTKAFLKGVETWKKETILKREAAKRLEQI---AEEPEYSPLPTGPNATAESK 117

Query: 247 AMEPEVTIL-ENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVS 305
           A   E   L +N+ WKE GLL FVW+AFL LQ+ KN+  +CS  YWVL+ LQIPVS+ V+
Sbjct: 118 APSDEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPTCSTWYWVLNFLQIPVSVGVT 177

Query: 306 LYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE 365
           +YE + L +GRRVI+SKGD+  + +  QL+ YC FG++AG+VGGLLGLGGGFIMGPLFLE
Sbjct: 178 MYEGLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAGVVGGLLGLGGGFIMGPLFLE 237

Query: 366 LGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLI 425
           LG+PPQVSSATATFAM FSSSMSVVEYYLL RFPVPYAL+F  VA  AA +GQH+VRKLI
Sbjct: 238 LGIPPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALFFTVVAFFAAIIGQHIVRKLI 297

Query: 426 ILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
             LGRASLIIFIL+FMIF+SAISLGGVGISNMIGK  RHEYMGF+N+C YD
Sbjct: 298 NWLGRASLIIFILSFMIFISAISLGGVGISNMIGKIARHEYMGFDNICNYD 348


>gi|414589355|tpg|DAA39926.1| TPA: hypothetical protein ZEAMMB73_327447 [Zea mays]
          Length = 497

 Score =  484 bits (1245), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 274/440 (62%), Positives = 334/440 (75%), Gaps = 28/440 (6%)

Query: 63  EGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATA 122
           +G  YHHVWP + FGW +V+G++IGFFGAA GSVGGVGGGGIFVPML+LI+GFDPKS+TA
Sbjct: 57  DGKAYHHVWPPMGFGWRVVVGSLIGFFGAACGSVGGVGGGGIFVPMLALIVGFDPKSSTA 116

Query: 123 ISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWM 182
           ISKCMIMG +VSTVYYNLK +HP+LDMP+IDYDLALL+QPMLMLG+SIGV FNVIF DW+
Sbjct: 117 ISKCMIMGGSVSTVYYNLKRKHPSLDMPLIDYDLALLMQPMLMLGVSIGVIFNVIFPDWL 176

Query: 183 VTVLLIVLFIGTSTKAFLKGVETWKRETILKKE-----------AARCLGSNGAGAGEVE 231
           +T LLI +F+ TSTKA+LKG ETWK+ET+ K++             R +         + 
Sbjct: 177 ITALLITIFLVTSTKAYLKGFETWKKETVKKRQMLLRLHRAARQETRQIMKRYGNLACIA 236

Query: 232 YKSLPSGPRSGPQ-----KDAMEPEV-------TILENINWKELGLLVFVWVAFLGLQIA 279
            +++    R          + ME          ++L+NI WKELGLL FVW+AFLGLQ+ 
Sbjct: 237 VRTIAYIVRIRTCVHIYIDELMETACLALFQTSSVLKNIYWKELGLLAFVWIAFLGLQLT 296

Query: 280 K-NHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIAS-KG---DDGKSFRVFQL 334
           + N+ ASCS+ YWVL+ LQ+PV++ V++YEA  L  GRR ++S KG       + R+ QL
Sbjct: 297 RTNYAASCSVWYWVLNSLQVPVAVGVTVYEACGLRTGRRALSSNKGGSQQQQSALRIRQL 356

Query: 335 VSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL 394
           + YC FGVLAG+VGGLLG+GGGFIMGPLFLELG+PPQVSSATATF M FSSSMSVVEYYL
Sbjct: 357 LVYCLFGVLAGLVGGLLGMGGGFIMGPLFLELGIPPQVSSATATFTMMFSSSMSVVEYYL 416

Query: 395 LKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGI 454
           L RFPVPYA YF  VA +AA  GQH VRKLI  LGRASLIIFILA MIFVSA++LGGVGI
Sbjct: 417 LHRFPVPYAAYFTGVAFVAAITGQHCVRKLIAWLGRASLIIFILASMIFVSALTLGGVGI 476

Query: 455 SNMIGKFHRHEYMGFENLCK 474
           SN++ +  RH+YMGFE+LCK
Sbjct: 477 SNIVHRVERHQYMGFESLCK 496


>gi|242044424|ref|XP_002460083.1| hypothetical protein SORBIDRAFT_02g022500 [Sorghum bicolor]
 gi|241923460|gb|EER96604.1| hypothetical protein SORBIDRAFT_02g022500 [Sorghum bicolor]
          Length = 455

 Score =  479 bits (1233), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 274/417 (65%), Positives = 329/417 (78%), Gaps = 20/417 (4%)

Query: 63  EGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATA 122
           +G  YHHVWP ++FGW +V+G++IGFFGAA GSVGGVGGGGIFVPML+LIIGFDPKS+TA
Sbjct: 53  DGKAYHHVWPPMEFGWRVVVGSLIGFFGAACGSVGGVGGGGIFVPMLALIIGFDPKSSTA 112

Query: 123 ISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWM 182
           ISK           YYNLKL+HP+LDMP+IDYDLALL+QPMLMLG+SIGV FNVIF +W+
Sbjct: 113 ISK----------FYYNLKLKHPSLDMPLIDYDLALLMQPMLMLGVSIGVIFNVIFPNWL 162

Query: 183 VTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSG 242
           +T LLI +F+ TSTKA+LKG ETWK+ETI K+E AR          E E+ ++P G    
Sbjct: 163 ITALLITIFLVTSTKAYLKGFETWKKETIKKREDAR---RKEQICQEPEH-AIPIG-SDA 217

Query: 243 PQKDAMEPE---VTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIP 299
           P   A  P     ++L+NI WKE GLL FVWVAFLGLQ+ KN+ ASCS+ YWVL+ LQIP
Sbjct: 218 PADKAKTPTDEATSVLKNIYWKEFGLLAFVWVAFLGLQVTKNYVASCSVWYWVLNSLQIP 277

Query: 300 VSLVVSLYEAISLYKGRRVIASKGDDGK--SFRVFQLVSYCAFGVLAGIVGGLLGLGGGF 357
           V++ V++YEA  L  G+RV++SKG   +  + R+ QL+ YC FG+LAG++GGLLG+GGGF
Sbjct: 278 VAVGVTVYEAYGLRTGKRVLSSKGSSQQQSTLRIRQLLVYCLFGILAGLIGGLLGMGGGF 337

Query: 358 IMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVG 417
           IMGPLFLELG+PPQVSSATATF M FSSSMSVVEYYLL RFPVPYA YF  VA +AA  G
Sbjct: 338 IMGPLFLELGIPPQVSSATATFTMMFSSSMSVVEYYLLHRFPVPYAAYFTGVAFVAAITG 397

Query: 418 QHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
           QH VRKLI  LGRASLIIFILA MIFVSA++LGGVGISN++ +  RHEYMGFE+LCK
Sbjct: 398 QHFVRKLIAWLGRASLIIFILASMIFVSALTLGGVGISNIVHRMERHEYMGFESLCK 454


>gi|356511003|ref|XP_003524221.1| PREDICTED: uncharacterized protein LOC100818240 [Glycine max]
          Length = 473

 Score =  479 bits (1232), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 259/426 (60%), Positives = 330/426 (77%), Gaps = 16/426 (3%)

Query: 64  GSGYH------HVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDP 117
           G+GYH      H WP +K+GW+I++GTI+GF G+AFG+VGGVGGGGIFVPML+LIIGFD 
Sbjct: 46  GTGYHAKVFYQHTWPSMKYGWKIIVGTIVGFLGSAFGNVGGVGGGGIFVPMLTLIIGFDA 105

Query: 118 KSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVI 177
           KSA AISKCMI G A +TV+YNL+ RHPTLD+P+IDYDLALL QPMLMLGISIGV+FNVI
Sbjct: 106 KSAIAISKCMITGGATATVFYNLRQRHPTLDLPVIDYDLALLFQPMLMLGISIGVSFNVI 165

Query: 178 FADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPS 237
           F DWM+T LLI+ F G S K+F KGV+TWK+ET++ KEA++   ++G G     Y   P 
Sbjct: 166 FPDWMLTTLLIISFTGISVKSFFKGVDTWKQETLMMKEASKNSLTDGRGGST--YIGNPE 223

Query: 238 GPRS-----GPQKDAM---EPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIV 289
                     P KD       +V++++N++WKELGLL  VW+  L L+I KN+T +CS V
Sbjct: 224 DDAHYIKTIDPVKDNTNQSRKKVSVIDNVHWKELGLLFAVWIMILALEIGKNYTTTCSGV 283

Query: 290 YWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGG 349
           YWV++LLQ+P+++ +S Y+A+ LYKG+R+IASKGD    +RV QL+ +CA G+LAG + G
Sbjct: 284 YWVINLLQVPIAVGMSSYQAMRLYKGQRIIASKGDQQTHWRVLQLIVFCACGILAGTIAG 343

Query: 350 LLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAV 409
           LLGLGGGFI+ PLFL LG+PPQV+SAT+  AM FS+S++VVEYYLLKRFP+ YALYFVAV
Sbjct: 344 LLGLGGGFILAPLFLGLGIPPQVASATSILAMAFSASIAVVEYYLLKRFPISYALYFVAV 403

Query: 410 ATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGF 469
           AT AA VGQH+VRK+I +LGRAS+IIFIL   + VSA+ LGGVG++NMI +    EYMGF
Sbjct: 404 ATAAALVGQHLVRKVIAMLGRASVIIFILTLTLCVSAVLLGGVGVANMIKRIENKEYMGF 463

Query: 470 ENLCKY 475
            NLC Y
Sbjct: 464 GNLCTY 469


>gi|356528373|ref|XP_003532778.1| PREDICTED: uncharacterized protein LOC100801244 [Glycine max]
          Length = 466

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 250/409 (61%), Positives = 323/409 (78%)

Query: 67  YHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKC 126
           Y H WP +++GW+I++GT++GF G+AFG+VGGVGGGGIFVPML+LIIGFD KSA AISKC
Sbjct: 54  YKHTWPSMEYGWKIIVGTMVGFLGSAFGNVGGVGGGGIFVPMLTLIIGFDAKSAIAISKC 113

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MI G A +TV+YNL+ RHPTLD+P+IDYDLALL QPMLMLGISIGV+FNVIF DWM+T L
Sbjct: 114 MITGGATATVFYNLRQRHPTLDLPVIDYDLALLFQPMLMLGISIGVSFNVIFPDWMLTTL 173

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           LI+ F G S K+F KGV+TWK+ET++ KEA +    +  G+ E     + +G       +
Sbjct: 174 LIIFFTGISVKSFFKGVDTWKQETLIVKEARKNSQIDDIGSPEDAAHYIQTGDPVKDNTN 233

Query: 247 AMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSL 306
               +V+++ENI+W ELGLL  VW+  L L+I KN+T +CS VYWV++LLQ+P+++ +S 
Sbjct: 234 QSRKKVSVIENIHWNELGLLFAVWIMILALEIGKNYTTTCSGVYWVINLLQVPIAVGMSS 293

Query: 307 YEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLEL 366
           Y+A+ LYKG+R+I SKGD   ++RV QL+ +CA G+LAG + GLLGLGGGFI+ PLFL +
Sbjct: 294 YQAMRLYKGQRIIGSKGDQQTNWRVLQLILFCACGILAGTIAGLLGLGGGFILAPLFLGI 353

Query: 367 GVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
           G+PPQV+SAT+  AM FS+SM+VVEYYLLKRFP+ YALYFVAVAT AA VGQH+VRK+I 
Sbjct: 354 GIPPQVASATSILAMAFSASMAVVEYYLLKRFPISYALYFVAVATAAALVGQHLVRKVIA 413

Query: 427 LLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           +LGRAS+IIFIL   + VSA+ LGGVG++NMI +    EYMGF NLC Y
Sbjct: 414 ILGRASVIIFILTLTLCVSAVLLGGVGVANMIKRIENKEYMGFGNLCTY 462


>gi|255549970|ref|XP_002516036.1| conserved hypothetical protein [Ricinus communis]
 gi|223544941|gb|EEF46456.1| conserved hypothetical protein [Ricinus communis]
          Length = 466

 Score =  477 bits (1227), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 272/412 (66%), Positives = 332/412 (80%)

Query: 65  SGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAIS 124
           S Y HVWPE++ GW IV+G IIGFFGAAFGS GGVGGGGI+VPML+LIIGFD KS+ AIS
Sbjct: 55  SDYKHVWPEMRIGWRIVVGAIIGFFGAAFGSAGGVGGGGIYVPMLTLIIGFDAKSSIAIS 114

Query: 125 KCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVT 184
           KCMI GAA STVYYNLK RHPTL+MP+IDYDLALL QPML+LG+SIGV FNVIFADWM+T
Sbjct: 115 KCMITGAAASTVYYNLKQRHPTLEMPVIDYDLALLFQPMLVLGVSIGVTFNVIFADWMIT 174

Query: 185 VLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQ 244
           VLLI++FI  STKAF KG++TWK+ETI KK A   LG N           + S       
Sbjct: 175 VLLIIIFIFMSTKAFSKGIQTWKKETIKKKIAPLLLGDNAEVVIPTPPPEILSDNAQIET 234

Query: 245 KDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVV 304
           K   + +V+++EN+ WK LGLL+ +W   L LQIAKN+T +C++ YW+LD  QIPV+  +
Sbjct: 235 KGPKKEKVSLIENVYWKALGLLLVIWFMILALQIAKNYTKTCTVPYWILDFAQIPVAASL 294

Query: 305 SLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFL 364
           ++Y+A+ LYKGRR IAS+G+   ++RV +L+ YC  G+LAGI+GGLLGLGGGFI+GPLFL
Sbjct: 295 TIYQAVRLYKGRRKIASRGEAESNWRVHKLLLYCFSGLLAGIIGGLLGLGGGFILGPLFL 354

Query: 365 ELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           E+G+PPQVSSATATFAMTFS+SMSVVEYYLLKRFPVPYALY  AV T+A FVGQ+VV+K+
Sbjct: 355 EMGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYLFAVTTLAGFVGQYVVKKV 414

Query: 425 IILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           I +LGRAS+IIFIL+F I VSAISLGGVG+++MI K  R EYMGFE++C Y 
Sbjct: 415 IDMLGRASIIIFILSFTILVSAISLGGVGLADMIKKIERKEYMGFEDICLYS 466


>gi|224054356|ref|XP_002298219.1| predicted protein [Populus trichocarpa]
 gi|222845477|gb|EEE83024.1| predicted protein [Populus trichocarpa]
          Length = 421

 Score =  467 bits (1202), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 276/425 (64%), Positives = 333/425 (78%), Gaps = 21/425 (4%)

Query: 65  SGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAIS 124
           SGY HVWP++KFGW+ V+GTII F GAA GSVGGVGGGGIFVPML+LI+GFD KS+ AIS
Sbjct: 2   SGYKHVWPDMKFGWKTVVGTIITFLGAACGSVGGVGGGGIFVPMLTLIVGFDSKSSAAIS 61

Query: 125 KC-----------MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVA 173
           KC           MI GAA +TV+YNL+LRHPTL++PIIDYDLALL QPML+LGISIGV 
Sbjct: 62  KCNCCRLCLLLSGMITGAAATTVFYNLRLRHPTLELPIIDYDLALLFQPMLILGISIGVT 121

Query: 174 FNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYK 233
            NV+FADWM+ +LLI+ FI TST++F KGVETWK+ET  K+ +      N   A E   K
Sbjct: 122 LNVLFADWMIIILLIIFFIATSTRSFFKGVETWKKETKSKQVSHFLYPKNLIEAAE--EK 179

Query: 234 SLPSGPRSGPQKDAMEPE-----VTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSI 288
            LP G  S    D ++PE     V+I+ENI WK+LG+LV VW+  L LQI KN+T +CS 
Sbjct: 180 PLPGGTTS---HDQIKPEPNKEKVSIIENIKWKDLGILVIVWLIILALQIGKNYTTTCSA 236

Query: 289 VYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVG 348
           +YW+L++LQIPV+  V+ YEA+SLYKGRR I S G+   ++ V +L+ YC  GV+AG VG
Sbjct: 237 IYWLLNILQIPVAAGVTSYEAVSLYKGRRKITSMGEASNNWPVHELILYCFIGVMAGTVG 296

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVA 408
           G+LGLGGGFI+GPLFLE+G+PPQVSSATATFAM FS+SMSVVE+YLLKRFPVPYALYF  
Sbjct: 297 GMLGLGGGFILGPLFLEMGIPPQVSSATATFAMMFSASMSVVEFYLLKRFPVPYALYFFT 356

Query: 409 VATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMG 468
           VATIAA VGQHVVRKLI +LGRASLIIFILA  IFVSAI +GG G+++MI KF R EYMG
Sbjct: 357 VATIAAVVGQHVVRKLISILGRASLIIFILASTIFVSAILVGGAGMASMIEKFERKEYMG 416

Query: 469 FENLC 473
           FE++C
Sbjct: 417 FESMC 421


>gi|217075809|gb|ACJ86264.1| unknown [Medicago truncatula]
          Length = 481

 Score =  467 bits (1201), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 268/411 (65%), Positives = 326/411 (79%), Gaps = 2/411 (0%)

Query: 67  YHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKC 126
           Y H WP +KFGW I++GTIIGF G++FG+VGGVGGGGIFVPML+LI+GFD KSATAISK 
Sbjct: 67  YKHTWPGMKFGWRIIVGTIIGFLGSSFGTVGGVGGGGIFVPMLTLIVGFDAKSATAISKF 126

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MI G A +TV YNLK RHPTLD+P+IDYDLALL QPMLMLGIS+GVAFN+IF DWM+T L
Sbjct: 127 MITGGAGATVLYNLKKRHPTLDLPVIDYDLALLFQPMLMLGISLGVAFNLIFPDWMLTTL 186

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           LI+ F G S  AFLKGV TWK+ET+ KKEA      N     +           +  Q +
Sbjct: 187 LIIFFTGISVNAFLKGVNTWKKETLTKKEAKDNSQLNDIRTEDATLDLQIGESVNENQTN 246

Query: 247 AMEP--EVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVV 304
              P  +V+++EN+ WKELGLL  VW+  L LQI KN+T +CS++YW+L+LLQ+P+++ V
Sbjct: 247 TNLPRKKVSVIENVYWKELGLLFSVWIMILALQIGKNYTTTCSLIYWILNLLQVPIAIGV 306

Query: 305 SLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFL 364
           S YEAI LYKG+RVIAS GD G ++ V QL+ YC+ G++AGI+GGLLGLGGGFI+GPLF+
Sbjct: 307 SGYEAILLYKGKRVIASNGDQGINWSVKQLILYCSCGIIAGIIGGLLGLGGGFILGPLFI 366

Query: 365 ELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            LG+PPQV+SAT+TFAMTFS+SMSVVEYYLLKRFP+PYALYFV VAT+AA VGQH+VRKL
Sbjct: 367 GLGIPPQVASATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVVVATVAALVGQHLVRKL 426

Query: 425 IILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           I LLGRAS+IIFILA  +FVS ISLGG GI+N+I +    EYMGF NLC Y
Sbjct: 427 IALLGRASIIIFILALTVFVSGISLGGTGIANLIKRLENKEYMGFGNLCSY 477


>gi|357519217|ref|XP_003629897.1| hypothetical protein MTR_8g088080 [Medicago truncatula]
 gi|355523919|gb|AET04373.1| hypothetical protein MTR_8g088080 [Medicago truncatula]
          Length = 479

 Score =  466 bits (1200), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 268/411 (65%), Positives = 326/411 (79%), Gaps = 2/411 (0%)

Query: 67  YHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKC 126
           Y H WP +KFGW I++GTIIGF G++FG+VGGVGGGGIFVPML+LI+GFD KSATAISK 
Sbjct: 65  YKHTWPGMKFGWRIIVGTIIGFLGSSFGTVGGVGGGGIFVPMLTLIVGFDAKSATAISKF 124

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MI G A +TV YNLK RHPTLD+P+IDYDLALL QPMLMLGIS+GVAFN+IF DWM+T L
Sbjct: 125 MITGGAGATVLYNLKKRHPTLDLPVIDYDLALLFQPMLMLGISLGVAFNLIFPDWMLTTL 184

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           LI+ F G S  AFLKGV TWK+ET+ KKEA      N     +           +  Q +
Sbjct: 185 LIIFFTGISVNAFLKGVNTWKKETLTKKEAKDNSQLNDIRTEDATLDLQIGESVNENQTN 244

Query: 247 AMEP--EVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVV 304
              P  +V+++EN+ WKELGLL  VW+  L LQI KN+T +CS++YW+L+LLQ+P+++ V
Sbjct: 245 TNLPRKKVSVIENVYWKELGLLFSVWIMILALQIGKNYTTTCSLIYWILNLLQVPIAIGV 304

Query: 305 SLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFL 364
           S YEAI LYKG+RVIAS GD G ++ V QL+ YC+ G++AGI+GGLLGLGGGFI+GPLF+
Sbjct: 305 SGYEAILLYKGKRVIASNGDQGINWSVKQLILYCSCGIIAGIIGGLLGLGGGFILGPLFI 364

Query: 365 ELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            LG+PPQV+SAT+TFAMTFS+SMSVVEYYLLKRFP+PYALYFV VAT+AA VGQH+VRKL
Sbjct: 365 GLGIPPQVASATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVVVATVAALVGQHLVRKL 424

Query: 425 IILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           I LLGRAS+IIFILA  +FVS ISLGG GI+N+I +    EYMGF NLC Y
Sbjct: 425 IALLGRASIIIFILALTVFVSGISLGGTGIANLIKRLENKEYMGFGNLCSY 475


>gi|337733644|gb|AEI72271.1| hypothetical protein [Citrus trifoliata]
          Length = 353

 Score =  457 bits (1175), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 248/352 (70%), Positives = 290/352 (82%), Gaps = 3/352 (0%)

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MI G AV+TV YNL+ RHPTLDMP+IDYDLALL QPML+LGISIGVAFNVIFADWM+TVL
Sbjct: 1   MITGTAVATVVYNLRQRHPTLDMPVIDYDLALLFQPMLVLGISIGVAFNVIFADWMITVL 60

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           LI+LFIG S KAFLKGVETWK+ETI K EAA+ L  +  G    E+K  P    +   K+
Sbjct: 61  LIILFIGMSAKAFLKGVETWKKETITKVEAAKRLELDRNGDQAEEFKCEPENLSNTTPKE 120

Query: 247 AMEP---EVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLV 303
             EP   E +IL+NI WKELGLLV VWV  L LQIAKN+TA+CS++YWVL+ LQIPV+  
Sbjct: 121 TKEPVKSEGSILKNIYWKELGLLVAVWVIVLALQIAKNYTATCSVLYWVLNFLQIPVAGG 180

Query: 304 VSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLF 363
           VS YEA++LYKGRR IASKGD G ++R  QLV YCA G+ AG+VGGLLGLGGGFI+GPLF
Sbjct: 181 VSAYEAVALYKGRRTIASKGDVGTNWRPHQLVFYCACGITAGVVGGLLGLGGGFILGPLF 240

Query: 364 LELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRK 423
           LELG+PPQVSSATATFAMTFS+SMSVVEYYLLKRFPVPYALYF AV+TIAAFVGQHVVRK
Sbjct: 241 LELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYFFAVSTIAAFVGQHVVRK 300

Query: 424 LIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           LI +LGRAS+IIF L+  IFVSA+SL G+G++ M+ +  R EYMGF+++C Y
Sbjct: 301 LINILGRASIIIFTLSLTIFVSALSLSGIGLAKMVPRIDRKEYMGFDSICSY 352


>gi|356511007|ref|XP_003524223.1| PREDICTED: uncharacterized protein LOC100819316 [Glycine max]
          Length = 454

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 267/409 (65%), Positives = 325/409 (79%), Gaps = 22/409 (5%)

Query: 67  YHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKC 126
           Y H WPE+KFGW+I++G+IIGF G+AFG+VGGVGGGGIFVPML+LIIGFD +SATAISKC
Sbjct: 64  YKHTWPEMKFGWKIIVGSIIGFLGSAFGTVGGVGGGGIFVPMLTLIIGFDARSATAISKC 123

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MI G A +TV+YNLK RHPTLDMP+IDYDLALL QPMLMLGIS+GVAFNVIF DWM+T L
Sbjct: 124 MITGGAGATVFYNLKQRHPTLDMPVIDYDLALLFQPMLMLGISVGVAFNVIFPDWMLTAL 183

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           LI++FIG S  AF KGV+TWK+ETI+KK    C            ++ L         K 
Sbjct: 184 LIIVFIGLSVNAFFKGVKTWKKETIIKK----CF----------HFQHL-----FFLMKS 224

Query: 247 AMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSL 306
               +V+++ENI WKELG+LV VW+  L LQI KN+T +CS +YW+++LLQ+P+++  + 
Sbjct: 225 KTVLQVSVIENIYWKELGILVSVWILILALQIGKNYTTNCSALYWIMNLLQVPITVGTTF 284

Query: 307 YEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLEL 366
           YEA+ LYKG+RVIASKGD    +RV QL+ YC+ G++AGI+GGLLGLGGGFI+GPLF+ L
Sbjct: 285 YEAVLLYKGQRVIASKGDQQTRWRVQQLILYCSCGIIAGIIGGLLGLGGGFILGPLFIGL 344

Query: 367 GVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
           G+ PQVSSAT+TFAMTFS+SMSVVEYYLLKRFP+PYALYFVAVAT AA VGQH+VRK+I 
Sbjct: 345 GIHPQVSSATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVAVATAAALVGQHLVRKVIA 404

Query: 427 LLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           +LGR SLIIFILA  +FVS ISLGGVGI+ +I K      M FENLC Y
Sbjct: 405 ILGRTSLIIFILALTVFVSGISLGGVGIAKLIEK---KVQMKFENLCSY 450


>gi|356528377|ref|XP_003532780.1| PREDICTED: uncharacterized protein LOC100802321 [Glycine max]
          Length = 434

 Score =  446 bits (1148), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 264/409 (64%), Positives = 316/409 (77%), Gaps = 40/409 (9%)

Query: 67  YHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKC 126
           Y H WPE+KFGW+I++G+IIGF G+AFG+VGGVGGGGIFVPML+LIIGFD +SATAISKC
Sbjct: 62  YKHTWPEMKFGWKIIVGSIIGFLGSAFGTVGGVGGGGIFVPMLTLIIGFDARSATAISKC 121

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MI G A +T++YNLK RHPTLDMP+IDYDLALL QPMLMLGISIGVAFNVIF DWM+T L
Sbjct: 122 MITGGAGATIFYNLKQRHPTLDMPVIDYDLALLFQPMLMLGISIGVAFNVIFPDWMLTAL 181

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           LIV+FIG S  AF KGV+TWK+ETI+ K                                
Sbjct: 182 LIVVFIGLSFNAFFKGVKTWKKETIVNK-------------------------------- 209

Query: 247 AMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSL 306
                V+++ENI WKELG+LV VW+  L LQI KN+T +CS +YW L+LLQ+P+++  ++
Sbjct: 210 -----VSVIENIYWKELGILVSVWILILALQIGKNYTTNCSALYWALNLLQVPITVGTTV 264

Query: 307 YEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLEL 366
           YEA+ LYKG+R IASKGD    +RV QL+ YC  G++AGI+GGLLGLGGGFI+GPLF+ L
Sbjct: 265 YEAVLLYKGKRKIASKGDQQTRWRVHQLILYCTCGIIAGIIGGLLGLGGGFILGPLFIGL 324

Query: 367 GVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
           G+ PQVSSAT+TFAMTFS+SMSVVEYYLLKRFP+PYALYFVAVAT AA VGQH+VRK+I 
Sbjct: 325 GIHPQVSSATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVAVATAAALVGQHLVRKVIA 384

Query: 427 LLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           +LGR SLIIFILA  +FVS ISLGGVGI+ +I K      M FENLC Y
Sbjct: 385 ILGRTSLIIFILALTVFVSGISLGGVGIAKLIEK---KVQMKFENLCSY 430


>gi|357519215|ref|XP_003629896.1| hypothetical protein MTR_8g088070 [Medicago truncatula]
 gi|355523918|gb|AET04372.1| hypothetical protein MTR_8g088070 [Medicago truncatula]
          Length = 448

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 249/387 (64%), Positives = 309/387 (79%), Gaps = 4/387 (1%)

Query: 67  YHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKC 126
           Y H WP +KFGW I++GTI+ F G++FG+VGGVGGG   VPML+LIIGFD KSATAISKC
Sbjct: 60  YKHTWPGMKFGWRIIVGTILAFLGSSFGTVGGVGGG-GIVPMLTLIIGFDAKSATAISKC 118

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           MI G A +TV YNL+ RHPTLD+P+IDYDLALL QPMLMLGIS+GVAFNVIF DWM+T L
Sbjct: 119 MITGGAGATVLYNLRQRHPTLDLPVIDYDLALLFQPMLMLGISLGVAFNVIFPDWMITSL 178

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPR---SGP 243
           +++ F G S KAFLKGV TWK ET+ KKEA      N     E     L +G     +  
Sbjct: 179 ILIFFTGISVKAFLKGVNTWKNETLAKKEARDNSQLNDIRRIEDVELDLQAGESVNDNHT 238

Query: 244 QKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLV 303
             +    +V+++EN++WKE+GLL FVW+  L LQI K++T +CS+ YW+L+LLQ+P+++ 
Sbjct: 239 NTNIQRKKVSVIENVSWKEVGLLFFVWIMILALQIGKSYTITCSVAYWILNLLQVPIAVG 298

Query: 304 VSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLF 363
           VS YEA+ LYKG++VIASKGD G ++ V QL+ YC+ G++AG++GGLLGLGGGFI+ PLF
Sbjct: 299 VSSYEAVLLYKGKKVIASKGDQGINWSVKQLILYCSCGIIAGLIGGLLGLGGGFILAPLF 358

Query: 364 LELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRK 423
           L LG+PPQVSSAT+TFAMTFS+SMSVVEYYLLKRFP+PYALYFV VAT+AA VGQH+VRK
Sbjct: 359 LGLGIPPQVSSATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVVVATVAALVGQHLVRK 418

Query: 424 LIILLGRASLIIFILAFMIFVSAISLG 450
           LI+LLGRAS+IIFILA  +FVS ISLG
Sbjct: 419 LIVLLGRASIIIFILALTVFVSGISLG 445


>gi|356570810|ref|XP_003553577.1| PREDICTED: uncharacterized protein LOC100807336 [Glycine max]
          Length = 459

 Score =  387 bits (994), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 227/421 (53%), Positives = 305/421 (72%), Gaps = 19/421 (4%)

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
           G G   +WPELK GW + L T+IGF G+AFG+VGGVGGGGIFVPML+LI+GFD KSA A+
Sbjct: 44  GFGNDKLWPELKVGWRVALATVIGFLGSAFGTVGGVGGGGIFVPMLTLIVGFDTKSAAAL 103

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGA+ ++V+YNL++ HPT ++PIIDYDLALL QPMLMLGI++GV  +V+F  W++
Sbjct: 104 SKCMIMGASTASVWYNLRVPHPTKEVPIIDYDLALLFQPMLMLGITVGVVLSVVFPYWLI 163

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEV----EYKSLPSGP 239
           TVL+I+LFIG+S+++F KG + W+ ET+LKKE AR   +     GE+    EY+ L   P
Sbjct: 164 TVLIIILFIGSSSRSFFKGTQMWREETLLKKEMARQRATLVNFRGELLIDTEYEQL--FP 221

Query: 240 RSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIP 299
           +   +K +M+       N+ WK + +L+FVWV+FL LQ+ KN    CS+ YWVL  LQ P
Sbjct: 222 KE--EKSSMQ---IFCFNLKWKRILILMFVWVSFLLLQVIKNDVKICSVWYWVLFCLQFP 276

Query: 300 VSLVVSLYEAISLYKGRRVIASKGDDGK------SFRVFQLVSYCAFG-VLAGIVGGLLG 352
           ++L+V  YEA+ LYKG +   S G+          + V  ++ +CA   +L G VGGLLG
Sbjct: 277 IALLVFGYEAVKLYKGHKERVSTGNPESICEASIEWTVLHIL-FCALCGILGGTVGGLLG 335

Query: 353 LGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATI 412
            GGGFI+GPL +E+GV PQV+SATATF M FSSS+SVVE+YLLKRFP+PYALY  AV+ +
Sbjct: 336 SGGGFILGPLLIEIGVIPQVASATATFVMMFSSSLSVVEFYLLKRFPIPYALYLTAVSVL 395

Query: 413 AAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENL 472
           A F GQ+ VRKL+++L RAS+I+FIL+ +IF SA+++G VGI   I     HE+MGF + 
Sbjct: 396 AGFWGQYFVRKLMVILKRASIIVFILSGVIFASALTMGVVGIDKSIKMIQHHEFMGFLDF 455

Query: 473 C 473
           C
Sbjct: 456 C 456


>gi|224068935|ref|XP_002326235.1| predicted protein [Populus trichocarpa]
 gi|222833428|gb|EEE71905.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 223/424 (52%), Positives = 294/424 (69%), Gaps = 17/424 (4%)

Query: 61  QPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSA 120
            P  S    VWP+L+F W  VL T+IG  G+A G+VGGVGGGGIFVPML+LI+GFD KSA
Sbjct: 42  SPYLSTTDKVWPKLEFSWRTVLATVIGLLGSACGTVGGVGGGGIFVPMLTLIVGFDTKSA 101

Query: 121 TAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFAD 180
            A+SKCMIM A+ S+V+YNL++ HPT ++PIIDYDLALL QPML+LGI++GV+ +V+F  
Sbjct: 102 AALSKCMIMAASASSVWYNLRVPHPTREVPIIDYDLALLFQPMLLLGITLGVSLSVVFPY 161

Query: 181 WMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKE----AARCLGSNGAGAGEVEYKSLP 236
           W++TVL+I+LFIGTS+++F KG+E WK ETILKKE        + S G    + EY+  P
Sbjct: 162 WLITVLIIILFIGTSSRSFFKGIEMWKEETILKKEMVIQQETIVNSRGELLIDTEYE--P 219

Query: 237 SGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLL 296
             PR   +K  M+    +  N+ WK L +L  VW +FL LQ+ KN  A CS  YWVL  L
Sbjct: 220 LIPRE--EKSKMQ---ILCFNLKWKRLLILFLVWTSFLLLQVIKNDVAVCSTWYWVLFCL 274

Query: 297 QIPVSLVVSLYEAISLYKGRRVIASKGD-----DGKSFRVFQLVSYCAF-GVLAGIVGGL 350
           Q P++  V  YEA+ LY+  +   S G+     +         + +CA  G++ G VGGL
Sbjct: 275 QFPIAFGVFGYEAVKLYRENKKRISTGNTETICEASIEWTPMHILFCALCGIIGGTVGGL 334

Query: 351 LGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVA 410
           LG GGGF++GPL LE+GV P V+SAT+TF M FSSS+SVVE+YLLKRFP+P+ALY + V+
Sbjct: 335 LGSGGGFVLGPLLLEIGVSPHVASATSTFVMMFSSSLSVVEFYLLKRFPIPFALYLMGVS 394

Query: 411 TIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFE 470
            +A F GQ  VRKL+ +LGRASLI+FIL+ +IFVSA+++GGVGI   I     HE+MGF 
Sbjct: 395 VLAGFWGQFFVRKLVKILGRASLIVFILSGVIFVSALTMGGVGIDTSITMIRNHEFMGFL 454

Query: 471 NLCK 474
             C 
Sbjct: 455 EFCS 458


>gi|297827147|ref|XP_002881456.1| hypothetical protein ARALYDRAFT_902780 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327295|gb|EFH57715.1| hypothetical protein ARALYDRAFT_902780 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 463

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 212/421 (50%), Positives = 293/421 (69%), Gaps = 27/421 (6%)

Query: 68  HHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCM 127
             +WP+LKF W++VL T+I F G+A G+VGGVGGGGIFVPML+LI+GFD KSA AISKCM
Sbjct: 52  EKIWPDLKFSWKLVLATVIAFLGSACGTVGGVGGGGIFVPMLTLILGFDTKSAAAISKCM 111

Query: 128 IMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLL 187
           IMGA+ S+V+YN+++RHPT ++PI+DYDLALL QPML+LGI++GV+ +V+F  W++TVL+
Sbjct: 112 IMGASASSVWYNVRVRHPTKEVPILDYDLALLFQPMLLLGITVGVSLSVVFPYWLITVLI 171

Query: 188 IVLFIGTSTKAFLKGVETWKRETILKKEAAR----CLGSNGAGAGEVEYKSLPSGPRSGP 243
           I+LF+GTS+++F KG+E WK ET+LK E A+     + S G    + EY+ L       P
Sbjct: 172 IILFVGTSSRSFFKGIEMWKEETLLKNEMAQQRANMVNSRGELLIDTEYEPL------YP 225

Query: 244 QKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLV 303
           +++  E E+ I  N+ WK L +LV VW+AFL +QI KN    CS +YW+L +LQ PV+L 
Sbjct: 226 REEKSELEI-IRSNLKWKRLLVLVTVWLAFLLIQIIKNEIKVCSTIYWILFILQFPVALA 284

Query: 304 VSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLL-----------G 352
           V  +EAI LY   +   S G+           +   +  L+ I  GL            G
Sbjct: 285 VFGFEAIKLYTENKKRLSSGNTECICE-----ATIKWTPLSLIFCGLCGVIGGIVGGLLG 339

Query: 353 LGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATI 412
            GGGF++GPL LE+GV PQV+SATATF M FSSS+SVVE+YLLKRFP+PYA+Y ++V+ +
Sbjct: 340 SGGGFVLGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLLKRFPIPYAMYLISVSIL 399

Query: 413 AAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENL 472
           A F GQ  +RKL+ +L RAS+I+F+L+ +I  SA+++G +GI   I   H HE+MGF   
Sbjct: 400 AGFWGQSFIRKLVAILKRASIIVFVLSGVICASALTMGVIGIEKSIKMIHNHEFMGFLGF 459

Query: 473 C 473
           C
Sbjct: 460 C 460


>gi|356548305|ref|XP_003542543.1| PREDICTED: uncharacterized protein LOC100804918 [Glycine max]
          Length = 464

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 227/443 (51%), Positives = 308/443 (69%), Gaps = 26/443 (5%)

Query: 46  RQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIF 105
             D N   KAV  L     S    VWP+L+  W ++L T+IGF G+A G+VGGVGGGGIF
Sbjct: 32  HDDDN---KAVTLL-ASRVSETLKVWPDLEPSWRLLLATVIGFLGSACGTVGGVGGGGIF 87

Query: 106 VPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLM 165
           VPML+LI+GFD KSA A+SKCMIMGA+ S+V+YN+++ HPT ++PI+DYDLALL QPMLM
Sbjct: 88  VPMLNLILGFDTKSAAALSKCMIMGASTSSVWYNVRVPHPTKEVPILDYDLALLFQPMLM 147

Query: 166 LGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEA----ARCLG 221
           LGI++GVA +V+F  W++TVL+I+LFIGTS+++F KG+E W+ ETI K+E     A  + 
Sbjct: 148 LGITVGVALSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWREETIFKREKTMQRATLVD 207

Query: 222 SNGAG-AGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAK 280
           S G    G ++ K  P  P+   +K  ME    +  N+ WK + +L+ VWV FL +Q+ K
Sbjct: 208 SQGEDKTGIIDTKYEPLIPKE--KKSTME---ILCLNLRWKRILVLIVVWVGFLLVQVIK 262

Query: 281 NHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQL------ 334
           N   +CS  YWVL  LQ+P++L+V  YEA+ LYK  +    + + G S  + +       
Sbjct: 263 NDVEACSAWYWVLFGLQLPIALLVFGYEAVKLYKEHK---RRMNTGNSECICEASIEWTA 319

Query: 335 --VSYCAFG-VLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVE 391
             +++CA   ++ GIVGGLLG GGGF++GPL LE+GV PQV+SATATF M FSSS+SVVE
Sbjct: 320 INLAFCALCGIVGGIVGGLLGSGGGFVLGPLLLEIGVIPQVASATATFVMMFSSSLSVVE 379

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGG 451
           +YLLKRFP+PYALY  +V+ +A F GQ  VR+LI  LGRAS+I+FIL+ +IF SA+++G 
Sbjct: 380 FYLLKRFPIPYALYLTSVSVLAGFWGQFFVRRLITCLGRASIIVFILSGVIFASALTMGV 439

Query: 452 VGISNMIGKFHRHEYMGFENLCK 474
           VGI N I   + HE+MGF   C 
Sbjct: 440 VGIENSIQMINNHEFMGFLGFCS 462


>gi|30686851|ref|NP_850267.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|18700129|gb|AAL77676.1| At2g36630/F1O11.26 [Arabidopsis thaliana]
 gi|22137236|gb|AAM91463.1| At2g36630/F1O11.26 [Arabidopsis thaliana]
 gi|330254181|gb|AEC09275.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 459

 Score =  370 bits (949), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 209/425 (49%), Positives = 295/425 (69%), Gaps = 27/425 (6%)

Query: 65  SGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAIS 124
           S    +WP+LKF W++VL T+I F G+A G+VGGVGGGGIFVPML+LI+GFD KSA AIS
Sbjct: 45  SATEKIWPDLKFSWKLVLATVIAFLGSACGTVGGVGGGGIFVPMLTLILGFDTKSAAAIS 104

Query: 125 KCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVT 184
           KCMIMGA+ S+V+YN+++RHPT ++PI+DYDLALL QPML+LGI++GV+ +V+F  W++T
Sbjct: 105 KCMIMGASASSVWYNVRVRHPTKEVPILDYDLALLFQPMLLLGITVGVSLSVVFPYWLIT 164

Query: 185 VLLIVLFIGTSTKAFLKGVETWKRETILKKEAAR----CLGSNGAGAGEVEYKSLPSGPR 240
           VL+I+LF+GTS+++F KG+E WK ET+LK E A+     + S G    + EY+ L     
Sbjct: 165 VLIIILFVGTSSRSFFKGIEMWKEETLLKNEMAQQRANMVNSRGELLIDTEYEPL----- 219

Query: 241 SGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPV 300
             P+++  E E+ I  N+ WK L +LV VW+ FL +QI KN    CS +YW+L ++Q PV
Sbjct: 220 -YPREEKSELEI-IRSNLKWKGLLILVTVWLTFLLIQIVKNEIKVCSTIYWILFIVQFPV 277

Query: 301 SLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLL--------- 351
           +L V  +EA  LY   +    + + G +  + +  +   +  L+ I  GL          
Sbjct: 278 ALAVFGFEASKLYTANK---KRLNSGNTECICE--ATIEWTPLSLIFCGLCGLIGGIVGG 332

Query: 352 --GLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAV 409
             G GGGF++GPL LE+GV PQV+SATATF M FSSS+SVVE+YLLKRFP+PYA+Y ++V
Sbjct: 333 LLGSGGGFVLGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLLKRFPIPYAMYLISV 392

Query: 410 ATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGF 469
           + +A F GQ  +RKL+ +L RAS+I+F+L+ +I  SA+++G +GI   I   H HE+MGF
Sbjct: 393 SILAGFWGQSFIRKLVAILRRASIIVFVLSGVICASALTMGVIGIEKSIKMIHNHEFMGF 452

Query: 470 ENLCK 474
              C 
Sbjct: 453 LGFCS 457


>gi|225441056|ref|XP_002283988.1| PREDICTED: uncharacterized protein LOC100240792 isoform 1 [Vitis
           vinifera]
 gi|297740033|emb|CBI30215.3| unnamed protein product [Vitis vinifera]
          Length = 469

 Score =  368 bits (944), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 229/422 (54%), Positives = 298/422 (70%), Gaps = 19/422 (4%)

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
            S    VWPELKF W IVL T+IGF G+AFG+VGGVGGGGIFVPML+LI+GFD KSA A+
Sbjct: 54  SSSSEKVWPELKFSWRIVLATVIGFLGSAFGTVGGVGGGGIFVPMLTLIVGFDTKSAAAL 113

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGA+ S+V+YNL++ HPT + PIIDYDLALL QPMLMLGI++GVA +V+F  W++
Sbjct: 114 SKCMIMGASTSSVWYNLRVNHPTKEAPIIDYDLALLFQPMLMLGITVGVALSVVFPYWLI 173

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRETILKKEAAR----CLGSNGAGAGEVEYKSL-PSG 238
           T+L+I+LF+GTST++F K V+ W  ET+ KKE        + S+G    + EY  L P+ 
Sbjct: 174 TILIIILFMGTSTRSFFKAVQMWNEETLFKKELEEQRRTMVNSHGELLIDAEYDPLIPNE 233

Query: 239 PRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQI 298
            +S  Q         +  N+ WK + +LV VWV FL LQI KN +  CS  YWVL LLQ 
Sbjct: 234 EKSWIQ--------ILCSNLRWKRILVLVTVWVIFLLLQIFKNDSTVCSAWYWVLFLLQF 285

Query: 299 PVSLVVSLYEAISLYK--GRRVIASKGDDGKSFRVFQL---VSYCAF-GVLAGIVGGLLG 352
           PV++VV  YEA+ L+K   +R+I+   +      +      +++CA  G+L G VGGLLG
Sbjct: 286 PVAVVVFGYEAVKLHKEYKKRIISGNTEAVCEANIEWTGLNLAFCALCGILGGTVGGLLG 345

Query: 353 LGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATI 412
            GGGFI+GPL LE+GV PQV+SATATF M FSSS+SVVE+YLLKRFP+PYALY ++V+ +
Sbjct: 346 SGGGFILGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLLKRFPIPYALYLISVSVV 405

Query: 413 AAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENL 472
           A F GQ  +RKL+  L RAS+IIFIL+ +IF SA+++G VG    I   H HE+MGF + 
Sbjct: 406 AGFWGQLFIRKLVAFLRRASIIIFILSGVIFASALTMGVVGTKESITMIHNHEFMGFLSF 465

Query: 473 CK 474
           C 
Sbjct: 466 CD 467


>gi|449451245|ref|XP_004143372.1| PREDICTED: uncharacterized protein LOC101206149 [Cucumis sativus]
 gi|449523213|ref|XP_004168618.1| PREDICTED: uncharacterized protein LOC101229265 [Cucumis sativus]
          Length = 456

 Score =  366 bits (940), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 228/414 (55%), Positives = 297/414 (71%), Gaps = 17/414 (4%)

Query: 71  WPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMG 130
           WP+L+  W +V  T+IGF G+A G+VGGVGGGGIFVPML+LIIGFD KSA AISKCMIMG
Sbjct: 48  WPDLEPSWRLVAATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMG 107

Query: 131 AAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVL 190
           A+ S+V+YNL++ HPT D+PIID+DLALL QPMLMLGI++GV+ +V+F  W++T+L+I+L
Sbjct: 108 ASTSSVWYNLRVAHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITILIIIL 167

Query: 191 FIGTSTKAFLKGVETWKRETILKKE-AARC---LGSNGAGAGEVEYKSLPSGPRSGPQKD 246
           FIGTS+++F KG+E WK ETILKKE A RC   + S+G    +VEY  L       P++ 
Sbjct: 168 FIGTSSRSFFKGIEMWKEETILKKEFAKRCETVVNSHGELLIDVEYDPLI------PKEQ 221

Query: 247 AMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSL 306
             E E+    N+ WK   +L  VW++FL LQ+ KN  A+CSI YWV+  LQ P+++VV  
Sbjct: 222 KTELELMCF-NLRWKRTSILFAVWISFLILQVVKNDVAACSIWYWVVFFLQFPIAIVVFG 280

Query: 307 YEAISLYK--GRRVIASKGDDGKSFRVFQLVSYCAFGVLAGI----VGGLLGLGGGFIMG 360
           YEA  LYK   +R+ A   +      +    S+ AF  L GI    VGGLLG GGGF++G
Sbjct: 281 YEARKLYKEHKKRMEAGNLEQICEASIGWTGSHLAFCALCGIVGGTVGGLLGSGGGFVLG 340

Query: 361 PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           PL LE+GV PQV+SATATF M FSSS+SVVE+YLL RFP+PYALY  +V+ +A F GQ  
Sbjct: 341 PLLLEIGVVPQVASATATFVMMFSSSLSVVEFYLLNRFPIPYALYLTSVSVLAGFWGQFF 400

Query: 421 VRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
           VRKLI +L RASLI+FIL+ +IF SAI++G VG++  I     HE+MGF + C 
Sbjct: 401 VRKLITILRRASLIVFILSGVIFASAITMGIVGVTKSITMIQNHEFMGFLDFCS 454


>gi|356533415|ref|XP_003535260.1| PREDICTED: uncharacterized protein LOC100788770 [Glycine max]
          Length = 464

 Score =  366 bits (939), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 216/419 (51%), Positives = 302/419 (72%), Gaps = 22/419 (5%)

Query: 70  VWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIM 129
           VWP+L+  W +VL T+IGF G+A G+VGGVGGGGIFVPML+L++GFD KSA A+SKCMIM
Sbjct: 52  VWPDLEPSWRLVLATVIGFLGSACGTVGGVGGGGIFVPMLNLLLGFDTKSAAALSKCMIM 111

Query: 130 GAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIV 189
           GA+ S+V+YN+++ HPT ++PI+DYDLALL QPMLMLGI++GVA +V+F  W++TVL+I+
Sbjct: 112 GASASSVWYNVRVPHPTKEVPILDYDLALLFQPMLMLGITVGVALSVVFPYWLITVLIII 171

Query: 190 LFIGTSTKAFLKGVETWKRETILKKEAAR-------CLGSNGAGAGEVEYKSLPSGPRSG 242
           LFIGTS+++F KG+E W+ ETI K+E  +         G +  G  + +Y+ L       
Sbjct: 172 LFIGTSSRSFFKGIEMWREETIFKREKTKQRATLVDSHGEDKTGIIDTKYEPL------I 225

Query: 243 PQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSL 302
           P+++    ++  L N+ WK + +L+ VWVAFL +Q+ KN   +CS  YWVL  LQ P++L
Sbjct: 226 PKEEKSTIQILCL-NLRWKRILVLIVVWVAFLLVQVIKNDVEACSPWYWVLFGLQFPIAL 284

Query: 303 VVSLYEAISLYK-GRRVIASKG-----DDGKSFRVFQLVSYCAFG-VLAGIVGGLLGLGG 355
           +V  YEA+ LYK  +R ++++      +    +    L ++C+   ++ GIVGGLLG GG
Sbjct: 285 LVFGYEAVKLYKEHKRRMSTRNLECICEASIEWTAMNL-AFCSLCGIVGGIVGGLLGSGG 343

Query: 356 GFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAF 415
           GF++GPL LE+GV PQV+SATATF M FSSS+SVVE+YLLKRFP+PYALY  +V+ +A F
Sbjct: 344 GFVLGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLLKRFPIPYALYLTSVSVLAGF 403

Query: 416 VGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
            GQ  VR+LI  LGRAS+I+FIL+ +IF SA+++G VGI N I   + HE+MGF   C 
Sbjct: 404 WGQFFVRRLIACLGRASIIVFILSGVIFASALTMGVVGIENSIQMINNHEFMGFLGFCS 462


>gi|115455071|ref|NP_001051136.1| Os03g0726500 [Oryza sativa Japonica Group]
 gi|41469272|gb|AAS07154.1| expressed protein [Oryza sativa Japonica Group]
 gi|50428734|gb|AAT77085.1| expressed protein [Oryza sativa Japonica Group]
 gi|108710852|gb|ABF98647.1| expressed protein [Oryza sativa Japonica Group]
 gi|113549607|dbj|BAF13050.1| Os03g0726500 [Oryza sativa Japonica Group]
 gi|215697621|dbj|BAG91615.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 475

 Score =  363 bits (933), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 213/421 (50%), Positives = 289/421 (68%), Gaps = 11/421 (2%)

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
            S    VWP+L F W +V+ T++GF G+AFG+VGGVGGGGIFVPML+L++GFD KSA A+
Sbjct: 55  SSRSARVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLLVGFDTKSAAAL 114

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGA+ S+V+YNL++ HPT + P+IDY LALL QPMLMLGI+IGV  +VIF  W++
Sbjct: 115 SKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYWLI 174

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPR--- 240
           TVL+I+LFIGTS+++F KG+  WK ET ++ E       + +     +    PS      
Sbjct: 175 TVLIIILFIGTSSRSFYKGILMWKDETRIQMETREREEESKSSCAARDVVIDPSCEEPLL 234

Query: 241 SGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPV 300
             PQ        T L N+ WK + +L+ VW +FL LQI KN++ SCS  YWV+++LQ+PV
Sbjct: 235 CQPQPKEKSALETFLFNLRWKNILVLMTVWSSFLVLQIFKNNSQSCSTFYWVINILQVPV 294

Query: 301 SLVVSLYEAISLYKGRRVIASKG------DDGKSFRVFQLVSYCAFG-VLAGIVGGLLGL 353
           +L V L+E + L +  R     G      +    +   QL+ +CAF  +L G VGGLLG 
Sbjct: 295 ALSVFLWEGVQLCRESRARRMDGNWECVCEASIEWSPAQLI-FCAFCGLLGGTVGGLLGS 353

Query: 354 GGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIA 413
           GGGFI+GPL LELG  PQV+SATATF M FSSS+SVVE+Y L RFP+P+A+Y + ++ +A
Sbjct: 354 GGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLNRFPIPFAVYLICISILA 413

Query: 414 AFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            F GQ +VRKL+ +L RASLI+FIL+ +IF SA+++G VG    I   + HEYMGF + C
Sbjct: 414 GFWGQSLVRKLVHVLKRASLIVFILSSVIFASALTMGVVGTQKSISMINNHEYMGFLDFC 473

Query: 474 K 474
           +
Sbjct: 474 E 474


>gi|255575461|ref|XP_002528632.1| conserved hypothetical protein [Ricinus communis]
 gi|223531921|gb|EEF33735.1| conserved hypothetical protein [Ricinus communis]
          Length = 449

 Score =  363 bits (932), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 232/436 (53%), Positives = 295/436 (67%), Gaps = 25/436 (5%)

Query: 48  DSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVP 107
           D +  +K    L     S    VWPEL+F W IVL T+IGF G+A G+VGGVGGGGIFVP
Sbjct: 26  DHHPYVKNSTLLSSHYISTTERVWPELEFSWRIVLATVIGFLGSACGTVGGVGGGGIFVP 85

Query: 108 MLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLG 167
           ML+LI+GFD KSA AISKCMIMGA+ S+V+YNL++ HPT ++PI+DYDLALL QPMLMLG
Sbjct: 86  MLTLIVGFDTKSAAAISKCMIMGASASSVWYNLRVPHPTKEVPILDYDLALLFQPMLMLG 145

Query: 168 ISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAAR----CLGSN 223
           I++GVA +V+F  W++TVL+I+LFIGTS+++F KGVE WK ETILKKE A+     + S 
Sbjct: 146 ITVGVASSVVFPYWLITVLIIILFIGTSSRSFFKGVEMWKEETILKKELAKQQEAVVNSR 205

Query: 224 GAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHT 283
           G    + EY+ L       P+++  E    ++E      + L V     FL LQ+ KN  
Sbjct: 206 GELLIDTEYEPL------VPKEEKSE----MVEKAFCAAICLSV-----FLLLQVIKNDV 250

Query: 284 ASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGD-----DGKSFRVFQLVSYC 338
           A+CS  YWVL  LQ PV+L V  YEA+ LYK  +   S G+     +         +S+C
Sbjct: 251 ATCSKWYWVLFCLQFPVALAVFGYEAVKLYKEHKKRISTGNTESICEASIAWTPMHISFC 310

Query: 339 AFG-VLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           A   +L G VGGLLG GGGFI+GPL LE+GV PQV+SATATF M FSSS+SVVE+YLLKR
Sbjct: 311 ALCGILGGTVGGLLGSGGGFILGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLLKR 370

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNM 457
           FP+PYALY   V+ +A F GQ  VRKLI +L R SLI+FIL+ +IF SAI++G VG    
Sbjct: 371 FPMPYALYLTGVSVLAGFWGQFFVRKLITILKRGSLIVFILSGVIFASAITMGVVGTEKS 430

Query: 458 IGKFHRHEYMGFENLC 473
           I   + HE+MGF   C
Sbjct: 431 IRMINNHEFMGFLGFC 446


>gi|125545587|gb|EAY91726.1| hypothetical protein OsI_13369 [Oryza sativa Indica Group]
          Length = 475

 Score =  363 bits (932), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 213/421 (50%), Positives = 289/421 (68%), Gaps = 11/421 (2%)

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
            S    VWP+L F W +V+ T++GF G+AFG+VGGVGGGGIFVPML+L++GFD KSA A+
Sbjct: 55  SSRSARVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLLVGFDTKSAAAL 114

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGA+ S+V+YNL++ HPT + P+IDY LALL QPMLMLGI+IGV  +VIF  W++
Sbjct: 115 SKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYWLI 174

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPR--- 240
           TVL+I+LFIGTS+++F KG+  WK ET ++ E       + +     +    PS      
Sbjct: 175 TVLIIILFIGTSSRSFYKGILMWKDETRIQMETREREEESKSSCAARDVVIDPSCEEPLL 234

Query: 241 SGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPV 300
             PQ        T L N+ WK + +L+ VW +FL LQI KN++ SCS  YWV+++LQ+PV
Sbjct: 235 CQPQPKEKSALETFLFNLRWKNILVLMTVWSSFLVLQIFKNNSQSCSTFYWVINILQVPV 294

Query: 301 SLVVSLYEAISLYKGRRVIASKG------DDGKSFRVFQLVSYCAFG-VLAGIVGGLLGL 353
           +L V L+E + L +  R     G      +    +   QL+ +CAF  +L G VGGLLG 
Sbjct: 295 ALSVFLWEGVQLCRESRARRMDGNWECVCEASIEWSPAQLI-FCAFCGLLGGTVGGLLGS 353

Query: 354 GGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIA 413
           GGGFI+GPL LELG  PQV+SATATF M FSSS+SVVE+Y L RFP+P+A+Y + ++ +A
Sbjct: 354 GGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLNRFPIPFAVYLICISILA 413

Query: 414 AFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            F GQ +VRKL+ +L RASLI+FIL+ +IF SA+++G VG    I   + HEYMGF + C
Sbjct: 414 GFWGQSLVRKLVHVLKRASLIVFILSSVIFASALTMGVVGTQKSISMINNHEYMGFLDFC 473

Query: 474 K 474
           +
Sbjct: 474 E 474


>gi|326524532|dbj|BAK00649.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 474

 Score =  362 bits (928), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 213/425 (50%), Positives = 299/425 (70%), Gaps = 19/425 (4%)

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
            S    VWP+L+F W +V+ T++GF G+AFG+VGGVGGGGIFVP+L+L++GFD KSA A+
Sbjct: 53  SSRSARVWPDLEFNWRLVVATVVGFLGSAFGTVGGVGGGGIFVPLLNLVLGFDTKSAAAL 112

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGA+ S+V+YNL++ HPT + P++DY LALL QPMLMLGI+IGV  +V+F  W++
Sbjct: 113 SKCMIMGASASSVWYNLQVSHPTKEAPVLDYKLALLFQPMLMLGITIGVELSVVFPYWLI 172

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRET-ILKKEAARCLGSNGAGAGEVEYKSL------P 236
           TVL+I+LF+GTS+++F KG+  W+ ET IL +   R   S  +GA       L      P
Sbjct: 173 TVLIIILFVGTSSRSFYKGILMWREETRILMETRERETQSKSSGASATNDVVLDPSYAEP 232

Query: 237 SGPRSGP-QKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDL 295
             P+S P QK  +E   T++ N+ WK + +L+ VW +FL LQI KN++ +CS  YWV+++
Sbjct: 233 LLPQSKPTQKSGLE---TLMFNLRWKHILVLLAVWSSFLVLQILKNNSKTCSTFYWVINI 289

Query: 296 LQIPVSLVVSLYEAISLYKGRRVIASKG------DDGKSFRVFQLVSYCAFG-VLAGIVG 348
            QIP ++ V +++A+ L +  R     G      +    +   QL+ +CAF  +L G VG
Sbjct: 290 FQIPAAVSVFVWKAVQLCRESRARRMNGNLECVCEANIEWSPTQLI-FCAFCGLLGGTVG 348

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVA 408
           GLLG GGGFI+GPL LELG  PQV+SATATF M FSSS+SVVE+Y L RFP+PYALY + 
Sbjct: 349 GLLGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLGRFPIPYALYLIF 408

Query: 409 VATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMG 468
           ++ +A F GQ +VRK++++L RASLI+FIL+ +IF SA+++G VG    I   ++HEYMG
Sbjct: 409 ISILAGFWGQCLVRKIVLVLKRASLIVFILSSVIFASALTMGVVGTEKSISMINKHEYMG 468

Query: 469 FENLC 473
           F + C
Sbjct: 469 FLSFC 473


>gi|242033121|ref|XP_002463955.1| hypothetical protein SORBIDRAFT_01g009520 [Sorghum bicolor]
 gi|241917809|gb|EER90953.1| hypothetical protein SORBIDRAFT_01g009520 [Sorghum bicolor]
          Length = 482

 Score =  354 bits (908), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 213/421 (50%), Positives = 290/421 (68%), Gaps = 11/421 (2%)

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
            S    VWP+L F W +V+ T++GF G+AFG+VGGVGGGGIFVPML+L++GFD KSA A+
Sbjct: 63  SSRSPRVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLVVGFDTKSAAAL 122

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGA+ S+V+YNL++ HPT + P+IDY LALL QPMLMLGI+IGV  +VIF  W++
Sbjct: 123 SKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYWLI 182

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRET-ILKKEAARCLGSNG-AGAGEVEYKSLPSGP-R 240
           TVL+I+LF+GTS+++F KG+  W+ ET IL +   R   S     AG+V        P  
Sbjct: 183 TVLIIILFVGTSSRSFYKGILMWREETRILMETREREEQSKSTCAAGDVIIDPSYEEPLL 242

Query: 241 SGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPV 300
           + PQ        T L N+ WK + +L+ VW +FL LQ+ KN+  SCS  YWV+++LQ+PV
Sbjct: 243 TQPQPKKKSALETFLFNLRWKNILVLMSVWSSFLVLQVLKNNAKSCSTFYWVVNILQVPV 302

Query: 301 SLVVSLYEAISLYKGRRVIASKG------DDGKSFRVFQLVSYCAFG-VLAGIVGGLLGL 353
           ++ V L+EA+ L +  R     G      +    +   QL+ +CAF  +L G VGGLLG 
Sbjct: 303 AVSVFLWEAMQLCRESRARRMNGNLECVCEASIEWSPAQLI-FCAFCGLLGGTVGGLLGS 361

Query: 354 GGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIA 413
           GGGFI+GPL LELG  PQV+SATATF M FSSS+SVVE+Y L RFP+P+A Y + ++ +A
Sbjct: 362 GGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLHRFPLPFAGYLIFISILA 421

Query: 414 AFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            F GQ +VRK++ +L RAS+I+FIL+ +IF SA+++G VG    I     HEYMGF + C
Sbjct: 422 GFWGQCLVRKIVHVLKRASVIVFILSSVIFASALTMGVVGTQKSISMIKNHEYMGFLDFC 481

Query: 474 K 474
           +
Sbjct: 482 E 482


>gi|293331891|ref|NP_001168709.1| uncharacterized protein LOC100382501 [Zea mays]
 gi|223950337|gb|ACN29252.1| unknown [Zea mays]
 gi|238009716|gb|ACR35893.1| unknown [Zea mays]
 gi|414872565|tpg|DAA51122.1| TPA: hypothetical protein ZEAMMB73_068409 [Zea mays]
          Length = 479

 Score =  353 bits (906), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 215/424 (50%), Positives = 296/424 (69%), Gaps = 18/424 (4%)

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
            S    VWP+L F W +V+ T++GF G+AFG+VGGVGGGGIFVPML+L++GFD KSA A+
Sbjct: 61  SSRSPRVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLVVGFDTKSAAAL 120

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGA+ S+V+YNL++ HPT + P+IDY LALL QPMLMLGI+IGV  +VIF  W++
Sbjct: 121 SKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYWLI 180

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRET-ILKKEAARCLGSNGAGAG-----EVEYKSLPS 237
           TVL+I+LF+GTS+++F KG+  W+ E+ IL +   R   S    A      +  Y   P 
Sbjct: 181 TVLIIILFVGTSSRSFYKGILMWREESRILMETRGREEQSKSTCAASDVVIDPSYGE-PL 239

Query: 238 GPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQ 297
             +  P+K A E   T + N+ WK + +L+ VW  FL LQ+ KN++ SCS  YWV+++LQ
Sbjct: 240 LTQPQPKKSAWE---TFMFNLRWKNILVLMSVWSLFLVLQVLKNNSKSCSSFYWVINVLQ 296

Query: 298 IPVSLVVSLYEAISLYK---GRRV---IASKGDDGKSFRVFQLVSYCAFG-VLAGIVGGL 350
           +PV++ V L+EA+ L +    RR+   +    +    +   QL+ +CAF  +L G VGGL
Sbjct: 297 VPVAVSVFLWEAMQLCRESHARRMNGNLECVCEASIEWSPAQLI-FCAFCGLLGGTVGGL 355

Query: 351 LGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVA 410
           LG GGGFI+GPL LELG  PQV+SATATF M FSSS+SVVE+Y L RFP+P+A Y + ++
Sbjct: 356 LGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLHRFPLPFAGYLIFIS 415

Query: 411 TIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFE 470
            +A F GQ +VRK++ +L RASLI+FIL+ +IFVSA+++G VG    I   + HEYMGF 
Sbjct: 416 ILAGFWGQCLVRKIVHVLKRASLIVFILSSVIFVSALTMGVVGTQKSISMINNHEYMGFL 475

Query: 471 NLCK 474
           N C+
Sbjct: 476 NFCE 479


>gi|357117223|ref|XP_003560372.1| PREDICTED: uncharacterized protein LOC100844938 [Brachypodium
           distachyon]
          Length = 479

 Score =  344 bits (883), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 213/425 (50%), Positives = 298/425 (70%), Gaps = 19/425 (4%)

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
            S    VWP+L+F W +V+ TI+GF G+AFG+VGGVGGGGIFVP+L+L++GFD KSA A+
Sbjct: 60  SSRSARVWPDLEFNWRLVVATIVGFLGSAFGTVGGVGGGGIFVPLLNLVLGFDTKSAAAL 119

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGA+ S+V+YNL++ HPT + P++DY LALL QPMLMLGI+IGV  +V+F  W++
Sbjct: 120 SKCMIMGASASSVWYNLQVSHPTKEAPVLDYKLALLFQPMLMLGITIGVELSVVFPYWLI 179

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRET-ILKKEAARCLGSNGAGAGE-----VEYKSLPS 237
           T+L+I+LF+GTS+++F KG+  W+ ET IL +   R   S  + A         Y+  P 
Sbjct: 180 TILIIILFVGTSSRSFYKGILMWREETRILMETQEREAQSKSSSAVSDVVLGTSYEE-PL 238

Query: 238 GPRSGP-QKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLL 296
            P+  P +K  +E   T+L N+ WK + +L+ VW +FL LQI KN++ +CS  YWV+++L
Sbjct: 239 LPQPKPTEKSGLE---TLLFNLRWKNILVLMAVWSSFLFLQILKNNSKTCSTFYWVINIL 295

Query: 297 QIPVSLVVSLYEAISLYKGRRVIASKG------DDGKSFRVFQLVSYCAF-GVLAGIVGG 349
           Q+P ++ V L++A+ L +  R     G      +    +   QLV +CAF G+L G VGG
Sbjct: 296 QVPAAVSVFLWKAVQLCRESRARRMNGNLECVCEASIEWSPTQLV-FCAFCGLLGGTVGG 354

Query: 350 LLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAV 409
           LLG GGGFI+GPL LELG  PQV+SATATF M FSSS+SVVE+Y L RFP+PYA Y + +
Sbjct: 355 LLGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLHRFPIPYAAYLIFI 414

Query: 410 ATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGF 469
           + +A F GQ +VRK++++L RASLI+FIL+ +IF SA+++G VG    I   + HEYMGF
Sbjct: 415 SILAGFWGQCLVRKIVLVLKRASLIVFILSSVIFASAVTMGVVGTEKSISMINNHEYMGF 474

Query: 470 ENLCK 474
              C+
Sbjct: 475 LGFCE 479


>gi|242044420|ref|XP_002460081.1| hypothetical protein SORBIDRAFT_02g022480 [Sorghum bicolor]
 gi|241923458|gb|EER96602.1| hypothetical protein SORBIDRAFT_02g022480 [Sorghum bicolor]
          Length = 383

 Score =  343 bits (881), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 209/415 (50%), Positives = 270/415 (65%), Gaps = 79/415 (19%)

Query: 74  LKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAV 133
           ++FGW IVLG++IGFFGAAFGSVGGVGGGG+F+PML+LIIGFDPKS+ A+SKCMIMGA+V
Sbjct: 33  MEFGWRIVLGSLIGFFGAAFGSVGGVGGGGVFLPMLALIIGFDPKSSAAMSKCMIMGASV 92

Query: 134 STVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIG 193
           STVY NLKL+HP+LD+P+IDYDLA+LIQPMLMLGISIGV FNV+F DW+VT LLI+LF+ 
Sbjct: 93  STVYCNLKLKHPSLDIPLIDYDLAMLIQPMLMLGISIGVIFNVVFPDWLVTALLIILFLV 152

Query: 194 TSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVT 253
           TSTKAFLKGVETWK+ETI K+++                                    +
Sbjct: 153 TSTKAFLKGVETWKKETIKKRKS------------------------------------S 176

Query: 254 ILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLY 313
           +L+N+ WKE GLL FVW+AFLGLQIA  ++A CS +YW L+ LQIPV++ V++YEA  L 
Sbjct: 177 LLKNVYWKEFGLLAFVWLAFLGLQIANKYSAYCSALYWALNSLQIPVAVGVTMYEAHGLA 236

Query: 314 KGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVS 373
            G++V++SKG    + +  QL  YC FG++AG+VGGLLG+GGGFIMGPLFLELG+PPQ +
Sbjct: 237 TGKKVLSSKGSQQNALKSCQLFVYCQFGIIAGLVGGLLGVGGGFIMGPLFLELGIPPQPA 296

Query: 374 -------SATATFAMTFS------SSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
                  SA A F              +   Y++LK   +    Y      + +++  H 
Sbjct: 297 LCEEVDQSARACFTHHLHLGFHDLCQRNFSWYHILKLAEINQNPY----GALDSYINNH- 351

Query: 421 VRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
                                     +  GGVGISN++ K   H+YMGFE+LC Y
Sbjct: 352 -------------------------RLPAGGVGISNIVHKIECHQYMGFESLCNY 381


>gi|226503253|ref|NP_001145740.1| uncharacterized protein LOC100279247 [Zea mays]
 gi|219884241|gb|ACL52495.1| unknown [Zea mays]
 gi|414885095|tpg|DAA61109.1| TPA: hypothetical protein ZEAMMB73_890925 [Zea mays]
          Length = 291

 Score =  340 bits (871), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 172/254 (67%), Positives = 216/254 (85%), Gaps = 4/254 (1%)

Query: 47  QDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFV 106
           ++ + L K  N +W  +G+ Y HVWP ++FGW+IVLG++IGFFGAAFGSVGGVGGGGIFV
Sbjct: 40  EEVSLLRKVANLMWNGDGNSYQHVWPPMEFGWQIVLGSLIGFFGAAFGSVGGVGGGGIFV 99

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           PML+LIIGFDPKS+TAISKCMIMGA+VSTVYYNLKL+HPTLDMP+IDYDLA+L+QPMLML
Sbjct: 100 PMLTLIIGFDPKSSTAISKCMIMGASVSTVYYNLKLKHPTLDMPVIDYDLAVLMQPMLML 159

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG 226
           GISIGV FNVIF DW+VTVLLI+LF+GTSTKAFLKG+ETWK+ET++++EAA+ L    AG
Sbjct: 160 GISIGVIFNVIFPDWLVTVLLIILFLGTSTKAFLKGIETWKKETVIQREAAKLL-EQTAG 218

Query: 227 AGEVEYKSLPSGPRSGPQKDAMEPE--VTILENINWKELGLLVFVWVAFLGLQIAKNHTA 284
             E EY +LP+GP     K A+  +   ++++NI+WK++GLL FVWVAFL LQ+ KN+TA
Sbjct: 219 E-EAEYAALPTGPDVAANKKALTSDEAPSLIKNIHWKKVGLLSFVWVAFLVLQVTKNYTA 277

Query: 285 SCSIVYWVLDLLQI 298
           +CS  YWVL+LLQ+
Sbjct: 278 TCSPWYWVLNLLQV 291


>gi|217075755|gb|ACJ86237.1| unknown [Medicago truncatula]
          Length = 238

 Score =  333 bits (854), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 174/245 (71%), Positives = 208/245 (84%), Gaps = 8/245 (3%)

Query: 74  LKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAV 133
           ++FGW+IVLG+ IGF GAAFGSVGGVGGGGIFVPMLSLIIGFDPKS+TAISKCMIMGAA+
Sbjct: 1   MEFGWQIVLGSFIGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSSTAISKCMIMGAAL 60

Query: 134 STVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIG 193
           STVYYNL+LRHPTLDMPIIDYDLALLIQPMLMLGISIGV FNV+F DW+VT+LLIVLF+G
Sbjct: 61  STVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWLVTILLIVLFLG 120

Query: 194 TSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVT 253
           TSTKAF KG+ETW +ETI+KKEAAR   SNG+G    EYK+LP+GP    +K   E +V+
Sbjct: 121 TSTKAFFKGLETWNKETIMKKEAARRQESNGSG----EYKALPTGPNVAIEK---ENKVS 173

Query: 254 ILENINWKELGLLVFVWVAFLGLQIAK-NHTASCSIVYWVLDLLQIPVSLVVSLYEAISL 312
           I+EN+ WKE GLL FVW++FL LQIAK N+T +CS  YW+L+LLQIP+++ V+ YE +  
Sbjct: 174 IIENVYWKEFGLLAFVWISFLALQIAKQNYTTTCSAAYWILNLLQIPIAVGVTAYEQLHC 233

Query: 313 YKGRR 317
            +G  
Sbjct: 234 SQGEE 238


>gi|388514887|gb|AFK45505.1| unknown [Lotus japonicus]
          Length = 290

 Score =  320 bits (821), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 175/284 (61%), Positives = 216/284 (76%)

Query: 193 GTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEV 252
           GTS KAF KGVETWK+ETI+K+E    L S    +G+ +Y  LP  P  GPQK   + EV
Sbjct: 6   GTSIKAFFKGVETWKKETIIKEETVMLLESTAPCSGDEQYNLLPGSPDGGPQKKGRKHEV 65

Query: 253 TILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISL 312
           +IL NI WK+ GLL FVW+A+L LQIAK +TA+CS   W++ LLQIPVS+ V LY+AI L
Sbjct: 66  SILGNIYWKQFGLLSFVWLAYLLLQIAKTYTATCSTTCWMIILLQIPVSVGVFLYKAIGL 125

Query: 313 YKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQV 372
           Y+G  VI++KGD    +++  L+   A  +LAGI+GGLLG+GGGF+MGPLFLELG+ PQV
Sbjct: 126 YQGWSVISAKGDQAIQWKLHYLILTSACALLAGIIGGLLGIGGGFVMGPLFLELGIAPQV 185

Query: 373 SSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRAS 432
           +SATATF MTFS+S+SVV+YYLL RFPVPYALY   VA IAA+ GQ ++ KL+ +  RAS
Sbjct: 186 ASATATFGMTFSASISVVQYYLLNRFPVPYALYLTLVAAIAAYRGQFIIDKLVNMFQRAS 245

Query: 433 LIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           LIIF LAF IFVSAI LGG GIS+MIG+  R EYMGFE+LCKYD
Sbjct: 246 LIIFTLAFTIFVSAIVLGGEGISDMIGQIQRSEYMGFEDLCKYD 289


>gi|147812176|emb|CAN61520.1| hypothetical protein VITISV_033969 [Vitis vinifera]
          Length = 382

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 175/305 (57%), Positives = 202/305 (66%), Gaps = 60/305 (19%)

Query: 13  SVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQ--DSNYLLKAVNFLWQPEGSGYHHV 70
           S   I  +F LA   V  ER +K E   FN T+    +SNYL K VNFLWQ + SGY HV
Sbjct: 72  SDFWIIWSFMLASAFVSGERSIKHEASTFNVTKEAGFNSNYLSKVVNFLWQSDRSGYQHV 131

Query: 71  WPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMG 130
           WPE++FGW+IV+G+IIGFFGAAFGSVGGVGGGGIFVPMLSL+IGFDPKSATA+SKCMIMG
Sbjct: 132 WPEMEFGWQIVVGSIIGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDPKSATALSKCMIMG 191

Query: 131 AAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVL 190
           AA STVYYNLKLRHPTLDMPIIDYDLALL QPMLM+GISIGVAFNV+FADWM        
Sbjct: 192 AAGSTVYYNLKLRHPTLDMPIIDYDLALLFQPMLMMGISIGVAFNVLFADWM-------- 243

Query: 191 FIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEP 250
                                   EAA+ LG+NG G  EVEYK LPSGP +G Q    + 
Sbjct: 244 ------------------------EAAKRLGTNGNGTEEVEYKPLPSGPSNGTQXATNKS 279

Query: 251 EVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAI 310
                     KE                 +N+TA+CS+ YWVL+ +QIPVS+ VSLYEA+
Sbjct: 280 ----------KE----------------XENNTATCSMAYWVLNFMQIPVSVGVSLYEAV 313

Query: 311 SLYKG 315
           SL +G
Sbjct: 314 SLVQG 318


>gi|302816689|ref|XP_002990023.1| hypothetical protein SELMODRAFT_184956 [Selaginella moellendorffii]
 gi|300142334|gb|EFJ09036.1| hypothetical protein SELMODRAFT_184956 [Selaginella moellendorffii]
          Length = 356

 Score =  283 bits (724), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 176/377 (46%), Positives = 240/377 (63%), Gaps = 37/377 (9%)

Query: 111 LIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISI 170
           L+ GFD K+A A+SKCMI     STV YNL LR    +  +IDYDLALL QPML+LGIS+
Sbjct: 2   LVCGFDAKTAPALSKCMIFAQCASTVAYNLGLRSSQTNSSLIDYDLALLFQPMLLLGISV 61

Query: 171 GVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEV 230
           GV FNV+F +W++T+LLIV+ +  +++AF KG+ETWK+ET  K+               +
Sbjct: 62  GVTFNVLFPNWLITLLLIVVSLAMASRAFSKGLETWKKETNEKRL--------------I 107

Query: 231 EYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVY 290
              SL  GP +    D++         + WK+L LL  VW    GLQ+ K +TA+CS  +
Sbjct: 108 LEGSLTPGPANFTTLDSL------WTTVEWKKLSLLFAVWCLITGLQVLKAYTANCSTAF 161

Query: 291 WVLDLLQIPVSLVVSLYEAI-----SLYKGRRVIASKGDD---------GKSFRVFQLVS 336
           W+ ++LQ PV+L V++ +A+     S YK R  +  +  +         G++  VF+   
Sbjct: 162 WIYNILQAPVTLAVTVTQALRMREHSSYKLRESLLDESSESSSASLKAAGRALDVFR--- 218

Query: 337 YCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLK 396
           Y  FGVLAG + GLLG+GGG  +GP+ LE GVPPQ +SATAT AM FSSS+SVVE+Y L 
Sbjct: 219 YVFFGVLAGSIAGLLGVGGGATLGPIMLEFGVPPQTASATATLAMLFSSSLSVVEFYFLG 278

Query: 397 RFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISN 456
           R  V YALYF A+  ++AF+ Q +V+KL+ LLGR S+I F L F+I VS ISLGGVGI +
Sbjct: 279 RIKVSYALYFGAICIVSAFLSQKLVQKLLNLLGRTSVITFTLVFVIVVSVISLGGVGIVD 338

Query: 457 MIGKFHRHEYMGFENLC 473
            I +  R +YMGF +LC
Sbjct: 339 SIQELKRGKYMGFGSLC 355


>gi|255645402|gb|ACU23197.1| unknown [Glycine max]
          Length = 261

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/236 (61%), Positives = 178/236 (75%), Gaps = 10/236 (4%)

Query: 23  LAFVCVKAERGLKIE-TLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIV 81
           +A+    AER LK + +  F   ERQ    +   ++F           VWPE++FGW IV
Sbjct: 27  MAYNVSLAERVLKDQKSESFVAKERQ--GVMNSIIDFFRNDGEPTNDRVWPEMEFGWRIV 84

Query: 82  LGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLK 141
           +G+I+GFFGAA GSVGGVGGGGIF+PML+L+IGFD KS+TA+SKCMIMGAAVSTVYYNL+
Sbjct: 85  VGSIVGFFGAALGSVGGVGGGGIFIPMLTLVIGFDAKSSTALSKCMIMGAAVSTVYYNLR 144

Query: 142 LRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLK 201
            RHPTLD+P+IDYDLALL QPMLMLGISIGVAFNV+FADWMVTVLLI+LFI TSTKA  K
Sbjct: 145 FRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLLIILFIATSTKALFK 204

Query: 202 GVETWKRETILKKEAARCLGSNGAG--AGEVEYKSLPSGPRSGPQKDAMEPEVTIL 255
           G++TWK+ETI+KKEAA+ L S+ +     E +YKSLP+G       D  + EV +L
Sbjct: 205 GIDTWKKETIMKKEAAKMLESDSSPGYVSEEDYKSLPAG-----SADPRDEEVPLL 255


>gi|357440251|ref|XP_003590403.1| hypothetical protein MTR_1g061710 [Medicago truncatula]
 gi|355479451|gb|AES60654.1| hypothetical protein MTR_1g061710 [Medicago truncatula]
          Length = 558

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 139/265 (52%), Positives = 189/265 (71%), Gaps = 9/265 (3%)

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
           GSG   +WPEL+  W++VL T+IGFFG+A G+VGGVGGGGIFVPML+L++GFD KSA A+
Sbjct: 48  GSGTDRIWPELEPSWKLVLATVIGFFGSACGTVGGVGGGGIFVPMLNLLLGFDTKSAAAL 107

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGA+ S+V+YNL++ HPT  +PI+DYDLALL QPMLMLGI++GV  +V+F  W++
Sbjct: 108 SKCMIMGASASSVWYNLRVAHPTKQVPILDYDLALLFQPMLMLGITVGVTLSVVFPFWLI 167

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRETILKKEAAR---CLGSNGA-GAGEVEYKSLPSGP 239
           TVL+I+LFIGTS ++F KG+E W+ ET+LKKE A     + SNG   A  V+ K  P  P
Sbjct: 168 TVLIIILFIGTSLRSFFKGIEMWRVETLLKKETAEQAALVNSNGEFKAVLVDTKFEPLIP 227

Query: 240 RSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIP 299
           +   QK  M+    +  N+ WK + +L+ +W +FL +QI K H   CS+ YWVL  LQ P
Sbjct: 228 KE--QKSTMQ---ILCFNLRWKRILVLLSIWFSFLLIQIIKTHVEDCSVWYWVLFGLQFP 282

Query: 300 VSLVVSLYEAISLYKGRRVIASKGD 324
           ++ +V  +EA+ LYK  R+    G 
Sbjct: 283 IAFLVFGFEAVKLYKDHRMRMGTGS 307



 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 59/131 (45%), Positives = 76/131 (58%), Gaps = 28/131 (21%)

Query: 372 VSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRA 431
           V+SATATF M FSSS+SV E+Y+LKRFP+PYALY + V+ +A F GQ  VRKL+  LGRA
Sbjct: 427 VASATATFVMMFSSSLSVFEFYILKRFPMPYALYLITVSVLAGFWGQFFVRKLVAFLGRA 486

Query: 432 SLIIFILAFMIFVSAISLGG----------------------------VGISNMIGKFHR 463
           S+I+FIL+ +IF SA+++G                             VG    I   H 
Sbjct: 487 SVIVFILSGVIFASALTMGNNGNLPQFADLYKMILAQISMLYIFISGIVGTEKSIEMIHN 546

Query: 464 HEYMGFENLCK 474
           HE+MGF   C 
Sbjct: 547 HEFMGFLGFCS 557


>gi|302808025|ref|XP_002985707.1| hypothetical protein SELMODRAFT_446339 [Selaginella moellendorffii]
 gi|300146616|gb|EFJ13285.1| hypothetical protein SELMODRAFT_446339 [Selaginella moellendorffii]
          Length = 392

 Score =  255 bits (652), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 191/402 (47%), Positives = 252/402 (62%), Gaps = 18/402 (4%)

Query: 80  IVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYN 139
           +VLG+++ F GA F +  GVGGGG+FVP+  L+IGFD K++ A+SK MIMG AV++  YN
Sbjct: 1   MVLGSVVAFLGAMFANAAGVGGGGLFVPLFRLLIGFDVKTSAALSKSMIMGGAVASALYN 60

Query: 140 LKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAF 199
           L  +HP LD P+IDYDLALLIQPML+LGISIGV  NV+F DW++TVLL+ +  G + K F
Sbjct: 61  LPSKHPVLDKPLIDYDLALLIQPMLLLGISIGVMCNVMFPDWILTVLLVAVLTGMAFKTF 120

Query: 200 LKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSG--PQKDAMEPEVTILEN 257
            KG  TW  E+   ++ A C  +   G          S    G   + +   PE  +  +
Sbjct: 121 NKGACTWSTES---EQKAHCCDTGEEGFVSNSRTEPSSSLEQGLLAKSEQCAPE-GLPSS 176

Query: 258 INWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRR 317
           I W  + LL  VW  +L LQ+ K+  A+C  +YW+L +LQ PVS+  +   AI +++   
Sbjct: 177 IKWANICLLCTVWTVYLILQLLKSGAATCGRLYWILTVLQAPVSIGAT---AIGVWR--- 230

Query: 318 VIASKGDDGKSF-----RVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQV 372
            I  +GD GK          QL  Y  + VLAG+VGGLLG+GGG I+ PLFLELG+ PQV
Sbjct: 231 -IYRRGDFGKEKPASGPTCEQLFLYPLYAVLAGVVGGLLGIGGGMILAPLFLELGIIPQV 289

Query: 373 SSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRAS 432
           +SAT TF + FSSSMSVVE+YLL R PV +A YF  +  IAA VG H  +  I   GR S
Sbjct: 290 TSATTTFIVVFSSSMSVVEFYLLGRLPVRFAAYFTTLCGIAALVGLHATKLFIRRYGRTS 349

Query: 433 LIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
            IIFILA +I  SAI LG +G    I ++   EYMGF +LC+
Sbjct: 350 FIIFILAAIIGSSAIILGIIGGFADIRRYIAGEYMGFHSLCE 391


>gi|16660290|gb|AAL27555.1|AF420410_1 hypothetical protein [Musa acuminata AAA Group]
          Length = 238

 Score =  248 bits (634), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 147/199 (73%), Positives = 173/199 (86%), Gaps = 1/199 (0%)

Query: 273 FLGLQIAK-NHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRV 331
           FL LQ+ K N+T++CS+ YW+L+LLQ+PVSL VS YEA+SLY+G+R+I+SKG +G  F V
Sbjct: 1   FLILQVLKQNYTSTCSLWYWILNLLQVPVSLGVSGYEAVSLYRGKRIISSKGLEGTDFTV 60

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVE 391
            QLV YC  GVLAG+VGGLLGLGGGFI+GP+FLELGVPPQVSSATATFAMTFSSSMSVVE
Sbjct: 61  IQLVFYCLIGVLAGVVGGLLGLGGGFILGPVFLELGVPPQVSSATATFAMTFSSSMSVVE 120

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGG 451
           YYLLKRFP+PYALYFV+VA +AAFVGQH+V++LI +LGRASLIIFILA  IF+SAISLGG
Sbjct: 121 YYLLKRFPIPYALYFVSVALVAAFVGQHLVKRLIEILGRASLIIFILASTIFISAISLGG 180

Query: 452 VGISNMIGKFHRHEYMGFE 470
           VGISNM+ K   HE  G  
Sbjct: 181 VGISNMVQKIQHHESWGLR 199


>gi|359482297|ref|XP_003632751.1| PREDICTED: uncharacterized protein LOC100240792 isoform 2 [Vitis
           vinifera]
          Length = 369

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 144/268 (53%), Positives = 188/268 (70%), Gaps = 15/268 (5%)

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
            S    VWPELKF W IVL T+IGF G+AFG+VGGVGGGGIFVPML+LI+GFD KSA A+
Sbjct: 54  SSSSEKVWPELKFSWRIVLATVIGFLGSAFGTVGGVGGGGIFVPMLTLIVGFDTKSAAAL 113

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGA+ S+V+YNL++ HPT + PIIDYDLALL QPMLMLGI++GVA +V+F  W++
Sbjct: 114 SKCMIMGASTSSVWYNLRVNHPTKEAPIIDYDLALLFQPMLMLGITVGVALSVVFPYWLI 173

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRETILKKEAAR----CLGSNGAGAGEVEYKSL-PSG 238
           T+L+I+LF+GTST++F K V+ W  ET+ KKE        + S+G    + EY  L P+ 
Sbjct: 174 TILIIILFMGTSTRSFFKAVQMWNEETLFKKELEEQRRTMVNSHGELLIDAEYDPLIPNE 233

Query: 239 PRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQI 298
            +S  Q         +  N+ WK + +LV VWV FL LQI KN +  CS  YWVL LLQ 
Sbjct: 234 EKSWIQ--------ILCSNLRWKRILVLVTVWVIFLLLQIFKNDSTVCSAWYWVLFLLQF 285

Query: 299 PVSLVVSLYEAISLYK--GRRVIASKGD 324
           PV++VV  YEA+ L+K   +R+I+   +
Sbjct: 286 PVAVVVFGYEAVKLHKEYKKRIISGNTE 313


>gi|302785269|ref|XP_002974406.1| hypothetical protein SELMODRAFT_414563 [Selaginella moellendorffii]
 gi|300158004|gb|EFJ24628.1| hypothetical protein SELMODRAFT_414563 [Selaginella moellendorffii]
          Length = 380

 Score =  245 bits (626), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 177/378 (46%), Positives = 234/378 (61%), Gaps = 18/378 (4%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           +FVP+  L+IGFD K++ A+SK MIMG AV++  YNL  +HP LD P+IDYDLALLIQPM
Sbjct: 13  LFVPLFRLLIGFDVKTSAALSKSMIMGGAVASALYNLPSKHPVLDKPLIDYDLALLIQPM 72

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSN 223
           L+LGISIGV  NV+F DW++TVLL+ +    + K F KG  TW  E+   ++ A C  + 
Sbjct: 73  LLLGISIGVMCNVMFPDWILTVLLVAVLTAMAFKTFNKGACTWSTES---EQKAHCCDTG 129

Query: 224 GAGAGEVEYKSLPSGPRSG--PQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKN 281
               G        S    G   + +   PE  +  +I W  + LL  VW  +L LQ+ K+
Sbjct: 130 EEDFGSNSRTEPSSSLEQGLLAKSEQCAPE-GLSSSIKWANICLLCTVWTVYLILQLLKS 188

Query: 282 HTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSF-----RVFQLVS 336
             A+C  +YW+L++LQ PVS+  +   AI +++    I  +GD GK          QL  
Sbjct: 189 GAATCGRLYWILNVLQAPVSIGAT---AIGVWR----IYRRGDFGKEKPAAGPTCEQLFL 241

Query: 337 YCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLK 396
           Y  + VLAG+VGGLLG+GGG I+ PLFLELG+ PQV+SAT TF + FSSSMSVVE+YLL 
Sbjct: 242 YPLYAVLAGVVGGLLGIGGGMILAPLFLELGIIPQVTSATTTFIVVFSSSMSVVEFYLLG 301

Query: 397 RFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISN 456
           R PV +A YF  +  IAA VG H  +  I   GR S IIFILA +I  SAI LG +G   
Sbjct: 302 RLPVRFAAYFTTLCGIAALVGLHATKLFIRRYGRTSFIIFILAAIIGSSAIILGIIGGFA 361

Query: 457 MIGKFHRHEYMGFENLCK 474
            I ++   EYMGF +LC+
Sbjct: 362 DIRRYIAGEYMGFHSLCE 379


>gi|356538176|ref|XP_003537580.1| PREDICTED: uncharacterized protein LOC100817312 [Glycine max]
          Length = 198

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 132/194 (68%), Positives = 166/194 (85%)

Query: 280 KNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCA 339
           + ++  CSI +W+L+ LQ+P+++ V+L+EAI LYKG RVIASKG +  ++++ Q+  YC+
Sbjct: 2   QEYSKPCSIHFWLLNFLQVPIAVSVTLFEAIGLYKGTRVIASKGKEVTNWKIHQICLYCS 61

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFP 399
            G++AG+VGGLLGLGGGFI+GPLFLELG+PPQV+SAT+TFAM FSSSMSVV+YYLL RFP
Sbjct: 62  TGIMAGMVGGLLGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYLLDRFP 121

Query: 400 VPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIG 459
           VPYA YF  VATIAAF GQHVVRK+I++LGRAS+IIFILA  IF+SAISLGGVGI N+I 
Sbjct: 122 VPYASYFALVATIAAFTGQHVVRKVIVVLGRASIIIFILALTIFISAISLGGVGIENIIE 181

Query: 460 KFHRHEYMGFENLC 473
           K   HEYMGFE+LC
Sbjct: 182 KIESHEYMGFEDLC 195


>gi|302771113|ref|XP_002968975.1| hypothetical protein SELMODRAFT_170159 [Selaginella moellendorffii]
 gi|300163480|gb|EFJ30091.1| hypothetical protein SELMODRAFT_170159 [Selaginella moellendorffii]
          Length = 309

 Score =  228 bits (581), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 148/329 (44%), Positives = 209/329 (63%), Gaps = 37/329 (11%)

Query: 159 LIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAAR 218
           ++QPML+LGIS+GV FNV+F +W++T+LLIV+ +  +++AF KG+ETWK+ET  K+    
Sbjct: 3   VLQPMLLLGISVGVTFNVLFPNWLITLLLIVVSLAMASRAFSKGLETWKKETNEKRL--- 59

Query: 219 CLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQI 278
                      +   SL  GP +    D++         + WK+L LL  VW    GLQ+
Sbjct: 60  -----------ILEGSLTPGPANFTTLDSL------WTTVEWKKLSLLFAVWCLITGLQV 102

Query: 279 AKNHTASCSIVYWVLDLLQIPVSLVVSLYEAI-----SLYKGRRVIASKGDD-------- 325
            K +TA+CS  +W+ ++LQ PV+L V++ +A+     S YK R  +  +  +        
Sbjct: 103 LKAYTANCSTAFWIYNILQAPVTLAVTVTQALRMREHSSYKLRESLLDESSESSSASLKA 162

Query: 326 -GKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
            G++  VF+   Y  FGVLAG + GLLG+GGG  +GP+ LE GVPPQ +SATAT AM FS
Sbjct: 163 AGRALDVFR---YVFFGVLAGSIAGLLGVGGGATLGPIMLEFGVPPQTASATATLAMLFS 219

Query: 385 SSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           SS+SVVE+Y L R  V YALYF A+  ++AF+ Q +V+KL+ LLGR S+I F L F+I V
Sbjct: 220 SSLSVVEFYFLGRIKVSYALYFGAICIVSAFLSQKLVQKLLNLLGRTSVITFTLVFVIVV 279

Query: 445 SAISLGGVGISNMIGKFHRHEYMGFENLC 473
           S ISLGGVGI + I +  R +YMGF +LC
Sbjct: 280 SVISLGGVGIVDSIQELKRGKYMGFGSLC 308


>gi|168044376|ref|XP_001774657.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673957|gb|EDQ60472.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 380

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 144/392 (36%), Positives = 230/392 (58%), Gaps = 21/392 (5%)

Query: 91  AAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTL-DM 149
           AA  S GG+GGGG+F+P+ +L++ FDPK++ A+S  +I+G +V+ + +NL  RHP+L   
Sbjct: 1   AAICSAGGIGGGGLFIPVFNLLLLFDPKTSAALSNFVILGGSVANLIWNLPQRHPSLPHK 60

Query: 150 PIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRE 209
             IDYD+AL++QP ++LGISIGV  NV+F  W++ V L ++    +T+++  G++ W+ E
Sbjct: 61  SAIDYDVALILQPNMLLGISIGVICNVMFPGWLIIVQLALILGFITTRSWKNGIKRWRIE 120

Query: 210 TILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFV 269
           + L   A +   +N    G +E  +L   P   P K  +EP           ++  L  V
Sbjct: 121 SQLA--ALKTEEANAESEGPLESDNL-HAPLLAPVKTPLEPSCL------GSKVMCLGLV 171

Query: 270 WVAFLGLQIAKNHTA--------SCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIAS 321
           WVAF  +Q+ +            SC + YW+L L QIP++  V+L+ A   ++  +  + 
Sbjct: 172 WVAFFVIQLLRGGKTTEGILPLKSCGVGYWLLTLTQIPLACFVTLWTA---FRHTQCSSD 228

Query: 322 KGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAM 381
           K D G+  R   L  +    ++AG+ GG+LG+GGG  M PL +E GV  QV++AT  F +
Sbjct: 229 KQDQGEITRHRALTVFPGMALVAGLWGGMLGIGGGMFMNPLLIEAGVHVQVTAATTAFMV 288

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFM 441
            FSSS+SVVE++LL R PV +A+ F ++  +A+ +G  VV   I   GRAS+I+F +A  
Sbjct: 289 FFSSSLSVVEFWLLGRIPVDFAIVFSSICFVASLIGLTVVHHAISKYGRASIIVFSVAIA 348

Query: 442 IFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           + +SA+ + G G  N+  ++    YMGF   C
Sbjct: 349 LGISAVMMAGFGSLNVYRQYKDGAYMGFHTPC 380


>gi|222625716|gb|EEE59848.1| hypothetical protein OsJ_12424 [Oryza sativa Japonica Group]
          Length = 465

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 132/361 (36%), Positives = 195/361 (54%), Gaps = 22/361 (6%)

Query: 125 KCMIMGAAVSTVYYNLKLR-HPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           KC +M   +   Y+  + + +     P+   + A+   P L     +G + N    +   
Sbjct: 115 KCSLMLRTLPAKYFPRRCKSNAASHCPVFCTNSAIASAPPLKFSTKVGSSRNACVTN--- 171

Query: 184 TVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPR--- 240
                    GTS+++F KG+  WK ET ++ E       + +     +    PS      
Sbjct: 172 -------STGTSSRSFYKGILMWKDETRIQMETREREEESKSSCAARDVVIDPSCEEPLL 224

Query: 241 SGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPV 300
             PQ        T L N+ WK + +L+ VW +FL LQI KN++ SCS  YWV+++LQ+PV
Sbjct: 225 CQPQPKEKSALETFLFNLRWKNILVLMTVWSSFLVLQIFKNNSQSCSTFYWVINILQVPV 284

Query: 301 SLVVSLYEAISLYKGRRVIASKG------DDGKSFRVFQLVSYCAFGVLAGIVGGLLGLG 354
           +L V L+E + L +  R     G      +    +   QL+ +CAF  L G   G L   
Sbjct: 285 ALSVFLWEGVQLCRESRARRMDGNWECVCEASIEWSPAQLI-FCAFCGLLGGTVGGLLGS 343

Query: 355 GGFIMG-PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIA 413
           GG  +  PL LELG  PQV+SATATF M FSSS+SVVE+Y L RFP+P+A+Y + ++ +A
Sbjct: 344 GGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLNRFPIPFAVYLICISILA 403

Query: 414 AFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            F GQ +VRKL+ +L RASLI+FIL+ +IF SA+++G VG    I   + HEYMGF + C
Sbjct: 404 GFWGQSLVRKLVHVLKRASLIVFILSSVIFASALTMGVVGTQKSISMINNHEYMGFLDFC 463

Query: 474 K 474
           +
Sbjct: 464 E 464


>gi|356566531|ref|XP_003551484.1| PREDICTED: uncharacterized protein LOC100809802 [Glycine max]
          Length = 254

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 104/177 (58%), Positives = 131/177 (74%), Gaps = 6/177 (3%)

Query: 126 CMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTV 185
           C+++   VSTVYYNLKLRHPTL+MPIIDYDLALLIQPMLMLGI+IGV FNV+F+ W+VT+
Sbjct: 3   CLLVIYVVSTVYYNLKLRHPTLNMPIIDYDLALLIQPMLMLGITIGVVFNVVFSYWIVTI 62

Query: 186 LLIVLFIGTSTKAFLKGVETWKRETILKKEA-ARCLGSNGAGAGEVEYKSLPSGPRSGPQ 244
           LLIVLF+GTSTK+  KG+ETWK+ETI+KK+   +  G +    GE      P+  +  P 
Sbjct: 63  LLIVLFLGTSTKSLFKGIETWKKETIIKKDQFTKEAGKHPVSNGEFG----PNHIKVLPD 118

Query: 245 KDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAK-NHTASCSIVYWVLDLLQIPV 300
           +   +  V I+EN+ WKE GLLVFVWV+F  LQI K N+T +C   YW L+LLQI V
Sbjct: 119 QFLSDVSVAIIENVYWKEFGLLVFVWVSFPALQIGKENYTTTCPTFYWALNLLQIMV 175


>gi|168019909|ref|XP_001762486.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686219|gb|EDQ72609.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 390

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 139/404 (34%), Positives = 240/404 (59%), Gaps = 30/404 (7%)

Query: 85  IIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRH 144
           ++G  GA+  S GGVGGGG+F+P+ +L+IGFD KS+ A+S  MI G +++ V++N++  H
Sbjct: 2   VLGTLGASICSAGGVGGGGLFIPLFNLLIGFDAKSSAALSNFMIFGGSIANVWWNIQRDH 61

Query: 145 PTLD-MPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGV 203
           P L   P+ID+D+ LL+QP ++LGISIGV  NV F  W +T+  I+     + ++F  G+
Sbjct: 62  PFLPGHPLIDFDVVLLLQPNMLLGISIGVICNVAFPSWFITLEFIITLGYITARSFRSGL 121

Query: 204 ETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEP---EVTILENINW 260
             W+ ET L++  ++           + +++L   P+ G   +A+ P   E     N  +
Sbjct: 122 VRWRNETPLERLDSK-----------LSWQNL--CPKEG--DEAIVPLLGESKPPRNFPY 166

Query: 261 KELGLLVFVWVAFLGLQIAKNHTASCSIV--------YWVLDLLQIPVSLVVSLYEAISL 312
            +L +L  VW AFL +Q+ +   +S SI+        YW+L  +Q+P++ +++ + A  L
Sbjct: 167 VKLLMLTLVWTAFLAVQLLRGGKSSDSILTLEPCGGAYWLLTTMQVPLAFLLTGWSAWHL 226

Query: 313 YKGRRVIASKGDDGKSFRVF---QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVP 369
           +       +   + + + V      V + +  +LAG +GG+LG+GGG I+ P+ +ELG+ 
Sbjct: 227 HHALEEKNTPELELEEWDVIGPRAFVLFPSMALLAGFLGGMLGIGGGMIINPMLIELGMH 286

Query: 370 PQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLG 429
           PQ+++AT  F + FSSS+SVV+++LL R  + +AL F A+  +++ +G HVV++ I   G
Sbjct: 287 PQLTAATTAFMVFFSSSLSVVQFWLLGRLQLDFALLFGAICFVSSLIGLHVVQQAIAKFG 346

Query: 430 RASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           R S+I+F ++ ++ +SA+S    G   +  +F   EYMGF   C
Sbjct: 347 RPSIIVFSVSIVLGISAVSTTICGGFEVWDQFTGGEYMGFHYPC 390


>gi|290979999|ref|XP_002672720.1| hypothetical protein NAEGRDRAFT_81115 [Naegleria gruberi]
 gi|284086299|gb|EFC39976.1| hypothetical protein NAEGRDRAFT_81115 [Naegleria gruberi]
          Length = 610

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 141/470 (30%), Positives = 233/470 (49%), Gaps = 69/470 (14%)

Query: 64  GSGYH-HVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATA 122
           G  YH  ++P+  F +   + T+I F G+   ++ G+GGGG+ VP++     F P+ A  
Sbjct: 150 GVCYHKDLFPQ--FSYLDAIATVISFLGSVLSTMAGIGGGGLIVPLMETAGQFPPQMAVG 207

Query: 123 ISKCMIMGAAVST-VYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADW 181
           ISK MI GA++S  +  +LK RHP  D P+I+YD+ALL+QP  ++G+ +GV  N +  +W
Sbjct: 208 ISKTMIFGASISNFIALSLK-RHPHADRPLINYDVALLLQPTSLVGVLVGVLLNSMTPNW 266

Query: 182 MVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGE--------VEYK 233
           ++ +L  ++    S   F++    W+ E+     AA+ +GSN   +          V   
Sbjct: 267 LIVLLSAIILTIVSLTTFVRAGRMWRAES-----AAKLVGSNNGSSANYHQITDNGVNND 321

Query: 234 SLPSG--------PRSGPQKDAMEP--------EVTILENINWKE--------------- 262
           ++ S         P++   ++ +E         E    ENI  +E               
Sbjct: 322 TIISDDEDETFDRPKNTNNENVLETSLLGEDELEQMEQENIEKEEKKQIARKVLHNEKST 381

Query: 263 ----LGLLVFVWVAFLGLQIAKN--------HTASCSIVYWVLDLLQIPVSLVVSLYEAI 310
               L +LV  WV    L I +         H   CS+ +WVL     P+ + VS    +
Sbjct: 382 PFTKLFVLVLCWVIVFILTIVRGGHGAPSVFHIDHCSVWHWVLFAAMFPIMIAVSYMVGV 441

Query: 311 SL---YKGRRVIASKG----DDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLF 363
            L   +K R  + S G         + V  ++ Y  F  L+G++ G+LG+GGG I  P+ 
Sbjct: 442 YLVRKHKERSALVSHGYRFVAGDPHWSVGTVILYPVFSALSGVIAGMLGVGGGMIKSPML 501

Query: 364 LELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRK 423
           L LG+ P V++ATA+F M F+SS+S +++ +L   P  Y L++      A  VGQ ++  
Sbjct: 502 LYLGLDPLVAAATASFMMLFTSSISSIQFVILGTVPFDYGLWYFVFGLFAGMVGQ-ILMH 560

Query: 424 LIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            +   GR S+++FI+AF+I VS + + G GI N +     + YMGF ++C
Sbjct: 561 FVFQKGRKSVLVFIVAFIIIVSTLCMTGFGIYNAVIMLQNNMYMGFHSIC 610


>gi|4415908|gb|AAD20139.1| unknown protein [Arabidopsis thaliana]
 gi|20197925|gb|AAM15316.1| unknown protein [Arabidopsis thaliana]
          Length = 288

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 96/153 (62%), Positives = 130/153 (84%)

Query: 65  SGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAIS 124
           S    +WP+LKF W++VL T+I F G+A G+VGGVGGGGIFVPML+LI+GFD KSA AIS
Sbjct: 45  SATEKIWPDLKFSWKLVLATVIAFLGSACGTVGGVGGGGIFVPMLTLILGFDTKSAAAIS 104

Query: 125 KCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVT 184
           KCMIMGA+ S+V+YN+++RHPT ++PI+DYDLALL QPML+LGI++GV+ +V+F  W++T
Sbjct: 105 KCMIMGASASSVWYNVRVRHPTKEVPILDYDLALLFQPMLLLGITVGVSLSVVFPYWLIT 164

Query: 185 VLLIVLFIGTSTKAFLKGVETWKRETILKKEAA 217
           VL+I+LF+GTS+++F KG+E WK ET+LK   A
Sbjct: 165 VLIIILFVGTSSRSFFKGIEMWKEETLLKFPVA 197


>gi|3023070|gb|AAC12639.1| hypothetical protein [Mesembryanthemum crystallinum]
          Length = 132

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 92/140 (65%), Positives = 111/140 (79%), Gaps = 8/140 (5%)

Query: 151 IIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRET 210
           IIDYDLALL QPMLMLGISIGVAFNVIFADWMVTVLLIVLF+GTSTKAF++G++TWK+ET
Sbjct: 1   IIDYDLALLFQPMLMLGISIGVAFNVIFADWMVTVLLIVLFVGTSTKAFMRGIDTWKKET 60

Query: 211 ILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVW 270
           +++KEAA+   SNGA    VEY+ LP+GP    +      E  ILEN+ WKE+GLL FVW
Sbjct: 61  LMQKEAAKRAESNGADG--VEYEPLPAGPEKEDR------EAPILENVYWKEVGLLCFVW 112

Query: 271 VAFLGLQIAKNHTASCSIVY 290
           VAFL  +I   +TA+CS+ Y
Sbjct: 113 VAFLAFEIINENTATCSVAY 132


>gi|297737341|emb|CBI26542.3| unnamed protein product [Vitis vinifera]
          Length = 220

 Score =  174 bits (441), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 112/188 (59%), Positives = 128/188 (68%), Gaps = 19/188 (10%)

Query: 74  LKFGWEIVLGTIIGFFGAAFGSVGG---VGGGGIFVPMLSLIIGFDPKSATAIS-KCMIM 129
           +KFGW+I   +IIGFFGA FG  GG   V  GG FV       G   +   +I   CM+M
Sbjct: 1   MKFGWQIDDCSIIGFFGATFGVGGGGSFVTCGGSFV-----TCGRSSEVINSICWSCMVM 55

Query: 130 GAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIV 189
           GA  STVYYNLKLRHPTLDMPII YDLALL QPMLM+GISIGV FNV  ADWMVT+LLIV
Sbjct: 56  GAVGSTVYYNLKLRHPTLDMPIICYDLALLFQPMLMMGISIGVVFNVAVADWMVTILLIV 115

Query: 190 LFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQ---KD 246
           LF+GT TKAF+KGVETWK+ETI+K+           G  EVEYK LPSG  +G Q   + 
Sbjct: 116 LFLGTPTKAFIKGVETWKKETIMKRNKCN-------GTKEVEYKPLPSGLSNGTQNATRK 168

Query: 247 AMEPEVTI 254
             E EV+I
Sbjct: 169 FEELEVSI 176


>gi|290999413|ref|XP_002682274.1| predicted protein [Naegleria gruberi]
 gi|284095901|gb|EFC49530.1| predicted protein [Naegleria gruberi]
          Length = 399

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 128/409 (31%), Positives = 195/409 (47%), Gaps = 29/409 (7%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           +L TI  F G    +  G GGGG+FVP+L +   F P  A  IS  MI GA +  +    
Sbjct: 3   ILATITCFIGGVLSAASGTGGGGVFVPLLHVAGQFPPTLAIPISTLMIFGAGIINIATLS 62

Query: 141 KLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFL 200
             RHP  D P+IDYD+AL+++P  +LG  IGV FN++F DW++ V +I+    TS   F 
Sbjct: 63  FKRHPHADRPLIDYDIALMMEPPTLLGTIIGVFFNMMFPDWLIVVFVILTLSITSFVMFK 122

Query: 201 KGVETWKRETILKK---EAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILEN 257
            G +  KRE   +K   E+      N        Y  L         +   E E    + 
Sbjct: 123 NGYKRLKRENEERKKLEESTTEENHNDNNNPTTVYDEL---------QQMYESE----KR 169

Query: 258 INWKELGLLVFVWVAFLGLQIAKN-HTA--------SCSIVYWVLDLLQIPVSLVVSLYE 308
             + ++ +L   W+    L + K  H A         CS+ YW+L  L  P+   ++L  
Sbjct: 170 TPFGKVIVLFICWMTVFTLSLLKGGHGAPSIIPSVTQCSVGYWILTALSFPLLGAMTLGI 229

Query: 309 AISLYKGRRVIASKG----DDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFL 364
              L K        G    +    +  + +  Y    V AGI+  +LG+GGG +  PL L
Sbjct: 230 VFYLLKKHERKVQIGYQFVEGDVHWNKYNVTLYPIACVGAGILASMLGIGGGMVKSPLLL 289

Query: 365 ELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            LG  P  S AT +F + F+SS+S V+Y +    PV Y L+F+A   +    GQ ++   
Sbjct: 290 ILGSDPVSSQATTSFMILFTSSISTVQYLIAGLLPVDYGLWFLACGILCGVFGQLILDLW 349

Query: 425 IILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           +   GR S++IFI+A +   +   +GG GI +++ +  +  YMGF + C
Sbjct: 350 LDKSGRRSIMIFIVAIVTLAATFLMGGAGIYDVVKQVEKGVYMGFRSPC 398


>gi|384245840|gb|EIE19332.1| hypothetical protein COCSUDRAFT_44684 [Coccomyxa subellipsoidea
           C-169]
          Length = 495

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 143/519 (27%), Positives = 232/519 (44%), Gaps = 69/519 (13%)

Query: 1   MADLGAKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLW 60
           MA      +I+RS +  +L F  AF+C  A+     E        R DS         L+
Sbjct: 1   MARQSGSLEIMRSALF-WLAFG-AFLCTLAQEDQTHEA------SRSDSG--------LF 44

Query: 61  QPEGSGYHHVWPELKFGW--EIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPK 118
           Q  G+G     P  KF W    +    +    A   +  G+GGG  +VP+L++++GFD K
Sbjct: 45  QILGTG-----PNTKFEWTPRTIFAAALACVCALLANSAGIGGGPFYVPLLNVVLGFDLK 99

Query: 119 SATAISKCMIMGAAVSTVYYNLKLRHPT-LDMPIIDYDLALLIQPMLMLGISIGVAFNVI 177
           +AT +S  ++  +AV++  Y L   HP     P++D+D+AL   P L+LG+S GV  NV+
Sbjct: 100 AATGLSHTIVATSAVASSIYGLIQTHPNDPSRPLVDFDVALTFIPALLLGVSFGVLLNVL 159

Query: 178 FADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKE----AARCLGSNGAGAG---EV 230
             DW+ T LL VL +    K   KG+  W++E    K+    A + LG      G   E 
Sbjct: 160 VPDWLQTALLTVLLLFVINKTVRKGITQWRQEQKAIKQKRSAAQQDLGDEDDEEGVLHEE 219

Query: 231 EYKSLPSGPRSGPQKDAMEPEVTILENIN--------------WKELGLLVFVWVAFLGL 276
            ++  PS   S P     + + T+ +  +                ++  +V +W  FL  
Sbjct: 220 RFERNPSKRFSAPHSSVHQLQTTLSQIFHRLPLFKALRKVCETCVQMAAVVALWAVFLAF 279

Query: 277 QIAKNHTASCSIVYWVLDLLQIPVSLVVSL----YEAISLY---------KGRRVIASKG 323
           Q  K    +C+  Y+ +   Q+   L V+     YEA             + R VI  + 
Sbjct: 280 QQLKARYPNCTWQYFTIFAAQVIFLLSVTAFCIWYEAKKAAGPHADEMDPELRTVILGEQ 339

Query: 324 DDGKS--------FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSA 375
            D ++         R+ ++V   AF   AG   GLLG+GG  I  P+ L+LGV PQV+++
Sbjct: 340 SDSETPIGTADTYKRLAKVVGVMAF---AGFTAGLLGIGGALIFNPVLLQLGVQPQVTAS 396

Query: 376 TATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLII 435
           T+   + F+SS   + +Y        YAL    +  +A+ +G  VV ++I   GR S+I+
Sbjct: 397 TSVLMILFTSSAIALSFYFQGLLNTSYALVLAPLCFVASLIGVTVVGRIIRASGRVSIIV 456

Query: 436 FILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
            +L+ +I    +     G    +        +GF+  C 
Sbjct: 457 LLLSALIIAGTVLTAFFGGIRAVNDIRDGAPIGFKPFCD 495


>gi|407038050|gb|EKE38923.1| hypothetical protein ENU1_147930 [Entamoeba nuttalli P19]
          Length = 459

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 129/445 (28%), Positives = 221/445 (49%), Gaps = 50/445 (11%)

Query: 76  FGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVST 135
           F W++++G+I   F A   +  G+GGG  ++ +  LI+  DP  A  +SK    G A   
Sbjct: 7   FDWKLIVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDPHQAIPLSKITTFGVACGG 66

Query: 136 VYYNLKLRHPTLD-MPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT 194
                  RHP +   P+I Y  AL+++P+ + G  IGV FN+I   W++ ++L++L   T
Sbjct: 67  FLILWMKRHPNVRYKPLISYPTALMVEPLTIYGTMIGVIFNIISPSWLIIIVLVLLLGFT 126

Query: 195 STKAFLKGVETWKRETILKKEAARCL--------------------GSNGAGAG------ 228
           S K F K ++ WK E   K++AA+                        NG  A       
Sbjct: 127 SYKTFAKAIKQWKNEN-EKRDAAKATELVETSKPDIADNDNDDMKPSENGNNAVIVDERV 185

Query: 229 EVEYKSLPSGPRSGPQKDAME--------PEVTILENINWK---ELGLLVFVWVAFLGLQ 277
           + E +   SGP+  PQ ++ E         E T+L+    K    +G+L+ VW     + 
Sbjct: 186 QEEEEGQGSGPKLLPQDESQEAQKEAKKIEEKTLLKREIIKAILSVGILIVVWAVMFFIV 245

Query: 278 IAKNHTASCSIV--------YWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSF 329
           I K      SIV        YWVL  +  P+ L V++   I L+  +R    +G+   + 
Sbjct: 246 ILKGGEKMDSIVGIECGTPWYWVLTAIGGPLMLAVTIVVGIFLWWRQRGEEVQGEVQWTV 305

Query: 330 RVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSV 389
           +   ++   AF   AG+    LG+GGG ++GP+ LE+GV PQV++AT+ F + F++S S 
Sbjct: 306 KNCLIIPIGAF--FAGVSAAYLGIGGGMVIGPILLEIGVLPQVATATSAFMIMFTASSSS 363

Query: 390 VEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL 449
           ++Y +  +  + Y L++ A+  I A  GQ    K++  L R S+I F L  +I +S +++
Sbjct: 364 LQYIIDGKLDIFYGLWYFAIGFIGAAFGQFGFSKIVQKLNRQSIIGFFLGVLIVLSTLAM 423

Query: 450 GGVGISNMIGKFHRHEYMGFENLCK 474
             + +  ++    +++ +GF++LCK
Sbjct: 424 IAITVVQLVSDV-KNDNLGFKHLCK 447


>gi|67466247|ref|XP_649271.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56465670|gb|EAL43888.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449701601|gb|EMD42390.1| Hypothetical protein EHI5A_207490 [Entamoeba histolytica KU27]
          Length = 460

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 128/446 (28%), Positives = 222/446 (49%), Gaps = 51/446 (11%)

Query: 76  FGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVST 135
           F W++++G+I   F A   +  G+GGG  ++ +  LI+  DP  A  +SK    G A   
Sbjct: 7   FDWKLIVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDPHQAIPLSKITTFGVACGG 66

Query: 136 VYYNLKLRHPTLD-MPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT 194
                  RHP +   P+I Y  AL+++P+ + G  IGV FN+I   W++ ++L++L   T
Sbjct: 67  FLILWMKRHPNVRYKPLISYPTALMVEPLTIYGTMIGVIFNIISPSWLIIIVLVLLLGFT 126

Query: 195 STKAFLKGVETWKRETILKKEAARCL--------------------GSNGAGAGEVEYK- 233
           S K F K ++ WK E   K++AA+                        NG+ A  V+ K 
Sbjct: 127 SYKTFAKAIKQWKNEN-EKRDAAKATELVETSKPDIADNDNDDMKPSENGSNAVIVDEKV 185

Query: 234 ------SLPSGPRSGPQKDAME--------PEVTILENINWK---ELGLLVFVWVAFLGL 276
                    +GP+  PQ ++ E         E T+L+    K    +G+L+ VW     +
Sbjct: 186 QEEEEEGQGTGPKLLPQDESQEAQKEAKKIEEKTLLKREIIKAILSVGILIIVWAVMFFI 245

Query: 277 QIAKNHTASCSIV--------YWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKS 328
            I K      SIV        YWVL  +  P+ L V++   I L+  +R    +G+   +
Sbjct: 246 VILKGGEKMDSIVGIECGTPWYWVLTAIGGPLMLAVTIVVGIFLWWRQRGEEVEGEVQWT 305

Query: 329 FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS 388
            +   ++   AF   AG+    LG+GGG ++GP+ LE+GV PQV++AT+ F + F++S S
Sbjct: 306 VKNCLIIPIGAF--FAGVSAAYLGIGGGMVIGPILLEIGVLPQVATATSAFMIMFTASSS 363

Query: 389 VVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAIS 448
            ++Y +  +  + Y +++ A+  I A  GQ    K++  L R S+I F L  +I +S ++
Sbjct: 364 SLQYIIDGKLDIFYGIWYFAIGFIGAAFGQFGFSKIVQKLNRQSIIGFFLGVLIVLSTLA 423

Query: 449 LGGVGISNMIGKFHRHEYMGFENLCK 474
           +  + +  ++    +++ +GF++LCK
Sbjct: 424 MIAITVVQLVSDV-KNDNLGFKHLCK 448


>gi|255555059|ref|XP_002518567.1| conserved hypothetical protein [Ricinus communis]
 gi|223542412|gb|EEF43954.1| conserved hypothetical protein [Ricinus communis]
          Length = 463

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 134/417 (32%), Positives = 230/417 (55%), Gaps = 24/417 (5%)

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           P LK    +VL  ++ F  A+  S GG+GGGG+FVP+L+++ G D K+A++ S  M+ G 
Sbjct: 56  PMLKLASPMVLSGVLCFIAASISSAGGIGGGGLFVPILTIVAGLDLKTASSFSAFMVTGG 115

Query: 132 AVSTVYYNLKLRHPTL-DMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVL 190
           +++ V  N  L  P      +IDYD+ALL +P ++LG+S+GV  N+IF +W++TVL ++ 
Sbjct: 116 SIANVLCN--LFSPKFGGKALIDYDIALLSEPCMLLGVSVGVICNLIFPEWLITVLFVLF 173

Query: 191 FIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEP 250
            + ++ K     V  W  E+   +E  R    NG G  E       S   +   K   EP
Sbjct: 174 LVWSTFKTCKNAVAHWNLES---EEVKR----NGHGNLENGRVKDRSSIGNEEIKIIKEP 226

Query: 251 EVTI-LEN---INWKELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQI 298
            + I +EN     W++LG+LV +W++F  L + + +           C + YWV+  LQI
Sbjct: 227 LMGIEMENRMSFTWEKLGVLVLIWLSFSFLYLLRGNRYGEGIAPLKPCGVGYWVVSSLQI 286

Query: 299 PVSLVVSLYEAISL--YKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGG 356
           P++++ + +  +    Y+ +       DD    R    +++    +LAGI+GG+ G+GGG
Sbjct: 287 PLAIIFTAWILLKKRHYQNQTANLQDIDDSMEGRAPNKLTFPIMALLAGILGGVFGIGGG 346

Query: 357 FIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFV 416
            ++ PL L +G+PP+V++AT +F + FSS+MS  +Y L        AL F ++  +A+ V
Sbjct: 347 MLISPLLLHVGIPPEVTAATCSFMVFFSSTMSAFQYLLSGMEHTDTALMFASICFVASLV 406

Query: 417 GQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           G  VV+++I   GRAS+I+F ++ ++ +S + +   G  ++   +     MGF+  C
Sbjct: 407 GLLVVQRIIQDYGRASIIVFSVSIVMALSTVLITSFGTIDVWRNYESGTNMGFKLPC 463


>gi|225469193|ref|XP_002275818.1| PREDICTED: uncharacterized protein LOC100252710 [Vitis vinifera]
 gi|297741863|emb|CBI33227.3| unnamed protein product [Vitis vinifera]
          Length = 458

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 134/415 (32%), Positives = 227/415 (54%), Gaps = 27/415 (6%)

Query: 73  ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAA 132
           +LKF   IV   ++ F  A+  S GG+GGGG+FVP+L+++ G D K+A+  S  M+ G +
Sbjct: 57  QLKFSPPIVRAAVLCFIAASISSAGGIGGGGLFVPILAIVGGLDLKTASTFSAFMVAGGS 116

Query: 133 VSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFI 192
            + +   + +        +ID+D+ALL +P L+LG+SIGV  N++F +W++T+L +V   
Sbjct: 117 TANILCTMFINCIHGGKSVIDFDIALLSEPCLLLGVSIGVVCNIVFPEWLITILFVVFLS 176

Query: 193 GTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEV 252
            T++K   KGV +WK    L+ E  R    NG G  E   +   S   +   K   EP +
Sbjct: 177 WTTSKTCRKGVVSWK----LESEVIR---RNGFGELENGVRRDESNGENEVIKSLKEPLM 229

Query: 253 TILEN----INWKELGLLVFVWVAFLGLQIAKNHTASCSIV--------YWVLDLLQIPV 300
             +EN    I W + G LV +W++F  L I +      SI+        YW+L  LQ P+
Sbjct: 230 GEVENFKISIPWTKFGALVVIWLSFFLLYILRGDRDGQSIIPMEPCGEGYWILSSLQFPL 289

Query: 301 SLVVSLY--EAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFI 358
           ++  + +          + ++   G+   +  +F +++     +LAGI+GG+ G+GGG +
Sbjct: 290 AITFTAWILHRRETSNQQEILGQTGEKPPNL-IFPIMA-----LLAGILGGVFGIGGGML 343

Query: 359 MGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQ 418
           + PL L +G+PP+V++AT +  + FSS+MS  +Y L+       AL F  +   A+ +G 
Sbjct: 344 ISPLLLHIGIPPEVTAATCSVMVFFSSTMSSFQYLLIGMEHKEVALIFAIICFFASILGV 403

Query: 419 HVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            VV++ I   GRASLI+F ++ ++ +S + +   G  ++   + R EYMGF+  C
Sbjct: 404 VVVQRAIEKYGRASLIVFSVSTVMALSTVLITSFGAIDVWRDYARGEYMGFKLPC 458


>gi|6554197|gb|AAF16643.1|AC011661_21 T23J18.20 [Arabidopsis thaliana]
          Length = 491

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 135/407 (33%), Positives = 217/407 (53%), Gaps = 26/407 (6%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           ++  ++ FF A+  S GG+GGGG+F+ ++++I G + K+A++ S  M+ G + + V  NL
Sbjct: 97  IIAAVLSFFAASISSAGGIGGGGLFLSIMTIIAGLEMKTASSFSAFMVTGVSFANVGCNL 156

Query: 141 KLRHP-TLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAF 199
            LR+P + D  +ID+DLAL IQP L+LG+SIGV  N +F +W+V  L  V    ++ K  
Sbjct: 157 FLRNPKSRDKTLIDFDLALTIQPCLLLGVSIGVICNRMFPNWLVLFLFAVFLAWSTMKTC 216

Query: 200 LKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENIN 259
            KGV  W     L+ E A+          EV    L S  R   ++  M           
Sbjct: 217 KKGVSYWN----LESERAKIKSPRDVDGIEVARSPLLSEEREDVRQRGM-------IRFP 265

Query: 260 WKELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQIPVSLVVSLY---- 307
           W +LG+LV +W+ F  + + + +           C  +YW L  LQIP+++  +L     
Sbjct: 266 WMKLGVLVIIWLLFFSINLFRGNKYGQGIISIKPCGALYWFLSSLQIPLTIFFTLCIYFS 325

Query: 308 -EAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLEL 366
               S +       S+ + G   R  +L+      +LAG++GGL G+GGG ++ PL L++
Sbjct: 326 DNVQSNHTSHSNQNSEQETGVGGRQNKLM-LPVMALLAGVLGGLFGIGGGMLISPLLLQI 384

Query: 367 GVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
           G+ P+V++AT +F + FSSSMS ++Y LL       A  F  V  +A+ VG  VV+K+I 
Sbjct: 385 GIAPEVTAATCSFMVLFSSSMSAIQYLLLGMEHAGTAAIFALVCFVASLVGLMVVKKVIA 444

Query: 427 LLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
             GRAS+I+F +  ++ +S + +   G  N+   F    YMGF+  C
Sbjct: 445 KYGRASIIVFAVGIVMALSTVLMTTHGAFNVWNDFVSGRYMGFKLPC 491


>gi|302758356|ref|XP_002962601.1| hypothetical protein SELMODRAFT_65204 [Selaginella moellendorffii]
 gi|300169462|gb|EFJ36064.1| hypothetical protein SELMODRAFT_65204 [Selaginella moellendorffii]
          Length = 385

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 132/401 (32%), Positives = 225/401 (56%), Gaps = 34/401 (8%)

Query: 91  AAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLD-M 149
           AA  S GG+GGGG++VP+ +L++ F+ ++A  +S  MI G  ++ + ++   R P  D  
Sbjct: 1   AAVSSAGGIGGGGLYVPLFNLLLRFNTRTAAVLSNFMIFGGMIANMMWSAFQRDPFDDER 60

Query: 150 PIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRE 209
           P+ID+D ALL+QP ++LGIS+GV  N++F  W++T+LL +     + ++F  G   WK E
Sbjct: 61  PLIDFDAALLMQPNMLLGISLGVLCNLMFPGWLITLLLTITLAFVTFRSFNCGFRLWKAE 120

Query: 210 TILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFV 269
           +             G+ + + E KS        P  D+ E        +   +L  L+ V
Sbjct: 121 S-------------GSNSSDGEGKSAKYHDAEAPLLDSAEIPHRRFPAL---KLAGLLLV 164

Query: 270 WVAFLGLQIAKN--------HTASCSIVYWVLDLLQIPVSLVVSLY---EAISLYKGRRV 318
           W+ F  +Q+ +         H   C + YW++   Q+P++L+ +++   EA      +  
Sbjct: 165 WLFFFAVQLLRGSKTSEGYFHLDECGLGYWLITGSQLPLTLLFTVWTIREATVSTSCQSD 224

Query: 319 IASKGDDGKSFRVFQLVSYCAFGVL------AGIVGGLLGLGGGFIMGPLFLELGVPPQV 372
           +       KSF+     S  A   L      AGI+GGLLG+GGG ++ P+ LE+G+PPQV
Sbjct: 225 VKFLVSSAKSFKWNSSRSNRAHLTLPLMALLAGILGGLLGIGGGMLISPILLEMGMPPQV 284

Query: 373 SSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRAS 432
           ++AT+ F + FSSS+SV +Y+L+ R PV +AL+F  +  + + VG  VV++ I   GRAS
Sbjct: 285 TAATSAFMVFFSSSLSVAQYWLMGRIPVEFALWFSGICFVFSLVGLLVVQRAIQRYGRAS 344

Query: 433 LIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           +I+F++  ++ +SA+ + G G  ++  ++ R +YMGF + C
Sbjct: 345 IIVFLVTSVMGLSALLMAGFGGMDVWKQYERGDYMGFRSPC 385


>gi|328868613|gb|EGG16991.1| hypothetical protein DFA_07972 [Dictyostelium fasciculatum]
          Length = 521

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 120/400 (30%), Positives = 194/400 (48%), Gaps = 34/400 (8%)

Query: 63  EGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATA 122
           EG   H       F +  ++G  + F G A  S GGVGGGGI++P+L L+  F PK+A  
Sbjct: 115 EGICEHKSLFHDHFTFLDIVGLCLLFIGCALSSGGGVGGGGIYIPILILVSKFSPKTAIP 174

Query: 123 ISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWM 182
           +S C++ G A++    N   RHP  +  +IDY + LLI+P+ + G + GV  +     ++
Sbjct: 175 LSNCLVAGCAMANFIQNFPRRHPFANKHLIDYSVVLLIEPLTLGGTTFGVYLHTFLPPFV 234

Query: 183 VTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSG 242
           + +LL+V    T+   F KG+  +K+E   K  +     S  +   E +        +S 
Sbjct: 235 ILILLVVTLTATAITTFRKGLSIYKKENETKSYSQIKNTSINSDGSETQ--------QSN 286

Query: 243 PQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTAS--------CSIVYWVLD 294
           P KDA            W ++  +V V        + K             CS  YW L 
Sbjct: 287 PFKDA-----------EWGKISAIVLVLALSTVFSVLKGGDGEYSMVGIKLCSPTYWTLS 335

Query: 295 LLQIPVSLVVSLYEAISLYKGRRV-----IASKGDDGKSFRVFQLVSYCAFGVLAGIVGG 349
               PV +V  +  A+ LY+  +      I  +GD   S +   L+ +    V+AGI+  
Sbjct: 336 FAIWPVIIVTWILTALYLYRRWKKNQLQGIKVEGDINYSPQTIILLGF--LSVIAGILAS 393

Query: 350 LLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAV 409
           LLG+GGG I GP+ L++G+ P +++AT++F + F+S+ S ++Y LL +  + Y L +   
Sbjct: 394 LLGIGGGMIKGPVLLQMGLSPDITAATSSFMILFTSASSAIQYVLLGKLRLDYGLVYYFT 453

Query: 410 ATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL 449
           A IA FVG   +   +    R S  IF++  +I +S I L
Sbjct: 454 AFIACFVGTQSLLYAVKKSNRKSYFIFLICLVIVISTILL 493


>gi|414872566|tpg|DAA51123.1| TPA: hypothetical protein ZEAMMB73_068409 [Zea mays]
          Length = 199

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 86/139 (61%), Positives = 111/139 (79%), Gaps = 3/139 (2%)

Query: 64  GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAI 123
            S    VWP+L F W +V+ T++GF G+AFG+VGGVGGGGIFVPML+L++GFD KSA A+
Sbjct: 61  SSRSPRVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLVVGFDTKSAAAL 120

Query: 124 SKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMV 183
           SKCMIMGA+ S+V+YNL++ HPT + P+IDY LALL QPMLMLGI+IGV  +VIF  W++
Sbjct: 121 SKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYWLI 180

Query: 184 TVLLIVLFI---GTSTKAF 199
           TVL+I+LF      ST+AF
Sbjct: 181 TVLIIILFAPRRDPSTRAF 199


>gi|168065030|ref|XP_001784459.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663984|gb|EDQ50721.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 405

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 129/417 (30%), Positives = 230/417 (55%), Gaps = 40/417 (9%)

Query: 79  EIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYY 138
            +V   I+G  GA   + GGVGGGG+F+P+ +L++ FD K++ A+S  MI+  +++ + +
Sbjct: 4   RVVAAVILGSLGACICAAGGVGGGGLFIPIFNLLLLFDAKTSAALSNIMILAGSIAVLAW 63

Query: 139 NLKLRHP-TLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTK 197
           N++  HP +   P+IDYD+ALL+ P ++LGISIGV  N+ F  W++  +L V+    + +
Sbjct: 64  NIRRTHPLSPGKPLIDYDVALLLNPNMLLGISIGVFCNITFPGWLLISVLTVILFYMTNR 123

Query: 198 AFLKGVETW---KRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTI 254
           +   G   W          KE    + ++    G+ +Y  L  G +S P   A  P   +
Sbjct: 124 SIQNGFTRWKKESAAAAKAKEKIAIVSAHSVETGQSKYPLL--GGQSEPSLFAQCPPQKL 181

Query: 255 LENINWKELGLLVFVWVAFLGLQIAKNHTAS--------CSIVYWVLDLLQIPVSLVVSL 306
           ++         LV +W+ F  +QI +    +        C + YW+L   Q+P  L + L
Sbjct: 182 IK---------LVMMWLLFFAVQILRGGEGTEGILKVKPCGLAYWLLSASQLP--LAIGL 230

Query: 307 YEAISLYKGRRVIASKGDD----------GKSFRVFQLVSYCAFGVLAGIVGGLLGLGGG 356
              I+L    +  A+K  +           +++ VF L++     +LAG++GG+LG+GGG
Sbjct: 231 TAWIALQHSSKSHAAKPSESNEEVDVMLTSRAYTVFPLMA-----LLAGMLGGMLGIGGG 285

Query: 357 FIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFV 416
            I+ P+  E+G+ PQ ++ T++F + F++SMSV++++LL R P+ +AL F AV    + V
Sbjct: 286 MIINPMLTEVGMHPQATAGTSSFMIFFATSMSVLQFWLLGRIPMDFALLFGAVCLFWSCV 345

Query: 417 GQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           G  +++  I+  GR S+I+F+++ ++ VSA+ +   G  N+  ++   +YMGF   C
Sbjct: 346 GIGLLQAAIVKHGRPSVIVFLVSSVMGVSALLMATFGGFNVWHQYRAGDYMGFHAAC 402


>gi|302797450|ref|XP_002980486.1| hypothetical protein SELMODRAFT_55219 [Selaginella moellendorffii]
 gi|300152102|gb|EFJ18746.1| hypothetical protein SELMODRAFT_55219 [Selaginella moellendorffii]
          Length = 385

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 131/401 (32%), Positives = 224/401 (55%), Gaps = 34/401 (8%)

Query: 91  AAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLD-M 149
           AA  S GG+GGGG++VP+ +L++ F+ ++A  +S  MI G  ++ + ++   R P  D  
Sbjct: 1   AAVSSAGGIGGGGLYVPLFNLLLRFNTRTAAVLSNFMIFGGMIANMMWSAFQRDPFDDER 60

Query: 150 PIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRE 209
           P+ID+D ALL+QP ++LGIS+GV  N++F  W++T+LL +     + ++F  G   WK E
Sbjct: 61  PLIDFDAALLMQPNMLLGISLGVLCNLMFPGWLITLLLTITLAFVTFRSFNCGFRLWKAE 120

Query: 210 TILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFV 269
           +             G+ + + E KS        P  D+ E        +   +L  L+ V
Sbjct: 121 S-------------GSNSSDGEGKSAKYHDAEAPLLDSAEIPHRRFPAL---KLAGLLLV 164

Query: 270 WVAFLGLQIAKN--------HTASCSIVYWVLDLLQIPVSLVVSLY---EAISLYKGRRV 318
           W+ F  +Q+ +         H   C + YW++   Q+P++L+ +++   E       +  
Sbjct: 165 WLFFFAVQLLRGSKTSEGYFHLDECGLGYWLITGSQLPLTLLFTVWTIRETTVSTSCQSD 224

Query: 319 IASKGDDGKSFRVFQLVSYCAFGVL------AGIVGGLLGLGGGFIMGPLFLELGVPPQV 372
           +       KSF+     S  A   L      AGI+GGLLG+GGG ++ P+ LE+G+PPQV
Sbjct: 225 VKFLVSSAKSFKWNSSRSNRAHLTLPLMALLAGILGGLLGIGGGMLISPILLEMGMPPQV 284

Query: 373 SSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRAS 432
           ++AT+ F + FSSS+SV +Y+L+ R PV +AL+F  +  + + VG  VV++ I   GRAS
Sbjct: 285 TAATSAFMVFFSSSLSVAQYWLMGRIPVEFALWFSGICFVFSLVGLLVVQRAIQRYGRAS 344

Query: 433 LIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           +I+F++  ++ +SA+ + G G  ++  ++ R +YMGF + C
Sbjct: 345 IIVFLVTSVMGLSALLMAGFGGMDVWKQYERGDYMGFRSPC 385


>gi|290977704|ref|XP_002671577.1| predicted protein [Naegleria gruberi]
 gi|284085147|gb|EFC38833.1| predicted protein [Naegleria gruberi]
          Length = 537

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 126/432 (29%), Positives = 213/432 (49%), Gaps = 44/432 (10%)

Query: 85  IIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRH 144
           +  F G+   S GG GGGG+FVP+L ++  F+ + A  +SK MI GAAV+ V+      H
Sbjct: 107 LTNFIGSVLSSAGGTGGGGVFVPLLHVLGRFNAQEAVPLSKVMIFGAAVTNVFTLFFRSH 166

Query: 145 PTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVE 204
           P  D P+IDYD+AL+++P  +LG  IGV  N+I  +W++ + +I++   T+   F K  +
Sbjct: 167 PYADRPLIDYDIALMMEPATLLGTIIGVFLNIICPEWVIVLSVIIVLTITTILTFRKFFQ 226

Query: 205 TWKRE--------------TILKKEAARCLGSN-------GAGAGEVEYKSLPSG----- 238
             + E                L KE    +  N       G G+   E +SL +      
Sbjct: 227 RARVEFAFLWKKKKKDEETEPLNKEQEPIVSENKEEEQAQGYGSVNKEEESLVNTQPIFV 286

Query: 239 PRSGPQKDAMEPEV-TILENINWKELGLLVFVWVAFLGLQIAKNHTAS---------CSI 288
            +   +KD ++P   TI++   + ++ +L+  W+    L + +    +         CS 
Sbjct: 287 SQQVAEKDFVKPSAWTIVKKTPYWKIFVLIVCWIIIFTLSLLRGGEGAPSVIPGLEMCSP 346

Query: 289 VYWVLDLLQIP----VSLVVSLYEAISLYK--GRRVIASKGDDGKSFRVFQLVSYCAFGV 342
           VYW L  L  P    + ++V+ Y  I   +   R  +  +GD    +    +  Y    +
Sbjct: 347 VYWTLVGLSFPIIGAIMIIVAAYLLIDYRRKVKRGHVFVQGD--VKWNWINVTFYPGACL 404

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPY 402
           +AGI+  +LG+GGG I  PL L LG  P V +ATA F + F+SS+S  ++ ++ R P  Y
Sbjct: 405 IAGILAAMLGIGGGMIKSPLLLLLGSDPAVGAATAAFMIFFTSSISSAQFAIVGRIPFDY 464

Query: 403 ALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFH 462
            + +     ++ FVG   V   +   G+ SLII  +  ++  + + +GGVGI +++    
Sbjct: 465 GMLYGLTGFVSGFVGYFFVTFGVERWGKRSLIILCVGLVLLFATMLMGGVGIYDVVIDLQ 524

Query: 463 RHEYMGFENLCK 474
           +  YMGF + C+
Sbjct: 525 QGVYMGFHDPCR 536


>gi|242058941|ref|XP_002458616.1| hypothetical protein SORBIDRAFT_03g036740 [Sorghum bicolor]
 gi|241930591|gb|EES03736.1| hypothetical protein SORBIDRAFT_03g036740 [Sorghum bicolor]
          Length = 473

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 127/397 (31%), Positives = 199/397 (50%), Gaps = 42/397 (10%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMP---IIDYDLALLI 160
           +F+P+L+L+ G   K ATA S  M+ G A S V YNL  R  T       +IDYD+ALL 
Sbjct: 92  LFLPILNLVAGLGLKRATAYSSFMVTGGAASNVLYNLASRSSTGTGGGGRLIDYDIALLF 151

Query: 161 QPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCL 220
           QP L+LG+SIGV  NV+F +W++T+L  V     + K    GV+ W+ E+     A    
Sbjct: 152 QPCLLLGVSIGVVCNVVFPEWLITLLFSVFLASCTAKTCRAGVKIWRSES---GGAGTAR 208

Query: 221 GSNGAGAG-EVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQ-- 277
           G +  G G E     LP G   G   DA            W ++ LLV VW+ F  L   
Sbjct: 209 GDHHHGIGKEPLLLRLPLGTSDG---DAEGGGRGNGAGFPWADVALLVMVWLCFFALHVL 265

Query: 278 IAKNH------TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRV 331
           I   H         CS+ YW++ L Q+P ++  + Y  I   + +RV+ S+ +DGK   +
Sbjct: 266 IGDKHGKGVIRIRPCSVAYWLITLSQLPAAVAFTGY-IIHSKRKKRVVPSQ-EDGKQEDL 323

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMG---------------PLFLELGVPPQVSSAT 376
                    GV   +    L L   F+ G               P+ L++G+PPQ ++AT
Sbjct: 324 VDT------GVETTLPSLTLPLAA-FVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAAT 376

Query: 377 ATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIF 436
           ++F + F +SMS+V++ LL    +  A  +  +  +A+ VG  V+++ I   GR SLI+F
Sbjct: 377 SSFMVLFCASMSMVQFILLGVEGIGQASIYAGICFVASVVGVVVIQRAIRKSGRVSLIVF 436

Query: 437 ILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           ++  ++ +S + +   G  ++  ++   EYMGF+ LC
Sbjct: 437 LVTAIMALSTVIVTCFGALDVWMQYTSGEYMGFKLLC 473


>gi|414880188|tpg|DAA57319.1| TPA: hypothetical protein ZEAMMB73_804983 [Zea mays]
          Length = 458

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 117/385 (30%), Positives = 190/385 (49%), Gaps = 22/385 (5%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           +F+P+L+L+ G   K ATA S  M+ G A S V YNL          +IDYD+ALL QP 
Sbjct: 81  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLACTGGGGRGRLIDYDIALLFQPC 140

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSN 223
           L+LG+SIGV  NV+F +W++T+L  +     + K    GV+ W+ E+      AR     
Sbjct: 141 LLLGVSIGVVCNVMFPEWLITLLFSLFLAFCTAKTCRAGVKIWRSESSGCGSGAR----- 195

Query: 224 GAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQ--IAKN 281
            +  G   +   P  PR G      E          W ++ LLV +W+ F  L   I   
Sbjct: 196 -SSRGGHSHSKEPLLPR-GTSDGDAEGGGGNGAGFPWGDVALLVIIWLCFFALHVLIGDK 253

Query: 282 H------TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKS------- 328
           H         C + YW++   Q+P ++  + Y   +  K R V +   +DG         
Sbjct: 254 HGKGVIRIRPCGVAYWLITFFQLPAAVAFTGYIVYAKRKKRAVHSESQEDGSKADLADAG 313

Query: 329 FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS 388
                L +      + G + GL G+GGG ++ P+ L++G+PPQ ++AT++F + F +SMS
Sbjct: 314 VEALPLPTLPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLFCASMS 373

Query: 389 VVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAIS 448
           +V++ LL    V  A  +  V    +  G  V+ + +   GR SLI+F++  ++ +SA+ 
Sbjct: 374 MVQFILLGVEGVGQACVYAGVCFAGSVAGVVVIERAVSKSGRVSLIVFLVTAIMALSAVI 433

Query: 449 LGGVGISNMIGKFHRHEYMGFENLC 473
           +   G  ++  ++   EYMGF+  C
Sbjct: 434 VTCFGALDVWMQYTTGEYMGFKLPC 458


>gi|42561921|ref|NP_172621.2| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|332190628|gb|AEE28749.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 367

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 124/379 (32%), Positives = 197/379 (51%), Gaps = 26/379 (6%)

Query: 109 LSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP-TLDMPIIDYDLALLIQPMLMLG 167
           +++I G + K+A++ S  M+ G + + V  NL LR+P + D  +ID+DLAL IQP L+LG
Sbjct: 1   MTIIAGLEMKTASSFSAFMVTGVSFANVGCNLFLRNPKSRDKTLIDFDLALTIQPCLLLG 60

Query: 168 ISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGA 227
           +SIGV  N +F +W+V  L  V    ++ K   KGV  W  E+    E A+         
Sbjct: 61  VSIGVICNRMFPNWLVLFLFAVFLAWSTMKTCKKGVSYWNLES----ERAKIKSPRDVDG 116

Query: 228 GEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHT---- 283
            EV    L S  R   ++  M           W +LG+LV +W+ F  + + + +     
Sbjct: 117 IEVARSPLLSEEREDVRQRGM-------IRFPWMKLGVLVIIWLLFFSINLFRGNKYGQG 169

Query: 284 ----ASCSIVYWVLDLLQIPVSLVVSLY-----EAISLYKGRRVIASKGDDGKSFRVFQL 334
                 C  +YW L  LQIP+++  +L         S +       S+ + G   R  +L
Sbjct: 170 IISIKPCGALYWFLSSLQIPLTIFFTLCIYFSDNVQSNHTSHSNQNSEQETGVGGRQNKL 229

Query: 335 VSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL 394
           +      +LAG++GGL G+GGG ++ PL L++G+ P+V++AT +F + FSSSMS ++Y L
Sbjct: 230 M-LPVMALLAGVLGGLFGIGGGMLISPLLLQIGIAPEVTAATCSFMVLFSSSMSAIQYLL 288

Query: 395 LKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGI 454
           L       A  F  V  +A+ VG  VV+K+I   GRAS+I+F +  ++ +S + +   G 
Sbjct: 289 LGMEHAGTAAIFALVCFVASLVGLMVVKKVIAKYGRASIIVFAVGIVMALSTVLMTTHGA 348

Query: 455 SNMIGKFHRHEYMGFENLC 473
            N+   F    YMGF+  C
Sbjct: 349 FNVWNDFVSGRYMGFKLPC 367


>gi|224116342|ref|XP_002317274.1| predicted protein [Populus trichocarpa]
 gi|222860339|gb|EEE97886.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 131/454 (28%), Positives = 239/454 (52%), Gaps = 32/454 (7%)

Query: 45  ERQDSNYLLKAVNFLWQPEGSGYHHVWPE---LKFGWEIVLGTIIGFFGAAFGSVGGVGG 101
           E Q  +  LK   FL +     +H +  +   LK    +V+  ++ F  A+  S GG+GG
Sbjct: 27  ETQPLSNNLKIDLFLDKIGKWSHHQIQSQETGLKLAPSMVIAGVLCFIAASVSSAGGIGG 86

Query: 102 GGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTL-DMPIIDYDLALLI 160
           GG+++P+L+++   D K+A++ S  M+ G +V+ V  N+  R        ++DYD+A+L 
Sbjct: 87  GGLYIPILTIVASLDLKTASSFSAFMVTGGSVANVMCNMFTRSAKFGGQTLVDYDIAILS 146

Query: 161 QPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCL 220
           +P ++LG+S+GV  N++F +W+VT+L  V    ++ K    GV  WK E+   +E  R  
Sbjct: 147 EPCMLLGVSVGVICNLVFPEWLVTILFAVFLACSTFKTCQNGVFHWKLES---EEVNRNE 203

Query: 221 GSNGAGAGEVEYKSLPSGPR---SGPQKDAMEPEVT-ILENINWKELGLLVFVWVAFLGL 276
             N    G VEY++         S  ++  +  E+T  +    W +LG+L  +W +F  L
Sbjct: 204 SGNLEN-GLVEYETSTKESEEVISSVKEPLLGVELTSSVLRFPWMKLGILFIIWFSFSIL 262

Query: 277 QIAKNH--------TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKS 328
            + + +          SC   YWV+  LQIP+++   ++ A  LY+         +   S
Sbjct: 263 YLLRGNRYGEGIIPMESCGFGYWVVSSLQIPLAI---MFTAWILYRKESCQHQTINQQLS 319

Query: 329 FRVFQLVS---------YCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATF 379
            +  + ++         +    +LAG++GG+ G+GGG ++ PL L +G+ P++++AT +F
Sbjct: 320 VKGMEDLTGGGTSNKLIFPVMALLAGMLGGVFGIGGGMLISPLLLHVGIAPEITAATCSF 379

Query: 380 AMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILA 439
            + FSSSMS ++Y LL    V  A+    +  +A+ +G  VV++ I+  GRAS+I+F ++
Sbjct: 380 MVFFSSSMSALQYLLLGMEHVDTAIILSVICFVASLLGLLVVQRAIVKYGRASMIVFSVS 439

Query: 440 FMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            ++ +S + +   G  N+   ++    MGF+  C
Sbjct: 440 TVMALSTVLMTSFGALNVWRDYNSGRNMGFKLPC 473


>gi|125527980|gb|EAY76094.1| hypothetical protein OsI_04018 [Oryza sativa Indica Group]
          Length = 397

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 127/405 (31%), Positives = 212/405 (52%), Gaps = 25/405 (6%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           VL  I+ F  AAF S GGVGGG ++VP+L+++ G   K+ATA S  M+ G  +S V Y L
Sbjct: 6   VLACILSFLAAAFSSAGGVGGGSLYVPILNIVAGLSLKTATAFSTFMVTGGTLSNVLYTL 65

Query: 141 ---KLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTK 197
              +        P+IDYD+A++ QP L+LG+S+GV  NV+F +W++T L  V     + K
Sbjct: 66  IVLRGHEKGGHQPLIDYDIAVVSQPCLLLGVSVGVICNVMFPEWLITALFAVFLASATFK 125

Query: 198 AFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILEN 257
            +  G++ W+ ET   +       S G GAGE        G + G        ++ +L  
Sbjct: 126 TYGTGMKRWRAETAAARRMLEGGSSLGDGAGEALL-----GQKDGDGHRRQCVDLMVLVT 180

Query: 258 INWKELGLLVFVWVAFLGLQIAKN--HTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKG 315
           I W    L  FV   F+G + AK       C + YW++ + QIPV++    + A  +++ 
Sbjct: 181 I-W----LCFFVIHLFIGGEGAKGVFDIEPCGVTYWLITIAQIPVAVA---FTACIVHQK 232

Query: 316 RRVIASKGDD-------GKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGV 368
           R+  A    +               +  +    +L G++ GL G+GGG ++ P+ L++GV
Sbjct: 233 RKSHAQNSQEFDQAISVKSKLESLPVYVFPVAALLTGVMSGLFGIGGGLLLNPVLLQIGV 292

Query: 369 PPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
           PP+ +S+T  F + F +SMS+V++ +L    +  AL +     +A+ VG  V++  I   
Sbjct: 293 PPKTASSTTMFMVLFCASMSMVQFIILGVDGIVTALVYAITCFVASIVGLVVIQGAIRKS 352

Query: 429 GRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           GR SLI+F++A ++ +S + +   G   +  ++   +YMGF+  C
Sbjct: 353 GRVSLIVFMVAAILALSVVVIACSGAVRVWVQYTSGQYMGFKMPC 397


>gi|53792418|dbj|BAD53256.1| membrane protein-like [Oryza sativa Japonica Group]
 gi|53792472|dbj|BAD53380.1| membrane protein-like [Oryza sativa Japonica Group]
 gi|125572272|gb|EAZ13787.1| hypothetical protein OsJ_03712 [Oryza sativa Japonica Group]
          Length = 397

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 126/405 (31%), Positives = 212/405 (52%), Gaps = 25/405 (6%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           VL  I+ F  AAF S GGVGGG ++VP+L+++ G   K+ATA S  M+ G  +S V Y L
Sbjct: 6   VLACILSFLAAAFSSAGGVGGGSLYVPILNIVAGLSLKTATAFSTFMVTGGTLSNVLYTL 65

Query: 141 ---KLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTK 197
              +        P+IDYD+A++ QP L+LG+S+GV  NV+F +W++T L  V     + K
Sbjct: 66  IVLRGHEKGGHQPLIDYDIAVVSQPCLLLGVSVGVICNVMFPEWLITALFAVFLASATFK 125

Query: 198 AFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILEN 257
            +  G++ W+ ET   +       S G GAGE        G + G        ++ +L  
Sbjct: 126 TYGTGMKRWRAETAAARRMLEGGSSLGDGAGEALL-----GQKDGDGHRRQCVDLMVLVT 180

Query: 258 INWKELGLLVFVWVAFLGLQIAKN--HTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKG 315
           I W    L  FV   F+G + AK       C + YW++ + QIP+++    + A  +++ 
Sbjct: 181 I-W----LCFFVIHLFIGGEGAKGVFDIEPCGVTYWLITIAQIPIAVA---FTACIVHQK 232

Query: 316 RRVIASKGDD-------GKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGV 368
           R+  A    +               +  +    +L G++ GL G+GGG ++ P+ L++GV
Sbjct: 233 RKSHAQNSQEFDQAISVKSKLESLPVYVFPVAALLTGVMSGLFGIGGGLLLNPVLLQIGV 292

Query: 369 PPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
           PP+ +S+T  F + F +SMS+V++ +L    +  AL +     +A+ VG  V++  I   
Sbjct: 293 PPKTASSTTMFMVLFCASMSMVQFIILGVDGIVTALVYAITCFVASIVGLVVIQGTIRKS 352

Query: 429 GRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           GR SLI+F++A ++ +S + +   G   +  ++   +YMGF+  C
Sbjct: 353 GRVSLIVFMVAAILALSVVVIACSGAVRVWVQYTSGQYMGFKMPC 397


>gi|226497914|ref|NP_001144009.1| uncharacterized protein LOC100276828 precursor [Zea mays]
 gi|195635291|gb|ACG37114.1| hypothetical protein [Zea mays]
          Length = 458

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 119/385 (30%), Positives = 191/385 (49%), Gaps = 22/385 (5%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           +F+P+L+L+ G   K ATA S  M+ G A S V YNL          +IDYD+ALL QP 
Sbjct: 81  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLACTGGGGRGRLIDYDIALLFQPC 140

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSN 223
           L+LG+SIGV  NV+F +W++T+L  +     + K    GV+ W+ E+      AR   S 
Sbjct: 141 LLLGVSIGVVCNVMFPEWLITLLFSLFLAFCTAKTCRAGVKIWRSESSGCGSGAR--SSR 198

Query: 224 GAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQ--IAKN 281
           G  +   E    P  PR G      E          W ++ LLV +W+ F  L   I   
Sbjct: 199 GGHSHSKE----PLLPR-GTSDGDAEGGGGNGAGFPWGDVALLVIIWLCFFALHVLIGDK 253

Query: 282 H------TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKS------- 328
           H         C + YW++   Q+P ++  + Y   +  K R V +   +DG         
Sbjct: 254 HGKGVIRIRPCGVAYWLITFFQLPAAVAFTGYIVYAKRKKRAVHSESQEDGSKADLADAG 313

Query: 329 FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS 388
                L +      + G + GL G+GGG ++ P+ L++G+PPQ ++AT++F + F +SMS
Sbjct: 314 VEALPLPTLPLAAFVTGALXGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLFCASMS 373

Query: 389 VVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAIS 448
           +V++ LL    V  A  +  V    +  G  V+ + +   GR SLI+F++  ++ +SA+ 
Sbjct: 374 MVQFILLGVEGVGQACVYAGVCFAGSVAGVVVIERAVSKSGRVSLIVFLVTAIMALSAVI 433

Query: 449 LGGVGISNMIGKFHRHEYMGFENLC 473
           +   G  ++  ++   EYMGF+  C
Sbjct: 434 VTCFGALDVWMQYTTGEYMGFKLPC 458


>gi|115440389|ref|NP_001044474.1| Os01g0786800 [Oryza sativa Japonica Group]
 gi|113534005|dbj|BAF06388.1| Os01g0786800, partial [Oryza sativa Japonica Group]
          Length = 434

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 120/389 (30%), Positives = 196/389 (50%), Gaps = 34/389 (8%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP-TLDMPIIDYDLALLIQP 162
           +F+P+L+L+ G   K ATA S  M+ G A S V YNL           +IDYD+ALL QP
Sbjct: 61  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLLCTGCGRRAAAVIDYDIALLFQP 120

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGS 222
            L+LG+SIGV  NV+F +W++T L  +     +TK    G+  W  E+      A    +
Sbjct: 121 CLLLGVSIGVVCNVMFPEWLITALFALFLAFCTTKTLRAGLRIWSSES----RGATLAVA 176

Query: 223 NGAGAGEVEYKSLPSGPRSG----PQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQ- 277
                G  E   LP G  +G     + DA  P         WK++ +LV VW+ F  L  
Sbjct: 177 AATAHGREEPLLLPHGTDAGNGGGARGDAGFP---------WKDVSVLVMVWLCFFVLHV 227

Query: 278 -IAKNH------TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKS-- 328
            I   H         C + YW++ L Q+P ++  + Y  I   K ++ +    +DGK+  
Sbjct: 228 FIGDKHGKGMIRIKPCGVAYWLITLSQVPFAVAFTAY--IIYAKRKKQVLHNQEDGKANP 285

Query: 329 ----FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
                     + +     + G + GL G+GGG ++ P+ L++G+PPQ ++AT++F + F 
Sbjct: 286 ESTKMDTLPTLLFPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLFC 345

Query: 385 SSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           +SMS+V++ LL    +  A  +  +  +A+ VG  V+ + I   GR SLI+F++  ++ V
Sbjct: 346 ASMSMVQFILLGMQGIGEASVYAGICFVASVVGAVVIERAIRKSGRVSLIVFLVTGIMAV 405

Query: 445 SAISLGGVGISNMIGKFHRHEYMGFENLC 473
           S + +   G  ++  ++    YMGF+  C
Sbjct: 406 STVIITFFGALDVWAQYTSGAYMGFKLPC 434


>gi|125527981|gb|EAY76095.1| hypothetical protein OsI_04019 [Oryza sativa Indica Group]
          Length = 461

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 120/389 (30%), Positives = 196/389 (50%), Gaps = 34/389 (8%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP-TLDMPIIDYDLALLIQP 162
           +F+P+L+L+ G   K ATA S  M+ G A S V YNL           +IDYD+ALL QP
Sbjct: 88  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLLCTGCGRRAAAVIDYDIALLFQP 147

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGS 222
            L+LG+SIGV  NV+F +W++T L  +     +TK    G+  W  E+      A    +
Sbjct: 148 CLLLGVSIGVVCNVMFPEWLITALFALFLAFCTTKTLRAGLRIWSSES----RGATLAVA 203

Query: 223 NGAGAGEVEYKSLPSGPRSG----PQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQ- 277
                G  E   LP G  +G     + DA  P         WK++ +LV VW+ F  L  
Sbjct: 204 AATAHGREEPLLLPHGTDAGNGGGARGDAGFP---------WKDVSVLVMVWLCFFVLHV 254

Query: 278 -IAKNH------TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKS-- 328
            I   H         C + YW++ L Q+P ++  + Y  I   K ++ +    +DGK+  
Sbjct: 255 FIGDKHGKGMIRIKPCGVAYWLITLSQVPFAVAFTAY--IIYAKRKKQVLHNQEDGKANP 312

Query: 329 ----FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
                     + +     + G + GL G+GGG ++ P+ L++G+PPQ ++AT++F + F 
Sbjct: 313 ESTKMDTLPTLLFPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLFC 372

Query: 385 SSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           +SMS+V++ LL    +  A  +  +  +A+ VG  V+ + I   GR SLI+F++  ++ V
Sbjct: 373 ASMSMVQFILLGMQGIGEASVYAGICFVASVVGAVVIERAIRKSGRVSLIVFLVTGIMAV 432

Query: 445 SAISLGGVGISNMIGKFHRHEYMGFENLC 473
           S + +   G  ++  ++    YMGF+  C
Sbjct: 433 STVIITFFGALDVWAQYTSGAYMGFKLPC 461


>gi|53792419|dbj|BAD53257.1| membrane protein-like [Oryza sativa Japonica Group]
          Length = 461

 Score =  157 bits (398), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 120/389 (30%), Positives = 196/389 (50%), Gaps = 34/389 (8%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP-TLDMPIIDYDLALLIQP 162
           +F+P+L+L+ G   K ATA S  M+ G A S V YNL           +IDYD+ALL QP
Sbjct: 88  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLLCTGCGRRAAAVIDYDIALLFQP 147

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGS 222
            L+LG+SIGV  NV+F +W++T L  +     +TK    G+  W  E+      A    +
Sbjct: 148 CLLLGVSIGVVCNVMFPEWLITALFALFLAFCTTKTLRAGLRIWSSES----RGATLAVA 203

Query: 223 NGAGAGEVEYKSLPSGPRSG----PQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQ- 277
                G  E   LP G  +G     + DA  P         WK++ +LV VW+ F  L  
Sbjct: 204 AATAHGREEPLLLPHGTDAGNGGGARGDAGFP---------WKDVSVLVMVWLCFFVLHV 254

Query: 278 -IAKNH------TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKS-- 328
            I   H         C + YW++ L Q+P ++  + Y  I   K ++ +    +DGK+  
Sbjct: 255 FIGDKHGKGMIRIKPCGVAYWLITLSQVPFAVAFTAY--IIYAKRKKQVLHNQEDGKANP 312

Query: 329 ----FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
                     + +     + G + GL G+GGG ++ P+ L++G+PPQ ++AT++F + F 
Sbjct: 313 ESTKMDTLPTLLFPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLFC 372

Query: 385 SSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           +SMS+V++ LL    +  A  +  +  +A+ VG  V+ + I   GR SLI+F++  ++ V
Sbjct: 373 ASMSMVQFILLGMQGIGEASVYAGICFVASVVGAVVIERAIRKSGRVSLIVFLVTGIMAV 432

Query: 445 SAISLGGVGISNMIGKFHRHEYMGFENLC 473
           S + +   G  ++  ++    YMGF+  C
Sbjct: 433 STVIITFFGALDVWAQYTSGAYMGFKLPC 461


>gi|330818743|ref|XP_003291498.1| hypothetical protein DICPUDRAFT_89425 [Dictyostelium purpureum]
 gi|325078304|gb|EGC31962.1| hypothetical protein DICPUDRAFT_89425 [Dictyostelium purpureum]
          Length = 463

 Score =  157 bits (398), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 118/413 (28%), Positives = 214/413 (51%), Gaps = 52/413 (12%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           +LG ++ F G A  S GGVGGGGI++P+L L+  +DPKS+  +S C++ G +++    N 
Sbjct: 77  ILGMVLLFIGCALSSGGGVGGGGIYIPILILVSKWDPKSSIPLSNCLVAGCSLANFIQNF 136

Query: 141 KLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFL 200
             RHP  +  +ID+ +ALLI+P+ + G   GV  +  F   ++ +LL++    TS K   
Sbjct: 137 PRRHPFSNKHLIDFSVALLIEPLTLAGTIFGVYLHTYFPPLVILLLLVITLGFTSFKTIT 196

Query: 201 KGVETWKRE-----TILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTIL 255
           KGVE +++E     ++L  +  +   SNG+G+             S P  D     V   
Sbjct: 197 KGVEIYRKEIKAKVSLLNNDHHKINDSNGSGS-------------SNPNGDGANSNV--- 240

Query: 256 ENINWKELGLLVFVWV-----------AFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVV 304
                 +  LL+F  +           + +G+++       CS +YWVL  + +PV +++
Sbjct: 241 ------KYNLLIFSTMFSIFKGGDEEYSLIGVKL-------CSPLYWVLSFVMVPVIIIL 287

Query: 305 SLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVL---AGIVGGLLGLGGGFIMGP 361
             + A  LY+   +   +G + +    +   +    G+L   AGI+  LLG+GGG I GP
Sbjct: 288 WGFTARYLYREYEIRRDEGREIEGEIKYTHKNIIVLGILSIVAGILASLLGIGGGMIKGP 347

Query: 362 LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVV 421
           + L++G+ P V++AT+++ + F+S+ S ++Y L+ +    Y + + A+  I+ FVG   +
Sbjct: 348 VLLQMGLSPDVTAATSSYMILFTSASSAIQYILVGKLRWDYGIVYYAIGFISCFVGTQTL 407

Query: 422 RKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
             ++    R S I+F++  +I VS I L    +      F ++  + F+++CK
Sbjct: 408 IWIVKKYQRRSYIVFLIGAVISVSTILL----VVTESIDFVKYRNLSFDSICK 456


>gi|356549443|ref|XP_003543103.1| PREDICTED: uncharacterized protein LOC100790958 [Glycine max]
          Length = 466

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 112/388 (28%), Positives = 212/388 (54%), Gaps = 21/388 (5%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTL-DMPIIDYDLALLIQP 162
           +FVP+LS++ G D K+A+++S  M+ G +++ V  N+ +  P      +IDYD+AL  +P
Sbjct: 82  LFVPILSIVAGLDLKTASSLSAFMVTGGSIANVMCNMCITSPKFGGKSLIDYDIALSSEP 141

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRET-ILKKEAARCLG 221
            ++LG+S+GV  N++F +W++TVL  +    +++K    G+  WK E+ +++K     L 
Sbjct: 142 CMLLGVSLGVICNLVFPEWLITVLFAIFLAWSTSKTCKSGLLFWKAESEVIRKNG---LI 198

Query: 222 SNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILE-------NINWKELGLLVFVWVAFL 274
           +     G +E +++          +    EV++L         I W +L +L+ +W +F 
Sbjct: 199 NEELEKGLLENETIEQRKVYIENNEPKSIEVSLLAPQGNSKVRIPWFKLAVLLLIWFSFF 258

Query: 275 GLQIAKNH--------TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGR-RVIASKGDD 325
            + + + +           C + YW+L  +Q+P+++V + +        R R +  K   
Sbjct: 259 SVYLLRGNRYGEGIIPMEPCGVGYWILSSVQVPLAVVFTAWIVFRKESLRDRTLIPKVPG 318

Query: 326 GKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSS 385
               R   ++ +    +LAGI+GG+ G+GGG ++ PL L++GV P+V++AT +F + FS+
Sbjct: 319 LTKKRPSNILVFPLMALLAGILGGVFGIGGGMLISPLLLQVGVTPEVTAATCSFMVLFSA 378

Query: 386 SMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVS 445
           +MS ++Y LL    V  AL    +  +A+ +G  VV++ I   GRAS+I+F ++ ++F+S
Sbjct: 379 TMSGLQYLLLGMEHVQAALVLAIMCFVASLLGLLVVQRAIRKYGRASIIVFSVSIVMFIS 438

Query: 446 AISLGGVGISNMIGKFHRHEYMGFENLC 473
            + +   G   +   +   EYMGF+  C
Sbjct: 439 NVLMTSFGAIKVWTDYESGEYMGFKLPC 466


>gi|297837261|ref|XP_002886512.1| hypothetical protein ARALYDRAFT_475155 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332353|gb|EFH62771.1| hypothetical protein ARALYDRAFT_475155 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 461

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 128/429 (29%), Positives = 230/429 (53%), Gaps = 48/429 (11%)

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           P+++     ++  ++ F  ++  S GG+GGGG++VP+++++ G D K+A++ S  M+ G 
Sbjct: 54  PKIELTTSTIIAGLLSFLASSISSAGGIGGGGLYVPIMTIVAGLDLKTASSFSAFMVTGG 113

Query: 132 AVSTVYYNLKLRHPTL-DMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVL 190
           +++ V  NL +R+P      +ID+DLALL++P ++LG+SIGV  N++F +W++T L  V 
Sbjct: 114 SIANVGCNLFVRNPKSGGKTLIDFDLALLLEPCMLLGVSIGVICNLVFPNWLITSLFAVF 173

Query: 191 FIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYK----SLPSGPRSGPQKD 246
              ++ K F  G+  W+    L+ E  +   SN  G  + E K     LP        +D
Sbjct: 174 LAWSTLKTFGNGLYYWR----LESEMVKIRESNRIGEDDEEDKIESLKLPL------LED 223

Query: 247 AMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQI 298
              P     +   W +LG+LV +W+++  + + + +           C   YW++   QI
Sbjct: 224 YERP-----KRFPWIKLGVLVIIWLSYFAVYLLRGNKYGEGIISIEPCGNAYWLISSSQI 278

Query: 299 PVSLVVSLY---------EAISLYKG--RRVIASKGDDGKSFRVFQLVSYCAFGV---LA 344
           P++L  +L+         +  S Y    + V   + +DG         + C F V   LA
Sbjct: 279 PLTLFFTLWICFSDNVQSQQPSDYNVSIKDVEDLRSNDGAR------SNKCMFPVMALLA 332

Query: 345 GIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYAL 404
           G++GG+ G+GGG ++ PL L++G+ P+V++AT +F + FSS+MS ++Y LL       A 
Sbjct: 333 GVLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLLLGMEHTGTAS 392

Query: 405 YFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRH 464
            F  +  +A+ VG  VV+K+I   GRAS+I+F +  ++ +S + +   G  ++   +   
Sbjct: 393 IFAVICFVASLVGLKVVQKVITEYGRASIIVFSVCIVMALSIVLMTSYGALDVWNDYVAG 452

Query: 465 EYMGFENLC 473
            YMGF+  C
Sbjct: 453 RYMGFKLPC 461


>gi|223994523|ref|XP_002286945.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220978260|gb|EED96586.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 385

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 121/398 (30%), Positives = 194/398 (48%), Gaps = 29/398 (7%)

Query: 76  FGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVST 135
           F W  ++GT++ F  A      GVGGGGI++P+  +++ F PK +T +S+  I GA +  
Sbjct: 3   FLWTDLVGTVVWFLTAGIAVSCGVGGGGIYMPLGMILLRFAPKQSTGLSQACIFGAGLGG 62

Query: 136 VYYNLKLRHPTLDM---------PIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           +  N + RHP   +         PIIDYDLAL   PM + G  +GV    +  DW+   +
Sbjct: 63  LIINSRKRHPDRHIRDTKGFYTRPIIDYDLALFQAPMELAGAVVGVTVQRLLPDWLFLSI 122

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
            +V+   T  K F K  E++K++ + KK  A     +     E E + +P  P  G   D
Sbjct: 123 AVVILGLTCFKTFQKFFESYKKDKMQKKHLAFLAQRH---LDEQEAQKIPGCPSPGYNSD 179

Query: 247 AMEPEVTIL--ENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVV 304
             E     L  E++   ++ +L  +W  FL +    +   +  +  W L+ +        
Sbjct: 180 ESEHTTVELCAESVP-DDVMILRCLWRLFLRIFRTSHQALTSMMTQWNLNFVD------S 232

Query: 305 SLYEAISLYKGRRVIASKGD---DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGP 361
           SL   ++ ++G R++        D K  R F LVS   FG  AG++GGL+G+  G+  GP
Sbjct: 233 SLQRMLTSFRGLRLLHCLYFGLWDYKQVRDFSLVS---FG--AGMIGGLVGISAGYFTGP 287

Query: 362 LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVV 421
             ++ G+ P+VS+AT    M  +SS   V + L    P  YALYF  V    AFVG+  +
Sbjct: 288 FMIQRGLHPRVSTATTATTMLLTSSSVAVMFVLSGLLPWEYALYFFLVCVTGAFVGKTRI 347

Query: 422 RKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIG 459
              +   G AS++I  LA +I  S +    + ++N+ G
Sbjct: 348 DAYVKKTGMASVLIGALATIIGCSTLGCLVILLTNLAG 385


>gi|15220600|ref|NP_176367.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|13272465|gb|AAK17171.1|AF325103_1 unknown protein [Arabidopsis thaliana]
 gi|4508075|gb|AAD21419.1| Unknown protein [Arabidopsis thaliana]
 gi|111074464|gb|ABH04605.1| At1g61740 [Arabidopsis thaliana]
 gi|332195760|gb|AEE33881.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 458

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 123/425 (28%), Positives = 224/425 (52%), Gaps = 40/425 (9%)

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           P ++     ++  ++ F  ++  S GG+GGGG++VP+++++ G D K+A++ S  M+ G 
Sbjct: 51  PRIELTTSTIIAGLLSFLASSISSAGGIGGGGLYVPIMTIVAGLDLKTASSFSAFMVTGG 110

Query: 132 AVSTVYYNLKLRHPTL-DMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVL 190
           +++ V  NL +R+P      +ID+DLALL++P ++LG+SIGV  N++F +W++T L  V 
Sbjct: 111 SIANVGCNLFVRNPKSGGKTLIDFDLALLLEPCMLLGVSIGVICNLVFPNWLITSLFAVF 170

Query: 191 FIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEP 250
              ++ K F  G+  W+ E+ + K              ++E   LP        +D   P
Sbjct: 171 LAWSTLKTFGNGLYYWRLESEMVKIRESNRIEEDDEEDKIESLKLPL------LEDYQRP 224

Query: 251 EVTILENINWKELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQIPVSL 302
                +   W +LG+LV +W+++  + + + +           C   YW++   QIP++L
Sbjct: 225 -----KRFPWIKLGVLVIIWLSYFAVYLLRGNKYGEGIISIEPCGNAYWLISSSQIPLTL 279

Query: 303 VVSLYEAIS-----------LYKGRRVIASKGDDGKSFRVFQLVSYCAFGV---LAGIVG 348
             +L+   S               + V   + +DG         + C F V   LAG++G
Sbjct: 280 FFTLWICFSDNVQSQQQSDYHVSVKDVEDLRSNDGAR------SNKCMFPVMALLAGVLG 333

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVA 408
           G+ G+GGG ++ PL L++G+ P+V++AT +F + FSS+MS ++Y LL       A  F  
Sbjct: 334 GVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLLLGMEHTGTASIFAV 393

Query: 409 VATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMG 468
           +  +A+ VG  VV+K+I   GRAS+I+F +  ++ +S + +   G  ++   +    YMG
Sbjct: 394 ICFVASLVGLKVVQKVITEYGRASIIVFSVGIVMALSIVLMTSYGALDVWNDYVSGRYMG 453

Query: 469 FENLC 473
           F+  C
Sbjct: 454 FKLPC 458


>gi|4079632|emb|CAA10487.1| hypothetical protein [Arabidopsis thaliana]
          Length = 389

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 114/393 (29%), Positives = 207/393 (52%), Gaps = 40/393 (10%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP-TLDMPIIDYDLALLIQP 162
           ++VP+++++ G D K+A++ S  M+ G +++ V  NL +R+P +    +ID+DLALL++P
Sbjct: 14  LYVPIMTIVAGLDLKTASSFSAFMVTGGSIANVGCNLFVRNPKSGGKTLIDFDLALLLEP 73

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGS 222
            ++LG+SIGV  N++F +W++T L  V    ++ K F  G+  W+ E+ + K        
Sbjct: 74  CMLLGVSIGVICNLVFPNWLITSLFAVFLAWSTLKTFGNGLYYWRLESEMVKIRESNRIE 133

Query: 223 NGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNH 282
                 ++E   LP        +D   P     +   W +LG+LV +W+++  + + + +
Sbjct: 134 EDDEEDKIESLKLPL------LEDYQRP-----KRFPWIKLGVLVIIWLSYFAVYLLRGN 182

Query: 283 T--------ASCSIVYWVLDLLQIPVSLVVSLYEAIS-----------LYKGRRVIASKG 323
                      C   YW++   QIP++L  +L+   S               + V   + 
Sbjct: 183 KYGEGIISIEPCGNAYWLISSSQIPLTLFFTLWICFSDNVQSQQQSDYHVSVKDVEDLRS 242

Query: 324 DDGKSFRVFQLVSYCAFGV---LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFA 380
           +DG         + C F V   LAG++GG+ G+GGG ++ PL L++G+ P+V++AT +F 
Sbjct: 243 NDGAR------SNKCMFPVMALLAGVLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFM 296

Query: 381 MTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAF 440
           + FSS+MS ++Y LL       A  F  +  +A+ VG  VV+K+I   GRAS+I+F +  
Sbjct: 297 VLFSSTMSAIQYLLLGMEHTGTASIFAVICFVASLVGLKVVQKVITEYGRASIIVFSVGI 356

Query: 441 MIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           ++ +S + +   G  ++   +    YMGF+  C
Sbjct: 357 VMALSIVLMTSYGALDVWNDYVSGRYMGFKLPC 389


>gi|186512097|ref|NP_193857.3| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|332659032|gb|AEE84432.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 449

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 130/413 (31%), Positives = 228/413 (55%), Gaps = 26/413 (6%)

Query: 73  ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAA 132
           ELK    I++  ++ F  A   S GG+GGGG+F+P+++++ G D K+A++ S  M+ G +
Sbjct: 51  ELKLSSAIIMAGVLCFLAALISSAGGIGGGGLFIPIMTIVAGVDLKTASSFSAFMVTGGS 110

Query: 133 VSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFI 192
           ++ V  NL          ++DYDLALL++P ++LG+SIGV  N +  +W++TVL  V   
Sbjct: 111 IANVISNL-----FGGKALLDYDLALLLEPCMLLGVSIGVICNRVLPEWLITVLFAVFLA 165

Query: 193 GTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEV 252
            +S K    GV+ WK E+ + +E+       G G  E E K+L        +   +E + 
Sbjct: 166 WSSLKTCRSGVKFWKLESEIARESGHGRPERGQGQIEEETKNL--------KAPLLEAQA 217

Query: 253 TILEN-INWKELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQIPVSLV 303
           T  ++ I W +LG+LV VW +F  + + + +           C + YW+L  LQIP++L+
Sbjct: 218 TKNKSKIPWTKLGVLVIVWASFFVIYLLRGNKDGKGIITIKPCGVEYWILLSLQIPLALI 277

Query: 304 VSLYEAISLYKGRRVIA---SKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMG 360
            +   A+S  + R+  +    K  +G        + + A   LAG++GG+ G+GGG ++ 
Sbjct: 278 FTKL-ALSRTESRQEQSPNDQKNQEGTRLDKSTRLKFPAMSFLAGLLGGIFGIGGGMLIS 336

Query: 361 PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           PL L+ G+PPQ+++AT +F + FS++MS V+Y LL       A  F  +  +A+ +G  +
Sbjct: 337 PLLLQSGIPPQITAATTSFMVFFSATMSAVQYLLLGMQNTDTAYVFSFICFLASLLGLVL 396

Query: 421 VRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           V+K +   GRAS+I+F +  ++ +S + +   G  ++   +   + MGF+  C
Sbjct: 397 VQKAVAQFGRASIIVFSVGTVMSLSTVLMTSFGALDVWTDYVAGKDMGFKLPC 449


>gi|357125384|ref|XP_003564374.1| PREDICTED: uncharacterized protein LOC100827655 [Brachypodium
           distachyon]
          Length = 469

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 114/384 (29%), Positives = 191/384 (49%), Gaps = 25/384 (6%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           +F+P+L+L+ G   K A   S  M+ G A S V YNL      +   +IDYD+ALL QP 
Sbjct: 97  LFLPILNLVAGLSLKRAATYSSFMVTGGAASNVLYNLLWNRGRV---LIDYDIALLFQPC 153

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSN 223
           L+LG+SIGV  NV+F +W++T L  +     + K    GV+ W+ E+     A     + 
Sbjct: 154 LLLGVSIGVVCNVMFPEWLITALFSLFLAFCTVKTCRAGVKIWRSESCAASAAVAVAAAR 213

Query: 224 GAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQ--IAKN 281
                E     LP+G + G Q  A            WK++ +LV VW+ F  L   I   
Sbjct: 214 HNENKEPLLVLLPAG-QDGDQAAAGNGA-----GFPWKDVSVLVAVWLCFFLLHAFIGDK 267

Query: 282 H------TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRV---F 332
           H         C I YW+  + Q+P S+  + Y  I   K ++ +    +DGK+  V    
Sbjct: 268 HGKGMIRITPCGIAYWLFTISQVPFSVAFTAY--IIYAKRKKQLLRNQEDGKANCVETKT 325

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIM---GPLFLELGVPPQVSSATATFAMTFSSSMSV 389
           + +S     + A + G L GL G        P+ L++G+PPQ ++AT++F + F +SMS+
Sbjct: 326 ETMSSLILPLAAFVTGSLSGLFGIGGGLLLNPVLLQMGIPPQTAAATSSFMVLFCASMSM 385

Query: 390 VEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL 449
           V++ LL    +  A  +  +  +A+  G  ++ K +   GR S+I+F++  ++ +S + +
Sbjct: 386 VQFILLGMDGIGEASVYAGICFVASVAGAVLIEKAVRKSGRVSMIVFLVTAIMALSTVIV 445

Query: 450 GGVGISNMIGKFHRHEYMGFENLC 473
              G  ++  +++   YMGF+  C
Sbjct: 446 TCFGALDVWKQYNGGAYMGFKLPC 469


>gi|222640475|gb|EEE68607.1| hypothetical protein OsJ_27142 [Oryza sativa Japonica Group]
          Length = 192

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 83/143 (58%), Positives = 115/143 (80%)

Query: 73  ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAA 132
           +L F W +V+ T++GF G+ FG+VGGVGGGG F PM++L++GF+ KSA  +SK MIMGA+
Sbjct: 16  DLGFKWRVVVATVVGFPGSGFGTVGGVGGGGKFGPMVNLLVGFETKSAAGVSKGMIMGAS 75

Query: 133 VSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFI 192
            S+V+YNL++ HPT + P+IDY LALL QPMLMLGI+IGV  +VIF  W++TVL+I+LFI
Sbjct: 76  ASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYWLITVLIIILFI 135

Query: 193 GTSTKAFLKGVETWKRETILKKE 215
           GTS+++F KG+  WK ET ++ E
Sbjct: 136 GTSSRSFYKGILMWKDETRIQME 158


>gi|356524768|ref|XP_003531000.1| PREDICTED: uncharacterized protein LOC100806202 [Glycine max]
          Length = 473

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 117/403 (29%), Positives = 212/403 (52%), Gaps = 48/403 (11%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTL-DMPIIDYDLALLIQP 162
           +F+P+L+++   D K+A+++S  M+ G +++ V  NL+  +P L    +IDYD+ALL +P
Sbjct: 86  LFIPILTIVASLDLKTASSLSAFMVTGGSIANVMCNLRATNPKLGGKSLIDYDIALLSEP 145

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGS 222
            ++LG+S+GV  N++F +W++T+L  V    +++K    GV  WK E+   +E  +  G 
Sbjct: 146 CMLLGVSVGVICNLVFPEWLITMLFAVFLTWSTSKTCNSGVVFWKIES---EERRKNDGF 202

Query: 223 NGAGAG----------------EVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLL 266
            G   G                E E + + S      ++  M PE  I   I W +L +L
Sbjct: 203 EGLEKGLLEDESSEEREEGVQVEKEKEKVKS-----IEEQVMVPEENIRVRIPWLKLVVL 257

Query: 267 VFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQIPVSLVVSL--------YEAI 310
           + VW +F  L + + +           C + YW++   Q+P++L  +         ++  
Sbjct: 258 LLVWFSFFSLYLLRGNKYGQSIIPMEPCGVGYWIISSAQVPLALFFTAWIVYRKESHQDQ 317

Query: 311 SLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPP 370
           +L +    ++S G   K   +F +++     +LAGI+GG+ G+GGG ++ PL L +G+ P
Sbjct: 318 NLMQEDSCLSSNGPSNK--LIFPMMA-----LLAGILGGVFGIGGGMLISPLLLHVGIAP 370

Query: 371 QVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGR 430
           +V++AT +F + FSS+MS ++Y LL    +  AL    +  +A+ +G  VV+K I   GR
Sbjct: 371 EVTAATCSFMVFFSSTMSALQYLLLGMDHIETALILALICFVASLIGLLVVQKAIQSYGR 430

Query: 431 ASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            SLI+F ++ ++ +S + +   G       +    YMGF+  C
Sbjct: 431 PSLIVFSVSIVMTLSIVLMTSFGAIRTWKDYTSGRYMGFKLPC 473


>gi|297804026|ref|XP_002869897.1| hypothetical protein ARALYDRAFT_492757 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315733|gb|EFH46156.1| hypothetical protein ARALYDRAFT_492757 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 447

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 131/415 (31%), Positives = 227/415 (54%), Gaps = 32/415 (7%)

Query: 73  ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAA 132
           ELK    I++  ++ F  A   S GG+GGGG+F+P+++++ G D K+A++ S  M+ G +
Sbjct: 51  ELKLSSAIIVAGVLCFLAALISSAGGIGGGGLFIPIMTIVAGLDLKTASSFSAFMVTGGS 110

Query: 133 VSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFI 192
           ++ V  NL          ++DYDLALL++P ++LG+SIGV  N +  +W++T L  V   
Sbjct: 111 IANVISNL-----FGGKALLDYDLALLLEPCMLLGVSIGVICNRVLPEWLITALFAVFLA 165

Query: 193 GTSTKAFLKGVETWKRETILKKEAARCLGSN--GAGAGEVEYKSLPSGPRSGPQKDAMEP 250
            +S K    GV+ WK E+    E AR  G      G GE+E  +L        +   +E 
Sbjct: 166 WSSLKTCRSGVKFWKIES----EIARGKGHERPEKGQGEIEEDNL--------KAPLLEA 213

Query: 251 EVTILEN-INWKELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQIPVS 301
           +V   ++ I W +LG+LV VW +F  + + + +           C + YW+L  LQIP++
Sbjct: 214 QVNRNKSKIPWTKLGVLVIVWASFFVIYLLRGNKDGKGIITIKPCGVEYWILLSLQIPLA 273

Query: 302 LVVSLYEAISLYKGRRVIA---SKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFI 358
           L+ +   A+S  + R+  +    K  +G        + + A   LAG++GG+ G+GGG +
Sbjct: 274 LIFTKL-ALSRTESRQEQSPNNQKNQEGTRMDQSMRLKFPAMSFLAGLLGGIFGIGGGML 332

Query: 359 MGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQ 418
           + PL L+ G+PPQ+++AT +F + FS++MS V+Y LL       A  F  +  +A+ +G 
Sbjct: 333 ISPLLLQSGIPPQITAATTSFMVFFSATMSAVQYLLLGMQNTDTAYVFSFICFLASLLGL 392

Query: 419 HVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            +V+K +   GRAS+I+F +  ++ +S + +   G  ++   +   + MGF+  C
Sbjct: 393 VLVQKAVAQFGRASIIVFSVGTVMSLSTVLMTSFGALDVWTDYMAGKDMGFKLPC 447


>gi|357446189|ref|XP_003593372.1| hypothetical protein MTR_2g010740 [Medicago truncatula]
 gi|355482420|gb|AES63623.1| hypothetical protein MTR_2g010740 [Medicago truncatula]
          Length = 466

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 123/443 (27%), Positives = 238/443 (53%), Gaps = 30/443 (6%)

Query: 50  NYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPML 109
           N++L  ++ L +    G+H     L+    +V+  I+ F  ++  S GG+GGGGIF+P+L
Sbjct: 35  NHVLDKISQL-KNVTQGFHESQ--LQISVPLVVAGILCFIASSISSAGGIGGGGIFIPIL 91

Query: 110 SLIIGFDPKSATAISKCMIMGAAVSTVY-YNLKLRHPTLDMPIIDYDLALLIQPMLMLGI 168
           +++ G D K A++IS  M+ G +++ V  Y            +IDYD+AL  +P ++LG+
Sbjct: 92  TIVAGLDLKVASSISAFMVTGGSIANVICYMFTTSTKFGGKSLIDYDIALSSEPCMLLGV 151

Query: 169 SIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLG----SNG 224
           S+GV  N++F +W++T++  V    +++K    GV  W  E+   +E  + +G      G
Sbjct: 152 SVGVICNLVFPEWLITLMFAVFLAWSTSKTCKSGVMFWNIES---EEIRKNIGVQEIEKG 208

Query: 225 AGAGEVEYKSLPSGPRSGPQKDAMEPEVTILEN-----INWKELGLLVFVWVAFLGLQIA 279
               E+       G ++  +   + P+    EN     I W +LG+L+ +W +F  + + 
Sbjct: 209 LLENEITMHKDNDGSKTVEENLVLVPQ----ENSSKLCIPWLKLGVLLLIWFSFFSIYLI 264

Query: 280 KNH-------TASCSIVYWVLDLLQIPVSLVVSLYEAI--SLYKGRRVIASKGDDGKSFR 330
           + +          C + YW++  +Q+P+++V + +  +     + + +I       ++  
Sbjct: 265 RGNGYGQIIPMEPCGVGYWIISSVQVPLAVVFTAWMVLRKESIQDQTLIPQVQCQNRNCP 324

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
             +LV +    +LAG++GG+ G+GGG ++ PL L++G+ P+V++AT +F + FSS+MS +
Sbjct: 325 SNKLV-FPLMALLAGMLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVFFSSTMSSL 383

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLG 450
           +Y LL    V  AL    +  +A+ +G  VV+K+I   GR S+I+F ++ ++ +S + + 
Sbjct: 384 QYLLLGMEHVETALILAIMCFVASLLGLLVVQKVIRKYGRPSIIVFSVSIVMSLSIVLMT 443

Query: 451 GVGISNMIGKFHRHEYMGFENLC 473
             G   +   +   +YMGF+  C
Sbjct: 444 SFGTLKVWEDYKSGKYMGFKLPC 466


>gi|326504538|dbj|BAJ91101.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 465

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 113/386 (29%), Positives = 190/386 (49%), Gaps = 19/386 (4%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           +F+P+L+L+ G   K AT  S  M+ G A S V YNL          +IDYD+ALL QP 
Sbjct: 83  LFLPILNLVAGLTLKHATTYSSFMVTGGAASNVLYNLWRARGRGRAALIDYDIALLFQPC 142

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCL-GS 222
           L+LG+SIGV  NV+F +W++T L  +     + K    G + W+ E+      A    G 
Sbjct: 143 LLLGVSIGVVCNVMFPEWLITALFSLFLAFCTAKTCRAGAKIWRCESAGAGAPAAARHGH 202

Query: 223 NGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQ--IAK 280
                 +V     PS    G Q  A +          WK++ +LV VW+ F  L   I  
Sbjct: 203 KVPLLLDVGGLPQPSQDDGGLQA-ARDGGSGGGAGFPWKDVAVLVIVWLCFFLLHVFIGD 261

Query: 281 NH------TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQL 334
            H         C I YW+  + Q+P ++  + Y  I   K ++      +DGK+      
Sbjct: 262 KHGKGVIRIKPCGIAYWLATVSQVPFAVAFTAY--IIYAKRKKQATHHHEDGKAHSSVHT 319

Query: 335 VSYCAFGV-------LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSM 387
            S     +       + G + GL G+GGG ++ P+ L++G+PPQ ++AT++F + F +SM
Sbjct: 320 KSETLPALALPLAAFVTGSLSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLFCASM 379

Query: 388 SVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
           S+V++ LL    V  A  +  +  +A+  G  ++ +++   GR S+I+F++  ++ +S +
Sbjct: 380 SMVQFILLGMDGVGEASVYAGICFVASIAGVVLIERVVRKSGRVSMIVFLVTAIMALSTV 439

Query: 448 SLGGVGISNMIGKFHRHEYMGFENLC 473
            +   G  ++  ++    YMGF+  C
Sbjct: 440 IVTCFGALDVWTQYTGGAYMGFKLPC 465


>gi|357483383|ref|XP_003611978.1| Membrane protein-like protein [Medicago truncatula]
 gi|355513313|gb|AES94936.1| Membrane protein-like protein [Medicago truncatula]
          Length = 109

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 79/103 (76%), Positives = 88/103 (85%)

Query: 371 QVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGR 430
           +V+SAT+TF+M FSSSMSVV+YY L RFPVPYA YFV VATIAAF GQHVVRK+I +LGR
Sbjct: 4   KVASATSTFSMLFSSSMSVVQYYYLDRFPVPYASYFVLVATIAAFAGQHVVRKIIAILGR 63

Query: 431 ASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           AS+IIFILA  IF+SAISLGGVGI  MI K   HEYMGFENLC
Sbjct: 64  ASIIIFILASTIFISAISLGGVGIEKMIVKMENHEYMGFENLC 106


>gi|222619377|gb|EEE55509.1| hypothetical protein OsJ_03713 [Oryza sativa Japonica Group]
          Length = 351

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 110/366 (30%), Positives = 181/366 (49%), Gaps = 34/366 (9%)

Query: 127 MIMGAAVSTVYYNLKLRHP-TLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTV 185
           M+ G A S V YNL           +IDYD+ALL QP L+LG+SIGV  NV+F +W++T 
Sbjct: 1   MVTGGAASNVLYNLLCTGCGRRAAAVIDYDIALLFQPCLLLGVSIGVVCNVMFPEWLITA 60

Query: 186 LLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSG--- 242
           L  +     +TK    G+  W  E+      A    +     G  E   LP G  +G   
Sbjct: 61  LFALFLAFCTTKTLRAGLRIWSSES----RGATLAVAAATAHGREEPLLLPHGTDAGNGG 116

Query: 243 -PQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQ--IAKNH------TASCSIVYWVL 293
             + DA  P         WK++ +LV VW+ F  L   I   H         C + YW++
Sbjct: 117 GARGDAGFP---------WKDVSVLVMVWLCFFVLHVFIGDKHGKGMIRIKPCGVAYWLI 167

Query: 294 DLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKS------FRVFQLVSYCAFGVLAGIV 347
            L Q+P ++  + Y  I   K ++ +    +DGK+            + +     + G +
Sbjct: 168 TLSQVPFAVAFTAY--IIYAKRKKQVLHNQEDGKANPESTKMDTLPTLLFPLAAFVTGAL 225

Query: 348 GGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFV 407
            GL G+GGG ++ P+ L++G+PPQ ++AT++F + F +SMS+V++ LL    +  A  + 
Sbjct: 226 SGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLFCASMSMVQFILLGMQGIGEASVYA 285

Query: 408 AVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYM 467
            +  +A+ VG  V+ + I   GR SLI+F++  ++ VS + +   G  ++  ++    YM
Sbjct: 286 GICFVASVVGAVVIERAIRKSGRVSLIVFLVTGIMAVSTVIITFFGALDVWAQYTSGAYM 345

Query: 468 GFENLC 473
           GF+  C
Sbjct: 346 GFKLPC 351


>gi|358344379|ref|XP_003636267.1| Membrane protein-like protein [Medicago truncatula]
 gi|355502202|gb|AES83405.1| Membrane protein-like protein [Medicago truncatula]
          Length = 110

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 79/103 (76%), Positives = 87/103 (84%)

Query: 371 QVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGR 430
            V+SAT+TF+M FSSSMSVV+YY L RFPVPYA YFV VATIAAF GQHVVRK+I +LGR
Sbjct: 5   SVASATSTFSMLFSSSMSVVQYYYLDRFPVPYASYFVLVATIAAFAGQHVVRKIIAILGR 64

Query: 431 ASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           AS+IIFILA  IF+SAISLGGVGI  MI K   HEYMGFENLC
Sbjct: 65  ASIIIFILASTIFISAISLGGVGIEKMIVKMENHEYMGFENLC 107


>gi|363807478|ref|NP_001242649.1| uncharacterized protein LOC100803518 precursor [Glycine max]
 gi|255636709|gb|ACU18690.1| unknown [Glycine max]
          Length = 473

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 128/469 (27%), Positives = 250/469 (53%), Gaps = 44/469 (9%)

Query: 39  LRFNGTERQDSNYLLKAVN--------FLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFG 90
           + FN  + + +  +L +++        + W     G+     +++    IV+  ++ F  
Sbjct: 15  ISFNPCKAEQAQTILNSLSIDQILQKIYEWGNGAQGFQE--GQIQISGPIVVAGVLCFIA 72

Query: 91  AAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTL-DM 149
           ++  S GG+GGGG+F+P+L+++   D K+A+++S  M+ G +++ V  NL    P     
Sbjct: 73  SSISSAGGIGGGGLFLPILTIVACLDLKTASSLSAFMVTGGSIANVLCNLCATSPKFGGK 132

Query: 150 PIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRE 209
            +IDYD+ALL +P ++LG+S+GV  N++F +W++T+L  V    +++K    GV  WK E
Sbjct: 133 SLIDYDIALLSEPCMLLGVSVGVICNLVFPEWLITMLFAVFLTWSTSKTCNSGVLFWKIE 192

Query: 210 TILKKEA------ARCLGSNGAGAGEVEYKSLPSGPRSGP---QKDAMEPEVTILENINW 260
           +  +++        + L  +G+ + E E +   +  ++G    ++  M PE  I   I W
Sbjct: 193 SEERRKNDGFERLEKGLLEDGS-SEEREERVQVNNEKAGMKSIEEQVMVPEENIRMRIPW 251

Query: 261 KELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQIPVSLVVSL------ 306
            +L +L+ VW++F  L + + +           C + YW+L   Q+P++L  +       
Sbjct: 252 LKLVVLLLVWLSFFSLYLLRGNKYGQSIIPMEPCGVGYWILSSAQVPLALFFTAWIVYRK 311

Query: 307 --YEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFL 364
             ++  +L +    ++S G   K   +F +++     +LAGI+GG+ G+GGG ++ PL L
Sbjct: 312 ESHQDQNLMQEDPCLSSNGPSNK--LIFPMMA-----LLAGILGGVFGIGGGMLISPLLL 364

Query: 365 ELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            +G+ P+V++AT +F + FSS+MS ++Y LL    +  AL    +  +A+ +G  VV++ 
Sbjct: 365 HVGIAPEVTAATCSFMVFFSSTMSALQYLLLGMDHIETALILALICFVASLIGLLVVQRA 424

Query: 425 IILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           +   GR SLI+F ++ ++ +S + +   G+      +    YMGF+  C
Sbjct: 425 VQSYGRPSLIVFSVSIVMTLSIVLMTSFGVIRTWKDYTSGRYMGFKLPC 473


>gi|281207304|gb|EFA81487.1| hypothetical protein PPL_05475 [Polysphondylium pallidum PN500]
          Length = 500

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 118/403 (29%), Positives = 204/403 (50%), Gaps = 31/403 (7%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           ++G  + F G A  S GGVGGGGI++P+L L+  F PK+A  +S C++ G + + +  N 
Sbjct: 113 IVGMALLFLGCALSSGGGVGGGGIYIPILILVNKFSPKTAIPLSNCLVAGCSFANLIQNF 172

Query: 141 KLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFL 200
             RHP  +  +IDY + LLI+P+ + G   G+  + +   +++ +LL+V    TS   F 
Sbjct: 173 PRRHPHANKHLIDYSVVLLIEPLTLGGTVFGIYLHTVLPPYVILILLVVTLTATSATTFK 232

Query: 201 KGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINW 260
           KG++  K+E   KKE    L +N + A     K      +  P  DA   ++  + +I  
Sbjct: 233 KGLDLRKKENT-KKEYL--LINNNSDAYLTPEK------KVNPFLDADWVKIFAILSILI 283

Query: 261 KELGLLVF----VWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLY-KG 315
                 VF      V+ +G+++       CS  YWVL     P+ ++  ++ A  LY + 
Sbjct: 284 LSTMFSVFKGGDSEVSLIGIKL-------CSPPYWVLSFAIWPIIIITWIFTARYLYGQW 336

Query: 316 RRVIAS----KGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ 371
            R  A     +GD   S +   L+      V+AGI+  LLG+GGG I GP+ L +G+ P 
Sbjct: 337 LRNQADGTIIEGDIRYSRKTIILLG--ILSVVAGILASLLGIGGGMIKGPVLLAMGLSPD 394

Query: 372 VSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRA 431
           + +AT++F + F+S+ S  +Y LL +  + Y L +  +   A FVG   +  ++    + 
Sbjct: 395 IVAATSSFMILFTSASSAFQYILLGKLRLDYGLVYYIIGFAACFVGTQTLIWVVNKYKKR 454

Query: 432 SLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
           S IIF++  +I +S I L    +   +    +++   F+++C 
Sbjct: 455 SYIIFLITAIIVISTILL----VVTEVLDLEKYKNQPFQSICS 493


>gi|301122683|ref|XP_002909068.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099830|gb|EEY57882.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 483

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 190/381 (49%), Gaps = 36/381 (9%)

Query: 103 GIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQP 162
            I VP + LI+GFD K AT IS   I+G AV+  ++N++ RHP ++ P+ID +LAL + P
Sbjct: 84  VIMVPAMVLIMGFDIKRATPISNVAILGGAVANAWFNMRKRHPNVNRPLIDPELALGMIP 143

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLG- 221
           +++ G  +G   N +   +++++L +V+ +   ++   KG+   K+E   ++EA      
Sbjct: 144 VVIGGTVLGALINKLIPSYVLSLLFVVVLLVGGSRTMKKGIRLHKKEVAKRREAEAATSE 203

Query: 222 -------SNGA----------GAGEVEYKSLPSGPRSGPQKDAM------EPEVTILENI 258
                  S GA          G  E       +G  + P K A       +  V ILE  
Sbjct: 204 VTADIPVSPGAYVQVSTPQITGNDEKRLSLSVTGGDAAPVKSAAGDHAGSDSLVQILEKE 263

Query: 259 NWKELGLLVFVWVAFLGLQIAKNHTASCS---IVYWVLDLLQIP---VSLVVSLYEAISL 312
                G  V + V +LG+  A    AS     + YWV+ L++IP   V +V++ +    +
Sbjct: 264 RHFAWGPHVAIMVCYLGVVAASIGDASVDCGGVAYWVILLIEIPWVAVFVVLTSHYLHKV 323

Query: 313 YKGRRVIASKGDDGKSFRVFQLVSYCAFG-VLAGIVGGLLGLGGGFIMGPLFLELGVPPQ 371
           Y  +     +  DG      ++V Y   G  +AGIV G+ G+GGG I GP+ +ELG+ P+
Sbjct: 324 YLRKTAANYQYVDGDIKWTKKMVVYFPLGCAVAGIVAGMFGVGGGIITGPIMIELGIVPE 383

Query: 372 VSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRA 431
           V+S+T    + +SS+ +  ++ + K     +AL+  AVA +     Q V+   +   GR 
Sbjct: 384 VASSTTALMILYSSAAATAKFAVFKMVAWDWALFLCAVAFVVTSASQVVILGFVRRTGRQ 443

Query: 432 SLIIFILAFMIFVSAISLGGV 452
           S+I+  +A     +A+ +GGV
Sbjct: 444 SIIVLCIA-----TAVLIGGV 459


>gi|242043762|ref|XP_002459752.1| hypothetical protein SORBIDRAFT_02g009840 [Sorghum bicolor]
 gi|241923129|gb|EER96273.1| hypothetical protein SORBIDRAFT_02g009840 [Sorghum bicolor]
          Length = 432

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 119/390 (30%), Positives = 199/390 (51%), Gaps = 41/390 (10%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDM---------PIIDY 154
           ++VP+LS++ G   K+ATA+S  M+ G  +S V Y L LR               P+IDY
Sbjct: 64  LYVPILSIVAGLSLKTATALSTFMVTGGTLSNVLYTLFLRGGGSGSGQGQGQGQQPLIDY 123

Query: 155 DLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKK 214
           D+A++ QP L+LG+S+GV  NV+F +W++T L  +     + K +  GV  W+ ET    
Sbjct: 124 DIAVVSQPCLLLGVSVGVVCNVVFPEWLITALFSLFLAFATFKTYGAGVRRWRAET---A 180

Query: 215 EAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFL 274
           E  R   + GA     E   L      G                 W +L +LV VW+ F 
Sbjct: 181 ELGRIPDAAGAETAAAEEALLGRNVSGG-------------HRCQWVDLAVLVTVWLCFF 227

Query: 275 GLQI------AKN--HTASCSIVYWVLDLLQIPVSLVVSL---YEAISLYKGRRVIASKG 323
            + +      AK       C  VYW++ + Q+PV++  +     +  S   G+ + A + 
Sbjct: 228 VMHLFIGGEGAKGVFDIEPCGTVYWLITVAQVPVAVAFTACIGQKRKSQAHGQVISAKRK 287

Query: 324 DDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTF 383
            D     VF + +     +L G++ GL G+GGG ++ P+ L++GVPP  +SAT  F + F
Sbjct: 288 LDALPAYVFPVAA-----LLTGVMSGLFGIGGGLLLNPVLLQIGVPPTTASATTMFMVLF 342

Query: 384 SSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIF 443
            +SMS+V++ +L    +  A+ + A   +A+ VG   ++  +   GRASLI+F++A ++ 
Sbjct: 343 CASMSMVQFIILGVDGIASAVLYAATCFVASIVGLVGIQGAVRRSGRASLIVFMVAGVLA 402

Query: 444 VSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           VSA+ +   G + +  ++   +YMGF+  C
Sbjct: 403 VSALVIACSGAARVWEEYMSGQYMGFKMPC 432


>gi|2911082|emb|CAA17544.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268922|emb|CAB79125.1| hypothetical protein [Arabidopsis thaliana]
          Length = 431

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 124/413 (30%), Positives = 216/413 (52%), Gaps = 44/413 (10%)

Query: 73  ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAA 132
           ELK    I++  ++ F  A   S GG+GGGG+F+P+++++ G D K+A++ S  M+ G +
Sbjct: 51  ELKLSSAIIMAGVLCFLAALISSAGGIGGGGLFIPIMTIVAGVDLKTASSFSAFMVTGGS 110

Query: 133 VSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFI 192
           ++ V  NL          ++DYDLALL++P ++LG+SIGV  N +  +W++TVL  V   
Sbjct: 111 IANVISNL-----FGGKALLDYDLALLLEPCMLLGVSIGVICNRVLPEWLITVLFAVFLA 165

Query: 193 GTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEV 252
            +  K    GV+ WK E+ + +E+       G G  E E K+L        +   +E + 
Sbjct: 166 WSILKTCRSGVKFWKLESEIARESGHGRPERGQGQIEEETKNL--------KAPLLEAQA 217

Query: 253 TILEN-INWKELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQIPVSLV 303
           T  ++ I W +LG+LV VW +F  + + + +           C + YW+L  LQIP++L+
Sbjct: 218 TKNKSKIPWTKLGVLVIVWASFFVIYLLRGNKDGKGIITIKPCGVEYWILLSLQIPLALI 277

Query: 304 VSLYEAISLYKGRRVIA---SKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMG 360
            +   A+S  + R+  +    K  +G        + + A   LAG++GG+ G+GGG ++ 
Sbjct: 278 FTKL-ALSRTESRQEQSPNDQKNQEGTRLDKSTRLKFPAMSFLAGLLGGIFGIGGGMLIS 336

Query: 361 PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           PL L+ G+PPQ+++AT +F + FS++MS V+Y LL                         
Sbjct: 337 PLLLQSGIPPQITAATTSFMVFFSATMSAVQYLLL------------------GMQNTDT 378

Query: 421 VRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
             K +   GRAS+I+F +  ++ +S + +   G  ++   +   + MGF+  C
Sbjct: 379 AYKAVAQFGRASIIVFSVGTVMSLSTVLMTSFGALDVWTDYVAGKDMGFKLPC 431


>gi|116783346|gb|ABK22903.1| unknown [Picea sitchensis]
          Length = 95

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 67/95 (70%), Positives = 76/95 (80%)

Query: 381 MTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAF 440
           M FSSSMSVVEYY LKRFPVPYA YF  V  IAAF GQHV+RKL++LLGRAS+IIF LAF
Sbjct: 1   MLFSSSMSVVEYYFLKRFPVPYAAYFFGVCIIAAFTGQHVIRKLVLLLGRASIIIFCLAF 60

Query: 441 MIFVSAISLGGVGISNMIGKFHRHEYMGFENLCKY 475
           MIF+SA  +GGVGIS M+ +     YMGF+NLC Y
Sbjct: 61  MIFISAWIMGGVGISKMVHEIKDGAYMGFQNLCNY 95


>gi|123437805|ref|XP_001309695.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121891432|gb|EAX96765.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 448

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 112/402 (27%), Positives = 194/402 (48%), Gaps = 31/402 (7%)

Query: 75  KFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVS 134
           K+   +++G I  F    F S  G+GGGG+FVP++ L++ F    A   SK +I G +++
Sbjct: 75  KWDTRLIIGIICVFIAGIFVSGAGIGGGGLFVPIMMLLVNFPTSYAIPTSKAIIFGGSLA 134

Query: 135 TVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT 194
              +NL  RHP  + P+I+Y++A +I+P+  LG  IGV FN I  +W++  +  VL   T
Sbjct: 135 VTLFNLNKRHPYYERPLINYNVAAMIEPISWLGTVIGVIFNSIIPEWLLYSVQFVLLTYT 194

Query: 195 STKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTI 254
           +   F KG++        ++ A   +  N         + L  G   GP        + +
Sbjct: 195 AWNTFKKGLKD-------QRNAKLGISPNN--------ELLVKGTYDGPTYSIGLLWLLL 239

Query: 255 LENINWKELGLLVFVWVAFLGLQIAKNHTAS---CSIVYWVLDLLQIPVSLVVSLYEAIS 311
           +  +        VF+ ++FL      +       CS +YW L     P+ L ++ +  + 
Sbjct: 240 IIYV--------VFLAISFLRGGDGADSIIGIKFCSPIYWALTFGPFPIYLGITAW-MVH 290

Query: 312 LYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ 371
           + K   V+  K +  K      ++S    G +AG+  G LG+GGG I GP+ L L +  +
Sbjct: 291 IAKRYPVLGHKNELTKKDIFLLMMS----GFVAGMAAGFLGIGGGMIKGPMMLALEIEAE 346

Query: 372 VSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRA 431
             +AT++F +  +SS + ++Y      P      F ++  ++  +G   +R L+  LG  
Sbjct: 347 EMAATSSFMILMTSSATSIQYIAEGLMPWLEFGVFTSMGFVSFLIGVIFLRWLVKKLGNR 406

Query: 432 SLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           S+ ++ LA +I +SAI +  VGI  +I +   H  MGF   C
Sbjct: 407 SIFLYFLAAIIMISAILMSVVGIEIIILEVKEHASMGFRPFC 448


>gi|66825573|ref|XP_646141.1| hypothetical protein DDB_G0269644 [Dictyostelium discoideum AX4]
 gi|60474235|gb|EAL72172.1| hypothetical protein DDB_G0269644 [Dictyostelium discoideum AX4]
          Length = 549

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 110/416 (26%), Positives = 205/416 (49%), Gaps = 34/416 (8%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           ++G I+ F G A  S GGVGGGGI++P+L L+  +DPK+A  +S C++ G A++    N 
Sbjct: 137 IVGFILLFIGCALSSGGGVGGGGIYIPILILVSKWDPKTAIPLSNCLVAGCALANFIQNF 196

Query: 141 KLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFL 200
             RHP  +  +IDY +ALLI+P+ + G   GV  +  F   ++ +LL++    TS K   
Sbjct: 197 PRRHPFSNKHLIDYSVALLIEPLTLAGTIFGVYLHTYFPPLVILLLLVITLGFTSYKTIS 256

Query: 201 KGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAME----------- 249
           KG++ WK E   KK+    L +           +  +   +    +              
Sbjct: 257 KGIDIWKSE---KKKKNSLLSNTDDNNNNNNNNNNNNNKNNNNDNENNNNNNSTGGNSNN 313

Query: 250 ---PEVTILENINWKELGLLVFVWVAFLGLQIAKNHTAS------CSIVYWVLDLLQIPV 300
               E T +ENI +     L+  +     +    +   S      CS VYW+L  + +PV
Sbjct: 314 KNINETTNIENIKYN----LILAFSTMFSIFKGGDQMYSIVGVKLCSPVYWILSFVMVPV 369

Query: 301 SLV---VSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGF 357
            ++   ++    +  Y+ ++    + +    +    ++      V+AG +  LLG+GGG 
Sbjct: 370 IIIAWGITAKFLMREYEKKKEEGREIEGEIKYTYKNILLLGILSVIAGCLASLLGIGGGM 429

Query: 358 IMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVG 417
           I GP+ L++G+ P V++AT+++ + F+S+ S ++Y L+ +    Y + +  +  ++ F+G
Sbjct: 430 IKGPVLLQMGLVPDVTAATSSYMILFTSASSAIQYILVGKLRWDYGIVYYVIGFVSCFIG 489

Query: 418 QHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
              +  ++    R S I+F++ F+I  S I L    I+  I  F ++  + F+++C
Sbjct: 490 TQTLIWIVKKYQRRSYIVFLIGFVITFSTILL---VITESID-FVKYRNLTFDSIC 541


>gi|440293005|gb|ELP86177.1| hypothetical protein EIN_328840 [Entamoeba invadens IP1]
          Length = 492

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 119/462 (25%), Positives = 204/462 (44%), Gaps = 63/462 (13%)

Query: 73  ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAA 132
           EL F W++ +G+I   F A   +  G+GGG  ++ +  LI+  D   A  +SK    G A
Sbjct: 2   ELTFDWKLAVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDAHEAVPLSKITTFGVA 61

Query: 133 VSTVYYNLKLRHPTLD-MPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLF 191
                      HPT+   P+I Y  AL+++P+ + G  +GV  N+    W++ V+L++L 
Sbjct: 62  CGGYLVLWMKMHPTIKYKPLISYATALMVEPLTIYGTMLGVILNITSPSWLIIVVLVLLL 121

Query: 192 IGTSTKAFLKGVETWKRETILKKEAARCLG-------SNGAGAGEVEYKSLPSGPRSG-- 242
             TS K F K  + +K E    K+A   +               ++  +SL +G  S   
Sbjct: 122 GYTSYKTFTKAWKQYKAENEKMKQAKIAIEMKEEKQPDQDTANDDMTTESLKTGAESDDK 181

Query: 243 -----------------PQKDAME----------PEVTILENINWKELGLL--------- 266
                            PQ D  E           E   +E+   KE  +L         
Sbjct: 182 VQSGVIVSDKIDLNESTPQDDKNEQGTGPQLLPQDENETVEDTKKKEEKILFKREVLKSI 241

Query: 267 ------VFVWVAFLGLQIAKNHTASCSIV--------YWVLDLLQIPVSLVVSLYEAISL 312
                 + VW     + I +      S+V        YW+L  +  P+ L V+L     L
Sbjct: 242 LSIIILIAVWAVMFCIVILRGGGKMDSVVGVQCGTPWYWILTGVGAPLMLTVTLIVGGVL 301

Query: 313 YKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQV 372
           +   R    +G+   S +   ++   A  + AG+    LG+GGG ++GP+ LE+GV PQV
Sbjct: 302 WYKHRGEHIEGEVQWSVKNCIIIPVGA--LFAGVSAAFLGIGGGMVIGPILLEIGVLPQV 359

Query: 373 SSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRAS 432
           ++AT+ F + F++S S ++Y +  +  +   L++  +  I A  GQ    K++  + R S
Sbjct: 360 ATATSAFMIMFTASSSSLQYIIDGKLDLYCGLWYFGIGFIGAAFGQFGFSKIVQKMNRQS 419

Query: 433 LIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
           +I F L  +I +S +++  + +  +I     H  +GF +LCK
Sbjct: 420 VIGFFLGALIVLSTLAMVAMTVIQLIQDVKNHN-LGFHHLCK 460


>gi|167393930|ref|XP_001733526.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165894999|gb|EDR22813.1| hypothetical protein EDI_269710 [Entamoeba dispar SAW760]
          Length = 404

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 105/365 (28%), Positives = 176/365 (48%), Gaps = 49/365 (13%)

Query: 76  FGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVST 135
           F W++++G+I   F A   +  G+GGG  ++ +  LI+  DP  A  +SK    G A   
Sbjct: 7   FDWKLIVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDPHQAIPLSKITTFGVACGG 66

Query: 136 VYYNLKLRHPTLD-MPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT 194
                  RHP +   P+I Y  AL+++P+ + G  IGV FN+I   W++ ++L++L   T
Sbjct: 67  FLILWMKRHPIVRYKPLISYPTALMVEPLTIYGTMIGVIFNIISPSWLIIIVLVILLGFT 126

Query: 195 STKAFLKGVETWKRETILKKEAARCL--------------------GSNGAGAG------ 228
           S K F K ++ WK E   K++A +                        NG  A       
Sbjct: 127 SYKTFAKAIKQWKNEN-EKRDATKATELVETSKPDITDNDNDDMKPSENGNNAVIVDERI 185

Query: 229 EVEYKSLPSGPRSGPQKDAME--------PEVTILENINWK---ELGLLVFVWVAFLGLQ 277
           + E     +GP+  PQ ++ E         E T+L+    K    +G+L+ VW     + 
Sbjct: 186 QEEDDEQGTGPKLLPQDESQEAQQQAKIVEEKTLLKREIIKAVLSVGILIIVWAVMFFIV 245

Query: 278 IAKNHTASCSIV--------YWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSF 329
           I K      SIV        YW+L ++  P+ L V++   I L+  +R    +G+   + 
Sbjct: 246 ILKGGEKMDSIVGIECGTPWYWILTVIGGPLMLTVTIIVGIFLWWRQRGEEVEGEVQWTV 305

Query: 330 RVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSV 389
           +   ++   AF   AG+    LG+GGG ++GP+ LE+GV PQV++AT+ F + F++S S 
Sbjct: 306 KNCLIIPIGAF--FAGVSAAYLGIGGGMVIGPILLEIGVLPQVATATSAFMIMFTASSSS 363

Query: 390 VEYYL 394
           ++Y +
Sbjct: 364 LQYII 368


>gi|449519050|ref|XP_004166548.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101231924 [Cucumis sativus]
          Length = 455

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 120/410 (29%), Positives = 219/410 (53%), Gaps = 33/410 (8%)

Query: 80  IVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYN 139
           IV+  ++ F  A+  S GG+GGGG+++P+L+++ G D K+A++++ CM+ G +++ V  N
Sbjct: 63  IVVAGVLCFIAASLSSAGGIGGGGLYLPILTIVAGVDLKTASSLTACMVTGGSIANVLSN 122

Query: 140 LKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAF 199
              +  +    +I++D+ALL +P ++LG+SIGV  N+ F +W+ T+L  +    ++ K  
Sbjct: 123 FFSK--SGGKCLINFDIALLTEPCMLLGVSIGVICNLSFPEWVTTILFAIFLAWSTLKTC 180

Query: 200 LKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQ-KDAMEPEVTILENI 258
             G+  W+RE+         L +NG          L  G ++  + K   EP +   EN 
Sbjct: 181 KSGMVYWERES-------EGLMNNGC--------KLEDGLQNENEAKLVEEPLLPTQENC 225

Query: 259 NWK----ELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQIPVSLVVSL 306
             +    +LG LV VW  F  + + +            +C   YW+L  +Q+PV++  +L
Sbjct: 226 RSRFPSMKLGXLVLVWFCFYLIYLLRGDQDGRGLLPIETCGTGYWILSSVQVPVAIAFTL 285

Query: 307 YEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGV---LAGIVGGLLGLGGGFIMGPLF 363
           +        +   +S+ +  ++  V +      F V   LAGI+GG+ G+GGG ++ P  
Sbjct: 286 WILYKQKSPQSEDSSQKELEEARPVVEPSKKLIFPVMAFLAGILGGMFGIGGGMLISPFL 345

Query: 364 LELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRK 423
           L++G+ P+ ++AT +F + FS++MS  +Y LL       A+ F  +  IA+ +G  VV+K
Sbjct: 346 LQVGIVPEKTAATCSFMVFFSATMSAGQYLLLGMEHAEIAIIFAIICFIASVLGLVVVQK 405

Query: 424 LIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            I   GRAS+IIF ++ ++ +S + +   G  N+   +     MGF++ C
Sbjct: 406 AIRDHGRASVIIFSVSIVMALSTVLMTTFGALNVFRDYIAGNPMGFKSPC 455


>gi|449454696|ref|XP_004145090.1| PREDICTED: uncharacterized protein LOC101208650 [Cucumis sativus]
 gi|449470720|ref|XP_004153064.1| PREDICTED: uncharacterized protein LOC101220005 [Cucumis sativus]
          Length = 455

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 120/410 (29%), Positives = 219/410 (53%), Gaps = 33/410 (8%)

Query: 80  IVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYN 139
           IV+  ++ F  A+  S GG+GGGG+++P+L+++ G D K+A++++ CM+ G +++ V  N
Sbjct: 63  IVVAGVLCFIAASLSSAGGIGGGGLYLPILTIVAGVDLKTASSLTACMVTGGSIANVLSN 122

Query: 140 LKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAF 199
              +  +    +I++D+ALL +P ++LG+SIGV  N+ F +W+ T+L  +    ++ K  
Sbjct: 123 FFSK--SGGKCLINFDIALLTEPCMLLGVSIGVICNLSFPEWVTTILFAIFLAWSTLKTC 180

Query: 200 LKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQ-KDAMEPEVTILENI 258
             G+  W+RE+         L +NG          L  G ++  + K   EP +   EN 
Sbjct: 181 KSGMVYWERES-------EGLMNNGC--------KLEDGLQNENEAKLVEEPLLPTQENC 225

Query: 259 NWK----ELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQIPVSLVVSL 306
             +    +LG LV VW  F  + + +            +C   YW+L  +Q+PV++  +L
Sbjct: 226 RSRFPSMKLGALVLVWFCFYLIYLLRGDQDGRGLLPIETCGTGYWILSSVQVPVAIAFTL 285

Query: 307 YEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGV---LAGIVGGLLGLGGGFIMGPLF 363
           +        +   +S+ +  ++  V +      F V   LAGI+GG+ G+GGG ++ P  
Sbjct: 286 WILYKQKSPQSEDSSQKELEEARPVVEPSKKLIFPVMAFLAGILGGMFGIGGGMLISPFL 345

Query: 364 LELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRK 423
           L++G+ P+ ++AT +F + FS++MS  +Y LL       A+ F  +  IA+ +G  VV+K
Sbjct: 346 LQVGIVPEKTAATCSFMVFFSATMSAGQYLLLGMEHAEIAIIFAIICFIASVLGLVVVQK 405

Query: 424 LIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            I   GRAS+IIF ++ ++ +S + +   G  N+   +     MGF++ C
Sbjct: 406 AIRDHGRASVIIFSVSIVMALSTVLMTTFGALNVFRDYIAGNPMGFKSPC 455


>gi|348688213|gb|EGZ28027.1| hypothetical protein PHYSODRAFT_469591 [Phytophthora sojae]
          Length = 545

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 122/475 (25%), Positives = 216/475 (45%), Gaps = 86/475 (18%)

Query: 79  EIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYY 138
           E + G+I  F  +   +  G+GGGG+ VP+  + +      A  +SK  I G A+++   
Sbjct: 75  EEIAGSISAFLASVVAAGSGLGGGGLLVPLYIMTMSMSSHEAVPLSKATIFGGAIASFLL 134

Query: 139 NLKLRHPTL-DMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTK 197
           N++ RHP +   P+IDY+  LL++PM + G  IGV  N +F +W++T+ ++ L   T+ +
Sbjct: 135 NVRKRHPLVRSRPLIDYETILLMEPMTLAGTIIGVNMNAVFPEWLITLCIVWLLTKTALR 194

Query: 198 AFLKGVE----------------------------------------TWKRETILKKEAA 217
            + KG                                           WK     +KE  
Sbjct: 195 TYSKGKTIWKEEADADTKIVSDIVAYWRLLPYESNFKQFRAVARAYLKWKSYKSPEKEEL 254

Query: 218 RCLGSNGAGAGEVEYKSLPSGPRSGPQK---DAMEPE----------------VTILE-- 256
           R     G  + E ++ S  S   S  ++   D  E E                +++ E  
Sbjct: 255 RLKILAGKASSEEDHSSSNSTEASTEEEASSDENESESLMSWGLQDKKRPVKFLSVEEIA 314

Query: 257 ----NINWKELGLLVFVWVAFLGLQIAKNHTASCSIV--------YWVLDLLQIPVSLVV 304
                +   ++G+L   WV  +   +AK    + S++        YW L ++  P  + V
Sbjct: 315 KARRTVPMADMGVLFLTWVGLVLFSMAKGGHGTPSVIGLSCGSFGYWSLIVVSFPFFMGV 374

Query: 305 SLYEAISLYKGRRVIAS------KGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFI 358
           ++Y  + + +   ++ +      KGD    +    ++ Y A    AG+  GLLG+GGG +
Sbjct: 375 TIYFGMKISRFHAMLQASDYTYAKGD--MVWTKHAVIKYPALCTAAGVAAGLLGIGGGMV 432

Query: 359 MGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQ 418
            GPL LE+G+ PQVSSAT++  + F+SS + +++ +L    V +AL+   V  IA  +GQ
Sbjct: 433 KGPLLLEMGLIPQVSSATSSSMILFTSSATTIQFIILGTLSVEHALWHGTVGFIAGLIGQ 492

Query: 419 HVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
             +  LI    +++L+IF++A  I VS   +G +G++    +     + GF +LC
Sbjct: 493 LGMSYLIKKYRKSALVIFLIAIFIGVSGGVMGVLGVA----RISEIGFGGFRSLC 543


>gi|223998204|ref|XP_002288775.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975883|gb|EED94211.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 385

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 119/386 (30%), Positives = 185/386 (47%), Gaps = 31/386 (8%)

Query: 86  IGFFGAAFGSV----GGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLK 141
           +GFF AA G V    GG+GGGGI VP+  LI+ F PK A  +S   + G AV+   +N++
Sbjct: 3   LGFFFAALGLVLAAGGGIGGGGILVPVYILILDFLPKHAIPLSNVTVFGGAVANTIFNVR 62

Query: 142 LRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLK 201
            RHP  D P+ID+DL L+++P  +LG  +G   N I ++ ++ V+L+VL   T+     K
Sbjct: 63  KRHPLADRPLIDWDLILVMEPSTLLGALVGANLNKILSETLIAVMLVVLLSFTAYGTLKK 122

Query: 202 GVETWKRET-ILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINW 260
             + + +ET  LK E +   G       +  +     G R G   +  E  VT  E    
Sbjct: 123 AGKMYDKETEDLKNEWSYSDGLREHLVNDYSHMDDEEG-RKGANDNNKEDTVTEYEEFGM 181

Query: 261 KE---LGLLVFVWVAFLGLQIAKNHTAS-----C-SIVYWVLDLLQIPVSLVVSLYEAIS 311
            E   L  L+FV V  + +       AS     C S  +W      I  +L++     IS
Sbjct: 182 HEANSLDRLMFVVVLAINILKGGGGFASPVGIKCGSAAFW------ISQALLLVWIIGIS 235

Query: 312 LYKGRRVIASKG-DDGKSFRVFQ---------LVSYCAFGVLAGIVGGLLGLGGGFIMGP 361
           L   R++I       G  FR  +          + Y  F  +AG   G+ G+GGG + GP
Sbjct: 236 LVARRQLIKDTALKMGAGFRYLKEDMIWDDKSTIIYPLFSTVAGFCAGMFGIGGGIVKGP 295

Query: 362 LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVV 421
           L + +GV P V+SAT+   + F+S  +   + +       YA+  V +  +A  VGQ + 
Sbjct: 296 LMIMMGVHPAVASATSACMILFTSFTATTTFAVYGLLVHDYAIACVILGFVATAVGQTIT 355

Query: 422 RKLIILLGRASLIIFILAFMIFVSAI 447
            +L+    R S I F + F++ +SA+
Sbjct: 356 TRLLKKSRRNSYIAFSIGFVVLLSAL 381


>gi|297737351|emb|CBI26552.3| unnamed protein product [Vitis vinifera]
          Length = 100

 Score =  122 bits (307), Expect = 3e-25,   Method: Composition-based stats.
 Identities = 61/76 (80%), Positives = 68/76 (89%)

Query: 127 MIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVL 186
           M MGAA ST+Y NLKL+HPTLDMPII+YDLALL QPMLM+GISI VAFNV+F D MVT+L
Sbjct: 24  MNMGAASSTIYCNLKLKHPTLDMPIIEYDLALLFQPMLMMGISIEVAFNVVFVDSMVTIL 83

Query: 187 LIVLFIGTSTKAFLKG 202
           LIVLF+GTSTK FLKG
Sbjct: 84  LIVLFLGTSTKTFLKG 99


>gi|301122685|ref|XP_002909069.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099831|gb|EEY57883.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 470

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 106/406 (26%), Positives = 194/406 (47%), Gaps = 43/406 (10%)

Query: 103 GIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQP 162
            I VP + LI+GFD K AT +S   I+G A++  ++N++ RHP +D P+ID DL+  + P
Sbjct: 74  TILVPAMVLIMGFDIKRATPVSNLAIVGGAIANAWFNIRKRHPAVDRPLIDADLSFSMIP 133

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRE-------TILKKE 215
           ++M G  +G     +   +++++L +V+ +   T+   KG++ ++ E       T  +++
Sbjct: 134 LVMGGAVVGTVLAKLLPSYLLSLLFVVVLVLGGTRTVSKGIKMYRAEMKSCKVQTTEEQQ 193

Query: 216 A---------ARCLGSNGA-GAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKEL-- 263
           A         + C     A   G+    SL  G  S   +     +  +L  I  +E   
Sbjct: 194 AAAYAAVCSPSSCTEDKFADDGGDSTSHSLLKGTGSLSNECGGSTDEEVLTEIVERERHF 253

Query: 264 -----GLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKG--R 316
                G ++  ++  +   I     +   I YW+L ++++P      +  A+ LY+   R
Sbjct: 254 SLTKHGAIMLCYMGIVAASIGGAAVSCGGITYWLLLIIELPWIAGFGVCTAVYLYRQHCR 313

Query: 317 RVIA----SKGD---DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVP 369
           +V      + GD     K+   F L   CA    AG++ GL G+GGG + GPL +E+G+ 
Sbjct: 314 KVSVNYEFAAGDIHWTKKTVVRFPLA--CAG---AGLIAGLFGVGGGIVTGPLMIEMGIV 368

Query: 370 PQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLG 429
           P+V+SAT    + +SS+ +  ++ +       +AL   AVA +   V Q ++   +   G
Sbjct: 369 PEVASATTALMVLYSSAAATAKFAVFNMTAWDWALLLSAVAFVVTAVSQVIILGFVRRTG 428

Query: 430 RASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFE-NLCK 474
           R S+I+  +   I + A+ +    I + I    +H    FE N+C+
Sbjct: 429 RQSVIVLCIGATICIGAVLMTYQAIKSTI----QHAGDPFEVNVCR 470


>gi|301117154|ref|XP_002906305.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262107654|gb|EEY65706.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 543

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 113/453 (24%), Positives = 200/453 (44%), Gaps = 96/453 (21%)

Query: 106 VPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTL-DMPIIDYDLALLIQPML 164
           VP+  +++      A  +SK  I G A+++   N++ RHP +   P+IDY+  LL++PM 
Sbjct: 99  VPLYIMLMSMSSHEAVPLSKTTIFGGAIASFLLNVRKRHPLVRSRPLIDYETMLLMEPMT 158

Query: 165 MLGISIGVAFNVIFADWMVTVLLIVLFIGTS----------------------------- 195
           + G  IGV  N +F +W++T+ ++ L   T+                             
Sbjct: 159 LAGTIIGVNMNAVFPEWLITICIVWLLTKTALRTYSKGKKIWKEEVDADNKIIMDIVAYW 218

Query: 196 ---------------TKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPR 240
                           +A+LK    WK     +KE  R L      A  VE +   S   
Sbjct: 219 RLLPYESNFKQFQVVARAYLK----WKAYKSPEKEELR-LKILADEASSVEERKSSSNIT 273

Query: 241 SGPQKDAMEPEVTILENI-NW-------------------------KELGLLVFVWVAFL 274
               ++    +    EN+ +W                          ++G+L   W+  +
Sbjct: 274 EASTEEETSSDENESENLMSWGLQDKRPVKFLSVEEIAKTRRTVPMADMGVLFLTWIGLV 333

Query: 275 GLQIAKNHTASCSIV--------YWVLDLLQIPVSLVVSLYEAISLYKGRRVIAS----- 321
              +AK    + S++        YW+L ++  P  + V++Y  + + +   ++ +     
Sbjct: 334 LFSMAKGGHGTPSVIGLSCGSIGYWLLVIVSFPFFMSVTIYFGMKISRFHTMLQASDYTY 393

Query: 322 -KGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFA 380
            KGD    +  F +V Y A    AG+  GLLG+GGG + GPL +E+G+ PQVSSAT++  
Sbjct: 394 AKGD--MIWTKFAVVKYPALCTAAGVAAGLLGIGGGMVKGPLLIEMGLLPQVSSATSSSM 451

Query: 381 MTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAF 440
           + F+SS + +++ +L    V +AL+  AV  +A  +GQ  +  L     +++L+IF++A 
Sbjct: 452 ILFTSSATTIQFIILGTLSVNHALWHGAVGFVAGLIGQLGMSYLFKKYRKSALVIFLVAV 511

Query: 441 MIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            I VS I +G +G      +     + GF +LC
Sbjct: 512 FIGVSGIVMGVLGAV----RISEIGFGGFRSLC 540


>gi|428174477|gb|EKX43372.1| hypothetical protein GUITHDRAFT_95350 [Guillardia theta CCMP2712]
          Length = 572

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 109/389 (28%), Positives = 184/389 (47%), Gaps = 39/389 (10%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           I VPM   I  F P     +SK  I G AV+  YYN++ RHP  + P++DY+  ++++P+
Sbjct: 180 ILVPMYLSIGKFSPHYGIPLSKATIFGGAVTNNYYNVQRRHPYANRPLVDYNTCMMLEPV 239

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWK--RETILKKEAARCLG 221
           L+LG  IGV FN +   W++T+LL++    T+ +  +K +ET+    + + ++E    LG
Sbjct: 240 LLLGTIIGVFFNAVSPGWLITILLVLSLTYTTYRTSVKALETYNKEEKAVKEEETKHLLG 299

Query: 222 SNGAGAGEVEYKSLPSGPRSGPQ--KDAMEPEVTILENINWKELGLLVFVWVAFLGLQIA 279
           S              +GP   P    DA  PE   L  I   E  +L   W+      I 
Sbjct: 300 SK-------------AGPEQHPSFMLDANIPE--DLREIYEAESRVLTISWIIIAVCSIL 344

Query: 280 KNHTASCSIV------YWVLDLLQIPVSLVVSLYEAISLYKGRRVIA------SKGD--- 324
           K       IV      YW+L    +P+ L + +Y    L +G           ++GD   
Sbjct: 345 KGGEGGQGIVACGSLGYWLLVAAPLPMVLGLVMYCGDILVRGYEDKLRLGYEFAEGDIHW 404

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
             K+  V+ L  YC   + AG   G LG+  G I+GP+ LELG+ P V +A++ F + F+
Sbjct: 405 TRKNASVYPL--YC---ISAGFAAGALGIAAGTILGPILLELGMLPLVGTASSGFMVIFT 459

Query: 385 SSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           +S +  ++ ++ +  + YAL+F  +  +   +G  VV   +    +   ++ IL+ ++  
Sbjct: 460 ASSTTFQFLIMGQLQIDYALFFCGIGLLGGAIGNTVVSFFVKKYKKTWFVVAILSAVLAA 519

Query: 445 SAISLGGVGISNMIGKFHRHEYMGFENLC 473
           S + +G  G       +   + MG   LC
Sbjct: 520 STVLMGYAGFERAELSYDHGKNMGIRRLC 548


>gi|358344827|ref|XP_003636488.1| hypothetical protein MTR_042s0030 [Medicago truncatula]
 gi|355502423|gb|AES83626.1| hypothetical protein MTR_042s0030 [Medicago truncatula]
          Length = 150

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 76/119 (63%), Positives = 83/119 (69%), Gaps = 22/119 (18%)

Query: 308 EAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELG 367
           +A +L+  RRVIAS GD GK+F V  L+ YC FGVLAGIVGGLLGLGGGF+MGPLFLELG
Sbjct: 27  KATALFTQRRVIASAGDQGKAFTVGLLIIYCVFGVLAGIVGGLLGLGGGFVMGPLFLELG 86

Query: 368 VPPQ----------------------VSSATATFAMTFSSSMSVVEYYLLKRFPVPYAL 404
           VP Q                      V SAT TFAMTFSSSMSVVEYYLLKRFP+PY L
Sbjct: 87  VPSQSSESFKLQYINRHSLILIIKQLVPSATTTFAMTFSSSMSVVEYYLLKRFPIPYGL 145


>gi|297849536|ref|XP_002892649.1| hypothetical protein ARALYDRAFT_334446 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338491|gb|EFH68908.1| hypothetical protein ARALYDRAFT_334446 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 345

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 83/280 (29%), Positives = 138/280 (49%), Gaps = 36/280 (12%)

Query: 41  FNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPEL-----KFGWEIVLGTIIGFFGAAFGS 95
           FN T   D N +L A         S + H  P L     KF    ++  ++ FF A   S
Sbjct: 16  FNPTAIADQNQILNAT--------SQWLHFPPNLNESTIKFSIPTIIAAVLSFFAA---S 64

Query: 96  VGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP-TLDMPIIDY 154
           +   GGG +F+ +++ I G + K+A++ S  MI G +++ V  NL  R+P + D  +ID+
Sbjct: 65  ISSAGGGALFLSIMTTISGLEMKTASSFSAFMITGVSIANVGCNLFARNPKSRDKTLIDF 124

Query: 155 DLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKK 214
           DL+L +QP L+LG+SIGV  N +F +W+V  L  V    ++ K   KGV  W  E+  +K
Sbjct: 125 DLSLTLQPCLLLGVSIGVICNRMFPNWLVLSLFAVFLAWSTMKTCKKGVSYWNLESEREK 184

Query: 215 EAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFL 274
             +R          + +   +   P    + +A E E  ++    W +LG+LV +W+ F 
Sbjct: 185 IRSR---------RDDDRIKVARSPLLANEGEA-EVERGMIR-FPWMKLGVLVIIWLVFF 233

Query: 275 GLQIAKNHT--------ASCSIVYWVLDLLQIPVSLVVSL 306
            + + + +           C  +YW L  LQIP+++  +L
Sbjct: 234 SINLFRGNKYGQGIISIKPCGGLYWFLSSLQIPLTIFFTL 273


>gi|414880189|tpg|DAA57320.1| TPA: hypothetical protein ZEAMMB73_804983 [Zea mays]
          Length = 407

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 88/284 (30%), Positives = 134/284 (47%), Gaps = 22/284 (7%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           +F+P+L+L+ G   K ATA S  M+ G A S V YNL          +IDYD+ALL QP 
Sbjct: 81  LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLACTGGGGRGRLIDYDIALLFQPC 140

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSN 223
           L+LG+SIGV  NV+F +W++T+L  +     + K    GV+ W+ E+      AR     
Sbjct: 141 LLLGVSIGVVCNVMFPEWLITLLFSLFLAFCTAKTCRAGVKIWRSESSGCGSGAR----- 195

Query: 224 GAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQ--IAKN 281
            +  G   +   P  PR G      E          W ++ LLV +W+ F  L   I   
Sbjct: 196 -SSRGGHSHSKEPLLPR-GTSDGDAEGGGGNGAGFPWGDVALLVIIWLCFFALHVLIGDK 253

Query: 282 H------TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKS------- 328
           H         C + YW++   Q+P ++  + Y   +  K R V +   +DG         
Sbjct: 254 HGKGVIRIRPCGVAYWLITFFQLPAAVAFTGYIVYAKRKKRAVHSESQEDGSKADLADAG 313

Query: 329 FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQV 372
                L +      + G + GL G+GGG ++ P+ L++G+PPQV
Sbjct: 314 VEALPLPTLPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQV 357


>gi|320170699|gb|EFW47598.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 642

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 108/433 (24%), Positives = 190/433 (43%), Gaps = 74/433 (17%)

Query: 88  FFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTL 147
           FFG++   +GGVGGGG+FVP+L +I  F    A  IS  MI  AA+ ++ + ++ + P  
Sbjct: 199 FFGSSLAVIGGVGGGGLFVPLLMIITHFKADQAVPISSTMITAAAIMSLLFEIRAKRPN- 257

Query: 148 DMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWK 207
             P+IDYD++ L+QP+ + G ++GV  NV+   WM+ + L+V+ + T+TK   KG+  +K
Sbjct: 258 GRPVIDYDVSFLLQPVCLAGTTLGVFLNVLLPGWMIILALLVILVYTTTKTMKKGITMYK 317

Query: 208 RETILKK------------------------------------------EAARCLGSN-- 223
           +E+  ++                                           +A  L  N  
Sbjct: 318 KESQQRRALANGGTSANVAASAPAAAPLQKLNKRHHKHQDKESLVASTDTSAEQLSVNMD 377

Query: 224 ------GAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLL--VFVWVAFLG 275
                    AG ++           P KD +  +  + + + +    +L  + +W+  L 
Sbjct: 378 DSDLSDSEDAGRIQLSQPVPSEAELPSKDQVLYQRQLDQELRFPTTQILGMIAMWLIVLA 437

Query: 276 LQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLV 335
               K   + CS  +W++  L +P++++V+L+    L   R     K   G  F     V
Sbjct: 438 CSTIKRFVSKCSAEFWIVAFLPLPIAILVTLWYGRRL---RDAFELKQRCGHQFEPTDFV 494

Query: 336 SYCAFGVLAGIVGGLLGLGGGFI-----------MGPLFLELGV---PPQVSSATATFAM 381
               F     IV  LL  GGG             +GPL L + V    P V++A +   +
Sbjct: 495 ----FNRRNTIVYPLLSFGGGLAGGMVGVGGAMVIGPLLLNMAVQTPDPSVTTAISNLLV 550

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFM 441
            F+++ +V+++ +L      YAL+  A   +A+ + + V++      GR S ++F L   
Sbjct: 551 VFTAASTVIQFVILNTLVYDYALFLSAFTLMASVLSKKVLKPWFDNKGRKSFVVFALVLS 610

Query: 442 IFVSAISLGGVGI 454
           I +S I     GI
Sbjct: 611 ISLSGILTAIEGI 623


>gi|242083980|ref|XP_002442415.1| hypothetical protein SORBIDRAFT_08g019640 [Sorghum bicolor]
 gi|241943108|gb|EES16253.1| hypothetical protein SORBIDRAFT_08g019640 [Sorghum bicolor]
          Length = 170

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/110 (53%), Positives = 73/110 (66%), Gaps = 29/110 (26%)

Query: 117 PKSATAISKC-----------------------------MIMGAAVSTVYYNLKLRHPTL 147
           PKS+TAISKC                             MIMG +VSTVYYNLKL+HP+L
Sbjct: 27  PKSSTAISKCEYYWFYIYSTAIYVYLNLNEKAQVCWFCSMIMGGSVSTVYYNLKLKHPSL 86

Query: 148 DMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTK 197
           DMP+I+YDLALL+QPMLMLG+SIGV FNVIF +W++T LLI +F+G   +
Sbjct: 87  DMPLIEYDLALLMQPMLMLGVSIGVIFNVIFPNWLITALLITIFLGQEPE 136


>gi|15233581|ref|NP_193858.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
 gi|2911083|emb|CAA17545.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268923|emb|CAB79126.1| hypothetical protein [Arabidopsis thaliana]
 gi|332659033|gb|AEE84433.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
          Length = 393

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 119/433 (27%), Positives = 203/433 (46%), Gaps = 74/433 (17%)

Query: 26  VCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTI 85
           +CV A    + E +    T++   N L K   +    + S       ELK    +V+  +
Sbjct: 12  LCVFAINANQQEEINQESTQQTHQNLLYKVQKWRTSLKDSSD----AELKLSPALVVAGV 67

Query: 86  IGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP 145
           + F  A   S  G+                D K+A++ S  M+ G +++ +  N    H 
Sbjct: 68  LCFTAALISSASGI----------------DLKAASSFSAFMVTGGSIANLINN----HF 107

Query: 146 TLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVET 205
                +IDYDLALL++P ++LG+S+GV  N +F +W++T L +V  + +S +    G  +
Sbjct: 108 GCK-KLIDYDLALLLEPCMLLGVSVGVICNKVFPEWLITGLFVVFLMWSSMETCENGHTS 166

Query: 206 WKRETILK-KEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELG 264
           WK   IL+ KE  R      +   EV+ +                   TI   I +K L 
Sbjct: 167 WKLSLILREKEDMR-----DSRLAEVKRRR------------------TI---IFFKHLY 200

Query: 265 LLV---FVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIAS 321
           L +       +FLG  +       CS+ YW+L  LQIP++LV ++   ++L +   +   
Sbjct: 201 LKIKKTETKQSFLGRNLGIISIKPCSVEYWILLSLQIPLALVFTI---LALSRTESLQEQ 257

Query: 322 KGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAM 381
              + +                AG++GG+ G+GGG I+ PL L  G+PPQV++AT +F +
Sbjct: 258 SISNQE----------------AGLLGGIFGIGGGMIISPLLLRAGIPPQVTAATTSFMV 301

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFM 441
            FS++MS V+Y LL       A  F  +   A+ +G    +K++    RAS+I+F++  M
Sbjct: 302 FFSATMSGVQYLLLGMQNTEAAYVFSVICFFASTLGLVFAQKVVPHFRRASIIVFLVGTM 361

Query: 442 IFVSAISLGGVGI 454
           ++++ I +   GI
Sbjct: 362 MYLTTIVMASFGI 374


>gi|348676056|gb|EGZ15874.1| hypothetical protein PHYSODRAFT_316057 [Phytophthora sojae]
          Length = 437

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 94/372 (25%), Positives = 166/372 (44%), Gaps = 63/372 (16%)

Query: 100 GGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALL 159
             G I VP + LI+GFD K AT IS   I+G A++  ++N++ RHP+ D P+ID DLAL 
Sbjct: 75  ASGVIMVPAMVLIMGFDIKRATPISNVGILGGALANAWFNMQKRHPSADRPLIDADLALG 134

Query: 160 IQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARC 219
           + P+L+               ++V++L +V+   + T+  +KG++ ++ E+  K +    
Sbjct: 135 MIPVLL-------------PSYIVSLLFVVVLAASGTRMMIKGIQLYRAESTKKAQ---- 177

Query: 220 LGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVT---------------------ILE-- 256
                A A   +       P +  Q     P +                      ILE  
Sbjct: 178 -----ADADSKDTADAAMSPDAYAQAFTPNPSIDSDASAAKSASASAQAVKVLAEILEQE 232

Query: 257 -NINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKG 315
            +  W++ G ++  ++  +   I     +   +  WV+ L +IP       + A     G
Sbjct: 233 RHFAWRKHGAILVCYLGVVATSIGDASVSCGGVADWVILLAEIP-------WVARKASVG 285

Query: 316 RRVIASKGD---DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQV 372
              I  +GD     K+   F L   CA G   GIV G+ G+GGG I GP+ +E+G+ P+V
Sbjct: 286 YLYI--EGDIRWTQKAVICFPL--GCALG---GIVAGMFGVGGGIITGPIRIEMGIVPEV 338

Query: 373 SSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRAS 432
           +S+T    + +SS+ +  +Y +       +A    AV        Q V+   +   GR S
Sbjct: 339 ASSTMALMILYSSAAATAKYTVFNMIAWDWAALLCAVTFAVTSAAQVVILAYVRRSGRQS 398

Query: 433 LIIFILAFMIFV 444
           +++  ++  + +
Sbjct: 399 IVVLCISAAVVI 410


>gi|303279591|ref|XP_003059088.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458924|gb|EEH56220.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 461

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 57/143 (39%), Positives = 89/143 (62%)

Query: 73  ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAA 132
            L +   I + ++  FF +   + GGVGGGG+FVP+L L++G   K A  +S CMI+  A
Sbjct: 9   SLDWDATIAIASVACFFVSILANAGGVGGGGVFVPLLMLVVGLSGKWAIPVSNCMILAGA 68

Query: 133 VSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFI 192
           +   ++NL  RHPT D P++D + ALL+ P  + G + GV  NV+F +W+V+ +LI L  
Sbjct: 69  IPATFFNLMKRHPTRDRPLLDTNAALLLIPATLAGTTPGVMLNVLFPEWLVSAMLICLLT 128

Query: 193 GTSTKAFLKGVETWKRETILKKE 215
            TST+ F KG   W++E  +K++
Sbjct: 129 YTSTQTFQKGKREWRKEGEIKRK 151


>gi|348675995|gb|EGZ15813.1| hypothetical protein PHYSODRAFT_263156 [Phytophthora sojae]
          Length = 421

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 102/393 (25%), Positives = 182/393 (46%), Gaps = 50/393 (12%)

Query: 106 VPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLM 165
           VP LS +  +D  S       MI+G AVS+          T D P+ID +LAL + P+++
Sbjct: 38  VPSLSEMDTYDALSIA----FMIIGLAVSSA-------GATADRPLIDPELALGLIPVVI 86

Query: 166 LGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGA 225
            G  +G   N +   ++V++L +V+   +  +  L+G+  +K+E   K+  +    +N  
Sbjct: 87  GGTVLGAVINKLIPSYIVSLLFVVVLTFSDARMTLRGIRLFKQEVAQKRAES---SANET 143

Query: 226 GAGEVEYKSL---PSGPRSGPQKD---AMEPEVTILEN-----INWKELGLLV------- 267
            A + E  S+    S P+  P  D   A E  ++I  +     ++  E G +        
Sbjct: 144 KADDPESPSVYIKASTPQ--PSSDDIMAEEHRLSISRSSLKSVLDEDEDGAIRSQILGKE 201

Query: 268 --FVW-------VAFLGLQIAKNHTASCS---IVYWVLDLLQIPVSLVVSLYEAISLYK- 314
             F W       V FLG+       AS     +VYW+L L+++P  +    + +  L+K 
Sbjct: 202 RHFAWGSHSATLVCFLGVVATSIGDASVDCGGVVYWILLLIEVPWVVAFVFFTSHYLHKI 261

Query: 315 --GRRVIASKGDDGKSFRVFQLVSYCAFGV-LAGIVGGLLGLGGGFIMGPLFLELGVPPQ 371
              +  ++ +  DG      + V Y   G  +AGIV G+ G+GGG I GP+ +ELG+ P+
Sbjct: 262 YLRKEAVSYQYVDGDIQWTKKTVVYFPLGCAVAGIVAGMFGVGGGIITGPIMIELGIVPE 321

Query: 372 VSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRA 431
           V+S+T    + +SS+ +  ++ + K     +AL   AVA +     Q ++   +   GR 
Sbjct: 322 VASSTTALMILYSSAAATAKFAVFKMIAWDWALLLCAVAFVVTSASQVMILGFVRRTGRQ 381

Query: 432 SLIIFILAFMIFVSAISLGGVGISNMIGKFHRH 464
           S+II  ++  + +  I +    + + I     H
Sbjct: 382 SIIILCISASVTLGTILMTYEAVKDTINDAGNH 414


>gi|224014498|ref|XP_002296911.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968291|gb|EED86639.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 516

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 99/397 (24%), Positives = 168/397 (42%), Gaps = 90/397 (22%)

Query: 82  LGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLK 141
           +GT + F  AA     GVGGGGI+VP+  L++ F PKS++ +S+  I GAA+  +  NL+
Sbjct: 114 VGTFLWFLTAALSITCGVGGGGIYVPVGILLLRFPPKSSSGLSQASIFGAALGGLIVNLR 173

Query: 142 LRHPTL-----------------------------------------------DMPIIDY 154
            RHP                                                   P+IDY
Sbjct: 174 NRHPYTFVRDTKGTPLEDHPGKIVSYEKDKGPAAIEKDREAYLAGGDGKRKFYTRPVIDY 233

Query: 155 DLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKK 214
           D+AL + PM M G  +GV    +F +W+      V+   TS K + K   ++K++ + ++
Sbjct: 234 DMALFLAPMEMAGAVLGVIIQKLFPNWLFLSFATVILGFTSYKTYKKFFSSYKKDKLNRE 293

Query: 215 EAAR----------CLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVT----------I 254
            A R             ++  G  E    +  +   +G    A++ ++            
Sbjct: 294 TAMRLSMAESMNVSASAADATGNEEPSNDADANAEENGTADTAIKDDLDDPKELEKRRMF 353

Query: 255 LENINWK----ELGLLVFVWVAFLGLQIAKNHTASCSIV--------YWVLDLLQIPVSL 302
           LEN + +    ++  L+ +W+    +   K      S++        Y+VL   Q    L
Sbjct: 354 LENDSRQFPKEKVAYLILLWIGLAVITFLKGGKGVDSLIGITCEDSAYYVLVAAQF---L 410

Query: 303 VVSLYEAISLYKGRRVIASKGDDGKSFR-------VFQLVSYCAFGVLAGIVGGLLGLGG 355
               + A   +K  +    +      F+       + +L  Y  F  +AGIV GL+G+GG
Sbjct: 411 WTMGFAAFFGWKNTKRTQERLAVNYPFQEQDVLWDLKKLQFYSFFTFVAGIVAGLIGIGG 470

Query: 356 GFIMGPLFLELGVPPQVSSA-TATFAMTFSSSMSVVE 391
           G ++GPL + +GV P V++A TAT  +  SSS++V+ 
Sbjct: 471 GMVLGPLMMVMGVHPSVATATTATMVVLTSSSVAVIR 507


>gi|268083363|gb|ACY95275.1| unknown [Zea mays]
          Length = 94

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 50/94 (53%), Positives = 70/94 (74%)

Query: 381 MTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAF 440
           M FSSS+SVVE+Y L RFP+P+A Y + ++ +A F GQ +VRK++ +L RASLI+FIL+ 
Sbjct: 1   MMFSSSLSVVEFYFLHRFPLPFAGYLIFISILAGFWGQCLVRKIVHVLKRASLIVFILSS 60

Query: 441 MIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
           +IFVSA+++G VG    I   + HEYMGF N C+
Sbjct: 61  VIFVSALTMGVVGTQKSISMINNHEYMGFLNFCE 94


>gi|428163170|gb|EKX32257.1| hypothetical protein GUITHDRAFT_148761 [Guillardia theta CCMP2712]
          Length = 538

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 114/415 (27%), Positives = 185/415 (44%), Gaps = 57/415 (13%)

Query: 84  TIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLR 143
           T+I F   A  +  G GGGGI VPM  +I  F P SA  +SK  I+G A++    N++ R
Sbjct: 136 TVITFVTIALAAPTGTGGGGILVPMYMIIGHFSPHSAIPLSKATILGGAIANNLINIQRR 195

Query: 144 HPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGV 203
           HP  + P++DYD   ++ P L++G  +GV  N +   W+VT+ L+V  +G S     K  
Sbjct: 196 HPFANRPLVDYDSLQILVPSLLIGTILGVFLNAVSPAWLVTLGLVV-SLGYSFAIAAKKA 254

Query: 204 ETWKRETILKK--EAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTIL----EN 257
                E +LK   E    LG         E K  P+   S   +D +EP++  +      
Sbjct: 255 WAIYVEEVLKSLPEREPLLG---------ERKEQPAQHYSF-DEDKLEPQLREIIKAESR 304

Query: 258 INWKELGLLVFVWVAFLGLQIAK-----NHTASC-SIVYWVLDLLQIPVSLVVSLYEAIS 311
            ++K +G++V  W+      + K     N   +C S  +W++ LL  P+ +++S     S
Sbjct: 305 HDFKAIGMIVISWILVAVCSLIKGGSGPNQFVACGSWSFWMVALLPFPIVMILSWRVGTS 364

Query: 312 L---YKGRRVIA---SKGD---DGKSFRVFQLVSYCAFGVLAGIVGGLLGL---GGGFIM 359
           L   ++ ++      ++GD   D +  R+F  VS    G+LAG +GG+      G     
Sbjct: 365 LNEKFESKKACGYRFAEGDAVWDVQHVRIFPFVSIIV-GILAGALGGVEPCGERGDDGAH 423

Query: 360 GPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQH 419
           G L + L             A++   S      +LL               T+AA +G  
Sbjct: 424 GSLHVLLD----------HHAVSLPRSAQARLRHLLH-----------PRRTVAASIGNT 462

Query: 420 VVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
            +  +     +   ++ ILA  I +SA+ LG VG    I  +   E MGF ++C 
Sbjct: 463 AIHHVSRKYRKTWFVVAILAITIGLSAVLLGYVGYYRAIRSWLEGEDMGFRDICH 517


>gi|401403532|ref|XP_003881498.1| Os03g0726500 protein, related [Neospora caninum Liverpool]
 gi|325115911|emb|CBZ51465.1| Os03g0726500 protein, related [Neospora caninum Liverpool]
          Length = 494

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 82/137 (59%), Gaps = 2/137 (1%)

Query: 73  ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAA 132
           EL  G  IV   +I   GA      G GGG IFVP++ LI+ F+   ATA S+C++ G+A
Sbjct: 192 ELNNGPTIVCIVLIAIVGA-VSVTAGTGGGAIFVPLMQLIMHFNTFEATATSQCLMTGSA 250

Query: 133 VSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFI 192
           ++ +  N   R+P +DMP+ID D+ LL+ PM M G S+GV  N +   W++TVLL+V  +
Sbjct: 251 LAGLCLNFVRRNPVVDMPLIDMDMVLLLGPMQMCGSSVGVIVNRVLPAWLITVLLVVCLL 310

Query: 193 GTSTKAFLKGVETWKRE 209
             + +  ++ +   +RE
Sbjct: 311 YETVR-LMRRLRDKQRE 326


>gi|40253413|dbj|BAD05342.1| unknown protein [Oryza sativa Japonica Group]
 gi|40253789|dbj|BAD05727.1| unknown protein [Oryza sativa Japonica Group]
          Length = 132

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 59/82 (71%), Positives = 70/82 (85%), Gaps = 1/82 (1%)

Query: 47  QDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFV 106
           ++  Y+ K VNFLW  E S YHHVWP ++FGW+IVLG +IGFFGAAFGSVGGVGGGGIFV
Sbjct: 38  EEVGYMRKVVNFLWSGEAS-YHHVWPPMEFGWKIVLGILIGFFGAAFGSVGGVGGGGIFV 96

Query: 107 PMLSLIIGFDPKSATAISKCMI 128
           PML+LIIGFD KS+TAISK ++
Sbjct: 97  PMLTLIIGFDAKSSTAISKFIV 118


>gi|148906476|gb|ABR16391.1| unknown [Picea sitchensis]
          Length = 157

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 73/138 (52%), Positives = 85/138 (61%), Gaps = 13/138 (9%)

Query: 1   MADLGAKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSN---------- 50
           MA  G KW +L  VM+      LA     AER         + TER              
Sbjct: 1   MAGGGRKW-LLVVVMVATFGLLLAVTLSSAERTFLDGNSASSETERSMEKIVDPVEGNTL 59

Query: 51  --YLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPM 108
             ++LKA NFLW+   S Y  VWPE+KFGW IV+G+I+GFFGAA GSVGGVGGGGIFVPM
Sbjct: 60  KYHVLKARNFLWRSNISDYQRVWPEMKFGWRIVIGSIVGFFGAAVGSVGGVGGGGIFVPM 119

Query: 109 LSLIIGFDPKSATAISKC 126
           L+LIIGFD KS+TAISKC
Sbjct: 120 LTLIIGFDAKSSTAISKC 137


>gi|219112381|ref|XP_002177942.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410827|gb|EEC50756.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 644

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 106/416 (25%), Positives = 184/416 (44%), Gaps = 60/416 (14%)

Query: 86  IGFFGAAFG----SVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLK 141
           IGFF A  G    + GG+GGGGI VP+  L++GF PK A  +S   ++G AV+    N +
Sbjct: 207 IGFFLATLGLMVAAGGGIGGGGILVPVYILVMGFTPKHAIPLSNVTVLGGAVANTILNAR 266

Query: 142 LRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLK 201
            RHP  D P++D+DL L+++P+ + G  +G   N +  + ++TVLL++L   T+  +  K
Sbjct: 267 KRHPLADRPLVDWDLILVMEPLTIAGALLGAFLNKVLPELLLTVLLVLLLSVTAYTSLTK 326

Query: 202 GVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENIN-- 259
            ++ + RE+     A   +  +G    E+   +     R   Q D  E    +LEN+   
Sbjct: 327 ALKLYARESRAMAAAQGLVRVDGTKESELTVMA-----RLEDQDDHDEAAEVLLENMERD 381

Query: 260 -------------------------------------WKELGLLVFVWVAFLGLQIAKNH 282
                                                   + +LV +++  L + + K  
Sbjct: 382 DDDDESSSDDDMKSVELPASSLQAELDQLLEEECTTPMANISILVTMFIVVLTINVLKGG 441

Query: 283 TA-------SC-SIVYWVLDLLQIP----VSLVVSLYEAISLYKGRRVIASKGDDGKSFR 330
            A        C S  +W+ +L+ +     +S+ +  Y      + RR+     +    + 
Sbjct: 442 GAFPSPLGIRCGSRAFWIANLVMLAWIGIISVGIRAYLVRRFEQKRRLSFPYVEGDIRWD 501

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
               + Y     +AG   G+ G+GGG + GPL L +GV P VSSA++   + F+S  +  
Sbjct: 502 ARATIVYPVVCCMAGFFAGMFGVGGGIVKGPLMLAMGVHPAVSSASSACMILFTSFTATT 561

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSA 446
            + +       YA   +A+  +A F GQ  +  L+    R S I F +  ++ +SA
Sbjct: 562 SFVVFGLLVWDYAYVCMAIGFVATFAGQVGLSYLMRRAQRNSYIAFSIGAVVLLSA 617


>gi|297597753|ref|NP_001044473.2| Os01g0786700 [Oryza sativa Japonica Group]
 gi|255673761|dbj|BAF06387.2| Os01g0786700 [Oryza sativa Japonica Group]
          Length = 300

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 78/216 (36%), Positives = 115/216 (53%), Gaps = 15/216 (6%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           VL  I+ F  AAF S GGVGGG ++VP+L+++ G   K+ATA S  M+ G  +S V Y L
Sbjct: 6   VLACILSFLAAAFSSAGGVGGGSLYVPILNIVAGLSLKTATAFSTFMVTGGTLSNVLYTL 65

Query: 141 ---KLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTK 197
              +        P+IDYD+A++ QP L+LG+S+GV  NV+F +W++T L  V     + K
Sbjct: 66  IVLRGHEKGGHQPLIDYDIAVVSQPCLLLGVSVGVICNVMFPEWLITALFAVFLASATFK 125

Query: 198 AFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILEN 257
            +  G++ W+ ET   +       S G GAGE        G + G        ++ +L  
Sbjct: 126 TYGTGMKRWRAETAAARRMLEGGSSLGDGAGEALL-----GQKDGDGHRRQCVDLMVLVT 180

Query: 258 INWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVL 293
           I W    L  FV   F+G + AK  TAS + ++ VL
Sbjct: 181 I-W----LCFFVIHLFIGGEGAK--TASSTTMFMVL 209



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 59/107 (55%)

Query: 367 GVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
           G   + +S+T  F + F +SMS+V++ +L    +  AL +     +A+ VG  V++  I 
Sbjct: 194 GEGAKTASSTTMFMVLFCASMSMVQFIILGVDGIVTALVYAITCFVASIVGLVVIQGTIR 253

Query: 427 LLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
             GR SLI+F++A ++ +S + +   G   +  ++   +YMGF+  C
Sbjct: 254 KSGRVSLIVFMVAAILALSVVVIACSGAVRVWVQYTSGQYMGFKMPC 300


>gi|340508148|gb|EGR33919.1| hypothetical protein IMG5_031030 [Ichthyophthirius multifiliis]
          Length = 434

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 87/356 (24%), Positives = 160/356 (44%), Gaps = 66/356 (18%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           P+L +++ +  + AT IS C+++G  ++     L+ +HP    P+IDY++ ++I PM++L
Sbjct: 55  PLLMILMNYQQRKATFISYCIVLGGCLANSLLILRKQHPLKKKPLIDYNIIMIINPMVIL 114

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG 226
           G +IG+  NVIF + +V+ +L ++F+ T +    K      R+  L+K     + S+   
Sbjct: 115 GTNIGIILNVIFPE-IVSGVLFIIFLCTVSPYLFKKESQLSRDNDLEKVNNSYIISD-VK 172

Query: 227 AGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWV--------------- 271
              +    + +      +   M+ E          +L +L+FV+V               
Sbjct: 173 VDNIAQSQIKNNDPGELKCFLMQEE----RQYPLNKLLILMFVFVSIQFLIFLRGGKGVG 228

Query: 272 AFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASK--------- 322
           +F+G++I       CS  YW+L    +  SLVVS +  I ++  R  I  K         
Sbjct: 229 SFIGIKI-------CSNSYWLLSAGILVYSLVVSYF--IKIFISRNEIQKKMIFQKYGLE 279

Query: 323 ---------GDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVS 373
                     D+ K F ++      A G+L+G + G  G G   ++ P+F+   +PP + 
Sbjct: 280 EYFKDDFDISDNKKYFIIW------ASGLLSGCLSGTFGSGAALLLMPVFISYQLPPIIG 333

Query: 374 SATATFAMTFSSSMSVV----EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLI 425
           SA   F   F +  S++    E YL          Y V + +  AF+G  V  +++
Sbjct: 334 SAVCGFNYFFIACASIISVFSEQYL--------TAYEVIIYSFLAFLGGFVCARIL 381


>gi|146183328|ref|XP_001025893.2| hypothetical protein TTHERM_00713420 [Tetrahymena thermophila]
 gi|146143638|gb|EAS05648.2| hypothetical protein TTHERM_00713420 [Tetrahymena thermophila
           SB210]
          Length = 505

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 86/389 (22%), Positives = 164/389 (42%), Gaps = 68/389 (17%)

Query: 70  VWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIM 129
           V  +L FG   ++  ++        +VGG+GGG + VP+L +++ F  K AT +S  +++
Sbjct: 34  VHQDLTFGVVEIISYVLISIIVGLANVGGLGGGIVKVPILVILLNFSVKEATFLSYPILL 93

Query: 130 GAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIV 189
           G  +S     +  RHP  D PIID+DL L++ P ++LG  +G+  NVI ++ ++T + ++
Sbjct: 94  GGVLSNAILLISQRHPRKDKPIIDFDLVLILVPTVLLGTVVGILMNVIISEIILTSVFML 153

Query: 190 LFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGA------------------GEVE 231
                    F+K  +  +++   K+E    +  N                       EV+
Sbjct: 154 FMCLVCVYLFMKARDIQQKQQEDKEEQDSSIQDNNEQNQNKLSKSTIQLVKQNSYLSEVD 213

Query: 232 YKSL--------------------PSGPRSGPQKDA-----MEPEVTILEN--------- 257
            K +                            ++D+      E +   +EN         
Sbjct: 214 QKKIEQIDENCQKELGIQYYQVVSEDSQEQNSEEDSKGVSNKEKQKNKIENQLLAEFLEQ 273

Query: 258 ----INWKELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLDLLQIPVSLVVS 305
               +   +L  LV +++ F  + I+K             C  +Y++L  LQ+  S++  
Sbjct: 274 EKKMLPLDKLFYLVLIFLVFTFIGISKGGKGFQSIFGIQKCDNLYYLLTALQLISSIIFM 333

Query: 306 LY---EAISLYKGRRVIASKGD-DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGP 361
            +   +   L++ +  I  + D +   F  +        G+ AG + G+LG+G G I+ P
Sbjct: 334 FFIYLQQKRLHEYKISINYQFDREDFYFSNYNFFILSLSGLAAGSITGMLGMGSGLIILP 393

Query: 362 LFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + L LG   +V S+T+ F   F    S++
Sbjct: 394 VLLSLGCHTRVCSSTSGFMYLFIGGTSII 422


>gi|325181581|emb|CCA16031.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 421

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 158/335 (47%), Gaps = 52/335 (15%)

Query: 82  LGTIIG-FFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           L  IIG F  AA  S  G+GGGG+ VP+  L     P+ A  +SK +I GA++S +   L
Sbjct: 95  LSGIIGTFLSAAIASGCGLGGGGLLVPLYILTQHLSPQKAIPLSKAVIFGASISGLIVTL 154

Query: 141 KLRHPTL-DMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAF 199
           + +HP +    II++D  LL++PM + G  IGV    I  D++VT+LL            
Sbjct: 155 RRKHPFVSSRSIINFDAVLLMEPMTLAGTVIGVTMITILPDYIVTILL------------ 202

Query: 200 LKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENIN 259
                   +  IL+         +   + + + +SL     +      M  +       N
Sbjct: 203 -------GQAQILR------YSHHQFASDDCQDQSLTDQSLNLSDGGIMNRQKIQTSQRN 249

Query: 260 -WKELGLLVFVWVAFLGLQIAKNHTASCSIV--------YWVLDLLQIPVSLVVSLYEAI 310
              ++ ++   W++ + + + K    + SIV        YW +      ++ +  ++ AI
Sbjct: 250 LLCDMAVISLTWLSLIVISLVKGGHGAASIVGIGCGSYSYWAV------ITFMPYVFCAI 303

Query: 311 SLYKGRRVIASKG-DDGKSFRVFQ---------LVSYCAFGVLAGIVGGLLGLGGGFIMG 360
           + Y  +R++  +   +   +R  +         +V Y  F  LAG+  G+LG+GGG + G
Sbjct: 304 TAYFAQRILKQRAFLESYDYRYAEGEMRWDRRGVVRYPIFCSLAGVAAGMLGIGGGMVKG 363

Query: 361 PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLL 395
           P+ LE+   PQV+SAT++  + F+SS +V++  L+
Sbjct: 364 PILLEIRFHPQVASATSSTMIFFTSSTTVIQEALI 398


>gi|325193669|emb|CCA27930.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 591

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 54/148 (36%), Positives = 83/148 (56%), Gaps = 7/148 (4%)

Query: 330 RVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSV 389
           +  +L  YC   V+AGI  GLLG+GGG + GP+ LE+GV P V  ATA F + F+S+ + 
Sbjct: 444 KTTRLPFYC---VIAGITAGLLGIGGGMVKGPIMLEMGVLPMVQVATANFMILFTSASTT 500

Query: 390 VEYYLLKRFP----VPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVS 445
           +++ +  +FP      Y  +F     + A+ GQ VV  L+    R S++++ILA MI VS
Sbjct: 501 LQFAIAGQFPGSLQYDYVGWFACAGFVGAYCGQSVVAFLLKKYNRESMLVYILAVMIGVS 560

Query: 446 AISLGGVGISNMIGKFHRHEYMGFENLC 473
           A  +G VG   +  +     ++GF   C
Sbjct: 561 AFCMGIVGFQIVENEIALRMHLGFSGSC 588



 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 49/145 (33%), Positives = 83/145 (57%), Gaps = 6/145 (4%)

Query: 76  FGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVST 135
           F    ++ TI+     A  +  G+GGGG+ VP   L+IG  PK A  +SK  I+G +V+ 
Sbjct: 118 FAKTDLIATIMTLLITALSAGCGIGGGGLLVPSFILVIGLSPKHAIPLSKATILGTSVAN 177

Query: 136 VYYNLKLRHP-TLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT 194
            ++N   RHP   ++PII+Y +A +++P  ++G  IGV  N +  +W++ +LLI L    
Sbjct: 178 FWFNYHRRHPMKNNVPIINYAMAAIMEPPTLIGAVIGVMLNHVVPNWLIFLLLISLLTSI 237

Query: 195 STKAFLKG-----VETWKRETILKK 214
           + + F+KG      ET +R+ ++K 
Sbjct: 238 TLRTFIKGNRLREKETKRRQALIKN 262


>gi|294944275|ref|XP_002784174.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239897208|gb|EER15970.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 393

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 98/374 (26%), Positives = 171/374 (45%), Gaps = 61/374 (16%)

Query: 91  AAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMP 150
           A   S  G+GGG +F P+L L+   D K A+A S+ +I  + +++   N   +      P
Sbjct: 20  ALLASPAGIGGGILFTPVLHLVGELDSKEASATSQALIAASQLASCIINFWTQWHEPKKP 79

Query: 151 IIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRET 210
           +I     +++ P ++ G  IGV  N+I    ++ +L +++   ++ K  LKGV+ ++ E 
Sbjct: 80  LIILPYVIIMLPCIVAGAVIGVYLNMILPQLIILILYVLVAAFSTIKTTLKGVKQYRSEN 139

Query: 211 ILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENIN----WKELGLL 266
             KK +              E++S    P S  QK  +  E    + ++         +L
Sbjct: 140 ATKKASK-------------EHES----PSSASQKTIVTLEEAKEKKVDPFLVMPSRKVL 182

Query: 267 VFVW-VAFL------------GLQIAKN--HTASCSIVYWVLDLLQIPVSLVVSLYEAIS 311
            F W  AFL            G   AK+      C  VYW L  L+I + L +       
Sbjct: 183 FFYWTTAFLIWVLCLIFPLLRGSSTAKSIAPVPYCGGVYWFLAALEIALLLGI------- 235

Query: 312 LYKGRRVIASKGDDGKSFRVFQLVSYCAFG---VLAGIVGGLLGLGGGFIMGPLFLELGV 368
                    S G      +V +LV  C  G   V  GI+  ++G+GGG +M P+ L+ G+
Sbjct: 236 ---------SSGFIFAKRKVCRLVQCCILGVEIVFTGIISSMVGIGGGILMNPIILDFGL 286

Query: 369 PPQVSSATAT---FAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLI 425
            PQ  +AT     FAM+ S+++S   Y +   FP    ++ V +  +   +G+ V+++++
Sbjct: 287 NPQQGTATNAINIFAMSTSTALS---YGMSGYFPGGSDMWIVVLPFVGGIIGKLVLKQIV 343

Query: 426 ILLGRASLIIFILA 439
              GR S+++F+LA
Sbjct: 344 AKTGRMSVLVFLLA 357


>gi|219112005|ref|XP_002177754.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410639|gb|EEC50568.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 629

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 92/385 (23%), Positives = 176/385 (45%), Gaps = 51/385 (13%)

Query: 111 LIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISI 170
           L++G  P  A  I+   + G A+++   N++ RHP  D PIID+DL L+++P+ ++G  +
Sbjct: 220 LVMGLHPHYAIPIASVTVFGGALASTIVNMQRRHPLADRPIIDWDLVLMMEPLTLIGTLL 279

Query: 171 GVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRET-----ILKKEAARCLGSNGA 225
           G  F+ I ++ ++ VLL++L   T+     K +  ++ E      ++  +A    G+   
Sbjct: 280 GTLFHRILSEKILIVLLVLLLSITAHSTLSKAMRMYEAEKRYIRHLIAAQADSPRGNPSL 339

Query: 226 GA-----GEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELG---------------- 264
           G      G+ +     +G +   +  A E +  ++ N +++ +                 
Sbjct: 340 GGYVLPFGDEDDSRADTGCKEEARMAAEERQRILILNPDFRTMKTDLLEQEKVTPRSKII 399

Query: 265 --LLVFVWVAFLGLQIAKNHTAS-----C-SIVYWVLDLLQIPVSLVVSLYEAISLYKGR 316
               +F  + FL L +      S     C S  +WV+ ++ I   L+ S + A +    R
Sbjct: 400 ALCCMFSVLIFLNLMVGGGSFDSPWDIKCGSTAFWVVHVVMIAF-LMSSAWMAQTYLIAR 458

Query: 317 RVIAS-------KGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVP 369
             I          GD     R    + Y A   +AG+  G+ G+GGG ++ PL L  GV 
Sbjct: 459 HEIKDMVRFDYVHGDIKWDTRTS--IIYPAVFTIAGVFAGMFGIGGGVVIVPLLLHSGVH 516

Query: 370 PQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIIL-- 427
           P V+SAT++  + F+S  SV  Y++       +A+    +  I++ +GQ ++R++     
Sbjct: 517 PGVASATSSAMILFTSLASVSTYFVFGLIVADFAMAGFVIGFISSTLGQILMRRVRQAKS 576

Query: 428 -----LGRASLIIFILAFMIFVSAI 447
                  R S + F++  ++ VSA+
Sbjct: 577 ASGRKFERNSYLAFVIGGVVLVSAL 601


>gi|403353250|gb|EJY76161.1| hypothetical protein OXYTRI_02333 [Oxytricha trifallax]
          Length = 548

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 110/476 (23%), Positives = 183/476 (38%), Gaps = 102/476 (21%)

Query: 95  SVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDY 154
           +V G+GGGG  VP+L     F+ K A AIS   I   AV+   +N + RHP+ D   IDY
Sbjct: 78  TVAGIGGGGDLVPLLMSFFEFEAKEAVAISGFAIFLCAVTRYIFNFRQRHPSKDSVQIDY 137

Query: 155 DLA--------------LLIQ---PMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTK 197
            LA              +LI    P+L L I++ +    +F +     ++I      + +
Sbjct: 138 GLASVMMPTVFIGSFLGVLINIVFPVLYLQIALTLLLCFLFYECTKKAVVIFRKENQAQQ 197

Query: 198 AFLKGVETWK---------------RETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSG 242
             L  V+  K                E +L K   +    N       +  S+ +   + 
Sbjct: 198 PTLSQVQGIKVDENQITSYQNNENINEKLLTKSFTKDSKKNSKIKRSSQRASILAEDYAE 257

Query: 243 PQKDAM--------------EPEVTILEN------------------------------- 257
            QKDA                 E  ILEN                               
Sbjct: 258 YQKDATMTSINSDEIMDLNNNIEQKILENQPLVPQSPLIKFSFSTSRDLIKVDKLLAREK 317

Query: 258 --INWKELGLLVFVWVA------FLGLQIAKN--HTASCSIVYWVLDLLQIPVSLVVSLY 307
               W + G+   V +       F G +  K+     +CS+  W+     I     +S+ 
Sbjct: 318 THYQWDKQGICAIVLIVEVLVSLFRGSKKTKSIIDIQTCSVWDWLCFAFFIVFCFFMSMV 377

Query: 308 EAISLYKGRRVIASKGD---------DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFI 358
              +L K +++    G          +GK      LVS      + G+V G  G+GGG I
Sbjct: 378 AVNNLKKQQQLKMKSGRGLHPSDIKFEGKKVTKLILVSS-----IGGLVSGAFGIGGGTI 432

Query: 359 MGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQ 418
             PL L +G PP V+S+T  + + FS+  S + Y +     V +  +       ++ +G 
Sbjct: 433 YNPLLLSMGAPPSVASSTGMYMVMFSNFGSSITYIVYGTLNVKFGFWIGGFCCASSILGL 492

Query: 419 HVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYM-GFENLC 473
            ++ K++  L R S I+ ILA ++ +SA+ +   G  +M+ +    + +  F +LC
Sbjct: 493 FLLNKIVKKLNRQSPIVIILAVVMGISALLVPIFGSIDMVKQVRDGQSITQFSSLC 548


>gi|298710140|emb|CBJ31852.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 378

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/107 (38%), Positives = 67/107 (62%), Gaps = 2/107 (1%)

Query: 105 FVPMLSLIIGFDPKSATAISKCMIMG-AAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
            VP+  L++GFDPK A  +S   I G A  +TV  NL  RHP  D P++D+DL L+++P+
Sbjct: 146 LVPLYILVLGFDPKHAIPLSNITIFGGAITNTV-LNLSKRHPAADRPLVDWDLILVMEPL 204

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRET 210
            + G  +G   N +  DW++ ++LIVL   T+ +   KG++++ +ET
Sbjct: 205 TIGGALVGSFINKVLPDWILAIMLIVLLAATANRTLRKGIKSYNKET 251


>gi|384249010|gb|EIE22493.1| hypothetical protein COCSUDRAFT_47913 [Coccomyxa subellipsoidea
           C-169]
          Length = 628

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 63/234 (26%), Positives = 112/234 (47%), Gaps = 14/234 (5%)

Query: 246 DAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVS 305
           +A EP+V  L+      L LL  ++   +     K+ T   +  YW++ L  +PV L+++
Sbjct: 399 EAAEPQVPPLK------LALLFLMFACIVASDSFKDRTVCGTWQYWLVVLSVLPVILIIT 452

Query: 306 LY------EAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIM 359
           L+         +  +    + ++GD   S R   L  + A   LAG++ G+ G+GGG + 
Sbjct: 453 LFVRAYLVRDFNAKQASGYVWTEGDVEWSRRNTLL--FPALSSLAGLIAGMFGVGGGIVK 510

Query: 360 GPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQH 419
           GPL LE+GV P V++AT+   + F+++ + V Y      P  Y L    V  I   VGQ 
Sbjct: 511 GPLMLEMGVLPDVAAATSATMIMFTAASASVVYLSFGGIPFDYGLATFLVGLIFTMVGQV 570

Query: 420 VVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
               L+  L R S+++  +A ++ +S + +    + + I     H  + F  +C
Sbjct: 571 TCYWLMKALDRRSVVVIAMALLMVISMVIIYYEAVVSTIAAVQDHRLLDFGTIC 624



 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 67/208 (32%), Positives = 101/208 (48%), Gaps = 20/208 (9%)

Query: 52  LLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSL 111
           L    N L   +   +  +WP     W  V+G I         + GG+GGGGI VP+  L
Sbjct: 14  LFGECNRLGNHKDDPHKGLWP---LDWRDVVGYIFAAVSLFIAAGGGIGGGGILVPLYIL 70

Query: 112 IIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIG 171
           ++GF   +A A+S   I+G A+S   +N+  RH   + P+ID+++ L ++P  +LG  +G
Sbjct: 71  VMGFPTNTAVALSNITIVGGAISNFVFNVGRRHAFFNRPLIDWNIILAMEPATILGALLG 130

Query: 172 VAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVE 231
              N    +WM T+LL  L    + K   + V TW++E +  K AA         AG  +
Sbjct: 131 GYLNKATPNWMTTILLAALLTLLTYKLVDRAVVTWRKENLEFKRAA---------AGSSQ 181

Query: 232 YKSLPSGP--RSGPQK------DAMEPE 251
             S PS P  R GPQ+      +A  PE
Sbjct: 182 DGSDPSEPLLRKGPQEQQEILNEAFAPE 209


>gi|440800424|gb|ELR21463.1| hypothetical protein ACA1_184090 [Acanthamoeba castellanii str.
           Neff]
          Length = 512

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 110/201 (54%), Gaps = 18/201 (8%)

Query: 286 CSIVYWVLDLLQIPVSL---VVSLYEAISLYKGRRVIA---SKGDDGKSFRVFQLVSYCA 339
           CS+ +W+L  +  P+ L   V+S   A  +Y  R+       +GD   S +   L+ +  
Sbjct: 308 CSMYFWILWGVMFPIMLGFMVLSCIIARLIYSYRKRNGWPFIEGDVQWSVKSLFLIPFA- 366

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFP 399
            G + G   G LG+G G + GP+ LE+G+ P+V++AT++F + F++  +V +Y+++    
Sbjct: 367 -GTIGGTAAGFLGIGSGMVNGPVMLEIGMTPEVATATSSFIIVFTALSTVSQYFIIGALN 425

Query: 400 VPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL---GGVGISN 456
              AL+F  +  +AA VGQ+ V+ ++    ++S+I F+LAF+I  S +++   G + I++
Sbjct: 426 WQPALWFFVLGVLAAVVGQYGVQYVVKRFNKSSIISFLLAFVIAGSGVAMIVTGALQIAD 485

Query: 457 MIGKFHRHEYMGFENLCKYDR 477
                      GF +LC+ + 
Sbjct: 486 -------EGITGFADLCELNH 499



 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 57/147 (38%), Positives = 90/147 (61%), Gaps = 2/147 (1%)

Query: 73  ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAA 132
           EL FGW + LGT + F G A     G+GGGGI+VP+L LI+G+  K A  +SK  + G A
Sbjct: 3   ELLFGWRLYLGTGLAFVGGALSGAVGIGGGGIYVPVLILIVGYTAKEAIPLSKVTVAGVA 62

Query: 133 VSTVYYNLKLRHPTLD-MPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLF 191
           +S+   N+  RHP      ++DYD+A+++ P  +LG ++GV   VI  +W++ +LLI++ 
Sbjct: 63  ISSFLVNVLRRHPRAPWRALVDYDVAMVMTPTTLLGTTVGVLVYVILPEWLILILLILVL 122

Query: 192 IGTSTKAFLKGVETWKRETILKKEAAR 218
                + F+  ++ WK+E +  KEA R
Sbjct: 123 GLVDYRTFVAAIKLWKKEKV-AKEAER 148


>gi|326433498|gb|EGD79068.1| hypothetical protein PTSG_02036 [Salpingoeca sp. ATCC 50818]
          Length = 607

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 78/141 (55%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           +  Y  F ++AG++ G LG+GGG ++ PL L+  + P VSSAT  +   F+S+ S  ++ 
Sbjct: 467 VAKYLFFALVAGMLAGFLGIGGGMVLAPLLLQFNMHPLVSSATTAYMTLFTSAGSFTQFV 526

Query: 394 LLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVG 453
           +L R P  Y +    +A  A+ VGQ ++   +   G +S+I FIL F+I ++AI L   G
Sbjct: 527 ILNRVPYDYGIALFLLAAAASVVGQILLHSYVRRTGNSSVIAFILGFVIGLAAIMLLVSG 586

Query: 454 ISNMIGKFHRHEYMGFENLCK 474
              +     R E  GF+ LC 
Sbjct: 587 SLQLKAAHDRGESFGFKPLCS 607



 Score = 45.1 bits (105), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 64/122 (52%), Gaps = 11/122 (9%)

Query: 110 SLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP---TLDMPIIDYDLALLIQPMLML 166
           S+I  F P  A  +S   I+GA++   ++ ++ RHP        +ID+D  L++ P+ + 
Sbjct: 113 SVITLFPPYYAVPLSSTAIVGASIVQFFFQIRRRHPLPGAQHRRVIDFDTILMLLPLALA 172

Query: 167 GISIGVAFNVIF----ADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGS 222
           G  +G    VIF     DW+V V++I++ + T+ K  +KG E  ++E   +    R +  
Sbjct: 173 GTVVG----VIFNTVSPDWLVLVVVIIVLVFTTFKTLVKGRELRRQEQEARALPRRSIVK 228

Query: 223 NG 224
           +G
Sbjct: 229 HG 230


>gi|118348628|ref|XP_001007789.1| hypothetical protein TTHERM_00069510 [Tetrahymena thermophila]
 gi|89289556|gb|EAR87544.1| hypothetical protein TTHERM_00069510 [Tetrahymena thermophila
           SB210]
          Length = 549

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/179 (28%), Positives = 94/179 (52%), Gaps = 7/179 (3%)

Query: 57  NFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFD 116
           NF    +   Y+ +WP   F  ++++  +I F      +VGG+GGG I VP++++++ + 
Sbjct: 15  NFHCSNKVCVYNDLWPPSAF--QVIVYILIPFI-IGVSNVGGLGGGIIKVPLITVMLNYP 71

Query: 117 PKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNV 176
            K AT IS C++ G++V      +  +HP  + PIIDY++ L+I PM++LG + G+  NV
Sbjct: 72  SKVATFISYCILFGSSVVHSTIIIFKKHPLFNKPIIDYNIVLVINPMVLLGTNAGILLNV 131

Query: 177 IFADWMVTVLLIVLFIGTSTKAFLKGVE----TWKRETILKKEAARCLGSNGAGAGEVE 231
           +  + +  V++ V     +     K +     T K++  ++ EA           GEV+
Sbjct: 132 LMPEIVAGVIICVYLSLIAPYILFKAISLYKITKKQQQQIEPEAKALETVERKNEGEVQ 190



 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 52/223 (23%), Positives = 96/223 (43%), Gaps = 27/223 (12%)

Query: 226 GAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKE-------LGLLVFVWVAFLGLQI 278
           G+  V   SL   P    Q+  M  E+    +  +++       L +++F  + ++  Q 
Sbjct: 283 GSIIVSQNSLKETPEEPKQEQIMTEELKAFYDEEYQQFPKKKILLLVIIFCIIQYIVFQR 342

Query: 279 AKNHTAS------CSIVYWVLD---LLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSF 329
                 S      CS  YWV +   L+    ++ V  Y  +   K +  I  K +  K F
Sbjct: 343 GGKGLQSFVGIKTCSASYWVSNGAILVLCVAAIFVIRYYLLKWTKNKNEIIKKYNLQKEF 402

Query: 330 R-VFQLVSYCAF------GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMT 382
              F +++   +      G+ AG+V G +G+G G  + PL L +GV PQV +AT  F   
Sbjct: 403 EGDFNVLNKTHYFVVLLAGLAAGLVAGTVGVGAGLTLVPLLLSIGVHPQVVAATCGFNYL 462

Query: 383 FSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLI 425
           F ++ ++++ +          + F    ++ +FVG  ++ K I
Sbjct: 463 FIATTTIIQVFTSHYLSYAQIVLF----SLLSFVGGFIIAKCI 501


>gi|403339320|gb|EJY68918.1| hypothetical protein OXYTRI_10465 [Oxytricha trifallax]
          Length = 539

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 107/426 (25%), Positives = 183/426 (42%), Gaps = 70/426 (16%)

Query: 115 FDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMP---IIDYDLALLIQPMLMLGISIG 171
           FD K+A AIS   I   +V+   Y L  +HP  D     II+Y+LA+++ P +M+G   G
Sbjct: 118 FDTKNAIAISNFAIFTCSVTRYIYTLDKKHPHPDKKQNVIIEYNLAIVMLPTVMMGSLTG 177

Query: 172 VAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETI-LKKEAARC----------- 219
           V  N+IF    +  +L  L I  S ++ +KG + +++ETI  ++EA +            
Sbjct: 178 VFLNIIFPAIALQAILTALLIFLSLQSLMKGKDMYRKETIKFQQEAEKLKKKQDEERLMM 237

Query: 220 -----LGSNGAGA-------GEVEYKSLPSGPRS--------------GPQKDAMEPEVT 253
                L ++  G         E + K   + P+               G   +  + E+ 
Sbjct: 238 EKILKLKTDSQGEIIRESPIDEEQLKKEGTKPQQIQNFEIEEGQPLTKGLINNVNQEELQ 297

Query: 254 ILENINWKELG----------LLVFVWVAFLGLQIAKNHTAS------CSIVYW-VLDLL 296
            LE I  KE G          LL+F  +    L        S      CS++ W VL   
Sbjct: 298 QLERILEKEKGHKQWDKHLTCLLIFSLLVSTNLLRGNKSLDSIIGIERCSLLDWMVLFTF 357

Query: 297 QIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCA-----FGVLAGIVGGLL 351
            I  S +  L   I   +  +++  K   G +    +L          F  + G + G L
Sbjct: 358 YILCSSITFL--GIRRVRKEQILKEKYKRGLADCDIRLTPRNTLRLQIFSFVGGWISGAL 415

Query: 352 GLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVAT 411
           GLGGG I  P+ + LG PP V++AT+ + ++FSS+ S   Y +     +P++++   +  
Sbjct: 416 GLGGGAIFNPILIGLGTPPAVATATSMYMISFSSAGSTATYIIYGLINLPFSIWVGVIGC 475

Query: 412 IAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL---GGVGISNMIGKFHRHEYMG 468
             A  G  +   +     R S I+F+LA ++  SA+ +   GG+ +  +I +    +   
Sbjct: 476 FGATGGLALFNVVTKKYNRQSFIVFVLAGVMGASALLVPIFGGLDLMKLIDR--GEDIFK 533

Query: 469 FENLCK 474
             ++CK
Sbjct: 534 MNSICK 539


>gi|118348622|ref|XP_001007786.1| hypothetical protein TTHERM_00069480 [Tetrahymena thermophila]
 gi|89289553|gb|EAR87541.1| hypothetical protein TTHERM_00069480 [Tetrahymena thermophila
           SB210]
          Length = 575

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 41/113 (36%), Positives = 66/113 (58%), Gaps = 4/113 (3%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           PML L++ +  K AT IS C++ G+ ++     +  +HP LD PIIDY++ L+I PM++L
Sbjct: 63  PMLMLMLNYQTKVATFISYCILFGSCLANSTLLIFKKHPLLDKPIIDYNIVLMINPMVLL 122

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARC 219
           G +IG+  N++  +    +L I   I  S   F KG+  ++    LKKE  +C
Sbjct: 123 GTNIGIFLNILLPEIAAGILFICFLILISPYMFKKGLNLYR----LKKEQQKC 171



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/182 (23%), Positives = 82/182 (45%), Gaps = 21/182 (11%)

Query: 284 ASCSIVYWVLD----LLQIPVSLVVSLYEAISLYKGRRVIASK----GDDGKSFRVFQLV 335
            +CS  YWV +    LL    + V+  +      K +R+   K     +      V   +
Sbjct: 374 TTCSTAYWVTNGGIVLLAFVAAFVIRFFLQ-KWEKNKRIFIEKYHLEDEFASDLDVNNNM 432

Query: 336 SYCAF---GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           +Y      G+ AG++ G  G+G G  + P+ L  GV PQV++AT  F   F S+ ++++ 
Sbjct: 433 NYIKISLAGLTAGMLAGTFGVGAGLALVPILLASGVHPQVAAATCGFNYFFISTTTIIQV 492

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIF---------ILAFMIF 443
           +      +   + F  ++ I  F+   ++ + +     + LI+F         ILAF+++
Sbjct: 493 FTNDYLNISQIILFSILSFIGGFLCAKLIYRYVEKKKASYLIVFIVFGLAILNILAFIVY 552

Query: 444 VS 445
           ++
Sbjct: 553 LT 554


>gi|85002083|gb|ABC68446.1| unknown [Tragopogon pratensis]
          Length = 134

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 69/135 (51%), Positives = 84/135 (62%), Gaps = 8/135 (5%)

Query: 263 LGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASK 322
           L LL  VW AFL LQI KN    CS  YWVL + Q P +LVV  +E + LYK  +     
Sbjct: 1   LMLLFAVWAAFLVLQIIKNELVVCSTWYWVLTVAQFPATLVVFGFECVKLYKESKKRKRA 60

Query: 323 GDDGK------SFRVFQLVSYCAFG-VLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSA 375
           G+          + +  L+ +CA   +L G VGGLLG GGGFI+GPL LE+GV PQV+SA
Sbjct: 61  GNTESVCEASIEWTIPSLI-FCALCGILGGTVGGLLGSGGGFILGPLLLEIGVIPQVASA 119

Query: 376 TATFAMTFSSSMSVV 390
           TATF M FSSS+SVV
Sbjct: 120 TATFVMMFSSSLSVV 134


>gi|294944337|ref|XP_002784205.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239897239|gb|EER16001.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 392

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 95/366 (25%), Positives = 173/366 (47%), Gaps = 43/366 (11%)

Query: 98  GVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDM-PIIDYDL 156
           GVGGG +FVP+L+L+     K+ATA+S+ +IM A V+ V ++L  ++ +     +I+   
Sbjct: 27  GVGGGILFVPLLNLVGRLPSKNATAMSQGLIMSATVAKVLFSLHAQYTSRRRGRVINLPY 86

Query: 157 ALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT----STKAFLKGVETWKRETIL 212
            +L+ P +++G  IG+        W+  ++ ++L++ T    S  + LKG   W+ ET  
Sbjct: 87  VVLMLPCMIVGGLIGIYIY----SWLPVLIQLILYVITALFGSAMSLLKGFHLWRAETSE 142

Query: 213 KKEAARCLGSNGAGAGEVEYKS-----LPSGPRSGPQKDAMEPEVTILENINWKELGLLV 267
           K++A R    +  G  +V   S     +P   R   ++ A+   +TI            V
Sbjct: 143 KEKAKRDAEVSAGGTLKVPASSTVETVVPPLLRPITRRKAVACVITIFA----------V 192

Query: 268 FVWVAFLGLQIAKNHTAS------CSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIAS 321
           ++ V    L +  + T S      C  +YW L ++ + V L+V L  A+       +  S
Sbjct: 193 WILVILSRLILGSSSTRSIVGISYCEGLYWALSVVVVVVLLMVPLAYAL-------IDRS 245

Query: 322 KGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAM 381
            G    +  +    S    G LA +VG    + GG I+ PL +  G+ P  +S T +  +
Sbjct: 246 PGSSKAALTLSG--SLLGIGFLAAVVG----ISGGIIITPLVMFTGLTPPQASGTGSVVI 299

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFM 441
             +SS   + + L    P   AL+ +A+    A  G  ++ +++   GR S++ F+L+ +
Sbjct: 300 LVNSSSLALSFGLGGYLPDASALWIIALPFCGALTGDIILTRIMRRTGRTSILAFLLSSL 359

Query: 442 IFVSAI 447
             + AI
Sbjct: 360 AALGAI 365


>gi|299115466|emb|CBN75630.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 236

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 39/119 (32%), Positives = 75/119 (63%), Gaps = 2/119 (1%)

Query: 103 GIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQP 162
           GI VP+  L++ F PK A  +S   I G A++  + N+K RHP  D P++D+DL L+++P
Sbjct: 36  GILVPIFILVLRFGPKYAVPLSNITIFGGAITNTFLNMKKRHPLADRPLVDWDLILVMEP 95

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLG 221
           + + G  +G     +  + ++T+ +++L + T+ + F KG++ +K+E+ L++  A+ LG
Sbjct: 96  LTIGGALVGSFIQKVLPEVVLTLSMVLLLVATADRTFRKGLKAFKKESSLQQ--AKELG 152


>gi|356533449|ref|XP_003535276.1| PREDICTED: uncharacterized protein LOC100806242 [Glycine max]
          Length = 408

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 58/134 (43%), Positives = 90/134 (67%), Gaps = 2/134 (1%)

Query: 278 IAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSY 337
           + +N+T +CS +YW+++LLQ+P+++  + YE + LYKG+ VIASKGD    + V QL+ Y
Sbjct: 204 VKQNYTTNCSALYWIMNLLQVPITIGTTFYEVVLLYKGQSVIASKGDQQTRWHVQQLILY 263

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVE--YYLL 395
           C  G+++GI+GGLLGLGGGFI+GPLF+ LG+ PQ+ + T       SS+ +      +LL
Sbjct: 264 CMCGIISGIIGGLLGLGGGFILGPLFIGLGIHPQLKTNTKPHPWLLSSTHTQPHPPKFLL 323

Query: 396 KRFPVPYALYFVAV 409
            + P P +   + V
Sbjct: 324 YQSPNPKSSRLITV 337


>gi|302828096|ref|XP_002945615.1| hypothetical protein VOLCADRAFT_85767 [Volvox carteri f. nagariensis]
 gi|300268430|gb|EFJ52610.1| hypothetical protein VOLCADRAFT_85767 [Volvox carteri f. nagariensis]
          Length = 1018

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 68/118 (57%), Gaps = 1/118 (0%)

Query: 357  FIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFV 416
             +MGPL L++GV PQV++A++   + FSSS +++++ LL R    YAL F A + +A  V
Sbjct: 897  MVMGPLLLQIGVHPQVTAASSGAMVLFSSSAALIQFVLLHRLNTDYALVFGAASLVAGLV 956

Query: 417  GQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
            G   V   I   GR S+++  LA ++ +  + +   G+ N  G+  R + +GF  +C 
Sbjct: 957  GTQTVSGAIKRSGRPSIVVLALAGVMGIGTVCVAAFGLRNAAGQLRRGD-LGFAGICS 1013



 Score = 47.8 bits (112), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 59/94 (62%), Gaps = 1/94 (1%)

Query: 118 KSATAISKCMIMGAAVSTVYYNLKLRHPTL-DMPIIDYDLALLIQPMLMLGISIGVAFNV 176
           K +TA+S+  I  ++++ V  NL   HP++ + P+ID+ L LL+ P+L++G+ IGV  NV
Sbjct: 11  KPSTALSQACITASSLAAVVSNLPRTHPSVPEAPLIDFPLILLLTPVLLVGVGIGVLLNV 70

Query: 177 IFADWMVTVLLIVLFIGTSTKAFLKGVETWKRET 210
               W++ +LL+VL +    +A  KG   W +E+
Sbjct: 71  ALPSWLLNLLLLVLLLLLLAQAIAKGKALWAQES 104


>gi|301118272|ref|XP_002906864.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108213|gb|EEY66265.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 439

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 81/375 (21%), Positives = 162/375 (43%), Gaps = 31/375 (8%)

Query: 99  VGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLAL 158
           +GGG +   +  L++   P  A  ++   + G AV   + NL  +    + P+I++D  L
Sbjct: 81  IGGGAVLDAVYILVLKLGPDEAIPLASITVFGGAVCDFFLNLWKKPINSNFPLINWDFML 140

Query: 159 LIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAAR 218
           ++QPML++G + G +    F+ W++T+ LIV  +    KAF K         +  +E  R
Sbjct: 141 IMQPMLLMGAAFGASMISWFSTWLLTIALIVYLVYVGKKAFKKA------RAVGHEEGWR 194

Query: 219 CLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVA---FLG 275
              S+       E  SL   P    Q D  +        ++W++LG+   ++ A      
Sbjct: 195 WCSSS-------ETMSLLGAPSMSFQDD--DGSFQYKSGLSWRKLGINFGIFTATVLLTA 245

Query: 276 LQIAKNHTA-----SCSIVYWVLDLLQIPVSLVVSLYE---AISLYKGR---RVIASKGD 324
           LQ  +   +       S  + ++ +L      VVS Y+   A++ Y+ +   R I +  +
Sbjct: 246 LQGGRYFPSPLGIPPTSFFFLIVSMLPFIFLSVVSHYQMKDAVATYQRQQNPRFILAPNE 305

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
              S    + +     G+  G + G  G+GG      L   +   P   SA +  A+ F 
Sbjct: 306 VQWSPDAIRKIPLRLLGI--GAIAGAFGVGGEGATSSLLRGVNFTPAAVSAMSATAVFFV 363

Query: 385 SSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           S M+  ++ L  +  +  A + + +  +   +G+  + +++      +L++F +A  +F+
Sbjct: 364 SGMASFDFLLWGKLDLNLAKFLMPLGFMMTLLGRLCLIRIVRKAKSRTLLLFAIAAAMFI 423

Query: 445 SAISLGGVGISNMIG 459
           S   L  + +  + G
Sbjct: 424 SIFPLAFMELRGLFG 438


>gi|326433239|gb|EGD78809.1| hypothetical protein PTSG_01784 [Salpingoeca sp. ATCC 50818]
          Length = 589

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/165 (33%), Positives = 90/165 (54%), Gaps = 7/165 (4%)

Query: 68  HHVWPELK--FGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISK 125
           HHV   +K    W+  + +I  F  A+     G+GGG +FV +  +I+G D  +A  +SK
Sbjct: 50  HHVTALIKMVLAWQWAVASICVFLCASLAVGAGIGGGALFVGIYMIILGMDAHAAVPLSK 109

Query: 126 CMIMGAAVSTVYYNLKLRHP-TLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVT 184
             I G A++    NL  RHP +   P+IDYD AL+++PM +LG  +GV  NV+F +W+V 
Sbjct: 110 ATIFGLAIAAYSVNLWKRHPHSQQRPLIDYDTALMLEPMTLLGAIVGVLLNVLFPNWLVL 169

Query: 185 VLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGE 229
           + L +L +  S +   KG+    +E    K     + +N  G+G+
Sbjct: 170 LPLCLLLMVVSYRTIRKGLRLRAKE----KGTPHQVLTNRRGSGD 210



 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/206 (28%), Positives = 100/206 (48%), Gaps = 20/206 (9%)

Query: 256 ENINWKELGLLVFVWVAFLGLQI----AKNHTASCSIVYWVLDLLQIPVSLVVSLYEAIS 311
             + W++L LL+ VW+ +  + +    A +    CS  + VL L  IP  + ++ Y A  
Sbjct: 358 RTVPWEKLVLLILVWLGYTTITMLLYEANDVIKPCSAGWIVLLLCAIPYVIAIT-YFAGR 416

Query: 312 LYKGRRVIASK-------GD---DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGP 361
           + K + V   K       GD   +G +   F      A    AG+   ++G+GGG I  P
Sbjct: 417 MLKRQTVRKRKCNYPFLPGDVMWEGANLNKFP-----ALAFFAGVAAAMMGIGGGMIKSP 471

Query: 362 LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVV 421
           + L +G+ PQV + T++F + F+SS + ++Y +L +         +++    A VGQ VV
Sbjct: 472 IMLAMGLQPQVVTTTSSFMIIFTSSATTLQYLILGKLKPQQLGIVMSMGFAGAVVGQRVV 531

Query: 422 RKLIILLGRASLIIFILAFMIFVSAI 447
             +I    + S +IF+L  +  VS I
Sbjct: 532 NYIIAKYKKQSFLIFLLGGLTIVSGI 557


>gi|307109812|gb|EFN58049.1| hypothetical protein CHLNCDRAFT_50737 [Chlorella variabilis]
          Length = 752

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 45/125 (36%), Positives = 76/125 (60%), Gaps = 1/125 (0%)

Query: 91  AAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP-TLDM 149
           +A  +  GVGGG +FVP+ S+++    K+ATA+S+ +I G A+ +V ++L  +HP     
Sbjct: 339 SALSNAAGVGGGAVFVPLFSVLLSISIKAATALSQAVITGGAIGSVAFSLTRQHPLRPGA 398

Query: 150 PIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRE 209
           P+ID+ LAL + P L+LG++ GV  N+    W+VT+LLI L I  + +    G+   + E
Sbjct: 399 PLIDFSLALTLLPPLLLGVATGVLLNLALPAWLVTILLIPLLITFAIRTAATGLSMRRAE 458

Query: 210 TILKK 214
              ++
Sbjct: 459 KQAQQ 463


>gi|167525260|ref|XP_001746965.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774745|gb|EDQ88372.1| predicted protein [Monosiga brevicollis MX1]
          Length = 499

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 47/123 (38%), Positives = 68/123 (55%), Gaps = 1/123 (0%)

Query: 88  FFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP-T 146
           F GAA     G+GGG  FV +  L++G D   A  +SK  I G +++    N+  RHP  
Sbjct: 15  FLGAALNVGAGIGGGATFVAIYFLVLGEDAHGAVPLSKATIFGLSLAAFAVNIWKRHPHD 74

Query: 147 LDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETW 206
              P+IDYD A++++PM +LG  +GV  NVIF +W+V + L +L    S K   K     
Sbjct: 75  PKRPLIDYDTAMMLEPMTLLGGILGVILNVIFPNWLVLLPLCLLLGFISYKTLKKAWNMH 134

Query: 207 KRE 209
           K+E
Sbjct: 135 KKE 137



 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 68/256 (26%), Positives = 114/256 (44%), Gaps = 44/256 (17%)

Query: 240 RSGPQKDAMEPEVTILENIN-----WKELGLLVFVWVAFLGLQI----AKNHTASCSIVY 290
             GP  DAM      LE  +     W  + +L  VW+ +  L      A++    C   +
Sbjct: 236 EDGP--DAMAALRAALERDDHRLAKWDRILMLGAVWIGYFVLTFLLYRAEDAVPRCEAGW 293

Query: 291 WVLDLLQIPVSLVVSLYEAISLYKGRRVIASK--------GD---DGKSFRVFQLVSYCA 339
            VL L  IP  + V+   A  L+  R+ I  K        GD   +G++     L  Y  
Sbjct: 294 VVLLLCSIPYVVGVTYLFARRLH--RQTIQKKAVGYVFHPGDVMWEGRN-----LYYYPE 346

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFP 399
               AG+   ++G+GGG I  PL L +G+ PQ ++ T++F + F+SS S  + +L     
Sbjct: 347 MAFFAGLCASMMGIGGGLIKSPLMLSMGLNPQTTTTTSSFMIIFTSSASTFQVWLRALVS 406

Query: 400 VPYALYFVAVATI--------------AAFVGQHVVRKLIILLGRASLIIFILAFMIFVS 445
            P A +++ +  +               A +GQ VV  L+    + SL++F+L  +  +S
Sbjct: 407 TPKAPHYLILGKLHGAELAAVMASGFAGALMGQKVVNHLVQKYQKQSLLMFLLGGLTVLS 466

Query: 446 AISLGGVGISN-MIGK 460
            + L  + I++  IGK
Sbjct: 467 VVILFSLAIADGKIGK 482


>gi|301093159|ref|XP_002997428.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110684|gb|EEY68736.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 591

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 88/151 (58%), Gaps = 7/151 (4%)

Query: 330 RVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSV 389
           RV    + C+   +AG+  GLLG+GGG + GP+ LE+G+ P V SATA F + F+SS + 
Sbjct: 444 RVLVFPALCS---MAGVAAGLLGIGGGMVKGPIMLEMGILPPVQSATANFMILFTSSSTT 500

Query: 390 VEYYLLKRFP----VPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVS 445
           +++ +  +FP      Y  +F  +  I  F GQ VV  L+    R S+++++LA  I +S
Sbjct: 501 LQFAINGQFPGQLQYDYMAWFALMGCIGGFCGQKVVAYLVKKYRRESIMVYLLAVTIGLS 560

Query: 446 AISLGGVGISNMIGKFHRHEYMGFENLCKYD 476
           A+++G +G+ + +    +  ++GF  +C  +
Sbjct: 561 ALAMGIIGLKSTLRDIEKGVHLGFNGICDSE 591



 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 54/154 (35%), Positives = 83/154 (53%), Gaps = 1/154 (0%)

Query: 76  FGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVST 135
           F W+ ++ TII     A GS  GVGGGG+ VPM     G  PK A  +SK  I G AVS 
Sbjct: 109 FTWKDIVATIIALVSTALGSGCGVGGGGLLVPMYIFFYGLSPKHAIPLSKATIFGNAVSA 168

Query: 136 VYYNLKLRHP-TLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT 194
            ++N   +HP    +P+I+Y +A +++P  ++G   GV  N +F DW++ VLL+ L    
Sbjct: 169 YFFNFNRKHPMNAKLPLINYQVAGVMEPTTLIGAIFGVMMNHMFPDWLILVLLVSLLSYI 228

Query: 195 STKAFLKGVETWKRETILKKEAARCLGSNGAGAG 228
           + K  LKG    ++E+  ++   + +   G   G
Sbjct: 229 TYKTVLKGNTIREKESRYQRAVVKSVLKGGPEGG 262


>gi|428174009|gb|EKX42907.1| hypothetical protein GUITHDRAFT_110958 [Guillardia theta CCMP2712]
          Length = 197

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 49/125 (39%), Positives = 78/125 (62%)

Query: 86  IGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP 145
           + F  +A G+ GG+GGGGI VP+L  + GF    A  +++  ++GA++  + YN++ R+P
Sbjct: 19  LTFCASALGAAGGIGGGGIMVPLLVSVGGFSVHHAIPLTQATVLGASIMNLIYNVRKRNP 78

Query: 146 TLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVET 205
            LD P+IDY+ AL+++   +LG  IGV  N I   W++T+LLIV    T+ +   KG+E 
Sbjct: 79  VLDRPLIDYNTALILEVTTLLGTVIGVDVNKISPVWLITILLIVTLGYTTYRTLRKGLEL 138

Query: 206 WKRET 210
              ET
Sbjct: 139 RAIET 143


>gi|340501283|gb|EGR28084.1| hypothetical protein IMG5_183410 [Ichthyophthirius multifiliis]
          Length = 470

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 73/366 (19%), Positives = 160/366 (43%), Gaps = 47/366 (12%)

Query: 67  YHHVWP--ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAIS 124
           + ++WP   ++    I++  +IG       +VGG GG  + VP+L L++ +   ++  IS
Sbjct: 24  HKNLWPPNTIEICAYILIPILIGI-----SNVGGQGGSIVRVPLLMLMLNYSQSTSVFIS 78

Query: 125 KCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVT 184
             ++ G+ +      L  RHP  D+P+I++DL L++ P L++G   G+   ++  +++  
Sbjct: 79  FIILFGSCLPNSLLLLTKRHPFKDIPLINFDLVLILLPNLIVGNIYGILLTLVVPEFITI 138

Query: 185 VLLIVLFIGTSTKAFLKGVETWK-------RETILKKEAARCLGSNGAGAGEVEYKSLP- 236
           +L I+     +   + KG++ +K       +E  L+    + +  +        Y+    
Sbjct: 139 ILFILYLFAITPYFYRKGMKLYKEKKHKDQKEVYLQINLNKTIQRHNINENVNTYQDDNN 198

Query: 237 ------------SGPRSGPQKDAMEPEVTILENINWKELGLLVFVWV---AFLGLQIAKN 281
                         P+S   K     +  +   +  K++  ++  ++     L L+ ++ 
Sbjct: 199 SNISNYNNNIEIQSPQSQKNKQIYIRKKKLKSILPIKKILAIIATFLIIQTILMLRCSQK 258

Query: 282 HT----ASCSIVYWVLDLLQIPVSLVVSLY-------EAISLYKGRRVIASKGDDGKSFR 330
                  + ++ Y++++L    V++ + L+       +++ +   ++      +      
Sbjct: 259 FDYLGIKTYNLYYYLINLFLFIVNIAMYLFFKDRFKTKSLIIQIKKQQQIDLNESQYKDN 318

Query: 331 VFQLVSYCAF------GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
            F L SY  F      G ++G+  GL G+G G  + P  L L + P V++AT  F   F 
Sbjct: 319 EFSLQSYKCFLQIISLGFISGVFAGLFGIGSGLTIVPALLYLKIEPTVAAATNGFITFFL 378

Query: 385 SSMSVV 390
           S  SV+
Sbjct: 379 SLNSVI 384


>gi|110740663|dbj|BAE98434.1| hypothetical protein [Arabidopsis thaliana]
          Length = 210

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 107/202 (52%), Gaps = 20/202 (9%)

Query: 286 CSIVYWVLDLLQIPVSLVVSLYEAIS-----------LYKGRRVIASKGDDGKSFRVFQL 334
           C   YW++   QIP++L  +L+   S               + V   + +DG        
Sbjct: 15  CGNAYWLISSSQIPLTLFFTLWICFSDNVQSQQQSDYHVSVKDVEDLRSNDGAR------ 68

Query: 335 VSYCAFGV---LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVE 391
            + C F V   LAG++GG+ G+GGG ++ PL L++G+ P+V++AT +F + FSS+MS ++
Sbjct: 69  SNKCMFPVMALLAGVLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQ 128

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGG 451
           Y LL       A  F  +  +A+ VG  VV+K+I   GRAS+I+F +  ++ +S + +  
Sbjct: 129 YLLLGMEHTGTASIFAVICFVASLVGLKVVQKVITEYGRASIIVFSVGIVMALSIVLMTS 188

Query: 452 VGISNMIGKFHRHEYMGFENLC 473
            G  ++   +    YMGF+  C
Sbjct: 189 YGALDVWNDYVSGRYMGFKLPC 210


>gi|224014684|ref|XP_002297004.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968384|gb|EED86732.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 522

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 49/167 (29%), Positives = 73/167 (43%), Gaps = 47/167 (28%)

Query: 82  LGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLK 141
           +G I+ F  A   +  GVGGGGI+VP+  L++ F PKS++ +S+  I GA++  +  N++
Sbjct: 11  VGIIVWFLTAGIATACGVGGGGIYVPLGILLLRFPPKSSSGLSQASIFGASLGGLIVNIR 70

Query: 142 LRHPTLDM-----------------------------------------------PIIDY 154
            RHP L M                                               P+IDY
Sbjct: 71  NRHPDLYMRDTRGKPLQEGSGKIVSYEKDKGPAAIEEDRKAYLAGGDGKQKFYTRPVIDY 130

Query: 155 DLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLK 201
           D+AL + PM M G  +GV    +F DW+      V+   TS K + K
Sbjct: 131 DMALFLAPMEMAGAVLGVIIQRLFPDWLFLSFAAVVLGFTSYKTYTK 177



 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 41/120 (34%), Positives = 69/120 (57%), Gaps = 2/120 (1%)

Query: 328 SFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSM 387
           +F+  Q  S+  F  +AGIV GL+G+GGG ++GPL L +G+ P VS+AT    +  +SS 
Sbjct: 356 NFKKLQFYSFFTF--VAGIVAGLIGIGGGMVLGPLMLVMGIHPSVSTATTASMILLTSSS 413

Query: 388 SVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
             V + +    P  YALYF  V  + A++G+  +   +   G  S+++  LA +I ++ +
Sbjct: 414 VAVMFVMSGLVPWQYALYFFCVCLVGAYIGKSRIDSYVRKTGMTSVLVGTLATIIALATV 473


>gi|118348626|ref|XP_001007788.1| hypothetical protein TTHERM_00069500 [Tetrahymena thermophila]
 gi|89289555|gb|EAR87543.1| hypothetical protein TTHERM_00069500 [Tetrahymena thermophila
           SB210]
          Length = 570

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 43/140 (30%), Positives = 73/140 (52%), Gaps = 4/140 (2%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           PM+ L++ ++ K AT IS C++ G+ ++     +  +HP  D PIIDY++ L+I PM++L
Sbjct: 63  PMVMLMLNYETKIATFISYCVLFGSCLANSTLLIFKKHPLYDKPIIDYNIVLMINPMVLL 122

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG 226
           G +IG+  N++  +    +L I   I  S   F KG+  +K    LKKE  +   S    
Sbjct: 123 GTNIGIFLNILLPEIAAGILFIGFLILVSPYMFKKGLNLYK----LKKEQQKNQLSQSLL 178

Query: 227 AGEVEYKSLPSGPRSGPQKD 246
             + E ++     +    KD
Sbjct: 179 ENDQENETKEGFSKQETGKD 198



 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 65/287 (22%), Positives = 119/287 (41%), Gaps = 44/287 (15%)

Query: 196 TKAFLKGVET----WKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPE 251
           + ++ +  ET     KR++IL       +G N     EV Y      P S  + +A   E
Sbjct: 278 SNSYFQAQETNDDRLKRQSIL-------IGHNSIKDNEVSYNQKYVDPTSTEELEAFYQE 330

Query: 252 VTILENINWKELGLLVFVWVAFLGLQIAKNHT--------ASCSIVYWVLD----LLQIP 299
               + +  K+L LL+ V+ +   L   +            +CSI YW+ +    +L + 
Sbjct: 331 E--YKQVPTKKLLLLIVVFFSVQMLVFIRGGKGLKSFVGITTCSISYWITNAGILVLAVA 388

Query: 300 VSLVVSLYEAISLYKGRRVIASK-------GDDGKSFRVFQLVSYCAFGVLAGIVGGLLG 352
              V+  +  +   K ++++  K        DD               G+ AG++ G  G
Sbjct: 389 AVFVIRFF-LLKWEKNKKILIEKYNLKEEFADDLDVTNSIMYAKISVAGLTAGMLAGTFG 447

Query: 353 LGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATI 412
           +G G  + P+ L  GV PQV++AT  F   F S+ ++++ +      +   + F    +I
Sbjct: 448 VGAGLALVPILLASGVNPQVAAATCGFNYFFISTTTIIQVFTNSYLDLSQIVLF----SI 503

Query: 413 AAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIG 459
            +F+G  V+ K I          +++ FM+F       G+ I N+I 
Sbjct: 504 LSFIGGFVIAKFIYNFVERKKASYLVVFMVF-------GLAILNIIS 543


>gi|221481168|gb|EEE19573.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 299

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 35/100 (35%), Positives = 61/100 (61%)

Query: 109 LSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGI 168
           + L++ F+   ATA S+C++ G++++ +  N   R+P +DMP+ID D+ LL+ PM M G 
Sbjct: 1   MQLLMRFNTFEATATSQCLMTGSSLAGLILNFIRRNPVVDMPLIDMDMVLLLGPMQMCGS 60

Query: 169 SIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKR 208
           SIGV  N +   W++ VLL+V  +  + +   +  +  KR
Sbjct: 61  SIGVIVNRVLPAWLIIVLLVVCLLYETIRLMRRLCDQPKR 100


>gi|237844563|ref|XP_002371579.1| hypothetical protein TGME49_020560 [Toxoplasma gondii ME49]
 gi|211969243|gb|EEB04439.1| hypothetical protein TGME49_020560 [Toxoplasma gondii ME49]
          Length = 299

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 35/100 (35%), Positives = 61/100 (61%)

Query: 109 LSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGI 168
           + L++ F+   ATA S+C++ G++++ +  N   R+P +DMP+ID D+ LL+ PM M G 
Sbjct: 1   MQLLMRFNTFEATATSQCLMTGSSLAGLILNFIRRNPVVDMPLIDMDMVLLLGPMQMCGS 60

Query: 169 SIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKR 208
           SIGV  N +   W++ VLL+V  +  + +   +  +  KR
Sbjct: 61  SIGVIVNRVLPAWLIIVLLVVCLLYETIRLMRRLCDQPKR 100


>gi|452823720|gb|EME30728.1| hypothetical protein Gasu_19670 [Galdieria sulphuraria]
          Length = 711

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 77/145 (53%), Gaps = 3/145 (2%)

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           P   F W   L  ++ F  A   + GGVGGG +FVP+L L  G+   +A AIS+ ++ GA
Sbjct: 140 PLTDFTWRDGLTFVLVFIIAGLSNAGGVGGGFLFVPVLVLATGYRASTAAAISQALVTGA 199

Query: 132 AVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLF 191
           + +  +Y L  RHP  + P IDY + +   P ++ G SIGV  N +F ++     L  L 
Sbjct: 200 SGANTFYGLIRRHPKRERPRIDYGVVIHFIPSVLCGTSIGVLLNELFPNFFTLFALSALV 259

Query: 192 IGTSTKAFLKGVETWKRETILKKEA 216
           +     +  KG+  WK+E   +KEA
Sbjct: 260 LYVFYVSLKKGISLWKQE---RKEA 281



 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 32/135 (23%), Positives = 70/135 (51%), Gaps = 5/135 (3%)

Query: 342 VLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTFSSSM-SVVEYYLLKRFP 399
           ++ G +G  +G GG F+  P+ +  +G+ P V  +TA F M F+S   S ++Y    +  
Sbjct: 576 IILGAIGAWVGAGGSFMSTPILVAGIGMDPVVVQSTAGF-MNFTSGFSSALQYIFDHQMK 634

Query: 400 VPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVS-AISLGGVGISNMI 458
           + Y L   A   + +F G +++  L+      ++++ +++ ++F + A+ L   G+  +I
Sbjct: 635 IDYGLSLGATTLVGSFTGLYILNGLVARYNLQAILVIVMSIVMFGAFAVDLYA-GVQELI 693

Query: 459 GKFHRHEYMGFENLC 473
           G    +E+    ++C
Sbjct: 694 GVLDLNEHFPIHSIC 708


>gi|221501858|gb|EEE27611.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 299

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 35/100 (35%), Positives = 61/100 (61%)

Query: 109 LSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGI 168
           + L++ F+   ATA S+C++ G++++ +  N   R+P +DMP+ID D+ LL+ PM M G 
Sbjct: 1   MQLLMRFNTFEATATSQCLMTGSSLAGLILNFIRRNPVVDMPLIDMDMVLLLGPMQMCGS 60

Query: 169 SIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKR 208
           SIGV  N +   W++ VLL+V  +  + +   +  +  KR
Sbjct: 61  SIGVIVNRVLPAWLIIVLLVVCLLYETIRLMRRLCDQPKR 100


>gi|348690321|gb|EGZ30135.1| hypothetical protein PHYSODRAFT_353731 [Phytophthora sojae]
          Length = 449

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 109/204 (53%), Gaps = 15/204 (7%)

Query: 285 SC-SIVYWVLDLLQIPVSLVVSLY--EAISLYKGRRVIAS----KGD-DGKSFRVFQLVS 336
           SC S  YW+L  L + + + ++L+    + L    +V+      +GD      R+    +
Sbjct: 249 SCNSPDYWILTFLPLLILVGITLWVGYHLRLLNRSKVLCDYPFVEGDIHWIKRRILVFPT 308

Query: 337 YCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLK 396
            C    +AG+  GLLG+GGG + GP+ LE+G+ P V SATA F + F+SS + +++ +  
Sbjct: 309 LC---TMAGVAAGLLGIGGGMVKGPIMLEMGILPPVQSATANFMILFTSSSTTLQFAING 365

Query: 397 RFP----VPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGV 452
           +FP      Y  +F  +  I    GQ VV  L+    R S+++++LA  I +SA+++G +
Sbjct: 366 QFPGERQYDYIAWFALIGCIGGLCGQKVVAYLVKKYKRESIMVYLLAVTIGLSALAMGFI 425

Query: 453 GISNMIGKFHRHEYMGFENLCKYD 476
           G+ + +    +  ++GF  +C  +
Sbjct: 426 GLKSTLSDIEKGVHLGFHGICDNE 449



 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 62/116 (53%), Gaps = 1/116 (0%)

Query: 114 GFDPKSATAISKCMIMGAAVSTVYYNLKLRHPT-LDMPIIDYDLALLIQPMLMLGISIGV 172
           G  PK A  +SK  I G AVS   +N   +HPT   +P+I+Y +A +++P  ++G   GV
Sbjct: 7   GLSPKHAIPLSKATIFGNAVSAYLFNFNRKHPTNAKLPLINYQVAGIMEPTTLIGAIFGV 66

Query: 173 AFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAG 228
             N +F DW++ VLL+ L    + K  LKG +  ++E+  +    + +   G   G
Sbjct: 67  MMNHMFPDWLILVLLVSLLSYITYKTILKGNKISEKESKHQLSVVKSVLKGGPDGG 122


>gi|403372026|gb|EJY85900.1| hypothetical protein OXYTRI_16112 [Oxytricha trifallax]
          Length = 499

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 97/437 (22%), Positives = 169/437 (38%), Gaps = 74/437 (16%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           PM     GF  + A A+S   I+   V+   Y+   +HP  D   IDY LA ++ P ++L
Sbjct: 68  PMCIAFFGFGTREAVALSGFCILICQVTKWAYSWSQKHPEKDTVSIDYGLATVMLPTVLL 127

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRE----------------- 209
           G  IG    ++    ++ +LL  L    + ++ LK  E +++E                 
Sbjct: 128 GSFIGTFVTILVPPIVLQILLTALLTFLTVQSGLKAKEIYEKENAKIKKLKEAEEAKAAA 187

Query: 210 -----------------TILKKEAARCLGS----NGAGAGEVEYKSLPSGPRS------- 241
                            T+  K  +   GS        A +V        P +       
Sbjct: 188 EADKMAKANRQSVQVVDTVDGKRLSINRGSLREKKPTIASKVHDNDTTDDPSAKLLHGGD 247

Query: 242 ---GPQKDAMEPEVTIL--ENINW---KELGLLVFVWVAFL-----GLQIAKNHTASCSI 288
              GP ++ ++    +L  E  +W   K L  LV +    +     G +  K+    C +
Sbjct: 248 YSEGPTQEEIDKVDAMLRREKTHWQWDKHLTCLVVLVSQVIINIIRGSKSTKSVVEHCGV 307

Query: 289 VYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGK----------SFRVFQLVSYC 338
             W+L      V L  ++   +S+   +R+IA +    K           F    + S  
Sbjct: 308 ADWIL------VGLYAAICITVSVIAIKRIIAEQALKTKVGKGLIPSDIRFNKQTVRSVV 361

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
               + G   G LGL GG I  PL L  GVPP V+S+T  + + FS+  + V Y +    
Sbjct: 362 ITAFVGGWASGCLGLSGGAIFNPLLLNQGVPPSVASSTGMYMILFSTIGTCVVYSIQGSL 421

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMI 458
              +  +      +A+  G + + K++   GR S ++ +L  ++ +S I +   G   + 
Sbjct: 422 NFAFGGWIGGWCCLASVGGMYALDKVVKKFGRQSPLVVVLTGVLALSTILVPIFGYIEIH 481

Query: 459 GKFHRHEYMGFENLCKY 475
           GKF R+      ++  Y
Sbjct: 482 GKFLRNPDYSMWDISSY 498


>gi|302834698|ref|XP_002948911.1| hypothetical protein VOLCADRAFT_89258 [Volvox carteri f.
           nagariensis]
 gi|300265656|gb|EFJ49846.1| hypothetical protein VOLCADRAFT_89258 [Volvox carteri f.
           nagariensis]
          Length = 1005

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 92  AFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPT-LDMP 150
           +  S  GVGGG IF+P+ + ++GF+ KS+TA+S+  I   A +T+  NL   HP+   + 
Sbjct: 29  SVASSSGVGGGAIFIPLFNALVGFNLKSSTALSQACITFGAAATLLANLMHHHPSDPTVA 88

Query: 151 IIDYDLALLIQPMLMLGISIGVAFNVIFADW 181
           ++D+ L  ++ PML++G+ IGV  NV+   W
Sbjct: 89  LVDFTLVAVLTPMLLMGVGIGVLLNVMLPSW 119


>gi|397643932|gb|EJK76166.1| hypothetical protein THAOC_02088 [Thalassiosira oceanica]
          Length = 534

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 61/108 (56%)

Query: 103 GIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQP 162
           GI VP+  LI+ F PK A  +S   + G +++    N + RHP  D P+ID+DL ++++P
Sbjct: 77  GILVPIYILILNFLPKHAIPLSNVTVFGGSIANTLLNWRKRHPVADRPLIDWDLIVVMEP 136

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRET 210
             +LG  IG   N +  +  + +LL+VL + TS     K    +++ET
Sbjct: 137 PTLLGALIGANLNKLLPETAIAILLVVLLVYTSFNTLKKAHSMYQKET 184



 Score = 43.5 bits (101), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 67/123 (54%), Gaps = 5/123 (4%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
           +GK   ++ ++S      LAG+  GL G+GGG I GPL + LGV P V+SAT+   + F+
Sbjct: 386 NGKKTIIYPMIS-----TLAGVAAGLFGIGGGIIKGPLMIALGVHPAVASATSACMILFT 440

Query: 385 SSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           S  +   + +       YA+    +  ++  VGQ V+  ++    R+S I + + F++ +
Sbjct: 441 SFTATTTFSVYGLMVRDYAIACSILGFVSTLVGQKVMNSILRKTNRSSYIAYSIGFVVLL 500

Query: 445 SAI 447
           SAI
Sbjct: 501 SAI 503


>gi|307108116|gb|EFN56357.1| hypothetical protein CHLNCDRAFT_144844 [Chlorella variabilis]
          Length = 670

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 60/107 (56%)

Query: 103 GIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQP 162
            I VP+  +++GF    A A+S   I+G  ++ + +N++ RHP    P+ID+DL ++++P
Sbjct: 147 AILVPLNLMVLGFGTAPAVALSNITIVGGTLANLAFNVQRRHPARHGPLIDWDLIMVMEP 206

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRE 209
             +LG  +G   N I   W+ TV L VL    S K F KG E   RE
Sbjct: 207 STILGALVGGYLNKILPGWLTTVSLSVLLALISYKTFNKGREIHARE 253



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/216 (25%), Positives = 94/216 (43%), Gaps = 15/216 (6%)

Query: 277 QIAKNHTASCSIVYWVLDLLQIPVSL-VVSLYEAISLYKGRRVIAS-----KGD-DGKSF 329
            + K+  A  S+ YW+     +PV+L  +S      L KG    AS     +GD +    
Sbjct: 453 DVLKSQAACPSLPYWLAATAMVPVTLATLSAVRTYLLKKGAAQRASGHELLEGDVEWTPG 512

Query: 330 RVFQLVSYCAFGVLAGIVGGLLGLGGGF--------IMGPLFLELGVPPQVSSATATFAM 381
                 + C+F  LA  + G+ G               GPL LE+GV P+V++AT+   +
Sbjct: 513 STLLYPALCSFAGLAAGIFGVGGGIIKARWLAVRENPAGPLMLEMGVLPEVAAATSATMI 572

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFM 441
            F+S+ + V +         YA     +  I    GQ +V  +   +   SL++F++A +
Sbjct: 573 FFTSAFASVVFISFGAVQWDYAAMLFTMGLICTAAGQLLVLWVNQHIKSRSLLVFVMATV 632

Query: 442 IFVSAISLGGVGISNMIGKFHRHEYMGFENLCKYDR 477
           + VS+++L   G  +       H+   F  +C  +R
Sbjct: 633 LSVSSVALAVQGAQSTAAAAAAHDLWHFHGICGTNR 668


>gi|294948278|ref|XP_002785683.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|294954883|ref|XP_002788342.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239899706|gb|EER17479.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239903654|gb|EER20138.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 354

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 88/351 (25%), Positives = 157/351 (44%), Gaps = 61/351 (17%)

Query: 98  GVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL--KLRHPTLDMPIIDYD 155
           G+GGG +FVP+L LI     K ATA+S+ +I  A+++ + +N   + R       +I + 
Sbjct: 27  GIGGGVLFVPVLRLIGKVPLKEATALSQALIASASLAAILFNFFEQYRARNESKALIVWP 86

Query: 156 LALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT----STKAFLKGVETWKRETI 211
             +LI P  ++G  IGV    IF+ W+ ++ +++L+       S  A  KG++ WK ET 
Sbjct: 87  FVILIIPCTVIGSLIGV---YIFS-WLPSLFILILYFCYACLGSFMACKKGIKLWKAETR 142

Query: 212 LKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWV 271
            K++      ++ + + EV  +  P                 +LE  N K+L       +
Sbjct: 143 AKRQIPVGDSTDSSRSSEVAQEIPP-----------------LLEMPNRKKL-------I 178

Query: 272 AFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRV 331
           A+           S   + W + L+  P+             KG      +        +
Sbjct: 179 AY----------TSIVALIWAVCLIFPPL-------------KGNSATQKRSPGAVKIGL 215

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVE 391
             + S    G+L+ I+G      G   +  + L LG+ P+ ++ATAT  +  +SS + + 
Sbjct: 216 VLMTSTIVIGLLSSIIGT----AGALFIIRVVLSLGLDPKQATATATVVIFATSSRTALS 271

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMI 442
           + L   FP    L+ V +    A +G+ +V KLI   GR S+++ +LA M+
Sbjct: 272 FALGGYFPPASNLWIVVLPFAGALLGKTIVAKLIAKTGRLSILVLLLAAMV 322


>gi|255635532|gb|ACU18117.1| unknown [Glycine max]
          Length = 114

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 31/68 (45%), Positives = 43/68 (63%)

Query: 26 VCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTI 85
          V V A + +  + +     E   +++L K VNFLW   GSGY H WP+++FGW I+ GTI
Sbjct: 25 VSVSALQHVSHKNINPTTVETTQTSFLGKVVNFLWSSSGSGYQHTWPDIEFGWRIITGTI 84

Query: 86 IGFFGAAF 93
          IGF G+AF
Sbjct: 85 IGFLGSAF 92


>gi|294944335|ref|XP_002784204.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239897238|gb|EER16000.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 354

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 160/344 (46%), Gaps = 43/344 (12%)

Query: 98  GVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDM-PIIDYDL 156
           GVGGG +FVP+L+L+     K+ATA+S+ +IM A V+ V ++L  ++ +     +I+   
Sbjct: 27  GVGGGILFVPLLNLVGRLPSKNATAMSQGLIMSATVAKVLFSLHAQYTSRRRGRVINLPY 86

Query: 157 ALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT----STKAFLKGVETWKRETIL 212
            +L+ P +++G  IG+        W+  ++ ++L++ T    S  + LKG   W+ ET  
Sbjct: 87  VVLMLPCMIVGGLIGIYIY----SWLPVLIQLILYVITALFGSAMSLLKGFHLWRAETSE 142

Query: 213 KKEAARCLGSNGAGAGEVEYKS-----LPSGPRSGPQKDAMEPEVTILENINWKELGLLV 267
           K++A R    +  G  +V   S     +P   R   ++ A+   +TI            V
Sbjct: 143 KEKAKRDAEVSAGGTLKVPASSTVETVVPPLLRPITRRKAVACVITIFA----------V 192

Query: 268 FVWVAFLGLQIAKNHTAS------CSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIAS 321
           ++ V    L +  + T S      C  +YW L ++ + V L+V L  A+       +  S
Sbjct: 193 WILVILSRLILGSSSTRSIVGISYCEGLYWALSVVVVVVLLMVPLAYAL-------IDRS 245

Query: 322 KGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAM 381
            G    +  +    S    G LA +V    G+ GG I+ PL +  G+ P  +S T +  +
Sbjct: 246 PGSSKAALTLSG--SLLGIGFLAAVV----GISGGIIITPLVMFTGLTPPQASGTGSVVI 299

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLI 425
             +SS   + + L    P   AL+ +A+    A  G  ++ +++
Sbjct: 300 LVNSSSLALSFGLGGYLPDASALWIIALPFCGALTGDIILTRIM 343


>gi|449018085|dbj|BAM81487.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 607

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/144 (31%), Positives = 74/144 (51%)

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           P  +F W   L   + F  A   S GGVGGG +FVP+L L+ GF  + A A+S+ ++ G 
Sbjct: 124 PLSRFTWRDGLTFGLIFVIAGLSSTGGVGGGFLFVPVLVLLTGFQARRAAALSQALVTGG 183

Query: 132 AVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLF 191
           + +  +Y L  RHP  + P IDY +  +    ++ G S+GV  N++F ++    +L VL 
Sbjct: 184 SGANAFYGLITRHPFRERPRIDYYVVTVFMATILCGTSVGVILNILFPNFFTLFMLAVLV 243

Query: 192 IGTSTKAFLKGVETWKRETILKKE 215
                 +  K ++ WK E    + 
Sbjct: 244 AYVFYISIKKAIQLWKDERAASRN 267



 Score = 43.5 bits (101), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 32/133 (24%), Positives = 63/133 (47%), Gaps = 1/133 (0%)

Query: 342 VLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           +  G VG  +G GG F+  P+ +  +G+ P V  ATA F    ++  S ++YY+    P+
Sbjct: 474 IFLGAVGAWVGAGGSFMSTPILIAGVGMDPLVVQATAGFMNFVAAFSSAIQYYVNHELPL 533

Query: 401 PYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGK 460
            Y L     A + +        +L+      ++++FI+A ++F +A+     G   +   
Sbjct: 534 DYGLALGGTAFLGSLSFVVFFNRLVYKFKLQAILVFIMAGVMFGAAVLNIYAGALELKTT 593

Query: 461 FHRHEYMGFENLC 473
            ++ +   F N+C
Sbjct: 594 LNQGKPFPFGNIC 606


>gi|145531341|ref|XP_001451437.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419092|emb|CAK84040.1| unnamed protein product [Paramecium tetraurelia]
          Length = 430

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 77/323 (23%), Positives = 135/323 (41%), Gaps = 42/323 (13%)

Query: 94  GSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIID 153
            +VGG+GGG   +P+L +++ F    AT     +  G  +      +  +HP  D  IID
Sbjct: 46  STVGGLGGGIEKIPILIVMLNFSQSKATLYVYVLTFGTNLVNFLMLIYQKHPLADKQIID 105

Query: 154 YDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILK 213
           Y+L+L++ P  + G + G   + I  D  +  +LIV F     K ++K  +  ++E  + 
Sbjct: 106 YELSLILLPTALFGSAFGNILHQILPDIFLISILIVFFSIFVPKLYIKAKQNREQEMQVD 165

Query: 214 KEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWV-- 271
            +    +          +YKS         +   + P    L       L L++F+ V  
Sbjct: 166 NKQKTVINQEVTHLIAEQYKS---------EDQQIIPLYKFL-------LLLIIFMIVQC 209

Query: 272 -----------AFLGLQIAKNHT--ASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRV 318
                      +F+G+Q   +     +   VYW+   + I V L++S    I  + GR  
Sbjct: 210 VLMIRGGKKQQSFIGIQYCSDGKLINNNQAVYWITTGMIIVVLLLISY--GIKYHLGRET 267

Query: 319 --------IASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPP 370
                      K D       F ++ + + G L GI+GG+ G+G G I+  + +   V  
Sbjct: 268 KTKIEIGYFNEKVDFNFIESKFFMIVWIS-GFLGGIMGGMTGVGAGAIIVSILILQNVNS 326

Query: 371 QVSSATATFAMTFSSSMSVVEYY 393
           +V+SAT  F   F S  + +  Y
Sbjct: 327 RVASATGGFQKLFISLFTTILSY 349


>gi|221505087|gb|EEE30741.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 665

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 66/121 (54%), Gaps = 5/121 (4%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           I+VP++ L++GF    ATA S+ ++ G +++    NL  RHP  D P ID DL LL+ PM
Sbjct: 145 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTSLNLFRRHPYADRPAIDLDLVLLMGPM 204

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAF-----LKGVETWKRETILKKEAAR 218
            + G + G+  N  +  +++  LL+VL   T+ K F     LK      R+ + ++  + 
Sbjct: 205 QIAGATFGLVINRCWPVYLIMALLVVLLFATAYKTFRQMMRLKHEGAAARKQLEQRSGSL 264

Query: 219 C 219
           C
Sbjct: 265 C 265


>gi|221482011|gb|EEE20377.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 665

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 66/121 (54%), Gaps = 5/121 (4%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           I+VP++ L++GF    ATA S+ ++ G +++    NL  RHP  D P ID DL LL+ PM
Sbjct: 145 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTSLNLFRRHPYADRPAIDLDLVLLMGPM 204

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAF-----LKGVETWKRETILKKEAAR 218
            + G + G+  N  +  +++  LL+VL   T+ K F     LK      R+ + ++  + 
Sbjct: 205 QIAGATFGLVINRCWPVYLIMALLVVLLFATAYKTFRQMMRLKHEGAAARKQLEQRSGSL 264

Query: 219 C 219
           C
Sbjct: 265 C 265


>gi|237836929|ref|XP_002367762.1| hypothetical protein TGME49_005300 [Toxoplasma gondii ME49]
 gi|211965426|gb|EEB00622.1| hypothetical protein TGME49_005300 [Toxoplasma gondii ME49]
          Length = 665

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 66/121 (54%), Gaps = 5/121 (4%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           I+VP++ L++GF    ATA S+ ++ G +++    NL  RHP  D P ID DL LL+ PM
Sbjct: 145 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTSLNLFRRHPYADRPAIDLDLVLLMGPM 204

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAF-----LKGVETWKRETILKKEAAR 218
            + G + G+  N  +  +++  LL+VL   T+ K F     LK      R+ + ++  + 
Sbjct: 205 QIAGATFGLVINRCWPVYLIMALLVVLLFATAYKTFRQMMRLKHEGAAARKQLEQRSGSL 264

Query: 219 C 219
           C
Sbjct: 265 C 265


>gi|384252265|gb|EIE25741.1| hypothetical protein COCSUDRAFT_64812 [Coccomyxa subellipsoidea
           C-169]
          Length = 553

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/141 (35%), Positives = 82/141 (58%), Gaps = 3/141 (2%)

Query: 105 FVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP-TLDMPIIDYDLALLIQPM 163
           FVP+ ++++ F  K A A+S+ +I G A++ V   L  +HP     P+ID+DLAL++ P+
Sbjct: 91  FVPLFNILLQFSVKGAAALSQAVIAGGALAGVGVTLHKKHPYDPSKPLIDFDLALMLLPV 150

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKE-AARCLGS 222
           ++LG+S+GV  N +F +W++TVLL++L I  +     K +   + E + K E AA    S
Sbjct: 151 ILLGVSVGVLANQLFPNWLITVLLLLLLIFLTHMTVKKALSLHRAEVLYKAEQAADTCSS 210

Query: 223 NGAGAGEVE-YKSLPSGPRSG 242
             A A   E + S  S  + G
Sbjct: 211 ARAKADSSEGHASASSAQQQG 231



 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 65/244 (26%), Positives = 114/244 (46%), Gaps = 11/244 (4%)

Query: 237 SGPRSGPQKDAMEPEVTILEN-INWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDL 295
           SG      +D  +   ++      W ++ +L+  W  F+  Q+  +    CS  YW +  
Sbjct: 300 SGDEGEHDEDRQQEHRSVRSQPATWLQVVVLLGCWGIFVTFQLLLSRWPHCSGPYWAIFA 359

Query: 296 LQIPVSLV--VSLYEAISLYKGRR--------VIASKGDDGKSFRVFQLVSYCAFGVLAG 345
           +Q  + L+  V+    +S  K R         ++AS   +  ++ + +L+      +LAG
Sbjct: 360 VQAGLCLIAEVAFLRLVSARKQRSQESRLEQPMLASVYKEAPAWTLPRLIRSAIITLLAG 419

Query: 346 IVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALY 405
            + GLLG+GGG I+ PL LE G  P V++AT+T  + FSSS + + +         +AL 
Sbjct: 420 FIAGLLGIGGGMIVNPLLLEFGTHPHVAAATSTLMVLFSSSSAALSFGFSHLLNAQFALV 479

Query: 406 FVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHE 465
           F      A+ +G  +V +++   G AS+I+F+LA +I   A      G    +     H 
Sbjct: 480 FGLCCMAASLIGVLIVSRIVERSGNASIIVFLLALVIATGATLTAAFGGRFAVQDLIHHR 539

Query: 466 YMGF 469
            +GF
Sbjct: 540 SIGF 543


>gi|154342935|ref|XP_001567413.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134064745|emb|CAM42850.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 510

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 98/391 (25%), Positives = 174/391 (44%), Gaps = 27/391 (6%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP--TLDMPIIDYDLALLIQPML 164
           P+   I+    + A  +S+  I G +  ++Y  ++ +HP  + D P+I+Y    LI P+ 
Sbjct: 120 PLFCGIMKIPMQFAVGMSQATICGQSTVSMYLIVQQKHPDSSWDRPLINYQYLSLILPLG 179

Query: 165 MLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRET-----ILKKEAARC 219
           ++G  IG   + +F D +  +LL V+      +  +K V+ +K +T      ++ + A  
Sbjct: 180 LIGTLIGGILSKLFPDVLRLLLLFVILSTALYRTLVKVVKQYKEDTNAWQATVEADGANA 239

Query: 220 LGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTIL----------EN-----INWKELG 264
               G      E +      R+   K       T+L          EN        +EL 
Sbjct: 240 ASQQGNYGNNDERQGYELINRAVDAKSESLQGTTVLATSAHSLPAFENPPQSQYPQQELA 299

Query: 265 LLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGD 324
           +    ++  L   I + +     ++YW+  L  IP+  + +++  ++  K R+++ S   
Sbjct: 300 MNFTCFLVLLLFNILRMYLVCGGLLYWLCVL--IPLVFLSTVFY-LNYEKLRKLVESDSA 356

Query: 325 DGK-SFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTF 383
               ++     V Y    V+AG    +LG+GGG ++G +  E+G+ PQ +SAT   A  F
Sbjct: 357 QVTFAWTQKNTVMYPMVAVIAGASAAMLGIGGGLVLGFVLNEVGLVPQQASATGGMATLF 416

Query: 384 SSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIF 443
            +  SV+E  +     V + + F  V   +  +GQ V  + I   G   LII  LAF++ 
Sbjct: 417 IAFSSVLELLVTGHLVVDFGIVFCIVGLCSGSLGQFVFMEYIKGHGLNYLIIGSLAFVLG 476

Query: 444 VSAISLGGVGISNMIGKFHR-HEYMGFENLC 473
            S +SLGG GI   +   H     M F +LC
Sbjct: 477 GSLVSLGGYGIYTTVISVHAGRSLMAFGHLC 507


>gi|301093157|ref|XP_002997427.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110683|gb|EEY68735.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 432

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 56/166 (33%), Positives = 88/166 (53%), Gaps = 6/166 (3%)

Query: 76  FGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVST 135
           F ++ ++ T++ F   A G+ GGVGGGG+ VPM  ++ G +PK A  +SK  I G+AV+ 
Sbjct: 116 FTYKDLITTLVAFSSTALGAGGGVGGGGLLVPMY-ILGGLNPKHAIPLSKVTIFGSAVAM 174

Query: 136 VYYNLKLRHP-TLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT 194
              N + +HP   + P+ID+ L  L++P  ++G   GV  N IF +W++ V L+ L    
Sbjct: 175 YTVNFRRKHPLNKNRPLIDFGLVGLMEPTTLIGTVFGVMLNHIFPNWLILVCLVTLLTFI 234

Query: 195 STKAFLKGVETWKRETI----LKKEAARCLGSNGAGAGEVEYKSLP 236
           + K  LKG   +  E      L K   R          E +++SLP
Sbjct: 235 TYKTILKGNTRFDVEVAARRWLNKTRRRRKVRQIREEDEADFQSLP 280


>gi|145501808|ref|XP_001436884.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404029|emb|CAK69487.1| unnamed protein product [Paramecium tetraurelia]
          Length = 424

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 165/368 (44%), Gaps = 54/368 (14%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP-TLDMPIIDYDLALLIQPMLM 165
           P+L +++ ++   AT +S C++ G  V   +  +K  HP     P+++YDL +++   + 
Sbjct: 62  PILLMMLDYEQSYATQLSYCLMFGGCVINTFLLMKKSHPYDQKRPLVNYDLVVILNCSIA 121

Query: 166 LGISIGVAFNVIFADWMVT----VLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLG 221
           LG  +G   NV  A  + T    + LI++      KA        K+E + K    RC  
Sbjct: 122 LGSYLGSILNVFLAPIIETMFQQIFLIIVIPFLLNKA--------KKEKLRK---IRC-- 168

Query: 222 SNGAGAGEVEYKSLPSGPRSGPQKDAM--EPEVTILEN-----INWKELGL-LVFVWVA- 272
                  E++ +          QKD++  E +  +L+N        K+L + L F  V+ 
Sbjct: 169 -----QSELDLEKYLLN-----QKDSIYTEEQQLLLQNEFQNFCPSKKLAIALSFFIVSQ 218

Query: 273 -------FLGLQIAKNHTASCSIVYWV-LDLLQIPVSLVVSLY--EAISLYKGRRVIASK 322
                  +L   I  N       + W+ L ++ I +S +V  Y  +   ++   ++   +
Sbjct: 219 IIMTGGKYLKPFIPLNKCFDFRYMLWIMLFIVNIFMSRLVYTYGLKKEMIFDDYKIYMQE 278

Query: 323 GDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMT 382
               K+  +   VS    G  AG++ GLL LG G IM P+ LELG+ P++++AT+ F   
Sbjct: 279 RYFQKNRFILIYVS----GFFAGLISGLLALGAGLIMVPVLLELGLHPRIATATSAFNYF 334

Query: 383 FSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIF---ILA 439
           F    ++V+     +  +    +F  +A +   +  H   KLI  L    L+I+   +LA
Sbjct: 335 FIGLTNIVKLITDSQISIAEIAWFFGLALVFGTICCHFSLKLIEKLQLVHLVIYFTILLA 394

Query: 440 FMIFVSAI 447
            + F++ I
Sbjct: 395 ILNFIAGI 402


>gi|145495145|ref|XP_001433566.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400684|emb|CAK66169.1| unnamed protein product [Paramecium tetraurelia]
          Length = 399

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 91/320 (28%), Positives = 148/320 (46%), Gaps = 31/320 (9%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           +LGT++G   A     GG+GGG I  P++ +++G   K A   +  M+ G ++       
Sbjct: 19  ILGTLVGLAQA-----GGIGGGPIVSPVMMVLLGCSSKQAIWNTYIMLFGGSIGNYARLG 73

Query: 141 KLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFL 200
           + R      P+I+Y L  +  P+L+ G  +GVA       W+  V++++         FL
Sbjct: 74  RERISNGSSPLINYQLVQITLPLLLAGAILGVA----TGKWLPKVVVVIFLFAILLNVFL 129

Query: 201 KGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSG-PQ--KDAMEPEVTILEN 257
           K  + +K+  I +KE +  L        +VE K L     S  PQ  +  +E E  +   
Sbjct: 130 KTKKVYKK--IREKEQSELL-------QQVEMKELMLTDYSAVPQDLQQILENESKLYPT 180

Query: 258 INWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLY---- 313
            N KE+   V + VA   L+ A    +   I Y  ++   I   +    Y  +  Y    
Sbjct: 181 ENLKEIAFSVIIVVALTLLKGAATIPSILGISYCGMEFHFINFIIFGIGYYNVQRYRKWI 240

Query: 314 -KGRRVIASKGDD---GKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVP 369
            K      S G D   GK   VF +    +    AG +GGL+GLGGG ++ P++LE G+ 
Sbjct: 241 KKDEEFKQSLGYDFSGGKISEVFGITVKSSMK--AGFLGGLVGLGGGVVLTPIWLETGIH 298

Query: 370 PQVSSATATFAMTFSSSMSV 389
           P  ++A+ATF + F+S +SV
Sbjct: 299 PPRAAASATFTVMFTSFISV 318


>gi|224111746|ref|XP_002332885.1| predicted protein [Populus trichocarpa]
 gi|222833707|gb|EEE72184.1| predicted protein [Populus trichocarpa]
          Length = 151

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 84/133 (63%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
            +LAG++GG+ G+GGG ++ PL L +G+ P++++AT +F + FSSSMS ++Y LL    V
Sbjct: 19  ALLAGMLGGVFGIGGGMLISPLLLHVGIAPEITAATCSFMVFFSSSMSALQYLLLGMEHV 78

Query: 401 PYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGK 460
             A+    +  +A+ +G  VV++ I+  GRAS+I+F ++ ++ +S + +   G  N+   
Sbjct: 79  DTAIILSVICFVASLLGLLVVQRAIVKYGRASMIVFSVSTVMALSTVLMTSFGALNVWRD 138

Query: 461 FHRHEYMGFENLC 473
           ++    MGF+  C
Sbjct: 139 YNSGRNMGFKLPC 151


>gi|118395416|ref|XP_001030058.1| hypothetical protein TTHERM_01164150 [Tetrahymena thermophila]
 gi|89284345|gb|EAR82395.1| hypothetical protein TTHERM_01164150 [Tetrahymena thermophila
           SB210]
          Length = 503

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 148/355 (41%), Gaps = 51/355 (14%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           I VP+L    G++ K + A+    I  A++  +   +K +      P+IDY + +L  P 
Sbjct: 135 IVVPVLMSFFGYETKKSIALVFITIFSASLGNLMSFMKQKSKD-GGPVIDYRIVVLSLPT 193

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSN 223
           +M+G   GVA N  F   +V    +  FI    ++  K  +++KRE    KE      +N
Sbjct: 194 IMVGSIYGVALNK-FIPQIVLAFALAFFI---LQSLTKTYKSYKREK--AKEVQENQNNN 247

Query: 224 GAGAGEVEYK-------SLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWV----- 271
            +      Y+        LP  P S   K    P+ ++L  I    LG  VF  +     
Sbjct: 248 KSDQSSPLYELKQPNENGLP--PISQSSKKEQYPK-SLLSKIFCITLGFAVFSLLRGGSK 304

Query: 272 --AFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKG------------RR 317
             + LG+         C  +Y + +L         S Y A  L KG             +
Sbjct: 305 FDSLLGI-------PPCGFLYQISNL--------ASAYVAYLLVKGIIAGLVIQNKIEEK 349

Query: 318 VIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATA 377
           ++ +   D       ++  +     LAG++G   GLGGG ++ P +LE G+P   ++  +
Sbjct: 350 LVVNTSSDDTQLTAQEMQGFAVTAFLAGLIGSTFGLGGGMVLVPKWLEQGIPSYKTTPCS 409

Query: 378 TFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRAS 432
              +  ++  S +++ L   +     +YF  +A +++F+    +++ +    +AS
Sbjct: 410 ISLLFLTAFNSAIQFALGGVYETEEIIYFSVIALLSSFIVSSSIQQYVKKTNQAS 464


>gi|348689886|gb|EGZ29700.1| hypothetical protein PHYSODRAFT_309925 [Phytophthora sojae]
          Length = 580

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 55/195 (28%), Positives = 101/195 (51%), Gaps = 8/195 (4%)

Query: 290 YWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKG----DDGKSFRVFQLVSYCAFGVLAG 345
           YWVL L+ + + + +SLY    L    R+    G    +    +   + + + A   +AG
Sbjct: 386 YWVLTLMPLSILVGISLYVGYRLRLTNRLKVVSGYFFVEGDMHWVKRRTLVFPAVCTIAG 445

Query: 346 IVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALY 405
           +  GLLG+GGG + GP+ LE GV P V SATA+F + F++S + +++ +  +FP  +   
Sbjct: 446 VAAGLLGIGGGMVKGPIMLEAGVLPAVQSATASFMILFTASSTTLQFAINGQFPGEFQFD 505

Query: 406 FVA----VATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKF 461
           F+A    V  +  F G   V   +    R S++++ LA  I +SA+++  +G+ + +   
Sbjct: 506 FMAWLAFVGFVGGFCGLKCVGYFVKKYRRESIMVYTLAATIGLSAVAMRFIGLQSTLSDI 565

Query: 462 HRHEYMGFENLCKYD 476
               ++GF  +C  +
Sbjct: 566 ESGVHLGFHGICDNE 580



 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 48/138 (34%), Positives = 76/138 (55%), Gaps = 13/138 (9%)

Query: 76  FGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVST 135
           F +  +L T+I F   A G+ GG+GGGG+ VPM  +  G +PK A  +SK  I G+AV+ 
Sbjct: 116 FTFNDLLTTLIAFSCTALGAGGGIGGGGLLVPMY-IFAGLNPKHAIPLSKVTIFGSAVA- 173

Query: 136 VYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTS 195
                      ++ P+ID+ L  L++PM ++G   GV  N I  +W++ VLL+ L    +
Sbjct: 174 -----------MNRPLIDFALVALMEPMTLVGTVFGVMLNHISPNWLILVLLVTLMSFIT 222

Query: 196 TKAFLKGVETWKRETILK 213
               LKG +   +E+ L+
Sbjct: 223 YNTVLKGNKIQDKESKLQ 240


>gi|449016482|dbj|BAM79884.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 649

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 57/220 (25%), Positives = 95/220 (43%), Gaps = 14/220 (6%)

Query: 27  CVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGSGYHHVWPE---LKFGWEIVLG 83
           C    R  ++++ R        +   L++ N       +GY    P+     F W   + 
Sbjct: 105 CADLGRACQVQSQRCVPCTSSANCSSLQSCNI-----NTGYCD--PKSLFHPFVWNDGIM 157

Query: 84  TIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLR 143
             + F  +A  +  G GGG  ++ +  L++GF   +A+A S   I    ++ V  NL  R
Sbjct: 158 VPLMFISSAVHNAAGTGGGTSYIALFVLVLGFRVATASANSHAFIFAGMIANVLVNLWAR 217

Query: 144 HPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGV 203
           HP  DMP ID++L     P+ + G SIGV  N +F ++ +++LL  L +  +      GV
Sbjct: 218 HPFRDMPRIDWNLVATSVPLFLAGSSIGVFLNQLFPNYFLSLLLAFLMLLLTMLVVWLGV 277

Query: 204 E-TWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSG 242
             TW+    +K+           G    E  SL  GP + 
Sbjct: 278 RLTWRE---MKRHPLESNRDPCPGDVLAEQDSLTCGPETS 314


>gi|223992571|ref|XP_002285969.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977284|gb|EED95610.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 564

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 67/133 (50%)

Query: 103 GIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQP 162
           G+ VP+  LI+GF PK A  +S   + G AV+    N++ RHP  D P++D+DL L+++P
Sbjct: 119 GVLVPIYILIMGFSPKHAIPLSNITVFGGAVANTILNVRKRHPLADRPLVDWDLILVMEP 178

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGS 222
           + + G  IG   N +  + ++ + L+ L   TS     K +  +K E+   +        
Sbjct: 179 LTIAGALIGAFLNKLLPEAVLVLSLVALLSFTSYTTLKKAIRMYKAESKALEAQRGVRKV 238

Query: 223 NGAGAGEVEYKSL 235
            G G  E E   L
Sbjct: 239 RGDGTVESELTRL 251



 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 64/122 (52%), Gaps = 5/122 (4%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
           DG++  V+ LV  C     AG   G+ G+GGG + GPL L +GV P+VSSA++   + F+
Sbjct: 420 DGRATVVYPLV--C---TAAGFFAGMFGVGGGIVKGPLMLAMGVHPKVSSASSACMILFT 474

Query: 385 SSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           S  +   + +     + YA   + +  +A  VGQ  +  L+    R S I F +  ++ +
Sbjct: 475 SFTATTSFVVFGLLDMDYATVCMTLGFVATLVGQIGLFYLMEKFQRNSYIAFSIGGIVLL 534

Query: 445 SA 446
           SA
Sbjct: 535 SA 536


>gi|348690322|gb|EGZ30136.1| hypothetical protein PHYSODRAFT_475542 [Phytophthora sojae]
          Length = 124

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 68/123 (55%), Gaps = 4/123 (3%)

Query: 358 IMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFP----VPYALYFVAVATIA 413
           + GP+ LE G+ P V SATA+F + F++S + +++ +  +FP      Y  +F  V  + 
Sbjct: 2   VKGPIMLEAGILPPVQSATASFMILFTASSTTLQFAINGQFPGEFQFDYMAWFAFVGFVG 61

Query: 414 AFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            F G   V   +    R S+++++LA  I +SA+++G +G+ + +G      ++GF  +C
Sbjct: 62  GFCGLKCVGYFVKKYKRESIMVYMLAATIGLSAVAMGFIGLQSTLGDLESGVHLGFHGIC 121

Query: 474 KYD 476
             +
Sbjct: 122 DNE 124


>gi|401405669|ref|XP_003882284.1| hypothetical protein NCLIV_020390 [Neospora caninum Liverpool]
 gi|325116699|emb|CBZ52252.1| hypothetical protein NCLIV_020390 [Neospora caninum Liverpool]
          Length = 675

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/126 (34%), Positives = 70/126 (55%), Gaps = 9/126 (7%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           I+VP++ L++GF    ATA S+ ++ G +++    NL  RHP  D P ID DL LL+ PM
Sbjct: 158 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTCLNLFRRHPFADRPAIDLDLVLLMGPM 217

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKE--AARCLG 221
            + G + G+  N  +  +++  +L+VL   T+ K       T ++   LK+E  AAR L 
Sbjct: 218 QIAGATYGLVINRCWPVYLIMAVLVVLLFATAYK-------TSRQMMRLKREGAAARKLL 270

Query: 222 SNGAGA 227
              +G+
Sbjct: 271 EQRSGS 276


>gi|145528512|ref|XP_001450050.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417650|emb|CAK82653.1| unnamed protein product [Paramecium tetraurelia]
          Length = 400

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 93/370 (25%), Positives = 170/370 (45%), Gaps = 25/370 (6%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           +LG+++G   A     GG+GGG I  P++ +++G   K A   +  M+ G ++       
Sbjct: 19  ILGSLVGLAQA-----GGIGGGPIVSPVMMVLLGCSSKQAIWNTYIMLFGGSIGNFARLG 73

Query: 141 KLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFL 200
           + +      P+I+Y L  +  P+L+ G  +GVA       W+  +++++         FL
Sbjct: 74  REKIQDGSSPLINYQLVQITLPLLLAGAILGVA----SGKWLPKLVIVIFLFAILLNVFL 129

Query: 201 KGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINW 260
           K    +K+  + +KE    L        E+        P++  Q    E ++   EN+  
Sbjct: 130 KTKNVYKK--VREKERNDLLIQ--VEMKEININDQNILPQNLQQLKDNESKLYPTENL-- 183

Query: 261 KELGLLVFVWVAFLGLQIAKNHTA-----SCSIVYWVLDLLQIPVSLVVSLYEAISLYKG 315
           KE+ L VF+ VA   L+ A    +      C   Y  ++ +   V           + K 
Sbjct: 184 KEIALSVFIVVALTLLKGAATIPSILGIGYCGYGYHFINFIIFGVGFYNVQRYRQQIKKD 243

Query: 316 RRVIASKGDD---GKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQV 372
             +  S G D   GK   V+ +    +    AG +GGL+GLGGG ++ P++LE G+ P  
Sbjct: 244 EVLKESIGYDFSGGKISEVYDITVKSSMK--AGFLGGLVGLGGGVVLTPVWLETGINPAR 301

Query: 373 SSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRAS 432
           ++A+ATF + F+S +SV    L   + +   L   +V+   +++   +++KL+    R S
Sbjct: 302 AAASATFTVMFTSFISVFIIALSGGYQLSQFLILASVSGCGSYLVAGILKKLVKKYKRES 361

Query: 433 LIIFILAFMI 442
           +II +L  +I
Sbjct: 362 IIIQVLLVVI 371


>gi|253746884|gb|EET01870.1| Hypothetical protein GL50581_856 [Giardia intestinalis ATCC 50581]
          Length = 511

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 57/188 (30%), Positives = 82/188 (43%), Gaps = 12/188 (6%)

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           P L+   E+V+  +   F A   S  G+GGG IFV ML L  G  P  A  +SK MI G 
Sbjct: 8   PGLEVWMEVVVAIVCALF-AMLASAAGIGGGVIFVSMLQLF-GVSPHVAAPLSKAMIFGG 65

Query: 132 AVSTVYYNLKLRHPTLD--MPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIV 189
           +       +  +H   D   P I +DL  +I+P  + G  IG   NV+  +W + VL +V
Sbjct: 66  SCVLTCMTI-FQHEDNDPTKPAIIWDLVFIIEPAAVSGALIGALINVVLPEWFLLVLEVV 124

Query: 190 LFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAME 249
             + T+ K     +    RE +   +   C   N      VE     S  R+   K + +
Sbjct: 125 FLLYTTQKMLKNSLTILNRERLAAGKKPVCARRNRT---PVE----SSNRRNPSHKSSHQ 177

Query: 250 PEVTILEN 257
           P  TI E 
Sbjct: 178 PSSTIEEQ 185



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 40/156 (25%), Positives = 72/156 (46%), Gaps = 9/156 (5%)

Query: 323 GDDGKSFR-VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAM 381
            D   +FR +   V +   G+ AGI+G +LG+GGG +  P+ +  G+ P+ +   +T  +
Sbjct: 360 SDSLDAFRSILFYVKFILAGLFAGILGAMLGIGGGLLKNPILISFGIEPERARTASTVMI 419

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFM 441
            F+S  S++ Y ++      YA   + V  +    G ++   ++      S I FI+  +
Sbjct: 420 AFTSMSSMISYVVIGGLHFEYAWPLMLVIGVFFVSGYYLSGLIMRCFKTKSFIPFIITAL 479

Query: 442 IFVSAISLGGVGISNMIGKF----HRHEYMGFENLC 473
           I V    +    ++NMI  F       +  GF  LC
Sbjct: 480 IIVCTCFI----VANMIIVFVDIAKTGQLPGFAGLC 511


>gi|159469083|ref|XP_001692697.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277950|gb|EDP03716.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1854

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 29/84 (34%), Positives = 54/84 (64%), Gaps = 2/84 (2%)

Query: 89   FGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLD 148
            F +A  +  GVGGG I++P+ + +IGF+ K++TA+S+  I   +++ +  NL  RHP   
Sbjct: 1663 FASAVATASGVGGGAIYIPLFNALIGFELKASTALSQACITAGSLAALGSNLHRRHPLRP 1722

Query: 149  MP--IIDYDLALLIQPMLMLGISI 170
                ++D+ L L++ P+L++G S+
Sbjct: 1723 EAWHLVDFRLMLVLTPVLLVGSSL 1746


>gi|401426825|ref|XP_003877896.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322494143|emb|CBZ29440.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 511

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 109/405 (26%), Positives = 176/405 (43%), Gaps = 52/405 (12%)

Query: 101 GGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP--TLDMPIIDYDLAL 158
           GG + +PMLS         A  +S+  I G +   +Y+ ++ +HP  + D P+I+Y    
Sbjct: 123 GGLMEIPMLS---------AVGMSQSTICGQSTLNMYFAIQEKHPDSSWDRPLINYQYLG 173

Query: 159 LIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFI-------GTSTKAFLKGV-------- 203
           L+ P  ++G  +G   + +  D +  +LL+VL         GT  K + +          
Sbjct: 174 LLLPPGLIGTLVGGILSKLCPDVLRLILLLVLLSVVLYRSWGTMKKQYRQDTNPSHVTVE 233

Query: 204 -----ETWKRET-ILKKEAARCLGSNGAGAGEV---EYKSLPSGPRSGPQKDAMEPEVTI 254
                ET  RE      E+ RC  +  AG  +    E  ++ S P   PQ     P+   
Sbjct: 234 TGNANETSHRENHDNNDESQRCELTEKAGGVKRELGENTAILSTPEQSPQSLRCPPQ--- 290

Query: 255 LENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYK 314
                 +EL L    ++  L   I + +      +YW+  L+ +    VV  +    L  
Sbjct: 291 -SQYPQQELSLNFACFLVLLLFNIFRTYAVCGGFLYWLCVLVPVAFLSVVFFFNREKL-- 347

Query: 315 GRRVIASKGDDGKSFRVFQL--VSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQV 372
             R +A       +F   Q   V+Y    VLAG    +LG+GGG ++G +  E+G+ PQ 
Sbjct: 348 --RKLAGSNPAQMTFAWTQRNSVAYPMVAVLAGASAAMLGIGGGLVLGFVLNEVGIVPQE 405

Query: 373 SSAT---ATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLG 429
           +S T   ATF + FSS++ ++   +     V + + F  V   ++ +GQ V    I   G
Sbjct: 406 ASVTGGMATFFIAFSSALQLL---ITGSLVVDFGIVFSIVGLCSSALGQLVFMTYIKSHG 462

Query: 430 RASLIIFILAFMIFVSAISLGGVGISNMIGKFHR-HEYMGFENLC 473
            + LII  L F++  S +SLGG GI N +         MGF  LC
Sbjct: 463 LSYLIIGSLIFVVGGSLVSLGGFGIYNAVISIQAGGSVMGFGCLC 507


>gi|301093155|ref|XP_002997426.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110682|gb|EEY68734.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 124

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 66/123 (53%), Gaps = 4/123 (3%)

Query: 358 IMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFP----VPYALYFVAVATIA 413
           + GP+ LE GV P V SATA+F + F++S + +++ +  +FP      Y  +F  V  + 
Sbjct: 2   VKGPIMLEAGVLPAVQSATASFMILFTASSTTLQFAINGQFPGEFQFDYMAWFALVGFVG 61

Query: 414 AFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
            F G   V   +    R S+++++LA  I +SA+++G +G+   +       ++GF  +C
Sbjct: 62  GFCGLKCVGYFVKKYKRESIMVYMLATTIGLSALAMGFIGLKLTLADIESGVHLGFHGIC 121

Query: 474 KYD 476
             +
Sbjct: 122 DNE 124


>gi|294944269|ref|XP_002784171.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239897205|gb|EER15967.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 409

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/133 (26%), Positives = 74/133 (55%), Gaps = 1/133 (0%)

Query: 342 VLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVP 401
           ++ G++  ++G+ GG  M P+ L LG+ P+ ++AT +  +  +S+ + + + L   FP  
Sbjct: 277 IVIGLISSIVGISGGLFMIPVVLSLGLDPKQATATTSIVIFATSTSTALSFALGGYFPPA 336

Query: 402 YALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMI-GK 460
             L+ V +  I A +G+ +V +LI   GR S+++ +L  ++ +  I+    GI +++ G 
Sbjct: 337 SDLWIVVMPFIGALLGKTIVARLIAKTGRMSILVLLLGTVVIIGGITTISTGIVSVVNGA 396

Query: 461 FHRHEYMGFENLC 473
            +  E + F + C
Sbjct: 397 LNGDEVVQFGSFC 409



 Score = 47.4 bits (111), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 76/146 (52%), Gaps = 11/146 (7%)

Query: 98  GVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMP--IIDYD 155
           G+GGG +FVP+L LI G   K +TA+S+ +I  A+++   +N   ++   + P  +I + 
Sbjct: 27  GIGGGVLFVPVLRLIGGLQLKESTALSQALIASASLAATLFNCFEQYSARNDPKALIVWP 86

Query: 156 LALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIG----TSTKAFLKGVETWKRETI 211
             +L  P  ++G  IGV        W+ ++ +++L+       S  A+ KG+  WK E  
Sbjct: 87  FVILTLPCTVIGSLIGVYLY----SWLPSLFILILYFCFVCLGSFMAYRKGIRLWKAENG 142

Query: 212 LKKEAARCLGSNGAGAGEVEYKSLPS 237
            K+ A     ++ + + EV  + +PS
Sbjct: 143 AKRRAVDGDSTDMSRSSEVTVE-VPS 167


>gi|146096258|ref|XP_001467747.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398020624|ref|XP_003863475.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134072113|emb|CAM70812.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322501708|emb|CBZ36789.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 511

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 94/386 (24%), Positives = 173/386 (44%), Gaps = 48/386 (12%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP--TLDMPIIDYDLALLIQPML 164
           PM   ++    +SA  +S+  I G +   +Y  ++ +HP  + D P+I+Y    L+ P  
Sbjct: 120 PMFCGLMEIPMQSAVGMSQSTICGQSTLNMYLIIRQKHPDSSWDRPLINYQYLSLLLPPG 179

Query: 165 MLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILK----------- 213
           ++G  IG   + +  D +  +LL+VL      +++    + ++++T              
Sbjct: 180 LIGTLIGGILSKLCPDVLRLILLLVLLSVVLYRSWETMKKQYRQDTDPMHVTVEANDANA 239

Query: 214 -----------KEAARCLGSNGAGA--GEVEYKSLPSGPRSGPQKDAMEPEVTILENINW 260
                      K  +R L     GA  G  E  ++   P   P      P+   L+    
Sbjct: 240 TSHRESYDGNGKSQSRELTETAGGAKKGLCEDIAVLPTPEQSPPPIERPPQSQYLQ---- 295

Query: 261 KELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIA 320
           +EL + +  ++  L   I + +T     +YW+  L  +PV+ +     ++  Y  R  + 
Sbjct: 296 QELSMNIACFLVLLLFNIFRTYTVCGGFLYWLCVL--VPVAFL-----SVVFYFNREKLR 348

Query: 321 SKGDDGKSFRVF-----QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSA 375
              +   +   F       V+Y    VLAG    +LG+GGG ++G +  E+G+ PQ +SA
Sbjct: 349 KLTESDPAQMTFTWTQRNSVTYPMVAVLAGGSAAMLGIGGGLVLGFVLNEVGIVPQEASA 408

Query: 376 T---ATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRAS 432
           T   ATF + FSS++ ++   +     V +++ F  V   ++ +GQ V+   I   G + 
Sbjct: 409 TGGMATFFIAFSSALHLL---ITGSLVVDFSVVFCIVGLCSSALGQLVIMNYIKRRGLSY 465

Query: 433 LIIFILAFMIFVSAISLGGVGISNMI 458
           LII  L F++  S ++LGG GI N +
Sbjct: 466 LIIGSLVFVVGGSLVALGGYGIYNAV 491


>gi|159115095|ref|XP_001707771.1| Hypothetical protein GL50803_4181 [Giardia lamblia ATCC 50803]
 gi|157435878|gb|EDO80097.1| hypothetical protein GL50803_4181 [Giardia lamblia ATCC 50803]
          Length = 520

 Score = 62.4 bits (150), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 52/173 (30%), Positives = 81/173 (46%), Gaps = 7/173 (4%)

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           P L+   E+++  +   F A   S GG+GGG IFV ML L  G  P  A  +SK MI G 
Sbjct: 8   PGLEVWMEVIVAIVCALF-AMLASAGGIGGGVIFVSMLQLF-GVSPHVAAPLSKAMIFGG 65

Query: 132 AVSTVYYNL---KLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLI 188
           +      N+   +   PT   P I +DL  +I+P  + G  IG   NV+  +W++ VL +
Sbjct: 66  SCVLTCMNIFQHEDNEPT--KPSIIWDLVFIIEPAAVSGALIGALINVVLPEWLLLVLEV 123

Query: 189 VLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRS 241
              + T+ K     + T  +E I   +   C   + A A  ++ +  P  P +
Sbjct: 124 AFLLYTTQKMLRSSLATLNKERIAAGKRLLCTRKSRAPALSIDERGSPHQPST 176



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 69/138 (50%), Gaps = 10/138 (7%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G+ AGI+G +LG+GGG +  P+ +  G+ P+ +   +T  + F+S  S++ Y ++     
Sbjct: 388 GLFAGILGAMLGIGGGLLKNPILISFGIDPERARTASTVMIAFTSMSSMISYVVIGGLHF 447

Query: 401 PYALYFVAVATIAAFVGQHVVRKLII-LLGRASLIIFILAFMIFVSAISLGGVGISNMIG 459
            YA + + +   A FV  + + +LII      S I F++  +I V    +    ++NMI 
Sbjct: 448 EYA-WPLMLTVGAFFVSGYYLSELIIRCFKTKSFIPFLITALIVVCTCFI----VANMII 502

Query: 460 KF----HRHEYMGFENLC 473
            F          GF +LC
Sbjct: 503 VFIDIAKTGHLPGFTSLC 520


>gi|452819360|gb|EME26420.1| hypothetical protein Gasu_59110 [Galdieria sulphuraria]
          Length = 579

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 132/551 (23%), Positives = 204/551 (37%), Gaps = 115/551 (20%)

Query: 11  LRSVMMIFLNFALAFVCVKAE--RGLKIETLRFNGT-----ERQDSNYLLKA---VNFLW 60
           ++ V+++ +  +L ++ ++AE  RG  I TLR N       +   S +L+      +   
Sbjct: 1   MKHVVLLCILLSLIYIVLQAEQVRGETINTLRLNHPNVASGKEITSRFLVGTPCNSSISC 60

Query: 61  QPEGSGYHHVWPELKFGWEIV----LGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFD 116
            PE      V   +K G        +  ++ F  AA  S G + GG IFVP+L +++ F 
Sbjct: 61  GPELECEEGVCVHVKLGSPFTVRDGMACLVFFVAAAMASAGAIAGGPIFVPVLEVMLNFA 120

Query: 117 PKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNV 176
              A  +S+ MI G AV+        RHP  D P + Y++   + P+ ++G  IG   N 
Sbjct: 121 VSQAGGVSQSMITGMAVAVSTIAFLKRHPERDRPAVYYEIVTSMVPICLMGTFIGTYVNQ 180

Query: 177 I----FADWMVTVLLIVLFIGTSTKAF-LKGVETWKRE---------------------- 209
           +    F  ++V ++LI L I TS KA  L+ +E  +R+                      
Sbjct: 181 VLPGYFTAFVVVIILIYLVITTSMKAMSLRRIELHERDLRNTGTSSDYDMADILVSQNPQ 240

Query: 210 -----TILKKEAARCLGSNGAGAGEV----------EYKS---LPS------------GP 239
                   K+ +     SN AG  E           EY S   LPS            G 
Sbjct: 241 DSNNTNATKRNSFSKSLSNAAGTCEEHAWTTTNDSDEYNSNLNLPSQRNNNNNNNATTGN 300

Query: 240 RSGPQKDAMEPEVTIL------------------------------ENINWKELGLLVFV 269
            +G    A  P V I                                   W +L LLV  
Sbjct: 301 ENGESSSACLPNVQITRIFAKIPFLGAYLHPPKSSPDIDSMLKTERRRFQWGDLLLLVTT 360

Query: 270 W---VAFLGLQIAKNHTAS------CSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIA 320
           W   + F+ L+  K    S      C  +YW L  +   V L++S    + LY   +   
Sbjct: 361 WSFVIIFVALRGGKPRIISPLGVHLCGWIYWFLLAILELVLLIISSVTMLRLYSLHQHRV 420

Query: 321 SKG----DDGKSFRVFQLVSYCAFGVLAGIVGGLLGLG-GGFIMGPLFLELGVPPQVSSA 375
             G         +    L+ Y  F  + G+V   +G+       G L + LGV P V   
Sbjct: 421 RLGYLFCRSDVRWTPKTLILYGIFCFVCGLVASWVGISVETLAAGFLLVVLGVDPLVVQL 480

Query: 376 TATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLII 435
           T      F+SS    E          YAL++     + A VG  VV   +      S+I+
Sbjct: 481 TGGVINLFTSSAIAAESAANGSLAWRYALFYAGFTFLGALVGVLVVGHFVKKYHLKSIIV 540

Query: 436 FILAFMIFVSA 446
           F L F + V+ 
Sbjct: 541 FCLVFFLIVAT 551


>gi|401412944|ref|XP_003885919.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325120339|emb|CBZ55893.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 486

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 70/135 (51%), Gaps = 9/135 (6%)

Query: 116 DPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFN 175
           D   A  +SK  I GA + +V +NL  + P+ ++ +I Y+LA +++P  ++G  +GV  N
Sbjct: 41  DVHKAVPLSKVTIFGACLVSVLFNLGRKQPSGNLLLISYELAAILEPFTLMGGVLGVLLN 100

Query: 176 VIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLG-------SNGAGAG 228
           ++ +D  +   L+V+   T+ K   +G+  ++ E+ L  E A  L        S     G
Sbjct: 101 IVMSDIQIICCLVVVLSFTTYKTTRRGIIQYQTESRLLAERAARLSPMDRQPLSAWDREG 160

Query: 229 EVEYKSLPSG--PRS 241
           E E  SL +G  PRS
Sbjct: 161 EEETSSLLAGDEPRS 175


>gi|294948280|ref|XP_002785684.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239899707|gb|EER17480.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 392

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 98/395 (24%), Positives = 179/395 (45%), Gaps = 48/395 (12%)

Query: 98  GVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYN--LKLRHPTLDMPIIDYD 155
           G+GGG +FVP+L +I     + A A S+ ++  A ++ V  +  ++L   +    II+  
Sbjct: 27  GMGGGTLFVPVLHIIGMLSARDAAATSQVLVASATLAKVLSSVYIQLHGNSSVESIINLP 86

Query: 156 LALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT----STKAFLKGVETWKRETI 211
            A+L+ P +++G  +GV        W+  +  +VL++ T    S   F KG+  WK ET 
Sbjct: 87  YAVLMLPPMVVGGLLGVYLY----TWLPEIFQLVLYVFTAILASCMGFKKGIGLWKSET- 141

Query: 212 LKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVT-----ILENINWKE---L 263
              E A   G  G              P     +D + P VT     I  ++ +K+   +
Sbjct: 142 ---EGASSAGQLGVV------------PPPCVTEDTVLPSVTSRARSISLSLKYKKAILI 186

Query: 264 GLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKG 323
             L+ VW+A +  ++    +++ SI         I +     LY A+S+     ++A  G
Sbjct: 187 TTLLAVWIAVILSRLLLGSSSTPSI---------IGIPYCTGLYWALSVIVCIMLMAVPG 237

Query: 324 ---DDGKSFRVFQLVSYCAFGVLA-GIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATF 379
                 KS  + +L    +  +L  G +  L+G GGG ++ PL L + + PQ ++AT + 
Sbjct: 238 LFVVAIKSAAMLKLAVKLSGAMLCIGFIAALVGQGGGSLITPLLLYMELNPQQAAATGSV 297

Query: 380 AMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILA 439
            M  +SS   + + L    P    ++   +    A +G  ++ +L++   R S +  +LA
Sbjct: 298 VMLITSSSLALSFGLGGFLPAASDMWIAVLPFFGALLGDLLLSRLLLWSRRLSFLALLLA 357

Query: 440 FMIFVSAISLGGVGISNMIGKFHRHEY-MGFENLC 473
            +    A+ +   GI N+I ++ R E  + F  LC
Sbjct: 358 ALAMTGAMVIFTTGIVNVISQYRRGESPLSFGTLC 392


>gi|403340471|gb|EJY69522.1| hypothetical protein OXYTRI_09740 [Oxytricha trifallax]
          Length = 715

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 50/172 (29%), Positives = 87/172 (50%), Gaps = 16/172 (9%)

Query: 286 CSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGK----------SFRVFQLV 335
           C I+ W +      ++L + +   I ++  R+V+  +G   K           F    ++
Sbjct: 523 CGILDWTI------LALFLIMCACICIFSVRKVVKEQGLKAKYNLGLASSDIRFDRQAVM 576

Query: 336 SYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLL 395
           +   FG + G V G LGLGGG +  P+ L +GVPP VSS+T  + + FS+S S + Y L 
Sbjct: 577 NIVVFGFIGGWVSGALGLGGGAVFNPVLLSMGVPPSVSSSTGMYMIMFSTSGSSIVYILY 636

Query: 396 KRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
                 +A++     +IA+ VG  ++ K +    R S I+F+L  ++ +SA+
Sbjct: 637 GMLNYQFAMWLGFWCSIASLVGLQMLNKFVKKYNRQSPIVFLLGLILGLSAL 688



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 57/109 (52%)

Query: 103 GIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQP 162
           G+ +P       F  K+A A+S   I+  ++    Y L   HP  D  IIDY LA ++ P
Sbjct: 46  GVVIPFCMTFFTFQTKNAIAMSGFTILTCSIVKFVYGLNEHHPEKDAVIIDYSLATIMLP 105

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETI 211
           ++M+G  IGV  N++F   ++  +L ++ +  + ++  K    +K+E +
Sbjct: 106 VVMMGSMIGVLLNIMFPSLLLQTILTIVLLLLTWQSLGKARTIYKKENL 154


>gi|403352629|gb|EJY75833.1| hypothetical protein OXYTRI_02776 [Oxytricha trifallax]
          Length = 366

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 59/113 (52%), Gaps = 10/113 (8%)

Query: 115 FDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMP---IIDYDLALLIQPMLMLGISIG 171
           FD K+A AIS   I   +V+   Y L  +HP  D     II+Y+LA+++ P +M+G   G
Sbjct: 118 FDTKNAIAISNFAIFTCSVTRYIYTLDKKHPHPDKKQNVIIEYNLAIVMLPTVMMGSLTG 177

Query: 172 VAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWK-------RETILKKEAA 217
           V  N+IF    +  +L  L I  S ++ +KG +  K       +  ILK++ A
Sbjct: 178 VFLNIIFPAIALQAILTALLIFLSLQSLMKGKDIQKFKGQSHNKSRILKQKKA 230


>gi|449016905|dbj|BAM80307.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 804

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 33/138 (23%), Positives = 65/138 (47%), Gaps = 5/138 (3%)

Query: 115 FDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAF 174
           F    A+A S+ +I+GA+++   YN + RH   D P ID++L +   P  + G + G   
Sbjct: 131 FPIPQASANSQSLILGASIANSVYNFRKRHVIRDAPRIDWNLVISTMPFFLCGTTPGYFL 190

Query: 175 NVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVE--Y 232
           N+    +    +L  +    + ++FL G    +R+  +++E  R      AG+  ++   
Sbjct: 191 NISLPGYFTGFVLAAMLGALTIQSFLSGTRMTRRQWRMRREFLR---QEAAGSAPLDGPA 247

Query: 233 KSLPSGPRSGPQKDAMEP 250
            S P+ P +      ++P
Sbjct: 248 ASKPTAPTASDGTSVVQP 265


>gi|118401229|ref|XP_001032935.1| hypothetical protein TTHERM_00470450 [Tetrahymena thermophila]
 gi|89287281|gb|EAR85272.1| hypothetical protein TTHERM_00470450 [Tetrahymena thermophila SB210]
          Length = 1325

 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 78/149 (52%), Gaps = 12/149 (8%)

Query: 71   WPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMG 130
            +PE+ FG  IV+ TI+GF     G   G+GGG    P+L  +  +D K + A+   ++  
Sbjct: 899  FPEI-FG-SIVIITILGF-----GQAAGIGGGTSITPILLALFLYDTKKSVALVILLVFS 951

Query: 131  AAV-STVYYNLKLRHPTLD-MPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLI 188
            +++ +T+  +   R  T D +P+I+Y L L+  P L++G   GVA N       V +LL+
Sbjct: 952  SSLGNTIQIS---RERTHDGVPVIEYRLILVTLPTLIVGTVYGVAVNKFLPSIAVCILLV 1008

Query: 189  VLFIGTSTKAFLKGVETWKRETILKKEAA 217
            +L      K++L+     K E  L K ++
Sbjct: 1009 ILLAQQIQKSYLRYKNMRKNELKLIKNSS 1037


>gi|358344799|ref|XP_003636474.1| hypothetical protein MTR_042s0017 [Medicago truncatula]
 gi|355502409|gb|AES83612.1| hypothetical protein MTR_042s0017 [Medicago truncatula]
          Length = 107

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 34/69 (49%), Positives = 44/69 (63%), Gaps = 2/69 (2%)

Query: 216 AARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLG 275
           AAR   +  +GA EVEYK L +G     +K A   +VTI+EN+  KE GLL FVWV+ + 
Sbjct: 3   AARRQETIDSGA-EVEYKPLSTGSNGALEKGAKMHKVTIIENVCRKEFGLLAFVWVSLIA 61

Query: 276 LQIAK-NHT 283
            QIAK N+T
Sbjct: 62  AQIAKQNYT 70


>gi|403365874|gb|EJY82729.1| hypothetical protein OXYTRI_19655 [Oxytricha trifallax]
          Length = 683

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/169 (29%), Positives = 90/169 (53%), Gaps = 10/169 (5%)

Query: 286 CSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRV-------IASKGDDGKSFRVFQLVSYC 338
           CS + W + ++ I    ++S Y      + +++       IAS         +F+LV   
Sbjct: 491 CSTMDWAILMIFITFCSMLSAYSIFIAVREQKLKTKYSMGIASSDIQFNKPAIFKLV-IS 549

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           AF  + G V G LGLGGG I  P+ L +GVPP+V+SAT  + + FS+S S V Y + +  
Sbjct: 550 AF--IGGTVSGALGLGGGAIFNPILLSMGVPPKVASATGMYMIMFSTSGSSVIYVMYRML 607

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
            + Y  +    ++  + +G +++ K++ +  R S ++F L F++ +SA+
Sbjct: 608 NIQYGFWLGFWSSSGSILGMYLLNKVVKMYNRQSPVVFCLVFVLALSAV 656



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 57/108 (52%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           P       F+ K+A A+S   I+  +++    ++  RHP  +  +IDY+LA ++ P++++
Sbjct: 100 PFCMTFFSFETKNAIALSGLTILSCSITRYIVSINERHPERNSVVIDYNLAAIMLPIVLV 159

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKK 214
           G  IGV  NV F    + ++L ++ +  +     K    +K+ET + K
Sbjct: 160 GSMIGVLVNVAFPSLYLQIMLTLVLLSLALHTAYKARFIYKQETEMLK 207


>gi|308160584|gb|EFO63064.1| Hypothetical protein GLP15_1793 [Giardia lamblia P15]
          Length = 520

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/151 (31%), Positives = 73/151 (48%), Gaps = 7/151 (4%)

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           P L+   E+++  +   F A   S GG+GGG IFV ML L  G  P  A  +SK MI G 
Sbjct: 8   PGLEVWMEVIVAIVCALF-AMLASAGGIGGGVIFVSMLQLF-GVSPHVAAPLSKAMIFGG 65

Query: 132 AVSTVYYNL---KLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLI 188
           +      N+   +   PT   P I +DL  +I+P  + G  IG   NV+  +W++ VL +
Sbjct: 66  SCVLTCINIFQYEDDEPT--KPSIIWDLVFIIEPAAVSGALIGALINVVLPEWLLLVLEV 123

Query: 189 VLFIGTSTKAFLKGVETWKRETILKKEAARC 219
           +  + T+ K     + T  +E I   +   C
Sbjct: 124 IFLLYTTQKMLRNSLTTLNKERIATGKKPVC 154



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 65/123 (52%), Gaps = 8/123 (6%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G+ AGI+G +LG+GGG +  P+ +  G+ P+ +   +T  + F+S  S++ Y ++     
Sbjct: 388 GLFAGILGAMLGIGGGLLKNPILISFGIDPERARTASTVMIAFTSMSSMISYTVIGGLHF 447

Query: 401 PYA-LYFVAVATIAAFVGQHVVRKLIILLGRA-SLIIFILAFMIFVSAISLGGVGISNMI 458
            YA    + V T   FVG + + +L+I   R  S I FI+  +I V    +    ++NMI
Sbjct: 448 EYAWPLMLTVGTF--FVGGYYLSELMIRCFRTKSFIPFIITALIVVCTCFI----VANMI 501

Query: 459 GKF 461
             F
Sbjct: 502 VVF 504


>gi|302828094|ref|XP_002945614.1| hypothetical protein VOLCADRAFT_85766 [Volvox carteri f.
           nagariensis]
 gi|300268429|gb|EFJ52609.1| hypothetical protein VOLCADRAFT_85766 [Volvox carteri f.
           nagariensis]
          Length = 136

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 31/104 (29%), Positives = 59/104 (56%), Gaps = 1/104 (0%)

Query: 371 QVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGR 430
           QV++A+++  + FSSS +++++ LL R    YA+ F A + +A  +G   V + I   GR
Sbjct: 33  QVTAASSSTMVLFSSSAALIQFILLHRLNTDYAIVFGAASLVAGLLGTQAVSRAIKRSGR 92

Query: 431 ASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLCK 474
            S+++  LA +I ++ + +   G+ N   +    E +GF  +CK
Sbjct: 93  PSVVVLALAGVIGIATLCVAIFGLRNAAVQLRAGE-LGFLGICK 135


>gi|449017188|dbj|BAM80590.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 606

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/142 (30%), Positives = 67/142 (47%), Gaps = 15/142 (10%)

Query: 44  TERQDSNYL--LKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGG 101
           TERQ  N    + AV  L+ P             F W   +  ++ F      + GG+GG
Sbjct: 64  TERQVCNTNTGICAVKDLFHP-------------FLWSDGVLIVLVFLATGLSNAGGIGG 110

Query: 102 GGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQ 161
           G + VP+LSL  GF  K+A+A ++ +I GA+++    N+  RH   D P ID++L     
Sbjct: 111 GILLVPLLSLFAGFTLKNASANAQPLIFGASIANAMVNVPRRHAARDTPRIDWNLVACTV 170

Query: 162 PMLMLGISIGVAFNVIFADWMV 183
           P  ++G + G   N  F  +  
Sbjct: 171 PFFLMGTTPGTFMNQAFPSYFT 192



 Score = 42.4 bits (98), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 54/235 (22%), Positives = 101/235 (42%), Gaps = 19/235 (8%)

Query: 259 NWKELGLLVFVWVAFLGLQI----AKNHTAS----CSIVYWVLDLLQIPVSLVV---SLY 307
            W+ +  L+  W+    +QI    A+ H+      C  VYW+L  +Q  V   +   +++
Sbjct: 371 QWRYMLFLLACWLVLFIIQILSGSAEKHSPVGAPLCGAVYWILFAIQESVLFAMGCFTIW 430

Query: 308 EAISLYKGRRVIA----SKGDDGKSFRVFQLV-SYCAFGVLAGIVGGLLGLGGGFIMGPL 362
             +SL + R  +     SK   G       L+  Y  +  +AG+ G  +G+GG  ++ P 
Sbjct: 431 RNLSLNRLRDRLGYPWYSKNGLGDIRWTRALIFGYPVWAFIAGVFGSWVGIGGSSLLIPF 490

Query: 363 FLELG-VPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVV 421
              +G   P    ++ + +    S+     Y    R  + Y L+F   A + ++ G  +V
Sbjct: 491 LNLVGRADPATVQSSMSLSNLLGSASGAFVYLAQGRLNISYGLFFGLFALLGSYTGVWMV 550

Query: 422 RKLIILLGRASLIIFILAFMIFVSAI-SLGGVGISNMIGKFHRHEYMGFENLCKY 475
             L+      +L +F L  + FV+A+ ++  + I N +G         F N+C Y
Sbjct: 551 YFLVERYQIRALFVFALT-ICFVTALGAILYIAIRNTLGVEAAGVGWVFTNICAY 604


>gi|301118274|ref|XP_002906865.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108214|gb|EEY66266.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 260

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/200 (28%), Positives = 94/200 (47%), Gaps = 25/200 (12%)

Query: 257 NINWKELGLLVFVWVAFLGLQI---AKNHTASCSIVYWVLDLLQIPVSLVVS--LYEAIS 311
           NI W++L  L  ++V   G+++    KN  +              PV +  S  LY  + 
Sbjct: 71  NIPWRKLATLASLFVVVAGMRVLRGGKNFDS--------------PVGIDSSSTLYPVLQ 116

Query: 312 LYKGRRVIASKGD-DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPP 370
              G  + A +      S R F + S     + AG V G+ G+GGG I GPL LE+GV P
Sbjct: 117 QSGGYELEAHEIKWTPSSIRYFPMCS-----LAAGAVSGMFGIGGGIINGPLLLEVGVDP 171

Query: 371 QVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGR 430
             +SA     + FSS MS + Y  + +  +  A   + +  +  ++G   + KL+     
Sbjct: 172 SAASAMTATTVLFSSGMSSLNYAAMGKMDLHLAQLMLPMGLVTTYIGHLCLLKLVRHYNC 231

Query: 431 ASLIIFILAFMIFVSAISLG 450
            S+IIF +A ++ +SA+++ 
Sbjct: 232 PSMIIFSMATIVLISAVAMS 251


>gi|157873729|ref|XP_001685369.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68128441|emb|CAJ08542.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 511

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 93/400 (23%), Positives = 171/400 (42%), Gaps = 45/400 (11%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP--TLDMPIIDYDLALLIQPML 164
           P+   ++    +S   +S+  I G +   +Y  ++ RHP  + D P+I+Y    L+ P+ 
Sbjct: 120 PLFCGLMEVPMQSGVGMSQSAICGQSTLNIYLAIQQRHPDSSWDRPLINYQYLSLLLPLG 179

Query: 165 MLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRET-------------- 210
           ++G  IG   + +  D +  +LL+VL      +++    + ++++T              
Sbjct: 180 LIGTLIGGVLSKLCPDVLRLILLLVLLSVVLYRSWETMKKQYRQDTDPMHVTVEASDANA 239

Query: 211 --------ILKKEAARCLGSNGAGAGEV---EYKSLPSGPRSGPQKDAMEPEVTILENIN 259
                      K  +R L     GA +V   +   LP+ P   P      P+        
Sbjct: 240 TSHQKSYDSTGKSQSRELTETAGGAKKVLGEDIAVLPT-PEQSPPSIERPPQ----SQYP 294

Query: 260 WKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVI 319
            +EL +    ++  L   I + +T     +YW+  L  +PV+ +     ++  Y  R  +
Sbjct: 295 QQELSMNFACFLVLLLFNILRTYTVCGGFLYWLCVL--VPVAFL-----SVVFYFNREKL 347

Query: 320 ASKGDDGKSFRVF-----QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSS 374
               +   +   F       V+Y    VLAG+   +LG+GGG ++G +  E+G+ PQ +S
Sbjct: 348 RKLAESDPAQMTFTWTERNSVTYPMVAVLAGVSAAMLGIGGGLVLGFVLNEVGIAPQEAS 407

Query: 375 ATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLI 434
            T+     F +  SV++  +     V + + F  V   ++ +GQ V+   I   G   LI
Sbjct: 408 VTSGMTTFFIAFSSVLQLLITGSLVVDFGVVFCIVGLCSSALGQLVLMNYIKRRGLNYLI 467

Query: 435 IFILAFMIFVSAISLGGVGI-SNMIGKFHRHEYMGFENLC 473
           I  L F++  S ++LGG GI S +I        + F  LC
Sbjct: 468 IGSLVFVVGGSLVALGGYGIYSAVISTQTGGSVLAFGRLC 507


>gi|397620930|gb|EJK66001.1| hypothetical protein THAOC_13100 [Thalassiosira oceanica]
          Length = 344

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 57/112 (50%)

Query: 335 VSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL 394
           V Y A    AG   G+ G+GGG + GPL L +GV P+VSSA++   + F+S  +   + +
Sbjct: 205 VVYPAVCTAAGFFAGMFGVGGGIVKGPLMLAMGVHPKVSSASSACMILFTSFTATTSFVV 264

Query: 395 LKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSA 446
                + YA   + +   A  VGQ  +  L+    R S I F +  ++ +SA
Sbjct: 265 FGLLDMEYAPVCLLLGFAATLVGQIGLIYLMERFQRNSYIAFSIGAIVALSA 316


>gi|397638312|gb|EJK73008.1| hypothetical protein THAOC_05398, partial [Thalassiosira oceanica]
          Length = 318

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 50/88 (56%), Gaps = 12/88 (13%)

Query: 313 YKGRRVIAS---KGDDGKSFR-------VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPL 362
           Y+GRR++AS   K   G  FR          L  Y A+ +LAG+V GL+G+GGG ++GPL
Sbjct: 207 YQGRRIVASAAAKVRAGYPFRDRDVRWDAAALRLYGAWTLLAGVVAGLVGIGGGMVLGPL 266

Query: 363 FLELGVPPQVSSATATFAMTFSSSMSVV 390
            L + + P+VS  TAT     SS   VV
Sbjct: 267 MLAMNIDPRVS--TATTGSELSSIARVV 292


>gi|294887822|ref|XP_002772241.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239876300|gb|EER04057.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 116

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 31/116 (26%), Positives = 63/116 (54%), Gaps = 1/116 (0%)

Query: 359 MGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQ 418
           M P+ L LG+ P+ ++AT +  +  +S+ + + + L   FP    L+ V +  I A +G+
Sbjct: 1   MIPVVLSLGLDPKQATATTSIVIFATSTSTALSFALGGYFPPASDLWIVVMPFIGALLGK 60

Query: 419 HVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMI-GKFHRHEYMGFENLC 473
            +V +LI   GR S+++ +L  ++ +  I+    GI +++ G  +  E + F + C
Sbjct: 61  TIVARLIAKTGRMSMLVLLLGTVVIIGGITTISTGIVSVVKGALNGEEVVQFGSFC 116


>gi|407410866|gb|EKF33150.1| hypothetical protein MOQ_002989 [Trypanosoma cruzi marinkellei]
          Length = 407

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 87/374 (23%), Positives = 161/374 (43%), Gaps = 27/374 (7%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPT--LDMPIIDYDLALLIQPML 164
           PM  L++G     A  +S+  I G ++  V+  ++ R P      P+I+Y    L+ P+ 
Sbjct: 48  PMFCLLMGLPMDFAVGLSQSTICGQSILNVFIAIRKRFPCAGCSRPLINYQYLTLLVPLG 107

Query: 165 MLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNG 224
           ++G  IG   N +  D +  +LL +L      ++  K +  ++++   ++      G+N 
Sbjct: 108 VIGTLIGGVLNRLCPDLLRLILLFLLLTAVLYRSVRKMIAQYRKDQSERR------GTNT 161

Query: 225 AGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTA 284
             + E E    P+           +P+        W E+  + F ++  L     ++ T 
Sbjct: 162 VSSAE-EVSGTPTLNSPEEVFHVTQPQYP------WIEISCVFFSFIVNLSFGAWRSRTK 214

Query: 285 SCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLA 344
                Y V   L  PV+L ++++     Y+ R     K      +     + Y    V+A
Sbjct: 215 CGGGAYIVAYCL--PVALNIAIFFG---YRHRLSHMEKLSLVFHWSNSTTIFYPLVSVVA 269

Query: 345 GIVGGLLGLGGGFIMGPLFLELGVPPQVSSATA---TFAMTFSSSMSVVEYYLLKRFPVP 401
           GI   +LG+GGG ++G +  ++G+ P+ +S T    T  + FSS++S+V   +   F + 
Sbjct: 270 GIASAMLGIGGGLVLGFILYDVGLIPEEASVTGGVVTLFLAFSSALSLV---IEGHFLLD 326

Query: 402 YALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKF 461
           Y  +  A   ++   GQ V+ +LI       LII  L  +I  S + L   GI N +   
Sbjct: 327 YGGFLFACGIVSTLFGQFVLMRLIKKYKLRFLIIAALVTIIAGSLVFLTSYGIYNSLNVT 386

Query: 462 HR-HEYMGFENLCK 474
                 + F  LC+
Sbjct: 387 RSGGSIIAFGRLCR 400


>gi|119873364|ref|YP_931371.1| hypothetical protein Pisl_1881 [Pyrobaculum islandicum DSM 4184]
 gi|119674772|gb|ABL89028.1| protein of unknown function DUF81 [Pyrobaculum islandicum DSM 4184]
          Length = 243

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 47/82 (57%), Gaps = 9/82 (10%)

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           P     W   LG ++ F G    S+ GVGGG IFVP L LI GFDPKSA A+S  +I   
Sbjct: 116 PRRNIPW---LGLLLIFIGGFASSLFGVGGGTIFVPALILISGFDPKSAAAMSMGIIFPT 172

Query: 132 AVS--TVYYNLKLRHPTLDMPI 151
           AVS  T+Y  L +    L++P+
Sbjct: 173 AVSSTTIYAALGV----LNLPL 190


>gi|294891216|ref|XP_002773478.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239878631|gb|EER05294.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 385

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 80/306 (26%), Positives = 136/306 (44%), Gaps = 42/306 (13%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           +++P    ++G D   A  +SK  I G AVS   +NL+ RHPT D P+ID D+ LL++P+
Sbjct: 35  LYMPAFVAVMG-DAHWAVPLSKVAINGVAVSATIFNLRQRHPTYDQPLIDLDIGLLLEPL 93

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSN 223
            +LG  +GV  NV      +   L+++   T+   F K ++  +    L+++A+   G  
Sbjct: 94  TLLGSMLGVYLNVAMTSVEIFSCLVLVLSITAALTFRKAIQRRR----LEEDASVDDGLG 149

Query: 224 GAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILE---NINWKELGLLVFVWVAFLGLQIAK 280
           GA  G +   S  + P SG  +  ++    IL    ++   +   L+ +W+A   L    
Sbjct: 150 GAEMGLLVSAS--ARPSSGVDRSVVDKASRILREEASLQPMKAWALLVLWLANGALLYLA 207

Query: 281 NHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDG----KSFRVFQLVS 336
              A         +LL    +  V L     +Y GRR + S+  D      S  V+  V+
Sbjct: 208 EGPA---------ELLCGGTAQKVPLLSGYGIY-GRRFMYSRYWDEAGLPPSPVVYNKVN 257

Query: 337 YCAFGVLAGIVG------------------GLLGLGGGFIMGPLFLELGVPPQVSSATAT 378
              + +L+   G                   L  L      GPL L+LG+ PQ ++AT+ 
Sbjct: 258 TIVYPLLSCFAGVCAGCLGIGGGLIKVQCCSLGKLSSAGCQGPLLLQLGMVPQAATATSI 317

Query: 379 FAMTFS 384
           + + F+
Sbjct: 318 WMILFT 323


>gi|315230493|ref|YP_004070929.1| hypothetical protein TERMP_00729 [Thermococcus barophilus MP]
 gi|315183521|gb|ADT83706.1| hypothetical protein TERMP_00729 [Thermococcus barophilus MP]
          Length = 254

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 47/87 (54%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + + +SY A GV+ GI+  L GLGGGF++ P    LGV    +  T++ A+ F+S  S +
Sbjct: 1   MLRYISYFAVGVVIGILAALFGLGGGFLIVPTLNLLGVEIHHAVGTSSAAVVFTSLSSAL 60

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG 417
            Y   KR      L   + A I A++G
Sbjct: 61  AYSRQKRVHYKVGLLLASTAIIGAYIG 87


>gi|294944271|ref|XP_002784172.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239897206|gb|EER15968.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 180

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 65/124 (52%), Gaps = 2/124 (1%)

Query: 97  GGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMP--IIDY 154
            G+GGG +FVP+L LI G   K ++++S+ ++  +A++   +N   ++   + P  +I +
Sbjct: 3   AGIGGGILFVPVLRLIGGLSQKESSSLSQALVAASALAANLFNFYAQYRAKNEPKALIVW 62

Query: 155 DLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKK 214
              +L+ P  ++G  IG+         +  +L  ++    S  A+ KG + WK ET  K+
Sbjct: 63  PFVILMLPCAVVGSLIGIYLYSWLPSLLQLILYFIVACFGSLAAYRKGYKLWKAETDAKE 122

Query: 215 EAAR 218
            A R
Sbjct: 123 SAIR 126


>gi|294925749|ref|XP_002778995.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239887841|gb|EER10790.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 397

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 76/304 (25%), Positives = 136/304 (44%), Gaps = 36/304 (11%)

Query: 104 IFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPM 163
           +++P    ++G D   A  +SK  I G AVS   +NL+ RHPT D P+ID D+ LL++P+
Sbjct: 35  LYMPAFVAVMG-DAHWAVPLSKVAINGVAVSATIFNLRQRHPTYDQPLIDLDIGLLLEPL 93

Query: 164 LMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSN 223
            +LG  +GV  NV      +   L+++   T+   F K ++  + E     E    LG  
Sbjct: 94  TLLGSMLGVYLNVAMTSVEIFSCLVLVLSITAALTFRKAIQRRRLEGDASVEDG--LGGA 151

Query: 224 GAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHT 283
             G   V+   +    R   ++ +++P       + W   G L++  +A    ++    T
Sbjct: 152 EMGLLGVDRSVVDKASRILQEEASLQPMKAWALFVLWLANGALLY--LAEGPAELLCGGT 209

Query: 284 ASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDG--KSFRVFQLVSYCAFG 341
           A  ++V           ++VVS      + +G R++  + + G   S  V+  V+   + 
Sbjct: 210 AQKAMVS----------TVVVSCILVTGIVRG-RLLRQQDEAGLPPSPVVYNKVNTIIYS 258

Query: 342 VLAGIVG------------------GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTF 383
           +L+   G                   L  L      GPL L+LG+ PQ ++AT+ + + F
Sbjct: 259 LLSCFAGVCAGCLGIGGGLIKVQYRSLGKLSSAGCQGPLLLQLGMVPQAATATSIWMILF 318

Query: 384 SSSM 387
           +SS+
Sbjct: 319 TSSI 322


>gi|145514580|ref|XP_001443195.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410573|emb|CAK75798.1| unnamed protein product [Paramecium tetraurelia]
          Length = 472

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 75/308 (24%), Positives = 133/308 (43%), Gaps = 33/308 (10%)

Query: 101 GGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDM-PIIDYDLALL 159
           GG +  P+L +I+ +    AT I+ C + G  +      +  ++P  +  PII+Y +A++
Sbjct: 75  GGVLKGPLLEMILNYSQSEATHIAYCFMFGGTLLNTALLMFEKNPEDERRPIINYRIAII 134

Query: 160 IQPMLMLGISIGVAFNVIFADWMVTVLLIVLFI-GTSTKAFLKGVETWKRETIL----KK 214
               +    ++G +    F   + T++L  LF+ G +   + K  +  K E  +    KK
Sbjct: 135 FNLAVPFATNLGSSL-ASFLPQLYTLILQELFLFGVAPILWQKAKDAKKAELKISEGSKK 193

Query: 215 EAARCLGSNGAGAGEVEYKSLPSGPRSGPQK--DAMEPEVTI----LENINWKELGLLVF 268
           E  + L  +     ++E +++         K  DA   E        E+ N   L  ++F
Sbjct: 194 ENTQNLNDSNLNP-KIELQTIEGIHEQQEYKISDANSNESLYSHFKQESENILPLMPILF 252

Query: 269 VWVAF----LGLQIAKNHTASCSIV-----YWVLDLLQIPVSLVVSLYEAISLYKGRRVI 319
           +  +F    + +Q+   +    S V      W  D +   + L   LY+ +    G +  
Sbjct: 253 ILGSFGLNQIFIQMRSTNPNKPSYVGIEDCTWQNDFMIFILILANVLYDYLIWQFGTK-- 310

Query: 320 ASKGDDGKSF----RVFQLVSY----CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ 371
             K  +  +F    R F  +S      A G LAG V G LG+G GF+M P  L  G+ P+
Sbjct: 311 QEKQFNQINFLPKERYFTPISKFFKIYAGGFLAGFVSGFLGMGAGFVMVPTLLFSGLIPR 370

Query: 372 VSSATATF 379
            +SAT+ F
Sbjct: 371 CASATSAF 378


>gi|240102618|ref|YP_002958927.1| putative permease [Thermococcus gammatolerans EJ3]
 gi|239910172|gb|ACS33063.1| Predicted permease [Thermococcus gammatolerans EJ3]
          Length = 254

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 46/87 (52%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + + +SY A GV  GI+  L GLGGGF++ P    LGV    +  T++ A+ F+S  S +
Sbjct: 1   MLKYLSYFAVGVFIGILAALFGLGGGFLVVPTLNLLGVEMHHAVGTSSAAVVFTSLSSTI 60

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG 417
            Y   KR      L   + A I A++G
Sbjct: 61  AYSRQKRIHYRVGLLLASTAVIGAYIG 87


>gi|221485444|gb|EEE23725.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 482

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 68/133 (51%), Gaps = 6/133 (4%)

Query: 116 DPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFN 175
           D   A  +SK  I GA + +V +NL  + P+ ++ +I Y+LA +++P  +LG  +GV  N
Sbjct: 41  DVHKAVPLSKVTIFGACLVSVLFNLGRKQPSGNLLLISYELAAILEPFTLLGGVLGVLLN 100

Query: 176 VIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLG----SNGAGAGEVE 231
           ++  D  +   L+++   T+ K   +G+  ++ E+ L  E A  L     S      E E
Sbjct: 101 IVMTDIQIISCLVLVLSFTTYKTTRRGLLQYQTESRLLAERAAVLSREHHSLSTRDREEE 160

Query: 232 YKSLPSG--PRSG 242
             SL +G  PR+ 
Sbjct: 161 TSSLLAGEEPRAS 173


>gi|405984262|ref|ZP_11042565.1| hypothetical protein HMPREF9451_01696 [Slackia piriformis YIT
           12062]
 gi|404388094|gb|EJZ83178.1| hypothetical protein HMPREF9451_01696 [Slackia piriformis YIT
           12062]
          Length = 274

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 53/94 (56%), Gaps = 5/94 (5%)

Query: 336 SYCA----FGVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTFSSSMSVV 390
           +YCA     GV+AG++ G +G+GGGFIM PLF+  L +P +++S T+  A+   S    +
Sbjct: 152 AYCAKVALIGVVAGLLSGYVGVGGGFIMVPLFMTLLSIPMRLASGTSLIAVCILSVPGTI 211

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           E  LL        +   A +   AFVG  +V+K+
Sbjct: 212 EQALLGNIHFLLGIAMAAGSIPGAFVGASMVKKV 245


>gi|167521960|ref|XP_001745318.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776276|gb|EDQ89896.1| predicted protein [Monosiga brevicollis MX1]
          Length = 512

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/157 (24%), Positives = 68/157 (43%), Gaps = 17/157 (10%)

Query: 106 VPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDM---PIIDYDLALLIQP 162
           VPML ++  F    A  +S   I+G ++      ++ +HP   +    +I+Y + LL+ P
Sbjct: 45  VPMLLVVGDFPTYYAIPLSVTAIVGGSIVRFIMQVQRKHPNPKVAHRQLINYPMVLLLLP 104

Query: 163 MLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARC--- 219
           M + G  IGV  N +  +W++   + ++   TS K   KG E   +E     + A     
Sbjct: 105 MALAGTVIGVLLNSVAPNWLILATIFLVLTYTSFKTLKKGKELRAKEKEAHAQMAATELH 164

Query: 220 -----LGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPE 251
                +  NG    +V + +  SG         ++PE
Sbjct: 165 TMELIVDDNGDNKNKVPHVADDSG------DSGIDPE 195



 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 44/78 (56%)

Query: 372 VSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRA 431
           VSSAT ++   F+S  S  ++ +L R PV Y +    +A +A+ +GQ  +   +   G+ 
Sbjct: 371 VSSATTSYMTLFTSISSFTQFLVLNRVPVDYGILLFFLAAVASVLGQLALNSYVRKTGKN 430

Query: 432 SLIIFILAFMIFVSAISL 449
           S+I +IL  +I ++ + L
Sbjct: 431 SIIAYILGVIISLATVLL 448


>gi|294944267|ref|XP_002784170.1| cgmp-dependent protein kinase, putative [Perkinsus marinus ATCC
           50983]
 gi|239897204|gb|EER15966.1| cgmp-dependent protein kinase, putative [Perkinsus marinus ATCC
           50983]
          Length = 311

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 72/140 (51%), Gaps = 12/140 (8%)

Query: 98  GVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLA 157
           G+GGG +FVP+L LI G   K +TA+S+ +I  A+++   +N   ++   + P      A
Sbjct: 4   GIGGGVLFVPVLRLIGGLQLKESTALSQALIASASLAATLFNCFEQYSARNDP-----KA 58

Query: 158 LLIQPMLMLGIS---IGVAFNVIFADWMVTVLLIVLFIG----TSTKAFLKGVETWKRET 210
           L++ P ++L +    IG    V    W+ ++ +++L+       S  A+ KG+  WK E 
Sbjct: 59  LIVWPFVILTLPCTVIGSLIGVYLYSWLPSLFILILYFCFVCLGSFMAYRKGIRLWKAEN 118

Query: 211 ILKKEAARCLGSNGAGAGEV 230
             K+ A     ++ + + EV
Sbjct: 119 GAKRRAVDGDSTDMSRSSEV 138


>gi|408404324|ref|YP_006862307.1| sulfonate transporter [Candidatus Nitrososphaera gargensis Ga9.2]
 gi|408364920|gb|AFU58650.1| putative sulfonate transporter [Candidatus Nitrososphaera gargensis
           Ga9.2]
          Length = 256

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 47/87 (54%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           +  +V   A G+ AG +G ++G+GGG IM P    LG+PP  +++T+  A+T +S  S +
Sbjct: 5   ITTIVVLIAVGLGAGTLGSMIGVGGGIIMVPALTFLGLPPAQAASTSLIAVTSTSVSSTI 64

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG 417
           EY   KR      L   A A   A +G
Sbjct: 65  EYSRQKRIDYRLGLTMSAFAVPGAVLG 91


>gi|237835457|ref|XP_002367026.1| hypothetical protein TGME49_046180 [Toxoplasma gondii ME49]
 gi|211964690|gb|EEA99885.1| hypothetical protein TGME49_046180 [Toxoplasma gondii ME49]
 gi|221506300|gb|EEE31935.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 482

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 66/136 (48%), Gaps = 8/136 (5%)

Query: 116 DPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFN 175
           D   A  +SK  I GA + +V +NL  + P+ ++ +I Y+LA +++P  +LG  +GV  N
Sbjct: 41  DVHKAVPLSKVTIFGACLVSVLFNLGRKQPSGNLLLISYELAAILEPFTLLGGVLGVLLN 100

Query: 176 VIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSL 235
           ++  D  +   L+++   T+ K   +G+  ++ E+ L  E A  L          E+ SL
Sbjct: 101 IVMTDIQIISCLVLVLSFTTYKTTRRGLLQYQTESRLLAERAAVLSR--------EHHSL 152

Query: 236 PSGPRSGPQKDAMEPE 251
            +  R       +  E
Sbjct: 153 STRDREEETSSLLAEE 168


>gi|145499407|ref|XP_001435689.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402823|emb|CAK68292.1| unnamed protein product [Paramecium tetraurelia]
          Length = 400

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 84/343 (24%), Positives = 153/343 (44%), Gaps = 26/343 (7%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           P++ +++G   K A   +  M++G ++       K R    + P+I+Y L  +  P+L+ 
Sbjct: 40  PVMMVLLGCPSKKAIWNTYIMLLGGSLGNFLRLGKERTANGNAPLINYQLVQITLPLLLA 99

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG 226
           G  +GVA       W+  +++++   G     FLK    + +     KE    L      
Sbjct: 100 GAILGVAT----GKWLPKLIIVIFLFGILMTVFLKTKSLYAKTR--SKEMNEQLIP---- 149

Query: 227 AGEVEYKSLPSGPRSGPQKD-----AMEPEVTILENINWKELGLLVFVWVAFL---GLQI 278
              VE K L     S   K+       +  +  +E +    L +L+ + V  L   G   
Sbjct: 150 ---VELKDLTVQKESNHSKELNILKEKDARLYPIEPLTEISLTILIIIVVTLLKGSGAVP 206

Query: 279 AKNHTASCSIVYWVLDLLQIPVSL--VVSLYEAISLYKG-RRVIASKGDDGKSFRVFQLV 335
           +      C   Y  L+ +   ++   V    + IS  +  R  I     DGK   VF + 
Sbjct: 207 SLLGVDFCGYGYHFLNFVIFGIAFYNVQRYRKQISKDEEYRESIGYDFADGKMSSVFDIT 266

Query: 336 SYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLL 395
              +  + AG +GGL+GLGGG ++ PL+LE G+ P  ++A+ATF + F+SS+SV    L 
Sbjct: 267 VKSS--LYAGFLGGLVGLGGGVVLTPLWLETGINPPRAAASATFTVLFTSSISVFIIALS 324

Query: 396 KRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFIL 438
             +     +    V+++ +++    ++KL+    R S++I +L
Sbjct: 325 GGYQFSEFIILGLVSSLGSYLVAGFLKKLVKKYKRESILIQVL 367


>gi|23099829|ref|NP_693295.1| hypothetical protein OB2374 [Oceanobacillus iheyensis HTE831]
 gi|22778060|dbj|BAC14330.1| hypothetical conserved protein [Oceanobacillus iheyensis HTE831]
          Length = 285

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 68/149 (45%), Gaps = 11/149 (7%)

Query: 281 NHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRV--------F 332
           N   +    Y    LL I +S ++SL +  S  K RR +       +SF +         
Sbjct: 109 NQYVNMDGFYLYFGLLIIVISFIMSLRKKAS--KLRRNVDILEPGARSFELKGEVYIYKL 166

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLEL-GVPPQVSSATATFAMTFSSSMSVVE 391
             +S  +  +  G++ GL G+GGG IM P+ + L G P  +++AT+ F + F S M  + 
Sbjct: 167 SFISAISIALFVGMLSGLFGIGGGAIMVPVMMLLFGFPAHLAAATSMFMIIFVSFMGSIT 226

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           +  L      Y L+F+  A I   +G  V
Sbjct: 227 HIYLGNVVWEYVLFFIPGAWIGGKLGAKV 255


>gi|375082782|ref|ZP_09729829.1| permease [Thermococcus litoralis DSM 5473]
 gi|374742630|gb|EHR79021.1| permease [Thermococcus litoralis DSM 5473]
          Length = 254

 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 45/87 (51%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + + V Y   GV  GI+  L GLGGGF++ P    LGV    +  T++ A+ F+S  S +
Sbjct: 1   MLKYVGYFGVGVFIGILAALFGLGGGFLIVPTLNLLGVEIHHAVGTSSAAVVFTSLSSAI 60

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG 417
            Y+  +R      L   + A I A++G
Sbjct: 61  AYHRQRRIHYKAGLLLASTAVIGAYIG 87


>gi|403360891|gb|EJY80141.1| hypothetical protein OXYTRI_22577 [Oxytricha trifallax]
          Length = 388

 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 66/146 (45%), Gaps = 5/146 (3%)

Query: 75  KFGWEIVLGTIIGFFGAAFGSVGGVGGGG----IFVPMLSLIIGFDPKSATAISKCMIMG 130
           K   EI L    G F   F ++  V GG     I +P L + +    +    ++    M 
Sbjct: 34  KTPLEIGLLQTCGLFAIGFMNMFAVAGGLGGGGIVIPFLMIFMKMPIQECIPLANIFAMI 93

Query: 131 AAVSTVYYNLKLRHPTLDMP-IIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIV 189
           ++V+   YN   +HP      IIDY++  L  PM+  G  IGV    +     + +LL +
Sbjct: 94  SSVTRFVYNFNQKHPYRPFRMIIDYEIVTLTMPMVYFGSLIGVYAGSLMNQLTLVILLQI 153

Query: 190 LFIGTSTKAFLKGVETWKRETILKKE 215
           +   T  K F K ++T+ +ET  +++
Sbjct: 154 VLAFTLYKTFQKALQTYIKETNRRRQ 179



 Score = 43.1 bits (100), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 69/119 (57%), Gaps = 3/119 (2%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           ++++V +C    +A ++ G+LG+ GG I+ P+FL LG+ P V++AT  +    S+    +
Sbjct: 266 IYKVVIFC---FIASVLAGILGIAGGIILSPVFLSLGLLPSVTAATNQYIGMISTFSVSL 322

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL 449
           ++    +    YA    AV    A +G  VV +++   GR S+I+FI++F++ +S + L
Sbjct: 323 QFIYKGQLNYSYAYVIGAVVLFTAIIGLSVVERVVKKSGRQSIIVFIISFVLLISFLVL 381


>gi|348677068|gb|EGZ16885.1| hypothetical protein PHYSODRAFT_503702 [Phytophthora sojae]
          Length = 254

 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 63/241 (26%), Positives = 106/241 (43%), Gaps = 28/241 (11%)

Query: 213 KKEAARCLGSNGAGA--GEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVW 270
           +KE  +C  S  A A  G    + + S P   PQ D           + W++L  L  ++
Sbjct: 30  RKERWQCCVSPEATALLGIDSSEKVSSTPEKKPQAD-----------VPWRKLATLSSLF 78

Query: 271 VAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGD-DGKSF 329
           V   G++I +      S          + +    +LY  +    G  + A +      S 
Sbjct: 79  VVIAGMRILRGGKDFDS---------PLGIDSSSALYPMLQQTPGYELEAHEIKWTPLSI 129

Query: 330 RVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSV 389
           R F       F + AG V G+ G+GGG I GPL LE+G+    +SA     + FSS MS 
Sbjct: 130 RFFPF-----FSLAAGAVSGMFGIGGGIINGPLLLEVGIDASAASAMTATTVLFSSGMSA 184

Query: 390 VEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL 449
             Y ++ +  +  A   + +  +  ++GQ  + K++      SLIIF +A ++ +SAI++
Sbjct: 185 FNYTVMGKTDIHLAQVLLPMGFLMTYIGQLCLLKVVRRFQCPSLIIFSMAVIVLISAIAM 244

Query: 450 G 450
            
Sbjct: 245 S 245


>gi|145493138|ref|XP_001432565.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124399678|emb|CAK65168.1| unnamed protein product [Paramecium tetraurelia]
          Length = 441

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 65/325 (20%), Positives = 143/325 (44%), Gaps = 38/325 (11%)

Query: 96  VGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPT-LDMPIIDY 154
           +GG+G G I  P+L+L++ +    +T ++ C +   +     +    +HP   ++P+I++
Sbjct: 55  MGGLGSGMIKRPILNLLLNYPSSISTQVADCFLFTTSTLNSLFLFFEKHPDHHNLPLINF 114

Query: 155 DLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGT------STKAFL-KGVETWK 207
           D+++++   + L  S G         +++ ++ +   +G          A+L K +E  K
Sbjct: 115 DISIILNQTIPLAWSAGAFLQTRSPKFVIYIIQLCFLMGAIPFLWKYMLAYLQKRIEQDK 174

Query: 208 RETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLV 267
           RE ++     + + +    A E  +           +K   + +   + + +  ++  L 
Sbjct: 175 RENVIL--VNQKIKTQEDMASETNFD----------EKQLTQYQKFYINDHSKFQIKNLC 222

Query: 268 FVWVAFLGLQ---IAKNHTASCSIVYWVLDLLQIPVSLV------VSLYEAISLYKGRR- 317
           F++ +FL  Q   + +++  + SI+   LD   +  +L+      ++L     +Y  +R 
Sbjct: 223 FIFGSFLVNQTIILMRSNKYNNSII--GLDKCTLENNLILLAILGINLTYTFFIYWSKRN 280

Query: 318 ------VIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ 371
                 ++  + +        Q   Y   G LAG + G LG+GGG IM    L   +  +
Sbjct: 281 EEYYKDIVQYRPNQRFFKDKKQFFFYYMAGFLAGFITGFLGMGGGLIMVTFLLSQKIIAR 340

Query: 372 VSSATATFAMTFSSSMSVVEYYLLK 396
            ++ATA F     S  S++++ L K
Sbjct: 341 EAAATAAFGSFMISLNSLIQFILQK 365


>gi|242398129|ref|YP_002993553.1| permease [Thermococcus sibiricus MM 739]
 gi|242264522|gb|ACS89204.1| Predicted permease [Thermococcus sibiricus MM 739]
          Length = 254

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 46/87 (52%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + + + Y + G+  GI+  L GLGGGF++ P    LGV    +  T++ A+ F+S  S +
Sbjct: 1   MLKYLGYFSIGIFVGILAALFGLGGGFLVVPTLNLLGVEIHHAVGTSSAAVVFTSLSSAI 60

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG 417
            Y+  +R      L   + A I A++G
Sbjct: 61  AYHRQRRIHYKAGLLLASTAVIGAYIG 87


>gi|390960289|ref|YP_006424123.1| hypothetical protein CL1_0114 [Thermococcus sp. CL1]
 gi|390518597|gb|AFL94329.1| hypothetical protein CL1_0114 [Thermococcus sp. CL1]
          Length = 238

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 56/105 (53%), Gaps = 4/105 (3%)

Query: 322 KGD-DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLEL-GVPPQVSSATATF 379
            GD DG S   ++ V +   G  +G+  GLLG+ GG I  PLF EL G+P + +  T++ 
Sbjct: 114 SGDGDGVSEVKYRHVPF--IGAFSGLASGLLGVSGGIINVPLFHELAGIPVKYAVGTSSL 171

Query: 380 AMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           A+ F+S  +   +Y L +  +  A   V      +FVG H+V ++
Sbjct: 172 ALFFTSLTAAYAHYTLGQVDITTAAMIVPGLMAGSFVGAHLVSRI 216


>gi|127513412|ref|YP_001094609.1| hypothetical protein Shew_2484 [Shewanella loihica PV-4]
 gi|126638707|gb|ABO24350.1| protein of unknown function DUF81 [Shewanella loihica PV-4]
          Length = 257

 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 3/85 (3%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            A G +AG +  + G GGG +  P  L +G+PP  +  T   A +F SSM+   YY  K+
Sbjct: 14  AATGFIAGFIDAIAG-GGGLLSIPALLTIGIPPHTALGTNKLAASFGSSMAAFTYY-RKQ 71

Query: 398 FPVPYALYFVAVAT-IAAFVGQHVV 421
           F  P   Y  ++AT I A +G  VV
Sbjct: 72  FFTPRFWYHASIATFIGAVIGTFVV 96


>gi|323456207|gb|EGB12074.1| hypothetical protein AURANDRAFT_9279, partial [Aureococcus
           anophagefferens]
          Length = 89

 Score = 47.8 bits (112), Expect = 0.012,   Method: Composition-based stats.
 Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 2/89 (2%)

Query: 128 IMGAAVSTVYYNLKLRHP--TLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTV 185
           I G +++ +Y   +  HP  TL  P+IDYD +LL  P L+ G   G  F+V+F  W+V +
Sbjct: 1   ISGGSIANLYTYTQRYHPNPTLRRPLIDYDASLLFCPALLAGTMFGGLFSVMFPPWLVVI 60

Query: 186 LLIVLFIGTSTKAFLKGVETWKRETILKK 214
            L+VL   +  +   KG+  W  E+  +K
Sbjct: 61  CLVVLLGYSGKRTVKKGIAKWNAESAKQK 89


>gi|422018984|ref|ZP_16365535.1| hypothetical protein OO9_09783 [Providencia alcalifaciens Dmel2]
 gi|414104170|gb|EKT65742.1| hypothetical protein OO9_09783 [Providencia alcalifaciens Dmel2]
          Length = 271

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 30/73 (41%), Positives = 37/73 (50%), Gaps = 12/73 (16%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLEL------------GVPPQVSSATATFAMTFSS 385
           C FG ++GI   L G GGGFI  PL   L             V  Q++ AT+TF M FSS
Sbjct: 6   CLFGFISGITTALFGFGGGFITVPLLYALITLVWGPRHDTSEVAMQIAVATSTFVMIFSS 65

Query: 386 SMSVVEYYLLKRF 398
           S+S   +YL   F
Sbjct: 66  SLSSRAHYLKGNF 78


>gi|348688914|gb|EGZ28728.1| hypothetical protein PHYSODRAFT_475263 [Phytophthora sojae]
          Length = 257

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 64/273 (23%), Positives = 119/273 (43%), Gaps = 31/273 (11%)

Query: 181 WMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPR 240
           W++ VLL+++   T T+   K +         +KE  +C    G          + S   
Sbjct: 4   WLLCVLLVLVLSVTGTRTLQKAIRA------RQKERWQC----GVSPESTSLLGVDSAST 53

Query: 241 SGPQKDAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPV 300
            G +  A EP++    ++ W+++  L  ++V   G+++ +                  P+
Sbjct: 54  DGTKAHAGEPQLR--ADVPWRKIATLFGLFVVVAGMRLVRGGQN-----------FDSPI 100

Query: 301 SLVVS--LYEAISLYKGRRVIASKGD-DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGF 357
            L  S  LY  +       + A +      S + F + S+      AG V G+ G+GGG 
Sbjct: 101 GLDSSSALYPVLQQSPAYELEAHEIQWTSSSIKFFPVFSFA-----AGTVSGMFGIGGGI 155

Query: 358 IMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVG 417
           I  PL LE+GV P  +SA     + FSS MS   Y +L +  +  A   + +  +  ++G
Sbjct: 156 INAPLLLEVGVDPSAASAMTAATVLFSSGMSSFNYLMLGKLDLDLARLMLPMGFLTTYIG 215

Query: 418 QHVVRKLIILLGRASLIIFILAFMIFVSAISLG 450
              + K++      SLI+F +A ++ +SA+++ 
Sbjct: 216 HICLLKVVRRFQCPSLIVFSMAAIVLISAVAMS 248


>gi|57640914|ref|YP_183392.1| hypothetical protein TK0979 [Thermococcus kodakarensis KOD1]
 gi|57159238|dbj|BAD85168.1| hypothetical membrane protein, conserved, DUF81 family
           [Thermococcus kodakarensis KOD1]
          Length = 254

 Score = 47.8 bits (112), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 46/87 (52%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + + + Y   G++ GI+  + GLGGGF++ P    LGV    +  T++ A+ F+S  S +
Sbjct: 1   MLKYIGYLGVGIVIGILAAMFGLGGGFLIVPTLNFLGVEIHHAVGTSSAAVVFTSLSSAI 60

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG 417
            Y+  +R      L   + A I A++G
Sbjct: 61  AYHRQRRIHYKAGLLLASTAVIGAYIG 87


>gi|390960291|ref|YP_006424125.1| putative permease [Thermococcus sp. CL1]
 gi|390518599|gb|AFL94331.1| putative permease [Thermococcus sp. CL1]
          Length = 254

 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 45/87 (51%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + + VSY A GV  GI+  L GLGGGF++ P    LGV    +  T++ A+ F+S  S +
Sbjct: 1   MLKYVSYFAVGVFIGILAALFGLGGGFLIVPTLNFLGVEIHHAVGTSSAAVVFTSLSSAL 60

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG 417
            Y    R      L   + A I A++G
Sbjct: 61  AYSRQGRIHYKVGLLLASTAVIGAYIG 87


>gi|223478524|ref|YP_002582988.1| hypothetical protein [Thermococcus sp. AM4]
 gi|214033750|gb|EEB74576.1| conserved hypothetical protein [Thermococcus sp. AM4]
          Length = 254

 Score = 47.4 bits (111), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 45/87 (51%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + + + Y A GV  GI+  L GLGGGF++ P    LGV    +  T++ A+ F+S  S +
Sbjct: 1   MLKYLGYFAVGVFIGILAALFGLGGGFLIVPTLNLLGVEMHHAVGTSSAAVVFTSLSSAI 60

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG 417
            Y   +R      L   + A I A++G
Sbjct: 61  AYSRQRRIHYKVGLLLASTAVIGAYIG 87


>gi|71420985|ref|XP_811672.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70876359|gb|EAN89821.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 473

 Score = 47.4 bits (111), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 78/333 (23%), Positives = 150/333 (45%), Gaps = 44/333 (13%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPT--LDMPIIDYDLALLIQPML 164
           PM  L++G     A  +S+  I G ++  V   ++ R P      P+I+Y    L+ P+ 
Sbjct: 114 PMFCLLMGLPMDFAVGLSQTTICGQSILNVLIAIRKRFPCAGCSRPLINYQYLTLLVPLG 173

Query: 165 MLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNG 224
           ++G  IG   N +  D    VLL +L      ++  K +  ++++   ++      G+N 
Sbjct: 174 VIGTLIGGVLNRLCPDLFRLVLLFLLLTAVLYRSVRKMIAQYRKDQSERR------GTNT 227

Query: 225 AGAGEVEYKSLPSGPRSGPQK--DAMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNH 282
             + E E    P+   + P++     +P+        W E+  +VF ++  L     ++ 
Sbjct: 228 VSSTE-EVSGTPT--LNSPEEIFHVTQPQYP------WIEISCVVFSFIVNLSFGAWRSR 278

Query: 283 TASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGDDGKSFR-VFQ-----LVS 336
           T      Y V   L + +++V+        Y+ R        + + FR VF       + 
Sbjct: 279 TKCGGGAYIVAYCLPVVLNIVIFF-----CYRHRL------SNMEKFRLVFHWNNSTTIL 327

Query: 337 YCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSAT---ATFAMTFSSSMS-VVEY 392
           Y    V+AG+   +LG+GGG ++G +  ++G+ P+ +S T    T  + FSS++S ++E 
Sbjct: 328 YPLVSVVAGVASAMLGIGGGLVLGFILYDVGLIPEEASVTGGVVTLFLAFSSALSLLIES 387

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHVVRKLI 425
           +LL    + Y     A   ++  +GQ V+ +LI
Sbjct: 388 HLL----IDYGGVLFACGIVSTILGQFVLMRLI 416


>gi|212712467|ref|ZP_03320595.1| hypothetical protein PROVALCAL_03561 [Providencia alcalifaciens DSM
           30120]
 gi|212684924|gb|EEB44452.1| hypothetical protein PROVALCAL_03561 [Providencia alcalifaciens DSM
           30120]
          Length = 271

 Score = 47.4 bits (111), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 52/109 (47%), Gaps = 20/109 (18%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLEL------------GVPPQVSSATATFAMTFSS 385
           C FG ++GI   L G GGGFI  PL   L             V  Q++ AT+TF M FSS
Sbjct: 6   CLFGFISGITTALFGFGGGFIAVPLLYALITLVWGPRHDTSEVAMQIAVATSTFVMIFSS 65

Query: 386 SMSVVEYYL--------LKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
           S+S   +YL        ++ F +P ++  +  A IA  V    +R + I
Sbjct: 66  SLSSRAHYLKGNLNWQIIQPFMIPISIGGILGALIALSVDSEWIRWIFI 114


>gi|298290659|ref|YP_003692598.1| hypothetical protein Snov_0646 [Starkeya novella DSM 506]
 gi|296927170|gb|ADH87979.1| protein of unknown function DUF81 [Starkeya novella DSM 506]
          Length = 308

 Score = 47.4 bits (111), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 2/62 (3%)

Query: 336 SYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLL 395
           +  A G+  G + G+ G+GGGF+M PL + LGVPP V+ A+ +  M  SS    + Y  L
Sbjct: 16  TLLALGIAVGFISGMFGVGGGFLMTPLLIFLGVPPAVAVASVSTHMAASSFSGTLTY--L 73

Query: 396 KR 397
           +R
Sbjct: 74  RR 75


>gi|332295235|ref|YP_004437158.1| hypothetical protein Thena_0382 [Thermodesulfobium narugense DSM
           14796]
 gi|332178338|gb|AEE14027.1| protein of unknown function DUF81 [Thermodesulfobium narugense DSM
           14796]
          Length = 253

 Score = 47.0 bits (110), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 44/73 (60%), Gaps = 2/73 (2%)

Query: 313 YKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQV 372
           YK ++   + G+D +  +V  L      G  AG V GLLG+GGG ++ P+ + +GVP + 
Sbjct: 116 YKQKK--NNSGEDFQINKVQMLTIGVPIGAFAGFVAGLLGIGGGNMIIPILIFIGVPARF 173

Query: 373 SSATATFAMTFSS 385
           ++AT +F + FSS
Sbjct: 174 AAATTSFIVLFSS 186


>gi|145500024|ref|XP_001435996.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124403133|emb|CAK68599.1| unnamed protein product [Paramecium tetraurelia]
          Length = 400

 Score = 47.0 bits (110), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 75/296 (25%), Positives = 133/296 (44%), Gaps = 30/296 (10%)

Query: 107 PMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLML 166
           P++ +++G   K A   +  M++G ++       K R      P+I+Y L  +  P+L+ 
Sbjct: 40  PVMMVLLGCPSKKAIWNTYIMLLGGSLGNFIRLGKERTANGSAPLINYQLVQITLPLLLA 99

Query: 167 GISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAG 226
           G  +GVA       W+  +L+++   G     FLK    + +     KE    L      
Sbjct: 100 GAILGVAT----GKWLPKLLIVIFLFGILLTVFLKTKSLYTKTR--SKEMNEHLI----- 148

Query: 227 AGEVEYKSLPSGPRSGPQKD---AMEPEVTILENINWKELGLLVFVWVAFLGLQIAKNHT 283
              VE K L    +S   K+     E +  +       E+ L V + +    L+ +    
Sbjct: 149 --PVELKELSIQNQSTHSKELNIIKEKDGRLYPTEPLTEISLTVLIIIVVTLLKGSGAVP 206

Query: 284 ASCSIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGD----------DGKSFRVFQ 333
           +   I Y  L    +  ++V+      ++Y+ R+ I+   +          DGK   VF 
Sbjct: 207 SILGIDYCGLGFHFL--NIVIFGIACYNVYRYRKFISKDEEYKQSIGYDFSDGKMSAVFD 264

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSV 389
           +    +  + AG +GGL+GLGGG ++ PL+LE G+ P  ++A+ATF + F+SS+SV
Sbjct: 265 ITVKSS--LYAGFLGGLVGLGGGVVLTPLWLETGINPPRAAASATFTVLFTSSISV 318


>gi|158423157|ref|YP_001524449.1| permease [Azorhizobium caulinodans ORS 571]
 gi|158330046|dbj|BAF87531.1| putative permease [Azorhizobium caulinodans ORS 571]
          Length = 317

 Score = 47.0 bits (110), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 33/55 (60%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G   G V G+ G+GGGF+M PL L +GVPP V+ A+ +  M  SS    + Y+
Sbjct: 19  ALGAAVGFVSGMFGVGGGFLMTPLLLFIGVPPAVAVASVSTHMAASSFSGALSYF 73


>gi|126465319|ref|YP_001040428.1| hypothetical protein Smar_0408 [Staphylothermus marinus F1]
 gi|126014142|gb|ABN69520.1| protein of unknown function DUF81 [Staphylothermus marinus F1]
          Length = 250

 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 54/84 (64%), Gaps = 3/84 (3%)

Query: 343 LAGIVGGLLGLGGGFIMGPLF-LELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVP 401
           LAG+  G+LG+GGG +  PL  + LG+P + + AT++F +  ++S   +  YLLK +  P
Sbjct: 146 LAGLTSGMLGIGGGVLKVPLMTMVLGLPIRTAIATSSFMVGLTASAGSL-VYLLKGYVNP 204

Query: 402 YALYFVAVATI-AAFVGQHVVRKL 424
           YA+  +A+  I  A +G H+++K+
Sbjct: 205 YAVASLALGIIPGATLGAHMLKKI 228


>gi|212223719|ref|YP_002306955.1| membrane protein [Thermococcus onnurineus NA1]
 gi|212008676|gb|ACJ16058.1| hypothetical membrane protein, conserved [Thermococcus onnurineus
           NA1]
          Length = 254

 Score = 46.6 bits (109), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 44/87 (50%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + + + Y   GV  GI+  L GLGGGF++ P    LGV    +  T++ A+ F+S  S +
Sbjct: 1   MLKYLGYFTVGVFIGILAALFGLGGGFLIVPTLNLLGVEIHHAVGTSSAAVVFTSLSSAL 60

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG 417
            Y   KR      L   + A I A++G
Sbjct: 61  AYSRQKRIHYKIGLLLASTAVIGAYIG 87


>gi|170741286|ref|YP_001769941.1| hypothetical protein M446_3097 [Methylobacterium sp. 4-46]
 gi|168195560|gb|ACA17507.1| protein of unknown function DUF81 [Methylobacterium sp. 4-46]
          Length = 307

 Score = 46.6 bits (109), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 33/51 (64%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
            G   G V G+ G+GGGF+M P+ + LG+PP ++ AT T  +  SS+ SV+
Sbjct: 20  LGAAVGFVSGVFGIGGGFLMTPILIVLGIPPAIAVATQTAPIVASSTTSVL 70


>gi|297526402|ref|YP_003668426.1| hypothetical protein Shell_0395 [Staphylothermus hellenicus DSM
           12710]
 gi|297255318|gb|ADI31527.1| protein of unknown function DUF81 [Staphylothermus hellenicus DSM
           12710]
          Length = 250

 Score = 46.6 bits (109), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 54/84 (64%), Gaps = 3/84 (3%)

Query: 343 LAGIVGGLLGLGGGFIMGPLF-LELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVP 401
           LAG+  G+LG+GGG +  PL  + LG+P + + AT++F +  ++S   +  YLLK +  P
Sbjct: 146 LAGLTSGMLGIGGGVLKVPLMTIVLGLPIRTAIATSSFMVGLTASAGSL-VYLLKGYVNP 204

Query: 402 YALYFVAVATI-AAFVGQHVVRKL 424
           YA+  +A+  I  A +G H+++K+
Sbjct: 205 YAVAALALGIIPGATLGAHMLKKI 228


>gi|212557467|gb|ACJ29921.1| Conserved hypothetical protein [Shewanella piezotolerans WP3]
          Length = 257

 Score = 46.6 bits (109), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 50/99 (50%), Gaps = 8/99 (8%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
           D  S+ +  L+     G++AG +  + G GGG +  P  L +GVPP ++ AT   A  F 
Sbjct: 6   DPSSWALLALI-----GLIAGFIDAVAG-GGGLLSIPALLTMGVPPHMALATNKLAACFG 59

Query: 385 SSMSVVEYYLLKRFPVPYALYFVAVAT-IAAFVGQHVVR 422
           SSM+   YY  + F  P   Y   +AT I A +G  +V 
Sbjct: 60  SSMAAYTYYKQRLFS-PSLWYHTFIATFIGAVIGTFIVS 97


>gi|218667447|ref|YP_002426656.1| hypothetical protein AFE_2258 [Acidithiobacillus ferrooxidans ATCC
           23270]
 gi|415968072|ref|ZP_11558253.1| hypothetical protein GGI1_04804 [Acidithiobacillus sp. GGI-221]
 gi|218519660|gb|ACK80246.1| membrane protein, putative [Acidithiobacillus ferrooxidans ATCC
           23270]
 gi|339833297|gb|EGQ61154.1| hypothetical protein GGI1_04804 [Acidithiobacillus sp. GGI-221]
          Length = 296

 Score = 46.2 bits (108), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 47/87 (54%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
             FG++ G + GL G+GGGF++ PL + +GVPP V+  T    +  +S++    ++ +  
Sbjct: 9   IGFGLIVGFLSGLTGVGGGFLITPLLIFVGVPPLVAVGTGAAQIVGASAVGSYAHWRMGN 68

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKL 424
             +  A   +A + +   +G HV R L
Sbjct: 69  VDMRMAFILLAGSWMGGLLGVHVARIL 95


>gi|402849112|ref|ZP_10897353.1| membrane protein, putative [Rhodovulum sp. PH10]
 gi|402500640|gb|EJW12311.1| membrane protein, putative [Rhodovulum sp. PH10]
          Length = 310

 Score = 46.2 bits (108), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 36/65 (55%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           A G   G + G+ G+GGGF+M PL + +GV P VS AT    +T SS    + Y+  +  
Sbjct: 19  AMGTAVGFISGMFGIGGGFLMTPLLIFIGVSPAVSVATVAAHITASSFSGALTYWRRRAI 78

Query: 399 PVPYA 403
            +P A
Sbjct: 79  DLPLA 83


>gi|198284012|ref|YP_002220333.1| hypothetical protein Lferr_1905 [Acidithiobacillus ferrooxidans
           ATCC 53993]
 gi|198248533|gb|ACH84126.1| protein of unknown function DUF81 [Acidithiobacillus ferrooxidans
           ATCC 53993]
          Length = 305

 Score = 46.2 bits (108), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 47/87 (54%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
             FG++ G + GL G+GGGF++ PL + +GVPP V+  T    +  +S++    ++ +  
Sbjct: 18  IGFGLIVGFLSGLTGVGGGFLITPLLIFVGVPPLVAVGTGAAQIVGASAVGSYAHWRMGN 77

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKL 424
             +  A   +A + +   +G HV R L
Sbjct: 78  VDMRMAFILLAGSWMGGLLGVHVARIL 104


>gi|403374048|gb|EJY86957.1| hypothetical protein OXYTRI_08652 [Oxytricha trifallax]
          Length = 426

 Score = 46.2 bits (108), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 72/122 (59%), Gaps = 4/122 (3%)

Query: 327 KSFRV-FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSS 385
           +++RV F ++  C    ++GI+GG++G+ GG I+ PLFL+LG+ P + ++T  +    S+
Sbjct: 298 ENYRVIFNIILLCH---ISGILGGVVGIAGGIILAPLFLQLGMLPVIVASTNQYLALIST 354

Query: 386 SMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVS 445
                +Y+ L      Y   F  +  I +++G  VV K++   GR S+++ ILAF++F S
Sbjct: 355 IAVSSQYWYLGMMNWQYCSVFGFLGIIGSYIGIKVVMKIVQKSGRQSIMVVILAFVLFTS 414

Query: 446 AI 447
            I
Sbjct: 415 FI 416


>gi|345020875|ref|ZP_08784488.1| hypothetical protein OTW25_06042 [Ornithinibacillus scapharcae
           TW25]
          Length = 271

 Score = 45.8 bits (107), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 57/121 (47%), Gaps = 12/121 (9%)

Query: 316 RRVIASKGDDGKSFRVFQL--------VSYCAFGVLAGIVG---GLLGLGGGFIMGP-LF 363
           +R    K  +G S R FQ+        +S     +LA +VG   GL G+GGG IM P + 
Sbjct: 124 KRNPEKKRMNGNSMRSFQVGDNVYEYSISIPLAIILALVVGTLSGLFGIGGGSIMVPAMI 183

Query: 364 LELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRK 423
           L  G P  +++AT+ F + F S M    +  L      Y L+F+  A I   VG  + + 
Sbjct: 184 LLFGFPVHIATATSMFMIFFVSMMGASSHIFLGHIAWEYVLFFIPGAWIGGKVGAKISQL 243

Query: 424 L 424
           L
Sbjct: 244 L 244


>gi|145494802|ref|XP_001433395.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400512|emb|CAK65998.1| unnamed protein product [Paramecium tetraurelia]
          Length = 466

 Score = 45.8 bits (107), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 64/303 (21%), Positives = 126/303 (41%), Gaps = 36/303 (11%)

Query: 110 SLIIGFDPKSATAISKCMIMGAA-VSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGI 168
            +I+ +    AT I+ C++ G   ++T+    +        PII+Y ++++    +    
Sbjct: 71  EMILNYTQSEATHIAYCLMFGGTLLNTILLMFEKNPEDQRRPIINYRISIIFNLAVPFAT 130

Query: 169 SIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETWKRETI----LKKEAARCLGSNG 224
           ++G +    F   + T++L  LF+        K  +  K E +     +K  ++ L  +G
Sbjct: 131 NLGSSL-ASFLPQLYTLILQELFLFAVAPILWKKAQKAKSEELSTPDKQKNESQALNLDG 189

Query: 225 AGAGE------VEYKSLPSGPRSGPQKDAMEPEVTILENINWKELGLLVFVWVAF-LG-- 275
           +   +      +E +       S   +++ +   T+      +   +L F+   F LG  
Sbjct: 190 SNTQQKIELQKIEEQQYSVSLTSFSVENSKQNSNTLYYQFKQETENILPFMPALFILGSF 249

Query: 276 ------LQIAKNHTASCSIV-----YWVLDLLQIPVSLVVSLYEAISLYKGRRVIASKGD 324
                 +Q+   + +  S V      W  D + + + +   L++ ++   G R    K  
Sbjct: 250 GLNQIFIQMRSTNPSKPSYVGIYDCTWQNDFMILILIIANVLFDYVAWNFGSR--QEKYF 307

Query: 325 DGKSF----RVFQLVSY----CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSAT 376
           D  ++    R F  +S      A G  AG V G LG+G GF+M P  L  G+ P+ +SAT
Sbjct: 308 DQLNYLPNERYFTPISRFFKIYAGGFGAGFVSGFLGMGAGFVMVPTLLYSGLIPRCASAT 367

Query: 377 ATF 379
           + F
Sbjct: 368 SAF 370


>gi|157962478|ref|YP_001502512.1| hypothetical protein Spea_2657 [Shewanella pealeana ATCC 700345]
 gi|157847478|gb|ABV87977.1| protein of unknown function DUF81 [Shewanella pealeana ATCC 700345]
          Length = 257

 Score = 45.8 bits (107), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 50/101 (49%), Gaps = 8/101 (7%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
           D  S+ +  LV     G +AG +  + G GGG +  P  L +GVPP ++  T   A +F 
Sbjct: 6   DPSSWAILALV-----GFIAGFIDAVSG-GGGLLSIPALLTMGVPPHLALGTNKLAASFG 59

Query: 385 SSMSVVEYYLLKRFPVPYALYFVAVAT-IAAFVGQHVVRKL 424
           SSM+   YY  + F  P   Y   +AT I A +G  +V  +
Sbjct: 60  SSMAAYTYYKQRLFS-PSLWYHTFIATFIGAVIGTFIVYHI 99


>gi|341581528|ref|YP_004762020.1| putative permease [Thermococcus sp. 4557]
 gi|340809186|gb|AEK72343.1| putative permease [Thermococcus sp. 4557]
          Length = 254

 Score = 45.4 bits (106), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 42/81 (51%)

Query: 337 YCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLK 396
           Y A GV  GI+  + GLGGGF++ P    LGV    +  T++ A+ F+S  S + Y   +
Sbjct: 7   YFAVGVFIGILAAMFGLGGGFLIVPTLNFLGVEIHHAVGTSSAAVVFTSLSSAIAYSRQR 66

Query: 397 RFPVPYALYFVAVATIAAFVG 417
           R      L   + A I A++G
Sbjct: 67  RIHYKAGLLLASTAVIGAYIG 87


>gi|403359664|gb|EJY79493.1| hypothetical protein OXYTRI_23231 [Oxytricha trifallax]
          Length = 420

 Score = 45.4 bits (106), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 68/117 (58%), Gaps = 3/117 (2%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           +F ++  C    ++GI+GG++G+ GG I+ PLFL+LG+ P + ++T  +    S+     
Sbjct: 297 IFNIILLCH---ISGILGGVVGIAGGIILAPLFLQLGMLPVIVASTNQYLALISTIAVSS 353

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
           +Y+ L      Y   F  +  I +++G  VV K++   GR S+++ ILAF++F S I
Sbjct: 354 QYWYLGMMNWQYCSVFGFLGIIGSYIGIKVVMKIVQKSGRQSIMVVILAFVLFTSFI 410


>gi|261343525|ref|ZP_05971170.1| putative membrane protein [Providencia rustigianii DSM 4541]
 gi|282568674|gb|EFB74209.1| putative membrane protein [Providencia rustigianii DSM 4541]
          Length = 271

 Score = 45.4 bits (106), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 51/109 (46%), Gaps = 20/109 (18%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLEL------------GVPPQVSSATATFAMTFSS 385
           C FG ++GI   L G GGGFI  PL   L             +  Q++ AT+T  M FSS
Sbjct: 6   CLFGFISGITTALFGFGGGFITVPLLYALITLVWGSQLDVRDIAMQIAVATSTCVMIFSS 65

Query: 386 SMSVVEYYL--------LKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
           S+S   +YL        +K F +P ++  V  A IA  V    +R + I
Sbjct: 66  SVSSRAHYLKGNLDWKMIKPFILPISIGGVLGAMIALSVDSEWIRWIFI 114


>gi|114048031|ref|YP_738581.1| hypothetical protein Shewmr7_2539 [Shewanella sp. MR-7]
 gi|113889473|gb|ABI43524.1| protein of unknown function DUF81 [Shewanella sp. MR-7]
          Length = 257

 Score = 45.1 bits (105), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 48/91 (52%), Gaps = 11/91 (12%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY---LLKR 397
           G++AG +  ++G GGG +  P  L LG+PP  +  T  FA +F SSM+   YY   LLK 
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWTYYRQHLLKP 75

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
                   F  +A IA F+G  +   L+ LL
Sbjct: 76  -------SFWYMAFIATFIGAVLGSVLVYLL 99


>gi|145549159|ref|XP_001460259.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428088|emb|CAK92862.1| unnamed protein product [Paramecium tetraurelia]
          Length = 482

 Score = 45.1 bits (105), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 44/82 (53%), Gaps = 1/82 (1%)

Query: 94  GSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHP-TLDMPII 152
           G++G +G G +  P L   + FD +SA  ++ C+   + ++ +      RHP   + PII
Sbjct: 77  GALGALGAGVVKRPFLEATLNFDSESAGNMTMCLQFASQLANIIIIFFQRHPEDPERPII 136

Query: 153 DYDLALLIQPMLMLGISIGVAF 174
           +Y++AL+    + L  S+G  F
Sbjct: 137 NYEIALIYCLAIPLSQSLGTEF 158


>gi|113970806|ref|YP_734599.1| hypothetical protein Shewmr4_2471 [Shewanella sp. MR-4]
 gi|113885490|gb|ABI39542.1| protein of unknown function DUF81 [Shewanella sp. MR-4]
          Length = 257

 Score = 45.1 bits (105), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 48/91 (52%), Gaps = 11/91 (12%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY---LLKR 397
           G++AG +  ++G GGG +  P  L LG+PP  +  T  FA +F SSM+   YY   LLK 
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWTYYRQHLLKP 75

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
                   F  +A IA F+G  +   L+ LL
Sbjct: 76  -------SFWYMAFIATFIGAVLGSVLVYLL 99


>gi|86606127|ref|YP_474890.1| hypothetical protein CYA_1459 [Synechococcus sp. JA-3-3Ab]
 gi|86554669|gb|ABC99627.1| putative membrane protein [Synechococcus sp. JA-3-3Ab]
          Length = 317

 Score = 45.1 bits (105), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 42/69 (60%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
            G++  V + ++    G+ AGI+ G LG+GGG I+ PL + LG+PP  + AT+ F++  +
Sbjct: 15  QGRAIVVEEGLALAIGGLAAGILAGFLGIGGGTILVPLQVSLGIPPLQAVATSNFSILLT 74

Query: 385 SSMSVVEYY 393
           S    ++ +
Sbjct: 75  SLAGSIQNW 83


>gi|145539706|ref|XP_001455543.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423351|emb|CAK88146.1| unnamed protein product [Paramecium tetraurelia]
          Length = 484

 Score = 45.1 bits (105), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 44/79 (55%), Gaps = 1/79 (1%)

Query: 94  GSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTL-DMPII 152
            S+G +GGG +  P L  I+ FD   A  I+ C+++ A +  + +     HP + + P+I
Sbjct: 72  ASLGALGGGIVKRPFLQSILNFDASIAGDITACLMISAQIVNMVFIFLQNHPDVPERPVI 131

Query: 153 DYDLALLIQPMLMLGISIG 171
           +YD+A++    + + + +G
Sbjct: 132 NYDIAIIYTLAIPVSLCLG 150


>gi|154250913|ref|YP_001411737.1| hypothetical protein Plav_0457 [Parvibaculum lavamentivorans DS-1]
 gi|154154863|gb|ABS62080.1| protein of unknown function DUF81 [Parvibaculum lavamentivorans
           DS-1]
          Length = 307

 Score = 44.7 bits (104), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 36/64 (56%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
            +++  A G   G + G+ G+GGGF+M PL + LG+PP V+  T T  +  SS    + +
Sbjct: 13  SILTILAMGATVGFLSGMFGIGGGFLMTPLLIFLGIPPAVAVGTQTTQVVASSVTGALAH 72

Query: 393 YLLK 396
           +  K
Sbjct: 73  FTRK 76


>gi|24373383|ref|NP_717426.1| 4-toluene sulfonate uptake permease family protein [Shewanella
           oneidensis MR-1]
 gi|24347649|gb|AAN54870.1| 4-toluene sulfonate uptake permease family protein [Shewanella
           oneidensis MR-1]
          Length = 257

 Score = 44.7 bits (104), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 51/91 (56%), Gaps = 5/91 (5%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            A G++AG +  ++G GGG +  P  L LG+PP  +  T  FA +F SSM+   +Y  ++
Sbjct: 14  AAIGLVAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWTFY--RQ 70

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
             +  A +++A   +A F+G  +   L+ LL
Sbjct: 71  HLLKPAFWYMAF--VATFIGAVLGSILVYLL 99


>gi|317130121|ref|YP_004096403.1| hypothetical protein Bcell_3430 [Bacillus cellulosilyticus DSM
           2522]
 gi|315475069|gb|ADU31672.1| protein of unknown function DUF81 [Bacillus cellulosilyticus DSM
           2522]
          Length = 274

 Score = 44.7 bits (104), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 62/121 (51%), Gaps = 8/121 (6%)

Query: 73  ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAA 132
           E+ +G+  ++G +IGF       + G+GGG + VP + L+ GF P  A A S  MI  +A
Sbjct: 148 EITYGYHPIIGILIGFVVGMCSGLFGIGGGSLMVPAMILLFGFPPHIAVATSMLMIFLSA 207

Query: 133 VSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFN-VIFADWMVTVLLIVLF 191
           + +   ++ L +       ID+   L + P   +G  +G A N  + +D +V +L + L 
Sbjct: 208 ILSSVSHMALGN-------IDWLYVLALLPGAWIGGQLGAAINRKMQSDAIVNLLRVFLI 260

Query: 192 I 192
           I
Sbjct: 261 I 261



 Score = 39.3 bits (90), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 1/85 (1%)

Query: 341 GVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFP 399
           G + G+  GL G+GGG +M P + L  G PP ++ AT+   +  S+ +S V +  L    
Sbjct: 162 GFVVGMCSGLFGIGGGSLMVPAMILLFGFPPHIAVATSMLMIFLSAILSSVSHMALGNID 221

Query: 400 VPYALYFVAVATIAAFVGQHVVRKL 424
             Y L  +  A I   +G  + RK+
Sbjct: 222 WLYVLALLPGAWIGGQLGAAINRKM 246


>gi|294500093|ref|YP_003563793.1| hypothetical protein BMQ_3337 [Bacillus megaterium QM B1551]
 gi|294350030|gb|ADE70359.1| putative membrane protein [Bacillus megaterium QM B1551]
          Length = 258

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 64/110 (58%), Gaps = 8/110 (7%)

Query: 342 VLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAMTFSSSM-SVVEYYLLKRFP 399
           +L G V G+LG GG FI+ P+ L  L +P +++ AT + A+TF SS+ S V   ++ + P
Sbjct: 150 LLIGAVAGVLGAGGAFILVPVMLVVLKIPTRITIAT-SLAVTFISSIGSTVGKLIMHQVP 208

Query: 400 VPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL 449
              AL  VA + IA+ +G  V +K+        ++ +ILA +I  +A+ +
Sbjct: 209 FIPALILVAASLIASPIGAKVGQKM-----NTKVLQWILAGLILATAVKI 253


>gi|444375855|ref|ZP_21175107.1| hypothetical protein D515_1881 [Enterovibrio sp. AK16]
 gi|443680083|gb|ELT86731.1| hypothetical protein D515_1881 [Enterovibrio sp. AK16]
          Length = 257

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 40/78 (51%), Gaps = 1/78 (1%)

Query: 344 AGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYA 403
           AG +  + G GGG +M P  L +G+PP ++  T   + TF+SS +   YY  K F   + 
Sbjct: 20  AGFIDAVAG-GGGMLMVPALLSIGLPPHIALGTNKLSATFASSTAAFTYYRKKLFSPSFW 78

Query: 404 LYFVAVATIAAFVGQHVV 421
           ++      I A +G  VV
Sbjct: 79  IHAFVATFIGAILGTLVV 96


>gi|407465090|ref|YP_006775972.1| hypothetical protein NSED_06140 [Candidatus Nitrosopumilus sp. AR2]
 gi|407048278|gb|AFS83030.1| hypothetical protein NSED_06140 [Candidatus Nitrosopumilus sp. AR2]
          Length = 255

 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 36/64 (56%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G  AGI+G ++GLGGG I+ P+   LG PP  +++ + FA   ++  S + Y   KR   
Sbjct: 11  GFAAGILGSMIGLGGGVIVVPVLTFLGFPPTAAASNSLFAALSNAIASTISYSKQKRIEY 70

Query: 401 PYAL 404
           P  L
Sbjct: 71  PLGL 74



 Score = 38.9 bits (89), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 68/139 (48%), Gaps = 10/139 (7%)

Query: 86  IGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAV-STVYYNLKLRH 144
           +GF     GS+ G+GGG I VP+L+  +GF P +A + S    +  A+ ST+ Y+ + R 
Sbjct: 10  LGFAAGILGSMIGLGGGVIVVPVLTF-LGFPPTAAASNSLFAALSNAIASTISYSKQKR- 67

Query: 145 PTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVE 204
                  I+Y L L +  + + G  +G   +   A  +  +L   + I ++   FL+   
Sbjct: 68  -------IEYPLGLKLGLLSIPGTVLGALISSDVAPDVFKILFGFVLIASAAYIFLRKKI 120

Query: 205 TWKRETILKKEAARCLGSN 223
             + +TI K+     +G++
Sbjct: 121 ETREKTISKQMTVFAIGAS 139


>gi|417644318|ref|ZP_12294319.1| putative membrane protein [Staphylococcus warneri VCU121]
 gi|330684914|gb|EGG96596.1| putative membrane protein [Staphylococcus epidermidis VCU121]
          Length = 275

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 69/129 (53%), Gaps = 7/129 (5%)

Query: 325 DGKSFR--VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAM 381
           +GK++   V  LV++ A  +L GI+ GL G+GGG +M PL L     PP V+  T+   +
Sbjct: 148 EGKTYHYSVPPLVAFVA-TLLIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVG---QHVVRKLIILLGRASLIIFIL 438
            FSS MS + + +       Y++  +  + I A +G    H ++   +++   ++++ I 
Sbjct: 207 FFSSVMSSIGHIIQGHVAWDYSIVLIISSYIGAKIGVKVNHSIKSDTVVMLLRTVMLLIG 266

Query: 439 AFMIFVSAI 447
           A++I  S I
Sbjct: 267 AYLIIKSII 275



 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 82/189 (43%), Gaps = 26/189 (13%)

Query: 6   AKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGS 65
           +++  L+S  + F  F + FV +      KI+  +     + +  Y+          EG 
Sbjct: 100 SQYLTLKSFNLYFGIF-MIFVAILLMVRHKIKPFKIFNKPKYEKTYI--------DAEGK 150

Query: 66  GYHHVWPEL-KFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAIS 124
            YH+  P L  F   +++G + G FG        +GGG +  P++ ++  F P  A   S
Sbjct: 151 TYHYSVPPLVAFVATLLIGILTGLFG--------IGGGALMTPLMLIVFRFPPHVAVGTS 202

Query: 125 KCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFN-VIFADWMV 183
             MI  ++V +   ++   H        DY + L+I     +G  IGV  N  I +D +V
Sbjct: 203 MMMIFFSSVMSSIGHIIQGHVAW-----DYSIVLIISS--YIGAKIGVKVNHSIKSDTVV 255

Query: 184 TVLLIVLFI 192
            +L  V+ +
Sbjct: 256 MLLRTVMLL 264


>gi|376296158|ref|YP_005167388.1| hypothetical protein DND132_1375 [Desulfovibrio desulfuricans
           ND132]
 gi|323458719|gb|EGB14584.1| protein of unknown function DUF81 [Desulfovibrio desulfuricans
           ND132]
          Length = 309

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 326 GKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSS 385
           G S  VF +    A G   G++ G+ G+GGGF+M PL +  G+PP V++A+ +  +  +S
Sbjct: 10  GNSVNVFLVF---ALGGFVGLLSGIFGVGGGFLMTPLLIMFGIPPTVAAASDSNQIVGAS 66

Query: 386 SMSVVEYYLL 395
           +   + +Y L
Sbjct: 67  TSGCLAHYRL 76


>gi|90422261|ref|YP_530631.1| hypothetical protein RPC_0741 [Rhodopseudomonas palustris BisB18]
 gi|90104275|gb|ABD86312.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
           BisB18]
          Length = 308

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ T  M  SS    + Y+  +R 
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGIAPAVAVASVTSHMAASSLSGALSYW--RRR 76

Query: 399 PVPYALYFV 407
            +  AL  V
Sbjct: 77  AIDPALALV 85


>gi|445060181|ref|YP_007385585.1| sulfite exporter TauE/SafE-like protein [Staphylococcus warneri
           SG1]
 gi|443426238|gb|AGC91141.1| sulfite exporter TauE/SafE-like protein [Staphylococcus warneri
           SG1]
          Length = 275

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 69/129 (53%), Gaps = 7/129 (5%)

Query: 325 DGKSFR--VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAM 381
           +GK++   V  LV++ A  +L GI+ GL G+GGG +M PL L     PP V+  T+   +
Sbjct: 148 EGKTYHYSVPPLVAFVA-TLLIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVG---QHVVRKLIILLGRASLIIFIL 438
            FSS MS + + +       Y++  +  + I A +G    H ++   +++   ++++ I 
Sbjct: 207 FFSSVMSSIGHIIQGHVAWDYSIVLIISSYIGAKIGVKVNHSIKSDTVVMLLRTVMLLIG 266

Query: 439 AFMIFVSAI 447
           A++I  S I
Sbjct: 267 AYLIIKSII 275



 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 82/189 (43%), Gaps = 26/189 (13%)

Query: 6   AKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGS 65
           +++  L+S  + F  F + FV +      KI+  +     + +  Y+          EG 
Sbjct: 100 SQYLTLKSFNLYFGIF-MIFVAILLMVRHKIKPFKIFNKPKYEKTYI--------DAEGK 150

Query: 66  GYHHVWPEL-KFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAIS 124
            YH+  P L  F   +++G + G FG        +GGG +  P++ ++  F P  A   S
Sbjct: 151 TYHYSVPPLVAFVATLLIGILTGLFG--------IGGGALMTPLMLIVFRFPPHVAVGTS 202

Query: 125 KCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFN-VIFADWMV 183
             MI  ++V +   ++   H        DY + L+I     +G  IGV  N  I +D +V
Sbjct: 203 MMMIFFSSVMSSIGHIIQGHVAW-----DYSIVLIISS--YIGAKIGVKVNHSIKSDTVV 255

Query: 184 TVLLIVLFI 192
            +L  V+ +
Sbjct: 256 MLLRTVMLL 264


>gi|452853551|ref|YP_007495235.1| conserved membrane protein of unknown function [Desulfovibrio
           piezophilus]
 gi|451897205|emb|CCH50084.1| conserved membrane protein of unknown function [Desulfovibrio
           piezophilus]
          Length = 309

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 326 GKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSS 385
           G S  VF +    A G   G++ G+ G+GGGF+M PL +  G+PP V++A+ +  +  +S
Sbjct: 10  GNSVNVFLVF---ALGGFVGLLSGIFGVGGGFLMTPLLIMFGIPPTVAAASDSNQIVGAS 66

Query: 386 SMSVVEYYLL 395
           +   + +Y L
Sbjct: 67  TSGCLAHYRL 76



 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 54/105 (51%), Gaps = 2/105 (1%)

Query: 322 KGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFA 380
           + D  KS     ++     GVL GI+  ++G+GGGFIM P+ +  L +P  V   T+ F 
Sbjct: 167 QTDFAKSGVRLSMLMPLVLGVLVGILAAIMGVGGGFIMVPIMVYLLRMPMHVVVGTSLFQ 226

Query: 381 MTFSS-SMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           + F+  ++++++ Y         A+  +  +T+ A  G  + RKL
Sbjct: 227 ILFTCINVTILQSYTNHTVDFVLAVLLLLGSTLGAQFGTRISRKL 271


>gi|420205978|ref|ZP_14711489.1| hypothetical protein HMPREF9977_00722 [Staphylococcus epidermidis
           NIHLM008]
 gi|394278651|gb|EJE22965.1| hypothetical protein HMPREF9977_00722 [Staphylococcus epidermidis
           NIHLM008]
          Length = 253

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVA---VATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGAAIIATYIPANILKPLIII 108


>gi|117921072|ref|YP_870264.1| hypothetical protein Shewana3_2631 [Shewanella sp. ANA-3]
 gi|117613404|gb|ABK48858.1| protein of unknown function DUF81 [Shewanella sp. ANA-3]
          Length = 257

 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 11/80 (13%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY---LLKR 397
           G++AG +  ++G GGG +  P  L LG+PP  +  T  FA +F SSM+   YY   LLK 
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWTYYRQHLLKP 75

Query: 398 FPVPYALYFVAVATIAAFVG 417
                   F  +A IA F+G
Sbjct: 76  -------SFWYMAFIATFIG 88


>gi|428224218|ref|YP_007108315.1| hypothetical protein GEI7407_0765 [Geitlerinema sp. PCC 7407]
 gi|427984119|gb|AFY65263.1| protein of unknown function DUF81 [Geitlerinema sp. PCC 7407]
          Length = 275

 Score = 43.9 bits (102), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 48/84 (57%), Gaps = 4/84 (4%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLK--RF 398
           G LAGI+ G LG+GGG I+ PL + LG  P  + AT++ A+  +S    ++ + +   RF
Sbjct: 25  GFLAGILAGFLGIGGGTILVPLMVTLGFAPVQAVATSSLAILITSLSGSLQNWRMGYLRF 84

Query: 399 PVPYALYFVAV--ATIAAFVGQHV 420
           P   AL   A+  A I AF+  ++
Sbjct: 85  PQVLALGLPALLTAQIGAFLANYI 108


>gi|188025747|ref|ZP_02959652.2| hypothetical protein PROSTU_01536 [Providencia stuartii ATCC 25827]
 gi|188020324|gb|EDU58364.1| hypothetical protein PROSTU_01536 [Providencia stuartii ATCC 25827]
          Length = 277

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 51/109 (46%), Gaps = 20/109 (18%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLEL------------GVPPQVSSATATFAMTFSS 385
           C FG ++GI   L G GGGFI  PL   L             V  Q++ AT+T  M FS+
Sbjct: 12  CLFGFISGITTALFGFGGGFITVPLLYALITLVWGPESEASAVAMQIAVATSTCVMIFSA 71

Query: 386 SMSVVEYY--------LLKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
           S+S   +Y        +++ F +P +L  +  A IA  V    +R L I
Sbjct: 72  SLSSRAHYRKGHLNWSIIRPFIIPISLGGILGAFIALSVDSEWIRWLFI 120


>gi|352081388|ref|ZP_08952266.1| protein of unknown function DUF81 [Rhodanobacter sp. 2APBS1]
 gi|351683429|gb|EHA66513.1| protein of unknown function DUF81 [Rhodanobacter sp. 2APBS1]
          Length = 250

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 54/112 (48%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           + +S+ A G  A +V G LG+  G +   + L LG+PP V+SA+  +A TF+   S + +
Sbjct: 6   EFLSFAAVGAFAQLVDGALGMAYGVVSAAMLLGLGLPPAVASASVHYAETFTCGASGLSH 65

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
                              I A +G +VV  +     R +L +++LA  +F+
Sbjct: 66  LAAGNVRRQLFWTLAVPGAIGAVIGAYVVSHVPADGMRLALTVYMLAMGVFL 117


>gi|239636555|ref|ZP_04677557.1| conserved hypothetical protein [Staphylococcus warneri L37603]
 gi|239597910|gb|EEQ80405.1| conserved hypothetical protein [Staphylococcus warneri L37603]
          Length = 275

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 68/129 (52%), Gaps = 7/129 (5%)

Query: 325 DGKSFRVFQLVSYCAF--GVLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAM 381
           +GK++  + +  + AF   +L GI+ GL G+GGG +M PL L     PP V+  T+   +
Sbjct: 148 EGKTYH-YSVPPFVAFVATLLIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVG---QHVVRKLIILLGRASLIIFIL 438
            FSS MS + + +       Y++  +  + I A +G    H ++   +++   ++++ I 
Sbjct: 207 FFSSVMSSIGHIIQGHVAWDYSIVLIISSYIGAKIGVKVNHSIKSDTVVMLLRTVMLLIG 266

Query: 439 AFMIFVSAI 447
           A++I  S I
Sbjct: 267 AYLIIKSII 275



 Score = 38.9 bits (89), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 83/189 (43%), Gaps = 26/189 (13%)

Query: 6   AKWKILRSVMMIFLNFALAFVCVKAERGLKIETLRFNGTERQDSNYLLKAVNFLWQPEGS 65
           +++  L+S  + F  F + FV +      KI+  +     + +  Y+          EG 
Sbjct: 100 SQYLTLKSFNLYFGIF-MIFVAILLMVRHKIKPFKIFNKPQYEKTYI--------DAEGK 150

Query: 66  GYHH-VWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAIS 124
            YH+ V P + F   +++G + G FG        +GGG +  P++ ++  F P  A   S
Sbjct: 151 TYHYSVPPFVAFVATLLIGILTGLFG--------IGGGALMTPLMLIVFRFPPHVAVGTS 202

Query: 125 KCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFN-VIFADWMV 183
             MI  ++V +   ++   H        DY + L+I     +G  IGV  N  I +D +V
Sbjct: 203 MMMIFFSSVMSSIGHIIQGHVAW-----DYSIVLIISS--YIGAKIGVKVNHSIKSDTVV 255

Query: 184 TVLLIVLFI 192
            +L  V+ +
Sbjct: 256 MLLRTVMLL 264


>gi|386744462|ref|YP_006217641.1| hypothetical protein S70_15675 [Providencia stuartii MRSN 2154]
 gi|384481155|gb|AFH94950.1| hypothetical protein S70_15675 [Providencia stuartii MRSN 2154]
          Length = 271

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 51/109 (46%), Gaps = 20/109 (18%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLEL------------GVPPQVSSATATFAMTFSS 385
           C FG ++GI   L G GGGFI  PL   L             V  Q++ AT+T  M FS+
Sbjct: 6   CLFGFISGITTALFGFGGGFITVPLLYALITLVWGPESEASAVAMQIAVATSTCVMIFSA 65

Query: 386 SMSVVEYY--------LLKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
           S+S   +Y        +++ F +P +L  +  A IA  V    +R L I
Sbjct: 66  SLSSRAHYRKGHLNWSIIRPFIIPISLGGILGAFIALSVDSEWIRWLFI 114


>gi|417646396|ref|ZP_12296255.1| hypothetical protein SEVCU144_0935 [Staphylococcus epidermidis
           VCU144]
 gi|329727892|gb|EGG64342.1| hypothetical protein SEVCU144_0935 [Staphylococcus epidermidis
           VCU144]
          Length = 125

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 52/98 (53%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVA---VATIAAFVGQHVVRKLIIL 427
             LK       L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMLPLIFVASGGAAIIATYIPANILKPLIII 108


>gi|295705457|ref|YP_003598532.1| hypothetical protein BMD_3342 [Bacillus megaterium DSM 319]
 gi|294803116|gb|ADF40182.1| putative membrane protein [Bacillus megaterium DSM 319]
          Length = 258

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 63/110 (57%), Gaps = 8/110 (7%)

Query: 342 VLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAMTFSSSM-SVVEYYLLKRFP 399
           +L G V G+LG GG FI+ P+ L  L +P +++ AT + A+TF SS+ S +   +  + P
Sbjct: 150 LLIGAVAGVLGAGGAFILVPVMLVVLKIPTRITIAT-SLAVTFISSIGSTIGKLITHQVP 208

Query: 400 VPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL 449
              AL  VA + IA+ +G  V +K+        ++ +ILA +I  +AI +
Sbjct: 209 FIPALILVAASLIASPIGAKVGQKM-----NTKVLQWILAGLILATAIKI 253


>gi|85713974|ref|ZP_01044963.1| hypothetical protein NB311A_07458 [Nitrobacter sp. Nb-311A]
 gi|85699100|gb|EAQ36968.1| hypothetical protein NB311A_07458 [Nitrobacter sp. Nb-311A]
          Length = 371

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 32/55 (58%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G   G V G+ G+GGGF+M PL + +G+ P V+ AT T  +  SS    + Y+
Sbjct: 82  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGIAPAVAVATVTSHIAASSFSGALSYW 136


>gi|242242978|ref|ZP_04797423.1| membrane protein [Staphylococcus epidermidis W23144]
 gi|242233579|gb|EES35891.1| membrane protein [Staphylococcus epidermidis W23144]
          Length = 254

 Score = 43.5 bits (101), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLIII 108


>gi|418634984|ref|ZP_13197372.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
 gi|420190314|ref|ZP_14696257.1| hypothetical protein HMPREF9984_06342 [Staphylococcus epidermidis
           NIHLM037]
 gi|420192642|ref|ZP_14698500.1| hypothetical protein HMPREF9983_06044 [Staphylococcus epidermidis
           NIHLM023]
 gi|420204616|ref|ZP_14710174.1| hypothetical protein HMPREF9978_05542 [Staphylococcus epidermidis
           NIHLM015]
 gi|374835742|gb|EHR99339.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
 gi|394258759|gb|EJE03633.1| hypothetical protein HMPREF9984_06342 [Staphylococcus epidermidis
           NIHLM037]
 gi|394260815|gb|EJE05619.1| hypothetical protein HMPREF9983_06044 [Staphylococcus epidermidis
           NIHLM023]
 gi|394273626|gb|EJE18057.1| hypothetical protein HMPREF9978_05542 [Staphylococcus epidermidis
           NIHLM015]
          Length = 254

 Score = 43.5 bits (101), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLIII 108


>gi|91975280|ref|YP_567939.1| hypothetical protein RPD_0800 [Rhodopseudomonas palustris BisB5]
 gi|91681736|gb|ABE38038.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
           BisB5]
          Length = 307

 Score = 43.5 bits (101), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ T  M  SS    + Y+  +R 
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALSYW--RRR 76

Query: 399 PVPYALYFV 407
            +   L FV
Sbjct: 77  AIDPMLAFV 85


>gi|418614254|ref|ZP_13177232.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
 gi|374820914|gb|EHR84988.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
          Length = 254

 Score = 43.5 bits (101), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLIII 108


>gi|433655952|ref|YP_007299660.1| putative permease [Thermoanaerobacterium thermosaccharolyticum
           M0795]
 gi|433294141|gb|AGB19963.1| putative permease [Thermoanaerobacterium thermosaccharolyticum
           M0795]
          Length = 252

 Score = 43.5 bits (101), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 43/127 (33%), Positives = 64/127 (50%), Gaps = 11/127 (8%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMT---FSSSMSVV 390
           ++S C  G +A  +  + G GGG I  P  + LGVPP  +  T  FA T   F+SSM+ +
Sbjct: 6   ILSLCVIGFVAAFIDSIAG-GGGIISLPGLMVLGVPPAYALGTNKFASTCASFTSSMTFI 64

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLG 450
           +Y   K + V    Y V    I A +G     K ++LL  + L I I+  MIFV+  +L 
Sbjct: 65  KY---KVYDVNLLKYLVFGTLIGAILGV----KAVLLLDSSKLRIIIIILMIFVAIYTLL 117

Query: 451 GVGISNM 457
              + N+
Sbjct: 118 SKNVGNV 124


>gi|422022094|ref|ZP_16368603.1| hypothetical protein OO7_05939 [Providencia sneebia DSM 19967]
 gi|414097844|gb|EKT59497.1| hypothetical protein OO7_05939 [Providencia sneebia DSM 19967]
          Length = 271

 Score = 43.5 bits (101), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 35/68 (51%), Gaps = 12/68 (17%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLEL------------GVPPQVSSATATFAMTFSS 385
           C FG ++GI   L G GGGFI  PL   L            GV  Q++ AT+T  M FS+
Sbjct: 6   CIFGFISGITTALFGFGGGFITVPLLYALITLVWGTHSDAGGVAMQIAVATSTCVMIFSA 65

Query: 386 SMSVVEYY 393
           S+S   +Y
Sbjct: 66  SISSRAHY 73


>gi|420174532|ref|ZP_14680982.1| hypothetical protein HMPREF9990_02314 [Staphylococcus epidermidis
           NIHLM061]
 gi|394245037|gb|EJD90364.1| hypothetical protein HMPREF9990_02314 [Staphylococcus epidermidis
           NIHLM061]
          Length = 254

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLIII 108


>gi|27468308|ref|NP_764945.1| hypothetical protein SE1390 [Staphylococcus epidermidis ATCC 12228]
 gi|57867226|ref|YP_188850.1| hypothetical protein SERP1278 [Staphylococcus epidermidis RP62A]
 gi|251811104|ref|ZP_04825577.1| membrane protein [Staphylococcus epidermidis BCM-HMP0060]
 gi|282875864|ref|ZP_06284731.1| putative membrane protein [Staphylococcus epidermidis SK135]
 gi|293366339|ref|ZP_06613019.1| conserved hypothetical protein [Staphylococcus epidermidis
           M23864:W2(grey)]
 gi|417656164|ref|ZP_12305855.1| putative membrane protein [Staphylococcus epidermidis VCU028]
 gi|417658659|ref|ZP_12308280.1| putative membrane protein [Staphylococcus epidermidis VCU045]
 gi|417908781|ref|ZP_12552538.1| putative membrane protein [Staphylococcus epidermidis VCU037]
 gi|417911376|ref|ZP_12555083.1| putative membrane protein [Staphylococcus epidermidis VCU105]
 gi|417914293|ref|ZP_12557945.1| putative membrane protein [Staphylococcus epidermidis VCU109]
 gi|418603691|ref|ZP_13167072.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU041]
 gi|418606180|ref|ZP_13169474.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU057]
 gi|418610089|ref|ZP_13173215.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU065]
 gi|418612531|ref|ZP_13175566.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU117]
 gi|418618135|ref|ZP_13181014.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU120]
 gi|418623251|ref|ZP_13185969.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU123]
 gi|418625217|ref|ZP_13187873.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU125]
 gi|418627260|ref|ZP_13189839.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU126]
 gi|418629290|ref|ZP_13191802.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU127]
 gi|418664063|ref|ZP_13225560.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU081]
 gi|419769208|ref|ZP_14295304.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-250]
 gi|419771352|ref|ZP_14297406.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-K]
 gi|420166604|ref|ZP_14673287.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM088]
 gi|420170403|ref|ZP_14676964.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM070]
 gi|420187092|ref|ZP_14693115.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM039]
 gi|420195366|ref|ZP_14701159.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM021]
 gi|420196947|ref|ZP_14702681.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM020]
 gi|420209206|ref|ZP_14714644.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM003]
 gi|420215110|ref|ZP_14720382.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH05005]
 gi|420216860|ref|ZP_14722054.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH05001]
 gi|420220645|ref|ZP_14725604.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH04008]
 gi|420221508|ref|ZP_14726438.1| hypothetical protein HMPREF1390_00623 [Staphylococcus epidermidis
           NIH08001]
 gi|420225897|ref|ZP_14730724.1| hypothetical protein HMPREF1389_10265 [Staphylococcus epidermidis
           NIH06004]
 gi|420227493|ref|ZP_14732261.1| hypothetical protein HMPREF1388_05408 [Staphylococcus epidermidis
           NIH05003]
 gi|420229811|ref|ZP_14734513.1| hypothetical protein HMPREF1387_05447 [Staphylococcus epidermidis
           NIH04003]
 gi|420232217|ref|ZP_14736858.1| hypothetical protein HMPREF1386_05683 [Staphylococcus epidermidis
           NIH051668]
 gi|420234863|ref|ZP_14739423.1| hypothetical protein HMPREF1385_05672 [Staphylococcus epidermidis
           NIH051475]
 gi|421606803|ref|ZP_16048057.1| hypothetical protein B440_00608 [Staphylococcus epidermidis
           AU12-03]
 gi|27315854|gb|AAO04989.1|AE016748_223 conserved hypothetical protein [Staphylococcus epidermidis ATCC
           12228]
 gi|57637884|gb|AAW54672.1| conserved hypothetical protein [Staphylococcus epidermidis RP62A]
 gi|251805324|gb|EES57981.1| membrane protein [Staphylococcus epidermidis BCM-HMP0060]
 gi|281294889|gb|EFA87416.1| putative membrane protein [Staphylococcus epidermidis SK135]
 gi|291319577|gb|EFE59943.1| conserved hypothetical protein [Staphylococcus epidermidis
           M23864:W2(grey)]
 gi|329737414|gb|EGG73668.1| putative membrane protein [Staphylococcus epidermidis VCU028]
 gi|329737501|gb|EGG73754.1| putative membrane protein [Staphylococcus epidermidis VCU045]
 gi|341652991|gb|EGS76765.1| putative membrane protein [Staphylococcus epidermidis VCU109]
 gi|341653699|gb|EGS77466.1| putative membrane protein [Staphylococcus epidermidis VCU105]
 gi|341656142|gb|EGS79865.1| putative membrane protein [Staphylococcus epidermidis VCU037]
 gi|374405505|gb|EHQ76437.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU065]
 gi|374407009|gb|EHQ77878.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU041]
 gi|374409263|gb|EHQ80062.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU057]
 gi|374410953|gb|EHQ81682.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU081]
 gi|374816377|gb|EHR80581.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU120]
 gi|374818856|gb|EHR82999.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU117]
 gi|374824144|gb|EHR88117.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU123]
 gi|374825718|gb|EHR89642.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU125]
 gi|374829935|gb|EHR93727.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU126]
 gi|374834371|gb|EHR98017.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU127]
 gi|383358277|gb|EID35736.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-250]
 gi|383361578|gb|EID38948.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
           IS-K]
 gi|394233474|gb|EJD79078.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM088]
 gi|394240741|gb|EJD86164.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM070]
 gi|394256839|gb|EJE01765.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM039]
 gi|394263320|gb|EJE08056.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM021]
 gi|394266921|gb|EJE11539.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM020]
 gi|394279434|gb|EJE23742.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIHLM003]
 gi|394282559|gb|EJE26749.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH05005]
 gi|394285998|gb|EJE30064.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH04008]
 gi|394290606|gb|EJE34460.1| hypothetical protein HMPREF1390_00623 [Staphylococcus epidermidis
           NIH08001]
 gi|394291222|gb|EJE35046.1| membrane protein, DUF81 family [Staphylococcus epidermidis
           NIH05001]
 gi|394293331|gb|EJE37054.1| hypothetical protein HMPREF1389_10265 [Staphylococcus epidermidis
           NIH06004]
 gi|394297117|gb|EJE40729.1| hypothetical protein HMPREF1388_05408 [Staphylococcus epidermidis
           NIH05003]
 gi|394298610|gb|EJE42175.1| hypothetical protein HMPREF1387_05447 [Staphylococcus epidermidis
           NIH04003]
 gi|394301540|gb|EJE44996.1| hypothetical protein HMPREF1386_05683 [Staphylococcus epidermidis
           NIH051668]
 gi|394304106|gb|EJE47516.1| hypothetical protein HMPREF1385_05672 [Staphylococcus epidermidis
           NIH051475]
 gi|406657581|gb|EKC83966.1| hypothetical protein B440_00608 [Staphylococcus epidermidis
           AU12-03]
          Length = 254

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLIII 108


>gi|407798212|ref|ZP_11145120.1| hypothetical protein OCGS_0193 [Oceaniovalibus guishaninsula
           JLT2003]
 gi|407059648|gb|EKE45576.1| hypothetical protein OCGS_0193 [Oceaniovalibus guishaninsula
           JLT2003]
          Length = 303

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 58/105 (55%), Gaps = 2/105 (1%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPY 402
           + GI+ G+ G+GGGF++ PL   +G+PP V+ AT+   +  SS   V+ ++  +   +  
Sbjct: 23  MVGILSGMFGVGGGFLITPLLFFVGIPPAVAVATSANQIVASSVSGVLAHFRRRTVDLRM 82

Query: 403 ALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
            L  +A   + +  G  +   ++  +G+  L++  L +++F+ AI
Sbjct: 83  GLVLLAGGLVGSAAGVQIF-NVLRQMGQVDLMVR-LCYVVFLGAI 125


>gi|420202259|ref|ZP_14707852.1| hypothetical protein HMPREF9979_05842 [Staphylococcus epidermidis
           NIHLM018]
 gi|394269667|gb|EJE14197.1| hypothetical protein HMPREF9979_05842 [Staphylococcus epidermidis
           NIHLM018]
          Length = 254

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLIII 108


>gi|420173231|ref|ZP_14679726.1| hypothetical protein HMPREF9991_07281 [Staphylococcus epidermidis
           NIHLM067]
 gi|420183366|ref|ZP_14689497.1| hypothetical protein HMPREF9987_08245 [Staphylococcus epidermidis
           NIHLM049]
 gi|394240409|gb|EJD85833.1| hypothetical protein HMPREF9991_07281 [Staphylococcus epidermidis
           NIHLM067]
 gi|394249078|gb|EJD94299.1| hypothetical protein HMPREF9987_08245 [Staphylococcus epidermidis
           NIHLM049]
          Length = 254

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLIII 108


>gi|220925134|ref|YP_002500436.1| hypothetical protein Mnod_5288 [Methylobacterium nodulans ORS 2060]
 gi|219949741|gb|ACL60133.1| protein of unknown function DUF81 [Methylobacterium nodulans ORS
           2060]
          Length = 307

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 31/44 (70%)

Query: 347 VGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + G+ G+GGGF+M P+ + LG+PP ++ AT T  +  SS+ SV+
Sbjct: 27  ISGVFGIGGGFLMTPVLIVLGIPPAIAVATQTAPIVASSTTSVL 70


>gi|384921152|ref|ZP_10021141.1| hypothetical protein C357_18457 [Citreicella sp. 357]
 gi|384464952|gb|EIE49508.1| hypothetical protein C357_18457 [Citreicella sp. 357]
          Length = 302

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 2/58 (3%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            G L GI+ G+ G+GGGF++ PL   +G+PP V+ AT    +  SS   V+ +  LKR
Sbjct: 20  LGALVGILSGMFGVGGGFLITPLLFFMGIPPAVAVATGANQIVASSVSGVLAH--LKR 75


>gi|452824321|gb|EME31325.1| hypothetical protein Gasu_15590 [Galdieria sulphuraria]
          Length = 322

 Score = 43.1 bits (100), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 1/86 (1%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLEL-GVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           FG  AG +  L+G GGG ++ PLF+   G+  + +  T+  A++F+S ++   Y+L  R 
Sbjct: 74  FGFSAGCIASLVGAGGGVVLTPLFVTFYGLTQKQAQGTSLVAVSFNSLVASFIYFLGGRV 133

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKL 424
            +  A +    A I A VG  V  KL
Sbjct: 134 LLLPAFFLTLTAVICARVGAKVTSKL 159


>gi|154246428|ref|YP_001417386.1| hypothetical protein Xaut_2487 [Xanthobacter autotrophicus Py2]
 gi|154160513|gb|ABS67729.1| protein of unknown function DUF81 [Xanthobacter autotrophicus Py2]
          Length = 307

 Score = 43.1 bits (100), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 33/55 (60%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G+  G + G+ G+GGGF+M PL +  GVPP V+ A+ +  M  SS    + Y+
Sbjct: 19  AMGLAVGFISGMFGVGGGFLMTPLLIFTGVPPAVAVASVSPYMAASSFSGALSYW 73


>gi|416125430|ref|ZP_11596028.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
 gi|319401027|gb|EFV89246.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
          Length = 254

 Score = 43.1 bits (100), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  +  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFIDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLIII 108


>gi|126458964|ref|YP_001055242.1| hypothetical protein Pcal_0341 [Pyrobaculum calidifontis JCM 11548]
 gi|126248685|gb|ABO07776.1| protein of unknown function DUF81 [Pyrobaculum calidifontis JCM
           11548]
          Length = 244

 Score = 43.1 bits (100), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 36/55 (65%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVST 135
           VLG ++ F G    ++ GVGGG +FVP L L +GFD K+A A S  +I+  AV++
Sbjct: 121 VLGYLLIFAGGLASALFGVGGGTVFVPALVLAMGFDAKAAAASSMGIILPTAVAS 175


>gi|418327069|ref|ZP_12938243.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU071]
 gi|365223948|gb|EHM65221.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU071]
          Length = 254

 Score = 43.1 bits (100), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  +  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFIDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASAGGAIIATYIPANILKPLIII 108


>gi|347523588|ref|YP_004781158.1| hypothetical protein Pyrfu_1038 [Pyrolobus fumarii 1A]
 gi|343460470|gb|AEM38906.1| protein of unknown function DUF81 [Pyrolobus fumarii 1A]
          Length = 241

 Score = 43.1 bits (100), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 4/80 (5%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY----LLK 396
           G+LAGIVG + GLGGG IM PL +  GV P  + + + FA+  +S   ++  +    L  
Sbjct: 6   GLLAGIVGSIAGLGGGAIMVPLLILAGVEPHHAVSASLFAVLGTSLGGIIHLHKEGLLDT 65

Query: 397 RFPVPYALYFVAVATIAAFV 416
           R  V  A   V  A I A+V
Sbjct: 66  RLAVTLATASVTGAVIGAYV 85


>gi|420178011|ref|ZP_14684345.1| hypothetical protein HMPREF9989_07051 [Staphylococcus epidermidis
           NIHLM057]
 gi|420181101|ref|ZP_14687307.1| hypothetical protein HMPREF9988_09900 [Staphylococcus epidermidis
           NIHLM053]
 gi|394247198|gb|EJD92446.1| hypothetical protein HMPREF9989_07051 [Staphylococcus epidermidis
           NIHLM057]
 gi|394247337|gb|EJD92583.1| hypothetical protein HMPREF9988_09900 [Staphylococcus epidermidis
           NIHLM053]
          Length = 254

 Score = 43.1 bits (100), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  +  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFIDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLIII 108


>gi|159479540|ref|XP_001697848.1| hypothetical protein CHLREDRAFT_151478 [Chlamydomonas reinhardtii]
 gi|158273946|gb|EDO99731.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 929

 Score = 43.1 bits (100), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 60/113 (53%), Gaps = 1/113 (0%)

Query: 361 PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           PL L LGV PQV++AT+   + FSSS +++++ L       YA  F A + +AA  G  V
Sbjct: 816 PLLLHLGVHPQVTAATSGAMVLFSSSTALLQFALAGELNAQYAAVFAAASAVAALAGTLV 875

Query: 421 VRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHEYMGFENLC 473
           V  L+   GR S+++  LA ++ +  +S+   G+          + +GF  LC
Sbjct: 876 VAGLVRRSGRPSIVVLALAGVMGLGLVSVAVFGLQRAAKDLGAGD-IGFSQLC 927


>gi|39933890|ref|NP_946166.1| hypothetical protein RPA0813 [Rhodopseudomonas palustris CGA009]
 gi|192289309|ref|YP_001989914.1| hypothetical protein Rpal_0881 [Rhodopseudomonas palustris TIE-1]
 gi|39647737|emb|CAE26257.1| possible permease [Rhodopseudomonas palustris CGA009]
 gi|192283058|gb|ACE99438.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
           TIE-1]
          Length = 320

 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ T  M  SS    + Y+  +R 
Sbjct: 32  AMGAAVGFVSGMFGVGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALSYW--RRR 89

Query: 399 PVPYALYFV 407
            +   L FV
Sbjct: 90  AIDPLLAFV 98


>gi|145483119|ref|XP_001427582.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394664|emb|CAK60184.1| unnamed protein product [Paramecium tetraurelia]
          Length = 491

 Score = 43.1 bits (100), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 12/88 (13%)

Query: 94  GSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTV-------YYNLKLRHPT 146
           GS+G +GGG +  P L  I+ +D   A  I+ C++  A +  +       Y   +  HP 
Sbjct: 75  GSLGALGGGTVKRPFLQAILNYDASIAGDITSCLMFSAQLVNMIFIFFQKYIMNQFSHPD 134

Query: 147 LD-MPIIDYDLALLIQPMLMLGISIGVA 173
           +D  PII+Y++ L+      LGI I + 
Sbjct: 135 VDERPIINYEIGLI----YTLGIPISMC 158


>gi|337285350|ref|YP_004624824.1| membrane protein, conserved [Pyrococcus yayanosii CH1]
 gi|334901284|gb|AEH25552.1| hypothetical membrane protein, conserved [Pyrococcus yayanosii CH1]
          Length = 254

 Score = 43.1 bits (100), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 44/87 (50%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + + + Y A G++ G +  L GLGGGF++ P    LGV    +  T++ A+ F+S  S  
Sbjct: 1   MLKYLGYFAVGIVIGTLAALFGLGGGFLIVPTLNLLGVEIHKAVGTSSAAIVFTSLSSAY 60

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG 417
            Y+   R      L   + A + A++G
Sbjct: 61  AYHRQGRIHYKAGLLLASTAIVGAYIG 87


>gi|83950218|ref|ZP_00958951.1| membrane protein [Roseovarius nubinhibens ISM]
 gi|83838117|gb|EAP77413.1| membrane protein [Roseovarius nubinhibens ISM]
          Length = 307

 Score = 43.1 bits (100), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 25/97 (25%), Positives = 55/97 (56%), Gaps = 1/97 (1%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           + G++ G++ G+ G+GGGF++ PL   +G+PP V+ AT+   +  +S  +++ +   +  
Sbjct: 19  SIGLMVGLLSGIFGVGGGFLITPLLFFIGIPPAVAVATSANQIVAASFSAILAHLRRRTV 78

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLII 435
            +P  L  +    + + VG ++   L+  LG+  L++
Sbjct: 79  DIPMGLALMIGGLLGSTVGIYIF-NLLRSLGQVDLLV 114


>gi|365876063|ref|ZP_09415587.1| hypothetical protein EAAG1_07322 [Elizabethkingia anophelis Ag1]
 gi|442586736|ref|ZP_21005560.1| siroheme synthase [Elizabethkingia anophelis R26]
 gi|365756294|gb|EHM98209.1| hypothetical protein EAAG1_07322 [Elizabethkingia anophelis Ag1]
 gi|442563465|gb|ELR80676.1| siroheme synthase [Elizabethkingia anophelis R26]
          Length = 503

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 39/77 (50%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G LA IV G +G+G G I   + L L VPP V SA+   A +F+++   + +Y L     
Sbjct: 263 GFLAEIVAGSMGMGYGVICTTVLLLLNVPPPVVSASIHSAESFTTAAGSISHYKLGNVNK 322

Query: 401 PYALYFVAVATIAAFVG 417
                 V VA + A +G
Sbjct: 323 KMVWILVPVAILGAIIG 339


>gi|78042783|ref|YP_361264.1| hypothetical protein CHY_2470 [Carboxydothermus hydrogenoformans
           Z-2901]
 gi|77994898|gb|ABB13797.1| putative membrane protein [Carboxydothermus hydrogenoformans
           Z-2901]
          Length = 266

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 41/73 (56%), Gaps = 2/73 (2%)

Query: 341 GVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFP 399
           G+  G++G L+G GGGFI+ P L L     PQ ++ T+ F + F++    + Y   KR  
Sbjct: 10  GIFVGMIGTLIGAGGGFILIPYLILVAKFSPQTAAGTSLFMVFFNALSGSIAYIRQKRVD 69

Query: 400 VPYALYFVAVATI 412
              A YF A+ATI
Sbjct: 70  FRTAFYF-ALATI 81



 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 30/112 (26%), Positives = 54/112 (48%), Gaps = 2/112 (1%)

Query: 313 YKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQV 372
           ++ R +I ++G+  K    F L       ++ GI+  LLG+GGG I  PL   LG P  +
Sbjct: 134 HETREIIDAEGN--KYEYSFNLWLGIGISLIVGILSSLLGIGGGIIHVPLMGFLGFPMHI 191

Query: 373 SSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           ++AT+ F +  +S + V+ +           + +   A I A +G  +  K+
Sbjct: 192 ATATSHFILVITSLIGVISHISYGHVVFAKGIAYALGAIIGAQIGAKISAKI 243


>gi|115522742|ref|YP_779653.1| hypothetical protein RPE_0716 [Rhodopseudomonas palustris BisA53]
 gi|115516689|gb|ABJ04673.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
           BisA53]
          Length = 308

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 32/55 (58%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ T  M  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALSYW 73


>gi|268590380|ref|ZP_06124601.1| putative membrane protein [Providencia rettgeri DSM 1131]
 gi|291314292|gb|EFE54745.1| putative membrane protein [Providencia rettgeri DSM 1131]
          Length = 271

 Score = 42.7 bits (99), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 35/69 (50%), Gaps = 12/69 (17%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLEL------------GVPPQVSSATATFAMTFSS 385
           C FG ++GI   L G GGGFI  PL   L             V  Q++ AT+T  M FSS
Sbjct: 6   CLFGFISGITTALFGFGGGFITVPLLYALITLVWGPQSDAGSVAMQIAVATSTCVMIFSS 65

Query: 386 SMSVVEYYL 394
           ++S   +YL
Sbjct: 66  TLSSRAHYL 74


>gi|393762657|ref|ZP_10351283.1| hypothetical protein AGRI_06745 [Alishewanella agri BL06]
 gi|392606279|gb|EIW89164.1| hypothetical protein AGRI_06745 [Alishewanella agri BL06]
          Length = 262

 Score = 42.7 bits (99), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 40/87 (45%), Gaps = 1/87 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C+   +AG +  + G GGG +  P  L  G+PP V   T   A TF S  + V YY  K 
Sbjct: 16  CSVAFIAGYIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSLTASVTYYRKKL 74

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKL 424
           F   Y    +    I A +G  VV +L
Sbjct: 75  FNPVYWRNSLLFTAIGALLGTLVVDQL 101


>gi|254410195|ref|ZP_05023975.1| conserved domain protein, putative [Coleofasciculus chthonoplastes
           PCC 7420]
 gi|196183231|gb|EDX78215.1| conserved domain protein, putative [Coleofasciculus chthonoplastes
           PCC 7420]
          Length = 267

 Score = 42.7 bits (99), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 39/63 (61%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           QL+ + + G+ AGI+ G LG+GGG ++ PL + LG   Q + AT+T ++  ++    V+ 
Sbjct: 5   QLLIFASAGLFAGILAGFLGIGGGTVLVPLLVTLGYDYQQAVATSTLSIVITAISGTVQN 64

Query: 393 YLL 395
           + L
Sbjct: 65  WRL 67


>gi|125563332|gb|EAZ08712.1| hypothetical protein OsI_30980 [Oryza sativa Indica Group]
          Length = 94

 Score = 42.7 bits (99), Expect = 0.36,   Method: Composition-based stats.
 Identities = 15/23 (65%), Positives = 18/23 (78%)

Query: 52 LLKAVNFLWQPEGSGYHHVWPEL 74
          L K  NFLWQ +G+ YHHVWPE+
Sbjct: 58 LRKVANFLWQTDGNSYHHVWPEM 80


>gi|334341160|ref|YP_004546140.1| hypothetical protein [Desulfotomaculum ruminis DSM 2154]
 gi|334092514|gb|AEG60854.1| protein of unknown function DUF81 [Desulfotomaculum ruminis DSM
           2154]
          Length = 426

 Score = 42.7 bits (99), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 45/96 (46%)

Query: 329 FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS 388
           F  F      A GV AG++ G +G GGGF++ P  + LGV   ++  T  F +   + M 
Sbjct: 69  FTPFMFYGSMAVGVCAGLITGCIGAGGGFVITPALMSLGVKGILAVGTDQFHIFAKAIMG 128

Query: 389 VVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            V +  L    +P A+ F+  + I    G  + R L
Sbjct: 129 TVIHKKLGNVNIPLAIAFLCGSGIGVTAGGTLNRAL 164


>gi|262277629|ref|ZP_06055422.1| membrane protein [alpha proteobacterium HIMB114]
 gi|262224732|gb|EEY75191.1| membrane protein [alpha proteobacterium HIMB114]
          Length = 351

 Score = 42.7 bits (99), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 32/45 (71%), Gaps = 1/45 (2%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTF 383
           FGVL G++  LLG+GGGFI+ P+ +  +G+P ++   T+ FAM F
Sbjct: 181 FGVLIGLISSLLGVGGGFILVPILIYIIGMPAKLVPGTSLFAMIF 225


>gi|75674383|ref|YP_316804.1| hypothetical protein Nwi_0184 [Nitrobacter winogradskyi Nb-255]
 gi|74419253|gb|ABA03452.1| Protein of unknown function DUF81 [Nitrobacter winogradskyi Nb-255]
          Length = 343

 Score = 42.7 bits (99), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 56/109 (51%), Gaps = 2/109 (1%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           A G   G V G+ G+GGGF+M PL + +G+ P V+ AT T  +  SS    + Y+  +  
Sbjct: 54  AMGAAVGFVSGMFGIGGGFLMTPLLIFVGIAPAVAVATVTSHIAASSFSGALSYWRKRAV 113

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
               AL  ++   I   +G     +L   LG+  L+I  L++++ +S +
Sbjct: 114 DPMLALVLLSGGVIGTILGVLTFTRLRS-LGQLDLLI-ALSYLVLLSGV 160


>gi|384085444|ref|ZP_09996619.1| hypothetical protein AthiA1_08064 [Acidithiobacillus thiooxidans
           ATCC 19377]
          Length = 304

 Score = 42.7 bits (99), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 44/87 (50%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            +FG+  G + GL G+GGG ++ PL + +GVPP V+  T    +   S++    ++ +  
Sbjct: 18  VSFGLAVGFLSGLTGVGGGILITPLLIFIGVPPLVAVGTGAAQIVGGSAIGSYAHWRMGN 77

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKL 424
             +  A   +A +     +G H+ R L
Sbjct: 78  VDMRMAFVLLAGSWTGGLIGVHIARIL 104


>gi|296188121|ref|ZP_06856513.1| membrane protein, putative [Clostridium carboxidivorans P7]
 gi|296047247|gb|EFG86689.1| membrane protein, putative [Clostridium carboxidivorans P7]
          Length = 256

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 65/146 (44%), Gaps = 14/146 (9%)

Query: 329 FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS 388
           F    L+  C  G  A  V  + G GGG I  P FL  GVPP ++  T  F+ T +S  S
Sbjct: 5   FMTMNLIFLCVAGFFAAFVDSIAG-GGGIISVPAFLLAGVPPHITLGTNKFSSTCASFTS 63

Query: 389 VVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAIS 448
            +++    +  +    + +    I A +G + V    +++    L + +L  ++FV   S
Sbjct: 64  SLKFMQSGKVDLKILKFLLPFTVIGAVLGVNTV----LIIDAKYLNMIVLTLLLFVGIYS 119

Query: 449 LGGVGISNMIGKFHRHEYMGFENLCK 474
           L     S  IGK  +     F+ LCK
Sbjct: 120 L----FSKSIGKEDK-----FKGLCK 136


>gi|357419022|ref|YP_004932014.1| hypothetical protein Tlie_0171 [Thermovirga lienii DSM 17291]
 gi|355396488|gb|AER65917.1| protein of unknown function DUF81 [Thermovirga lienii DSM 17291]
          Length = 274

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 37/55 (67%), Gaps = 1/55 (1%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLF-LELGVPPQVSSATATFAMTFSSSMSVVEY 392
           AF  L GI+  LLG+GGG I  P+  L +G+P + ++AT+TF +  ++S+S V Y
Sbjct: 161 AFSTLGGIISALLGIGGGPIKVPIMRLVMGLPLKAATATSTFMVGITASVSAVIY 215



 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 79  EIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMI-MGAAVSTVY 137
            ++LG      G    ++ G+GGG I VP++ L++G   K+ATA S  M+ + A+VS V 
Sbjct: 155 RLLLGMAFSTLGGIISALLGIGGGPIKVPIMRLVMGLPLKAATATSTFMVGITASVSAVI 214

Query: 138 YNLK-LRHPTLDMPII 152
           Y  + +  P L +P +
Sbjct: 215 YLFEGMVKPDLAIPAV 230


>gi|365875113|ref|ZP_09414643.1| hypothetical protein EAAG1_02438 [Elizabethkingia anophelis Ag1]
 gi|442589242|ref|ZP_21008050.1| hypothetical protein D505_15488 [Elizabethkingia anophelis R26]
 gi|365757225|gb|EHM99134.1| hypothetical protein EAAG1_02438 [Elizabethkingia anophelis Ag1]
 gi|442560852|gb|ELR78079.1| hypothetical protein D505_15488 [Elizabethkingia anophelis R26]
          Length = 278

 Score = 42.7 bits (99), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 61/130 (46%), Gaps = 17/130 (13%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTFSSSMSVVEYYLLK 396
            A  +  G    LLG+GGG I  P  +E L  P  +++AT+ F +   S++SV+ +Y   
Sbjct: 159 IALSIFVGFFSPLLGIGGGIIHVPAMVEWLQFPVHIATATSHFILAIMSTVSVIVHYFEG 218

Query: 397 RFPVPYALYFVAVATI----AAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGV 452
            +  P  +  +A   +     AF+G +  RK   + G+         F+I   AISL  V
Sbjct: 219 SYNDPKIIKMIAALILGVIPGAFLGAYFSRK---VKGK---------FIIKALAISLALV 266

Query: 453 GISNMIGKFH 462
           GI  +I   H
Sbjct: 267 GIRILIASMH 276


>gi|339446247|ref|YP_004712251.1| hypothetical protein EGYY_28770 [Eggerthella sp. YY7918]
 gi|338905999|dbj|BAK45850.1| hypothetical protein EGYY_28770 [Eggerthella sp. YY7918]
          Length = 285

 Score = 42.7 bits (99), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 55/105 (52%), Gaps = 5/105 (4%)

Query: 322 KGDDGK--SFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLEL-GVPPQVSSATAT 378
           + D G   +F   QL+     G+LAG+  G +G+GGGF+M PL L L G+  + +S T+ 
Sbjct: 151 EADTGTPATFSRRQLLMGAGIGLLAGVASGYVGVGGGFLMVPLMLSLVGIGMRQASGTSL 210

Query: 379 FAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATI-AAFVGQHVVR 422
            A+   +   VVE  LL       A   +++ TI  A +G  +VR
Sbjct: 211 IAVMILAVPGVVEQALLGNIDY-MAGIAISIGTIPGALIGAQLVR 254


>gi|422007860|ref|ZP_16354845.1| hypothetical protein OOC_07073 [Providencia rettgeri Dmel1]
 gi|414095995|gb|EKT57654.1| hypothetical protein OOC_07073 [Providencia rettgeri Dmel1]
          Length = 278

 Score = 42.7 bits (99), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 35/69 (50%), Gaps = 12/69 (17%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLEL------------GVPPQVSSATATFAMTFSS 385
           C FG ++GI   L G GGGFI  PL   L             V  Q++ AT+T  M FSS
Sbjct: 13  CLFGFISGITTALFGFGGGFITVPLLYTLITLVWGPQSDAGTVAMQIAVATSTCVMIFSS 72

Query: 386 SMSVVEYYL 394
           ++S   +YL
Sbjct: 73  TLSSRAHYL 81


>gi|145514235|ref|XP_001443028.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410389|emb|CAK75631.1| unnamed protein product [Paramecium tetraurelia]
          Length = 473

 Score = 42.7 bits (99), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 65/138 (47%), Gaps = 16/138 (11%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G +AG V G LG+G GF+M P  L  G+ P+ +SAT+ F     S  +++   LL    +
Sbjct: 341 GFMAGFVSGFLGMGAGFVMVPTLLYSGLIPRCASATSAFIYLMISLNNLIT--LLTNHYL 398

Query: 401 PYALYFVAVATIAAFVGQHVVRKL-IILLGRAS------LIIFILAFMIFVSAISLGGVG 453
            +    + + T  A +G  V  K+  ILLG+        LI+F L     +S I  G V 
Sbjct: 399 DWQT--IILFTGLAIIGGSVFAKIGYILLGKYKIGYFVILIVFCLDIANIISQIYYGVV- 455

Query: 454 ISNMIGKFHRHEYMGFEN 471
                GK +  EY+   N
Sbjct: 456 ----FGKRYGLEYLTHAN 469


>gi|359436413|ref|ZP_09226518.1| UPF0721 transmembrane protein HI_0198 [Pseudoalteromonas sp.
           BSi20311]
 gi|358028868|dbj|GAA62767.1| UPF0721 transmembrane protein HI_0198 [Pseudoalteromonas sp.
           BSi20311]
          Length = 258

 Score = 42.7 bits (99), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 1/89 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C   +LAG +  + G GGG +  P  L  G+PP ++  T   A +F S  + + YY  + 
Sbjct: 15  CGVALLAGFIDAIAG-GGGLLTVPALLTAGLPPHLTLGTNKLAASFGSLTASITYYKKQL 73

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLII 426
           F   + L  +    I A +G  +V  L I
Sbjct: 74  FNPKFWLASILATAIGALLGTLLVDHLSI 102


>gi|374995932|ref|YP_004971431.1| permease [Desulfosporosinus orientis DSM 765]
 gi|357214298|gb|AET68916.1| putative permease [Desulfosporosinus orientis DSM 765]
          Length = 257

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 15/101 (14%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFA---MTFSSSM 387
           +F LV  C+ G LA +V  + G GGG I  P  L +G+PP ++  T  FA    +F+SS+
Sbjct: 9   LFNLVIICSLGFLAAMVDAIAG-GGGLISLPALLMVGIPPHLALGTNKFAASVASFNSSL 67

Query: 388 SVVE----YYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           +       ++ L ++ +P+ L       + AF+G   V K+
Sbjct: 68  TFARSGKVHFPLVKWQIPFTL-------LGAFLGAWAVLKV 101


>gi|160902447|ref|YP_001568028.1| hypothetical protein Pmob_0983 [Petrotoga mobilis SJ95]
 gi|160360091|gb|ABX31705.1| protein of unknown function DUF81 [Petrotoga mobilis SJ95]
          Length = 246

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 37/69 (53%), Gaps = 8/69 (11%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS--------VVEY 392
           G LA +V G LG+  G I   L L  GVPP +SSA+  FA  F++S+S         V+ 
Sbjct: 11  GFLAQVVDGALGMAYGTISNALLLSAGVPPAISSASVHFAEIFTTSISGFSHLKLGNVDK 70

Query: 393 YLLKRFPVP 401
            L K+  +P
Sbjct: 71  SLFKKLLIP 79


>gi|297583461|ref|YP_003699241.1| hypothetical protein [Bacillus selenitireducens MLS10]
 gi|297141918|gb|ADH98675.1| protein of unknown function DUF81 [Bacillus selenitireducens MLS10]
          Length = 272

 Score = 42.4 bits (98), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 51  YLLKAVNFLWQPEGSGYH---HVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVP 107
           +L K V  L  P G   +   H   E  +G++ V+G ++GF   +   + G+GGG + VP
Sbjct: 122 WLKKHVRPLSVPVGRNLYWTDHQGVEWTYGYQPVIGIMLGFIVGSISGLFGIGGGSLMVP 181

Query: 108 MLSLIIGFDPKSATAISKCMIMGAAV 133
            + L+  F P  A A S  MIM +A+
Sbjct: 182 AMILLFAFPPHLAVATSMFMIMLSAI 207


>gi|240102617|ref|YP_002958926.1| putative permease [Thermococcus gammatolerans EJ3]
 gi|239910171|gb|ACS33062.1| Predicted permease [Thermococcus gammatolerans EJ3]
          Length = 243

 Score = 42.4 bits (98), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 38/56 (67%), Gaps = 1/56 (1%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTFSSSMSVVEYYLL 395
           G+L+G+V GLLG+ GG +  PLF   +G+P + +  T++FA+ F++    +E+Y L
Sbjct: 137 GILSGLVSGLLGISGGVLNVPLFHTFVGIPMRYAVGTSSFALFFTALAGAIEHYRL 192


>gi|406672837|ref|ZP_11080062.1| hypothetical protein HMPREF9700_00604 [Bergeyella zoohelcum CCUG
           30536]
 gi|405587381|gb|EKB61109.1| hypothetical protein HMPREF9700_00604 [Bergeyella zoohelcum CCUG
           30536]
          Length = 263

 Score = 42.4 bits (98), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 63/123 (51%), Gaps = 6/123 (4%)

Query: 303 VVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGP- 361
           V+ L+ ++ + K  R I  K + GK      L+S    G+L GI+ G +G GGGF++ P 
Sbjct: 118 VLMLWSSLKMIKKDRSI--KKNIGKENNYTLLISQ---GLLVGIIMGFIGAGGGFLIVPA 172

Query: 362 LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVV 421
           L +  G+  + + AT+ F +  +S++  +          P+ L F  ++ I  F+G  + 
Sbjct: 173 LVMLFGMNMREAVATSLFIIMINSAVGFISSLDKIAIDWPFLLVFSGLSIIGIFIGMLIA 232

Query: 422 RKL 424
           +K+
Sbjct: 233 KKM 235


>gi|255020887|ref|ZP_05292943.1| protein of unknown function DUF81 [Acidithiobacillus caldus ATCC
           51756]
 gi|254969678|gb|EET27184.1| protein of unknown function DUF81 [Acidithiobacillus caldus ATCC
           51756]
          Length = 303

 Score = 42.4 bits (98), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 45/87 (51%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            AFG+L G + GL G+GGGF++ PL + +GVPP ++  T    +  +S++    ++ +  
Sbjct: 18  VAFGLLVGFLSGLTGVGGGFLITPLLIFVGVPPLIAVGTGAAQIVGASAVGSYAHWRMGN 77

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKL 424
             +  AL  +  +      G  V R L
Sbjct: 78  VDMRMALILLLGSWTGGLAGVRVARWL 104


>gi|423315709|ref|ZP_17293614.1| hypothetical protein HMPREF9699_00185 [Bergeyella zoohelcum ATCC
           43767]
 gi|405585813|gb|EKB59616.1| hypothetical protein HMPREF9699_00185 [Bergeyella zoohelcum ATCC
           43767]
          Length = 263

 Score = 42.4 bits (98), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 63/123 (51%), Gaps = 6/123 (4%)

Query: 303 VVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGP- 361
           V+ L+ ++ + K  R I  K + GK      L+S    G+L GI+ G +G GGGF++ P 
Sbjct: 118 VLMLWSSLKMIKKDRSI--KKNIGKENNYTLLISQ---GLLVGIIMGFIGAGGGFLIVPA 172

Query: 362 LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVV 421
           L +  G+  + + AT+ F +  +S++  +          P+ L F  ++ I  F+G  + 
Sbjct: 173 LVMLFGMNMREAVATSLFIIMINSAVGFISSLDKIAIDWPFLLVFSGLSIIGIFIGMLIA 232

Query: 422 RKL 424
           +K+
Sbjct: 233 KKM 235


>gi|316932351|ref|YP_004107333.1| hypothetical protein [Rhodopseudomonas palustris DX-1]
 gi|315600065|gb|ADU42600.1| protein of unknown function DUF81 [Rhodopseudomonas palustris DX-1]
          Length = 320

 Score = 42.4 bits (98), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 32/55 (58%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ T  M  SS    + Y+
Sbjct: 32  AMGAAVGFVSGMFGVGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALSYW 86


>gi|340781456|ref|YP_004748063.1| hypothetical protein Atc_0714 [Acidithiobacillus caldus SM-1]
 gi|340555609|gb|AEK57363.1| conserved hypothetical protein [Acidithiobacillus caldus SM-1]
          Length = 303

 Score = 42.4 bits (98), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 45/87 (51%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            AFG+L G + GL G+GGGF++ PL + +GVPP ++  T    +  +S++    ++ +  
Sbjct: 18  VAFGLLVGFLSGLTGVGGGFLITPLLIFVGVPPLIAVGTGAAQIVGASAVGSYAHWRMGN 77

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKL 424
             +  AL  +  +      G  V R L
Sbjct: 78  VDMRMALILLLGSWTGGLAGVRVARWL 104


>gi|254475159|ref|ZP_05088545.1| membrane protein [Ruegeria sp. R11]
 gi|214029402|gb|EEB70237.1| membrane protein [Ruegeria sp. R11]
          Length = 307

 Score = 42.4 bits (98), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 2/105 (1%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPY 402
           + GI+ G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ ++  +   +  
Sbjct: 23  MVGILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRRTVDIKM 82

Query: 403 ALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
            L       I A +G  VV   +  LG+  L++  L +++F+  +
Sbjct: 83  GLVLQVGGLIGAALGV-VVFNYLKALGQVDLLVK-LCYVVFLGVV 125


>gi|84500291|ref|ZP_00998557.1| membrane protein [Oceanicola batsensis HTCC2597]
 gi|84392225|gb|EAQ04493.1| membrane protein [Oceanicola batsensis HTCC2597]
          Length = 306

 Score = 42.4 bits (98), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 38/57 (66%), Gaps = 1/57 (1%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G + G++ G+ G+GGGF++ PL   +G+PP V+ AT+   +  +SS+S V  +L +R
Sbjct: 21  GGIVGVLSGMFGVGGGFLITPLLFFVGIPPAVAVATSANQIV-ASSVSAVLAHLRRR 76


>gi|425737948|ref|ZP_18856217.1| permease [Staphylococcus massiliensis S46]
 gi|425480853|gb|EKU48016.1| permease [Staphylococcus massiliensis S46]
          Length = 273

 Score = 42.4 bits (98), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 7/103 (6%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAMTFSSSMSVVEYYLLK 396
           C+F  L GI+ GL G+GGG +M PL L     PP V+  T+   + FSS  S   + L  
Sbjct: 162 CSF--LIGIITGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIFFSSLTSSFGHILQG 219

Query: 397 RFPVPYALYFVAVATIAAFVGQHVVRKL----IILLGRASLII 435
                YAL  V  + I A +G  + R +    ++LL R  +I+
Sbjct: 220 HVDWFYALILVISSYIGAKIGVRINRAVKSDTVVLLLRTVMIM 262


>gi|389807654|ref|ZP_10204191.1| Sulfate transporter, CysZ-type [Rhodanobacter thiooxydans LCS2]
 gi|388443779|gb|EIL99914.1| Sulfate transporter, CysZ-type [Rhodanobacter thiooxydans LCS2]
          Length = 250

 Score = 42.4 bits (98), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 59/114 (51%), Gaps = 4/114 (3%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           + +++ A G LA +V G LG+  G +   + L LG+PP V+SA+  +A TF+   S + +
Sbjct: 6   EFLTFAAVGALAQLVDGALGMAYGVVSAAMLLGLGLPPAVASASVHYAETFTCGASGLSH 65

Query: 393 YLLKRFPVPYALYFVAV--ATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
             L    V   L++       I A +G +VV  +     R +L I++L   +F+
Sbjct: 66  --LAAGNVRRQLFWTLAIPGAIGAVIGAYVVIHVPADGMRLALTIYMLGMGVFL 117


>gi|215740460|dbj|BAG97116.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 109

 Score = 42.4 bits (98), Expect = 0.53,   Method: Composition-based stats.
 Identities = 32/38 (84%), Positives = 36/38 (94%)

Query: 91  AAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMI 128
           AAFGSVGGVGGGGIFVPML+LIIGFD KS+TAISK ++
Sbjct: 58  AAFGSVGGVGGGGIFVPMLTLIIGFDAKSSTAISKFIV 95


>gi|389797460|ref|ZP_10200502.1| putative transmembrane protein [Rhodanobacter sp. 116-2]
 gi|388447423|gb|EIM03431.1| putative transmembrane protein [Rhodanobacter sp. 116-2]
          Length = 250

 Score = 42.4 bits (98), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 30/112 (26%), Positives = 53/112 (47%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           + +S+ A G  A +V G LG+  G +   + L LG+PP V+SA+  +A TF+   S + +
Sbjct: 6   EFLSFAAVGAFAQLVDGALGMAYGVVSAAMLLGLGLPPAVASASVHYAETFTCGASGLSH 65

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
                              I A +G +VV  +     R +L +++L   +F+
Sbjct: 66  LAAGNVRRQLFWTLAVPGAIGAVIGAYVVSHVPADGMRLALTVYMLGMGVFL 117


>gi|171185687|ref|YP_001794606.1| hypothetical protein Tneu_1233 [Pyrobaculum neutrophilum V24Sta]
 gi|170934899|gb|ACB40160.1| protein of unknown function DUF81 [Pyrobaculum neutrophilum V24Sta]
          Length = 242

 Score = 42.4 bits (98), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 36/69 (52%), Gaps = 3/69 (4%)

Query: 68  HHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCM 127
            +V P     W   LG  + F G    S+ GVGGG +FVP L L  G D K A A+S  +
Sbjct: 111 KNVEPRRNAPW---LGPPLVFIGGFASSLFGVGGGTVFVPTLMLTSGLDAKRAAAMSMGI 167

Query: 128 IMGAAVSTV 136
           I   AVS+V
Sbjct: 168 IFPTAVSSV 176


>gi|88608664|ref|YP_506134.1| hypothetical protein NSE_0240 [Neorickettsia sennetsu str.
           Miyayama]
 gi|88600833|gb|ABD46301.1| putative membrane protein [Neorickettsia sennetsu str. Miyayama]
          Length = 313

 Score = 42.4 bits (98), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 37/149 (24%), Positives = 69/149 (46%), Gaps = 12/149 (8%)

Query: 45  ERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGI 104
           E+   N L+K +N L  P    +H    ++      + GT++G       S+ GVGGG I
Sbjct: 155 EKPYKNKLIKLINRL--PLRYHFHSAKGDISLVALALFGTVVG----VILSISGVGGGFI 208

Query: 105 FVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPML 164
            VP+L  +     + A   S    +  +V+TV+++       + +  +D  L  L+    
Sbjct: 209 LVPVLMYVFNLPVRIAIGTSVAQSVLVSVATVFFH------AITLGTVDMLLGFLLSIGA 262

Query: 165 MLGISIGVAFNVIFADWMVTVLLIVLFIG 193
           + G++ G   N+IF   ++ +LL ++  G
Sbjct: 263 ICGVTFGAKLNLIFHPVVIRLLLALVMFG 291


>gi|332185052|ref|ZP_08386801.1| hypothetical protein SUS17_418 [Sphingomonas sp. S17]
 gi|332014776|gb|EGI56832.1| hypothetical protein SUS17_418 [Sphingomonas sp. S17]
          Length = 253

 Score = 42.4 bits (98), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 39/73 (53%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
           D  SF +  ++ + A G  A +V G LG+  G I   L + LGVPP  +SA      TF+
Sbjct: 2   DLSSFDLIHILPFVAVGFAAQVVDGALGMAFGVISNTLLMSLGVPPAAASAGVHSVETFT 61

Query: 385 SSMSVVEYYLLKR 397
           +++S + + + + 
Sbjct: 62  TAVSGISHAVHRN 74


>gi|399991606|ref|YP_006571846.1| hypothetical protein PGA1_c03930 [Phaeobacter gallaeciensis DSM
           17395 = CIP 105210]
 gi|398656161|gb|AFO90127.1| hypothetical protein PGA1_c03930 [Phaeobacter gallaeciensis DSM
           17395 = CIP 105210]
          Length = 307

 Score = 42.4 bits (98), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 32/51 (62%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           + GI+ G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ ++
Sbjct: 23  MVGILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHF 73


>gi|400753245|ref|YP_006561613.1| hypothetical protein PGA2_c03480 [Phaeobacter gallaeciensis 2.10]
 gi|398652398|gb|AFO86368.1| hypothetical protein PGA2_c03480 [Phaeobacter gallaeciensis 2.10]
          Length = 307

 Score = 42.4 bits (98), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 32/51 (62%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           + GI+ G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ ++
Sbjct: 23  MVGILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHF 73


>gi|254510226|ref|ZP_05122293.1| membrane protein [Rhodobacteraceae bacterium KLH11]
 gi|221533937|gb|EEE36925.1| membrane protein [Rhodobacteraceae bacterium KLH11]
          Length = 321

 Score = 42.4 bits (98), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 32/51 (62%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           + GI+ G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ ++
Sbjct: 38  MVGILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHF 88


>gi|254467336|ref|ZP_05080747.1| permease [Rhodobacterales bacterium Y4I]
 gi|206688244|gb|EDZ48726.1| permease [Rhodobacterales bacterium Y4I]
          Length = 285

 Score = 42.0 bits (97), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 56/105 (53%), Gaps = 2/105 (1%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPY 402
           + G++ G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ ++  +   +  
Sbjct: 1   MVGVLSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRRTVDIKM 60

Query: 403 ALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
            L   A   + A +G  VV   +  LG+  L++  L +++F+  +
Sbjct: 61  GLVLQAGGLMGAALGV-VVFNYLKALGQVDLLVK-LCYVVFLGVV 103


>gi|84685127|ref|ZP_01013026.1| membrane protein [Maritimibacter alkaliphilus HTCC2654]
 gi|84666859|gb|EAQ13330.1| membrane protein [Rhodobacterales bacterium HTCC2654]
          Length = 304

 Score = 42.0 bits (97), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G+ G+GGGF+M PL   +G+PP V+ AT T  +  SS   V+ +  LKR
Sbjct: 31  GMFGVGGGFLMTPLLFMIGIPPAVAVATGTNQIVASSVSGVLAH--LKR 77


>gi|260431503|ref|ZP_05785474.1| putative membrane protein [Silicibacter lacuscaerulensis ITI-1157]
 gi|260415331|gb|EEX08590.1| putative membrane protein [Silicibacter lacuscaerulensis ITI-1157]
          Length = 306

 Score = 42.0 bits (97), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 33/54 (61%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLK 396
           + GI+ G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ ++  K
Sbjct: 23  MVGILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRK 76


>gi|420163862|ref|ZP_14670596.1| hypothetical protein HMPREF9995_07828 [Staphylococcus epidermidis
           NIHLM095]
 gi|420168565|ref|ZP_14675173.1| hypothetical protein HMPREF9993_07496 [Staphylococcus epidermidis
           NIHLM087]
 gi|394232988|gb|EJD78599.1| hypothetical protein HMPREF9995_07828 [Staphylococcus epidermidis
           NIHLM095]
 gi|394233274|gb|EJD78882.1| hypothetical protein HMPREF9993_07496 [Staphylococcus epidermidis
           NIHLM087]
          Length = 254

 Score = 42.0 bits (97), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 52/98 (53%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
             FG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VTFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLIII 108


>gi|346994770|ref|ZP_08862842.1| membrane protein [Ruegeria sp. TW15]
          Length = 306

 Score = 42.0 bits (97), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 32/51 (62%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           + GI+ G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ ++
Sbjct: 23  MVGILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHF 73


>gi|328952971|ref|YP_004370305.1| hypothetical protein Desac_1265 [Desulfobacca acetoxidans DSM
           11109]
 gi|328453295|gb|AEB09124.1| protein of unknown function DUF81 [Desulfobacca acetoxidans DSM
           11109]
          Length = 311

 Score = 42.0 bits (97), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 33/47 (70%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSS 386
            GVL G + G+ G+GGGF++ PL + +G+PP V++A+ +  +  ++S
Sbjct: 20  LGVLVGFLSGMFGVGGGFLLTPLMMMVGIPPAVAAASDSNQIVAAAS 66


>gi|228942852|ref|ZP_04105369.1| permease [Bacillus thuringiensis serovar berliner ATCC 10792]
 gi|228976756|ref|ZP_04137173.1| permease [Bacillus thuringiensis serovar thuringiensis str. T01001]
 gi|228782969|gb|EEM31131.1| permease [Bacillus thuringiensis serovar thuringiensis str. T01001]
 gi|228816808|gb|EEM62916.1| permease [Bacillus thuringiensis serovar berliner ATCC 10792]
          Length = 282

 Score = 42.0 bits (97), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 55/99 (55%), Gaps = 7/99 (7%)

Query: 337 YCAFGVL--AGIVGGLLGLGGG-FIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           Y  FGV+  AG++ GLLG+G G F +  + + + +P +VSSAT+ F M  +++ S   Y+
Sbjct: 166 YGGFGVMYGAGVISGLLGIGSGSFKVMAMDVFMKLPLKVSSATSNFMMGVTAAASASVYF 225

Query: 394 LL----KRFPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
           L      +   P A+  +  ATI A + Q +  K I +L
Sbjct: 226 LRGDIDPKISAPVAIGVLIGATIGAKIMQRLKSKTIRML 264


>gi|375111110|ref|ZP_09757321.1| hypothetical protein AJE_14130 [Alishewanella jeotgali KCTC 22429]
 gi|374568652|gb|EHR39824.1| hypothetical protein AJE_14130 [Alishewanella jeotgali KCTC 22429]
          Length = 262

 Score = 42.0 bits (97), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 49/110 (44%), Gaps = 6/110 (5%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C    +AG +  + G GGG +  P  L  G+PP V   T   A TF S  + + YY  + 
Sbjct: 16  CGVAFIAGYIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSFTASLTYYRKQL 74

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
           F   Y    +    I A +G  VV +L       +L+  IL  +I  +AI
Sbjct: 75  FNPHYWRNSLVFTAIGALLGTLVVDQL-----STALLEKILPLLILFAAI 119


>gi|418412109|ref|ZP_12985374.1| hypothetical protein HMPREF9281_00978 [Staphylococcus epidermidis
           BVS058A4]
 gi|420184668|ref|ZP_14690777.1| hypothetical protein HMPREF9986_02847 [Staphylococcus epidermidis
           NIHLM040]
 gi|394257319|gb|EJE02241.1| hypothetical protein HMPREF9986_02847 [Staphylococcus epidermidis
           NIHLM040]
 gi|410890123|gb|EKS37923.1| hypothetical protein HMPREF9281_00978 [Staphylococcus epidermidis
           BVS058A4]
          Length = 254

 Score = 42.0 bits (97), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 52/98 (53%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
             FG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VTFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++  ++++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLIII 108


>gi|56695290|ref|YP_165638.1| hypothetical protein SPO0375 [Ruegeria pomeroyi DSS-3]
 gi|56677027|gb|AAV93693.1| membrane protein [Ruegeria pomeroyi DSS-3]
          Length = 316

 Score = 42.0 bits (97), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 32/51 (62%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           + GI+ G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ ++
Sbjct: 33  MVGILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHF 83


>gi|397169771|ref|ZP_10493201.1| hypothetical protein AEST_09670 [Alishewanella aestuarii B11]
 gi|396088666|gb|EJI86246.1| hypothetical protein AEST_09670 [Alishewanella aestuarii B11]
          Length = 260

 Score = 42.0 bits (97), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 49/110 (44%), Gaps = 6/110 (5%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C    +AG +  + G GGG +  P  L  G+PP V   T   A TF S  + + YY  + 
Sbjct: 14  CGVAFIAGYIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSFTASLTYYRKQL 72

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
           F   Y    +    I A +G  VV +L       +L+  IL  +I  +AI
Sbjct: 73  FNPHYWRNSLVFTAIGALLGTLVVDQL-----STALLEKILPLLILFAAI 117


>gi|375337107|ref|ZP_09778451.1| hypothetical protein SbacW_09094 [Succinivibrionaceae bacterium
           WG-1]
          Length = 273

 Score = 42.0 bits (97), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 46/170 (27%), Positives = 69/170 (40%), Gaps = 22/170 (12%)

Query: 52  LLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSL 111
           LL A N +            P LK      +  II FF  +   + G+GGG + VP L+L
Sbjct: 111 LLAAANMILFANKKALFDSLPSLK------IQRIIAFFIGSISVMLGIGGGTLTVPTLTL 164

Query: 112 IIGFDPKSATAISKC----MIMGAAVSTVYYNLKL-----------RHPTLDMPIIDYDL 156
              FDPK A   + C    + +  AV T+  +L L             P L +  + +  
Sbjct: 165 -FNFDPKKAIGSAACIGFLISIPGAVVTIITDLMLYCFDDNFTKAHNAPPLTLGHVCFLA 223

Query: 157 ALLIQPMLMLGISIGVAFNVIFADWMVTVLLIVLFIGTSTKAFLKGVETW 206
            + I P  M    +GV  N +     +  L  +L I T TK  L G++ +
Sbjct: 224 VIAIIPFSMAVAPLGVKVNKLLNPITIKRLFALLLIFTGTKMLLGGLQIF 273


>gi|452202437|ref|YP_007482722.1| hypothetical protein H175_328p047 [Bacillus thuringiensis serovar
           thuringiensis str. IS5056]
 gi|452109647|gb|AGG05380.1| hypothetical protein H175_328p047 [Bacillus thuringiensis serovar
           thuringiensis str. IS5056]
          Length = 276

 Score = 42.0 bits (97), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 55/99 (55%), Gaps = 7/99 (7%)

Query: 337 YCAFGVL--AGIVGGLLGLGGG-FIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           Y  FGV+  AG++ GLLG+G G F +  + + + +P +VSSAT+ F M  +++ S   Y+
Sbjct: 160 YGGFGVMYGAGVISGLLGIGSGSFKVMAMDVFMKLPLKVSSATSNFMMGVTAAASASVYF 219

Query: 394 LL----KRFPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
           L      +   P A+  +  ATI A + Q +  K I +L
Sbjct: 220 LRGDIDPKISAPVAIGVLIGATIGAKIMQRLKSKTIRML 258


>gi|385331945|ref|YP_005885896.1| membrane protein [Marinobacter adhaerens HP15]
 gi|311695095|gb|ADP97968.1| membrane protein containing DUF81 [Marinobacter adhaerens HP15]
          Length = 248

 Score = 42.0 bits (97), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 1/90 (1%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           V ++V+  + G +AG +  +L  GG  +  PL + LG+PPQV++ T   A+T  S  +V 
Sbjct: 3   VLEIVALLSIGGIAGFIN-VLSAGGSMLTLPLLMFLGLPPQVANGTNRVAITLQSITAVG 61

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
            +Y +    +  +L+    A + + VG  V
Sbjct: 62  SFYRMGHGNLVVSLHLAIPAVLGSLVGAWV 91


>gi|86751708|ref|YP_488204.1| hypothetical protein RPB_4610 [Rhodopseudomonas palustris HaA2]
 gi|86574736|gb|ABD09293.1| Protein of unknown function DUF81 [Rhodopseudomonas palustris HaA2]
          Length = 307

 Score = 42.0 bits (97), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           A G   G V G+ G+GGGF++ PL + +G+ P V+ A+ T  M  SS    + Y+  +R 
Sbjct: 19  AMGAAVGFVSGMFGVGGGFLLTPLLIFVGIAPAVAVASVTSHMAASSFSGALSYW--RRR 76

Query: 399 PVPYALYFV 407
            +   L FV
Sbjct: 77  AIDPMLAFV 85


>gi|359446644|ref|ZP_09236295.1| UPF0721 transmembrane protein HI_0198 [Pseudoalteromonas sp.
           BSi20439]
 gi|358039450|dbj|GAA72544.1| UPF0721 transmembrane protein HI_0198 [Pseudoalteromonas sp.
           BSi20439]
          Length = 258

 Score = 42.0 bits (97), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 41/89 (46%), Gaps = 1/89 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C   +LAG +  + G GGG +  P  L  G+PP ++  T   A +F S  + + YY  + 
Sbjct: 15  CGVALLAGFIDAIAG-GGGLLTVPALLTAGLPPHLTLGTNKLAASFGSLTASITYYKKQL 73

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLII 426
           F   +    +    I A +G  +V  L I
Sbjct: 74  FNPRFWFASIMATAIGALLGTLIVDHLSI 102


>gi|408381578|ref|ZP_11179127.1| permease [Methanobacterium formicicum DSM 3637]
 gi|407816045|gb|EKF86608.1| permease [Methanobacterium formicicum DSM 3637]
          Length = 267

 Score = 41.6 bits (96), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 28/109 (25%), Positives = 52/109 (47%), Gaps = 1/109 (0%)

Query: 316 RRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSA 375
           ++  +SK D     R  Q ++   FGV +G++ G+ G+ G   +      LG+P      
Sbjct: 133 KKETSSKNDPVVLTRPRQFLA-SLFGVASGLLAGIFGISGTPPVSAGLYSLGLPAMTVVG 191

Query: 376 TATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           T  F + F+S   +  Y+LL R  +   +     A + AF+G  +++K+
Sbjct: 192 TTVFVLIFNSLTGIGGYFLLGRLDITLIILLAGGAAVGAFIGPKLLKKI 240


>gi|418327458|ref|ZP_12938613.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
 gi|418630881|ref|ZP_13193353.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
 gi|365232948|gb|EHM73921.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
 gi|374836191|gb|EHR99779.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
          Length = 254

 Score = 41.6 bits (96), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 52/98 (53%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL--- 394
            AFG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++     
Sbjct: 12  VAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 395 --LKRFPVPYALYFVAV---ATIAAFVGQHVVRKLIIL 427
             LK     + L FVA    A IA ++   +++ LII+
Sbjct: 71  VDLKIVGKMFPLIFVASGGGAIIATYIPAIILKPLIII 108


>gi|315127106|ref|YP_004069109.1| hypothetical protein PSM_A2037 [Pseudoalteromonas sp. SM9913]
 gi|315015620|gb|ADT68958.1| hypothetical protein PSM_A2037 [Pseudoalteromonas sp. SM9913]
          Length = 258

 Score = 41.6 bits (96), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 41/89 (46%), Gaps = 1/89 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C   +LAG +  + G GGG +  P  L  G+PP ++  T   A +F S  + + YY  + 
Sbjct: 15  CGVALLAGFIDAIAG-GGGLLTVPALLTAGLPPHLTLGTNKLAASFGSLTASITYYKKQL 73

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLII 426
           F   +    +    I A +G  +V  L I
Sbjct: 74  FNPRFWFASIMATAIGALLGTIIVDHLSI 102


>gi|410463419|ref|ZP_11316934.1| putative permease [Desulfovibrio magneticus str. Maddingley MBC34]
 gi|409983446|gb|EKO39820.1| putative permease [Desulfovibrio magneticus str. Maddingley MBC34]
          Length = 313

 Score = 41.6 bits (96), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 31/42 (73%)

Query: 345 GIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSS 386
           G++ G+ G+GGGF+M PL + +G+PP V++AT +  +  +S+
Sbjct: 29  GLLSGIFGVGGGFLMTPLLIMMGIPPTVAAATDSNQIVGAST 70


>gi|372324344|ref|ZP_09518933.1| hypothetical protein OKIT_0431 [Oenococcus kitaharae DSM 17330]
 gi|366983152|gb|EHN58551.1| hypothetical protein OKIT_0431 [Oenococcus kitaharae DSM 17330]
          Length = 283

 Score = 41.6 bits (96), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 47/89 (52%), Gaps = 1/89 (1%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           L++Y   GVLAGI G +LGLGGG I+ P L L   +P   + A +  A+  +SS + V Y
Sbjct: 6   LITYILIGVLAGIFGSILGLGGGMIVTPILVLAFNLPIHYAIAASIIAVIGTSSGASVAY 65

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHVV 421
                  V  A++     TI A +G  +V
Sbjct: 66  LRDDLLNVRVAMFLEIFTTIGALIGAVLV 94


>gi|392554231|ref|ZP_10301368.1| hypothetical protein PundN2_02215 [Pseudoalteromonas undina NCIMB
           2128]
          Length = 258

 Score = 41.6 bits (96), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 41/89 (46%), Gaps = 1/89 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C   +LAG +  + G GGG +  P  L  G+PP ++  T   A +F S  + + YY  + 
Sbjct: 15  CGVALLAGFIDAIAG-GGGLLTVPALLTAGLPPHLTLGTNKLAASFGSLTASITYYKKQL 73

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLII 426
           F   +    +    I A +G  +V  L I
Sbjct: 74  FNPRFWFASIIATAIGALLGTLIVDHLSI 102


>gi|294954881|ref|XP_002788341.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239903653|gb|EER20137.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 318

 Score = 41.6 bits (96), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 59/243 (24%), Positives = 103/243 (42%), Gaps = 45/243 (18%)

Query: 170 IGVAFNVIFADWMVTVLLIVLFIGT----STKAFLKGVETWKRETILKKEAARCLGSNGA 225
           +G    V    W+  +  +VL++ T    S   F KG+  WK ET    E A   G  G 
Sbjct: 7   VGGLLGVYLYTWLPEIFQLVLYVFTAILASCMGFKKGIGLWKSET----EGASSAGQLGV 62

Query: 226 GAGEVEYKSLPSGPRSGPQKDAMEPEVT-----ILENINWKE---LGLLVFVWVAFLGLQ 277
                        P     +D + P VT     I  ++ +K+   +  L+ VW+A +  +
Sbjct: 63  V------------PPPCVTEDTVLPSVTSRARSISLSLKYKKAILITTLLAVWIAVILSR 110

Query: 278 IAKNHTASCSIV--------YWVLD------LLQIPVSLVVSLYEAISLYKGRRVIASKG 323
           +    +++ SI+        YW L       L+ +P   VV++  A  L    ++  +  
Sbjct: 111 LLLGSSSTPSIIGIPYCTGLYWALSVIVCIMLMAVPGLFVVAIKSAAMLKLAVKLSGAML 170

Query: 324 DDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTF 383
            +  + +   LV     G+  G +  L+G GGG ++ PL L + + PQ ++AT +  M  
Sbjct: 171 CNATA-KSSALVLGNIIGI--GFIAALVGQGGGSLITPLLLYMELNPQQAAATGSVVMLI 227

Query: 384 SSS 386
           +SS
Sbjct: 228 TSS 230


>gi|167624649|ref|YP_001674943.1| hypothetical protein Shal_2731 [Shewanella halifaxensis HAW-EB4]
 gi|167354671|gb|ABZ77284.1| protein of unknown function DUF81 [Shewanella halifaxensis HAW-EB4]
          Length = 257

 Score = 41.6 bits (96), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 47/98 (47%), Gaps = 8/98 (8%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
           D  S+ +  LV     G +AG +  + G GGG +  P  L +GV P ++  T   A  F 
Sbjct: 6   DPSSWAILALV-----GFIAGFIDAVSG-GGGLLSIPALLTMGVSPHMALGTNKLAACFG 59

Query: 385 SSMSVVEYYLLKRFPVPYALYFVAVAT-IAAFVGQHVV 421
           SSM+   YY  + F  P   Y   +AT I A +G  +V
Sbjct: 60  SSMAAYTYYKQRLFS-PSLWYHTFIATFIGAVIGTFIV 96


>gi|157374704|ref|YP_001473304.1| hypothetical protein Ssed_1565 [Shewanella sediminis HAW-EB3]
 gi|157317078|gb|ABV36176.1| protein of unknown function DUF81 [Shewanella sediminis HAW-EB3]
          Length = 257

 Score = 41.6 bits (96), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 41/82 (50%), Gaps = 3/82 (3%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G++AG +  + G GGG +  P  L +G+PP ++  T   A  F SSM+   YY    F  
Sbjct: 17  GLIAGFIDAVAG-GGGLLSIPALLTVGIPPHLALGTNKLAACFGSSMAAFTYYRKNLF-T 74

Query: 401 PYALYFVAVAT-IAAFVGQHVV 421
           P   Y   +AT I A  G  +V
Sbjct: 75  PNLWYHTFIATFIGAVSGTFIV 96


>gi|99078076|ref|YP_611334.1| hypothetical protein TM1040_3098 [Ruegeria sp. TM1040]
 gi|99035214|gb|ABF62072.1| protein of unknown function DUF81 [Ruegeria sp. TM1040]
          Length = 330

 Score = 41.6 bits (96), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 37/58 (63%), Gaps = 1/58 (1%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            G + G++ G+ G+GGGF++ PL   +G+PP V+ AT    +  +SS+S V  +L +R
Sbjct: 45  LGGMVGVLSGMFGVGGGFLLTPLLFFIGIPPAVAVATGANQIV-ASSVSGVLAHLRRR 101


>gi|414176339|ref|ZP_11430568.1| hypothetical protein HMPREF9695_04214 [Afipia broomeae ATCC 49717]
 gi|410886492|gb|EKS34304.1| hypothetical protein HMPREF9695_04214 [Afipia broomeae ATCC 49717]
          Length = 308

 Score = 41.6 bits (96), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 32/55 (58%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ +  M  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFVGISPAVAVASVSSHMAASSFSGALSYW 73


>gi|407462768|ref|YP_006774085.1| hypothetical protein NKOR_06310 [Candidatus Nitrosopumilus
           koreensis AR1]
 gi|407046390|gb|AFS81143.1| hypothetical protein NKOR_06310 [Candidatus Nitrosopumilus
           koreensis AR1]
          Length = 120

 Score = 41.6 bits (96), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 34/57 (59%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G  AG++G ++GLGGG I+ P+   LG PP  +++ + FA   ++  S V Y   +R
Sbjct: 11  GFAAGVLGSMIGLGGGIIVVPVLTFLGFPPIAAASNSLFAALSNAVASTVSYSRQRR 67


>gi|224476019|ref|YP_002633625.1| hypothetical protein Sca_0526 [Staphylococcus carnosus subsp.
           carnosus TM300]
 gi|222420626|emb|CAL27440.1| putative membrane protein [Staphylococcus carnosus subsp. carnosus
           TM300]
          Length = 274

 Score = 41.6 bits (96), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 59/108 (54%), Gaps = 12/108 (11%)

Query: 289 VYWVLDLLQIPVSLVVSLY-EAISLYKGRRVIASK-GDDGKSFR-----VFQLVSYCAFG 341
           +Y+ + L+ + + L+V  Y + ISL++ ++   +   +DG ++      VF    +  FG
Sbjct: 109 LYFGIFLIFVSIILMVRYYIKPISLFEKKKYERTYIANDGTTYHYHVPPVFAF--FATFG 166

Query: 342 VLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAMTFSSSMS 388
           +  GI+ GL G+GGG +M PL L     PP V+  T+   + FSS MS
Sbjct: 167 I--GILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIFFSSVMS 212


>gi|333897807|ref|YP_004471681.1| hypothetical protein Thexy_1991 [Thermoanaerobacterium
           xylanolyticum LX-11]
 gi|333113072|gb|AEF18009.1| protein of unknown function DUF81 [Thermoanaerobacterium
           xylanolyticum LX-11]
          Length = 252

 Score = 41.6 bits (96), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 66/135 (48%), Gaps = 15/135 (11%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMT---FSSSMSVV 390
           ++S C  G +A  +  + G GGG I  P  + LGVPP  +  T  FA T   F+SSM+ +
Sbjct: 6   VLSLCVIGFIAAFIDSIAG-GGGIISLPGLMVLGVPPAYALGTNKFASTCASFTSSMTFI 64

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLG 450
           +Y   K + +    Y V    I A +G     K ++ L  + L I I+  MIFV+  +L 
Sbjct: 65  KY---KVYDINLLKYLVFGTLIGAILGV----KAVLSLDSSKLRIIIIILMIFVAIYTL- 116

Query: 451 GVGISNMIGKFHRHE 465
              +S  +G  +  E
Sbjct: 117 ---LSKNVGSVNNFE 128


>gi|409095075|ref|ZP_11215099.1| permease [Thermococcus zilligii AN1]
          Length = 254

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 43/87 (49%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           + + + Y   GVL G +  + GLGGGF++ P+   LGV    +  T++ A+ F+S  S +
Sbjct: 1   MLRYLGYFGVGVLIGALASMFGLGGGFLIVPVLNLLGVEMHHAVGTSSAAVVFTSLSSAI 60

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG 417
            Y   +R      L     A   A++G
Sbjct: 61  AYQRQRRIHYKAGLLLALTAVPGAYIG 87


>gi|358450715|ref|ZP_09161168.1| hypothetical protein KYE_15477 [Marinobacter manganoxydans MnI7-9]
 gi|357225091|gb|EHJ03603.1| hypothetical protein KYE_15477 [Marinobacter manganoxydans MnI7-9]
          Length = 248

 Score = 41.2 bits (95), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 49/90 (54%), Gaps = 1/90 (1%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           +F++++  A G +AG +  +L  GG  +  PL + LG+PPQV++ T   A+T  S  +V 
Sbjct: 3   LFEILALLALGGIAGFIN-VLSAGGSMLTLPLLMFLGLPPQVANGTNRVAITLQSITAVG 61

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
            +Y +    +  +L+    A + + +G  V
Sbjct: 62  SFYRMGHGNLMVSLHLAVPAILGSLLGAWV 91


>gi|327398711|ref|YP_004339580.1| hypothetical protein Hipma_0550 [Hippea maritima DSM 10411]
 gi|327181340|gb|AEA33521.1| protein of unknown function DUF81 [Hippea maritima DSM 10411]
          Length = 259

 Score = 41.2 bits (95), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 51/92 (55%), Gaps = 3/92 (3%)

Query: 298 IPVSLVVSLYEAISLYKGRRV-IASKGDDGKSFRVFQLVS-YCAF-GVLAGIVGGLLGLG 354
           IP  +V+ L+    L    +V I++K  D  +   F+  + Y    G+L G V G+LG+G
Sbjct: 99  IPTRIVLILFIIAVLTAAVKVFISTKAPDEDNLIEFKKRAIYGGLSGLLIGFVAGMLGIG 158

Query: 355 GGFIMGPLFLELGVPPQVSSATATFAMTFSSS 386
           GGF+  P+ + +G     ++AT  + +TFSS+
Sbjct: 159 GGFLAAPILMSMGYGAIRAAATTAYIVTFSSA 190


>gi|420198663|ref|ZP_14704355.1| hypothetical protein HMPREF9980_00100 [Staphylococcus epidermidis
           NIHLM031]
 gi|394273839|gb|EJE18266.1| hypothetical protein HMPREF9980_00100 [Staphylococcus epidermidis
           NIHLM031]
          Length = 254

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 52/98 (53%), Gaps = 9/98 (9%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            AFG +A  V  ++G GGG I  P  L +G+PP V+  T  FA +FS+  S +++    +
Sbjct: 12  VAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLTSALKFLRSGK 70

Query: 398 FPVPYA-----LYFVAV---ATIAAFVGQHVVRKLIIL 427
             +  A     L FVA    A IA ++   +++ LII+
Sbjct: 71  VDLKIAGKMFPLIFVASGGGAIIATYIPAIILKPLIII 108


>gi|223478732|ref|YP_002582987.1| hypothetical protein [Thermococcus sp. AM4]
 gi|214033958|gb|EEB74784.1| conserved hypothetical protein [Thermococcus sp. AM4]
          Length = 243

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 41/65 (63%), Gaps = 1/65 (1%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFP 399
           G+L+G+V GLLG+ GG +  PLF   +G+P + +  T++FA+ F++     E+Y L +  
Sbjct: 137 GILSGLVSGLLGISGGVLNVPLFHTFVGIPMRYAVGTSSFALFFTALAGAFEHYRLGQVD 196

Query: 400 VPYAL 404
           +  AL
Sbjct: 197 LHMAL 201


>gi|379003676|ref|YP_005259348.1| putative permease [Pyrobaculum oguniense TE7]
 gi|375159129|gb|AFA38741.1| putative permease [Pyrobaculum oguniense TE7]
          Length = 240

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 6/64 (9%)

Query: 72  PELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGA 131
           P    G+ +VL       G    S+ GVGGG +FVP L L+ G DPK A A+S  +I+  
Sbjct: 119 PAPALGYALVL------LGGFVSSLFGVGGGTVFVPALVLVAGLDPKLAAAMSMGIILPT 172

Query: 132 AVST 135
           A+++
Sbjct: 173 ALAS 176


>gi|212223721|ref|YP_002306957.1| membrane protein [Thermococcus onnurineus NA1]
 gi|212008678|gb|ACJ16060.1| hypothetical membrane protein, conserved [Thermococcus onnurineus
           NA1]
          Length = 243

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 31/119 (26%), Positives = 60/119 (50%), Gaps = 7/119 (5%)

Query: 307 YEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE- 365
           Y +  +++GR     +  + K+ R+         GV +G++ GLLG+ GG +  PLF   
Sbjct: 109 YLSFRMWRGRDTFEFEEKEIKTSRI------SVVGVASGLMSGLLGISGGILNVPLFHAY 162

Query: 366 LGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           +G+P + +  T++ A+ F++    + +Y L +  V  AL       + A +G  +V +L
Sbjct: 163 VGIPMRYAVGTSSLALFFTALAGSLGHYRLGQVDVHTALLLAPGLLVGARIGALLVHRL 221


>gi|442317193|ref|YP_007357214.1| hypothetical protein MYSTI_00171 [Myxococcus stipitatus DSM 14675]
 gi|441484835|gb|AGC41530.1| hypothetical protein MYSTI_00171 [Myxococcus stipitatus DSM 14675]
          Length = 300

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 53/99 (53%), Gaps = 4/99 (4%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           GV  GI+ GL+G GGGF++ P    +G+P  V++AT+   +    +  +V +  L    +
Sbjct: 145 GVGVGILSGLVGAGGGFLIVPALAMVGLPTPVATATSLVVIALQCAAGLVGH--LGHLDL 202

Query: 401 PYAL--YFVAVATIAAFVGQHVVRKLIILLGRASLIIFI 437
           P+ L    +AVA + +FVG  +  ++   L R    +F+
Sbjct: 203 PWLLTAQVIAVALVGSFVGGRLAGRVAPGLLRKGFAVFV 241


>gi|326332008|ref|ZP_08198293.1| putative membrane protein [Nocardioidaceae bacterium Broad-1]
 gi|325950146|gb|EGD42201.1| putative membrane protein [Nocardioidaceae bacterium Broad-1]
          Length = 311

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 5/93 (5%)

Query: 333 QLVSYCAFGVLA-----GIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSM 387
            L++  AF VL      GIV GL G+GGG +M P  + LGVPP  + A    A + + S+
Sbjct: 4   DLLTQDAFSVLVVSFGIGIVVGLTGMGGGALMTPALIFLGVPPTTAVANDLVAASVNKSV 63

Query: 388 SVVEYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
               ++      +  A Y +  +   AF+G  V
Sbjct: 64  GAAVHWRHGSPNIRLATYLIIGSVPCAFLGTFV 96


>gi|260430369|ref|ZP_05784342.1| membrane protein [Citreicella sp. SE45]
 gi|260418398|gb|EEX11655.1| membrane protein [Citreicella sp. SE45]
          Length = 301

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G + GI+ GL G+GGGFI+ PL   +G+PP V+ +T    +  SS    + ++  K   +
Sbjct: 21  GGVVGIMSGLFGVGGGFILTPLLFFIGIPPAVAVSTQAVQIVASSCSGALAHWRRKTLDI 80


>gi|259415120|ref|ZP_05739042.1| membrane protein [Silicibacter sp. TrichCH4B]
 gi|259349030|gb|EEW60784.1| membrane protein [Silicibacter sp. TrichCH4B]
          Length = 305

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 37/58 (63%), Gaps = 1/58 (1%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            G + G++ G+ G+GGGF++ PL   +G+PP V+ AT    +  +SS+S V  +L +R
Sbjct: 20  LGGMVGVLSGMFGVGGGFLLTPLLFFIGIPPAVAVATGANQIV-ASSVSGVLAHLRRR 76


>gi|145538953|ref|XP_001455171.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124422970|emb|CAK87774.1| unnamed protein product [Paramecium tetraurelia]
          Length = 406

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 73/324 (22%), Positives = 134/324 (41%), Gaps = 39/324 (12%)

Query: 128 IMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIF-ADWMVTVL 186
           I GAA+       +   P    P++    A++  P +++G  +G+  N I  A +++++L
Sbjct: 84  IFGAALGNFLNLAQQLDPQTKTPVVKIRSAIVACPAMIIGSMVGLILNKILPAFFLISIL 143

Query: 187 LIVLFIGTSTKAFLKGVETWKRETILKKEAARCLGSNGAGAGEVEYKSLPSGPRSGPQKD 246
              LF+ +  K +   ++ WK E        R L  N      VE   L     +  Q++
Sbjct: 144 QYFLFL-SCQKFYKTALKEWKNEN------QRKLTQN-----SVEIPIL-----NKLQEE 186

Query: 247 AMEPEVTILENINW-KELGLLVFVWVAFLGLQIAKN-----HTASCSIVYWVLDLLQIPV 300
                 ++ ENI+    +G++V   +    L+ + N       + C   YW++ L     
Sbjct: 187 QTLQGASLKENIDLIVAIGVIVVTVLLGFFLRGSPNVESIIGISYCGFFYWIITL----- 241

Query: 301 SLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMG 360
            LVV LY        R     + ++ K  +   L         AGI  G  GLGGG I+ 
Sbjct: 242 GLVVVLYYYFEYIFDRF----QKEELKPLKKECLRD----SFKAGIFNGF-GLGGGMILI 292

Query: 361 PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           P++L +G     S+ T++F +  SS     +  +L       A     +  +  ++    
Sbjct: 293 PMYLGMGFTTIQSTGTSSFNVLLSSFQVAAQVIILGYMGTLQAFSLFTMTVVGCYLSSSY 352

Query: 421 VRKLIILLGRASLIIF-ILAFMIF 443
           +   +    R SLI++ ++ F+IF
Sbjct: 353 IFNNLKKRDRLSLIMWGLVGFVIF 376


>gi|422350303|ref|ZP_16431189.1| hypothetical protein HMPREF9465_02079 [Sutterella wadsworthensis
           2_1_59BFAA]
 gi|404657451|gb|EKB30342.1| hypothetical protein HMPREF9465_02079 [Sutterella wadsworthensis
           2_1_59BFAA]
          Length = 266

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 3/87 (3%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAMTFSSSMSVVEYYLLK 396
           C+F V  G +  +LG+GGG +  P+ +  LG PP V+ AT+TF +  S+++ VV + LL 
Sbjct: 154 CSFFV--GFISSILGIGGGVVHVPMMVFLLGFPPLVAVATSTFVLMVSAAIGVVGHALLA 211

Query: 397 RFPVPYALYFVAVATIAAFVGQHVVRK 423
                 A+     A + A +G  + RK
Sbjct: 212 HIVWAPAVAVGCGAIVGAQLGARLARK 238



 Score = 38.9 bits (89), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 8/88 (9%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSS--------ATATFAMTFSSSMSVVEY 392
           G+  G  G L+G+GGG IM PLF+   +PP  S+         T+ F +  ++      Y
Sbjct: 11  GIGVGSFGALVGIGGGLIMVPLFMYFMMPPSGSTFANVQEVVGTSLFGVLLNALSGTWAY 70

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHV 420
           +  KR  +  A+ F       AF+G +V
Sbjct: 71  FRQKRIILSVAMPFALATVPGAFLGSYV 98


>gi|289550530|ref|YP_003471434.1| hypothetical protein SLGD_01216 [Staphylococcus lugdunensis
           HKU09-01]
 gi|315658024|ref|ZP_07910897.1| membrane protein [Staphylococcus lugdunensis M23590]
 gi|385784159|ref|YP_005760332.1| hypothetical protein SLUG_12140 [Staphylococcus lugdunensis
           N920143]
 gi|418413835|ref|ZP_12987051.1| hypothetical protein HMPREF9308_00216 [Staphylococcus lugdunensis
           ACS-027-V-Sch2]
 gi|418636572|ref|ZP_13198918.1| sulfite exporter TauE/SafE [Staphylococcus lugdunensis VCU139]
 gi|289180062|gb|ADC87307.1| hypothetical protein SLGD_01216 [Staphylococcus lugdunensis
           HKU09-01]
 gi|315496914|gb|EFU85236.1| membrane protein [Staphylococcus lugdunensis M23590]
 gi|339894415|emb|CCB53693.1| putative membrane protein [Staphylococcus lugdunensis N920143]
 gi|374840627|gb|EHS04112.1| sulfite exporter TauE/SafE [Staphylococcus lugdunensis VCU139]
 gi|410877473|gb|EKS25365.1| hypothetical protein HMPREF9308_00216 [Staphylococcus lugdunensis
           ACS-027-V-Sch2]
          Length = 256

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 52/98 (53%), Gaps = 11/98 (11%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY------- 392
           FG LA  +  ++G GGG I  P  L +G+PP ++  T   A +F S  S +++       
Sbjct: 14  FGFLAAFIDAVVG-GGGLISTPALLAIGLPPALALGTNKLASSFGSLTSTIKFIRSGHVD 72

Query: 393 --YLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
              +LK FP  + ++    A++A F+   V++ LII++
Sbjct: 73  LNIVLKLFPFVF-IFAAGGASLAIFLPAEVLKPLIIII 109


>gi|407787737|ref|ZP_11134876.1| hypothetical protein B30_16868 [Celeribacter baekdonensis B30]
 gi|407199016|gb|EKE69040.1| hypothetical protein B30_16868 [Celeribacter baekdonensis B30]
          Length = 302

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 35/58 (60%)

Query: 336 SYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           +    G + G++ G+ G+GGGF++ PL   +G+PP V+ AT+   +  SS   V+ ++
Sbjct: 16  TLLGIGGIVGVLSGMFGVGGGFLITPLLFFVGIPPAVAVATSANQIVASSVSGVLAHF 73


>gi|218961280|ref|YP_001741055.1| putative Membrane protein, precursor [Candidatus Cloacamonas
           acidaminovorans]
 gi|167729937|emb|CAO80849.1| putative Membrane protein, precursor [Candidatus Cloacamonas
           acidaminovorans str. Evry]
          Length = 257

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 1/104 (0%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFP 399
           F  LAG +  + G GGG I  P +  +G+PP ++  T  F+    +  S   Y+  K   
Sbjct: 17  FIFLAGFIDSIAG-GGGLISLPAYWSVGIPPHLALGTNKFSSCCGTLFSTANYFKAKMID 75

Query: 400 VPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIF 443
           +P AL    +A I +++G     ++   +    LII I A  IF
Sbjct: 76  IPVALVSAGMAIIGSWLGASTALRVSSQVLNYLLIILIPAVAIF 119


>gi|414169802|ref|ZP_11425535.1| hypothetical protein HMPREF9696_03390 [Afipia clevelandensis ATCC
           49720]
 gi|410885534|gb|EKS33349.1| hypothetical protein HMPREF9696_03390 [Afipia clevelandensis ATCC
           49720]
          Length = 308

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 31/54 (57%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
            G   G V G+ G+GGGF+M PL + LG+ P V+ A+ +  M  SS    + Y+
Sbjct: 20  LGAAVGFVSGMFGIGGGFLMTPLLIFLGISPAVAVASVSSHMAASSFSGALSYW 73


>gi|338972859|ref|ZP_08628230.1| protein of unknown function DUF81 [Bradyrhizobiaceae bacterium
           SG-6C]
 gi|338234020|gb|EGP09139.1| protein of unknown function DUF81 [Bradyrhizobiaceae bacterium
           SG-6C]
          Length = 308

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 31/54 (57%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
            G   G V G+ G+GGGF+M PL + LG+ P V+ A+ +  M  SS    + Y+
Sbjct: 20  LGAAVGFVSGMFGIGGGFLMTPLLIFLGISPAVAVASVSSHMAASSFSGALSYW 73


>gi|336423425|ref|ZP_08603555.1| hypothetical protein HMPREF0993_02932 [Lachnospiraceae bacterium
           5_1_57FAA]
 gi|336004677|gb|EGN34737.1| hypothetical protein HMPREF0993_02932 [Lachnospiraceae bacterium
           5_1_57FAA]
          Length = 253

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C    LAG+V  + G GGG I  P +L  G+PP  + AT   +  F ++++   +     
Sbjct: 12  CPLVFLAGMVDAIAG-GGGLISLPAYLIAGLPPHAAVATNKMSSPFGTALATYRFARNHL 70

Query: 398 FPVPYALYFVAVATIAAFVGQHV 420
             V  A+  V  A I +F+G HV
Sbjct: 71  INVRLAVPSVMAAIIGSFIGSHV 93


>gi|77360916|ref|YP_340491.1| hypothetical protein PSHAa1992 [Pseudoalteromonas haloplanktis
           TAC125]
 gi|76875827|emb|CAI87048.1| conserved protein of unknown functions similar to YfcA
           [Pseudoalteromonas haloplanktis TAC125]
          Length = 258

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 1/89 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           CA  + AG +  + G GGG +  P  L  G+PP ++  T   A +F S  +   YY    
Sbjct: 15  CAVALAAGFIDAIAG-GGGMLTVPALLTAGLPPHLTLGTNKLAASFGSLTASYTYYKKDL 73

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLII 426
           F   + L  +    I A +G  +V  L I
Sbjct: 74  FNPKFWLASIFATAIGALIGTLIVDHLSI 102


>gi|154686411|ref|YP_001421572.1| hypothetical protein RBAM_019790 [Bacillus amyloliquefaciens FZB42]
 gi|154352262|gb|ABS74341.1| YrkJ1 [Bacillus amyloliquefaciens FZB42]
          Length = 175

 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 63/107 (58%), Gaps = 8/107 (7%)

Query: 345 GIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAMTFSSSM-SVVEYYLLKRFPVPY 402
           G+  G++G GG F++ PL L  L +P +++ AT+  A+TF SS+ + V   ++ + P   
Sbjct: 69  GLASGIVGAGGAFLLFPLMLVVLKIPTKITIATS-LAVTFISSIGTTVTKLVIGQVPFMP 127

Query: 403 ALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL 449
           AL  VA + IA+ +G  + +K+      A+++  IL F+I ++ + +
Sbjct: 128 ALVIVAASLIASPIGVLIGKKM-----NANILQGILIFVILITTVKI 169


>gi|456352291|dbj|BAM86736.1| permease [Agromonas oligotrophica S58]
          Length = 308

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 60/115 (52%), Gaps = 4/115 (3%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ +  +  SS    + Y+  + 
Sbjct: 18  LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYWRRRA 77

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGV 452
                AL  +   T+   +G      L+  LG+  L+I  L++++ +S  S+GGV
Sbjct: 78  IDPSLALVLMIGGTLGTALGVSTF-TLLRSLGQLDLMI-ALSYVVLLS--SVGGV 128


>gi|300856495|ref|YP_003781479.1| permease [Clostridium ljungdahlii DSM 13528]
 gi|300436610|gb|ADK16377.1| putative permease [Clostridium ljungdahlii DSM 13528]
          Length = 278

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 35/96 (36%), Positives = 54/96 (56%), Gaps = 9/96 (9%)

Query: 338 CAFGVL--AGIVGGLLGLGGG-FIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL 394
             FGV+  AGI+ GLLG+G G F +  + L + +P +VSSAT+ F M  +++ S    YL
Sbjct: 161 SGFGVMYGAGIISGLLGIGSGSFKVMAMDLFMKLPLKVSSATSNFMMGVTAAAS-AGVYL 219

Query: 395 LK-----RFPVPYALYFVAVATIAAFVGQHVVRKLI 425
           L+     +   P AL  +A AT+   + Q++  K I
Sbjct: 220 LRGNIDPKISAPVALGVLAGATMGTRIMQNMKSKTI 255


>gi|375150166|ref|YP_005012607.1| siroheme synthase [Niastella koreensis GR20-10]
 gi|361064212|gb|AEW03204.1| siroheme synthase [Niastella koreensis GR20-10]
          Length = 503

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 43/87 (49%), Gaps = 2/87 (2%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGV-PPQVSSATATFAMTFSSSMSVVEYYLLK 396
           C  G LA IV G +G+G G +   + L LG+ PP +SS+  T A TF+S ++   +Y   
Sbjct: 263 CLTGFLAQIVAGSMGMGYGVMCTTVLLTLGISPPAISSSIHT-AETFTSGVTAYSHYKYG 321

Query: 397 RFPVPYALYFVAVATIAAFVGQHVVRK 423
                 A   V    I A VG  ++ K
Sbjct: 322 NVNAKLAKSLVLPGVIGAVVGSLLLAK 348


>gi|406666360|ref|ZP_11074128.1| hypothetical protein B857_01928 [Bacillus isronensis B3W22]
 gi|405385899|gb|EKB45330.1| hypothetical protein B857_01928 [Bacillus isronensis B3W22]
          Length = 255

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 43/86 (50%), Gaps = 1/86 (1%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           A G+LA IVG + G   GF++ P  L +G+P   + A   FA   SS  +V+   L KR 
Sbjct: 18  AIGILASIVGVMFG-AAGFVLLPSLLLVGIPIHATVAINKFATGISSFSTVIVLTLKKRV 76

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKL 424
            +   + F+ +A I    G  +  +L
Sbjct: 77  QLKKMIPFMLIAAIGGISGAFLATRL 102


>gi|347529542|ref|YP_004836290.1| hypothetical protein SLG_31580 [Sphingobium sp. SYK-6]
 gi|345138224|dbj|BAK67833.1| conserved hypothetical membrane protein [Sphingobium sp. SYK-6]
          Length = 304

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 40/76 (52%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVA 408
           G+ G+GGGF+  PL +  G+PP V++A+A   +  +S  SV+ +   K   +P     V 
Sbjct: 29  GMFGVGGGFLTTPLLIFYGIPPTVAAASAATQVMGASVSSVMSHLGRKTVDLPMGGVMVV 88

Query: 409 VATIAAFVGQHVVRKL 424
              I A +G  + R L
Sbjct: 89  GGIIGAAIGTVIFRIL 104


>gi|392955479|ref|ZP_10321010.1| putative permease with tauE sulfite export domain [Bacillus
           macauensis ZFHKF-1]
 gi|391878406|gb|EIT86995.1| putative permease with tauE sulfite export domain [Bacillus
           macauensis ZFHKF-1]
          Length = 290

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 57/125 (45%), Gaps = 8/125 (6%)

Query: 255 LENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSL-YEAISLY 313
           L+NI WK    L    V  L  Q A       S  +W  D + IPV  +V + Y  + L 
Sbjct: 63  LKNIYWKTAITLGISGV--LATQAANPLVVYMSNHHW--DKIVIPVLYIVLMGYFCVQLL 118

Query: 314 KGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQV 372
             RR    + D G +   F L      G ++G +   LG+GGGF+M PL +  L + P+ 
Sbjct: 119 VKRR--KKEQDRGATRHAFSLPKTLLIGFVSGFLSSTLGVGGGFVMVPLMISILKIEPRK 176

Query: 373 SSATA 377
           +  T+
Sbjct: 177 AVGTS 181


>gi|374299305|ref|YP_005050944.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
 gi|332552241|gb|EGJ49285.1| protein of unknown function DUF81 [Desulfovibrio africanus str.
           Walvis Bay]
          Length = 423

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 44/84 (52%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G++AG++ G +G GGGFI+ P  +  G+   ++  T  F +   + M    +  L    V
Sbjct: 81  GLIAGLITGCIGAGGGFIITPALMAAGIKGILAVGTDLFHIFAKAIMGTTVHKKLGNVSV 140

Query: 401 PYALYFVAVATIAAFVGQHVVRKL 424
             AL F+A A+I  F+G  + + L
Sbjct: 141 KLALGFLAGASIGTFIGGAINKGL 164


>gi|315230494|ref|YP_004070930.1| hypothetical protein TERMP_00730 [Thermococcus barophilus MP]
 gi|315183522|gb|ADT83707.1| hypothetical protein TERMP_00730 [Thermococcus barophilus MP]
          Length = 251

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 34/55 (61%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLL 395
           GV++G   GLLG+ GG +  PLF  LG+P   +  T++ A+ F++    V +Y+L
Sbjct: 138 GVVSGFSSGLLGISGGILNVPLFYSLGLPIHYAIGTSSVALFFTALAGTVGHYIL 192


>gi|145589361|ref|YP_001155958.1| hypothetical protein Pnuc_1178 [Polynucleobacter necessarius subsp.
           asymbioticus QLW-P1DMWA-1]
 gi|145047767|gb|ABP34394.1| protein of unknown function DUF81 [Polynucleobacter necessarius
           subsp. asymbioticus QLW-P1DMWA-1]
          Length = 272

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 62/132 (46%), Gaps = 21/132 (15%)

Query: 306 LYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE 365
           LY +  +   R   A++   G +F +F      AFGVL G++  LLG GG FI  P  + 
Sbjct: 119 LYSSTQMILNRSPHAARALPG-TFGLF------AFGVLTGVIASLLGAGGAFITVPFMIW 171

Query: 366 LGVPPQVSSATAT---FAMTFSSSM----------SVVEYYLLKRFPVPYALYFVAVATI 412
             + P  + A ++   F +  SS++          S+ EY L   + VP  L  V  + +
Sbjct: 172 CNIKPHNAMANSSGLGFPIAASSTLGYIYGGWGNASLPEYSLGYVY-VPALLCIVITSML 230

Query: 413 AAFVGQHVVRKL 424
           AA +G  VV KL
Sbjct: 231 AAPLGARVVSKL 242


>gi|414153131|ref|ZP_11409458.1| conserved membrane hypothetical protein [Desulfotomaculum
           hydrothermale Lam5 = DSM 18033]
 gi|411455513|emb|CCO07360.1| conserved membrane hypothetical protein [Desulfotomaculum
           hydrothermale Lam5 = DSM 18033]
          Length = 428

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 43/86 (50%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           A GV AG++ G +G GGGF++ P  + LGV   ++  T  F +   + M  V +  L   
Sbjct: 79  AVGVCAGLITGCIGAGGGFVITPALMSLGVKGILAVGTDQFHIFAKAIMGTVIHKKLGNV 138

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKL 424
            VP A+ F+  + +    G  + R L
Sbjct: 139 NVPLAIAFLVGSGLGVTAGGTLNRAL 164


>gi|260431501|ref|ZP_05785472.1| membrane protein [Silicibacter lacuscaerulensis ITI-1157]
 gi|260415329|gb|EEX08588.1| membrane protein [Silicibacter lacuscaerulensis ITI-1157]
          Length = 307

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G+  G + G+ G+GGGF++ PL   +G+PP V+ AT+      +SS S V  +L +R
Sbjct: 21  GIAVGFLSGMFGVGGGFLITPLLFLIGIPPAVAVATSA-NQVVASSFSGVLAHLKRR 76


>gi|336311114|ref|ZP_08566081.1| hypothetical protein SOHN41_01564 [Shewanella sp. HN-41]
 gi|335865330|gb|EGM70356.1| hypothetical protein SOHN41_01564 [Shewanella sp. HN-41]
          Length = 257

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 43/88 (48%), Gaps = 5/88 (5%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G++AG +  ++G GGG +  P  L LG+ P     T   A +F SSM+   YY    F  
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAAWTYYRQHLFKP 75

Query: 401 PYALYFVAVATIAAFVGQHVVRKLIILL 428
                F  +A IA FVG  +   L+ LL
Sbjct: 76  A----FWYIAFIATFVGAALGSLLVYLL 99


>gi|223042876|ref|ZP_03612924.1| putative membrane protein [Staphylococcus capitis SK14]
 gi|417905704|ref|ZP_12549503.1| putative membrane protein [Staphylococcus capitis VCU116]
 gi|222443730|gb|EEE49827.1| putative membrane protein [Staphylococcus capitis SK14]
 gi|341598598|gb|EGS41098.1| putative membrane protein [Staphylococcus capitis VCU116]
          Length = 275

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 53/103 (51%), Gaps = 4/103 (3%)

Query: 325 DGKSFRVFQLVSYCAF--GVLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAM 381
           +GK++  + +  + AF   +  GI+ GL G+GGG +M PL L     PP V+  T+   +
Sbjct: 148 EGKTYH-YSVPPFFAFVATLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            FSS MS + + +       Y++  +  + I A +G  V R L
Sbjct: 207 FFSSVMSSIGHIVQGHVAWGYSVALIISSIIGAQIGVKVNRSL 249


>gi|393199026|ref|YP_006460868.1| permease [Solibacillus silvestris StLB046]
 gi|327438357|dbj|BAK14722.1| predicted permease [Solibacillus silvestris StLB046]
          Length = 255

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 43/86 (50%), Gaps = 1/86 (1%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           A G+LA IVG + G   GF++ P  L +G+P   + A   FA   SS  +V+   L KR 
Sbjct: 18  AIGILASIVGVMFG-AAGFVLLPSLLLVGIPIHATVAINKFATGISSFSTVIVLTLKKRV 76

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKL 424
            +   + F+ +A I    G  +  +L
Sbjct: 77  QLKKMIPFMLIAAIGGISGAFLATRL 102


>gi|443327177|ref|ZP_21055809.1| putative permease [Xenococcus sp. PCC 7305]
 gi|442793208|gb|ELS02663.1| putative permease [Xenococcus sp. PCC 7305]
          Length = 270

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 44/83 (53%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G  +GI+ G LG+GGG ++ PL + LG  P  + AT+  A+T ++    ++ + +    +
Sbjct: 13  GFFSGILAGFLGIGGGTVLVPLLVALGYEPIQAVATSALAITITALSGTLQNWRMGYIKL 72

Query: 401 PYALYFVAVATIAAFVGQHVVRK 423
              LY    A +AA VG +   K
Sbjct: 73  QSILYLGLPALLAAQVGVYFADK 95


>gi|197106230|ref|YP_002131607.1| hypothetical protein PHZ_c2769 [Phenylobacterium zucineum HLK1]
 gi|196479650|gb|ACG79178.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1]
          Length = 269

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 58/107 (54%), Gaps = 4/107 (3%)

Query: 359 MGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQ 418
           M P+ + LG+PP V+ A+ +  +  SS  SV+ Y   +   +       A   + AF+G 
Sbjct: 1   MTPMLVFLGIPPAVAVASMSNHVAASSMSSVIAYGRRRAVDLRMGGVLAAGGVVGAFLGV 60

Query: 419 HVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFHRHE 465
            + R L+ LLG+A L++ + ++++F+S I  GG+ ++  +G   R  
Sbjct: 61  ELFR-LLRLLGQADLVVSV-SYLVFLSII--GGLMLTESLGAILRRR 103


>gi|27376265|ref|NP_767794.1| hypothetical protein bll1154 [Bradyrhizobium japonicum USDA 110]
 gi|27349405|dbj|BAC46419.1| bll1154 [Bradyrhizobium japonicum USDA 110]
          Length = 380

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+    +  SS    + Y+
Sbjct: 94  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGAISYW 148


>gi|389774756|ref|ZP_10192875.1| hypothetical protein UU7_03077 [Rhodanobacter spathiphylli B39]
 gi|388438355|gb|EIL95110.1| hypothetical protein UU7_03077 [Rhodanobacter spathiphylli B39]
          Length = 250

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 47/92 (51%), Gaps = 4/92 (4%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           + +++ A G LA ++ G LG+  G     + L LG+PP  +SA+  +A TF+   S + +
Sbjct: 6   EFITFAAVGALAQLIDGALGMAYGITAASMLLGLGLPPVAASASVHYAETFTCGASGLSH 65

Query: 393 YLL----KRFPVPYALYFVAVATIAAFVGQHV 420
            +     +R     A+  VA A I A V  HV
Sbjct: 66  LVAGNVRRRLFWALAMPGVAGAVIGALVLSHV 97


>gi|367478122|ref|ZP_09477444.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 285]
 gi|365269682|emb|CCD89912.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 285]
          Length = 331

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 60/115 (52%), Gaps = 4/115 (3%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ +  +  SS    + Y+  + 
Sbjct: 41  LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYWRRRA 100

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGV 452
                AL  +   T+   +G      L+  LG+  L+I  L++++ ++  S+GGV
Sbjct: 101 IDPSLALVLMIGGTLGTALGVATF-TLLRSLGQLDLMI-ALSYVVLLT--SVGGV 151


>gi|256828130|ref|YP_003156858.1| hypothetical protein Dbac_0315 [Desulfomicrobium baculatum DSM
           4028]
 gi|256577306|gb|ACU88442.1| protein of unknown function DUF81 [Desulfomicrobium baculatum DSM
           4028]
          Length = 415

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 41/84 (48%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           GV AG++ G +G GGGFI+ P  +  GV   ++  T  F +   + M  V +  +    V
Sbjct: 78  GVGAGLISGCIGAGGGFIIAPALMSAGVKGILAVGTDLFHIFAKAIMGSVLHRKMGNVSV 137

Query: 401 PYALYFVAVATIAAFVGQHVVRKL 424
           P A  F+  A I   VG  + R L
Sbjct: 138 PLAFVFLIGAIIGTTVGAGINRAL 161


>gi|119485715|ref|ZP_01619990.1| hypothetical protein L8106_25070 [Lyngbya sp. PCC 8106]
 gi|119457040|gb|EAW38167.1| hypothetical protein L8106_25070 [Lyngbya sp. PCC 8106]
          Length = 297

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 34/53 (64%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSS 385
           +L++    G+ AGI+ G LG+GGG ++ P+ L LG  P  + AT+T ++  +S
Sbjct: 5   ELLTLAVSGIFAGILAGFLGIGGGTVLVPMMLGLGYEPINAVATSTLSIMITS 57


>gi|89099437|ref|ZP_01172313.1| hypothetical protein B14911_19705 [Bacillus sp. NRRL B-14911]
 gi|89085823|gb|EAR64948.1| hypothetical protein B14911_19705 [Bacillus sp. NRRL B-14911]
          Length = 285

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 65/144 (45%), Gaps = 18/144 (12%)

Query: 255 LENINWKELGLLVFVWVAFLGLQIAKNHTASCSIVYWV----LDLLQIPVSLVVSL-YEA 309
           L+NI WKE         A LG+       A+  +V ++    LD   IP   +V L Y A
Sbjct: 64  LKNILWKE--------GAILGVSGMLATQAAKPLVLFLSEKGLDATVIPACYIVLLSYFA 115

Query: 310 ISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE-LGV 368
            ++++  +    +  +G+      L      G   G V   LG+GGGFIM PL +  LG+
Sbjct: 116 FTMFRQGKKTGEQSREGRP----SLAGMLLIGFSGGFVSAALGVGGGFIMVPLSIAFLGL 171

Query: 369 PPQVSSATATFAMTFSSSMSVVEY 392
            P+ +  T+ FA+    S   + Y
Sbjct: 172 QPRKAVGTSLFAVLLIVSTGFLSY 195


>gi|163751894|ref|ZP_02159108.1| hypothetical protein KT99_00715 [Shewanella benthica KT99]
 gi|161328244|gb|EDP99408.1| hypothetical protein KT99_00715 [Shewanella benthica KT99]
          Length = 259

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 39/85 (45%), Gaps = 2/85 (2%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G+LAG +  + G GGG +  P  L LG+ P  +  T   A  F SS +   YY    F  
Sbjct: 17  GLLAGFIDAIAG-GGGLLSIPALLTLGINPHTALGTNKLAACFGSSTAAYTYYKQNLF-T 74

Query: 401 PYALYFVAVATIAAFVGQHVVRKLI 425
           P+  Y+  +AT    V    +  LI
Sbjct: 75  PHLWYYTFIATFFGAVAGTFIVSLI 99


>gi|429331524|ref|ZP_19212277.1| sodium:dicarboxylate symporter [Pseudomonas putida CSV86]
 gi|428763685|gb|EKX85847.1| sodium:dicarboxylate symporter [Pseudomonas putida CSV86]
          Length = 462

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 50/164 (30%), Positives = 75/164 (45%), Gaps = 20/164 (12%)

Query: 267 VFVWVAFLG---LQIAKNHTASCSIVYWVLDLLQIPVSLVVSLYEAISLY-----KGRRV 318
           V ++ AFLG   LQ+ K+     S     +D LQ  V+ +V L   ++ Y       + V
Sbjct: 189 VVIFAAFLGIAALQLLKDDAEKGSRALAAIDTLQAWVTRLVRLVMKLTPYGVLALMAKVV 248

Query: 319 IASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATAT 378
            +S  DD      F +VSY    ++ G+ G LLGL G   + PL     V P +     T
Sbjct: 249 ASSNLDDIIKLGSFVVVSYLGLALMFGVHGVLLGLAG---INPLRFLRKVWPVL-----T 300

Query: 379 FAMTFSSSMSVVEYYL---LKRFPVPYALYFVAVATIAAFVGQH 419
           FA T  SS + +   +    +R  VP A+   A A+  A +GQ+
Sbjct: 301 FAFTSRSSAASIPLSIEAQTRRLGVPQAIASFA-ASFGATIGQN 343


>gi|338211621|ref|YP_004655674.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336305440|gb|AEI48542.1| protein of unknown function DUF81 [Runella slithyformis DSM 19594]
          Length = 301

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 33/65 (50%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
            LV Y   G+ A +V G LG+G G       L LGVPP VSS +   +  F++  S + +
Sbjct: 58  DLVFYLLVGIGAQLVDGALGMGYGATATSFLLTLGVPPAVSSTSVHLSEMFTTGASAISH 117

Query: 393 YLLKR 397
           +  K 
Sbjct: 118 FKFKN 122


>gi|257062790|ref|YP_003142462.1| permease [Slackia heliotrinireducens DSM 20476]
 gi|256790443|gb|ACV21113.1| predicted permease [Slackia heliotrinireducens DSM 20476]
          Length = 277

 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 36/53 (67%), Gaps = 1/53 (1%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLF-LELGVPPQVSSATATFAMTFSSSMSVVEY 392
           G+L G++ GL G+GGG IM PLF L  G+PP  ++AT+ F +  +S   +V++
Sbjct: 11  GLLVGVLSGLFGIGGGTIMVPLFRLAFGLPPIGATATSLFTVIPTSLSGLVKH 63


>gi|314933135|ref|ZP_07840500.1| putative membrane protein [Staphylococcus caprae C87]
 gi|313653285|gb|EFS17042.1| putative membrane protein [Staphylococcus caprae C87]
          Length = 275

 Score = 40.4 bits (93), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 1/81 (1%)

Query: 345 GIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYA 403
           GI+ GL G+GGG +M PL L     PP V+  T+   + FSS MS V + +       Y+
Sbjct: 169 GILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIFFSSVMSSVGHIVQGHVAWGYS 228

Query: 404 LYFVAVATIAAFVGQHVVRKL 424
           +  +  + I A +G  V R L
Sbjct: 229 VALIISSIIGAQIGVKVNRSL 249


>gi|359449450|ref|ZP_09238943.1| UPF0721 transmembrane protein ORF9 [Pseudoalteromonas sp. BSi20480]
 gi|358044760|dbj|GAA75192.1| UPF0721 transmembrane protein ORF9 [Pseudoalteromonas sp. BSi20480]
          Length = 258

 Score = 40.4 bits (93), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 1/89 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           CA  + AG +  + G GGG +  P  L  G+PP ++  T   A +F S  +   YY    
Sbjct: 15  CAVALAAGFIDAIAG-GGGMLTVPALLTAGLPPHLTLGTNKLAASFGSLTASFTYYKKNL 73

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLII 426
           F   + L  +    I A +G  +V  L I
Sbjct: 74  FSPKFWLASIIATAIGALLGTVIVDYLSI 102


>gi|357013613|ref|ZP_09078612.1| hypothetical protein PelgB_29387 [Paenibacillus elgii B69]
          Length = 272

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 51/104 (49%), Gaps = 14/104 (13%)

Query: 53  LKAVNFLWQPE-------GSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIF 105
           LK +N  W  +       G+ YH+ +  L     +++G ++GF    FG    +GGG +F
Sbjct: 126 LKPLNVKWHMQRTYTDSSGNTYHYGYSILP---ALIIGLLVGFISGLFG----IGGGSLF 178

Query: 106 VPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDM 149
           VP + L+  + P  ATA S  +I+ +++   + +  L    L M
Sbjct: 179 VPAMVLLFQYPPHVATATSMFVILLSSLMGSFTHFSLGEVNLWM 222



 Score = 38.1 bits (87), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 1/85 (1%)

Query: 341 GVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFP 399
           G+L G + GL G+GGG +  P + L    PP V++AT+ F +  SS M    ++ L    
Sbjct: 160 GLLVGFISGLFGIGGGSLFVPAMVLLFQYPPHVATATSMFVILLSSLMGSFTHFSLGEVN 219

Query: 400 VPYALYFVAVATIAAFVGQHVVRKL 424
           +   L     A +  ++G  +  +L
Sbjct: 220 LWMVLGLAPSAIVGGWLGAKIASRL 244


>gi|291277368|ref|YP_003517140.1| hypothetical protein HMU11600 [Helicobacter mustelae 12198]
 gi|290964562|emb|CBG40415.1| putative inner membrane protein [Helicobacter mustelae 12198]
          Length = 272

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 337 YCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSM-SVVEY 392
           Y   GV +GI  GL GLGGG ++ P+   LG     + A + F M F+S+  S++ Y
Sbjct: 5   YMLVGVFSGITSGLFGLGGGTVIVPVMTSLGFSMHHAVAISVFQMIFASTFGSIINY 61


>gi|392538693|ref|ZP_10285830.1| hypothetical protein Pmarm_11210 [Pseudoalteromonas marina mano4]
          Length = 258

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 1/89 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           CA  + AG +  + G GGG +  P  L  G+PP ++  T   A +F S  +   YY    
Sbjct: 15  CAVALAAGFIDAIAG-GGGMLTVPALLTAGLPPHLTLGTNKLAASFGSLTASFTYYKKNL 73

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLII 426
           F   + L  +    I A +G  +V  L I
Sbjct: 74  FSPKFWLASIIATAIGALLGTVIVDYLSI 102


>gi|452965232|gb|EME70258.1| permease [Magnetospirillum sp. SO-1]
          Length = 301

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 42/74 (56%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           ++S  A G   G++ G+ G+GGGF++ PL + LG+PP V+ A+    +  SS   V  ++
Sbjct: 15  ILSLLAVGGGIGVLSGIFGVGGGFLLTPLLIMLGIPPAVAVASGANQVLGSSVSGVFAHW 74

Query: 394 LLKRFPVPYALYFV 407
             +   +  A++ +
Sbjct: 75  RRRNVDIRMAVFLL 88


>gi|119471108|ref|ZP_01613640.1| hypothetical protein ATW7_09031 [Alteromonadales bacterium TW-7]
 gi|119445764|gb|EAW27046.1| hypothetical protein ATW7_09031 [Alteromonadales bacterium TW-7]
          Length = 258

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 1/89 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           CA  + AG +  + G GGG +  P  L  G+PP ++  T   A +F S  +   YY    
Sbjct: 15  CAVALAAGFIDAIAG-GGGMLTVPALLTAGLPPHLTLGTNKLAASFGSLTASFTYYKKNL 73

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLII 426
           F   + L  +    I A +G  +V  L I
Sbjct: 74  FSPKFWLASIIATAIGALLGTVIVDYLSI 102


>gi|86159865|ref|YP_466650.1| hypothetical protein Adeh_3446 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|85776376|gb|ABC83213.1| protein of unknown function DUF81 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 255

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 49/87 (56%), Gaps = 6/87 (6%)

Query: 299 PVSLVVSLYEAISLYKG-----RRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGL 353
           PV ++ +L+ A+++Y       RR  + +  +G+ + V +  +       AG + GL+G+
Sbjct: 95  PVGVLKALFAAVAVYSALVMWRRRPASPQASEGEPYTVRRWGTGLGASAFAGAISGLIGV 154

Query: 354 GGGFIMGPLF-LELGVPPQVSSATATF 379
           GGGFI  P+  L + +P +V+ AT+ F
Sbjct: 155 GGGFIKVPVMTLAMELPFKVAVATSNF 181


>gi|294141745|ref|YP_003557723.1| hypothetical protein SVI_2974 [Shewanella violacea DSS12]
 gi|293328214|dbj|BAJ02945.1| conserved hypothetical protein [Shewanella violacea DSS12]
          Length = 257

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 50/119 (42%), Gaps = 17/119 (14%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G+LAG +  + G GGG +  P  L LG+ P  +  T   A  F SS +   YY    F  
Sbjct: 17  GLLAGFIDAIAG-GGGLLSIPALLTLGINPHTALGTNKLAACFGSSTAAYTYYKQNLF-T 74

Query: 401 PYALYFVAVAT---------IAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLG 450
           P   Y   +AT         I   +  H++ KL+       L+I I+A        S+G
Sbjct: 75  PQLWYHTFIATFLGAISGTFIVTLIDSHLLEKLL------PLMIIIIAIYTLAKPNSMG 127


>gi|167045656|gb|ABZ10304.1| putative domain of unknown function DUF81 [uncultured marine
           crenarchaeote HF4000_APKG10L15]
          Length = 251

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 7/74 (9%)

Query: 88  FFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTL 147
           FF     S+ G+GGG IFVP++ + +G   K A   S+ ++M A+ S +  +  L HP  
Sbjct: 140 FFAGIISSLFGIGGGLIFVPLMVVALGISMKRAAPTSQFILMFASFSGLIMHSMLGHP-- 197

Query: 148 DMPIIDYDLALLIQ 161
                DY  ALL+ 
Sbjct: 198 -----DYYQALLLS 206


>gi|11499708|ref|NP_070950.1| hypothetical protein AF2125 [Archaeoglobus fulgidus DSM 4304]
 gi|2648399|gb|AAB89123.1| conserved hypothetical transmembrane protein [Archaeoglobus
           fulgidus DSM 4304]
          Length = 325

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 49/95 (51%), Gaps = 4/95 (4%)

Query: 326 GKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSS 385
           G+SFRV     +   G L G+   ++G+GGGF+  P    LG+P  V    +T A+ F+ 
Sbjct: 195 GESFRVKNFAPFIV-GFLIGLFSAVIGIGGGFLFVPFLTSLGLPFYVVPGASTLAVFFTQ 253

Query: 386 SMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           + +V+  +L +    P AL     A I  F+G ++
Sbjct: 254 TSTVLG-WLARGVMYPVALIVAGWAGI--FIGSYI 285


>gi|94499052|ref|ZP_01305590.1| hypothetical protein RED65_09699 [Bermanella marisrubri]
 gi|94428684|gb|EAT13656.1| hypothetical protein RED65_09699 [Oceanobacter sp. RED65]
          Length = 256

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 3/81 (3%)

Query: 342 VLAGIVGGLLGL---GGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           +LAG +GG+L     GG FI  P  +  G+PP V++AT TFA +   +  V+ ++   + 
Sbjct: 8   LLAGFIGGVLNTIAGGGSFITFPALMMAGLPPVVANATNTFASSAGYASGVIGFWHEIKT 67

Query: 399 PVPYALYFVAVATIAAFVGQH 419
                LY  AVA + A  G +
Sbjct: 68  LKGTILYIAAVACLGASAGAY 88


>gi|340027139|ref|ZP_08663202.1| hypothetical protein PaTRP_00390 [Paracoccus sp. TRP]
          Length = 307

 Score = 40.0 bits (92), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 2/86 (2%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTFSSSMSVVEYYLLKR-F 398
           G   G++G ++G+GGGFIM P  +  LG+P +V   T+ F +TF S+ + + + +     
Sbjct: 182 GAAVGVLGAIMGVGGGFIMVPAMIYLLGMPTKVVIGTSLFQITFLSAYTTLMHAVSSNTV 241

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKL 424
            V  A+  +    I A VG H+  +L
Sbjct: 242 DVMLAVLLIVGGVIGAQVGTHLGARL 267


>gi|193084351|gb|ACF10008.1| hypothetical protein [uncultured marine crenarchaeote
           SAT1000-49-D2]
          Length = 252

 Score = 40.0 bits (92), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 7/74 (9%)

Query: 88  FFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTL 147
           FF     S+ G+GGG IFVP++ + +G   K A   S+ ++M A+ S +  +  L HP  
Sbjct: 140 FFAGIISSLFGIGGGLIFVPLMVVALGISMKRAAPTSQFILMFASFSGLIVHSMLGHP-- 197

Query: 148 DMPIIDYDLALLIQ 161
                DY  ALL+ 
Sbjct: 198 -----DYYQALLLS 206


>gi|427412871|ref|ZP_18903063.1| hypothetical protein HMPREF9282_00470 [Veillonella ratti
           ACS-216-V-Col6b]
 gi|425715687|gb|EKU78673.1| hypothetical protein HMPREF9282_00470 [Veillonella ratti
           ACS-216-V-Col6b]
          Length = 271

 Score = 40.0 bits (92), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 1/88 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFI-MGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLK 396
           C   V+ G +  + G+GGG I +  L   LG P  +++AT+ F +  S+ M V  +Y L 
Sbjct: 156 CLISVVVGFISSVFGIGGGLIHVAALVYLLGFPTHIATATSQFILFLSTIMGVTTHYFLG 215

Query: 397 RFPVPYALYFVAVATIAAFVGQHVVRKL 424
                 A+     A I A +G  + ++L
Sbjct: 216 HIQWNIAIACGIGAVIGAQLGAAIAKRL 243


>gi|18976740|ref|NP_578097.1| hypothetical protein PF0368 [Pyrococcus furiosus DSM 3638]
 gi|397650869|ref|YP_006491450.1| hypothetical protein PFC_00925 [Pyrococcus furiosus COM1]
 gi|18892327|gb|AAL80492.1| hypothetical protein PF0368 [Pyrococcus furiosus DSM 3638]
 gi|393188460|gb|AFN03158.1| hypothetical protein PFC_00925 [Pyrococcus furiosus COM1]
          Length = 251

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 41/77 (53%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G++ GI+  + GLGGGF++ P    LGV    +  T++ ++ F+S  S + Y   +R   
Sbjct: 8   GIVVGILAAMFGLGGGFLIVPTLNILGVEIHSAIGTSSASIVFTSLSSSIAYSRQRRIHY 67

Query: 401 PYALYFVAVATIAAFVG 417
              +   + A I A++G
Sbjct: 68  RAGILLASTAIIGAYIG 84


>gi|418323949|ref|ZP_12935206.1| sulfite exporter TauE/SafE [Staphylococcus pettenkoferi VCU012]
 gi|365228878|gb|EHM70051.1| sulfite exporter TauE/SafE [Staphylococcus pettenkoferi VCU012]
          Length = 274

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 42/81 (51%), Gaps = 1/81 (1%)

Query: 345 GIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYA 403
           GI+ GL G+GGG +M PL L     PP V+  T+   + FSS MS + + +       Y+
Sbjct: 168 GILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMLMIFFSSLMSSLGHMVQGHVAWGYS 227

Query: 404 LYFVAVATIAAFVGQHVVRKL 424
           L  +  + I A +G  + R +
Sbjct: 228 LVLIISSIIGAQIGVRINRTI 248


>gi|167043268|gb|ABZ07974.1| putative domain of unknown function DUF81 [uncultured marine
           crenarchaeote HF4000_ANIW141M12]
          Length = 251

 Score = 40.0 bits (92), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 7/74 (9%)

Query: 88  FFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTL 147
           FF     S+ G+GGG IFVP++ + +G   K A   S+ ++M A+ S +  +  L HP  
Sbjct: 140 FFAGIISSLFGIGGGLIFVPLMVVALGISMKRAAPTSQFILMFASFSGLIVHSMLGHP-- 197

Query: 148 DMPIIDYDLALLIQ 161
                DY  ALL+ 
Sbjct: 198 -----DYYQALLLS 206


>gi|427430367|ref|ZP_18920221.1| membrane protein, putative [Caenispirillum salinarum AK4]
 gi|425878827|gb|EKV27538.1| membrane protein, putative [Caenispirillum salinarum AK4]
          Length = 307

 Score = 40.0 bits (92), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 35/56 (62%), Gaps = 2/56 (3%)

Query: 340 FGV--LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           FGV  L G++ G+ G+GGGF++ PL + LG+PP V+  T    +  +S   V+ ++
Sbjct: 18  FGVGGLVGVLSGMFGVGGGFLLTPLLIFLGIPPTVAVGTGANQVVGASVSGVLAHW 73


>gi|404416492|ref|ZP_10998312.1| hypothetical protein SARL_01356 [Staphylococcus arlettae CVD059]
 gi|403491149|gb|EJY96674.1| hypothetical protein SARL_01356 [Staphylococcus arlettae CVD059]
          Length = 297

 Score = 40.0 bits (92), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 30/112 (26%), Positives = 53/112 (47%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           +L+ +   G  A ++ G LG+G G     + L  G  P + SAT  F+   +++ S V +
Sbjct: 3   KLLIFALAGFFAQMIDGSLGMGFGATSSSILLAAGAAPAIVSATIHFSEIATTAASGVSH 62

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           Y  K      A+       IAAF+G  V+  +   L +  + +F+L   +F+
Sbjct: 63  YKFKNVDKRMAIKLAIPGAIAAFIGSAVLSNIHSDLIKPFISLFLLTMGLFI 114


>gi|262276252|ref|ZP_06054061.1| hypothetical protein VHA_003237 [Grimontia hollisae CIP 101886]
 gi|262220060|gb|EEY71376.1| hypothetical protein VHA_003237 [Grimontia hollisae CIP 101886]
          Length = 257

 Score = 40.0 bits (92), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 6/74 (8%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS 384
           D  +  +  LV++ A G +  + GG     GG +  P  L +G+PP ++  T   + TF+
Sbjct: 6   DPNTIIILGLVAFAA-GFIDAVAGG-----GGMLTVPALLSIGLPPHIALGTNKLSATFA 59

Query: 385 SSMSVVEYYLLKRF 398
           SS + + YY  K F
Sbjct: 60  SSTAALTYYRKKLF 73


>gi|163744253|ref|ZP_02151613.1| integral membrane protein, putative [Oceanibulbus indolifex HEL-45]
 gi|161381071|gb|EDQ05480.1| integral membrane protein, putative [Oceanibulbus indolifex HEL-45]
          Length = 304

 Score = 40.0 bits (92), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 40/62 (64%), Gaps = 2/62 (3%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVA 408
           G+ G+GGGF++ PL   +G+PP V+ AT+T  +  SS  +++ +  LKR  V + + +V 
Sbjct: 29  GMFGVGGGFLITPLLFFVGIPPAVAVATSTNQIVASSFSALLAH--LKRRTVDFRMGWVL 86

Query: 409 VA 410
           +A
Sbjct: 87  LA 88


>gi|387196905|gb|AFJ68782.1| hypothetical protein NGATSA_3034300 [Nannochloropsis gaditana
           CCMP526]
          Length = 278

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 56/99 (56%)

Query: 358 IMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVG 417
           + GPL LE+GV P V++A+A   + ++S+ + V +Y+    P  Y L+F     +   VG
Sbjct: 164 VKGPLMLEMGVLPPVAAASAATMILYTSASATVAFYVFGLIPGDYGLFFFFWGFLCTGVG 223

Query: 418 QHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVGISN 456
           Q ++ +L+    + SLI+  +  +I +SA+ +    +++
Sbjct: 224 QILLLRLLQHSHKQSLIVLSIGLVITLSAVMMSFQAVAD 262


>gi|120599392|ref|YP_963966.1| hypothetical protein Sputw3181_2588 [Shewanella sp. W3-18-1]
 gi|146292612|ref|YP_001183036.1| hypothetical protein Sputcn32_1512 [Shewanella putrefaciens CN-32]
 gi|386313288|ref|YP_006009453.1| hypothetical protein [Shewanella putrefaciens 200]
 gi|120559485|gb|ABM25412.1| protein of unknown function DUF81 [Shewanella sp. W3-18-1]
 gi|145564302|gb|ABP75237.1| protein of unknown function DUF81 [Shewanella putrefaciens CN-32]
 gi|319425913|gb|ADV53987.1| protein of unknown function DUF81 [Shewanella putrefaciens 200]
          Length = 257

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 42/82 (51%), Gaps = 3/82 (3%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G++AG +  ++G GGG +  P  L LG+ P  +  T   A +F SSM+   YY    F  
Sbjct: 17  GLIAGFIDAIVG-GGGLLSIPALLTLGIAPHTALGTNKLAASFGSSMAAWTYYRQHLF-T 74

Query: 401 PYALYFVAVAT-IAAFVGQHVV 421
           P   Y   +AT I A +G  +V
Sbjct: 75  PSFWYMAFIATFIGAVLGSVLV 96


>gi|352089809|ref|ZP_08954114.1| protein of unknown function DUF81 [Rhodanobacter sp. 2APBS1]
 gi|389797284|ref|ZP_10200327.1| Sulfate transporter, CysZ-type [Rhodanobacter sp. 116-2]
 gi|351678977|gb|EHA62120.1| protein of unknown function DUF81 [Rhodanobacter sp. 2APBS1]
 gi|388447658|gb|EIM03658.1| Sulfate transporter, CysZ-type [Rhodanobacter sp. 116-2]
          Length = 247

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 48/101 (47%), Gaps = 8/101 (7%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS---- 388
           Q   +   G+LA +V G LG+  G +   + L LG+PP V+SAT   A  F++ +S    
Sbjct: 5   QFFVFAGVGLLAQLVDGALGMAYGLVSNSILLALGLPPAVASATVHTAEVFTTGVSGAAH 64

Query: 389 ----VVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLI 425
                V + L  +  +P A+  +  AT  A V    +R  +
Sbjct: 65  AWFGNVRWKLFWQLAIPGAIGGILGATFLASVPGEAIRPWV 105


>gi|161528556|ref|YP_001582382.1| hypothetical protein Nmar_1048 [Nitrosopumilus maritimus SCM1]
 gi|160339857|gb|ABX12944.1| protein of unknown function DUF81 [Nitrosopumilus maritimus SCM1]
          Length = 257

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 34/57 (59%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G  AG++G ++GLGGG I+ P+   LG PP  +++ + FA   ++  S + Y   KR
Sbjct: 11  GFAAGVLGSMIGLGGGVIVVPVLTFLGFPPTAAASNSLFAALSNAVASTMSYSKQKR 67


>gi|255534255|ref|YP_003094626.1| hypothetical protein FIC_00088 [Flavobacteriaceae bacterium
           3519-10]
 gi|255340451|gb|ACU06564.1| conserved hypothetical transmembrane protein [Flavobacteriaceae
           bacterium 3519-10]
          Length = 266

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 57/107 (53%), Gaps = 4/107 (3%)

Query: 322 KGDDGKSFRVFQLVSYCAF---GVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATA 377
           + ++    R +  V+Y      G+L GIV GL+G GGGF++ P L + LG+  + + AT+
Sbjct: 129 RKNERPRLRKYDEVNYTILVSQGLLVGIVTGLIGAGGGFLIVPALVMLLGLNMKKAVATS 188

Query: 378 TFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            F +  +S +  +    + +    + L F +++ I  F+G  + +++
Sbjct: 189 LFIIAMNSLIGFLSTMKIVKHDWVFLLSFTSLSVIGIFIGLALSKRM 235


>gi|421872468|ref|ZP_16304086.1| conserved hypothetical protein [Brevibacillus laterosporus GI-9]
 gi|372458441|emb|CCF13635.1| conserved hypothetical protein [Brevibacillus laterosporus GI-9]
          Length = 275

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 7/91 (7%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELG-------VPPQVSSATATFAMTFSSSMSVVEYY 393
           G+ AG++G L+G+GGG    P  L  G       + PQ++S T+   +  ++  S + ++
Sbjct: 14  GLFAGVMGSLVGIGGGMFFVPALLYFGNMYEPGSITPQIASGTSLLVIAITALSSSISFW 73

Query: 394 LLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
             K+  V  A+ F   +   A VG +V + L
Sbjct: 74  KQKKVEVQAAVLFFIGSAPGAIVGVYVNKWL 104


>gi|374293444|ref|YP_005040479.1| hypothetical protein AZOLI_3104 [Azospirillum lipoferum 4B]
 gi|357425383|emb|CBS88270.1| conserved membrane protein of unknown function [Azospirillum
           lipoferum 4B]
          Length = 305

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 32/53 (60%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           G L G + G+ G+GGGF+M PL + +GVPP ++  T    +  +S   V+ ++
Sbjct: 21  GWLVGFLSGMFGVGGGFLMTPLLIFIGVPPAIAVGTQANQLVAASVSGVLAHW 73


>gi|84515899|ref|ZP_01003260.1| membrane protein [Loktanella vestfoldensis SKA53]
 gi|84510341|gb|EAQ06797.1| membrane protein [Loktanella vestfoldensis SKA53]
          Length = 305

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 30/45 (66%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           G+ G+GGGF++ PL   +G+PP V+ AT+T  +  SS   V+ ++
Sbjct: 29  GMFGVGGGFLITPLLFFIGIPPAVAVATSTNQIVASSISGVLAHF 73


>gi|197123917|ref|YP_002135868.1| hypothetical protein AnaeK_3526 [Anaeromyxobacter sp. K]
 gi|196173766|gb|ACG74739.1| protein of unknown function DUF81 [Anaeromyxobacter sp. K]
          Length = 255

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 49/87 (56%), Gaps = 6/87 (6%)

Query: 299 PVSLVVSLYEAISLYKG-----RRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGL 353
           PV L+ +L+ A+++Y       RR    +  +G+ + V +  +      +AG + GL+G+
Sbjct: 95  PVGLLKALFAAVAVYSALVMWRRRPAGPQAAEGEPYVVRRWGTGLGASAVAGAISGLIGV 154

Query: 354 GGGFIMGPLF-LELGVPPQVSSATATF 379
           GGGFI  P+  L + +P +V+ AT+ F
Sbjct: 155 GGGFIKVPVMTLAMELPFKVAVATSNF 181


>gi|124028506|ref|YP_001013826.1| hypothetical protein Hbut_1668 [Hyperthermus butylicus DSM 5456]
 gi|123979200|gb|ABM81481.1| universally conserved protein [Hyperthermus butylicus DSM 5456]
          Length = 255

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 45/84 (53%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           GV+AG VG LLG+GGG IM P  + LGVP +V++  +  A+  +S   +   +      V
Sbjct: 12  GVVAGFVGTLLGIGGGAIMVPALVLLGVPVKVAAPASLVAILGTSLGGIRRLFRRGLVDV 71

Query: 401 PYALYFVAVATIAAFVGQHVVRKL 424
             A++    + + A VG   V +L
Sbjct: 72  MLAVFLETASGLGALVGVIAVGRL 95


>gi|339007279|ref|ZP_08639854.1| hypothetical membrane protein [Brevibacillus laterosporus LMG
           15441]
 gi|338776488|gb|EGP36016.1| hypothetical membrane protein [Brevibacillus laterosporus LMG
           15441]
          Length = 275

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 7/91 (7%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELG-------VPPQVSSATATFAMTFSSSMSVVEYY 393
           G+ AG++G L+G+GGG    P  L  G       + PQ++S T+   +  ++  S + ++
Sbjct: 14  GLFAGVMGSLVGIGGGMFFVPALLYFGNMYEPGSITPQIASGTSLLVIAITALSSSISFW 73

Query: 394 LLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
             K+  V  A+ F   +   A VG +V + L
Sbjct: 74  KQKKVEVQAAVLFFIGSAPGAIVGVYVNKWL 104


>gi|417985491|ref|ZP_12626075.1| membrane protein [Lactobacillus casei 32G]
 gi|410528519|gb|EKQ03371.1| membrane protein [Lactobacillus casei 32G]
          Length = 280

 Score = 39.7 bits (91), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVE 391
           Q++     G+L GI+G +LG+GGG I+ P L + +G+P Q +   +  ++  +SS + + 
Sbjct: 3   QMIMLMVTGLLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           Y   +   +  A++     T+ A +G  V
Sbjct: 63  YLKDEMLNLRVAMFLEIATTVGAVIGAVV 91


>gi|417993330|ref|ZP_12633679.1| membrane protein [Lactobacillus casei CRF28]
 gi|410531802|gb|EKQ06518.1| membrane protein [Lactobacillus casei CRF28]
          Length = 114

 Score = 39.7 bits (91), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 24/89 (26%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLF-LELGVPPQVSSATATFAMTFSSSMSVVE 391
           Q++     G+L GI+G +LG+GGG I+ P+  + +G+P Q +   +  ++  +SS + + 
Sbjct: 3   QMIMLMVTGLLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           Y   +   +  A++     T+ A +G  V
Sbjct: 63  YLKDEMLNLRVAMFLEIATTVGAVIGAVV 91


>gi|423956077|ref|ZP_17734916.1| sulfite exporter TauE/SafE family protein [Vibrio cholerae HE-40]
 gi|423985147|ref|ZP_17738465.1| sulfite exporter TauE/SafE family protein [Vibrio cholerae HE-46]
 gi|408657933|gb|EKL29008.1| sulfite exporter TauE/SafE family protein [Vibrio cholerae HE-40]
 gi|408664545|gb|EKL35379.1| sulfite exporter TauE/SafE family protein [Vibrio cholerae HE-46]
          Length = 258

 Score = 39.7 bits (91), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 3/103 (2%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF-PVP 401
           +AG +  + G GGG +  P  L LG+PP ++  T   A TF+S  +   YY  K F P  
Sbjct: 18  IAGFIDAVAG-GGGMLTVPALLSLGLPPHLALGTNKLAATFASLTAAWTYYSQKLFDPAC 76

Query: 402 YALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           +   F+A   I A +G   V  +     R  L + ILA  ++ 
Sbjct: 77  WKRAFIAT-LIGAILGTLAVDAISTDWLRKVLPLIILAAALYT 118


>gi|199597888|ref|ZP_03211313.1| Predicted permease [Lactobacillus rhamnosus HN001]
 gi|258507211|ref|YP_003169962.1| permease [Lactobacillus rhamnosus GG]
 gi|385826935|ref|YP_005864707.1| putative transporter protein [Lactobacillus rhamnosus GG]
 gi|199591145|gb|EDY99226.1| Predicted permease [Lactobacillus rhamnosus HN001]
 gi|257147138|emb|CAR86111.1| Permease [Lactobacillus rhamnosus GG]
 gi|259648580|dbj|BAI40742.1| putative transporter protein [Lactobacillus rhamnosus GG]
          Length = 280

 Score = 39.7 bits (91), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 58/110 (52%), Gaps = 8/110 (7%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVE 391
           Q++     GVL GI+G +LG+GGG I+ P L + +G+P Q +   +  ++  +SS + + 
Sbjct: 3   QMLLLMGTGVLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHVV-----RKLIILLGRASLIIF 436
           Y   +   +  A++     T+ A +G  +      + L IL G  +L+IF
Sbjct: 63  YLKDEMLNLRVAMFLEIATTVGAVIGAVITGLVNGKVLYILFG--ALLIF 110


>gi|395788323|ref|ZP_10467887.1| hypothetical protein ME7_01222 [Bartonella birtlesii LL-WM9]
 gi|395408240|gb|EJF74851.1| hypothetical protein ME7_01222 [Bartonella birtlesii LL-WM9]
          Length = 306

 Score = 39.7 bits (91), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 2/94 (2%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSS-MSVV 390
            ++   A G++ G++  ++G+GGGF M P L   L VP  V   T+ F +TF SS  +++
Sbjct: 174 SVIPVLAIGLIVGLLSSIMGIGGGFFMIPALIYLLHVPTSVVIGTSLFQITFVSSFTTIL 233

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           +    +   +  A   +   +I A  G  V RKL
Sbjct: 234 QSVTNQSVDIVLAFLLMIGGSIGAQYGTRVGRKL 267


>gi|239637839|ref|ZP_04678801.1| conserved membrane protein YtnM [Staphylococcus warneri L37603]
 gi|239596597|gb|EEQ79132.1| conserved membrane protein YtnM [Staphylococcus warneri L37603]
          Length = 299

 Score = 39.7 bits (91), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 45/92 (48%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           +L+ +   G LA +V G LG+G G     + L  GV P + SAT  F+   +++ S   +
Sbjct: 3   KLLVFALAGFLAQLVDGSLGMGFGASSSSILLTFGVAPAIVSATVHFSEIATTAASGTSH 62

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           +  +    P  L       I+AF+G  V+ ++
Sbjct: 63  WKFENVHYPTMLKLAIPGAISAFLGAAVLTRI 94


>gi|392380844|ref|YP_005030040.1| conserved membrane protein of unknown function [Azospirillum
           brasilense Sp245]
 gi|356875808|emb|CCC96556.1| conserved membrane protein of unknown function [Azospirillum
           brasilense Sp245]
          Length = 305

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 32/54 (59%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
            G L G + G+ G+GGGF+M PL + +GVPP ++  T    +  +S   V+ ++
Sbjct: 20  MGGLVGFLSGMFGVGGGFLMTPLLIFIGVPPAIAVGTQANQLVAASVSGVLAHW 73


>gi|288959672|ref|YP_003450013.1| hypothetical protein AZL_028310 [Azospirillum sp. B510]
 gi|288911980|dbj|BAI73469.1| hypothetical protein AZL_028310 [Azospirillum sp. B510]
          Length = 305

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 32/56 (57%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
              G L G + G+ G+GGGF+M PL + +GVPP ++  T    +  +S   V+ ++
Sbjct: 18  LGMGWLVGFLSGMFGVGGGFLMTPLLIFIGVPPAIAVGTQANQLVAASVSGVLAHW 73


>gi|347548030|ref|YP_004854358.1| hypothetical protein LIV_0552 [Listeria ivanovii subsp. ivanovii
           PAM 55]
 gi|346981101|emb|CBW85029.1| Conserved hypothetical protein [Listeria ivanovii subsp. ivanovii
           PAM 55]
          Length = 246

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 47/86 (54%), Gaps = 1/86 (1%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ-VSSATATFAMTFSSSMSVVEYYLLKRF 398
           FGV+ G + GL G+GGG I+ P+ L + +  Q  +SAT+++    +S  S+  Y ++   
Sbjct: 137 FGVIIGFLAGLFGIGGGPIVIPILLLIFMLNQKTASATSSYVTLLTSFASIGSYTIIGGS 196

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKL 424
                +Y +  A + A +G H+ + L
Sbjct: 197 DFSIGIYMIPGAILGAIIGTHLNKLL 222


>gi|374581728|ref|ZP_09654822.1| putative permease [Desulfosporosinus youngiae DSM 17734]
 gi|374417810|gb|EHQ90245.1| putative permease [Desulfosporosinus youngiae DSM 17734]
          Length = 250

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 43/89 (48%), Gaps = 1/89 (1%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
            L+  C FG +A +V  + G GGG I  P  L +G+PP  +  T  FA + +S  S + +
Sbjct: 4   DLIIICTFGFIAAMVDAIAG-GGGLISLPALLMVGLPPHFALGTNKFAASLASLNSSITF 62

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHVV 421
               +   P   + +  + + AF+G   V
Sbjct: 63  ARSGKVHFPLVKWQIPFSLLGAFLGAWAV 91


>gi|330813138|ref|YP_004357377.1| transmembrane permease [Candidatus Pelagibacter sp. IMCC9063]
 gi|327486233|gb|AEA80638.1| putative transmembrane permease [Candidatus Pelagibacter sp.
           IMCC9063]
          Length = 352

 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 1/58 (1%)

Query: 327 KSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTF 383
           KS     +V    FG++ GI+  +LG+GGGF++ P+ +  LG+P ++ + T+ F M F
Sbjct: 168 KSKLYISIVPPIFFGIVVGILSSMLGVGGGFLLVPIMIYILGMPARLVAGTSLFVMIF 225


>gi|328950601|ref|YP_004367936.1| hypothetical protein Marky_1084 [Marinithermus hydrothermalis DSM
           14884]
 gi|328450925|gb|AEB11826.1| protein of unknown function DUF81 [Marinithermus hydrothermalis DSM
           14884]
          Length = 227

 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 1/78 (1%)

Query: 341 GVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFP 399
           G LAG   GLLG+GGG +M P L L  G P Q++  T+  AM   + +  + +Y L    
Sbjct: 113 GALAGFASGLLGIGGGTVMVPALVLGAGFPQQLAQGTSLAAMILPAWVGALTHYRLGHVR 172

Query: 400 VPYALYFVAVATIAAFVG 417
              A + +A   + A+ G
Sbjct: 173 GEIAPWLLAGIAVGAYAG 190


>gi|206890428|ref|YP_002248551.1| hypothetical protein THEYE_A0709 [Thermodesulfovibrio yellowstonii
           DSM 11347]
 gi|206742366|gb|ACI21423.1| membrane protein, putative [Thermodesulfovibrio yellowstonii DSM
           11347]
          Length = 422

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 45/90 (50%)

Query: 335 VSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYL 394
           ++  A G+ AG++ G +G GGGFI+ P  +  GV   ++  T  F +   + M  V +  
Sbjct: 75  IASIAVGLAAGLITGCIGAGGGFIIAPALMAAGVKGILAVGTDQFHIFAKAIMGTVIHKK 134

Query: 395 LKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           L    V  A+ FV  + + A  G ++ R +
Sbjct: 135 LGNVCVGLAVAFVIGSVLGATTGGYIQRTI 164


>gi|37521834|ref|NP_925211.1| hypothetical protein gll2265 [Gloeobacter violaceus PCC 7421]
 gi|35212833|dbj|BAC90206.1| gll2265 [Gloeobacter violaceus PCC 7421]
          Length = 119

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 52/96 (54%), Gaps = 6/96 (6%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLEL-GVPPQVSSATATFAMTFSSSM-SVV 390
           Q+ S  + G++AG++ G+ G+GGG IM P  + L G   ++++ T+  A+     +  V+
Sbjct: 2   QIASLLSLGLVAGVLAGMFGIGGGAIMVPAMMFLIGFSTKLATGTSLAALLLPFGLFGVL 61

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
           EYY   +  +P AL  VA      FVG +   KL +
Sbjct: 62  EYYKNGQVNIPAALLLVA----GLFVGSYFGAKLTL 93


>gi|395790715|ref|ZP_10470175.1| hypothetical protein MEC_00166 [Bartonella alsatica IBS 382]
 gi|395409467|gb|EJF76057.1| hypothetical protein MEC_00166 [Bartonella alsatica IBS 382]
          Length = 304

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 59/120 (49%), Gaps = 9/120 (7%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           ++   A G++ G++  ++G+GGGF M P L   L VP  V   T+ F +TF SS + V  
Sbjct: 175 IIPVLAIGLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQITFVSSFTTVLQ 234

Query: 393 YLLKRFPVPYALYFVAV--ATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLG 450
            +  +  V   L F+ +   +I A  G    RKL     +A  +   LAF++ +  + L 
Sbjct: 235 SMTNQ-SVDIVLAFLLMLGGSIGAQYGTRAGRKL-----KAEQLRMALAFLVLIVCMRLA 288


>gi|417821200|ref|ZP_12467814.1| hypothetical protein VCHE39_2703 [Vibrio cholerae HE39]
 gi|340038831|gb|EGQ99805.1| hypothetical protein VCHE39_2703 [Vibrio cholerae HE39]
          Length = 249

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 49/103 (47%), Gaps = 3/103 (2%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF-PVP 401
           +AG +  + G GGG +  P  L LG+PP ++  T   A TF+S  +   YY  K F P  
Sbjct: 9   IAGFIDAVAG-GGGMLTVPALLSLGLPPHLALGTNKLAATFASLTAAWTYYSQKLFDPAC 67

Query: 402 YALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           +   F+A   I A +G   V  +     R  L + ILA  ++ 
Sbjct: 68  WKRAFIAT-LIGAILGTLAVDAISTDWLRKVLPLIILAAALYT 109


>gi|384222369|ref|YP_005613535.1| hypothetical protein BJ6T_87050 [Bradyrhizobium japonicum USDA 6]
 gi|354961268|dbj|BAL13947.1| hypothetical protein BJ6T_87050 [Bradyrhizobium japonicum USDA 6]
          Length = 305

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+    +  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALSYW 73


>gi|87310558|ref|ZP_01092687.1| putative membrane protein [Blastopirellula marina DSM 3645]
 gi|87286779|gb|EAQ78684.1| putative membrane protein [Blastopirellula marina DSM 3645]
          Length = 262

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 50/99 (50%), Gaps = 8/99 (8%)

Query: 345 GIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF--PVP 401
           G+V  LLG+GGG I  P  +  L +PP  ++AT+ F +TF +  + + +  +  F   + 
Sbjct: 155 GVVSSLLGIGGGIIHVPFLIRALKMPPHFATATSHFVLTFIALTATITHVSMGEFQGELS 214

Query: 402 YALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAF 440
             +Y      + A +G  V  KL     + SLI+ +LA 
Sbjct: 215 TTMYLAVGVMMGAPIGAAVSTKL-----KGSLIVKMLAL 248


>gi|414161496|ref|ZP_11417756.1| hypothetical protein HMPREF9310_02130 [Staphylococcus simulans
           ACS-120-V-Sch1]
 gi|410876392|gb|EKS24303.1| hypothetical protein HMPREF9310_02130 [Staphylococcus simulans
           ACS-120-V-Sch1]
          Length = 274

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 55/120 (45%), Gaps = 3/120 (2%)

Query: 308 EAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGV--LAGIVGGLLGLGGGFIMGPLFL- 364
           + I L++ ++   +  D   +   + +    AF V    GI  GL G+GGG +M PL L 
Sbjct: 129 KPIKLFEKKKYEKTYVDKDGTIYHYHVPPVFAFVVTFFIGITTGLFGIGGGALMTPLMLI 188

Query: 365 ELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
               PP V+  T+   + FSS MS + +         YAL  V  + + A +G  + + +
Sbjct: 189 VFRFPPHVAVGTSMMMIFFSSVMSSIGHTFQGHVAWHYALVLVISSYVGAKIGVRINQSI 248


>gi|227534549|ref|ZP_03964598.1| permease [Lactobacillus paracasei subsp. paracasei ATCC 25302]
 gi|227187798|gb|EEI67865.1| permease [Lactobacillus paracasei subsp. paracasei ATCC 25302]
          Length = 280

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVE 391
           Q++     G+L GI+G +LG+GGG I+ P L + +G+P Q +   +  ++  +SS + + 
Sbjct: 3   QMIMLMVTGLLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           Y   +   +  A++     T+ A +G  V
Sbjct: 63  YLKDEMLNLRVAMFLEIATTVGAVIGAVV 91


>gi|219851806|ref|YP_002466238.1| hypothetical protein Mpal_1173 [Methanosphaerula palustris E1-9c]
 gi|219546065|gb|ACL16515.1| protein of unknown function DUF81 [Methanosphaerula palustris
           E1-9c]
          Length = 267

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 47/101 (46%), Gaps = 1/101 (0%)

Query: 324 DDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTF 383
           +  K  RV   +    +G + G+V G+ G  GG +  P  + LGVP  ++ AT+   +  
Sbjct: 141 EHSKPIRV-PCIHLVIWGAMGGLVSGITGTSGGALFVPALVVLGVPIHLAVATSLLTIIP 199

Query: 384 SSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           +S      +  L    +PY + + A A + AF G  +  ++
Sbjct: 200 TSITGAATHIALGNISLPYVVVYGAGAVLGAFAGASLAPRI 240


>gi|258514257|ref|YP_003190479.1| hypothetical protein Dtox_0967 [Desulfotomaculum acetoxidans DSM
           771]
 gi|257777962|gb|ACV61856.1| protein of unknown function DUF81 [Desulfotomaculum acetoxidans DSM
           771]
          Length = 303

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 50/88 (56%), Gaps = 2/88 (2%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLEL-GVPPQVSSATATFAMTFSS-SMSVVEYYLLK 396
           A G+L G++  L+G+GGGFIM P+ + L G+P   +  T+ F + F++ +++V    L  
Sbjct: 178 ALGLLVGVLAALMGVGGGFIMLPVMIYLIGMPTHNAVGTSIFVIIFTAINVTVANSALNH 237

Query: 397 RFPVPYALYFVAVATIAAFVGQHVVRKL 424
              +  A+  +  ++I A  G  + +KL
Sbjct: 238 TVDLVLAMVLLVGSSIGAQFGAKLGKKL 265


>gi|88857895|ref|ZP_01132537.1| hypothetical protein PTD2_10934 [Pseudoalteromonas tunicata D2]
 gi|88819512|gb|EAR29325.1| hypothetical protein PTD2_10934 [Pseudoalteromonas tunicata D2]
          Length = 258

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 1/87 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C     AG +  + G GGG +  P  L  G+PP +   T   A +F S  + V YY  K 
Sbjct: 15  CIVAFAAGFIDAIAG-GGGLLTVPALLTAGLPPHLVLGTNKLAASFGSLTASVTYYKKKL 73

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKL 424
           F   +    +A   + A +G  +V  L
Sbjct: 74  FNPSFWKKSIAATALGALIGTLIVDHL 100


>gi|254513249|ref|ZP_05125314.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11]
 gi|221532253|gb|EEE35249.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11]
          Length = 320

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 38/70 (54%), Gaps = 5/70 (7%)

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPP----QVSSATATFAMTFSSSM 387
           F  ++Y   G+ A +V    G+GGG +  P+F+ LGV P      S A   F MT S S+
Sbjct: 58  FWKLAYFLVGICAALVANSTGVGGGVVFLPVFISLGVSPAEVLATSIAIQCFGMT-SGSL 116

Query: 388 SVVEYYLLKR 397
           S ++Y  +KR
Sbjct: 117 SWLQYRFVKR 126


>gi|149194358|ref|ZP_01871455.1| hypothetical protein CMTB2_07151 [Caminibacter mediatlanticus TB-2]
 gi|149135533|gb|EDM24012.1| hypothetical protein CMTB2_07151 [Caminibacter mediatlanticus TB-2]
          Length = 264

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 52/104 (50%), Gaps = 9/104 (8%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPY 402
           +AG + G+ G G G ++ P F+  G+PPQ++        T  +  ++  +   K+    Y
Sbjct: 25  IAGFIDGIAG-GAGLVLVPSFILAGLPPQIALGQEKLVSTIGTIPAIFNFLKSKQILWDY 83

Query: 403 ALYFVAVATIAAFVGQH--------VVRKLIILLGRASLIIFIL 438
            LY V +A + AFVG          +V K+++LL    LII I+
Sbjct: 84  VLYGVPMALLGAFVGAKFILILDPSIVGKMVVLLMPIGLIISII 127


>gi|418003905|ref|ZP_12643956.1| membrane protein [Lactobacillus casei UW1]
 gi|418006936|ref|ZP_12646836.1| membrane protein [Lactobacillus casei UW4]
 gi|410550290|gb|EKQ24421.1| membrane protein [Lactobacillus casei UW4]
 gi|410551109|gb|EKQ25179.1| membrane protein [Lactobacillus casei UW1]
          Length = 280

 Score = 39.7 bits (91), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVE 391
           Q++     G+L GI+G +LG+GGG I+ P L + +G+P Q +   +  ++  +SS + + 
Sbjct: 3   QMIMLMVTGLLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           Y   +   +  A++     T+ A +G  V
Sbjct: 63  YLKDEMLNLRVAMFLEIATTVGAVIGAVV 91


>gi|297543867|ref|YP_003676169.1| hypothetical protein Tmath_0381 [Thermoanaerobacter mathranii
           subsp. mathranii str. A3]
 gi|296841642|gb|ADH60158.1| protein of unknown function DUF81 [Thermoanaerobacter mathranii
           subsp. mathranii str. A3]
          Length = 253

 Score = 39.7 bits (91), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 61/126 (48%), Gaps = 11/126 (8%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFA---MTFSSSMSVV 390
           +V+ C  G  A  +  + G GGG I  P  L LGVPP  +  T  FA    +F+SS++ +
Sbjct: 6   IVALCGIGFAAAFIDSIAG-GGGIISLPGLLVLGVPPAFALGTNKFAASCASFTSSLTFI 64

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLG 450
           +Y +     + Y ++          +G  +  K ++LL  + L I I+  MIFV+  +L 
Sbjct: 65  KYRVYDGDLLKYLVF-------GTLLGAILGVKTVLLLDSSQLRIIIIILMIFVAIYTLF 117

Query: 451 GVGISN 456
              I N
Sbjct: 118 AKNIGN 123


>gi|146343498|ref|YP_001208546.1| permease membrane protein [Bradyrhizobium sp. ORS 278]
 gi|146196304|emb|CAL80331.1| putative permease; putative membrane protein [Bradyrhizobium sp.
           ORS 278]
          Length = 331

 Score = 39.7 bits (91), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ +  +  SS    + Y+  +R 
Sbjct: 42  AMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYW--RRR 99

Query: 399 PVPYALYFV 407
            +  AL  V
Sbjct: 100 AIDPALASV 108


>gi|148252390|ref|YP_001236975.1| permease [Bradyrhizobium sp. BTAi1]
 gi|146404563|gb|ABQ33069.1| putative permease [Bradyrhizobium sp. BTAi1]
          Length = 308

 Score = 39.7 bits (91), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 40/80 (50%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ +  +  SS    + Y+  + 
Sbjct: 18  LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYWRRRA 77

Query: 398 FPVPYALYFVAVATIAAFVG 417
                AL  +   T+   +G
Sbjct: 78  IDPSLALVLMIGGTLGTALG 97


>gi|116493760|ref|YP_805494.1| permease [Lactobacillus casei ATCC 334]
 gi|191637002|ref|YP_001986168.1| permease [Lactobacillus casei BL23]
 gi|239631015|ref|ZP_04674046.1| transport protein [Lactobacillus paracasei subsp. paracasei 8700:2]
 gi|385818702|ref|YP_005855089.1| hypothetical protein LC2W_0169 [Lactobacillus casei LC2W]
 gi|385821877|ref|YP_005858219.1| hypothetical protein LCBD_0178 [Lactobacillus casei BD-II]
 gi|409995846|ref|YP_006750247.1| permease [Lactobacillus casei W56]
 gi|417979570|ref|ZP_12620261.1| membrane protein [Lactobacillus casei 12A]
 gi|417982367|ref|ZP_12623025.1| membrane protein [Lactobacillus casei 21/1]
 gi|417988477|ref|ZP_12629012.1| membrane protein [Lactobacillus casei A2-362]
 gi|417994871|ref|ZP_12635181.1| membrane protein [Lactobacillus casei M36]
 gi|417998099|ref|ZP_12638329.1| membrane protein [Lactobacillus casei T71499]
 gi|418000877|ref|ZP_12641049.1| membrane protein [Lactobacillus casei UCD174]
 gi|418014128|ref|ZP_12653740.1| membrane protein [Lactobacillus casei Lpc-37]
 gi|116103910|gb|ABJ69052.1| Predicted permease [Lactobacillus casei ATCC 334]
 gi|190711304|emb|CAQ65310.1| Permease [Lactobacillus casei BL23]
 gi|239527298|gb|EEQ66299.1| transport protein [Lactobacillus paracasei subsp. paracasei 8700:2]
 gi|327381029|gb|AEA52505.1| hypothetical protein LC2W_0169 [Lactobacillus casei LC2W]
 gi|327384204|gb|AEA55678.1| hypothetical protein LCBD_0178 [Lactobacillus casei BD-II]
 gi|406356858|emb|CCK21128.1| Predicted permease [Lactobacillus casei W56]
 gi|410527279|gb|EKQ02151.1| membrane protein [Lactobacillus casei 12A]
 gi|410530296|gb|EKQ05077.1| membrane protein [Lactobacillus casei 21/1]
 gi|410539601|gb|EKQ14128.1| membrane protein [Lactobacillus casei M36]
 gi|410541407|gb|EKQ15887.1| membrane protein [Lactobacillus casei A2-362]
 gi|410541887|gb|EKQ16353.1| membrane protein [Lactobacillus casei T71499]
 gi|410549095|gb|EKQ23271.1| membrane protein [Lactobacillus casei UCD174]
 gi|410554457|gb|EKQ28432.1| membrane protein [Lactobacillus casei Lpc-37]
          Length = 280

 Score = 39.3 bits (90), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVE 391
           Q++     G+L GI+G +LG+GGG I+ P L + +G+P Q +   +  ++  +SS + + 
Sbjct: 3   QMIMLMVTGLLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           Y   +   +  A++     T+ A +G  V
Sbjct: 63  YLKDEMLNLRVAMFLEIATTVGAVIGAVV 91


>gi|15616016|ref|NP_244321.1| hypothetical protein BH3454 [Bacillus halodurans C-125]
 gi|10176077|dbj|BAB07173.1| BH3454 [Bacillus halodurans C-125]
          Length = 273

 Score = 39.3 bits (90), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%), Gaps = 10/70 (14%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELG--------VPPQVSSATATFAMTFSSSMSVVEY 392
           G+LAG +G L+GLGGG I+ P  L LG        V PQV+  T+   M F+   S   Y
Sbjct: 10  GLLAGTIGSLMGLGGGIIVVPALLWLGGTALLSEAVTPQVAVGTSLLIMIFTGLSSTFAY 69

Query: 393 YLLKRFPVPY 402
             +K+  V Y
Sbjct: 70  --MKKGTVDY 77



 Score = 38.9 bits (89), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 34/69 (49%), Gaps = 7/69 (10%)

Query: 61  QPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSA 120
            PEG  Y +       G++  LG +I FF      + G+GGG + VP + L+  F P  A
Sbjct: 143 DPEGQRYDY-------GFQAWLGILIAFFVGLLSGLFGIGGGTLLVPAMILLFAFPPHVA 195

Query: 121 TAISKCMIM 129
            A S  MI+
Sbjct: 196 VATSMFMII 204


>gi|374572587|ref|ZP_09645683.1| putative permease [Bradyrhizobium sp. WSM471]
 gi|374420908|gb|EHR00441.1| putative permease [Bradyrhizobium sp. WSM471]
          Length = 305

 Score = 39.3 bits (90), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+    +  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73


>gi|384171573|ref|YP_005552950.1| hypothetical protein [Arcobacter sp. L]
 gi|345471183|dbj|BAK72633.1| conserved hypothetical protein [Arcobacter sp. L]
          Length = 255

 Score = 39.3 bits (90), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 5/86 (5%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G +AG +  + G GGG I  P+ L  G+PP    AT   A  F + M+ ++Y+L K+   
Sbjct: 19  GFVAGYIDSIAG-GGGMIQVPVLLYSGIPPIFVLATNKIASLFGTLMATIKYFLSKKI-- 75

Query: 401 PYALYFVAVATIAAFVGQHVVRKLII 426
             +   VA+  I   +  ++  KL++
Sbjct: 76  --SFRIVAIGLIPCLLASYIGSKLVM 99


>gi|399056281|ref|ZP_10743692.1| putative permease [Brevibacillus sp. CF112]
 gi|433543570|ref|ZP_20499974.1| hypothetical protein D478_07713 [Brevibacillus agri BAB-2500]
 gi|398046272|gb|EJL38897.1| putative permease [Brevibacillus sp. CF112]
 gi|432185127|gb|ELK42624.1| hypothetical protein D478_07713 [Brevibacillus agri BAB-2500]
          Length = 274

 Score = 39.3 bits (90), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 7/87 (8%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFL------ELG-VPPQVSSATATFAMTFSSSMSVVEYY 393
           GV+AG++G + GLGGG    P  +      E G +PPQV++AT+   +  ++  S + Y 
Sbjct: 13  GVVAGVIGSIAGLGGGIFFVPALMFFANWYEPGSMPPQVAAATSLLVIAVTALSSSISYI 72

Query: 394 LLKRFPVPYALYFVAVATIAAFVGQHV 420
             K+  +  AL F   +   A VG ++
Sbjct: 73  KQKKVDMQSALLFFIGSAPGAIVGVYL 99


>gi|420184206|ref|ZP_14690317.1| hypothetical protein HMPREF9986_00402 [Staphylococcus epidermidis
           NIHLM040]
 gi|394257654|gb|EJE02570.1| hypothetical protein HMPREF9986_00402 [Staphylococcus epidermidis
           NIHLM040]
          Length = 299

 Score = 39.3 bits (90), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 40/81 (49%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G LA +V GLLG+G G     + L  G+ P V SAT  F+   +++ S   ++       
Sbjct: 11  GFLAQLVDGLLGMGFGASSSSILLTYGIAPAVVSATVHFSEIATTAASGTSHWRFDNVHK 70

Query: 401 PYALYFVAVATIAAFVGQHVV 421
           P  L      +I+AF+G  V+
Sbjct: 71  PTMLKLAIPGSISAFIGAGVL 91


>gi|152974977|ref|YP_001374494.1| hypothetical protein Bcer98_1177 [Bacillus cytotoxicus NVH 391-98]
 gi|152023729|gb|ABS21499.1| protein of unknown function DUF81 [Bacillus cytotoxicus NVH 391-98]
          Length = 266

 Score = 39.3 bits (90), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 35/58 (60%), Gaps = 1/58 (1%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLEL-GVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G++AG +G L+GLGGG I+ PL + L  + PQ++  T+   + F+   S + Y   KR
Sbjct: 10  GLIAGTIGSLVGLGGGIIVVPLLIGLHSLSPQIAVGTSIVTVVFTGLSSTLAYVKHKR 67


>gi|315281248|ref|ZP_07869919.1| integral membrane protein, putative [Listeria marthii FSL S4-120]
 gi|313615105|gb|EFR88579.1| integral membrane protein, putative [Listeria marthii FSL S4-120]
          Length = 246

 Score = 39.3 bits (90), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ-VSSATATFAMTFSSSMSVV 390
           ++++ Y  FGV+ G + GL G+GGG I+ P+ L + +  Q  +SAT+++    +S  S+ 
Sbjct: 130 YRIIPYV-FGVIIGFLAGLFGIGGGPIVIPILLLIFMLNQKTASATSSYVTLLTSLASIG 188

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            Y L+        +Y +  A I A +G  + + L
Sbjct: 189 SYALIGGSDFSIGIYMIPGAIIGALIGTRLNKLL 222


>gi|410457114|ref|ZP_11310951.1| hypothetical protein BABA_24600 [Bacillus bataviensis LMG 21833]
 gi|409926453|gb|EKN63624.1| hypothetical protein BABA_24600 [Bacillus bataviensis LMG 21833]
          Length = 273

 Score = 39.3 bits (90), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 53/114 (46%), Gaps = 11/114 (9%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELG-------VPPQVSSATATFAMTFSSSMSVVEYY 393
           G++A  +G L+G+GGG I+ P  L L        + PQV   T+ F M F+   S + Y 
Sbjct: 10  GLVASSIGSLIGMGGGIIVVPALLYLATLSTFSHLTPQVVVGTSLFTMIFTGLSSTLSYM 69

Query: 394 LLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
            LK       L F+  +   + +G  V  K    L   S  I+   F++FVS +
Sbjct: 70  KLKTVDYKSGLIFLIGSGPGSILGAWVTEK----LNLHSFNIYFGIFILFVSLV 119


>gi|89053334|ref|YP_508785.1| hypothetical protein Jann_0843 [Jannaschia sp. CCS1]
 gi|88862883|gb|ABD53760.1| protein of unknown function DUF81 [Jannaschia sp. CCS1]
          Length = 307

 Score = 39.3 bits (90), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 50/99 (50%), Gaps = 2/99 (2%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVA 408
           G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ +   K   +   L  + 
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHLKRKTVDLKMGLVLLI 88

Query: 409 VATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAI 447
              I A +G  V   L   +G+  L++  L +++F+  I
Sbjct: 89  GGLIGAAIGVQVFAALTA-IGQVDLLVR-LCYVVFLGII 125


>gi|57865552|ref|YP_189743.1| hypothetical protein SERP2187 [Staphylococcus epidermidis RP62A]
 gi|57636210|gb|AAW52998.1| conserved hypothetical protein [Staphylococcus epidermidis RP62A]
          Length = 299

 Score = 39.3 bits (90), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 40/81 (49%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G LA +V GLLG+G G     + L  G+ P V SAT  F+   +++ S   ++       
Sbjct: 11  GFLAQLVDGLLGMGFGASSSSILLTYGIAPAVVSATVHFSEIATTAASGTSHWRFDNVHK 70

Query: 401 PYALYFVAVATIAAFVGQHVV 421
           P  L      +I+AF+G  V+
Sbjct: 71  PTMLKLAIPGSISAFIGAGVL 91


>gi|414153246|ref|ZP_11409573.1| conserved membrane hypothetical protein [Desulfotomaculum
           hydrothermale Lam5 = DSM 18033]
 gi|411455628|emb|CCO07476.1| conserved membrane hypothetical protein [Desulfotomaculum
           hydrothermale Lam5 = DSM 18033]
          Length = 339

 Score = 39.3 bits (90), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 342 VLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           + +G+V G LG+GGGFI  P+ +  LG+P  ++  T  FA+  ++S     Y L  +  +
Sbjct: 186 IFSGLVSGFLGVGGGFIRVPILIYALGLPTVMAVGTDLFAILITNSWGAYIYALAGKVEI 245

Query: 401 PYALYFVAVATIAAFVGQ 418
             AL  V  A + A +G 
Sbjct: 246 IGALVMVVGAAVGAQIGS 263


>gi|365879405|ref|ZP_09418829.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 375]
 gi|365292656|emb|CCD91360.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 375]
          Length = 308

 Score = 39.3 bits (90), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
            A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ +  +  SS    + Y+
Sbjct: 18  LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYW 73


>gi|251810296|ref|ZP_04824769.1| hypothetical membrane protein [Staphylococcus epidermidis
           BCM-HMP0060]
 gi|251806178|gb|EES58835.1| hypothetical membrane protein [Staphylococcus epidermidis
           BCM-HMP0060]
          Length = 275

 Score = 39.3 bits (90), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 61/118 (51%), Gaps = 8/118 (6%)

Query: 325 DGKSFRVFQLVSYCAF--GVLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAM 381
           +GK++R + +    AF   +  G++ GL G+GGG +M PL L     PP V+  T+   +
Sbjct: 148 EGKTYR-YSVPPLFAFITTLFIGLLTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL----IILLGRASLII 435
            FSS MS + +      P  Y++  +  + I A +G  V R +    +++L R  ++I
Sbjct: 207 FFSSVMSSIGHIFQGHVPWGYSIILIISSVIGAQIGVRVNRSMKSDTVVMLLRTVMLI 264


>gi|256419285|ref|YP_003119938.1| hypothetical protein Cpin_0237 [Chitinophaga pinensis DSM 2588]
 gi|256034193|gb|ACU57737.1| protein of unknown function DUF81 [Chitinophaga pinensis DSM 2588]
          Length = 335

 Score = 39.3 bits (90), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 44/98 (44%)

Query: 327 KSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSS 386
           +S    +L+ Y   G+ A  V G LG+  G     L L LG+PP V+SA+   A  F++ 
Sbjct: 73  QSLFTRELLFYILVGLAAQTVDGALGMAYGATSSSLLLGLGIPPSVASASVHVAEVFTTG 132

Query: 387 MSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            S + ++          LY +    I +  G  ++  +
Sbjct: 133 ASGISHFRFGNVNKKLFLYLLVPGIIGSIAGSFLISNM 170


>gi|383769051|ref|YP_005448114.1| hypothetical protein S23_07820 [Bradyrhizobium sp. S23321]
 gi|381357172|dbj|BAL74002.1| hypothetical protein S23_07820 [Bradyrhizobium sp. S23321]
          Length = 305

 Score = 39.3 bits (90), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+    +  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73


>gi|386399941|ref|ZP_10084719.1| putative permease [Bradyrhizobium sp. WSM1253]
 gi|385740567|gb|EIG60763.1| putative permease [Bradyrhizobium sp. WSM1253]
          Length = 305

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+    +  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73


>gi|398824727|ref|ZP_10583048.1| putative permease [Bradyrhizobium sp. YR681]
 gi|398224594|gb|EJN10895.1| putative permease [Bradyrhizobium sp. YR681]
          Length = 305

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           A G   G V G+ G+GGGF+M PL + +G+ P V+ A+    +  SS    + Y+
Sbjct: 19  AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73


>gi|323701388|ref|ZP_08113062.1| protein of unknown function DUF81 [Desulfotomaculum nigrificans DSM
           574]
 gi|323533647|gb|EGB23512.1| protein of unknown function DUF81 [Desulfotomaculum nigrificans DSM
           574]
          Length = 433

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 42/90 (46%), Gaps = 1/90 (1%)

Query: 319 IASKGDDGKS-FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATA 377
           I  K   G S F  F      A GV AG++ G +G GGGF++ P  + LGV   ++  T 
Sbjct: 60  IGGKSAYGPSHFTPFMFYGSMAIGVCAGLITGCIGAGGGFVITPALMSLGVKGILAVGTD 119

Query: 378 TFAMTFSSSMSVVEYYLLKRFPVPYALYFV 407
            F +   + M  V +  L    V  A+ F+
Sbjct: 120 QFHIFAKAIMGTVMHKKLGNISVSLAIAFL 149


>gi|348688913|gb|EGZ28727.1| hypothetical protein PHYSODRAFT_294193 [Phytophthora sojae]
          Length = 477

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 33/147 (22%), Positives = 56/147 (38%), Gaps = 25/147 (17%)

Query: 81  VLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNL 140
           +LG ++        S    GGG +   +  L++   P  A  ++   I G AV     NL
Sbjct: 63  LLGIVLAALVVLIASGAKTGGGAVLDAVFILVLKLGPDEAIPLASITIFGGAVCDFLLNL 122

Query: 141 KLRHPTLDMPIIDYDLALLIQPMLM-------------------------LGISIGVAFN 175
             +    +  +I++D  L++QPML+                         +G   G +  
Sbjct: 123 WKKPINSNFSLINWDFILMMQPMLLSKFVGGVRPGLSSGMRIKWVWFYRAVGAGFGASLI 182

Query: 176 VIFADWMVTVLLIVLFIGTSTKAFLKG 202
             F+ W++ + LIV       KAF K 
Sbjct: 183 SWFSTWLLIIALIVYLGYIGKKAFKKA 209


>gi|333923117|ref|YP_004496697.1| hypothetical protein Desca_0906 [Desulfotomaculum carboxydivorans
           CO-1-SRB]
 gi|333748678|gb|AEF93785.1| protein of unknown function DUF81 [Desulfotomaculum carboxydivorans
           CO-1-SRB]
          Length = 433

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 42/90 (46%), Gaps = 1/90 (1%)

Query: 319 IASKGDDGKS-FRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATA 377
           I  K   G S F  F      A GV AG++ G +G GGGF++ P  + LGV   ++  T 
Sbjct: 60  IGGKSAYGPSHFTPFMFYGSMAIGVCAGLITGCIGAGGGFVITPALMSLGVKGILAVGTD 119

Query: 378 TFAMTFSSSMSVVEYYLLKRFPVPYALYFV 407
            F +   + M  V +  L    V  A+ F+
Sbjct: 120 QFHIFAKAIMGTVMHKKLGNISVSLAIAFL 149


>gi|365886110|ref|ZP_09425076.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3809]
 gi|365338422|emb|CCD97607.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3809]
          Length = 319

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
            A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ +  +  SS    + Y+
Sbjct: 29  LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYW 84


>gi|18313308|ref|NP_559975.1| hypothetical protein PAE2388 [Pyrobaculum aerophilum str. IM2]
 gi|18160832|gb|AAL64157.1| hypothetical protein PAE2388 [Pyrobaculum aerophilum str. IM2]
          Length = 243

 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 33/54 (61%)

Query: 82  LGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVST 135
           LG ++ F G    S+ GVGGG IFVP L L+ G D K A A+S  +I   A+++
Sbjct: 123 LGYLLVFIGGFVSSLFGVGGGTIFVPALILLAGLDAKLAAAMSMGIIFPTALAS 176


>gi|237728550|ref|ZP_04559031.1| conserved hypothetical protein [Citrobacter sp. 30_2]
 gi|226910028|gb|EEH95946.1| conserved hypothetical protein [Citrobacter sp. 30_2]
          Length = 265

 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 56/113 (49%), Gaps = 7/113 (6%)

Query: 337 YCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS-SSMSVVEYYLL 395
           +   G ++G + GLL +GGGFI+ PL  +   P  + S  AT  M  +   M  +   ++
Sbjct: 145 FLTVGAVSGFMTGLLAVGGGFIIVPLLRQF-TPLPIHSCIATSLMIVALVGMGGIATAVM 203

Query: 396 K--RFPVPYALYFVAVATIAAFVGQHVVRKL---IILLGRASLIIFILAFMIF 443
           +    P+P+ L+FV       F+G+ +   L   I+  G A L+I +   M+F
Sbjct: 204 QGATLPMPFTLWFVLSVVTGMFIGRRLSHHLPEHIVQKGFAGLLIVVALGMVF 256


>gi|402821984|ref|ZP_10871493.1| hypothetical protein LH128_04228 [Sphingomonas sp. LH128]
 gi|402264465|gb|EJU14319.1| hypothetical protein LH128_04228 [Sphingomonas sp. LH128]
          Length = 304

 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 42/73 (57%), Gaps = 2/73 (2%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            A G L G++ G+ G+GGGF+  PL +  G+PP V++A+A   +T +S   V  +   +R
Sbjct: 18  VALGALTGVLSGMFGVGGGFLTTPLMIFYGIPPTVAAASAASQVTGASVSGVFAHT--RR 75

Query: 398 FPVPYALYFVAVA 410
             V Y +  V VA
Sbjct: 76  GGVDYQMGAVMVA 88


>gi|336314523|ref|ZP_08569440.1| Putative permease [Rheinheimera sp. A13L]
 gi|335881063|gb|EGM78945.1| Putative permease [Rheinheimera sp. A13L]
          Length = 259

 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 4/78 (5%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C   ++AG +  + G GGG +  P  L  G+PP +   T   A TF S  + V YY  + 
Sbjct: 14  CVVALVAGFIDAIAG-GGGLLTVPALLTAGLPPHLVLGTNKLAATFGSITASVTYYRRRL 72

Query: 398 FPVPY---ALYFVAVATI 412
           F   +   AL F A+  +
Sbjct: 73  FDPAFWRTALIFTAIGAV 90


>gi|374852184|dbj|BAL55123.1| hypothetical conserved protein [uncultured Acidobacteria bacterium]
          Length = 276

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 28/112 (25%), Positives = 54/112 (48%), Gaps = 1/112 (0%)

Query: 314 KGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQV 372
            G R +  +     S++V Q        VLAG+  GL+ +GGG +  P + L   +P ++
Sbjct: 137 SGVRTLIDRQGRVYSYQVCQRGQLLGISVLAGLGSGLVAIGGGELNTPAMVLRCQIPIRI 196

Query: 373 SSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           ++ATA F M  +     + + L+ R     AL+ +  A +   +G ++  ++
Sbjct: 197 AAATAVFTMALTVLAGAITHVLVGRPVWNLALWTIPGAILGGQLGSYLASRI 248


>gi|365896889|ref|ZP_09434939.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3843]
 gi|365422354|emb|CCE07481.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3843]
          Length = 308

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
            A G   G V G+ G+GGGF+M PL + +G+ P V+ A+ +  +  SS    + Y+
Sbjct: 18  LAMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPGVAVASVSSHIAASSFSGALSYW 73


>gi|11497744|ref|NP_068966.1| hypothetical protein AF0127 [Archaeoglobus fulgidus DSM 4304]
 gi|2650522|gb|AAB91106.1| predicted coding region AF_0127 [Archaeoglobus fulgidus DSM 4304]
          Length = 475

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 27/119 (22%), Positives = 55/119 (46%), Gaps = 1/119 (0%)

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVE 391
           F+ +   A G   G++ G +G GG F++ P  + +G P  ++ A+        + +    
Sbjct: 23  FEAIFLLALGFFGGMLSGFIGSGGAFVLTPGMMSIGTPGPIAVASNMCHKFPKAMIGAYR 82

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFI-LAFMIFVSAISL 449
            Y L++     A+     A     VG  V +++   LG     +++ +AF++ + A+SL
Sbjct: 83  RYKLRQLDPKLAVIMAVSAIAGVQVGIQVQKQIYEALGETGTNLYVSIAFLVVLPAVSL 141


>gi|14590420|ref|NP_142486.1| hypothetical protein PH0518 [Pyrococcus horikoshii OT3]
 gi|3256923|dbj|BAA29606.1| 252aa long hypothetical protein [Pyrococcus horikoshii OT3]
          Length = 252

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 40/77 (51%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G++ G +  + GLGGGF++ P    LGV    +  T++ ++ F++  S   YY  K+   
Sbjct: 12  GLITGTLAAMFGLGGGFLLVPTLNILGVEIHHAIGTSSASIIFTALSSSYAYYKQKKIYY 71

Query: 401 PYALYFVAVATIAAFVG 417
              +   + A I A++G
Sbjct: 72  DIGIALASTAVIGAYIG 88


>gi|408375545|ref|ZP_11173209.1| hypothetical protein A11A3_15572 [Alcanivorax hongdengensis A-11-3]
 gi|407764566|gb|EKF73039.1| hypothetical protein A11A3_15572 [Alcanivorax hongdengensis A-11-3]
          Length = 264

 Score = 39.3 bits (90), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 25/53 (47%), Positives = 30/53 (56%), Gaps = 2/53 (3%)

Query: 337 YCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATF--AMTFSSSM 387
           Y A G L G+  GL G+GGG +M PL L  G PPQ +  T     A+T SS M
Sbjct: 11  YIAAGGLVGLAIGLTGVGGGSLMTPLLLLFGFPPQTAIGTDLLYAAITKSSGM 63


>gi|81428071|ref|YP_395070.1| transporter [Lactobacillus sakei subsp. sakei 23K]
 gi|78609712|emb|CAI54758.1| Putative transport protein [Lactobacillus sakei subsp. sakei 23K]
          Length = 281

 Score = 38.9 bits (89), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 5/87 (5%)

Query: 344 AGIVGGLLGLGGG-FIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY----LLKRF 398
           AGI  GLLG+G G F +  +   + +P + SSAT+   M  +++ S   Y+    +L   
Sbjct: 171 AGIASGLLGIGSGVFKVTAMDTIMKMPLKPSSATSNLMMGVTAAASATVYFFSGAILPEI 230

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKLI 425
             P AL  +A AT+ + + QH+  K I
Sbjct: 231 AAPLALGILAGATVGSRIMQHLQPKFI 257


>gi|423643404|ref|ZP_17619022.1| hypothetical protein IK9_03349 [Bacillus cereus VD166]
 gi|401275408|gb|EJR81375.1| hypothetical protein IK9_03349 [Bacillus cereus VD166]
          Length = 266

 Score = 38.9 bits (89), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 35/58 (60%), Gaps = 1/58 (1%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLEL-GVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G++AG +G L+GLGGG I+ PL + L  + PQ++  T+   + F+   S + Y   KR
Sbjct: 10  GLIAGTIGSLVGLGGGIIIVPLLIGLHSLSPQLAVGTSMVTVVFTGLSSTLTYMKHKR 67


>gi|255659129|ref|ZP_05404538.1| putative membrane protein [Mitsuokella multacida DSM 20544]
 gi|260848575|gb|EEX68582.1| putative membrane protein [Mitsuokella multacida DSM 20544]
          Length = 263

 Score = 38.9 bits (89), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 65/140 (46%), Gaps = 10/140 (7%)

Query: 311 SLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE-LGVP 369
           S +K    I     D +  R   +V+    G L+ I     G+GGG I  PL +  LG P
Sbjct: 126 STHKPVSNIMELPKDFQYNRTLGIVASLGVGFLSSI----FGIGGGVIHVPLMIYLLGFP 181

Query: 370 PQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLG 429
             V++AT+ F +  SS+  V+ +++L       A+     A I A +G  + +K      
Sbjct: 182 VHVATATSHFVLACSSAFGVISHFMLDHIIWVPAICISIGAAIGAQIGAKLSKKT----- 236

Query: 430 RASLIIFILAFMIFVSAISL 449
           ++ +I+ +L+  +F   I L
Sbjct: 237 KSKVILMLLSLAMFALGIRL 256


>gi|288554026|ref|YP_003425961.1| hypothetical protein BpOF4_05025 [Bacillus pseudofirmus OF4]
 gi|288545186|gb|ADC49069.1| hypothetical protein BpOF4_05025 [Bacillus pseudofirmus OF4]
          Length = 274

 Score = 38.9 bits (89), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 42/175 (24%), Positives = 84/175 (48%), Gaps = 7/175 (4%)

Query: 256 ENINWKELGLLVFVWV---AFLGLQIAKNHTASCSIVYWVLDLLQIPVSLVVSLY-EAIS 311
           + +++K  GLL F+     A +G+ + +       ++Y+ L ++ + + L++  Y + I 
Sbjct: 73  KKVDYKS-GLLFFLGSGPGALIGVWLNRYLEVEPFLIYFGLFMIVVAIVLIIRPYLKPIP 131

Query: 312 LYKGRRVIASKGDDGKSFRV-FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLEL-GVP 369
           L +         D G+SF   ++     +   + G++ GL G+GGG +M P  + L   P
Sbjct: 132 LSEKGVKRTYVNDLGESFEYGYRPAVAISIAFVVGMLSGLFGIGGGSLMVPAMIMLFHFP 191

Query: 370 PQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
           P ++ AT+ F +  S+  S V + +L      YAL  +  A +    G  + ++L
Sbjct: 192 PHMAVATSMFMILLSAITSSVSHIVLGNVNWLYALALIPGAYLGGIAGAAINKRL 246


>gi|319638962|ref|ZP_07993720.1| hypothetical protein HMPREF0604_01344 [Neisseria mucosa C102]
 gi|317399866|gb|EFV80529.1| hypothetical protein HMPREF0604_01344 [Neisseria mucosa C102]
          Length = 266

 Score = 38.9 bits (89), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 5/104 (4%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            A  ++AG V  + G G G I+ P FL +G+PPQV+ A      T  +  ++  +  +K 
Sbjct: 23  VAASLIAGYVDAIAG-GAGLILIPAFLMVGLPPQVALAQEKLVSTIGTVAAIKNF--MKS 79

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFM 441
             + +  + V V  +AA +G  V  K+I++L   ++   ILAF+
Sbjct: 80  SSIVW--HIVPVGIVAALIGAFVGAKVILILPVETINYIILAFL 121


>gi|444919977|ref|ZP_21239821.1| YfcA [Wohlfahrtiimonas chitiniclastica SH04]
 gi|444508844|gb|ELV09012.1| YfcA [Wohlfahrtiimonas chitiniclastica SH04]
          Length = 257

 Score = 38.9 bits (89), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 23/59 (38%), Positives = 33/59 (55%), Gaps = 3/59 (5%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           A  +LAG V  + G GGG I  PL L +G PP V+ +   +   F +S++  EY  +KR
Sbjct: 16  AAAILAGFVDSIAG-GGGLITAPLMLSIGAPPHVALSMGKYMGVFGTSLAAWEY--IKR 71


>gi|323488645|ref|ZP_08093887.1| hypothetical protein GPDM_04854 [Planococcus donghaensis MPA1U2]
 gi|323397663|gb|EGA90467.1| hypothetical protein GPDM_04854 [Planococcus donghaensis MPA1U2]
          Length = 275

 Score = 38.9 bits (89), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 62/122 (50%), Gaps = 10/122 (8%)

Query: 73  ELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAA 132
           E  +G+ I    ++ FF      + G+GGG I VP + L+  F P  A   S  M+  +A
Sbjct: 149 EYVYGYPIWFALLLTFFVGVASGLFGIGGGSIIVPAMILLFLFPPHVAVGTSMLMVFLSA 208

Query: 133 VSTVYYNLKLRHPTL-DMPIIDYDLALLIQPMLMLGISIGVAFNV-IFADWMVTVLLIVL 190
           +        + H +L ++P I Y LA++  P   +G  IG A N  + ++ +VT+L IVL
Sbjct: 209 LVN-----SVTHISLGNVPWI-YTLAVV--PGAYIGAKIGAALNKRLNSEVLVTILRIVL 260

Query: 191 FI 192
            +
Sbjct: 261 LV 262


>gi|220918686|ref|YP_002493990.1| hypothetical protein A2cp1_3594 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219956540|gb|ACL66924.1| protein of unknown function DUF81 [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 255

 Score = 38.9 bits (89), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 48/87 (55%), Gaps = 6/87 (6%)

Query: 299 PVSLVVSLYEAISLYKG-----RRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGL 353
           PV L+ +L+ A+++Y       RR       +G+ + V +  +      +AG + GL+G+
Sbjct: 95  PVGLLKALFAAVAVYSALVMWRRRPAGPPAAEGEPYVVRRWGTGLGASAVAGAISGLIGV 154

Query: 354 GGGFIMGPLF-LELGVPPQVSSATATF 379
           GGGFI  P+  L + +P +V+ AT+ F
Sbjct: 155 GGGFIKVPVMTLAMELPFKVAVATSNF 181


>gi|228990560|ref|ZP_04150525.1| hypothetical protein bpmyx0001_13220 [Bacillus pseudomycoides DSM
           12442]
 gi|228769086|gb|EEM17684.1| hypothetical protein bpmyx0001_13220 [Bacillus pseudomycoides DSM
           12442]
          Length = 281

 Score = 38.9 bits (89), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 50/96 (52%), Gaps = 5/96 (5%)

Query: 345 GIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYA 403
           G + GL G+GGG ++ P + L    P Q++ AT+ F +  S+ +S + +  L      YA
Sbjct: 175 GFISGLFGIGGGVLLVPAMMLLFAFPAQIAVATSMFIVFLSAIVSSLTHISLGNVSWIYA 234

Query: 404 LYFVAVATIAAFVGQHVVRKL----IILLGRASLII 435
           L  +  A I   +G ++  KL    II L R +LII
Sbjct: 235 LILIPGAWIGGKIGAYINTKLSGNAIINLLRITLII 270


>gi|374579212|ref|ZP_09652306.1| putative permease [Desulfosporosinus youngiae DSM 17734]
 gi|374415294|gb|EHQ87729.1| putative permease [Desulfosporosinus youngiae DSM 17734]
          Length = 349

 Score = 38.9 bits (89), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 59/123 (47%), Gaps = 17/123 (13%)

Query: 336 SYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSAT--------ATFAMTFSSSM 387
           S    G L G++ G +G GG FIM P  + LGVP  V+ ++        A       S M
Sbjct: 25  SIAGLGFLGGMLSGFIGSGGAFIMTPGMMALGVPGLVAVSSNLAHKLGKAIVGARKHSKM 84

Query: 388 SVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFI-LAFMIFVSA 446
             V++    R  +  +++ +    +A F+ +H+       +G+A   ++I L F++ +S 
Sbjct: 85  GNVDF----RLGICISIFLLLGVNLAVFLNEHIFN----YMGKAGSDLYINLTFVVLLSG 136

Query: 447 ISL 449
           +SL
Sbjct: 137 VSL 139


>gi|417316892|ref|ZP_12103522.1| hypothetical protein LM220_02535 [Listeria monocytogenes J1-220]
 gi|328475699|gb|EGF46445.1| hypothetical protein LM220_02535 [Listeria monocytogenes J1-220]
          Length = 176

 Score = 38.9 bits (89), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ-VSSATATFAMTFSSSMSVV 390
           ++++ Y  FGV+ G + GL G+GGG I+ P+ L + +  Q  +SAT+++    +S  S+ 
Sbjct: 60  YRIIPYV-FGVIIGFLAGLFGIGGGPIVIPILLLIFMLNQKTASATSSYVTLLTSLASIG 118

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            Y ++        +Y +  A I A +G  + + L
Sbjct: 119 SYAIIGGSDFSIGIYMIPGAIIGALIGTRLNKLL 152


>gi|301065336|ref|YP_003787359.1| permease [Lactobacillus casei str. Zhang]
 gi|300437743|gb|ADK17509.1| Predicted permease [Lactobacillus casei str. Zhang]
          Length = 280

 Score = 38.9 bits (89), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVE 391
           Q++     G+L GI+G +LG+GGG I+ P L + +G+P Q +   +  ++  +SS + + 
Sbjct: 3   QMIMLMVTGLLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIG 62

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           Y   +   +  A++     T+ A +G  V
Sbjct: 63  YLKDEMLNLRVAMFLEIATTVGAVIGAVV 91


>gi|452990317|emb|CCQ98539.1| conserved membrane hypothetical protein [Clostridium ultunense Esp]
          Length = 309

 Score = 38.9 bits (89), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 49/180 (27%), Positives = 77/180 (42%), Gaps = 32/180 (17%)

Query: 85  IIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRH 144
           +IG     +G++ G GGG IFVP L L+  +DP  A      +++  A+S +   ++ + 
Sbjct: 11  VIGLIAGTYGTIVGAGGGFIFVPALLLLFHYDPPVAAGTGLVVVLINALSGMIGYIRQKR 70

Query: 145 PTLDMPIIDYDLALLIQPMLMLGISI-----GVAFNVIFADWMVTVLLIVLFIGTSTKAF 199
                 +I   L+L   P   LG+ +     G AF + FA  M+  L I L +    K+ 
Sbjct: 71  VDYRFGVI---LSLAAIPGTFLGVWLSRAVTGHAFFLTFAI-MLIGLGIFLLVKKEPKS- 125

Query: 200 LKGVETWK------------RETI--LKKEAARCLGSNGAGAGEV-------EYKSLPSG 238
            KG +  +            RE +   + E  R   S  AG GE+         + LPSG
Sbjct: 126 -KGEQVTEGSFGEEAFPSGLREEVAATRSEGTRTYDSADAGEGEIPADLFLQNREKLPSG 184


>gi|402841864|ref|ZP_10890301.1| sulfite exporter TauE/SafE [Klebsiella sp. OBRC7]
 gi|402281797|gb|EJU30419.1| sulfite exporter TauE/SafE [Klebsiella sp. OBRC7]
          Length = 265

 Score = 38.9 bits (89), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 58/113 (51%), Gaps = 7/113 (6%)

Query: 337 YCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFS-SSMSVVEYYLL 395
           +  FG + G + GLL +GGGFI+ PL  +   P  + S  AT  M  +  S+  +   ++
Sbjct: 145 FLMFGAVTGFMTGLLAVGGGFIIVPLLRQF-TPLPIHSCIATSLMIVALVSIGGIAVAVI 203

Query: 396 K--RFPVPYALYFVAVATIAAFVGQHVVRKL---IILLGRASLIIFILAFMIF 443
           +    P+P+ L+FV  A     +G+ + + L   I+  G + L+I +   M+F
Sbjct: 204 QGATLPLPFTLWFVLSAATGMSIGRRLSQHLPEHIVQKGFSGLLIVVALGMVF 256


>gi|258424129|ref|ZP_05687011.1| conserved hypothetical protein [Staphylococcus aureus A9635]
 gi|418889517|ref|ZP_13443650.1| sulfite exporter TauE/SafE family protein [Staphylococcus aureus
           subsp. aureus CIG1524]
 gi|257845750|gb|EEV69782.1| conserved hypothetical protein [Staphylococcus aureus A9635]
 gi|377753025|gb|EHT76943.1| sulfite exporter TauE/SafE family protein [Staphylococcus aureus
           subsp. aureus CIG1524]
          Length = 256

 Score = 38.9 bits (89), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 16/110 (14%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY------- 392
           FG +A  +  ++G GGG I  P  L +G+PP V+  T   A +FSS  S +++       
Sbjct: 14  FGFIAAFIDSVVG-GGGLISTPALLAIGLPPSVALGTNKLASSFSSLTSAIKFIRSGKVD 72

Query: 393 -YLLKRFPVPYALYFVAVAT---IAAFVGQHVVRKLIILLGRASLIIFIL 438
            Y++ +    +   F+A A    IA  V   +++ LII +  +S+ IF L
Sbjct: 73  LYVVAKL---FGFVFLASACGAYIATMVPSQILKPLII-IALSSVFIFTL 118


>gi|339504564|ref|YP_004691984.1| hypothetical protein RLO149_c030650 [Roseobacter litoralis Och 149]
 gi|338758557|gb|AEI95021.1| hypothetical protein DUF1925 [Roseobacter litoralis Och 149]
          Length = 312

 Score = 38.9 bits (89), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 30/49 (61%), Gaps = 2/49 (4%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ +  LKR
Sbjct: 37  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAH--LKR 83


>gi|171915322|ref|ZP_02930792.1| hypothetical protein VspiD_29145 [Verrucomicrobium spinosum DSM
           4136]
          Length = 264

 Score = 38.9 bits (89), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 60/125 (48%), Gaps = 14/125 (11%)

Query: 297 QIPVSLVVSLYEAISLYKGRRVIASKG--DDGKSFR-------VFQLVSYCAFG---VLA 344
            +P +L++S +  +  Y G R+   +G  DD    R        F    Y   G     A
Sbjct: 95  HLPAALLLSTFALLMAYIGARMWRGRGAEDDAPPSRCVARGPGAFGPECYLRLGSGGAAA 154

Query: 345 GIVGGLLGLGGGFIMGPLFLEL-GVPPQVSSATATFAMTFSSSMSVVEYYLLKR-FPVPY 402
           G++ GL G+GGGFI+ P+ L + G+    + AT+   +   S   V+ + L  + FP+P 
Sbjct: 155 GLLSGLFGVGGGFIIVPVLLFVTGMSIHRAVATSLLVIFLISVSGVIAHMLHGQLFPMPL 214

Query: 403 ALYFV 407
           +L F+
Sbjct: 215 SLLFI 219


>gi|255263607|ref|ZP_05342949.1| membrane protein, putative [Thalassiobium sp. R2A62]
 gi|255105942|gb|EET48616.1| membrane protein, putative [Thalassiobium sp. R2A62]
          Length = 304

 Score = 38.9 bits (89), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLK 396
           G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ ++  K
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRK 76


>gi|255036228|ref|YP_003086849.1| hypothetical protein Dfer_2466 [Dyadobacter fermentans DSM 18053]
 gi|254948984|gb|ACT93684.1| protein of unknown function DUF81 [Dyadobacter fermentans DSM
           18053]
          Length = 293

 Score = 38.9 bits (89), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 36/80 (45%)

Query: 318 VIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATA 377
           VI+   +    F       Y   G+ A +V G LG+  G       L +GV P VSSA+ 
Sbjct: 36  VISVSSEQVTEFLTGDFALYVLVGLAAQLVDGALGMAYGVTSNSFLLSVGVTPAVSSASV 95

Query: 378 TFAMTFSSSMSVVEYYLLKR 397
            FA  F++  S + ++  K 
Sbjct: 96  HFAEMFTTGASAISHFRFKN 115


>gi|383753044|ref|YP_005431947.1| membrane protein [Selenomonas ruminantium subsp. lactilytica
           TAM6421]
 gi|381365096|dbj|BAL81924.1| hypothetical membrane protein [Selenomonas ruminantium subsp.
           lactilytica TAM6421]
          Length = 263

 Score = 38.9 bits (89), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLEL-GVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           +FGV  G +  + G+GGG I  PL + L G P  +++AT+ F +  SS+  VV ++LL  
Sbjct: 152 SFGV--GFLSSIFGIGGGVIHVPLMIYLLGFPVHMATATSHFVLACSSAFGVVSHFLLDH 209

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRK 423
                A+     A I A +G  + +K
Sbjct: 210 IIWMPAICISIGAAIGAQIGAKISKK 235


>gi|282916974|ref|ZP_06324732.1| hypothetical protein SATG_00467 [Staphylococcus aureus subsp.
           aureus D139]
 gi|282319461|gb|EFB49813.1| hypothetical protein SATG_00467 [Staphylococcus aureus subsp.
           aureus D139]
          Length = 249

 Score = 38.9 bits (89), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 55/110 (50%), Gaps = 16/110 (14%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY------- 392
           FG +A  +  ++G GGG I  P  L +G+PP V+  T   A +F S  S +++       
Sbjct: 14  FGFIAAFIDAVVG-GGGLISTPALLAIGLPPSVALGTNKLASSFGSLTSTIKFIRSGKVD 72

Query: 393 -YLLKRFPVPYALYFVAVAT---IAAFVGQHVVRKLIILLGRASLIIFIL 438
            Y++ +    +   F+A A    IA  V   +++ LII +  +S+ IF L
Sbjct: 73  LYVVAKL---FGFVFLASACGAYIATMVPSQILKPLII-IALSSVFIFTL 118


>gi|224370873|ref|YP_002605037.1| hypothetical protein HRM2_38150 [Desulfobacterium autotrophicum
           HRM2]
 gi|223693590|gb|ACN16873.1| conserved hypothetical protein [Desulfobacterium autotrophicum
           HRM2]
          Length = 424

 Score = 38.9 bits (89), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 42/84 (50%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G+LAG++ G +G GGGFI+ P  +  G+   ++  T  F +   + M  V +  L    V
Sbjct: 88  GLLAGLITGCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVIHRKLGNVSV 147

Query: 401 PYALYFVAVATIAAFVGQHVVRKL 424
           P A  FV  A + A  G  + R L
Sbjct: 148 PLAFVFVIGALMGATGGGLLNRYL 171


>gi|443325967|ref|ZP_21054637.1| putative permease [Xenococcus sp. PCC 7305]
 gi|442794404|gb|ELS03821.1| putative permease [Xenococcus sp. PCC 7305]
          Length = 260

 Score = 38.9 bits (89), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 47/89 (52%), Gaps = 1/89 (1%)

Query: 337 YCAFG-VLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLL 395
           + AFG  ++GI+ G LG+GGG ++ PL + LG  P  + AT+  A+T ++    ++ + +
Sbjct: 8   FIAFGGFISGILAGFLGIGGGTVLVPLLVALGYSPIQAVATSALAITITALSGTLQNWRM 67

Query: 396 KRFPVPYALYFVAVATIAAFVGQHVVRKL 424
                   LY    A + A VG ++   L
Sbjct: 68  GYIKPKSLLYLGVPALLFAQVGVYLSEGL 96


>gi|310815340|ref|YP_003963304.1| hypothetical protein EIO_0856 [Ketogulonicigenium vulgare Y25]
 gi|308754075|gb|ADO42004.1| integral membrane protein, putative [Ketogulonicigenium vulgare
           Y25]
          Length = 304

 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 30/47 (63%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSS 385
             G + GI+ G+ G+GGGF++ PL +  G+P  V+ ATAT  +  SS
Sbjct: 19  GLGWIVGILSGMFGVGGGFLITPLLIFSGIPSAVAVATATNQIVASS 65


>gi|126740488|ref|ZP_01756175.1| membrane protein [Roseobacter sp. SK209-2-6]
 gi|126718289|gb|EBA15004.1| membrane protein [Roseobacter sp. SK209-2-6]
          Length = 306

 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 28/45 (62%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ ++
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPTVAVATEANQIVASSFSGVLAHF 73


>gi|110678948|ref|YP_681955.1| hypothetical protein RD1_1642 [Roseobacter denitrificans OCh 114]
 gi|109455064|gb|ABG31269.1| integral membrane protein, putative [Roseobacter denitrificans OCh
           114]
          Length = 312

 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 30/49 (61%), Gaps = 2/49 (4%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ +  LKR
Sbjct: 37  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAH--LKR 83


>gi|442610970|ref|ZP_21025676.1| FIG011065: hypothetical protein [Pseudoalteromonas luteoviolacea B
           = ATCC 29581]
 gi|441746898|emb|CCQ11738.1| FIG011065: hypothetical protein [Pseudoalteromonas luteoviolacea B
           = ATCC 29581]
          Length = 257

 Score = 38.9 bits (89), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 39/89 (43%), Gaps = 1/89 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C   + AG +  + G GGG +  P  L  G+PP ++  T   A +F S  +   YY  K 
Sbjct: 14  CFVAMAAGFIDAIAG-GGGMLTVPALLTAGLPPHLALGTNKLAASFGSVTASFAYYRQKL 72

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLII 426
           F   +    +    I A +G  +V  L I
Sbjct: 73  FDPSFWWQSIVATAIGALLGTLLVDHLSI 101


>gi|242058939|ref|XP_002458615.1| hypothetical protein SORBIDRAFT_03g036735 [Sorghum bicolor]
 gi|241930590|gb|EES03735.1| hypothetical protein SORBIDRAFT_03g036735 [Sorghum bicolor]
          Length = 110

 Score = 38.9 bits (89), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 52/111 (46%), Gaps = 19/111 (17%)

Query: 361 PLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHV 420
           P+F  +GVPP+ +SA   F +  SSSMS  +               + +    A      
Sbjct: 16  PIFFHVGVPPRSASARTMFLILLSSSMSTAQS--------------ITLGAEGATDATSA 61

Query: 421 VRKLIILLGRASLIIFILAFMIFVSAISLGGVGISNMIGKFH-RHEYMGFE 470
           +RK     GRASLI+  +A ++ ++A  +   G+  +  ++    EYMGF+
Sbjct: 62  IRK----SGRASLIVLAVATVMALTAAVVLRSGVPRVWAQYTGGREYMGFK 108


>gi|298674473|ref|YP_003726223.1| hypothetical protein Metev_0515 [Methanohalobium evestigatum
           Z-7303]
 gi|298287461|gb|ADI73427.1| protein of unknown function DUF81 [Methanohalobium evestigatum
           Z-7303]
          Length = 252

 Score = 38.9 bits (89), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 29/45 (64%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSS 385
           G + GIV GLLGL GG  + PL + LG   + +SAT+ F + F+S
Sbjct: 141 GFMIGIVAGLLGLSGGVFIVPLLVYLGYNIKTASATSLFIIVFTS 185


>gi|418023649|ref|ZP_12662634.1| protein of unknown function DUF81 [Shewanella baltica OS625]
 gi|353537532|gb|EHC07089.1| protein of unknown function DUF81 [Shewanella baltica OS625]
          Length = 257

 Score = 38.5 bits (88), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 41/82 (50%), Gaps = 3/82 (3%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G++AG +  ++G GGG +  P  L LG+ P     T   A +F SSM+   YY    F  
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAAWTYYRQHLFK- 74

Query: 401 PYALYFVAVAT-IAAFVGQHVV 421
           P   Y   +AT I A +G  +V
Sbjct: 75  PSFWYMAFIATFIGAVLGSFLV 96


>gi|126173862|ref|YP_001050011.1| hypothetical protein Sbal_1628 [Shewanella baltica OS155]
 gi|386340622|ref|YP_006036988.1| hypothetical protein [Shewanella baltica OS117]
 gi|125997067|gb|ABN61142.1| protein of unknown function DUF81 [Shewanella baltica OS155]
 gi|334863023|gb|AEH13494.1| protein of unknown function DUF81 [Shewanella baltica OS117]
          Length = 257

 Score = 38.5 bits (88), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 41/82 (50%), Gaps = 3/82 (3%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G++AG +  ++G GGG +  P  L LG+ P     T   A +F SSM+   YY    F  
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAAWTYYRQHLFK- 74

Query: 401 PYALYFVAVAT-IAAFVGQHVV 421
           P   Y   +AT I A +G  +V
Sbjct: 75  PSFWYMAFIATFIGAVLGSFLV 96


>gi|221487433|gb|EEE25665.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 1659

 Score = 38.5 bits (88), Expect = 6.7,   Method: Composition-based stats.
 Identities = 38/109 (34%), Positives = 61/109 (55%), Gaps = 5/109 (4%)

Query: 341  GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
            G L G+  GL+G+GGG +  P  L +G  P  + ATA+  + F+SS + +++ L+ R P+
Sbjct: 1538 GFLTGVFAGLVGIGGGVVFSPFLLLMGNDPVSAVATASACVVFTSSSTSLQFLLIGRLPI 1597

Query: 401  PYALYFVAVATIAAF---VGQHVVRKLIILLGRASLIIFILAFMIFVSA 446
             YA  F  VA  AA     G H +R+ +   GR S+I   +A  + V++
Sbjct: 1598 LYASLFGLVAAAAAACATCGIHRLRRAV--GGRMSVIAGCVASAVTVAS 1644


>gi|237830133|ref|XP_002364364.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
 gi|211962028|gb|EEA97223.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
 gi|221507234|gb|EEE32838.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 1659

 Score = 38.5 bits (88), Expect = 6.8,   Method: Composition-based stats.
 Identities = 38/109 (34%), Positives = 61/109 (55%), Gaps = 5/109 (4%)

Query: 341  GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
            G L G+  GL+G+GGG +  P  L +G  P  + ATA+  + F+SS + +++ L+ R P+
Sbjct: 1538 GFLTGVFAGLVGIGGGVVFSPFLLLMGNDPVSAVATASACVVFTSSSTSLQFLLIGRLPI 1597

Query: 401  PYALYFVAVATIAAF---VGQHVVRKLIILLGRASLIIFILAFMIFVSA 446
             YA  F  VA  AA     G H +R+ +   GR S+I   +A  + V++
Sbjct: 1598 LYASLFGLVAAAAAACATCGIHRLRRAV--GGRMSVIAGCVASAVTVAS 1644


>gi|386042949|ref|YP_005961754.1| hypothetical protein [Listeria monocytogenes 10403S]
 gi|404409854|ref|YP_006695442.1| hypothetical protein LMOSLCC5850_0615 [Listeria monocytogenes
           SLCC5850]
 gi|345536183|gb|AEO05623.1| hypothetical protein LMRG_00304 [Listeria monocytogenes 10403S]
 gi|404229680|emb|CBY51084.1| hypothetical protein LMOSLCC5850_0615 [Listeria monocytogenes
           SLCC5850]
          Length = 246

 Score = 38.5 bits (88), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ-VSSATATFAMTFSSSMSVV 390
           ++++ Y  FGV+ G + GL G+GGG I+ P+ L + +  Q  +SAT+++    +S  S+ 
Sbjct: 130 YRIIPYV-FGVIIGFLAGLFGIGGGPIVIPILLLIFMLNQKTASATSSYVTLLTSLASIG 188

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            Y ++        +Y +  A I A +G  + + L
Sbjct: 189 SYAIIGGSDFSIGIYMIPGAIIGALIGTRLNKLL 222


>gi|160874768|ref|YP_001554084.1| hypothetical protein Sbal195_1651 [Shewanella baltica OS195]
 gi|378708021|ref|YP_005272915.1| hypothetical protein [Shewanella baltica OS678]
 gi|160860290|gb|ABX48824.1| protein of unknown function DUF81 [Shewanella baltica OS195]
 gi|315267010|gb|ADT93863.1| protein of unknown function DUF81 [Shewanella baltica OS678]
          Length = 257

 Score = 38.5 bits (88), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 41/82 (50%), Gaps = 3/82 (3%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G++AG +  ++G GGG +  P  L LG+ P     T   A +F SSM+   YY    F  
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAAWTYYRQHLFK- 74

Query: 401 PYALYFVAVAT-IAAFVGQHVV 421
           P   Y   +AT I A +G  +V
Sbjct: 75  PSFWYMAFIATFIGAVLGSFLV 96


>gi|119385456|ref|YP_916512.1| hypothetical protein Pden_2732 [Paracoccus denitrificans PD1222]
 gi|119375223|gb|ABL70816.1| protein of unknown function DUF81 [Paracoccus denitrificans PD1222]
          Length = 307

 Score = 38.5 bits (88), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 2/86 (2%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTFSSSMSVVEYYLLKR-F 398
           G   G++G L+G+GGGFIM P  +  LG+P +V   T+ F +TF S+ + + + +     
Sbjct: 182 GAAVGVLGALMGVGGGFIMVPAMIYLLGMPTKVVIGTSLFQITFLSAYTTLMHAVSSNTV 241

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKL 424
            V  A+  +      A +G H+  +L
Sbjct: 242 DVMLAVLLIVGGVTGAQIGTHLGARL 267


>gi|261313116|ref|ZP_05952313.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10]
 gi|261302142|gb|EEY05639.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10]
          Length = 306

 Score = 38.5 bits (88), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 1/98 (1%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G   G + GL G+GG F++ PL +   +PP ++ AT    +  SS    + ++  +   +
Sbjct: 21  GAAVGFLSGLFGVGGSFLITPLLIFYNIPPAIAVATGANQVIASSVSGALAHFKRRTLDI 80

Query: 401 PYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFIL 438
              L+ VA   + + +G  V   L   LG+  LI+ IL
Sbjct: 81  KLGLFLVAGGILGSLIGIFVFSWLRD-LGQLDLIVSIL 117


>gi|153000146|ref|YP_001365827.1| hypothetical protein Shew185_1617 [Shewanella baltica OS185]
 gi|373949018|ref|ZP_09608979.1| protein of unknown function DUF81 [Shewanella baltica OS183]
 gi|386325143|ref|YP_006021260.1| hypothetical protein [Shewanella baltica BA175]
 gi|151364764|gb|ABS07764.1| protein of unknown function DUF81 [Shewanella baltica OS185]
 gi|333819288|gb|AEG11954.1| protein of unknown function DUF81 [Shewanella baltica BA175]
 gi|373885618|gb|EHQ14510.1| protein of unknown function DUF81 [Shewanella baltica OS183]
          Length = 257

 Score = 38.5 bits (88), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 41/82 (50%), Gaps = 3/82 (3%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G++AG +  ++G GGG +  P  L LG+ P     T   A +F SSM+   YY    F  
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAAWTYYRQHLFK- 74

Query: 401 PYALYFVAVAT-IAAFVGQHVV 421
           P   Y   +AT I A +G  +V
Sbjct: 75  PSFWYMAFIATFIGAVLGSFLV 96


>gi|217973886|ref|YP_002358637.1| hypothetical protein Sbal223_2726 [Shewanella baltica OS223]
 gi|217499021|gb|ACK47214.1| protein of unknown function DUF81 [Shewanella baltica OS223]
          Length = 257

 Score = 38.5 bits (88), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 41/82 (50%), Gaps = 3/82 (3%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G++AG +  ++G GGG +  P  L LG+ P     T   A +F SSM+   YY    F  
Sbjct: 17  GLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAAWTYYRQHLFK- 74

Query: 401 PYALYFVAVAT-IAAFVGQHVV 421
           P   Y   +AT I A +G  +V
Sbjct: 75  PSFWYMAFIATFIGAVLGSFLV 96


>gi|385232877|ref|YP_005794219.1| permease protein [Ketogulonicigenium vulgare WSH-001]
 gi|343461788|gb|AEM40223.1| Permease protein [Ketogulonicigenium vulgare WSH-001]
          Length = 309

 Score = 38.5 bits (88), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 30/47 (63%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSS 385
             G + GI+ G+ G+GGGF++ PL +  G+P  V+ ATAT  +  SS
Sbjct: 24  GLGWIVGILSGMFGVGGGFLITPLLIFSGIPSAVAVATATNQIVASS 70


>gi|418009705|ref|ZP_12649494.1| membrane protein [Lactobacillus casei Lc-10]
 gi|410555155|gb|EKQ29116.1| membrane protein [Lactobacillus casei Lc-10]
          Length = 280

 Score = 38.5 bits (88), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 47/86 (54%), Gaps = 1/86 (1%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVE 391
           Q++     G+L GI+G +LG+GGG I+ P L + +G+P Q +   +  ++  +SS + + 
Sbjct: 3   QMIMLMVTGLLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62

Query: 392 YYLLKRFPVPYALYFVAVATIAAFVG 417
           Y   +   +  A++     T+ A +G
Sbjct: 63  YLKDEVLNLRVAMFLEIATTVGAVIG 88


>gi|83949994|ref|ZP_00958727.1| membrane protein [Roseovarius nubinhibens ISM]
 gi|83837893|gb|EAP77189.1| membrane protein [Roseovarius nubinhibens ISM]
          Length = 305

 Score = 38.5 bits (88), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLK 396
           G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ ++  K
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRK 76


>gi|254422238|ref|ZP_05035956.1| conserved domain protein, putative [Synechococcus sp. PCC 7335]
 gi|196189727|gb|EDX84691.1| conserved domain protein, putative [Synechococcus sp. PCC 7335]
          Length = 266

 Score = 38.5 bits (88), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 40/75 (53%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           G+ +GI+ G LG+GGG ++ P+ L+LG     ++AT++ A+  +S+    + + +     
Sbjct: 13  GLFSGILAGFLGIGGGTLLVPVLLQLGFESHAATATSSLAILVTSTTGSAQNWRMGYLDP 72

Query: 401 PYALYFVAVATIAAF 415
              L     A IA F
Sbjct: 73  KQILLLGIPAAIAGF 87


>gi|159045387|ref|YP_001534181.1| hypothetical protein Dshi_2847 [Dinoroseobacter shibae DFL 12]
 gi|157913147|gb|ABV94580.1| conserved hypothetical protein [Dinoroseobacter shibae DFL 12]
          Length = 306

 Score = 38.5 bits (88), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 31/49 (63%), Gaps = 1/49 (2%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G+ G+GGGF+M PL   +G+PP V+ AT    +  +SS S V  +L +R
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIV-ASSFSGVLAHLRRR 76


>gi|404412703|ref|YP_006698290.1| hypothetical protein LMOSLCC7179_0597 [Listeria monocytogenes
           SLCC7179]
 gi|404238402|emb|CBY59803.1| hypothetical protein LMOSLCC7179_0597 [Listeria monocytogenes
           SLCC7179]
          Length = 246

 Score = 38.5 bits (88), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ-VSSATATFAMTFSSSMSVV 390
           ++++ Y  FGV+ G + GL G+GGG I+ P+ L + +  Q  +SAT+++    +S  S+ 
Sbjct: 130 YRIIPYV-FGVIIGFLAGLFGIGGGPIVIPILLLIFMLNQKTASATSSYVTLLTSLASIG 188

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            Y ++        +Y +  A I A +G  + + L
Sbjct: 189 SYAIIGGSDFSIGIYMIPGAIIGALIGTRLNKLL 222


>gi|336424945|ref|ZP_08604977.1| hypothetical protein HMPREF0994_00983 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336013410|gb|EGN43292.1| hypothetical protein HMPREF0994_00983 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 148

 Score = 38.5 bits (88), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 1/87 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C    LAGIV  + G GGG I  P +L  G+PP ++ AT   + +  +++S   Y   + 
Sbjct: 10  CPMVFLAGIVDSIAG-GGGLISLPAYLMAGLPPHMAVATNKLSSSCGTTVSAFRYLKNRC 68

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKL 424
             +P     V  A   + +G H+V  L
Sbjct: 69  VSLPLVPSTVLAALAGSAIGAHLVLAL 95


>gi|16802663|ref|NP_464148.1| hypothetical protein lmo0621 [Listeria monocytogenes EGD-e]
 gi|47096284|ref|ZP_00233881.1| membrane protein, putative [Listeria monocytogenes str. 1/2a F6854]
 gi|254828312|ref|ZP_05232999.1| conserved hypothetical protein [Listeria monocytogenes FSL N3-165]
 gi|254911300|ref|ZP_05261312.1| conserved hypothetical protein [Listeria monocytogenes J2818]
 gi|254935627|ref|ZP_05267324.1| conserved hypothetical protein [Listeria monocytogenes F6900]
 gi|255026854|ref|ZP_05298840.1| hypothetical protein LmonocytFSL_11936 [Listeria monocytogenes FSL
           J2-003]
 gi|386046279|ref|YP_005964611.1| hypothetical protein [Listeria monocytogenes J0161]
 gi|386049548|ref|YP_005967539.1| hypothetical protein [Listeria monocytogenes FSL R2-561]
 gi|386052885|ref|YP_005970443.1| hypothetical protein [Listeria monocytogenes Finland 1998]
 gi|404283060|ref|YP_006683957.1| hypothetical protein LMOSLCC2372_0630 [Listeria monocytogenes
           SLCC2372]
 gi|405757615|ref|YP_006686891.1| hypothetical protein LMOSLCC2479_0628 [Listeria monocytogenes
           SLCC2479]
 gi|16410010|emb|CAC98699.1| lmo0621 [Listeria monocytogenes EGD-e]
 gi|47015329|gb|EAL06265.1| membrane protein, putative [Listeria monocytogenes str. 1/2a F6854]
 gi|258600705|gb|EEW14030.1| conserved hypothetical protein [Listeria monocytogenes FSL N3-165]
 gi|258608208|gb|EEW20816.1| conserved hypothetical protein [Listeria monocytogenes F6900]
 gi|293589231|gb|EFF97565.1| conserved hypothetical protein [Listeria monocytogenes J2818]
 gi|345533270|gb|AEO02711.1| hypothetical protein LMOG_02131 [Listeria monocytogenes J0161]
 gi|346423394|gb|AEO24919.1| conserved hypothetical protein [Listeria monocytogenes FSL R2-561]
 gi|346645536|gb|AEO38161.1| conserved hypothetical protein [Listeria monocytogenes Finland
           1998]
 gi|404232562|emb|CBY53965.1| hypothetical protein LMOSLCC2372_0630 [Listeria monocytogenes
           SLCC2372]
 gi|404235497|emb|CBY56899.1| hypothetical protein LMOSLCC2479_0628 [Listeria monocytogenes
           SLCC2479]
          Length = 246

 Score = 38.5 bits (88), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ-VSSATATFAMTFSSSMSVV 390
           ++++ Y  FGV+ G + GL G+GGG I+ P+ L + +  Q  +SAT+++    +S  S+ 
Sbjct: 130 YRIIPYV-FGVIIGFLAGLFGIGGGPIVIPILLLIFMLNQKTASATSSYVTLLTSLASIG 188

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            Y ++        +Y +  A I A +G  + + L
Sbjct: 189 SYAIIGGSDFSIGIYMIPGAIIGALIGTRLNKLL 222


>gi|389807021|ref|ZP_10203906.1| putative transmembrane protein [Rhodanobacter thiooxydans LCS2]
 gi|388444811|gb|EIM00906.1| putative transmembrane protein [Rhodanobacter thiooxydans LCS2]
          Length = 247

 Score = 38.5 bits (88), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 47/101 (46%), Gaps = 8/101 (7%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMS---- 388
           Q   +   G+LA +V G LG+  G +   + L LG+PP V+SAT   A  F++ +S    
Sbjct: 5   QFFVFAGVGLLAQLVDGALGMAYGLVSNSILLALGLPPAVASATVHTAEVFTTGVSGASH 64

Query: 389 ----VVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLI 425
                V + L  +  +P A+     AT  A V    +R  +
Sbjct: 65  AWFGNVRWRLFWQLAIPGAIGGFLGATFLASVPGDAIRPWV 105


>gi|126734854|ref|ZP_01750600.1| integral membrane protein, putative [Roseobacter sp. CCS2]
 gi|126715409|gb|EBA12274.1| integral membrane protein, putative [Roseobacter sp. CCS2]
          Length = 306

 Score = 38.5 bits (88), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 30/49 (61%), Gaps = 2/49 (4%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ +  LKR
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAH--LKR 75


>gi|334341163|ref|YP_004546143.1| hypothetical protein [Desulfotomaculum ruminis DSM 2154]
 gi|334092517|gb|AEG60857.1| protein of unknown function DUF81 [Desulfotomaculum ruminis DSM
           2154]
          Length = 430

 Score = 38.5 bits (88), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 42/86 (48%)

Query: 339 AFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRF 398
           A GV AG++ G +G GGGF++ P  + LGV   ++  T  F +   + M  V +  L   
Sbjct: 79  AVGVCAGLITGCIGAGGGFVITPALMSLGVKGILAVGTDQFHIFAKAIMGTVIHKKLGNV 138

Query: 399 PVPYALYFVAVATIAAFVGQHVVRKL 424
            V  A+ F+  + I    G  + R L
Sbjct: 139 NVALAIAFLVGSGIGVTAGGTLNRAL 164


>gi|284800901|ref|YP_003412766.1| hypothetical protein LM5578_0649 [Listeria monocytogenes 08-5578]
 gi|284994087|ref|YP_003415855.1| hypothetical protein LM5923_0648 [Listeria monocytogenes 08-5923]
 gi|284056463|gb|ADB67404.1| hypothetical protein LM5578_0649 [Listeria monocytogenes 08-5578]
 gi|284059554|gb|ADB70493.1| hypothetical protein LM5923_0648 [Listeria monocytogenes 08-5923]
          Length = 246

 Score = 38.5 bits (88), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ-VSSATATFAMTFSSSMSVV 390
           ++++ Y  FGV+ G + GL G+GGG I+ P+ L + +  Q  +SAT+++    +S  S+ 
Sbjct: 130 YRIIPYV-FGVIIGFLAGLFGIGGGPIVIPILLLIFMLNQKTASATSSYVTLLTSLASIG 188

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            Y ++        +Y +  A I A +G  + + L
Sbjct: 189 SYAIIGGSDFSIGIYMIPGAIIGALIGTRLNKLL 222


>gi|254440248|ref|ZP_05053742.1| conserved domain protein, putative [Octadecabacter antarcticus 307]
 gi|198255694|gb|EDY80008.1| conserved domain protein, putative [Octadecabacter antarcticus 307]
          Length = 305

 Score = 38.5 bits (88), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 30/49 (61%), Gaps = 2/49 (4%)

Query: 349 GLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           G+ G+GGGF+M PL   +G+PP V+ AT    +  SS   V+ +  L+R
Sbjct: 29  GMFGVGGGFLMTPLLFFIGIPPAVAVATEAVQIVASSFSGVLAH--LRR 75


>gi|57640988|ref|YP_183466.1| hypothetical protein TK1053 [Thermococcus kodakarensis KOD1]
 gi|57159312|dbj|BAD85242.1| hypothetical membrane protein, conserved, DUF81 family
           [Thermococcus kodakarensis KOD1]
          Length = 243

 Score = 38.5 bits (88), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 63/127 (49%), Gaps = 16/127 (12%)

Query: 303 VVSLYEAISLYKGRRVIASKGD----DGK-SFRVFQLVSYCAFGVLAGIVGGLLGLGGGF 357
           V+ LY A+++ + +     KGD     GK  +R   LV     G+++G+V GLLG+ GG 
Sbjct: 104 VLLLYLAVAMLRSK-----KGDCEMEKGKIEYRNVPLV-----GLVSGLVSGLLGVSGGI 153

Query: 358 IMGPLFLEL-GVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFV 416
           +  PLF  L G+P + +  T++ A+ F++     E++ L        L+      + A +
Sbjct: 154 LNVPLFHTLVGIPIKYAVGTSSLALLFTALAGTFEHWRLGHVQPNIVLFLAPGLIMGARL 213

Query: 417 GQHVVRK 423
           G   V +
Sbjct: 214 GAKTVSR 220


>gi|308272465|emb|CBX29069.1| hypothetical protein N47_J00500 [uncultured Desulfobacterium sp.]
          Length = 320

 Score = 38.5 bits (88), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 16/46 (34%), Positives = 30/46 (65%)

Query: 350 LLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLL 395
           + G+GGGF+M PL + +G+PP V++A+ +  +  +S+     +Y L
Sbjct: 41  IFGVGGGFLMTPLLIMMGIPPTVAAASDSNQIVAASASGTYAHYRL 86


>gi|126461650|ref|YP_001042764.1| hypothetical protein Rsph17029_0881 [Rhodobacter sphaeroides ATCC
           17029]
 gi|221638621|ref|YP_002524883.1| hypothetical protein RSKD131_0522 [Rhodobacter sphaeroides KD131]
 gi|332557643|ref|ZP_08411965.1| hypothetical protein RSWS8N_01290 [Rhodobacter sphaeroides WS8N]
 gi|429206978|ref|ZP_19198238.1| Sulfite exporter TauE/SafE [Rhodobacter sp. AKP1]
 gi|126103314|gb|ABN75992.1| protein of unknown function DUF81 [Rhodobacter sphaeroides ATCC
           17029]
 gi|221159402|gb|ACM00382.1| Hypothetical Protein RSKD131_0522 [Rhodobacter sphaeroides KD131]
 gi|332275355|gb|EGJ20670.1| hypothetical protein RSWS8N_01290 [Rhodobacter sphaeroides WS8N]
 gi|428189973|gb|EKX58525.1| Sulfite exporter TauE/SafE [Rhodobacter sp. AKP1]
          Length = 306

 Score = 38.5 bits (88), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPY 402
           + G + G+ G+GGGF++ PL   +G+PP V+ AT    +  SS   V+    LKR  V +
Sbjct: 23  IVGFMSGMFGVGGGFLITPLLFFIGIPPAVAVATGANQVVASSVSGVLAQ--LKRKTVDF 80

Query: 403 ALYFVAVA 410
            +  V +A
Sbjct: 81  RMGLVLLA 88


>gi|89894741|ref|YP_518228.1| hypothetical protein DSY1995 [Desulfitobacterium hafniense Y51]
 gi|89334189|dbj|BAE83784.1| hypothetical protein [Desulfitobacterium hafniense Y51]
          Length = 312

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 39/69 (56%), Gaps = 2/69 (2%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTF 383
            G+ F    +V + A G+L  I+   LG+GGGF++ P     LG P  + + T+T ++  
Sbjct: 183 SGEEFTYNAIVVFIA-GLLVAILSASLGVGGGFLLVPFMTSILGFPMYIVAGTSTLSILV 241

Query: 384 SSSMSVVEY 392
           SSS S++ Y
Sbjct: 242 SSSTSILNY 250


>gi|423074614|ref|ZP_17063339.1| hypothetical protein HMPREF0322_02770 [Desulfitobacterium hafniense
           DP7]
 gi|361854434|gb|EHL06500.1| hypothetical protein HMPREF0322_02770 [Desulfitobacterium hafniense
           DP7]
          Length = 312

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 39/69 (56%), Gaps = 2/69 (2%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTF 383
            G+ F    +V + A G+L  I+   LG+GGGF++ P     LG P  + + T+T ++  
Sbjct: 183 SGEEFTYNAIVVFIA-GLLVAILSASLGVGGGFLLVPFMTSILGFPMYIVAGTSTLSILV 241

Query: 384 SSSMSVVEY 392
           SSS S++ Y
Sbjct: 242 SSSTSILNY 250


>gi|217965284|ref|YP_002350962.1| hypothetical protein LMHCC_2009 [Listeria monocytogenes HCC23]
 gi|290892820|ref|ZP_06555811.1| conserved hypothetical protein [Listeria monocytogenes FSL J2-071]
 gi|386007350|ref|YP_005925628.1| hypothetical protein lmo4a_0637 [Listeria monocytogenes L99]
 gi|386025940|ref|YP_005946716.1| hypothetical protein LMM7_0652 [Listeria monocytogenes M7]
 gi|404407083|ref|YP_006689798.1| hypothetical protein LMOSLCC2376_0600 [Listeria monocytogenes
           SLCC2376]
 gi|217334554|gb|ACK40348.1| domain of unknown function, putative [Listeria monocytogenes HCC23]
 gi|290557632|gb|EFD91155.1| conserved hypothetical protein [Listeria monocytogenes FSL J2-071]
 gi|307570160|emb|CAR83339.1| conserved hypothetical protein [Listeria monocytogenes L99]
 gi|336022521|gb|AEH91658.1| hypothetical protein LMM7_0652 [Listeria monocytogenes M7]
 gi|404241232|emb|CBY62632.1| hypothetical protein LMOSLCC2376_0600 [Listeria monocytogenes
           SLCC2376]
          Length = 246

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ-VSSATATFAMTFSSSMSVV 390
           ++++ Y  FGV+ G + GL G+GGG I+ P+ L + +  Q  +SAT+++    +S  S+ 
Sbjct: 130 YRIIPYV-FGVIIGFLAGLFGIGGGPIVIPILLLIFMLNQKTASATSSYVTLLTSLASIG 188

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            Y ++        +Y +  A I A +G  + + L
Sbjct: 189 SYAIIGGSDFSIGIYMIPGAIIGALIGTRLNKLL 222


>gi|46906866|ref|YP_013255.1| hypothetical protein LMOf2365_0650 [Listeria monocytogenes serotype
           4b str. F2365]
 gi|47091582|ref|ZP_00229378.1| membrane protein, putative [Listeria monocytogenes str. 4b H7858]
 gi|226223250|ref|YP_002757357.1| hypothetical protein Lm4b_00647 [Listeria monocytogenes serotype 4b
           str. CLIP 80459]
 gi|254824106|ref|ZP_05229107.1| conserved hypothetical protein [Listeria monocytogenes FSL J1-194]
 gi|254853181|ref|ZP_05242529.1| conserved hypothetical protein [Listeria monocytogenes FSL R2-503]
 gi|254932104|ref|ZP_05265463.1| conserved hypothetical protein [Listeria monocytogenes HPB2262]
 gi|255521133|ref|ZP_05388370.1| hypothetical protein LmonocFSL_07891 [Listeria monocytogenes FSL
           J1-175]
 gi|300764423|ref|ZP_07074416.1| hypothetical protein LMHG_10294 [Listeria monocytogenes FSL N1-017]
 gi|386731388|ref|YP_006204884.1| hypothetical protein MUO_03355 [Listeria monocytogenes 07PF0776]
 gi|404280179|ref|YP_006681077.1| hypothetical protein LMOSLCC2755_0627 [Listeria monocytogenes
           SLCC2755]
 gi|404285990|ref|YP_006692576.1| hypothetical protein LMOSLCC2482_0623 [Listeria monocytogenes
           serotype 7 str. SLCC2482]
 gi|405748986|ref|YP_006672452.1| hypothetical protein LMOATCC19117_0651 [Listeria monocytogenes ATCC
           19117]
 gi|405751849|ref|YP_006675314.1| hypothetical protein LMOSLCC2378_0645 [Listeria monocytogenes
           SLCC2378]
 gi|405754705|ref|YP_006678169.1| hypothetical protein LMOSLCC2540_0626 [Listeria monocytogenes
           SLCC2540]
 gi|406703404|ref|YP_006753758.1| hypothetical protein LMOL312_0629 [Listeria monocytogenes L312]
 gi|417314705|ref|ZP_12101399.1| hypothetical protein LM1816_10167 [Listeria monocytogenes J1816]
 gi|424713508|ref|YP_007014223.1| Putative membrane protein [Listeria monocytogenes serotype 4b str.
           LL195]
 gi|424822361|ref|ZP_18247374.1| Membrane protein [Listeria monocytogenes str. Scott A]
 gi|46880132|gb|AAT03432.1| putative membrane protein [Listeria monocytogenes serotype 4b str.
           F2365]
 gi|47019901|gb|EAL10638.1| membrane protein, putative [Listeria monocytogenes str. 4b H7858]
 gi|225875712|emb|CAS04415.1| unnamed protein product [Listeria monocytogenes serotype 4b str.
           CLIP 80459]
 gi|258606534|gb|EEW19142.1| conserved hypothetical protein [Listeria monocytogenes FSL R2-503]
 gi|293583660|gb|EFF95692.1| conserved hypothetical protein [Listeria monocytogenes HPB2262]
 gi|293593338|gb|EFG01099.1| conserved hypothetical protein [Listeria monocytogenes FSL J1-194]
 gi|300514777|gb|EFK41831.1| hypothetical protein LMHG_10294 [Listeria monocytogenes FSL N1-017]
 gi|328467449|gb|EGF38525.1| hypothetical protein LM1816_10167 [Listeria monocytogenes J1816]
 gi|332311041|gb|EGJ24136.1| Membrane protein [Listeria monocytogenes str. Scott A]
 gi|384390146|gb|AFH79216.1| hypothetical protein MUO_03355 [Listeria monocytogenes 07PF0776]
 gi|404218186|emb|CBY69550.1| hypothetical protein LMOATCC19117_0651 [Listeria monocytogenes ATCC
           19117]
 gi|404221049|emb|CBY72412.1| hypothetical protein LMOSLCC2378_0645 [Listeria monocytogenes
           SLCC2378]
 gi|404223905|emb|CBY75267.1| hypothetical protein LMOSLCC2540_0626 [Listeria monocytogenes
           SLCC2540]
 gi|404226814|emb|CBY48219.1| hypothetical protein LMOSLCC2755_0627 [Listeria monocytogenes
           SLCC2755]
 gi|404244919|emb|CBY03144.1| hypothetical protein LMOSLCC2482_0623 [Listeria monocytogenes
           serotype 7 str. SLCC2482]
 gi|406360434|emb|CBY66707.1| hypothetical protein LMOL312_0629 [Listeria monocytogenes L312]
 gi|424012692|emb|CCO63232.1| Putative membrane protein [Listeria monocytogenes serotype 4b str.
           LL195]
          Length = 246

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ-VSSATATFAMTFSSSMSVV 390
           ++++ Y  FGV+ G + GL G+GGG I+ P+ L + +  Q  +SAT+++    +S  S+ 
Sbjct: 130 YRIIPYV-FGVIIGFLAGLFGIGGGPIVIPILLLIFMLNQKTASATSSYVTLLTSLASIG 188

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            Y ++        +Y +  A I A +G  + + L
Sbjct: 189 SYAIIGGSDFSIGIYMIPGAIIGALIGTRLNKLL 222


>gi|392955880|ref|ZP_10321410.1| hypothetical protein A374_04039 [Bacillus macauensis ZFHKF-1]
 gi|391878122|gb|EIT86712.1| hypothetical protein A374_04039 [Bacillus macauensis ZFHKF-1]
          Length = 254

 Score = 38.5 bits (88), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 63/112 (56%), Gaps = 8/112 (7%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTFSSSM-SVVEYYLLKR 397
           F  + G+V G+LG GG FI+ P+ L  L +P + + AT + A+TF SS+ S +   +  +
Sbjct: 144 FSFVIGLVAGVLGAGGAFILVPVMLAVLKIPTKTTIAT-SLAITFLSSIGSTIGKLVTHQ 202

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL 449
            P   A+  + V+ IA+ +G  V ++L     +A  +  +LA  I ++AI +
Sbjct: 203 VPYIPAIVLMIVSLIASPIGAKVGQRL-----QAKWLQRMLATFIIITAIKI 249


>gi|111223643|ref|YP_714437.1| hypothetical protein FRAAL4244 [Frankia alni ACN14a]
 gi|111151175|emb|CAJ62886.1| hypothetical integral membrane protein [Frankia alni ACN14a]
          Length = 301

 Score = 38.5 bits (88), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 63/122 (51%), Gaps = 5/122 (4%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTF 383
           D + +RV +  +       AG V GLLG+G G +  P + L + +P +VS+AT+   +  
Sbjct: 175 DVRPYRVTRTATGLTMMTAAGAVSGLLGIGSGAMKVPAMDLAMRLPLKVSTATSNLMIGV 234

Query: 384 SSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIF 443
           + + S V  YL +   VP     VA+ T+   VG H+  +L+ ++   +L +  L  ++ 
Sbjct: 235 TGAASAV-VYLQRGDVVPALAGAVALGTV---VGAHLGSRLLAVVPVGALRVIFLVVLLV 290

Query: 444 VS 445
           +S
Sbjct: 291 IS 292


>gi|77462758|ref|YP_352262.1| hypothetical protein RSP_2206 [Rhodobacter sphaeroides 2.4.1]
 gi|77387176|gb|ABA78361.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1]
          Length = 306

 Score = 38.5 bits (88), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 343 LAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPY 402
           + G + G+ G+GGGF++ PL   +G+PP V+ AT    +  SS   V+    LKR  V +
Sbjct: 23  IVGFMSGMFGVGGGFLITPLLFFIGIPPAVAVATGANQVVASSVSGVLAQ--LKRKTVDF 80

Query: 403 ALYFVAVA 410
            +  V +A
Sbjct: 81  RMGLVLLA 88


>gi|414161061|ref|ZP_11417324.1| hypothetical protein HMPREF9310_01698 [Staphylococcus simulans
           ACS-120-V-Sch1]
 gi|410876740|gb|EKS24638.1| hypothetical protein HMPREF9310_01698 [Staphylococcus simulans
           ACS-120-V-Sch1]
          Length = 257

 Score = 38.5 bits (88), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 51/98 (52%), Gaps = 11/98 (11%)

Query: 340 FGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY------- 392
           FG LA  +  ++G GGG I  P  L +G+PP  +  T   A +F S  S +++       
Sbjct: 14  FGFLAAFIDAVVG-GGGLISTPALLAIGLPPATALGTNKLASSFGSLTSAIKFIRSGKVD 72

Query: 393 --YLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILL 428
              + K FP  + L  V  A++A+F+   V++ L+I++
Sbjct: 73  LKIVGKLFPFVF-LAAVGGASLASFLPASVLKPLVIII 109


>gi|254796626|ref|YP_003081462.1| hypothetical protein NRI_0233 [Neorickettsia risticii str.
           Illinois]
 gi|254589863|gb|ACT69225.1| domain of unknown function, putative [Neorickettsia risticii str.
           Illinois]
          Length = 311

 Score = 38.5 bits (88), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 36/143 (25%), Positives = 65/143 (45%), Gaps = 12/143 (8%)

Query: 45  ERQDSNYLLKAVNFLWQPEGSGYHHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGI 104
           E+   N  +K VN L  P    +     ++      + GT++G       S+ GVGGG I
Sbjct: 153 EKPYKNKFIKLVNGL--PLRYYFRSAKGDISLVALFLFGTVVG----VILSISGVGGGFI 206

Query: 105 FVPMLSLIIGFDPKSATAISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPML 164
            VP+L  +     + A   S    +  +V+TV+++      T+ +  +D  L  L+    
Sbjct: 207 LVPVLMYVFNLPVRIAIGTSVAQSVLVSVATVFFH------TITLGTVDMLLGFLLSIGA 260

Query: 165 MLGISIGVAFNVIFADWMVTVLL 187
           + G++ G   N+IF   ++ +LL
Sbjct: 261 ICGVTFGAKLNLIFHPVVIRLLL 283


>gi|138895146|ref|YP_001125599.1| hypothetical protein GTNG_1484 [Geobacillus thermodenitrificans
           NG80-2]
 gi|134266659|gb|ABO66854.1| Conserved hypothetical protein [Geobacillus thermodenitrificans
           NG80-2]
          Length = 305

 Score = 38.5 bits (88), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 27/115 (23%), Positives = 54/115 (46%)

Query: 330 RVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSV 389
           R+ +L+ +   G +A ++ G LG+  G     L L  G+ P V+SA+   A   +++ S 
Sbjct: 10  RMKKLIVFVFVGFIAQLIDGSLGMAYGVTSTTLLLTFGIAPAVASASVHLAEVVTTAASG 69

Query: 390 VEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFV 444
           V ++          +  +   ++ AFVG   +  L   L +  + +F+LA   ++
Sbjct: 70  VSHWKFGNIDRDMVVKLIIPGSVGAFVGACFLSNLPGDLIKPCISLFLLALGFYI 124


>gi|398383529|ref|ZP_10541597.1| putative permease [Sphingobium sp. AP49]
 gi|397724545|gb|EJK85010.1| putative permease [Sphingobium sp. AP49]
          Length = 257

 Score = 38.5 bits (88), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 36/65 (55%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           +++ +   G  A I+ G LG+  G I   L L +GVPP  +SA+   A TF++ +S + +
Sbjct: 11  EILPFILIGFGAQIIDGALGMAYGVISSTLLLAMGVPPSRASASVHAAETFTTGVSAISH 70

Query: 393 YLLKR 397
            L + 
Sbjct: 71  ILHRN 75


>gi|289577560|ref|YP_003476187.1| hypothetical protein Thit_0308 [Thermoanaerobacter italicus Ab9]
 gi|289527273|gb|ADD01625.1| protein of unknown function DUF81 [Thermoanaerobacter italicus Ab9]
          Length = 253

 Score = 38.5 bits (88), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 59/119 (49%), Gaps = 11/119 (9%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFA---MTFSSSMSVV 390
           +V+ C  G  A  +  + G GGG I  P  L LGVPP  +  T  FA    +F+SS++ +
Sbjct: 6   IVALCGIGFAAAFIDSIAG-GGGIISLPGLLVLGVPPAFALGTNKFAASCASFTSSLTFI 64

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISL 449
           +Y +     + Y ++          +G  +  K ++LL  + L I I+  MIFV+  +L
Sbjct: 65  KYRVYDGDLLKYLVF-------GTLLGAILGVKTVLLLDSSQLRIIIIILMIFVAIYTL 116


>gi|352518651|ref|YP_004887968.1| hypothetical protein TEH_24770 [Tetragenococcus halophilus NBRC
           12172]
 gi|348602758|dbj|BAK95804.1| hypothetical protein TEH_24770 [Tetragenococcus halophilus NBRC
           12172]
          Length = 260

 Score = 38.5 bits (88), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 17/122 (13%)

Query: 78  WEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSTVY 137
           W ++ G I+GF      S+ G+GGG I V +L L+ G   K AT  S C I  + +S   
Sbjct: 139 WYLICGLILGFLA----SLLGIGGGPINVALLILMFGMTIKEATVYSICTIFFSQLS--- 191

Query: 138 YNLKLRHPTLDMPIIDYDLALL--IQPMLMLGISIGVAFNVIFADWMVT-----VLLIVL 190
              KL    L      YDL++L  + P  ++G  +G   + + +   V+     V+LIVL
Sbjct: 192 ---KLGTIALTTGFALYDLSILFYVIPAAIIGGLLGAKVSRLISAKQVSRVFEAVILIVL 248

Query: 191 FI 192
           FI
Sbjct: 249 FI 250


>gi|219669180|ref|YP_002459615.1| hypothetical protein Dhaf_3156 [Desulfitobacterium hafniense DCB-2]
 gi|219539440|gb|ACL21179.1| protein of unknown function DUF81 [Desulfitobacterium hafniense
           DCB-2]
          Length = 312

 Score = 38.5 bits (88), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 39/69 (56%), Gaps = 2/69 (2%)

Query: 325 DGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLE-LGVPPQVSSATATFAMTF 383
            G+ F    +V + A G+L  I+   LG+GGGF++ P     LG P  + + T+T ++  
Sbjct: 183 SGEEFTYNAIVVFIA-GLLVAILSASLGVGGGFLLVPFMTSILGFPMYIVAGTSTLSILV 241

Query: 384 SSSMSVVEY 392
           SSS S++ Y
Sbjct: 242 SSSTSILNY 250


>gi|335423623|ref|ZP_08552644.1| hypothetical protein SSPSH_13047 [Salinisphaera shabanensis E1L3A]
 gi|334891448|gb|EGM29696.1| hypothetical protein SSPSH_13047 [Salinisphaera shabanensis E1L3A]
          Length = 287

 Score = 38.5 bits (88), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 8/72 (11%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY------- 393
           G +A +V G LG+G G     + + LG+ P V+SA+  FA   +++ S   +Y       
Sbjct: 6   GFVAQLVDGSLGMGYGLTSSTMLIALGISPAVASASIHFAEIATTAASGTAHYRFGNVDV 65

Query: 394 -LLKRFPVPYAL 404
            LLKR  VP AL
Sbjct: 66  ALLKRMVVPGAL 77


>gi|242242213|ref|ZP_04796658.1| hypothetical membrane protein [Staphylococcus epidermidis W23144]
 gi|416123952|ref|ZP_11595138.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
 gi|418327602|ref|ZP_12938754.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
 gi|418613538|ref|ZP_13176544.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
 gi|418632622|ref|ZP_13195052.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
 gi|418633694|ref|ZP_13196100.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
 gi|420175320|ref|ZP_14681760.1| hypothetical protein HMPREF9990_06284 [Staphylococcus epidermidis
           NIHLM061]
 gi|420189924|ref|ZP_14695872.1| hypothetical protein HMPREF9984_04365 [Staphylococcus epidermidis
           NIHLM037]
 gi|420193318|ref|ZP_14699172.1| hypothetical protein HMPREF9983_09537 [Staphylococcus epidermidis
           NIHLM023]
 gi|420203830|ref|ZP_14709391.1| hypothetical protein HMPREF9978_01579 [Staphylococcus epidermidis
           NIHLM015]
 gi|242234332|gb|EES36644.1| hypothetical membrane protein [Staphylococcus epidermidis W23144]
 gi|319401800|gb|EFV90008.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
 gi|365232855|gb|EHM73831.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
 gi|374823394|gb|EHR87391.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
 gi|374832192|gb|EHR95912.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
 gi|374838641|gb|EHS02179.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
 gi|394243782|gb|EJD89143.1| hypothetical protein HMPREF9990_06284 [Staphylococcus epidermidis
           NIHLM061]
 gi|394259217|gb|EJE04075.1| hypothetical protein HMPREF9984_04365 [Staphylococcus epidermidis
           NIHLM037]
 gi|394260170|gb|EJE04990.1| hypothetical protein HMPREF9983_09537 [Staphylococcus epidermidis
           NIHLM023]
 gi|394274412|gb|EJE18833.1| hypothetical protein HMPREF9978_01579 [Staphylococcus epidermidis
           NIHLM015]
          Length = 275

 Score = 38.5 bits (88), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 61/118 (51%), Gaps = 8/118 (6%)

Query: 325 DGKSFRVFQLVSYCAF--GVLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAM 381
           +GK++R + +    AF   +  GI+ GL G+GGG +M PL L     PP V+  T+   +
Sbjct: 148 EGKTYR-YSVPPLFAFITTLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL----IILLGRASLII 435
            FSS MS + + L       Y++  +  + I A +G  V R +    +++L R  ++I
Sbjct: 207 FFSSVMSSIGHILQGHVAWGYSIILIISSVIGAQIGVRVNRSMKSDTVVMLLRTVMLI 264


>gi|71279539|ref|YP_269177.1| hypothetical protein CPS_2461 [Colwellia psychrerythraea 34H]
 gi|71145279|gb|AAZ25752.1| putative membrane protein [Colwellia psychrerythraea 34H]
          Length = 261

 Score = 38.5 bits (88), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 32/50 (64%), Gaps = 1/50 (2%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFA 380
           +F+LV     G+L G++  + G GG FI  P  + +G+PP ++SAT TFA
Sbjct: 6   LFELVFLFFAGILGGVINSIAG-GGSFITFPALMFVGIPPILASATNTFA 54


>gi|420199664|ref|ZP_14705335.1| hypothetical protein HMPREF9980_05334 [Staphylococcus epidermidis
           NIHLM031]
 gi|394271414|gb|EJE15907.1| hypothetical protein HMPREF9980_05334 [Staphylococcus epidermidis
           NIHLM031]
          Length = 275

 Score = 38.5 bits (88), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 61/118 (51%), Gaps = 8/118 (6%)

Query: 325 DGKSFRVFQLVSYCAF--GVLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAM 381
           +GK++R + +    AF   +  GI+ GL G+GGG +M PL L     PP V+  T+   +
Sbjct: 148 EGKTYR-YSVPPLFAFITTLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL----IILLGRASLII 435
            FSS MS + + L       Y++  +  + I A +G  V R +    +++L R  ++I
Sbjct: 207 FFSSVMSSIGHILQGHVAWGYSIILIISSVIGAQIGVRVNRSMKSDSVVMLLRTVMLI 264


>gi|420178833|ref|ZP_14685159.1| hypothetical protein HMPREF9989_11288 [Staphylococcus epidermidis
           NIHLM057]
 gi|420181508|ref|ZP_14687706.1| hypothetical protein HMPREF9988_11921 [Staphylococcus epidermidis
           NIHLM053]
 gi|394245744|gb|EJD91018.1| hypothetical protein HMPREF9989_11288 [Staphylococcus epidermidis
           NIHLM057]
 gi|394245936|gb|EJD91205.1| hypothetical protein HMPREF9988_11921 [Staphylococcus epidermidis
           NIHLM053]
          Length = 275

 Score = 38.5 bits (88), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 62/118 (52%), Gaps = 8/118 (6%)

Query: 325 DGKSFRVFQLVSYCAF--GVLAGIVGGLLGLGGGFIMGPLFL-ELGVPPQVSSATATFAM 381
           +GK++R + +    AF   +  GI+ GL G+GGG +M PL L     PP V+  T+   +
Sbjct: 148 EGKTYR-YSVPPLFAFITTLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206

Query: 382 TFSSSMSVVEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL----IILLGRASLII 435
            FSS MS + + L       Y++ F+  + I A +G  V R +    +++L R  ++I
Sbjct: 207 FFSSVMSSIGHILQGHVAWGYSIIFIISSVIGAQIGVRVNRSMKSDSVVMLLRTVMLI 264


>gi|332186970|ref|ZP_08388711.1| hypothetical protein SUS17_2059 [Sphingomonas sp. S17]
 gi|332012980|gb|EGI55044.1| hypothetical protein SUS17_2059 [Sphingomonas sp. S17]
          Length = 304

 Score = 38.1 bits (87), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 45/87 (51%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            A G   G++ G+ G+GGGF+  PL +  G+PP V++A+A   +T +S   V  ++    
Sbjct: 18  IALGAGVGLLSGMFGVGGGFLTTPLLIVYGIPPTVAAASAASQVTGASVSGVFAHWQRGG 77

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKL 424
             V      VA   + +F+G  + R L
Sbjct: 78  VDVKMGGVLVAGGIVGSFIGAWIFRLL 104


>gi|14521729|ref|NP_127205.1| hypothetical protein PAB1010 [Pyrococcus abyssi GE5]
 gi|5458948|emb|CAB50435.1| Predicted permease [Pyrococcus abyssi GE5]
 gi|380742350|tpe|CCE70984.1| TPA: hypothetical protein PAB1010 [Pyrococcus abyssi GE5]
          Length = 253

 Score = 38.1 bits (87), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 41/77 (53%)

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPV 400
           GV+ G +  + GLGGGF++ P+   +GV    +  T++ ++ F++  S   Y+  KR   
Sbjct: 14  GVVTGTLAAMFGLGGGFLLVPILNLMGVEIHHAVGTSSASIIFTALSSSYAYHRQKRILY 73

Query: 401 PYALYFVAVATIAAFVG 417
              +   + A + A++G
Sbjct: 74  RVGILLASTAVVGAYLG 90


>gi|422408731|ref|ZP_16485692.1| integral membrane protein, putative, partial [Listeria
           monocytogenes FSL F2-208]
 gi|313610283|gb|EFR85539.1| integral membrane protein, putative [Listeria monocytogenes FSL
           F2-208]
          Length = 244

 Score = 38.1 bits (87), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 332 FQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQ-VSSATATFAMTFSSSMSVV 390
           ++++ Y  FGV+ G + GL G+GGG I+ P+ L + +  Q  +SAT+++    +S  S+ 
Sbjct: 130 YRIIPYV-FGVIIGFLAGLFGIGGGPIVIPILLLIFMLNQKTASATSSYVTLLTSLASIG 188

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
            Y ++        +Y +  A I A +G  + + L
Sbjct: 189 SYAIIGGSDFSIGIYMIPGAIIGALIGTRLNKLL 222


>gi|205375197|ref|ZP_03227988.1| hypothetical protein Bcoam_19517 [Bacillus coahuilensis m4-4]
          Length = 273

 Score = 38.1 bits (87), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 36/130 (27%), Positives = 61/130 (46%), Gaps = 17/130 (13%)

Query: 63  EGSGY-HHVWPELKFGWEIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSAT 121
           +G+ Y +H+ P +      ++G   G FG        +GGG + VP + L+  F P  A 
Sbjct: 145 DGNTYEYHIQPVIAIASAFIVGLTSGLFG--------IGGGSLMVPAMILLFLFPPHVAV 196

Query: 122 AISKCMIMGAAVSTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNV-IFAD 180
           A S  MI  +A+ +   ++   +       +D+  AL + P   LG  +G   N  + +D
Sbjct: 197 ATSMFMIFLSAIVSSVTHIAFGN-------VDWLYALALIPGAWLGAKLGAYLNSRLQSD 249

Query: 181 WMVTVLLIVL 190
            +V VL I+L
Sbjct: 250 SLVMVLRIIL 259


>gi|381202936|ref|ZP_09910045.1| putative permease [Sphingobium yanoikuyae XLDN2-5]
 gi|427411758|ref|ZP_18901960.1| hypothetical protein HMPREF9718_04434 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710048|gb|EKU73071.1| hypothetical protein HMPREF9718_04434 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 257

 Score = 38.1 bits (87), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 36/65 (55%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           +++ +   G  A I+ G LG+  G I   L L +GVPP  +SA+   A TF++ +S + +
Sbjct: 11  EILPFILIGFGAQIIDGALGMAYGVISSTLLLAMGVPPSRASASVHAAETFTTGVSAISH 70

Query: 393 YLLKR 397
            L + 
Sbjct: 71  ILHRN 75


>gi|283770792|ref|ZP_06343684.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus
           H19]
 gi|283460939|gb|EFC08029.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus
           H19]
          Length = 256

 Score = 38.1 bits (87), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 58/117 (49%), Gaps = 16/117 (13%)

Query: 333 QLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
            +++   FG +A  +  ++G GGG I  P  L +G+PP V+  T   A +F S  S +++
Sbjct: 7   MIITIIVFGFIAAFIDAVVG-GGGLISTPALLAIGLPPSVALGTNKLASSFGSLTSTIKF 65

Query: 393 --------YLLKRFPVPYALYFVAVAT---IAAFVGQHVVRKLIILLGRASLIIFIL 438
                   Y++ +    +   F+A A    IA  V   +++ LII +  +S+ IF L
Sbjct: 66  IRSGKVDLYVVAKL---FGFVFLASACGAYIATMVPSQILKPLII-IALSSVFIFTL 118


>gi|120402827|ref|YP_952656.1| hypothetical protein Mvan_1828 [Mycobacterium vanbaalenii PYR-1]
 gi|119955645|gb|ABM12650.1| protein of unknown function DUF81 [Mycobacterium vanbaalenii PYR-1]
          Length = 291

 Score = 38.1 bits (87), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 51/91 (56%), Gaps = 5/91 (5%)

Query: 303 VVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGP- 361
           V+ +  AI++ +GR+ I + G+ G    V ++V   A G++ G+V GL+G GGGF++ P 
Sbjct: 104 VMMIATAIAMLRGRKTIET-GETGHRLPVPKIV---AEGLIVGLVTGLVGAGGGFLVVPA 159

Query: 362 LFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           L L  G+P  ++  T+   +   S   +  Y
Sbjct: 160 LALLGGLPMPIAVGTSLIVIAMKSFAGLAGY 190


>gi|334882870|emb|CCB83957.1| permease [Lactobacillus pentosus MP-10]
 gi|339637128|emb|CCC16007.1| permease [Lactobacillus pentosus IG1]
          Length = 286

 Score = 38.1 bits (87), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 1/91 (1%)

Query: 328 SFRVFQLVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSS 386
           S  ++ L      GV+AGI G +LG+GGG I+ P L L LG+  + +   +  A+  +SS
Sbjct: 4   SRMIYSLALMLGVGVIAGIFGAVLGIGGGMIVTPILTLGLGLDIKYAIGASIIAVIATSS 63

Query: 387 MSVVEYYLLKRFPVPYALYFVAVATIAAFVG 417
            S + Y   +   +  A++     T+ A +G
Sbjct: 64  GSTIAYLRDEMLNLRVAMFLEIATTVGAVLG 94


>gi|163868912|ref|YP_001610139.1| hypothetical protein Btr_1874 [Bartonella tribocorum CIP 105476]
 gi|161018586|emb|CAK02144.1| putative membrane protein [Bartonella tribocorum CIP 105476]
          Length = 308

 Score = 38.1 bits (87), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 1/58 (1%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVV 390
           +V     G++ G++  ++G+GGGF M P L   L VP  V   T+ F +TF SS + V
Sbjct: 177 IVPVLGIGLIVGLLSSIMGIGGGFFMIPALIYLLRVPTNVVIGTSLFQITFVSSFTTV 234


>gi|347527269|ref|YP_004834016.1| hypothetical protein SLG_08840 [Sphingobium sp. SYK-6]
 gi|345135950|dbj|BAK65559.1| conserved hypothetical membrane protein [Sphingobium sp. SYK-6]
          Length = 260

 Score = 38.1 bits (87), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 8/101 (7%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSV---- 389
           L+ +   G  A I+ G LG+  G I   L L LGVPP+ +SA+   A T ++++S     
Sbjct: 14  LLPFILVGFGAQIIDGALGMAYGVISSSLLLALGVPPRAASASVHAAETCTTAVSAISHV 73

Query: 390 ----VEYYLLKRFPVPYALYFVAVATIAAFVGQHVVRKLII 426
               V++ L  +  VP  L  VA A+  + +   ++R  ++
Sbjct: 74  MHRNVDWKLFWKLLVPGVLGGVAGASFLSNIDGELIRPYVL 114


>gi|187933776|ref|YP_001887651.1| transporter [Clostridium botulinum B str. Eklund 17B]
 gi|187721929|gb|ACD23150.1| transporter [Clostridium botulinum B str. Eklund 17B]
          Length = 249

 Score = 38.1 bits (87), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 1/84 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C    LAGI+  + G GGG I  P ++  G+P  ++  T  FA    +S+S ++++    
Sbjct: 8   CPLVFLAGIIDAVAG-GGGIISLPAYIFAGIPIHIAYGTNKFASCIGTSISSIKFFRSGN 66

Query: 398 FPVPYALYFVAVATIAAFVGQHVV 421
             +  AL   A A I ++ G  +V
Sbjct: 67  IKIKSALLSAAGALIGSWFGAQIV 90


>gi|399887998|ref|ZP_10773875.1| hypothetical protein CarbS_05635 [Clostridium arbusti SL206]
          Length = 274

 Score = 38.1 bits (87), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 1/91 (1%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           L+  C FG +A  V  + G GGG I  P FL  G+P +++  T  F  T  + MS  EYY
Sbjct: 28  LLLLCIFGFIAAAVDSIAG-GGGLISLPAFLLAGLPARLALGTNKFCSTSGTLMSSFEYY 86

Query: 394 LLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
              +       Y      I A +G   V  +
Sbjct: 87  KNGKMNFKLLKYLFPFTLIGAVIGVFTVLNI 117


>gi|395646309|ref|ZP_10434169.1| protein of unknown function DUF81 [Methanofollis liminatans DSM
           4140]
 gi|395443049|gb|EJG07806.1| protein of unknown function DUF81 [Methanofollis liminatans DSM
           4140]
          Length = 275

 Score = 38.1 bits (87), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 49/100 (49%), Gaps = 5/100 (5%)

Query: 298 IPVSLVVSLYEAISLYKGRRVIASKGDDGKSFRVFQLVSYCAFGVLAGIVGGLLGLGGGF 357
           +P  +V   +  + L  G RV+  + D G+         Y  +G+  G+V G+ G+GGG 
Sbjct: 105 LPAEMVRIFFGLVVLAAGLRVLLPQTDAGEDVPEIDGSRYLLWGLPVGVVSGMAGIGGGV 164

Query: 358 IMGPLF---LELGVPPQVSSATATFAMTFSSSMSVVEYYL 394
           I+ P+    L  G+   V  AT+T  M F+++   + Y +
Sbjct: 165 ILVPILTAVLRFGMLRAV--ATSTVVMIFAATAGTLSYMI 202


>gi|188589382|ref|YP_001922636.1| transporter [Clostridium botulinum E3 str. Alaska E43]
 gi|188499663|gb|ACD52799.1| transporter [Clostridium botulinum E3 str. Alaska E43]
          Length = 249

 Score = 38.1 bits (87), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 1/84 (1%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
           C    LAGI+  + G GGG I  P ++  G+P  ++  T  FA    +S+S ++++    
Sbjct: 8   CPLVFLAGIIDAVAG-GGGIISLPAYIFAGIPIHIAYGTNKFASCIGTSISSIKFFRSGN 66

Query: 398 FPVPYALYFVAVATIAAFVGQHVV 421
             +  AL   A A I ++ G  +V
Sbjct: 67  IKIKSALLSAAGALIGSWFGAQIV 90


>gi|118587476|ref|ZP_01544901.1| permease [Oenococcus oeni ATCC BAA-1163]
 gi|421186553|ref|ZP_15643944.1| permease [Oenococcus oeni AWRIB418]
 gi|118432126|gb|EAV38867.1| permease [Oenococcus oeni ATCC BAA-1163]
 gi|399966895|gb|EJO01396.1| permease [Oenococcus oeni AWRIB418]
          Length = 283

 Score = 38.1 bits (87), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 1/89 (1%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           L+ Y + G+LAG+ G +LGLGGG I+ P L L   +P   + A +  A+  +SS + V Y
Sbjct: 6   LIVYISIGILAGVFGAVLGLGGGMIVTPILVLGFNLPIHYAIAASIIAVIGTSSGASVAY 65

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHVV 421
                  +  A++     TI A +G  +V
Sbjct: 66  LKDDLLNIRVAMFLEIFTTIGALIGAVLV 94


>gi|333373756|ref|ZP_08465660.1| protein of hypothetical function DUF81 [Desmospora sp. 8437]
 gi|332969423|gb|EGK08447.1| protein of hypothetical function DUF81 [Desmospora sp. 8437]
          Length = 273

 Score = 38.1 bits (87), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 57/106 (53%), Gaps = 10/106 (9%)

Query: 287 SIVYWVLDLLQIPVSLVVSLYEAISLYKGRRVIAS-----KGDDGKSFRV-FQLVSYCAF 340
           S+ + V  +L I ++L+++L + +   K RRV  S     K  +G  +R  +        
Sbjct: 106 SLFFIVFGVLMIGIALLLTLGDRL---KVRRVKWSVQREFKDAEGNIYRYGYHRTIALTC 162

Query: 341 GVLAGIVGGLLGLGGGFIMGPLFLEL-GVPPQVSSATATFAMTFSS 385
           G + G++ GL G+GGG ++ PL + L   PP V++AT+ F +  SS
Sbjct: 163 GFVVGLLSGLFGIGGGALLVPLMVLLFRFPPHVATATSMFVIFLSS 208


>gi|255065394|ref|ZP_05317249.1| putative membrane protein [Neisseria sicca ATCC 29256]
 gi|340363398|ref|ZP_08685734.1| membrane protein [Neisseria macacae ATCC 33926]
 gi|419797635|ref|ZP_14323104.1| sulfite exporter TauE/SafE [Neisseria sicca VK64]
 gi|255050219|gb|EET45683.1| putative membrane protein [Neisseria sicca ATCC 29256]
 gi|339885989|gb|EGQ75674.1| membrane protein [Neisseria macacae ATCC 33926]
 gi|385697717|gb|EIG28128.1| sulfite exporter TauE/SafE [Neisseria sicca VK64]
          Length = 266

 Score = 38.1 bits (87), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 5/104 (4%)

Query: 338 CAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKR 397
            A  ++AG V  + G G G I+ P FL +G+PPQV+ A      T  +  ++  +  +K 
Sbjct: 23  VAASLIAGYVDAIAG-GAGLILIPAFLMVGLPPQVALAQEKLVSTIGTVAAIKNF--MKS 79

Query: 398 FPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFM 441
             + + +  V V  +AA +G  V  K+I++L   ++   ILAF+
Sbjct: 80  SSIVWRI--VPVGIVAALIGAFVGAKVILILPVETINYIILAFL 121


>gi|116491798|ref|YP_811342.1| permease [Oenococcus oeni PSU-1]
 gi|290891448|ref|ZP_06554507.1| hypothetical protein AWRIB429_1897 [Oenococcus oeni AWRIB429]
 gi|419759133|ref|ZP_14285439.1| permease [Oenococcus oeni AWRIB304]
 gi|419857308|ref|ZP_14380018.1| permease [Oenococcus oeni AWRIB202]
 gi|419858814|ref|ZP_14381473.1| permease [Oenococcus oeni DSM 20252 = AWRIB129]
 gi|421183971|ref|ZP_15641400.1| permease [Oenococcus oeni AWRIB318]
 gi|421188559|ref|ZP_15645896.1| permease [Oenococcus oeni AWRIB419]
 gi|421190397|ref|ZP_15647699.1| permease [Oenococcus oeni AWRIB422]
 gi|421191271|ref|ZP_15648549.1| permease [Oenococcus oeni AWRIB548]
 gi|421192953|ref|ZP_15650205.1| permease [Oenococcus oeni AWRIB553]
 gi|421195754|ref|ZP_15652957.1| permease [Oenococcus oeni AWRIB568]
 gi|421196231|ref|ZP_15653420.1| permease [Oenococcus oeni AWRIB576]
 gi|116092523|gb|ABJ57677.1| Predicted permease [Oenococcus oeni PSU-1]
 gi|290478890|gb|EFD87555.1| hypothetical protein AWRIB429_1897 [Oenococcus oeni AWRIB429]
 gi|399904055|gb|EJN91517.1| permease [Oenococcus oeni AWRIB304]
 gi|399965039|gb|EJN99666.1| permease [Oenococcus oeni AWRIB419]
 gi|399968608|gb|EJO03041.1| permease [Oenococcus oeni AWRIB318]
 gi|399970300|gb|EJO04605.1| permease [Oenococcus oeni AWRIB422]
 gi|399972398|gb|EJO06599.1| permease [Oenococcus oeni AWRIB548]
 gi|399973363|gb|EJO07538.1| permease [Oenococcus oeni AWRIB553]
 gi|399974880|gb|EJO08949.1| permease [Oenococcus oeni AWRIB568]
 gi|399977431|gb|EJO11411.1| permease [Oenococcus oeni AWRIB576]
 gi|410497692|gb|EKP89162.1| permease [Oenococcus oeni DSM 20252 = AWRIB129]
 gi|410498373|gb|EKP89829.1| permease [Oenococcus oeni AWRIB202]
          Length = 283

 Score = 38.1 bits (87), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 1/89 (1%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGP-LFLELGVPPQVSSATATFAMTFSSSMSVVEY 392
           L+ Y + G+LAG+ G +LGLGGG I+ P L L   +P   + A +  A+  +SS + V Y
Sbjct: 6   LIVYISIGILAGVFGAVLGLGGGMIVTPILVLGFNLPIHYAIAASIIAVIGTSSGASVAY 65

Query: 393 YLLKRFPVPYALYFVAVATIAAFVGQHVV 421
                  +  A++     TI A +G  +V
Sbjct: 66  LKDDLLNIRVAMFLEIFTTIGALIGAVLV 94


>gi|392970998|ref|ZP_10336396.1| putative uncharacterized protein [Staphylococcus equorum subsp.
           equorum Mu2]
 gi|403045070|ref|ZP_10900548.1| hypothetical protein SOJ_01570 [Staphylococcus sp. OJ82]
 gi|392511000|emb|CCI59658.1| putative uncharacterized protein [Staphylococcus equorum subsp.
           equorum Mu2]
 gi|402765134|gb|EJX19218.1| hypothetical protein SOJ_01570 [Staphylococcus sp. OJ82]
          Length = 301

 Score = 38.1 bits (87), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 26/99 (26%), Positives = 46/99 (46%), Gaps = 4/99 (4%)

Query: 331 VFQLVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVV 390
           +++L  +   G LA ++ G LG+G G     + L  G+ P ++SAT  FA   +++ S  
Sbjct: 1   MYKLFIFALAGFLAQLIDGSLGMGFGASSSSILLTFGIAPAIASATIHFAEVATTAASGT 60

Query: 391 EYYLLKRFPVPYALYFVAVATIAAFVG----QHVVRKLI 425
            ++  +    P  +       I AF+G     H+   LI
Sbjct: 61  AHWKFENVHKPTMIKLAIPGAITAFIGAAFLSHIHSNLI 99


>gi|390935637|ref|YP_006393142.1| hypothetical protein Tsac_2547 [Thermoanaerobacterium
           saccharolyticum JW/SL-YS485]
 gi|389571138|gb|AFK87543.1| protein of unknown function DUF81 [Thermoanaerobacterium
           saccharolyticum JW/SL-YS485]
          Length = 252

 Score = 38.1 bits (87), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 9/132 (6%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           ++S C  G  A  +  + G GGG I  P  + LGVPP  +  T  FA T +S  S + + 
Sbjct: 6   ILSLCVIGFTAAFIDSIAG-GGGIISLPGLMVLGVPPAYALGTNKFASTCASFTSSITFM 64

Query: 394 LLKRFPVPYALYFVAVATIAAFVGQHVVRKLIILLGRASLIIFILAFMIFVSAISLGGVG 453
             K + +    Y        A +G     K ++LL  + L I I+  MIFV+  +L    
Sbjct: 65  KYKVYDIDLLKYLAFGTLFGAILGV----KAVLLLDSSKLRIIIIILMIFVAMYTL---- 116

Query: 454 ISNMIGKFHRHE 465
           +S  +G  ++ +
Sbjct: 117 LSKNVGNVNKFK 128


>gi|383935848|ref|ZP_09989281.1| UPF0721 transmembrane protein [Rheinheimera nanhaiensis E407-8]
 gi|383703167|dbj|GAB59372.1| UPF0721 transmembrane protein [Rheinheimera nanhaiensis E407-8]
          Length = 260

 Score = 38.1 bits (87), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 39/91 (42%), Gaps = 1/91 (1%)

Query: 334 LVSYCAFGVLAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTFSSSMSVVEYY 393
           L+  C     AG +  + G GGG +  P  L  G+PP V   T   A TF S  + + YY
Sbjct: 12  LLILCLVAFSAGFIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSFTASLTYY 70

Query: 394 LLKRFPVPYALYFVAVATIAAFVGQHVVRKL 424
             K F   Y    +    + A +G  VV  L
Sbjct: 71  RKKLFNPLYWRRSLGYTALGAVLGTLVVDHL 101


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.327    0.143    0.436 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,481,186,779
Number of Sequences: 23463169
Number of extensions: 322124935
Number of successful extensions: 1388241
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1480
Number of HSP's successfully gapped in prelim test: 2838
Number of HSP's that attempted gapping in prelim test: 1375784
Number of HSP's gapped (non-prelim): 14300
length of query: 477
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 331
effective length of database: 8,933,572,693
effective search space: 2957012561383
effective search space used: 2957012561383
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 79 (35.0 bits)