BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011782
(477 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255549968|ref|XP_002516035.1| conserved hypothetical protein [Ricinus communis]
gi|223544940|gb|EEF46455.1| conserved hypothetical protein [Ricinus communis]
Length = 483
Score = 593 bits (1529), Expect = e-167, Method: Compositional matrix adjust.
Identities = 328/480 (68%), Positives = 393/480 (81%), Gaps = 6/480 (1%)
Query: 1 MSRVGSE--CFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA 58
M+ +GSE R + A+ I + V S+++ +++ + E +S ++ VE +++V
Sbjct: 1 MAMIGSEWKSLSLRYVKASLICFIGVVSMTIVSEARQQQEGSSDRNSEDVESAFLMRVVN 60
Query: 59 ---PKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDA 115
K + GY HVWPD+KFGW+IVVG+IIGFFG A GSVGGVGGGGIFVPML LI+GFDA
Sbjct: 61 FLWQKGQLGYTHVWPDMKFGWQIVVGSIIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDA 120
Query: 116 KSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVI 175
KSS A+SKCM+TG A S V YNL+ RHPTLD+P+IDYDLALLFQP+LVLGISIGVAFNVI
Sbjct: 121 KSSTAISKCMITGAAASTVYYNLKLRHPTLDMPIIDYDLALLFQPVLVLGISIGVAFNVI 180
Query: 176 FADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPE 235
FADWMITVLLI+LF+ ST+AFLKGVETWKKETI K+EAA++L G + E K P
Sbjct: 181 FADWMITVLLIILFIGTSTKAFLKGVETWKKETIIKKEAARRLASNGGASEEVEYKPLPG 240
Query: 236 YLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVL 295
+N T PE + EPK+ +VSIIEN+ WKE G+L AVW IILAL+I KNYTTTCSV+YWV
Sbjct: 241 GPTNGTGPE-SNEPKREEVSIIENVRWKELGILFAVWLIILALEITKNYTTTCSVVYWVC 299
Query: 296 NLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGL 355
NLLQIPVA GVS+Y+AV+LYKG+R+IASKG+ GT WR QLV YCACG+ AGMVGGLLGL
Sbjct: 300 NLLQIPVALGVSSYQAVSLYKGKRRIASKGEAGTNWRVHQLVLYCACGVLAGMVGGLLGL 359
Query: 356 GGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIA 415
GGGFILGPLFLELG+PPQVSSATA FA+TFS+SMSV+EYYLLKRFPVPYALYF A+S IA
Sbjct: 360 GGGFILGPLFLELGVPPQVSSATATFAMTFSASMSVIEYYLLKRFPVPYALYFVAVSTIA 419
Query: 416 AFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
AFVGQHV+++LI ILGRAS+IIF LSF IFVSA+SLGGVG+A MI++IE+ EYMGFDNIC
Sbjct: 420 AFVGQHVVRRLISILGRASLIIFILSFTIFVSAISLGGVGIANMIEKIENHEYMGFDNIC 479
>gi|225442593|ref|XP_002284423.1| PREDICTED: uncharacterized protein LOC100266912 [Vitis vinifera]
gi|297743252|emb|CBI36119.3| unnamed protein product [Vitis vinifera]
Length = 481
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 327/478 (68%), Positives = 386/478 (80%), Gaps = 4/478 (0%)
Query: 1 MSRVGSECFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA-- 58
M+ +GS+ + R +A + L VASV VSA+ LK E++S + + VEP ++ K+
Sbjct: 1 MAGIGSKWWGLRRVALILVTFLGVASVLVSAERTLKQESSSHDGSREVEPGLMTKIVNFL 60
Query: 59 -PKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKS 117
+ S Y+HVWPD+KFGW+I+VGTIIGFFG A GSVGGVGGGGIFVPML L++GFDAKS
Sbjct: 61 WDPNLSAYEHVWPDMKFGWQIIVGTIIGFFGAAFGSVGGVGGGGIFVPMLTLVIGFDAKS 120
Query: 118 SAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFA 177
S A+SKCM+ G A S V YNL+ RHPTLD+P+IDYDLALLFQP+LVLGISIGVAFNVIFA
Sbjct: 121 STAISKCMIMGAAASTVYYNLKLRHPTLDMPIIDYDLALLFQPMLVLGISIGVAFNVIFA 180
Query: 178 DWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYL 237
DWM+TVLLIVLF+ ST+AF KGVETWKKETI K+EAAK+LE GNG + E K P
Sbjct: 181 DWMVTVLLIVLFIGTSTKAFFKGVETWKKETIIKKEAAKRLESNGNGSEDGEYKALPAG- 239
Query: 238 SNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNL 297
ND T +T PK +VSIIEN+ WKE GLL AVW IIL LQI KNYT+TCS YW LNL
Sbjct: 240 PNDGTQRDTNAPKDKEVSIIENVCWKELGLLFAVWVIILGLQIGKNYTSTCSAGYWTLNL 299
Query: 298 LQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 357
LQ+PVA GVS YEAV LYKGRR IASKG+EGT W+ L+FYC G+ AG+VGGLLGLGG
Sbjct: 300 LQVPVAVGVSGYEAVCLYKGRRVIASKGEEGTNWKVHNLIFYCGVGVLAGIVGGLLGLGG 359
Query: 358 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 417
GFILGPLFLELGIPPQVSSATA FA+TFS+SMSVVEYYLLKRFPVPYALYF A++ +AAF
Sbjct: 360 GFILGPLFLELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYFVAVATVAAF 419
Query: 418 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+GQHV++KLI ILGRAS+IIF LSF IFVSA+SLGGVG+ MI++IE K+YMGF+N+C
Sbjct: 420 LGQHVVRKLINILGRASLIIFILSFTIFVSAISLGGVGIKDMIEKIEQKDYMGFENLC 477
>gi|224070901|ref|XP_002303288.1| predicted protein [Populus trichocarpa]
gi|222840720|gb|EEE78267.1| predicted protein [Populus trichocarpa]
Length = 426
Score = 571 bits (1472), Expect = e-160, Method: Compositional matrix adjust.
Identities = 313/417 (75%), Positives = 354/417 (84%), Gaps = 3/417 (0%)
Query: 60 KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
K GY HVWPD+KFGW+IVVGTIIGFFG A GSVGGVGGGGIFVPML LI+GFDAKSS
Sbjct: 10 KGLYGYTHVWPDMKFGWKIVVGTIIGFFGAALGSVGGVGGGGIFVPMLTLIIGFDAKSST 69
Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
A+SKCM+TG A S V YNL+ RHPTLD+PVIDYDLALLFQP+LVLGISIGVAFNVIFADW
Sbjct: 70 AISKCMITGAAASTVYYNLKLRHPTLDMPVIDYDLALLFQPMLVLGISIGVAFNVIFADW 129
Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSN 239
MITVLLI+LF+ ST+AFLKGVETWKKETI K+EAA++LE N EE + P
Sbjct: 130 MITVLLIILFIGTSTKAFLKGVETWKKETILKQEAARRLE--SNDDDNEEVEYQPLPGGP 187
Query: 240 DTTPE-ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLL 298
E E +EPKK VSIIEN+YWKE GLL AVW ILAL+I KNYTTTCS+ YW LNLL
Sbjct: 188 SGGGEAENKEPKKEAVSIIENVYWKELGLLFAVWGAILALEIGKNYTTTCSMAYWALNLL 247
Query: 299 QIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGG 358
QIPVA GVS+YEAV+LYKG RKIASKG+ GT WRA QLV YCACG+ AG+VGGLLGLGGG
Sbjct: 248 QIPVAVGVSSYEAVSLYKGTRKIASKGETGTNWRAHQLVLYCACGVLAGIVGGLLGLGGG 307
Query: 359 FILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 418
FILGPLFLELGIPPQVSSATA FA+TFS+SMSVVEYYLLKRFPVPYALYF A++ ++AFV
Sbjct: 308 FILGPLFLELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYFVAVATVSAFV 367
Query: 419 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
GQ V++KLI +LGRAS+IIF L+F IFVSA+SLGGVG+ MI++IEH +YMGF+NIC
Sbjct: 368 GQFVVRKLINLLGRASLIIFILAFTIFVSAISLGGVGIVNMIEKIEHHDYMGFENIC 424
>gi|337733644|gb|AEI72271.1| hypothetical protein [Citrus trifoliata]
Length = 353
Score = 557 bits (1435), Expect = e-156, Method: Compositional matrix adjust.
Identities = 294/351 (83%), Positives = 320/351 (91%), Gaps = 1/351 (0%)
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
M+TG AV+ VVYNLRQRHPTLD+PVIDYDLALLFQP+LVLGISIGVAFNVIFADWMITVL
Sbjct: 1 MITGTAVATVVYNLRQRHPTLDMPVIDYDLALLFQPMLVLGISIGVAFNVIFADWMITVL 60
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPE 244
LI+LF+ +S +AFLKGVETWKKETITK EAAK+LEL NG Q EE KC+PE LSN TTP+
Sbjct: 61 LIILFIGMSAKAFLKGVETWKKETITKVEAAKRLELDRNGDQAEEFKCEPENLSN-TTPK 119
Query: 245 ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAG 304
ET+EP KS+ SI++NIYWKE GLLVAVW I+LALQIAKNYT TCSVLYWVLN LQIPVAG
Sbjct: 120 ETKEPVKSEGSILKNIYWKELGLLVAVWVIVLALQIAKNYTATCSVLYWVLNFLQIPVAG 179
Query: 305 GVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 364
GVSAYEAVALYKGRR IASKGD GT WR QLVFYCACGITAG+VGGLLGLGGGFILGPL
Sbjct: 180 GVSAYEAVALYKGRRTIASKGDVGTNWRPHQLVFYCACGITAGVVGGLLGLGGGFILGPL 239
Query: 365 FLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
FLELGIPPQVSSATA FA+TFS+SMSVVEYYLLKRFPVPYALYFFA+S IAAFVGQHV++
Sbjct: 240 FLELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYFFAVSTIAAFVGQHVVR 299
Query: 425 KLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
KLI ILGRASIIIFTLS IFVSALSL G+GLAKM+ RI+ KEYMGFD+IC
Sbjct: 300 KLINILGRASIIIFTLSLTIFVSALSLSGIGLAKMVPRIDRKEYMGFDSIC 350
>gi|225430021|ref|XP_002284175.1| PREDICTED: uncharacterized protein LOC100267889 [Vitis vinifera]
gi|296081886|emb|CBI20891.3| unnamed protein product [Vitis vinifera]
Length = 478
Score = 554 bits (1427), Expect = e-155, Method: Compositional matrix adjust.
Identities = 309/459 (67%), Positives = 370/459 (80%), Gaps = 6/459 (1%)
Query: 22 LVVASVSVSAQSNLKPEATSKN--KNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGW 76
++AS VS + ++K EA++ N K N + KV RSGY+HVWP+++FGW
Sbjct: 17 FMLASAFVSGERSIKHEASTFNVTKEAGFNSNYLSKVVNFLWQSDRSGYQHVWPEMEFGW 76
Query: 77 RIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVY 136
+IVVG++IGFFG A GSVGGVGGGGIFVPML+L++GFD KS+ A+SKCM+ G A S V Y
Sbjct: 77 QIVVGSVIGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDPKSATALSKCMIMGAAGSTVYY 136
Query: 137 NLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRA 196
NL+ RHPTLD+P+IDYDLALLFQP+L++GISIGVAFNV+FADWM+TVLLIVLFL ST+A
Sbjct: 137 NLKLRHPTLDMPIIDYDLALLFQPMLMMGISIGVAFNVLFADWMVTVLLIVLFLGTSTKA 196
Query: 197 FLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI 256
FLKGVETWKKETI KREAAK+L GNG + E K P SN T T + K+ +VSI
Sbjct: 197 FLKGVETWKKETIMKREAAKRLGTNGNGTEEVEYKPLPSGPSNGTQ-NATNKSKEPEVSI 255
Query: 257 IENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYK 316
IEN+YWKE GLLV VW LALQIAKN T TCS+ YWVLN +QIPV+ GVS YEAV+LYK
Sbjct: 256 IENVYWKELGLLVFVWVAFLALQIAKNNTATCSMAYWVLNFMQIPVSVGVSLYEAVSLYK 315
Query: 317 GRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS 376
GRR IASKGD GT +R QL+ YC CG+ AG+VGGLLGLGGGFILGPLFLELG+PPQVSS
Sbjct: 316 GRRIIASKGDAGTNFRVHQLILYCFCGVLAGIVGGLLGLGGGFILGPLFLELGVPPQVSS 375
Query: 377 ATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASII 436
ATA FA+TFSSSMSVVEYYLLKRFPVPYA+Y ++ IAAF+GQHV+++LI ILGRAS+I
Sbjct: 376 ATATFAMTFSSSMSVVEYYLLKRFPVPYAVYLVLVATIAAFIGQHVVRRLISILGRASLI 435
Query: 437 IFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
IF L+F IF+SA+SLGGVGL+ MI +IE EYMGF+N+C
Sbjct: 436 IFILAFTIFISAISLGGVGLSNMIGKIERHEYMGFENLC 474
>gi|224092328|ref|XP_002309560.1| predicted protein [Populus trichocarpa]
gi|222855536|gb|EEE93083.1| predicted protein [Populus trichocarpa]
Length = 479
Score = 543 bits (1399), Expect = e-152, Method: Compositional matrix adjust.
Identities = 303/474 (63%), Positives = 369/474 (77%), Gaps = 13/474 (2%)
Query: 10 RWRLLAAAFIGLL--VVASVSVSAQSNLKPEATSKNKN---DHVEPNIVIKVS---APKS 61
+WR L + + LL ++A V VSA+ LK EATS N D V ++ V+ P
Sbjct: 7 KWRGLRSVLMVLLNFLLAFVFVSAERGLKHEATSTRMNETGDSVSSYVLKAVNFLWQPDH 66
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
+ GY+HVWP++KFGW+IV+G+IIGFFG A GSVGGVGGGGIFVPML+L++GFD KS+ A+
Sbjct: 67 K-GYQHVWPEMKFGWQIVLGSIIGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDPKSATAI 125
Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
SKCM+ G A+S V YNL+ RHPTLD+P+IDYDLALL QP+L+LGISIGVAFNV+FADWM+
Sbjct: 126 SKCMIMGAAISTVYYNLKLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVVFADWMV 185
Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDT 241
TVLLI+LFL ST+AFLKGVETWKKETI KREAAK+LE +G E + P
Sbjct: 186 TVLLIILFLGTSTKAFLKGVETWKKETIMKREAAKRLE--SDGASAGEVEYKPLPGGPSN 243
Query: 242 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 301
+P+ + K+ +VSI+EN+YWKE GLLV VW L LQI K T+TCS YWVLNLLQIP
Sbjct: 244 SPQ--KADKEEEVSILENVYWKELGLLVFVWVAFLVLQITKEGTSTCSTTYWVLNLLQIP 301
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
V+ GVS YEAV+LYKG R IASKG EGT + QLV YC GI AG+VGGLLGLGGGFI+
Sbjct: 302 VSVGVSMYEAVSLYKGHRIIASKGTEGTNFTILQLVIYCLFGILAGVVGGLLGLGGGFIM 361
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GPLFLELGIPPQVSSATA FA+TFSSSMSVVEYYLLKRFPV YA+YF A++ AAF+GQH
Sbjct: 362 GPLFLELGIPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVSYAVYFVAVATFAAFIGQH 421
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++++LI + GRAS+IIF L+ IF+SA+SLGGVG+A MI +I EYMGF+N+C
Sbjct: 422 IVRRLIIVFGRASLIIFILASTIFISAISLGGVGVANMIGKIHRHEYMGFENLC 475
>gi|356556244|ref|XP_003546436.1| PREDICTED: uncharacterized protein LOC100816528 [Glycine max]
Length = 476
Score = 537 bits (1384), Expect = e-150, Method: Compositional matrix adjust.
Identities = 303/469 (64%), Positives = 370/469 (78%), Gaps = 8/469 (1%)
Query: 10 RWRL-LAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHV 68
RW + L F LL+ ASVS N K + + + V+ S SGY+H
Sbjct: 9 RWDVRLVLTFSLLLISASVSALEHVNHKKIDPTTAETSFLGK--VMNFLWSSSGSGYEHT 66
Query: 69 WPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTG 128
WP+I+FGWRI+ GTIIGF G+A G+VGGVGGGGIFV ML+LI+GFDAKS+ A+SKCM+TG
Sbjct: 67 WPEIEFGWRIITGTIIGFVGSAFGTVGGVGGGGIFVTMLSLIIGFDAKSATAISKCMITG 126
Query: 129 GAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVL 188
GA + V YNLRQ+HPTLD+PVIDYDLALLFQP+LVLGISIGVAFNVIFADWMITVLL+++
Sbjct: 127 GAAATVFYNLRQKHPTLDMPVIDYDLALLFQPVLVLGISIGVAFNVIFADWMITVLLLII 186
Query: 189 FLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETRE 248
F+ ++T+AFLKGVETWKKETI K+E A+Q + G +T E +P L +E
Sbjct: 187 FVGIATKAFLKGVETWKKETIIKKETARQSQFNGTE-RTAEVAYEP--LPGGPNTSNHKE 243
Query: 249 PKKSKV--SIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGV 306
PKKSK SI+EN+ WK G+L VW +ILA +IAK++TTTCSV YWVLNLLQ+PVA G
Sbjct: 244 PKKSKETGSILENVRWKALGVLFTVWVLILASEIAKSHTTTCSVEYWVLNLLQVPVALGA 303
Query: 307 SAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 366
++Y+AV LY G+R IASKGD+ T+WRA QL+ YC+CGI AG+VGGLLGLGGGFILGPLFL
Sbjct: 304 TSYQAVLLYTGKRVIASKGDQRTQWRAHQLIMYCSCGICAGIVGGLLGLGGGFILGPLFL 363
Query: 367 ELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
ELGIPPQVSSATA FA+TFS+SMSVVEYYLLKRFP+PY LYF A+S AAFVGQ +++KL
Sbjct: 364 ELGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPIPYTLYFVAVSTFAAFVGQVLVRKL 423
Query: 427 IKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ ILGRAS+IIF LS IFVSA+SLGGVG++ MI+RI +KEYMGF+N+C
Sbjct: 424 VAILGRASLIIFILSSTIFVSAISLGGVGISNMIQRIANKEYMGFENLC 472
>gi|255551046|ref|XP_002516571.1| conserved hypothetical protein [Ricinus communis]
gi|223544391|gb|EEF45912.1| conserved hypothetical protein [Ricinus communis]
Length = 476
Score = 536 bits (1381), Expect = e-150, Method: Compositional matrix adjust.
Identities = 296/472 (62%), Positives = 371/472 (78%), Gaps = 13/472 (2%)
Query: 10 RWRLLAAAFIGLL--VVASVSVSAQSNLKPE--ATSKNKNDHVEPNIVIKVSA--PKSRS 63
+WR L + + L ++A V VSA+ ++ E T + + + + V V ++
Sbjct: 8 KWRGLRSVLMVSLNFLLAFVLVSAERGMRREFTGTPQTEGEGLSTYFVKAVDFLWQPDQT 67
Query: 64 GYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSK 123
GY+HVWP+++FGW+IV+GTIIGFFG A GSVGGVGGGGIFVPML+LI+GFD KS+ A+SK
Sbjct: 68 GYRHVWPEMRFGWQIVLGTIIGFFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISK 127
Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
CM+ G A S V YNL+ RHPTLD+P+IDYDL LL QP+L+LGISIGVAFNVIFADWM+TV
Sbjct: 128 CMIMGAAASTVYYNLKLRHPTLDMPIIDYDLTLLIQPMLMLGISIGVAFNVIFADWMVTV 187
Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTP 243
LLI+LF+ ST+AF KGVETWKKETI K+EAAK+LE NG E + P T P
Sbjct: 188 LLIILFIGTSTKAFFKGVETWKKETIMKKEAAKRLE--ANG---AEVEYKPLPSGPSTGP 242
Query: 244 EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVA 303
E +E K+ +V+++EN+YWKE GLLV VW L L IAKN+ CS+ YWV+NLLQIPV+
Sbjct: 243 E--KEAKEPEVTLLENVYWKELGLLVFVWVAFLILHIAKNHLPACSIEYWVVNLLQIPVS 300
Query: 304 GGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 363
GVS YEAV+LYKGRR IASKG EGT ++ QLV YC+CG+ AG+VGGLLGLGGGFI+GP
Sbjct: 301 LGVSGYEAVSLYKGRRIIASKGKEGTNFKVHQLVLYCSCGVLAGVVGGLLGLGGGFIMGP 360
Query: 364 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 423
LFLELGIPPQVSSATA FA+TFSSSMSVVEYYLLKRFPVPYALYF +++ IAA +GQH++
Sbjct: 361 LFLELGIPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYFVSVATIAALIGQHIV 420
Query: 424 KKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+++I + GRAS+IIF L+F IFVSA+SLGGVG++ MI +I+H EYMGF+N+C
Sbjct: 421 RRMIIMFGRASLIIFILAFTIFVSAISLGGVGISNMIWKIQHNEYMGFENLC 472
>gi|356547396|ref|XP_003542098.1| PREDICTED: uncharacterized protein LOC100789733 [Glycine max]
Length = 487
Score = 534 bits (1375), Expect = e-149, Method: Compositional matrix adjust.
Identities = 295/463 (63%), Positives = 367/463 (79%), Gaps = 16/463 (3%)
Query: 26 SVSVSA-----QSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVV 80
SVSVSA N+ P + + V+ S SGY+H WPDI+FGWRI+
Sbjct: 24 SVSVSALQHVSHKNINPTTVETTQTSFLGK--VVNFLWSSSGSGYQHTWPDIEFGWRIIT 81
Query: 81 GTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQ 140
GTIIGF G+A G+VGGVGGGGIFV ML+LI+GFDAKS+ A+SKCM+TGGA + V YNL+Q
Sbjct: 82 GTIIGFLGSAFGTVGGVGGGGIFVTMLSLIIGFDAKSATAISKCMITGGAAATVFYNLKQ 141
Query: 141 RHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKG 200
+HPTLD+PVIDYDLALLFQP+LVLGISIGVAFNVIFADWMITVLL+++F+ ++T+AFLKG
Sbjct: 142 KHPTLDMPVIDYDLALLFQPVLVLGISIGVAFNVIFADWMITVLLLIIFVGIATKAFLKG 201
Query: 201 VETWKKETITKREAAKQLELIGNGYQTEECKCDP----EYLSNDTTPEETREP----KKS 252
VETWKKETI K+E A+Q + G ++EE +P SN P++++E +
Sbjct: 202 VETWKKETIIKKETARQSQFNGTE-RSEEVAYEPLPGGPNTSNHNEPKKSKETTVRLRHH 260
Query: 253 KVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAV 312
K S++EN+ WK G+L VW +ILA +IAK++TTTCSV YW+LNLLQ+PVA G ++Y+AV
Sbjct: 261 KGSVLENVRWKALGVLFTVWVLILASEIAKSHTTTCSVEYWILNLLQVPVALGATSYQAV 320
Query: 313 ALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPP 372
LY G+R IASKGD+ T+WRA QLV YC+CGI AG+VGGLLGLGGGFILGPLFLELGIPP
Sbjct: 321 LLYTGKRVIASKGDQRTQWRAHQLVLYCSCGICAGIVGGLLGLGGGFILGPLFLELGIPP 380
Query: 373 QVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR 432
QVSSATA FA+TFS+SMSVVEYYLLKRFP+PY LYF A+S AAFVGQ +++KL+ ILGR
Sbjct: 381 QVSSATATFAMTFSASMSVVEYYLLKRFPIPYTLYFVAVSTFAAFVGQVLVRKLVAILGR 440
Query: 433 ASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
AS+IIF LS IFVSA+SLGGVG++ MI++I +KEYMGF+N+C
Sbjct: 441 ASLIIFILSGTIFVSAISLGGVGISNMIQKIANKEYMGFENLC 483
>gi|224059076|ref|XP_002299704.1| predicted protein [Populus trichocarpa]
gi|222846962|gb|EEE84509.1| predicted protein [Populus trichocarpa]
Length = 424
Score = 533 bits (1372), Expect = e-149, Method: Compositional matrix adjust.
Identities = 301/417 (72%), Positives = 356/417 (85%), Gaps = 3/417 (0%)
Query: 60 KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
K R GY HVWP+++FGW+IV+G+IIGFFG ACGSVGGVGGGGIFVPML+LI+GFD KS+
Sbjct: 10 KGRVGYTHVWPEMRFGWKIVLGSIIGFFGAACGSVGGVGGGGIFVPMLSLIIGFDTKSAT 69
Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
A+SKCM+TGGA + V YNL+ RHPTL++PVIDYDLALLFQP+LVLGISIGVA NVIFA+W
Sbjct: 70 AMSKCMITGGAAATVYYNLKLRHPTLELPVIDYDLALLFQPMLVLGISIGVALNVIFANW 129
Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSN 239
MIT+LLIVLF+V ST+AF KGVETWKKET TK+EA + LE N + EE + +P +
Sbjct: 130 MITILLIVLFIVTSTKAFFKGVETWKKETKTKKEATRSLE--SNDDRNEEVEGEPPPVGP 187
Query: 240 DTTPE-ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLL 298
+ ET+E KK +VSII+N+YW E LLVAVW IILALQI KNY+TTCS+ YW+LN+L
Sbjct: 188 TVGAQTETKEYKKEQVSIIKNVYWWELSLLVAVWLIILALQIGKNYSTTCSMAYWLLNIL 247
Query: 299 QIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGG 358
QIPVA GVS+YEA LYKGRRKIASKGD T W+ QLV YC G+ AG+VGG+LGLGGG
Sbjct: 248 QIPVAFGVSSYEAFNLYKGRRKIASKGDAVTNWKIRQLVLYCIIGLLAGVVGGMLGLGGG 307
Query: 359 FILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 418
FILGPLFLE+GIPPQVSSATA FA+ FS+SMSVVEYYLLKRFPVPYALYFFA++ +AA V
Sbjct: 308 FILGPLFLEMGIPPQVSSATATFAMMFSASMSVVEYYLLKRFPVPYALYFFAVATVAALV 367
Query: 419 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
GQHV++KLI ILGRAS+IIFTL+F IFVSA+ LGGVG+A+M+KRIE KEYMGF+NIC
Sbjct: 368 GQHVIRKLISILGRASLIIFTLAFTIFVSAILLGGVGIARMVKRIERKEYMGFENIC 424
>gi|224143013|ref|XP_002324818.1| predicted protein [Populus trichocarpa]
gi|222866252|gb|EEF03383.1| predicted protein [Populus trichocarpa]
Length = 448
Score = 526 bits (1356), Expect = e-147, Method: Compositional matrix adjust.
Identities = 289/450 (64%), Positives = 352/450 (78%), Gaps = 11/450 (2%)
Query: 32 QSNLKPEATSKNKN---DHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIG 85
+ L EATS + D V ++ VS P R GY+HVWP++KFGW+IV+G+IIG
Sbjct: 1 ERGLTREATSARTDETGDSVSSYVLKAVSCLWQPDQR-GYQHVWPEMKFGWQIVLGSIIG 59
Query: 86 FFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL 145
F G A GSVGGVGGGGIFVPML+LI+GFD KS+ A+SKCM+ G AVS+V YNL+ RHPT+
Sbjct: 60 FLGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGAAVSSVYYNLKLRHPTI 119
Query: 146 DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWK 205
D+P+IDYDLALL QP+L+LGIS+GVA NVIFADWM+TVLLIVLF V ST+AF KGVETWK
Sbjct: 120 DMPIIDYDLALLIQPMLMLGISLGVALNVIFADWMVTVLLIVLFTVTSTKAFFKGVETWK 179
Query: 206 KETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEF 265
KETI KREAAK LE NG E + P +P+ +E K+ ++++ EN+YWKE
Sbjct: 180 KETIMKREAAKHLE--SNGTDAGEVEYKPLRGGPSNSPQ--KETKELEITVFENVYWKEL 235
Query: 266 GLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG 325
GLLV VW L LQIAK T TCS+ YWVLNLLQIPV+ GV+ YE V+LYKGRR +ASKG
Sbjct: 236 GLLVFVWVAFLVLQIAKESTYTCSIGYWVLNLLQIPVSVGVTLYEVVSLYKGRRVVASKG 295
Query: 326 DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITF 385
DEGT +R QL+ YCA G+ AG+VGGLLGLGGGFI+GPLFLELG+PPQVSSATA FA+TF
Sbjct: 296 DEGTNFRVLQLMTYCAFGVLAGVVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTF 355
Query: 386 SSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIF 445
SSSMSVVEYYLLKRFPVPYA+YF ++ AAF GQH++++LI + GRAS+IIF L+F IF
Sbjct: 356 SSSMSVVEYYLLKRFPVPYAVYFVVVATFAAFTGQHIVRRLIIVFGRASLIIFILAFTIF 415
Query: 446 VSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
VSA+ LGG G++ MI +I +EYMGF+N C
Sbjct: 416 VSAILLGGFGISNMIGKIHRQEYMGFENFC 445
>gi|297821995|ref|XP_002878880.1| hypothetical protein ARALYDRAFT_481426 [Arabidopsis lyrata subsp.
lyrata]
gi|297324719|gb|EFH55139.1| hypothetical protein ARALYDRAFT_481426 [Arabidopsis lyrata subsp.
lyrata]
Length = 479
Score = 526 bits (1356), Expect = e-147, Method: Compositional matrix adjust.
Identities = 280/412 (67%), Positives = 333/412 (80%), Gaps = 5/412 (1%)
Query: 64 GYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSK 123
GY+HVWP+ +F W+IV+GT++GFFG A GSVGGVGGGGIFVPML+L++GFD KS+ A+SK
Sbjct: 69 GYRHVWPEFEFNWQIVLGTLVGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDPKSATAISK 128
Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
CM+ G +VS V YNLR RHPTLD+P+IDYDLALL QP+L+LGISIGVAFNVIF DW++TV
Sbjct: 129 CMIMGASVSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFPDWLVTV 188
Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTP 243
LLI+LFL ST+AFLKG ETW KETI K+EAAK+LE NG E + P + T P
Sbjct: 189 LLIILFLGTSTKAFLKGSETWNKETIEKKEAAKRLE--SNGVSGAEVEYVPLPAAPSTNP 246
Query: 244 EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVA 303
KK +VSIIEN+YWKE GLLV VW + LALQI+K CSV YWV+NLLQIPVA
Sbjct: 247 GNK---KKEEVSIIENVYWKELGLLVFVWVVFLALQISKQNLANCSVAYWVINLLQIPVA 303
Query: 304 GGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 363
GVS YEAVALY+GRR IASKG + + QLV YC GI AG+VGGLLGLGGGFI+GP
Sbjct: 304 VGVSGYEAVALYQGRRIIASKGQGDSNFTVGQLVMYCTFGILAGIVGGLLGLGGGFIMGP 363
Query: 364 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 423
LFLELG+PPQVSSATA FA+TFSSSMSVVEYYLLKRFPVPYALY ++ IAAFVGQHV+
Sbjct: 364 LFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLVGVATIAAFVGQHVV 423
Query: 424 KKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++LI LGRAS+IIF L+ +IF+SA+SLGGVG+ MI +I+ EYMGF+N+C
Sbjct: 424 RRLIAALGRASLIIFILASMIFISAISLGGVGIVNMIGKIQRHEYMGFENLC 475
>gi|356518316|ref|XP_003527825.1| PREDICTED: uncharacterized protein LOC100810784 [Glycine max]
Length = 477
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 284/458 (62%), Positives = 353/458 (77%), Gaps = 11/458 (2%)
Query: 24 VASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA-----PKSRSGYKHVWPDIKFGWRI 78
V ++ S +S +K E N EP A SGY+HVWPD++FGW+I
Sbjct: 21 VVAIVASERSIMKMEVARFNVTTSREPQQSFLTKALNFLWQSGESGYQHVWPDMEFGWQI 80
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
VVGT +GF G A GSVGGVGGGGIFVPML+LIVGFD KSS A+SKCM+ G AVS V YNL
Sbjct: 81 VVGTFVGFCGAAFGSVGGVGGGGIFVPMLSLIVGFDQKSSTAISKCMIMGAAVSTVYYNL 140
Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
+ RHPTL++P+IDYDLALL QP+L+LGISIGV FNV+F DW++T+LLIVLFL ST+AF
Sbjct: 141 KLRHPTLNMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWIVTILLIVLFLGTSTKAFF 200
Query: 199 KGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIE 258
KG+ETWKKETI K+EAAK+ E G+G E + P L + ++ K+ +V+IIE
Sbjct: 201 KGIETWKKETIMKKEAAKRQESNGSG---AEVEYKP--LPSGPNGANEKDTKEQEVTIIE 255
Query: 259 NIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKG 317
N+YWKEFGLLV VW LALQIAK NYTTTCS YWVLNLLQ+PV+ GV+AYEA AL+ G
Sbjct: 256 NVYWKEFGLLVFVWVSFLALQIAKENYTTTCSTFYWVLNLLQVPVSVGVTAYEAAALFSG 315
Query: 318 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 377
RR IAS G++G + QL+ YC G+ AG+VGG+LGLGGGF++GPLFLELG+PPQVSSA
Sbjct: 316 RRVIASTGEQGKDFTVLQLIIYCVFGVLAGVVGGMLGLGGGFVMGPLFLELGVPPQVSSA 375
Query: 378 TALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 437
TA FA+TFSSSMSV+EYYLLKRFP+PYALYF ++ IAAFVGQH+++KLI + GRAS+II
Sbjct: 376 TATFAMTFSSSMSVIEYYLLKRFPIPYALYFVLVATIAAFVGQHIVRKLIILFGRASLII 435
Query: 438 FTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
F L+ IFVSA+SLGGVG+ M+ +I++ EYMGF+++C
Sbjct: 436 FILASTIFVSAVSLGGVGIVNMVHKIDNHEYMGFEDLC 473
>gi|255549972|ref|XP_002516037.1| conserved hypothetical protein [Ricinus communis]
gi|223544942|gb|EEF46457.1| conserved hypothetical protein [Ricinus communis]
Length = 481
Score = 525 bits (1353), Expect = e-146, Method: Compositional matrix adjust.
Identities = 305/471 (64%), Positives = 371/471 (78%), Gaps = 6/471 (1%)
Query: 10 RWRL--LAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSG 64
RW L L A LVV S+ +++ LK + + VE + V + S K R G
Sbjct: 8 RWSLIMLGAILSSFLVVGSLLALSETRLKLLGSDYKRPKEVEFHHVTRASNFFYQKGRLG 67
Query: 65 YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
YKHVWP+++FGW+IV+G +IG FG ACGSVGGVGGGGIFVPML LIVGFDAKSS A+SKC
Sbjct: 68 YKHVWPEMRFGWKIVLGGVIGVFGAACGSVGGVGGGGIFVPMLTLIVGFDAKSSTALSKC 127
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
M+TG A + V YNL+ RHPTL++P+IDYDLALLFQP+LVLGIS+GVA NVIFADW IT+L
Sbjct: 128 MITGAAAATVYYNLKLRHPTLELPIIDYDLALLFQPMLVLGISLGVALNVIFADWAITIL 187
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPE 244
LI+LF+V S++AF KGVETWK+ET K+EAA+ LE+ + + E K P S
Sbjct: 188 LIILFIVTSSKAFFKGVETWKQETKLKKEAARHLEIDEDTTEEMEEKSPPTG-STIVAQA 246
Query: 245 ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAG 304
+ E K+ KVSI EN+ WKE GLLVAVW+IILALQI KNY+TTCSV YWVL+ LQIPVA
Sbjct: 247 QFSESKREKVSIGENVCWKELGLLVAVWSIILALQIGKNYSTTCSVTYWVLSTLQIPVAV 306
Query: 305 GVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 364
GV+ YEA++LYKG+RKIAS+G+ T W+ QLV Y A GI AG+VGG+LGLGGGFILGPL
Sbjct: 307 GVTGYEAISLYKGQRKIASRGEVSTNWKIHQLVSYSAIGILAGVVGGMLGLGGGFILGPL 366
Query: 365 FLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
FLE+GIPPQVSSATA F + FS+SMSV+EYYLLKRFPVPYALY FA++IIAAFVGQHV++
Sbjct: 367 FLEMGIPPQVSSATATFIMAFSASMSVIEYYLLKRFPVPYALYLFAVAIIAAFVGQHVVR 426
Query: 425 KLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
KLI +LGRAS+IIF L+F IF SA+SLGGVG+A+ IK+IE KEYMGF++IC
Sbjct: 427 KLISLLGRASLIIFILAFTIFASAISLGGVGIARAIKKIERKEYMGFEDIC 477
>gi|30682879|ref|NP_850068.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
gi|22531098|gb|AAM97053.1| unknown protein [Arabidopsis thaliana]
gi|25083770|gb|AAN72117.1| unknown protein [Arabidopsis thaliana]
gi|26450521|dbj|BAC42374.1| unknown protein [Arabidopsis thaliana]
gi|28951043|gb|AAO63445.1| At2g25737 [Arabidopsis thaliana]
gi|330252652|gb|AEC07746.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
Length = 476
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 287/450 (63%), Positives = 346/450 (76%), Gaps = 8/450 (1%)
Query: 29 VSAQSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIG 85
VSA+ K S ++ E + +K + GY+HVWP+ +F W+IV+GT++G
Sbjct: 28 VSAERRGKSLRLSTDETRENESSFFLKAINFLWESDQIGYRHVWPEFEFNWQIVLGTLVG 87
Query: 86 FFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL 145
FFG A GSVGGVGGGGIFVPML+LI+GFD KS+ A+SKCM+ G +VS V YNLR RHPTL
Sbjct: 88 FFGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSATAISKCMIMGASVSTVYYNLRLRHPTL 147
Query: 146 DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWK 205
D+P+IDYDLALL QP+L+LGISIGVAFNVIF DW++TVLLIVLFL ST+AFLKG ETW
Sbjct: 148 DMPIIDYDLALLIQPMLMLGISIGVAFNVIFPDWLVTVLLIVLFLGTSTKAFLKGSETWN 207
Query: 206 KETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEF 265
KETI K+EAAK+LE NG E + P + T P KK +VSIIEN+YWKE
Sbjct: 208 KETIEKKEAAKRLE--SNGVSGTEVEYVPLPAAPSTNPGNK---KKEEVSIIENVYWKEL 262
Query: 266 GLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG 325
GLLV VW + LALQI+K CSV YWV+NLLQIPVA GVS YEAVALY+GRR IASKG
Sbjct: 263 GLLVFVWIVFLALQISKQNLANCSVAYWVINLLQIPVAVGVSGYEAVALYQGRRIIASKG 322
Query: 326 DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITF 385
+ + QLV YC GI AG+VGGLLGLGGGFI+GPLFLELG+PPQVSSATA FA+TF
Sbjct: 323 QGDSNFTVGQLVMYCTFGIIAGIVGGLLGLGGGFIMGPLFLELGVPPQVSSATATFAMTF 382
Query: 386 SSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIF 445
SSSMSVVEYYLLKRFPVPYALY ++ IAA+VGQHV+++LI +GRAS+IIF L+ +IF
Sbjct: 383 SSSMSVVEYYLLKRFPVPYALYLVGVATIAAWVGQHVVRRLIAAIGRASLIIFILASMIF 442
Query: 446 VSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+SA+SLGGVG+ MI +I+ EYMGF+N+C
Sbjct: 443 ISAISLGGVGIVNMIGKIQRHEYMGFENLC 472
>gi|255549970|ref|XP_002516036.1| conserved hypothetical protein [Ricinus communis]
gi|223544941|gb|EEF46456.1| conserved hypothetical protein [Ricinus communis]
Length = 466
Score = 523 bits (1346), Expect = e-146, Method: Compositional matrix adjust.
Identities = 301/475 (63%), Positives = 366/475 (77%), Gaps = 13/475 (2%)
Query: 1 MSRVGSECFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPK 60
M G + + RL +GL+VVAS+ + A+S T + H K
Sbjct: 2 MDWSGLKEWNLRLRKKYLVGLIVVASMVIVAESASSYNQTQQAGYLH---------HGNK 52
Query: 61 SRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAA 120
S YKHVWP+++ GWRIVVG IIGFFG A GS GGVGGGGI+VPML LI+GFDAKSS A
Sbjct: 53 GHSDYKHVWPEMRIGWRIVVGAIIGFFGAAFGSAGGVGGGGIYVPMLTLIIGFDAKSSIA 112
Query: 121 VSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWM 180
+SKCM+TG A S V YNL+QRHPTL++PVIDYDLALLFQP+LVLG+SIGV FNVIFADWM
Sbjct: 113 ISKCMITGAAASTVYYNLKQRHPTLEMPVIDYDLALLFQPMLVLGVSIGVTFNVIFADWM 172
Query: 181 ITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSND 240
ITVLLI++F+ +ST+AF KG++TWKKETI K+ A L+G+ + PE LS D
Sbjct: 173 ITVLLIIIFIFMSTKAFSKGIQTWKKETIKKKIAPL---LLGDNAEVVIPTPPPEILS-D 228
Query: 241 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 300
ET+ PKK KVS+IEN+YWK GLL+ +W +ILALQIAKNYT TC+V YW+L+ QI
Sbjct: 229 NAQIETKGPKKEKVSLIENVYWKALGLLLVIWFMILALQIAKNYTKTCTVPYWILDFAQI 288
Query: 301 PVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 360
PVA ++ Y+AV LYKGRRKIAS+G+ + WR +L+ YC G+ AG++GGLLGLGGGFI
Sbjct: 289 PVAASLTIYQAVRLYKGRRKIASRGEAESNWRVHKLLLYCFSGLLAGIIGGLLGLGGGFI 348
Query: 361 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 420
LGPLFLE+GIPPQVSSATA FA+TFS+SMSVVEYYLLKRFPVPYALY FA++ +A FVGQ
Sbjct: 349 LGPLFLEMGIPPQVSSATATFAMTFSASMSVVEYYLLKRFPVPYALYLFAVTTLAGFVGQ 408
Query: 421 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+V+KK+I +LGRASIIIF LSF I VSA+SLGGVGLA MIK+IE KEYMGF++IC
Sbjct: 409 YVVKKVIDMLGRASIIIFILSFTILVSAISLGGVGLADMIKKIERKEYMGFEDIC 463
>gi|356510003|ref|XP_003523730.1| PREDICTED: uncharacterized protein LOC100809318 [Glycine max]
Length = 472
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 277/415 (66%), Positives = 340/415 (81%), Gaps = 10/415 (2%)
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
SGY+HVWPD++FGW IV+GT +GF G A GSVGGVGGGGIFVPML+LI+GFD KSS A+
Sbjct: 63 ESGYQHVWPDMEFGWEIVLGTFVGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSSTAI 122
Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
SKCM+ G AVS V YNL+ RHPTL++P+IDYDLALL QP+L+LGISIGV FNV+F DW++
Sbjct: 123 SKCMIMGAAVSTVYYNLKLRHPTLNMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWIV 182
Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDT 241
T+LLIVLFL ST+AF KGVETWKKETI K+EAAK+ E G+G E Y +
Sbjct: 183 TILLIVLFLGTSTKAFFKGVETWKKETIMKKEAAKRQESNGSGAVVE-------YKPLPS 235
Query: 242 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLNLLQI 300
PE ++ K+ ++SIIEN+YWKEFGLLV VW LALQIAK NYTTTCS LYWVLNLLQ+
Sbjct: 236 GPE--KDTKEQEMSIIENVYWKEFGLLVFVWVSFLALQIAKENYTTTCSTLYWVLNLLQV 293
Query: 301 PVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 360
PV+ GV+AYEA AL+ GRR IAS G++G + QL+ YC G+ AG+VGG+LGLGGGF+
Sbjct: 294 PVSVGVTAYEAAALFSGRRVIASTGEQGKDFTVLQLMIYCVFGVLAGVVGGMLGLGGGFV 353
Query: 361 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 420
+GPLFLELG+PPQVSSATA FA+TFSSSMSV+EYYLLKRFPVPYALYF ++ IAAFVGQ
Sbjct: 354 MGPLFLELGVPPQVSSATATFAMTFSSSMSVIEYYLLKRFPVPYALYFILVATIAAFVGQ 413
Query: 421 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
H+++KLI + GRAS+IIF L+ IFVSA+SLGGVG+ M+ +I++ EYMGF+++C
Sbjct: 414 HIVRKLIILFGRASLIIFILASTIFVSAVSLGGVGIVNMVHKIQNHEYMGFEDLC 468
>gi|358344801|ref|XP_003636475.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
gi|355502410|gb|AES83613.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
Length = 470
Score = 520 bits (1338), Expect = e-145, Method: Compositional matrix adjust.
Identities = 277/414 (66%), Positives = 336/414 (81%), Gaps = 12/414 (2%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
SGY+HVWP+++FGW+IV+G+ IGF G A GSVGGVGGGGIFVPML+LI+GFD KSS A+S
Sbjct: 64 SGYQHVWPEMEFGWQIVLGSFIGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSSTAIS 123
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G A+S V YNLR RHPTLD+P+IDYDLALL QP+L+LGISIGV FNV+F DW++T
Sbjct: 124 KCMIMGAALSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWLVT 183
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT 242
+LLIVLFL ST+AF KG+ETW KETI K+EAA++ E G+G EY + T
Sbjct: 184 ILLIVLFLGTSTKAFFKGLETWNKETIMKKEAARRQESNGSG----------EYKALPTG 233
Query: 243 PEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLNLLQIP 301
P E K++KVSIIEN+YWKEFGLL VW LALQIAK NYTTTCS YW+LNLLQIP
Sbjct: 234 PNVAIE-KENKVSIIENVYWKEFGLLAFVWISFLALQIAKQNYTTTCSAAYWILNLLQIP 292
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
+A GV+AYEA AL+ GRR IAS GD+G + QLV YC G+ AG+VGGLLGLGGGF++
Sbjct: 293 IAVGVTAYEATALFTGRRVIASTGDQGKAFTVGQLVIYCVFGVLAGIVGGLLGLGGGFVM 352
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GPLFLELG+PPQVSSATA FA+TFSSSMSVVEYYLLKRFPVPYALY ++ IAA VGQH
Sbjct: 353 GPLFLELGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLSLVATIAALVGQH 412
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++++LI + GRAS+IIF L+ IF+SA+SLGGVG++ M+ +I + EYMGF+NIC
Sbjct: 413 IVRRLIILFGRASLIIFILAGTIFISAISLGGVGISNMVHKIANHEYMGFENIC 466
>gi|224143011|ref|XP_002324817.1| predicted protein [Populus trichocarpa]
gi|222866251|gb|EEF03382.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 516 bits (1328), Expect = e-143, Method: Compositional matrix adjust.
Identities = 279/478 (58%), Positives = 358/478 (74%), Gaps = 12/478 (2%)
Query: 1 MSRVGSECFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA-- 58
M+ +G++ R A F ++ AS +SA+ ++K + ++ + +EPN ++K+ +
Sbjct: 1 MAFLGAKFLSLRPAAILFFTFML-ASAFISAERSIKHGVSRLSEAEELEPNYLVKIISFL 59
Query: 59 -PKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKS 117
SGY+HVWP++ F W+ V+GTIIGFFG A GSVGGVGGGGIFVPML LI+GFD KS
Sbjct: 60 WQSDESGYQHVWPELSFNWQAVIGTIIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKS 119
Query: 118 SAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFA 177
+ A+SKCM+ G AVS V +NLR RHPTLD+PVIDYDL LL QP+L+LGISIGV FNVIF
Sbjct: 120 ATAISKCMIMGAAVSTVYHNLRLRHPTLDLPVIDYDLVLLIQPMLMLGISIGVTFNVIFP 179
Query: 178 DWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYL 237
DW+ITV LI+L + S +AF KG++TWK ETI K AAK+ E G G + E K P+
Sbjct: 180 DWVITVTLIILCIGTSVKAFFKGLDTWKIETILKEVAAKRFESTG-GSEEVEYKPLPDGQ 238
Query: 238 SNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNL 297
ND PK SKV+I++NIYWKE GLLV VW LA+QIAKNYT CS +WVLNL
Sbjct: 239 GND-------PPKASKVTILQNIYWKELGLLVFVWVSYLAVQIAKNYTAPCSTTFWVLNL 291
Query: 298 LQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 357
LQIP++ GV YEA+ LYKGRR+I+SKGDE W+ +L+ + CG+ AG+VGGLLG+GG
Sbjct: 292 LQIPISIGVFLYEAIGLYKGRRRISSKGDEIMDWQVHRLLMFSVCGVVAGIVGGLLGIGG 351
Query: 358 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 417
GF++GPLFLE+GIPPQVSS TA F + FSSSMSVVEYYLL RFPVPYALYF A+++IAAF
Sbjct: 352 GFVMGPLFLEMGIPPQVSSGTATFGMLFSSSMSVVEYYLLDRFPVPYALYFIAVAVIAAF 411
Query: 418 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+GQ+++ KLI I GRAS+IIF L+F IF+SA++LGGVG+ + I +E EYMGF+N+C
Sbjct: 412 IGQNIITKLITISGRASLIIFVLAFTIFISAIALGGVGITRTIGMVERHEYMGFENLC 469
>gi|115478679|ref|NP_001062933.1| Os09g0344800 [Oryza sativa Japonica Group]
gi|50252864|dbj|BAD29095.1| membrane protein-like [Oryza sativa Japonica Group]
gi|113631166|dbj|BAF24847.1| Os09g0344800 [Oryza sativa Japonica Group]
gi|215692830|dbj|BAG88213.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 481
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 276/416 (66%), Positives = 331/416 (79%), Gaps = 12/416 (2%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
+ Y HVWP+++ GW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 71 NSYHHVWPEMELGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 130
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNV+F DW++T
Sbjct: 131 KCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVLFPDWLVT 190
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT 242
VLLI+LFL ST+AFLKGVETWKKETI KREAAK+LE QT E +PEY T
Sbjct: 191 VLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLE------QTSE---EPEYAPLPTG 241
Query: 243 P---EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 299
P + + P S+++NIYWKEFGLL VW L LQ+ KNYT TCS YW+LNLLQ
Sbjct: 242 PGAVADAKRPSDEAASLMKNIYWKEFGLLAFVWMAFLVLQVTKNYTATCSSWYWILNLLQ 301
Query: 300 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 359
IPV+ GV+ YEA+ L G+R ++SKG+E T + QL YC GITAG+VGGLLGLGGGF
Sbjct: 302 IPVSVGVTMYEALGLMSGKRVLSSKGNEQTTLKFHQLCIYCFFGITAGVVGGLLGLGGGF 361
Query: 360 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
I+GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYALYF ++ IAA +G
Sbjct: 362 IMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYALYFVIVAFIAAIIG 421
Query: 420 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
QHV+++LI LGRAS+IIF L+F+IFVSA+SLGGVG++ MI RI EYMGF+N+C
Sbjct: 422 QHVVRRLINWLGRASLIIFILAFMIFVSAISLGGVGISNMIHRINQHEYMGFENLC 477
>gi|222641386|gb|EEE69518.1| hypothetical protein OsJ_28979 [Oryza sativa Japonica Group]
Length = 481
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 276/416 (66%), Positives = 331/416 (79%), Gaps = 12/416 (2%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
+ Y HVWP+++ GW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 71 NSYHHVWPEMELGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 130
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNV+F DW++T
Sbjct: 131 KCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVLFPDWLVT 190
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT 242
VLLI+LFL ST+AFLKGVETWKKETI KREAAK+LE QT E +PEY T
Sbjct: 191 VLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLE------QTSE---EPEYAPLPTG 241
Query: 243 P---EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 299
P + + P S+++NIYWKEFGLL VW L LQ+ KNYT TCS YW+LNLLQ
Sbjct: 242 PGAVADAKRPSDEAASLMKNIYWKEFGLLAFVWMAFLVLQVTKNYTATCSSWYWILNLLQ 301
Query: 300 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 359
IPV+ GV+ YEA+ L G+R ++SKG+E T + QL YC GITAG+VGGLLGLGGGF
Sbjct: 302 IPVSVGVTMYEALGLMSGKRVLSSKGNEQTTLKFHQLCIYCFFGITAGVVGGLLGLGGGF 361
Query: 360 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
I+GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYALYF ++ IAA +G
Sbjct: 362 IMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYALYFVIVAFIAAIIG 421
Query: 420 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
QHV+++LI LGRAS+IIF L+F+IFVSA+SLGGVG++ MI RI EYMGF+N+C
Sbjct: 422 QHVVRRLINWLGRASLIIFILAFMIFVSAISLGGVGISNMIHRINQHEYMGFENLC 477
>gi|218201973|gb|EEC84400.1| hypothetical protein OsI_30977 [Oryza sativa Indica Group]
Length = 465
Score = 513 bits (1321), Expect = e-143, Method: Compositional matrix adjust.
Identities = 275/416 (66%), Positives = 331/416 (79%), Gaps = 12/416 (2%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
+ Y HVWP+++ GW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 55 NSYHHVWPEMELGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 114
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNV+F DW++T
Sbjct: 115 KCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVLFPDWLVT 174
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT 242
VLLI+LFL ST+AFLKGVETWKKETI KREAAK+LE QT E +PEY T
Sbjct: 175 VLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLE------QTSE---EPEYAPLPTG 225
Query: 243 P---EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 299
P + + P S+++NIYWKEFGLL VW L LQ+ KNYT TCS YW+LNLLQ
Sbjct: 226 PGAVADAKRPSDEAASLMKNIYWKEFGLLAFVWMAFLVLQVTKNYTATCSSWYWILNLLQ 285
Query: 300 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 359
IPV+ GV+ YEA+ L G+R ++SKG+E T + QL YC GITAG+VGGLLGLGGGF
Sbjct: 286 IPVSVGVTLYEALGLMSGKRVLSSKGNEQTTLKFHQLCIYCFFGITAGVVGGLLGLGGGF 345
Query: 360 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
I+GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYALYF ++ IAA +G
Sbjct: 346 IMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYALYFVIVAFIAAIIG 405
Query: 420 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
QHV+++LI LGRAS+IIF L+F+IFVSA+SLGGVG++ MI +I EYMGF+N+C
Sbjct: 406 QHVVRRLINWLGRASLIIFILAFMIFVSAISLGGVGISNMIHKINQHEYMGFENLC 461
>gi|326489334|dbj|BAK01650.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 513 bits (1320), Expect = e-143, Method: Compositional matrix adjust.
Identities = 275/416 (66%), Positives = 332/416 (79%), Gaps = 12/416 (2%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
+ Y+HVWP ++FGW++V+G++IGFFG + GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 61 NSYQHVWPPMEFGWKMVLGSLIGFFGASFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 120
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+TG AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW++T
Sbjct: 121 KCMITGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVIFPDWLVT 180
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT 242
VLLIVLFL ST+AFLKGVETWKKETI KREAAK+LE QT E +PEY T
Sbjct: 181 VLLIVLFLGTSTKAFLKGVETWKKETIIKREAAKRLE------QTSE---EPEYAPLPTG 231
Query: 243 P---EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 299
P + + S+++NIYWKEFGLL VW LA+QI KNY TCS YWVLN LQ
Sbjct: 232 PGAVADVKPQSDEAPSLMKNIYWKEFGLLTFVWLAFLAIQITKNYAPTCSTWYWVLNFLQ 291
Query: 300 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 359
IPV+ GV+ YEA+ L G+R ++SKGDE T + QL YC G+TAG+VGGLLGLGGGF
Sbjct: 292 IPVSVGVTMYEALGLMNGKRVLSSKGDEQTTLKFHQLCIYCFFGVTAGLVGGLLGLGGGF 351
Query: 360 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
I+GPLFLELGIPPQVSSATA FA+ FS+SMSVVEYYLL RFPVPYA YF A++ +AA +G
Sbjct: 352 IMGPLFLELGIPPQVSSATATFAMMFSASMSVVEYYLLNRFPVPYAAYFVAVAFVAAIIG 411
Query: 420 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
QHV+++LI LGRAS+IIF L+F+IFVSA+SLGGVG++ MI +IE EYMGF+N+C
Sbjct: 412 QHVVRRLISWLGRASLIIFILAFMIFVSAISLGGVGISNMIHKIEQHEYMGFENLC 467
>gi|115476246|ref|NP_001061719.1| Os08g0389700 [Oryza sativa Japonica Group]
gi|40253412|dbj|BAD05341.1| membrane protein-like [Oryza sativa Japonica Group]
gi|40253788|dbj|BAD05726.1| membrane protein-like [Oryza sativa Japonica Group]
gi|113623688|dbj|BAF23633.1| Os08g0389700 [Oryza sativa Japonica Group]
Length = 465
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 276/417 (66%), Positives = 326/417 (78%), Gaps = 12/417 (2%)
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
+ Y HVWP ++FGW+IV+G +IGFFG A GSVGGVGGGGIFVPML LI+GFDAKSS A+
Sbjct: 54 EASYHHVWPPMEFGWKIVLGILIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDAKSSTAI 113
Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
SKCM+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW+I
Sbjct: 114 SKCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVLFNVIFPDWLI 173
Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDT 241
TVLLI+LFL ST+AFLKGVETWKKETI KREAAK+LE I +PEY T
Sbjct: 174 TVLLIILFLGTSTKAFLKGVETWKKETILKREAAKRLEQIAE---------EPEYSPLPT 224
Query: 242 TPE---ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLL 298
P E++ P S+ +N+YWKEFGLL VW L LQ+ KNY TCS YWVLN L
Sbjct: 225 GPNATAESKAPSDEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPTCSTWYWVLNFL 284
Query: 299 QIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGG 358
QIPV+ GV+ YE + L +GRR I+SKGDE T + QL+ YC G+ AG+VGGLLGLGGG
Sbjct: 285 QIPVSVGVTMYEGLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAGVVGGLLGLGGG 344
Query: 359 FILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 418
FI+GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYAL+F ++ AA +
Sbjct: 345 FIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALFFTVVAFFAAII 404
Query: 419 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
GQH+++KLI LGRAS+IIF LSF+IF+SA+SLGGVG++ MI +I EYMGFDNIC
Sbjct: 405 GQHIVRKLINWLGRASLIIFILSFMIFISAISLGGVGISNMIGKIARHEYMGFDNIC 461
>gi|218201092|gb|EEC83519.1| hypothetical protein OsI_29108 [Oryza sativa Indica Group]
Length = 466
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 278/417 (66%), Positives = 328/417 (78%), Gaps = 12/417 (2%)
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
+ Y HVWP ++FGW+I++G +IGFFG A GSVGGVGGGGIFVPML LI+GFDAKSS A+
Sbjct: 55 EASYHHVWPPMEFGWKIILGILIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDAKSSTAI 114
Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
SKCM+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW+I
Sbjct: 115 SKCMIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVLFNVIFPDWLI 174
Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDT 241
TVLLI+LFL ST+AFLKGVETWKKETI KREAAK+LE I TEE PEY T
Sbjct: 175 TVLLIILFLGTSTKAFLKGVETWKKETIIKREAAKRLEQI-----TEE----PEYSPLPT 225
Query: 242 TPE---ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLL 298
P E++ P S+ +N+YWKEFGLL VW L LQ+ KNY TCS YWVLN L
Sbjct: 226 GPNATAESKAPSDEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPTCSTWYWVLNFL 285
Query: 299 QIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGG 358
QIPV+ GV+ YE + L +GRR I+SKGDE T + QL+ YC G+ AG+VGGLLGLGGG
Sbjct: 286 QIPVSVGVTMYEGLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAGVVGGLLGLGGG 345
Query: 359 FILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 418
FI+GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYAL+F ++ AA +
Sbjct: 346 FIMGPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALFFTTVAFFAAII 405
Query: 419 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
GQH+++KLI LGRAS+IIF LSF+IF+SA+SLGGVG++ MI +I EYMGFDNIC
Sbjct: 406 GQHIVRKLINWLGRASLIIFILSFMIFISAISLGGVGISNMIGKIARHEYMGFDNIC 462
>gi|358344803|ref|XP_003636476.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
gi|355502411|gb|AES83614.1| hypothetical protein MTR_042s0018 [Medicago truncatula]
Length = 480
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 277/424 (65%), Positives = 336/424 (79%), Gaps = 22/424 (5%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
SGY+HVWP+++FGW+IV+G+ IGF G A GSVGGVGGGGIFVPML+LI+GFD KSS A+S
Sbjct: 64 SGYQHVWPEMEFGWQIVLGSFIGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSSTAIS 123
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G A+S V YNLR RHPTLD+P+IDYDLALL QP+L+LGISIGV FNV+F DW++T
Sbjct: 124 KCMIMGAALSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWLVT 183
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT 242
+LLIVLFL ST+AF KG+ETW KETI K+EAA++ E G+G EY + T
Sbjct: 184 ILLIVLFLGTSTKAFFKGLETWNKETIMKKEAARRQESNGSG----------EYKALPTG 233
Query: 243 PEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLNLLQIP 301
P E K++KVSIIEN+YWKEFGLL VW LALQIAK NYTTTCS YW+LNLLQIP
Sbjct: 234 PNVAIE-KENKVSIIENVYWKEFGLLAFVWISFLALQIAKQNYTTTCSAAYWILNLLQIP 292
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
+A GV+AYEA AL+ GRR IAS GD+G + QLV YC G+ AG+VGGLLGLGGGF++
Sbjct: 293 IAVGVTAYEATALFTGRRVIASTGDQGKAFTVGQLVIYCVFGVLAGIVGGLLGLGGGFVM 352
Query: 362 GPLFLELGIPPQ----------VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFAL 411
GPLFLELG+PPQ VSSATA FA+TFSSSMSVVEYYLLKRFPVPYALY +
Sbjct: 353 GPLFLELGVPPQVSFKLQYINHVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLSLV 412
Query: 412 SIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGF 471
+ IAA VGQH++++LI + GRAS+IIF L+ IF+SA+SLGGVG++ M+ +I + EYMGF
Sbjct: 413 ATIAALVGQHIVRRLIILFGRASLIIFILAGTIFISAISLGGVGISNMVHKIANHEYMGF 472
Query: 472 DNIC 475
+NIC
Sbjct: 473 ENIC 476
>gi|357158060|ref|XP_003578003.1| PREDICTED: uncharacterized protein LOC100827421 [Brachypodium
distachyon]
Length = 469
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 273/416 (65%), Positives = 334/416 (80%), Gaps = 12/416 (2%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
+ Y HVWP ++FGW++V+GT+IGFFG + GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 59 NSYHHVWPPMEFGWKVVLGTLIGFFGASFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 118
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+TG AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW++T
Sbjct: 119 KCMITGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVIFPDWLVT 178
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT 242
VLLI+LFL ST+AFLKG+ETWKKETI KREAAK+LE QT E +PEY T
Sbjct: 179 VLLIILFLGTSTKAFLKGLETWKKETILKREAAKRLE------QTSE---EPEYAPLPTG 229
Query: 243 P---EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 299
P + + P S+++NIYW+EFGLL VW LA+QI KNYT+TCS YW+LN LQ
Sbjct: 230 PGAAADVKIPSDEAPSLMKNIYWREFGLLTLVWLSFLAIQITKNYTSTCSTWYWILNFLQ 289
Query: 300 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 359
IPV+ GV+ YEA+ L G+R ++SKG E T + QL YC G+TAG+VGGLLGLGGGF
Sbjct: 290 IPVSVGVTLYEALGLMNGKRVLSSKGGEQTTLKFHQLCIYCFFGVTAGLVGGLLGLGGGF 349
Query: 360 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
I+GPLFLELGIPPQVSSATA FA+ FS+SMSVVEYYLL RFPVPYA+YF A++ IAA +G
Sbjct: 350 IMGPLFLELGIPPQVSSATATFAMMFSASMSVVEYYLLNRFPVPYAVYFVAVAFIAAIIG 409
Query: 420 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
QH ++KLI LGRAS+IIF L+F+IFVSA+SLGGVG++ MI++I+ EYMGF+N+C
Sbjct: 410 QHAVRKLISWLGRASLIIFILAFMIFVSAISLGGVGVSNMIQKIQRHEYMGFENLC 465
>gi|449437030|ref|XP_004136295.1| PREDICTED: uncharacterized protein LOC101214783 [Cucumis sativus]
Length = 471
Score = 511 bits (1316), Expect = e-142, Method: Compositional matrix adjust.
Identities = 280/473 (59%), Positives = 353/473 (74%), Gaps = 13/473 (2%)
Query: 5 GSECFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSG 64
G C +R +A VV ++ VS + +K E +S++ N + ++G
Sbjct: 6 GGICAGFRSMALTMFNFAVVLAL-VSGERKMKLEGSSEDLNLF---QGALNFLWQSDKTG 61
Query: 65 YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
Y HVWP++ FGW+IVVG++IGF G A GSVGGVGGGGI+VPML+LI+GFD KSS A+SKC
Sbjct: 62 YHHVWPEMAFGWQIVVGSVIGFMGAAFGSVGGVGGGGIYVPMLSLIIGFDPKSSTAISKC 121
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
M+ G A S V YNL+ RHPTL++P+IDYDL LL P+L+LGIS+GV FNVIFADWM+TVL
Sbjct: 122 MIMGAAASTVYYNLKLRHPTLELPIIDYDLVLLIMPMLMLGISVGVVFNVIFADWMVTVL 181
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPE 244
LIVLFL ST++FLKGVETWKKETI K EAA++ E + + + D Y + P
Sbjct: 182 LIVLFLATSTKSFLKGVETWKKETIMKTEAARRAE-------SNDIQADTAYAPLPSGPS 234
Query: 245 ETREPKKS--KVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPV 302
E + +V I+EN+YWKE GLL+ VW LA+QI K ++ CS YW+LN LQ+P+
Sbjct: 235 HRPETNNTDREVPILENVYWKEMGLLLLVWFAFLAIQITKKHSPNCSWEYWLLNFLQVPI 294
Query: 303 AGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILG 362
A GVS YEAV LYKG+RKIAS G++ T +R QLV YC GI AG+VGGLLGLGGGFI+G
Sbjct: 295 AFGVSGYEAVGLYKGKRKIASLGNQKTDFRIYQLVLYCFMGIIAGVVGGLLGLGGGFIMG 354
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
PLFLELGIPPQVSSA+A F +TFSSSMSV++YYLL RFPVPYALYF ++ +AAFVGQHV
Sbjct: 355 PLFLELGIPPQVSSASATFGMTFSSSMSVIQYYLLNRFPVPYALYFTIVAAVAAFVGQHV 414
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++KLI ++GRAS+IIF LSF IFVSALSLGGVG++KMI +I+ EYMGF+N+C
Sbjct: 415 IRKLILLIGRASLIIFILSFTIFVSALSLGGVGISKMIGQIQRHEYMGFENLC 467
>gi|449523247|ref|XP_004168635.1| PREDICTED: uncharacterized LOC101214783 [Cucumis sativus]
Length = 471
Score = 510 bits (1314), Expect = e-142, Method: Compositional matrix adjust.
Identities = 280/473 (59%), Positives = 353/473 (74%), Gaps = 13/473 (2%)
Query: 5 GSECFRWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSG 64
G C +R +A VV ++ VS + +K E +S++ N + ++G
Sbjct: 6 GGICAGFRSMALTMFNFAVVLAL-VSGERKMKLEGSSEDLNLF---QGALNFLWQSDKTG 61
Query: 65 YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
Y HVWP++ FGW+IVVG++IGF G A GSVGGVGGGGI+VPML+LI+GFD KSS A+SKC
Sbjct: 62 YHHVWPEMAFGWQIVVGSVIGFMGAAFGSVGGVGGGGIYVPMLSLIIGFDPKSSTAISKC 121
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
M+ G A S V YNL+ RHPTL++P+IDYDL LL P+L+LGIS+GV FNVIFADWM+TVL
Sbjct: 122 MIMGAAASTVYYNLKLRHPTLELPIIDYDLVLLIMPMLMLGISVGVVFNVIFADWMVTVL 181
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPE 244
LIVLFL ST++FLKGVETWKKETI K EAA++ E + + + D Y + P
Sbjct: 182 LIVLFLATSTKSFLKGVETWKKETIMKTEAARRAE-------SNDIQADTAYAPLPSGPS 234
Query: 245 ETREPKKS--KVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPV 302
E + +V I+EN+YWKE GLL+ VW LA+QI K ++ CS YW+LN LQ+P+
Sbjct: 235 HRPETNYTDREVPILENVYWKEMGLLLLVWFAFLAIQITKKHSPNCSWEYWLLNFLQVPI 294
Query: 303 AGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILG 362
A GVS YEAV LYKG+RKIAS G++ T +R QLV YC GI AG+VGGLLGLGGGFI+G
Sbjct: 295 AFGVSGYEAVGLYKGKRKIASLGNQKTDFRIYQLVLYCFMGIIAGVVGGLLGLGGGFIMG 354
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
PLFLELGIPPQVSSA+A F +TFSSSMSV++YYLL RFPVPYALYF ++ +AAFVGQHV
Sbjct: 355 PLFLELGIPPQVSSASATFGMTFSSSMSVIQYYLLNRFPVPYALYFTIVAAVAAFVGQHV 414
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++KLI ++GRAS+IIF LSF IFVSALSLGGVG++KMI +I+ EYMGF+N+C
Sbjct: 415 IRKLILLIGRASLIIFILSFTIFVSALSLGGVGISKMIGQIQRHEYMGFENLC 467
>gi|356511003|ref|XP_003524221.1| PREDICTED: uncharacterized protein LOC100818240 [Glycine max]
Length = 473
Score = 509 bits (1311), Expect = e-141, Method: Compositional matrix adjust.
Identities = 270/440 (61%), Positives = 347/440 (78%), Gaps = 5/440 (1%)
Query: 40 TSKNKNDHVEPNIVIKV-SAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVG 98
T KN E N + V + ++ Y+H WP +K+GW+I+VGTI+GF G+A G+VGGVG
Sbjct: 29 TVSAKNTTSESNEITGVGTGYHAKVFYQHTWPSMKYGWKIIVGTIVGFLGSAFGNVGGVG 88
Query: 99 GGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLF 158
GGGIFVPML LI+GFDAKS+ A+SKCM+TGGA + V YNLRQRHPTLD+PVIDYDLALLF
Sbjct: 89 GGGIFVPMLTLIIGFDAKSAIAISKCMITGGATATVFYNLRQRHPTLDLPVIDYDLALLF 148
Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
QP+L+LGISIGV+FNVIF DWM+T LLI+ F +S ++F KGV+TWK+ET+ +EA+K
Sbjct: 149 QPMLMLGISIGVSFNVIFPDWMLTTLLIISFTGISVKSFFKGVDTWKQETLMMKEASKNS 208
Query: 219 ELIGNGYQTE--ECKCDPEYLSN-DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAII 275
G G T + D Y+ D + T + +K KVS+I+N++WKE GLL AVW +I
Sbjct: 209 LTDGRGGSTYIGNPEDDAHYIKTIDPVKDNTNQSRK-KVSVIDNVHWKELGLLFAVWIMI 267
Query: 276 LALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQ 335
LAL+I KNYTTTCS +YWV+NLLQ+P+A G+S+Y+A+ LYKG+R IASKGD+ T WR Q
Sbjct: 268 LALEIGKNYTTTCSGVYWVINLLQVPIAVGMSSYQAMRLYKGQRIIASKGDQQTHWRVLQ 327
Query: 336 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
L+ +CACGI AG + GLLGLGGGFIL PLFL LGIPPQV+SAT++ A+ FS+S++VVEYY
Sbjct: 328 LIVFCACGILAGTIAGLLGLGGGFILAPLFLGLGIPPQVASATSILAMAFSASIAVVEYY 387
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVG 455
LLKRFP+ YALYF A++ AA VGQH+++K+I +LGRAS+IIF L+ + VSA+ LGGVG
Sbjct: 388 LLKRFPISYALYFVAVATAAALVGQHLVRKVIAMLGRASVIIFILTLTLCVSAVLLGGVG 447
Query: 456 LAKMIKRIEHKEYMGFDNIC 475
+A MIKRIE+KEYMGF N+C
Sbjct: 448 VANMIKRIENKEYMGFGNLC 467
>gi|356528375|ref|XP_003532779.1| PREDICTED: uncharacterized protein LOC100801788 [Glycine max]
Length = 469
Score = 509 bits (1311), Expect = e-141, Method: Compositional matrix adjust.
Identities = 285/475 (60%), Positives = 357/475 (75%), Gaps = 26/475 (5%)
Query: 10 RWRL---LAAAFIGLLVVASVS-VSAQSNLKPEATSKNKN----DHVEPNIVIKVSAPKS 61
RW + + A I L++ SVS S+ N + TS + DH +
Sbjct: 8 RWDVKWVVGKALIAFLLLVSVSATSSHQNTDNKTTSPSNGTIGVDH------------HA 55
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
++ YKH WP +KFGWRI+VG ++GF G+A G+VGGVGGGGIFVPML LIVGFD KS+ A+
Sbjct: 56 KAFYKHHWPSMKFGWRIIVGAVVGFLGSAFGTVGGVGGGGIFVPMLTLIVGFDQKSATAI 115
Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
SKCM+TGGA + V YNLRQRHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNVIF +WM+
Sbjct: 116 SKCMITGGATATVFYNLRQRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVIFPEWML 175
Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYL-SND 240
TVLLI+ F+ +S ++F KGV+TWKKETI K+EA K + G + D Y+ + D
Sbjct: 176 TVLLIIFFVGISVKSFFKGVDTWKKETIMKKEAKKNSRIDDIGSPED----DAHYIQTGD 231
Query: 241 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 300
T ++T + +K KVSIIENI WKE GLL A W +ILAL+I K +TTTCS LYW+ NLLQ+
Sbjct: 232 PTKDDTNQSRK-KVSIIENIRWKELGLLFAGWIMILALEIGKKHTTTCSRLYWLSNLLQV 290
Query: 301 PVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 360
P+A G+S+YEAV LYKG+R IASKGD+ T W QLV +CACG AGM+ GLLGLGGGFI
Sbjct: 291 PIAVGMSSYEAVRLYKGKRIIASKGDKQTHWCVLQLVLFCACGTLAGMIAGLLGLGGGFI 350
Query: 361 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 420
LGPLFL LGIPPQV+SAT+ + FS+SM+VVEYYLLKRFPVPYALYF A++ AA VGQ
Sbjct: 351 LGPLFLGLGIPPQVASATSTLVMAFSASMAVVEYYLLKRFPVPYALYFVAIATAAALVGQ 410
Query: 421 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
H+++K I ILGRAS+IIF L+ + VSA+ LGGVG+A MI++IE+KEYMGF ++C
Sbjct: 411 HLVRKAIAILGRASVIIFILTLTLSVSAVLLGGVGIAHMIQKIENKEYMGFGDLC 465
>gi|356511005|ref|XP_003524222.1| PREDICTED: uncharacterized protein LOC100818778 [Glycine max]
Length = 474
Score = 507 bits (1305), Expect = e-141, Method: Compositional matrix adjust.
Identities = 279/464 (60%), Positives = 351/464 (75%), Gaps = 16/464 (3%)
Query: 17 AFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGW 76
A I L++ SVS ++ T+ N + + +++ YKH WP +KFGW
Sbjct: 18 ALIAFLLLVSVSATSSHQNTDNKTTSPSNGTIGVDY-------HAKAFYKHHWPSMKFGW 70
Query: 77 RIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVY 136
RI+VG I+GF G+A G+VGGVGGGGIFVPML LIVGFD KS+ A+SKCM+TGGA + V Y
Sbjct: 71 RIIVGAIVGFLGSAFGTVGGVGGGGIFVPMLTLIVGFDQKSATAISKCMITGGATATVFY 130
Query: 137 NLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRA 196
NLRQRHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNVIF +WM+TVLLI+ F+ +S ++
Sbjct: 131 NLRQRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVIFPEWMLTVLLIIFFVGISVKS 190
Query: 197 FLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYL-SNDTTPEETREPKKSK-- 253
F KGV+TWKKETI K+EA K + G + D Y+ + D ++T + +K K
Sbjct: 191 FFKGVDTWKKETIMKKEAKKNSRIDDIGSPED----DAHYIQTGDPAKDDTNQSRKKKTV 246
Query: 254 --VSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEA 311
VSIIENI WKE GLL A W +ILAL+I K +TTTCS L+W+LNLLQ+P+A G+S+YEA
Sbjct: 247 MQVSIIENIRWKELGLLFAGWIMILALEIGKKHTTTCSRLFWLLNLLQVPIAVGMSSYEA 306
Query: 312 VALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIP 371
V LYKG+R IASKGD+ T W QLV +CACG AGM+ GLLGLGGGFILGPLFL LGIP
Sbjct: 307 VRLYKGKRIIASKGDQQTHWCVLQLVLFCACGTLAGMIAGLLGLGGGFILGPLFLGLGIP 366
Query: 372 PQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 431
PQV+SAT+ + FS+SM+VVEYYLLKRFPVPYALYF A++ AA VGQH+++K I ILG
Sbjct: 367 PQVASATSTLVMAFSASMAVVEYYLLKRFPVPYALYFVAIATAAALVGQHLVRKAIAILG 426
Query: 432 RASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
RAS+IIF L+ + VSA+ LGGVG+A MI++IE+KEYMGF ++C
Sbjct: 427 RASVIIFILTLTLSVSAVLLGGVGIAHMIQKIENKEYMGFGDLC 470
>gi|357519217|ref|XP_003629897.1| hypothetical protein MTR_8g088080 [Medicago truncatula]
gi|355523919|gb|AET04373.1| hypothetical protein MTR_8g088080 [Medicago truncatula]
Length = 479
Score = 506 bits (1302), Expect = e-140, Method: Compositional matrix adjust.
Identities = 276/412 (66%), Positives = 341/412 (82%), Gaps = 4/412 (0%)
Query: 65 YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
YKH WP +KFGWRI+VGTIIGF G++ G+VGGVGGGGIFVPML LIVGFDAKS+ A+SK
Sbjct: 65 YKHTWPGMKFGWRIIVGTIIGFLGSSFGTVGGVGGGGIFVPMLTLIVGFDAKSATAISKF 124
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
M+TGGA + V+YNL++RHPTLD+PVIDYDLALLFQP+L+LGIS+GVAFN+IF DWM+T L
Sbjct: 125 MITGGAGATVLYNLKKRHPTLDLPVIDYDLALLFQPMLMLGISLGVAFNLIFPDWMLTTL 184
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEY-LSNDTTP 243
LI+ F +S AFLKGV TWKKET+TK+EA +L N +TE+ D + S +
Sbjct: 185 LIIFFTGISVNAFLKGVNTWKKETLTKKEAKDNSQL--NDIRTEDATLDLQIGESVNENQ 242
Query: 244 EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVA 303
T P+K KVS+IEN+YWKE GLL +VW +ILALQI KNYTTTCS++YW+LNLLQ+P+A
Sbjct: 243 TNTNLPRK-KVSVIENVYWKELGLLFSVWIMILALQIGKNYTTTCSLIYWILNLLQVPIA 301
Query: 304 GGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 363
GVS YEA+ LYKG+R IAS GD+G W QL+ YC+CGI AG++GGLLGLGGGFILGP
Sbjct: 302 IGVSGYEAILLYKGKRVIASNGDQGINWSVKQLILYCSCGIIAGIIGGLLGLGGGFILGP 361
Query: 364 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 423
LF+ LGIPPQV+SAT+ FA+TFS+SMSVVEYYLLKRFP+PYALYF ++ +AA VGQH++
Sbjct: 362 LFIGLGIPPQVASATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVVVATVAALVGQHLV 421
Query: 424 KKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+KLI +LGRASIIIF L+ +FVS +SLGG G+A +IKR+E+KEYMGF N+C
Sbjct: 422 RKLIALLGRASIIIFILALTVFVSGISLGGTGIANLIKRLENKEYMGFGNLC 473
>gi|356528373|ref|XP_003532778.1| PREDICTED: uncharacterized protein LOC100801244 [Glycine max]
Length = 466
Score = 505 bits (1301), Expect = e-140, Method: Compositional matrix adjust.
Identities = 272/470 (57%), Positives = 352/470 (74%), Gaps = 22/470 (4%)
Query: 10 RWRLLAAA---FIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKV-SAPKSRSGY 65
RW + A FI LL++ +VS KN +E N V + ++ Y
Sbjct: 9 RWDVKWVAGKVFIALLLLVTVSA--------------KNTTLESNETTGVGTGYYAKVFY 54
Query: 66 KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
KH WP +++GW+I+VGT++GF G+A G+VGGVGGGGIFVPML LI+GFDAKS+ A+SKCM
Sbjct: 55 KHTWPSMEYGWKIIVGTMVGFLGSAFGNVGGVGGGGIFVPMLTLIIGFDAKSAIAISKCM 114
Query: 126 VTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLL 185
+TGGA + V YNLRQRHPTLD+PVIDYDLALLFQP+L+LGISIGV+FNVIF DWM+T LL
Sbjct: 115 ITGGATATVFYNLRQRHPTLDLPVIDYDLALLFQPMLMLGISIGVSFNVIFPDWMLTTLL 174
Query: 186 IVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEE 245
I+ F +S ++F KGV+TWK+ET+ +EA K ++ G + Y+ ++
Sbjct: 175 IIFFTGISVKSFFKGVDTWKQETLIVKEARKNSQIDDIGSPEDAA----HYIQTGDPVKD 230
Query: 246 TREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGG 305
+ KVS+IENI+W E GLL AVW +ILAL+I KNYTTTCS +YWV+NLLQ+P+A G
Sbjct: 231 NTNQSRKKVSVIENIHWNELGLLFAVWIMILALEIGKNYTTTCSGVYWVINLLQVPIAVG 290
Query: 306 VSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF 365
+S+Y+A+ LYKG+R I SKGD+ T WR QL+ +CACGI AG + GLLGLGGGFIL PLF
Sbjct: 291 MSSYQAMRLYKGQRIIGSKGDQQTNWRVLQLILFCACGILAGTIAGLLGLGGGFILAPLF 350
Query: 366 LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 425
L +GIPPQV+SAT++ A+ FS+SM+VVEYYLLKRFP+ YALYF A++ AA VGQH+++K
Sbjct: 351 LGIGIPPQVASATSILAMAFSASMAVVEYYLLKRFPISYALYFVAVATAAALVGQHLVRK 410
Query: 426 LIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+I ILGRAS+IIF L+ + VSA+ LGGVG+A MIKRIE+KEYMGF N+C
Sbjct: 411 VIAILGRASVIIFILTLTLCVSAVLLGGVGVANMIKRIENKEYMGFGNLC 460
>gi|217075809|gb|ACJ86264.1| unknown [Medicago truncatula]
Length = 481
Score = 505 bits (1301), Expect = e-140, Method: Compositional matrix adjust.
Identities = 276/412 (66%), Positives = 341/412 (82%), Gaps = 4/412 (0%)
Query: 65 YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
YKH WP +KFGWRI+VGTIIGF G++ G+VGGVGGGGIFVPML LIVGFDAKS+ A+SK
Sbjct: 67 YKHTWPGMKFGWRIIVGTIIGFLGSSFGTVGGVGGGGIFVPMLTLIVGFDAKSATAISKF 126
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
M+TGGA + V+YNL++RHPTLD+PVIDYDLALLFQP+L+LGIS+GVAFN+IF DWM+T L
Sbjct: 127 MITGGAGATVLYNLKKRHPTLDLPVIDYDLALLFQPMLMLGISLGVAFNLIFPDWMLTTL 186
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEY-LSNDTTP 243
LI+ F +S AFLKGV TWKKET+TK+EA +L N +TE+ D + S +
Sbjct: 187 LIIFFTGISVNAFLKGVNTWKKETLTKKEAKDNSQL--NDIRTEDATLDLQIGESVNENQ 244
Query: 244 EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVA 303
T P+K KVS+IEN+YWKE GLL +VW +ILALQI KNYTTTCS++YW+LNLLQ+P+A
Sbjct: 245 TNTNLPRK-KVSVIENVYWKELGLLFSVWIMILALQIGKNYTTTCSLIYWILNLLQVPIA 303
Query: 304 GGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 363
GVS YEA+ LYKG+R IAS GD+G W QL+ YC+CGI AG++GGLLGLGGGFILGP
Sbjct: 304 IGVSGYEAILLYKGKRVIASNGDQGINWSVKQLILYCSCGIIAGIIGGLLGLGGGFILGP 363
Query: 364 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 423
LF+ LGIPPQV+SAT+ FA+TFS+SMSVVEYYLLKRFP+PYALYF ++ +AA VGQH++
Sbjct: 364 LFIGLGIPPQVASATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVVVATVAALVGQHLV 423
Query: 424 KKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+KLI +LGRASIIIF L+ +FVS +SLGG G+A +IKR+E+KEYMGF N+C
Sbjct: 424 RKLIALLGRASIIIFILALTVFVSGISLGGTGIANLIKRLENKEYMGFGNLC 475
>gi|125603285|gb|EAZ42610.1| hypothetical protein OsJ_27176 [Oryza sativa Japonica Group]
Length = 442
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 271/407 (66%), Positives = 320/407 (78%), Gaps = 12/407 (2%)
Query: 72 IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAV 131
++FGW+IV+G +IGFFG A GSVGGVGGGGIFVPML LI+GFDAKSS A+SKCM+ G AV
Sbjct: 41 MEFGWKIVLGKLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDAKSSTAISKCMIMGAAV 100
Query: 132 SAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLV 191
S V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW+ITVLLI+LFL
Sbjct: 101 STVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVLFNVIFPDWLITVLLIILFLG 160
Query: 192 LSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPE---ETRE 248
ST+AFLKGVETWKKETI KREAAK+LE I +PEY T P E++
Sbjct: 161 TSTKAFLKGVETWKKETILKREAAKRLEQIAE---------EPEYSPLPTGPNATAESKA 211
Query: 249 PKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSA 308
P S+ +N+YWKEFGLL VW L LQ+ KNY TCS YWVLN LQIPV+ GV+
Sbjct: 212 PSDEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPTCSTWYWVLNFLQIPVSVGVTM 271
Query: 309 YEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL 368
YE + L +GRR I+SKGDE T + QL+ YC G+ AG+VGGLLGLGGGFI+GPLFLEL
Sbjct: 272 YEGLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAGVVGGLLGLGGGFIMGPLFLEL 331
Query: 369 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 428
GIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYAL+F ++ AA +GQH+++KLI
Sbjct: 332 GIPPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALFFTVVAFFAAIIGQHIVRKLIN 391
Query: 429 ILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
LGRAS+IIF LSF+IF+SA+SLGGVG++ MI +I EYMGFDNIC
Sbjct: 392 WLGRASLIIFILSFMIFISAISLGGVGISNMIGKIARHEYMGFDNIC 438
>gi|356518101|ref|XP_003527721.1| PREDICTED: uncharacterized protein LOC100789748 [Glycine max]
gi|83853827|gb|ABC47859.1| membrane protein-like protein [Glycine max]
Length = 491
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 267/413 (64%), Positives = 330/413 (79%), Gaps = 10/413 (2%)
Query: 66 KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
VWP++KFGWRIVVG+I+GFFG A GSVGGVGGGGIF+PML L++GFDAKSS A+SKCM
Sbjct: 71 DRVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTLVIGFDAKSSTALSKCM 130
Query: 126 VTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLL 185
+ G AVS V YNLR RHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNV+FADWM+TVLL
Sbjct: 131 IMGAAVSTVYYNLRFRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLL 190
Query: 186 IVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGN-GYQTEECKCDPEYLSNDTTPE 244
I+LF+ ST+A KG++TWKKETI K+EAAK LE + GY +EE +Y S P
Sbjct: 191 IILFIATSTKALFKGIDTWKKETIMKKEAAKMLESDSSPGYVSEE-----DYKS---LPA 242
Query: 245 ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAG 304
+ +P+ +V +++NIYWKE +LV VW L +QI K YT TCS+LYWVLN LQ+P+A
Sbjct: 243 GSADPRDEEVPLLKNIYWKELLVLVYVWVAFLIVQIIKTYTKTCSILYWVLNSLQVPIAI 302
Query: 305 GVSAYEAVALYKGRRKIASKGDEGTKW-RASQLVFYCACGITAGMVGGLLGLGGGFILGP 363
V+ YEA+ L G R I+SKG E T W + ++ YC+CGI AG+V GLLGLGGGFILGP
Sbjct: 303 SVTLYEAICLCNGTRVISSKGKENTDWMKLHKICLYCSCGIIAGIVSGLLGLGGGFILGP 362
Query: 364 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 423
LFLELGIPPQV+SAT+ FA+ FSSSMSVV+YYLL+RFPVPYA YF ++ IAA GQHV+
Sbjct: 363 LFLELGIPPQVASATSTFAMVFSSSMSVVQYYLLERFPVPYASYFILVATIAALTGQHVV 422
Query: 424 KKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICY 476
+K+I I GRASIIIF L+F IF+SA+SLGGVG+ M++++E+ EYMGF NIC+
Sbjct: 423 RKIIAIFGRASIIIFVLAFTIFLSAISLGGVGIENMVEKMENNEYMGFANICH 475
>gi|356509743|ref|XP_003523605.1| PREDICTED: uncharacterized protein LOC100816633 [Glycine max]
Length = 474
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 273/445 (61%), Positives = 339/445 (76%), Gaps = 11/445 (2%)
Query: 34 NLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGS 93
+ KPE+ + V N +I VWP++KFGWRIVVG+I+GFFG A GS
Sbjct: 39 DQKPESFVVKERQGVM-NSIIDFFWNDGEPTNDRVWPEMKFGWRIVVGSIVGFFGAALGS 97
Query: 94 VGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYD 153
VGGVGGGGIF+PML L++GFDAKSS A+SKCM+ G AVS V YNLR RHPTLD+PVIDYD
Sbjct: 98 VGGVGGGGIFIPMLTLVIGFDAKSSTALSKCMIMGAAVSTVYYNLRFRHPTLDLPVIDYD 157
Query: 154 LALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE 213
LALLFQP+L+LGISIGVAFNV+FADWM+TVLLI+LF+ ST+A KG++TWKKETI K+E
Sbjct: 158 LALLFQPMLMLGISIGVAFNVMFADWMVTVLLIILFIATSTKALFKGIDTWKKETIMKKE 217
Query: 214 AAKQLELIGN-GYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVW 272
AAK LE + GY +EE +Y S P + +P+ +V +++NIYWKE +L VW
Sbjct: 218 AAKMLESDSSPGYVSEE-----DYKS---LPAGSADPRDEEVPLLKNIYWKELLVLAYVW 269
Query: 273 AIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKW- 331
L +QI K YT TCS+LYWVLN LQ+P+A V+ YEA+ L G R IASKG E T W
Sbjct: 270 VAFLIVQIIKTYTKTCSILYWVLNSLQVPIAISVTLYEAICLCNGTRVIASKGKENTDWM 329
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
+ ++ YC+CGI AG+V GLLGLGGGFILGPLFLELGIPPQV+SAT+ FA+ FSSSMSV
Sbjct: 330 KLHKICLYCSCGIIAGIVSGLLGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMSV 389
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
V+YYLL+RFPVPYA YF ++ IAA GQHV++K+I I GRASIIIF L+F IF+SA+SL
Sbjct: 390 VQYYLLERFPVPYASYFILVATIAALTGQHVVRKIIAIFGRASIIIFVLAFTIFLSAISL 449
Query: 452 GGVGLAKMIKRIEHKEYMGFDNICY 476
GGVG+ M++++E+ EYMGF NIC+
Sbjct: 450 GGVGIENMVEKMENNEYMGFANICH 474
>gi|224054356|ref|XP_002298219.1| predicted protein [Populus trichocarpa]
gi|222845477|gb|EEE83024.1| predicted protein [Populus trichocarpa]
Length = 421
Score = 487 bits (1254), Expect = e-135, Method: Compositional matrix adjust.
Identities = 279/426 (65%), Positives = 333/426 (78%), Gaps = 17/426 (3%)
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
+SGYKHVWPD+KFGW+ VVGTII F G ACGSVGGVGGGGIFVPML LIVGFD+KSSAA+
Sbjct: 1 QSGYKHVWPDMKFGWKTVVGTIITFLGAACGSVGGVGGGGIFVPMLTLIVGFDSKSSAAI 60
Query: 122 SKC-----------MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGV 170
SKC M+TG A + V YNLR RHPTL++P+IDYDLALLFQP+L+LGISIGV
Sbjct: 61 SKCNCCRLCLLLSGMITGAAATTVFYNLRLRHPTLELPIIDYDLALLFQPMLILGISIGV 120
Query: 171 AFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEEC 230
NV+FADWMI +LLI+ F+ STR+F KGVETWKKET K + N + E
Sbjct: 121 TLNVLFADWMIIILLIIFFIATSTRSFFKGVETWKKET--KSKQVSHFLYPKNLIEAAEE 178
Query: 231 KCDPEYLSNDTTPEETR-EPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCS 289
K P T+ ++ + EP K KVSIIENI WK+ G+LV VW IILALQI KNYTTTCS
Sbjct: 179 KPLP---GGTTSHDQIKPEPNKEKVSIIENIKWKDLGILVIVWLIILALQIGKNYTTTCS 235
Query: 290 VLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMV 349
+YW+LN+LQIPVA GV++YEAV+LYKGRRKI S G+ W +L+ YC G+ AG V
Sbjct: 236 AIYWLLNILQIPVAAGVTSYEAVSLYKGRRKITSMGEASNNWPVHELILYCFIGVMAGTV 295
Query: 350 GGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFF 409
GG+LGLGGGFILGPLFLE+GIPPQVSSATA FA+ FS+SMSVVE+YLLKRFPVPYALYFF
Sbjct: 296 GGMLGLGGGFILGPLFLEMGIPPQVSSATATFAMMFSASMSVVEFYLLKRFPVPYALYFF 355
Query: 410 ALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYM 469
++ IAA VGQHV++KLI ILGRAS+IIF L+ IFVSA+ +GG G+A MI++ E KEYM
Sbjct: 356 TVATIAAVVGQHVVRKLISILGRASLIIFILASTIFVSAILVGGAGMASMIEKFERKEYM 415
Query: 470 GFDNIC 475
GF+++C
Sbjct: 416 GFESMC 421
>gi|357147621|ref|XP_003574413.1| PREDICTED: uncharacterized protein LOC100846525 [Brachypodium
distachyon]
Length = 461
Score = 487 bits (1254), Expect = e-135, Method: Compositional matrix adjust.
Identities = 274/474 (57%), Positives = 343/474 (72%), Gaps = 28/474 (5%)
Query: 10 RWRLLAAAFIGLLVVASVSVSAQSNL------KPEATSKNKNDHVEPNIVIKVSAPKSRS 63
+W +AA I VVA+V+ + +L PE S +N R+
Sbjct: 4 KWHAVAALAIAYTVVATVAADDRGHLLAGTVTAPEGRSSLRN-----------VTSLGRT 52
Query: 64 GYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSK 123
Y HVWP +K GWR+++G+ I FFG A GS+GGVGGGGIFVPML LI+GFD KSSAA+SK
Sbjct: 53 TYHHVWPPMKVGWRVLLGSFIVFFGAAFGSIGGVGGGGIFVPMLTLIIGFDTKSSAAISK 112
Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
CM+ G A+S V NL+ +HP+LD+P+IDYDLALL QP+L+LG+SIGV NVIF DW++TV
Sbjct: 113 CMIMGAALSTVYCNLKLKHPSLDMPLIDYDLALLIQPMLMLGVSIGVICNVIFPDWLVTV 172
Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTP 243
LLI LFLV ST+AFLKGVETWKKET+TKREA KQLE QT E EY +
Sbjct: 173 LLISLFLVTSTKAFLKGVETWKKETLTKREATKQLE------QTSE---QWEYTLPPSGA 223
Query: 244 EE--TREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 301
++ ++ P VSI +NIYWKEFGLL VW L LQI K+Y TCS YWVLN+LQIP
Sbjct: 224 DDAASKAPSDEAVSIWKNIYWKEFGLLAFVWIAFLVLQITKDYVATCSTWYWVLNILQIP 283
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
V+ GV+ Y+AV L +G+R I+SK + T +A QL YC+ G+TAG +GGLLG+GGG I+
Sbjct: 284 VSVGVTMYQAVGLAQGKRVISSKANNQTSLKAYQLFVYCSLGLTAGSMGGLLGVGGGIIM 343
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GPLFLELG+PPQVSSAT+ FA+ FS+SMSVVEYYLLKRFPVPYA +F ++ +AA VGQ
Sbjct: 344 GPLFLELGVPPQVSSATSTFAMMFSASMSVVEYYLLKRFPVPYAAFFTIVAFLAAIVGQG 403
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
V++K+I LGRAS+IIF LSF+IF+SALSLGGVG + MI +I EY+GF+NIC
Sbjct: 404 VVRKMISWLGRASLIIFVLSFMIFISALSLGGVGTSNMIHKITQHEYLGFENIC 457
>gi|255551044|ref|XP_002516570.1| conserved hypothetical protein [Ricinus communis]
gi|223544390|gb|EEF45911.1| conserved hypothetical protein [Ricinus communis]
Length = 465
Score = 487 bits (1254), Expect = e-135, Method: Compositional matrix adjust.
Identities = 258/413 (62%), Positives = 317/413 (76%), Gaps = 14/413 (3%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
SGY+HVWP ++ GW++VVG+IIGFFG A GSVGGVGGGGIFVPML LI+GFD KS+ A+S
Sbjct: 63 SGYQHVWPGMELGWQVVVGSIIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSATAIS 122
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G A+S V YNL+ RHPTLD+P++DYDL LL QP ++LGIS GV NV F DW +T
Sbjct: 123 KCMIMGAAMSTVYYNLKLRHPTLDLPIVDYDLVLLIQPAVLLGISTGVTLNVFFPDWAVT 182
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT 242
VLLI+L + S +A+ KGV+TW KET K EAAKQ +L+ G ++
Sbjct: 183 VLLIILLIGTSVKAYFKGVDTWNKETTLKEEAAKQYKLLPGG-------------PSNVN 229
Query: 243 PEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPV 302
P + + ++ +V ++ENI+WKEF LLV W LALQIAKNYT TCS YWVLNLLQ+PV
Sbjct: 230 PRQ-KHTREREVPVLENIHWKEFALLVFDWVAYLALQIAKNYTATCSTAYWVLNLLQVPV 288
Query: 303 AGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILG 362
+ GV Y AV LY+GRR+IASKGDEG +A QL+ +CA G+ AG+VGGLLGLGGGFI+G
Sbjct: 289 SVGVFLYAAVGLYRGRRRIASKGDEGMNLKAYQLLAFCAYGVMAGIVGGLLGLGGGFIMG 348
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
PLFLELGIPPQV+SATA F + FSSSMSV EYYLL RFPVPYALYF A++ IAAF GQH+
Sbjct: 349 PLFLELGIPPQVTSATATFGMAFSSSMSVAEYYLLSRFPVPYALYFIAVATIAAFAGQHI 408
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ KLI I GRAS+IIF L+F IFVSAL+LGGVG++ +I +IE E M F N+C
Sbjct: 409 ITKLIAITGRASLIIFVLAFTIFVSALTLGGVGISNIIGKIEKGESMEFQNLC 461
>gi|224094753|ref|XP_002310222.1| predicted protein [Populus trichocarpa]
gi|222853125|gb|EEE90672.1| predicted protein [Populus trichocarpa]
Length = 454
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 272/454 (59%), Positives = 338/454 (74%), Gaps = 20/454 (4%)
Query: 30 SAQSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIGF 86
SA+ LK + + P ++K+ +S Y+ VWPD+KF WR+VVGTI+GF
Sbjct: 13 SAERLLKDKEAETLVKKEMSPGYIVKMIHFLFQGGKSSYEPVWPDMKFDWRVVVGTIVGF 72
Query: 87 FGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD 146
FG A GSVGGVGGGGIFVPML LI+GFD KSS A+SKCM+ G A + V YNLR RHPTLD
Sbjct: 73 FGAALGSVGGVGGGGIFVPMLTLIIGFDPKSSTAISKCMIMGAAGATVYYNLRLRHPTLD 132
Query: 147 IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
+P+IDYDLALLFQP+L+LGISIGVAFNV+FADWM+TVLLI+LF+ ST+A KG++TWKK
Sbjct: 133 MPIIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLLIILFIGTSTKALFKGIDTWKK 192
Query: 207 ETITKREAAKQLE---LIGNGYQTE--ECKCDPEYLSNDTTPEETREPKKSKVSIIENIY 261
E++ K+EAAKQLE G+G + E P L +D P + ENI
Sbjct: 193 ESVMKKEAAKQLESESKPGDGAEEEYQPLPSGPVVLHDDEVP------------LRENIR 240
Query: 262 WKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKI 321
WKE LLV VW L +QI + TCSV+YW+LN LQ+P+A V+ +EA+ L KG R I
Sbjct: 241 WKELALLVYVWVAFLVVQIVQTNVPTCSVMYWILNSLQVPIAASVTLFEAICLCKGTRVI 300
Query: 322 ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 381
ASKG E T W+ Q++ YC+CGI AG+VGGLLGLGGGFILGPLFLELGIPPQV+SAT+ F
Sbjct: 301 ASKGKEITNWKLHQVLLYCSCGIIAGIVGGLLGLGGGFILGPLFLELGIPPQVASATSTF 360
Query: 382 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 441
A+ FSSSMSVVEYYLL RFPVPYA YF ++ IAAF GQHV++++I ILGRASIIIF L+
Sbjct: 361 AMAFSSSMSVVEYYLLNRFPVPYAAYFVLVATIAAFAGQHVIRRIIAILGRASIIIFILA 420
Query: 442 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
IFVSA+SLGGVG+A M++++E+ EYMGF+N+C
Sbjct: 421 LTIFVSAISLGGVGIANMVEKLENDEYMGFENLC 454
>gi|217073872|gb|ACJ85296.1| unknown [Medicago truncatula]
Length = 474
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 250/416 (60%), Positives = 319/416 (76%), Gaps = 8/416 (1%)
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGG-GGIFVPMLNLIVGFDAKSSAA 120
S Y+ VWP++KFGWRIVVG+I+GFFG A GSVGGVGG GIF+PML LI+GFD KSS A
Sbjct: 59 ESSYERVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGWKGIFIPMLTLIIGFDPKSSTA 118
Query: 121 VSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWM 180
+SKCM+TG A S V NLR RHPTLD+P+IDYDLALLFQP+L+LGISIGVAFNV+FADWM
Sbjct: 119 LSKCMITGAAGSTVYCNLRLRHPTLDMPLIDYDLALLFQPMLMLGISIGVAFNVMFADWM 178
Query: 181 ITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSND 240
+T+LLI+LF+ ST+A +KG++TWKKE I K+EA ++ Q E P+Y S +
Sbjct: 179 VTILLIILFIGTSTKALVKGIDTWKKEAIMKKEAFEE------AAQMLESGSTPDYASEE 232
Query: 241 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 300
+ + +V +++NI+WKE +L+ VW L +QI K Y+ TCS+ YW+LN LQ+
Sbjct: 233 DYKSLPADLQDEEVPLLDNIHWKELSVLMYVWVAFLIVQILKTYSKTCSIEYWLLNSLQV 292
Query: 301 PVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 360
P+A V+ +EA+ L KG R IAS+G E T W+ ++ YC CGI AGMV GLLGLGGGFI
Sbjct: 293 PIAISVTLFEAICLCKGTRVIASRGKEIT-WKFHKICLYCFCGIIAGMVSGLLGLGGGFI 351
Query: 361 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 420
LGPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YY L RFP+PYA Y ++ IAA GQ
Sbjct: 352 LGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYHLDRFPIPYASYLVLVATIAALTGQ 411
Query: 421 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICY 476
HV++K+I I GRASII+F L+F IFVSA+SLGGVG+ M++++E+ EYMGFDN C+
Sbjct: 412 HVVRKIIAIFGRASIIVFILAFTIFVSAISLGGVGIGNMVEKMENAEYMGFDNFCH 467
>gi|326510405|dbj|BAJ87419.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 474
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 259/416 (62%), Positives = 319/416 (76%), Gaps = 3/416 (0%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
+ Y HVWP +K GW+IV+G+ IGFFG A GS+GGVGGGG+FVPML LI+GFD KSSAA+S
Sbjct: 55 TSYHHVWPPLKLGWKIVLGSFIGFFGAAFGSIGGVGGGGVFVPMLTLIIGFDPKSSAAMS 114
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G AVS V N++ +HPTLD+PVIDYDLALL QP+L+LG+SIGV NVIF DW++T
Sbjct: 115 KCMIMGAAVSTVYCNVKLKHPTLDMPVIDYDLALLIQPMLMLGVSIGVICNVIFPDWLVT 174
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSND-- 240
VLLI+LF+V ST+AFLK VETWKKETIT A K+ E + + + S
Sbjct: 175 VLLIILFIVTSTKAFLKAVETWKKETITIAVATKRSEETSEEQEYMQLSAGLDAASQSEI 234
Query: 241 -TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 299
+ ++ P VSI +N+YWKE GLL VW LALQ+AKNY TCS+ YWVLNLLQ
Sbjct: 235 LSDHDQNETPPDEAVSIWKNVYWKEVGLLGFVWVAFLALQLAKNYMATCSISYWVLNLLQ 294
Query: 300 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 359
IPV+ GV+ YEAV L +GRR I+SK +E RA QL+ YC G+TAG+V GLLG+GGG
Sbjct: 295 IPVSVGVAMYEAVGLMQGRRVISSKRNEENSLRAHQLLVYCFLGVTAGVVAGLLGVGGGS 354
Query: 360 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
I+GPLFLELG+PPQV+SATA FA+ FS+SMSVVEYYLL RFPVPYALY L+ +AA +G
Sbjct: 355 IMGPLFLELGVPPQVASATATFAMMFSASMSVVEYYLLNRFPVPYALYLTILAFLAAIIG 414
Query: 420 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
Q V++KLI +LGRASIIIF LSF+IF+SALSLGGVG++ I +I +EYMGF+NIC
Sbjct: 415 QRVVRKLIDLLGRASIIIFILSFMIFISALSLGGVGISNTIHKIARREYMGFENIC 470
>gi|148906357|gb|ABR16333.1| unknown [Picea sitchensis]
Length = 505
Score = 484 bits (1247), Expect = e-134, Method: Compositional matrix adjust.
Identities = 289/496 (58%), Positives = 363/496 (73%), Gaps = 31/496 (6%)
Query: 10 RWRLLAA--AFIGLLVVASVSVSAQSNLKPEATSKNKN-------DHVEPNIVIKVSAPK 60
+W L+ A GLL+ ++S + ++ L + S D VE N +K K
Sbjct: 7 KWLLVVVMVATFGLLLAVTLSSAERTFLDGNSASSETERSMEKIVDPVEGN-TLKYHLLK 65
Query: 61 SR--------SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVG 112
+R S Y+ VWP++KFGWRIV+G+I+GFFG A GSVGGVGGGGIFVPML LI+G
Sbjct: 66 ARNFLWRSNISDYQRVWPEMKFGWRIVIGSIVGFFGAAVGSVGGVGGGGIFVPMLTLIIG 125
Query: 113 FDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAF 172
FDAKSS A+SKCM+ G A S V YNL+ +HPTLD+P+IDYDLA+LFQP+L+LGISIGVAF
Sbjct: 126 FDAKSSTAISKCMIMGAAGSTVYYNLKLKHPTLDLPIIDYDLAMLFQPMLMLGISIGVAF 185
Query: 173 NVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAK-QLELIGNGYQTEECK 231
NVIFADWM+TVLLI+LFL ST+AFLKGVETWK+ETI K+EA + ++E + + EE +
Sbjct: 186 NVIFADWMVTVLLIILFLGTSTKAFLKGVETWKQETILKKEAERVRVENVTADVEPEEHQ 245
Query: 232 -CDPEYLSNDTTPEET-----------REPKKSKVSIIENIYWKEFGLLVAVWAIILALQ 279
D + L ++ P E R P +I N+ WKE G+LV VW I L LQ
Sbjct: 246 EVDFKPLPSEPVPNEESNGNNEGSNNERLPTDVTNTIWFNVRWKELGILVVVWIIFLVLQ 305
Query: 280 IAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFY 339
+ K +TTCSV YWVLNLLQIPVA VS YEA+ LYKG + +ASKG+ G W+ SQLV Y
Sbjct: 306 VVKAKSTTCSVEYWVLNLLQIPVAMSVSLYEAIGLYKGTKAVASKGEAGINWKISQLVLY 365
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
CGI AG+VGGLLGLGGGFILGPL LELG+PPQVSSATA F +TFSSSMSVVEYY LKR
Sbjct: 366 FFCGILAGVVGGLLGLGGGFILGPLLLELGVPPQVSSATATFVMTFSSSMSVVEYYFLKR 425
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
FPVPYA Y F + +IAAF+GQHV+++L+ +LGRAS+IIF L+F+IF+SA++LGGVG+ KM
Sbjct: 426 FPVPYAAYLFGVCVIAAFMGQHVIRRLVILLGRASLIIFILAFVIFLSAITLGGVGIVKM 485
Query: 460 IKRIEHKEYMGFDNIC 475
I + E+ +YMGF+N+C
Sbjct: 486 IHKFENGDYMGFENLC 501
>gi|225430784|ref|XP_002267318.1| PREDICTED: uncharacterized protein LOC100253061 [Vitis vinifera]
Length = 476
Score = 484 bits (1246), Expect = e-134, Method: Compositional matrix adjust.
Identities = 280/448 (62%), Positives = 341/448 (76%), Gaps = 16/448 (3%)
Query: 32 QSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIGFFG 88
+++ +P NKN EP + KV +S Y+HVWPD+KFGW++VV +IIGFFG
Sbjct: 40 KADDRPAEHLVNKNK--EPGFLSKVVHFLWQSGKSSYQHVWPDMKFGWKLVVASIIGFFG 97
Query: 89 TACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP 148
A GSVGGVGGGGIFVPML LIVGFD KSS A+SKCM+ G A S V YN+R RHPTLD+P
Sbjct: 98 AALGSVGGVGGGGIFVPMLTLIVGFDPKSSTAISKCMIMGAAGSTVYYNMRLRHPTLDMP 157
Query: 149 VIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET 208
+IDYDLALLFQP+L+LGISIGVAFNVIFADWM+TVLLI+LFL ST+A KG+ETWKKET
Sbjct: 158 IIDYDLALLFQPMLMLGISIGVAFNVIFADWMVTVLLIILFLGTSTKALFKGIETWKKET 217
Query: 209 ITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLL 268
I K+EAA+ LE E K +Y + P R+ V +I NI WKE LL
Sbjct: 218 IMKKEAARLLE--------AETKPTGDYRPLPSGPATIRD---DHVPVIYNICWKELALL 266
Query: 269 VAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEG 328
V VW LA+QI K YT TCS+ YWVLN LQ+P+A VS YEAV LYKG R IASKG E
Sbjct: 267 VYVWVAFLAVQIVKTYTVTCSIEYWVLNCLQVPIAASVSIYEAVCLYKGTRVIASKGKEV 326
Query: 329 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 388
T W+ Q+ YC+CGI AG+VGGLLGLGGGFILGPLFLELGIPPQV+SAT+ F++ FSSS
Sbjct: 327 TNWKIHQIFLYCSCGIVAGLVGGLLGLGGGFILGPLFLELGIPPQVASATSTFSMAFSSS 386
Query: 389 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSA 448
MSVV+Y+LL+RFPVPYA YF ++ IAA VGQHV++K+IK++GRASIIIF L+ IF+SA
Sbjct: 387 MSVVQYHLLRRFPVPYASYFVLVATIAALVGQHVVRKVIKLVGRASIIIFILALTIFISA 446
Query: 449 LSLGGVGLAKMIKRIEHKEYMGFDNICY 476
+SLGGVG+ M++++ + EYMGFD++CY
Sbjct: 447 ISLGGVGIMTMVEKLANNEYMGFDDLCY 474
>gi|242044426|ref|XP_002460084.1| hypothetical protein SORBIDRAFT_02g022510 [Sorghum bicolor]
gi|241923461|gb|EER96605.1| hypothetical protein SORBIDRAFT_02g022510 [Sorghum bicolor]
Length = 470
Score = 484 bits (1245), Expect = e-134, Method: Compositional matrix adjust.
Identities = 263/414 (63%), Positives = 320/414 (77%), Gaps = 6/414 (1%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
+ Y+HVWP ++ GW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 58 NSYQHVWPTMELGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 117
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G +VS V YNL+ +HPTLD+PVIDYDLA+L QP+L+LGISIGV FNVIF DW++T
Sbjct: 118 KCMIMGASVSTVYYNLKLKHPTLDMPVIDYDLAVLIQPMLMLGISIGVIFNVIFPDWLVT 177
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI-GNGYQTEECKCDPEYLSNDT 241
VLLI+LFL ST+AFLKG+ETWKKETI +REAAK+LE I G + P +N
Sbjct: 178 VLLIILFLGTSTKAFLKGIETWKKETIIQREAAKRLEQIAGEEAEYAPLPTGPGAAANKK 237
Query: 242 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 301
T P S+I+NI+WKE GLL VW L LQ+ KNYT TCS YWVLNLLQ+P
Sbjct: 238 TLSSDEAP-----SLIKNIHWKEVGLLSFVWVAFLVLQVTKNYTATCSPWYWVLNLLQVP 292
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
V+ GV+ YE L G+R ++SKG E T + Q+V Y GI AG+VGGLLGLGGGFI+
Sbjct: 293 VSVGVTLYEGFGLMSGKRVLSSKGSEQTTMKFHQVVVYGLFGIAAGLVGGLLGLGGGFIM 352
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYA+YF ++ AA +GQH
Sbjct: 353 GPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYAVYFTIVAFAAALIGQH 412
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++KLI LGRAS+IIF L+F+IFVSALSLGGVG++ M+ +I EYMGF+N+C
Sbjct: 413 AVRKLINWLGRASLIIFILAFMIFVSALSLGGVGISNMVHKIARHEYMGFENLC 466
>gi|255561190|ref|XP_002521607.1| conserved hypothetical protein [Ricinus communis]
gi|223539285|gb|EEF40878.1| conserved hypothetical protein [Ricinus communis]
Length = 478
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 264/446 (59%), Positives = 339/446 (76%), Gaps = 11/446 (2%)
Query: 35 LKPEATSKNKNDHVEPNIVIKVSA----PKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTA 90
K + +K DH + ++ +A +S Y+ VWP+++F ++IVVGTI+GF G A
Sbjct: 40 FKEQNHTKVLVDHEKEKSFLQKAAHFLWQSGKSSYEPVWPEMEFNYKIVVGTIVGFLGAA 99
Query: 91 CGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVI 150
GSVGGVGGGGIFVPML L++GFD KSS A+SKCM+ G A S V YN+R RHPTLD+P+I
Sbjct: 100 LGSVGGVGGGGIFVPMLTLLIGFDPKSSTAISKCMIMGAAGSTVYYNMRLRHPTLDMPLI 159
Query: 151 DYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETIT 210
DYDLALLFQP+L+LGISIGVAFNV+FADWM+TVLLI+LF+ ST+A KG++TWKKE++
Sbjct: 160 DYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLLIILFIGTSTKALFKGIDTWKKESMM 219
Query: 211 KREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVA 270
K+EAA+QLE +E D E P + ++S+ +NIYWKE LLV
Sbjct: 220 KKEAARQLE-------SESKPNDGEGQDYKPLPSGPVALEDEEISLFQNIYWKELALLVY 272
Query: 271 VWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTK 330
VW LA+QI K+Y TCSV YW+LN LQ+P+A V+ +EA+ L KG R IAS+G E T
Sbjct: 273 VWVGFLAVQIVKSYVRTCSVAYWILNALQVPIAASVTLFEAICLRKGTRVIASRGKEITN 332
Query: 331 WRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMS 390
W+ Q++ YC+CGI AGMVGGLLGLGGGFILGPLFLELGIPPQV+SAT+ FA+ FSSSMS
Sbjct: 333 WKYHQILLYCSCGIIAGMVGGLLGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMS 392
Query: 391 VVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALS 450
VV+YYLLKRFPVPYA +F ++ +AA VGQHV++K+I ILGRASIIIF L+ IFVSA+S
Sbjct: 393 VVQYYLLKRFPVPYATFFVLVATVAALVGQHVVRKIIAILGRASIIIFILALTIFVSAIS 452
Query: 451 LGGVGLAKMIKRIEHKEYMGFDNICY 476
LGGVG+ M++++ ++EYMGF+N+CY
Sbjct: 453 LGGVGIVDMVEKLTNEEYMGFENLCY 478
>gi|224134334|ref|XP_002327812.1| predicted protein [Populus trichocarpa]
gi|222836897|gb|EEE75290.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 481 bits (1237), Expect = e-133, Method: Compositional matrix adjust.
Identities = 268/449 (59%), Positives = 336/449 (74%), Gaps = 10/449 (2%)
Query: 30 SAQSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIGF 86
SA+ LK + + + P ++KV +S Y+ VWPD+KF WRIVVGTI+GF
Sbjct: 33 SAERLLKDKEPEASAEKEMSPGYIVKVLHFLFQGGKSSYEPVWPDMKFDWRIVVGTIVGF 92
Query: 87 FGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD 146
G A GSVGGVGGGGIFVPML LI+GFD KSS A+SKCM+ G A + V YNLR RHPTLD
Sbjct: 93 LGAALGSVGGVGGGGIFVPMLTLIIGFDPKSSTAISKCMIMGAAGATVYYNLRLRHPTLD 152
Query: 147 IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
+P+IDYDL LLFQP+L+LGISIGV+F+V+FADWM+TVLLI+LF+ ST+A KG++TWKK
Sbjct: 153 MPLIDYDLTLLFQPMLMLGISIGVSFSVMFADWMVTVLLIILFIGTSTKALFKGIDTWKK 212
Query: 207 ETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFG 266
ET+ +++AAKQLE E P P++ +V + ENIYWKE
Sbjct: 213 ETMMRKDAAKQLESESKPEDGAEEDYKPLPSGPVVLPDD-------EVPLRENIYWKEVA 265
Query: 267 LLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGD 326
LL VWA L +QI + Y TCSV YW+LN LQ+P+A V+ +EA+ L KG R IASKG
Sbjct: 266 LLFYVWAGFLVVQIVQTYFPTCSVTYWILNSLQVPIAASVTLFEAICLCKGTRVIASKGK 325
Query: 327 EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFS 386
E T W+ Q++ YC+CGI AGMVGGLLGLGGGFILGPLFLELGIPPQV+SAT+ FA+ FS
Sbjct: 326 EITNWKLHQILLYCSCGIIAGMVGGLLGLGGGFILGPLFLELGIPPQVASATSTFAMAFS 385
Query: 387 SSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
SSMSVVEYYLL RFPVPYA YF ++ I+AFVGQHV++K+I LGRAS+IIF L+ IFV
Sbjct: 386 SSMSVVEYYLLNRFPVPYAAYFALVATISAFVGQHVVRKIIAFLGRASLIIFILALTIFV 445
Query: 447 SALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
SA+SLGGVG+A M++++E++EYMGF+N+C
Sbjct: 446 SAVSLGGVGIANMVEKLENEEYMGFENLC 474
>gi|414885094|tpg|DAA61108.1| TPA: hypothetical protein ZEAMMB73_890925 [Zea mays]
Length = 470
Score = 481 bits (1237), Expect = e-133, Method: Compositional matrix adjust.
Identities = 259/414 (62%), Positives = 321/414 (77%), Gaps = 6/414 (1%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
+ Y+HVWP ++FGW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 58 NSYQHVWPPMEFGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 117
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G +VS V YNL+ +HPTLD+PVIDYDLA+L QP+L+LGISIGV FNVIF DW++T
Sbjct: 118 KCMIMGASVSTVYYNLKLKHPTLDMPVIDYDLAVLMQPMLMLGISIGVIFNVIFPDWLVT 177
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI-GNGYQTEECKCDPEYLSNDT 241
VLLI+LFL ST+AFLKG+ETWKKET+ +REAAK LE G + P+ +N
Sbjct: 178 VLLIILFLGTSTKAFLKGIETWKKETVIQREAAKLLEQTAGEEAEYAALPTGPDVAANKK 237
Query: 242 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 301
P S+I+NI+WK+ GLL VW L LQ+ KNYT TCS YWVLNLLQ+P
Sbjct: 238 ALTSDEAP-----SLIKNIHWKKVGLLSFVWVAFLVLQVTKNYTATCSPWYWVLNLLQVP 292
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
V+ GV+ YE L G+R ++SKG T + Q+V Y GI AG+VGGLLGLGGGFI+
Sbjct: 293 VSVGVTLYEGFGLMSGKRVLSSKGSGQTTMKFHQVVVYGLFGIAAGLVGGLLGLGGGFIM 352
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYA+YF ++ +AA +GQH
Sbjct: 353 GPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYAVYFIVVAFVAAIIGQH 412
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
V++KLI LGRAS+IIF L+F+IFVSA+SLGGVG++ M+ +IE EYMGF+++C
Sbjct: 413 VVRKLINWLGRASLIIFILAFMIFVSAISLGGVGVSNMVHKIERHEYMGFEDLC 466
>gi|297735172|emb|CBI17534.3| unnamed protein product [Vitis vinifera]
Length = 481
Score = 479 bits (1234), Expect = e-132, Method: Compositional matrix adjust.
Identities = 278/448 (62%), Positives = 341/448 (76%), Gaps = 11/448 (2%)
Query: 32 QSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGWRIVVGTIIGFFG 88
+++ +P NKN EP + KV +S Y+HVWPD+KFGW++VV +IIGFFG
Sbjct: 40 KADDRPAEHLVNKNK--EPGFLSKVVHFLWQSGKSSYQHVWPDMKFGWKLVVASIIGFFG 97
Query: 89 TACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP 148
A GSVGGVGGGGIFVPML LIVGFD KSS A+SKCM+ G A S V YN+R RHPTLD+P
Sbjct: 98 AALGSVGGVGGGGIFVPMLTLIVGFDPKSSTAISKCMIMGAAGSTVYYNMRLRHPTLDMP 157
Query: 149 VIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET 208
+IDYDLALLFQP+L+LGISIGVAFNVIFADWM+TVLLI+LFL ST+A KG+ETWKKET
Sbjct: 158 IIDYDLALLFQPMLMLGISIGVAFNVIFADWMVTVLLIILFLGTSTKALFKGIETWKKET 217
Query: 209 ITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLL 268
I K+EAA+ LE + + +Y + P R+ V +I NI WKE LL
Sbjct: 218 IMKKEAARLLE---AETKPTDNTGGGDYRPLPSGPATIRD---DHVPVIYNICWKELALL 271
Query: 269 VAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEG 328
V VW LA+QI K YT TCS+ YWVLN LQ+P+A VS YEAV LYKG R IASKG E
Sbjct: 272 VYVWVAFLAVQIVKTYTVTCSIEYWVLNCLQVPIAASVSIYEAVCLYKGTRVIASKGKEV 331
Query: 329 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 388
T W+ Q+ YC+CGI AG+VGGLLGLGGGFILGPLFLELGIPPQV+SAT+ F++ FSSS
Sbjct: 332 TNWKIHQIFLYCSCGIVAGLVGGLLGLGGGFILGPLFLELGIPPQVASATSTFSMAFSSS 391
Query: 389 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSA 448
MSVV+Y+LL+RFPVPYA YF ++ IAA VGQHV++K+IK++GRASIIIF L+ IF+SA
Sbjct: 392 MSVVQYHLLRRFPVPYASYFVLVATIAALVGQHVVRKVIKLVGRASIIIFILALTIFISA 451
Query: 449 LSLGGVGLAKMIKRIEHKEYMGFDNICY 476
+SLGGVG+ M++++ + EYMGFD++CY
Sbjct: 452 ISLGGVGIMTMVEKLANNEYMGFDDLCY 479
>gi|125561413|gb|EAZ06861.1| hypothetical protein OsI_29096 [Oryza sativa Indica Group]
Length = 469
Score = 474 bits (1219), Expect = e-131, Method: Compositional matrix adjust.
Identities = 252/414 (60%), Positives = 314/414 (75%), Gaps = 11/414 (2%)
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
+ Y HVWP ++FGW+IV+G+ +GF G A GS+GGVGGGG FVPML LI+GFDAKSS A+
Sbjct: 63 ENSYHHVWPPMEFGWQIVLGSFVGFIGAAFGSIGGVGGGGFFVPMLTLIIGFDAKSSVAI 122
Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
SKCM+ G AVS V NL+++HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW++
Sbjct: 123 SKCMIMGAAVSTVYCNLKRKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVIFPDWLV 182
Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDT 241
TVLLI+LFL ST+AFLKG+ETWKKETI KREA K+ E +EE + P
Sbjct: 183 TVLLIILFLGTSTKAFLKGIETWKKETIIKREAEKRSE-----QTSEELEYRP------V 231
Query: 242 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 301
E++ P VSI+ N+YWKEFGLL VW LALQ+ KNY TCS YWVLNLLQIP
Sbjct: 232 PASESKPPSDEAVSILHNVYWKEFGLLAFVWIAFLALQVTKNYMPTCSTWYWVLNLLQIP 291
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
V+ GV+ YE + L +GRR I+S G+E T + QL+ YC GITAG+V GLLG+GGG IL
Sbjct: 292 VSVGVTMYEGLGLMQGRRVISSNGNEQTNLKFHQLLMYCFFGITAGIVAGLLGVGGGSIL 351
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GP+FL+LG+PPQV+SATA F++ FS+SMS VEYY L RFPVPYALY ++ +A VGQ
Sbjct: 352 GPMFLDLGVPPQVASATATFSMMFSASMSAVEYYFLDRFPVPYALYLTVVAFFSAIVGQR 411
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+++K+I LGRASIIIFTLS +IF+S + LGG+G+ I +IE EYMGF++IC
Sbjct: 412 MVRKVINWLGRASIIIFTLSIMIFLSTIPLGGIGIVNWIGKIERHEYMGFEDIC 465
>gi|356511007|ref|XP_003524223.1| PREDICTED: uncharacterized protein LOC100819316 [Glycine max]
Length = 454
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 275/464 (59%), Positives = 343/464 (73%), Gaps = 32/464 (6%)
Query: 12 RLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPD 71
R+L AF L+ V S + KP ++ N V+ N + YKH WP+
Sbjct: 17 RVLLVAFFLLVSVYVASSHQNTEQKPVSSPSNDTKGVDTN------GHHANGFYKHTWPE 70
Query: 72 IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAV 131
+KFGW+I+VG+IIGF G+A G+VGGVGGGGIFVPML LI+GFDA+S+ A+SKCM+TGGA
Sbjct: 71 MKFGWKIIVGSIIGFLGSAFGTVGGVGGGGIFVPMLTLIIGFDARSATAISKCMITGGAG 130
Query: 132 SAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLV 191
+ V YNL+QRHPTLD+PVIDYDLALLFQP+L+LGIS+GVAFNVIF DWM+T LLI++F+
Sbjct: 131 ATVFYNLKQRHPTLDMPVIDYDLALLFQPMLMLGISVGVAFNVIFPDWMLTALLIIVFIG 190
Query: 192 LSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKK 251
LS AF KGV+TWKKETI K+ Q +L T +
Sbjct: 191 LSVNAFFKGVKTWKKETIIKKCFHFQHLF---------------FLMKSKTVLQ------ 229
Query: 252 SKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEA 311
VS+IENIYWKE G+LV+VW +ILALQI KNYTT CS LYW++NLLQ+P+ G + YEA
Sbjct: 230 --VSVIENIYWKELGILVSVWILILALQIGKNYTTNCSALYWIMNLLQVPITVGTTFYEA 287
Query: 312 VALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIP 371
V LYKG+R IASKGD+ T+WR QL+ YC+CGI AG++GGLLGLGGGFILGPLF+ LGI
Sbjct: 288 VLLYKGQRVIASKGDQQTRWRVQQLILYCSCGIIAGIIGGLLGLGGGFILGPLFIGLGIH 347
Query: 372 PQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 431
PQVSSAT+ FA+TFS+SMSVVEYYLLKRFP+PYALYF A++ AA VGQH+++K+I ILG
Sbjct: 348 PQVSSATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVAVATAAALVGQHLVRKVIAILG 407
Query: 432 RASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
R S+IIF L+ +FVS +SLGGVG+AK+ IE K M F+N+C
Sbjct: 408 RTSLIIFILALTVFVSGISLGGVGIAKL---IEKKVQMKFENLC 448
>gi|357518359|ref|XP_003629468.1| hypothetical protein MTR_8g077830 [Medicago truncatula]
gi|355523490|gb|AET03944.1| hypothetical protein MTR_8g077830 [Medicago truncatula]
Length = 480
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 253/422 (59%), Positives = 324/422 (76%), Gaps = 14/422 (3%)
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
S Y+ VWP++KFGWRIVVG+I+GFFG A GSVGGVGGGGIF+PML LI+GFD KSS A+
Sbjct: 59 ESSYERVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTLIIGFDPKSSTAL 118
Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
SKCM+TG A S V YNLR RHPTLD+P+IDYDLALLFQP+L+LGISIGVAFNV+FADWM+
Sbjct: 119 SKCMITGAAGSTVYYNLRLRHPTLDMPLIDYDLALLFQPMLMLGISIGVAFNVMFADWMV 178
Query: 182 TVLLIVLFL-------VLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDP 234
T+LLI+LF+ ST+A +KG++TWKKETI K+EA ++ Q E P
Sbjct: 179 TILLIILFIGNNHYLFSTSTKALVKGIDTWKKETIMKKEAFEE------AAQMLESGSTP 232
Query: 235 EYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWV 294
+Y S + + + +V +++NI+WKE +L+ VW L +QI K Y+ TCS+ YW+
Sbjct: 233 DYASEEDYKSLPADLQDEEVPLLDNIHWKELSVLMYVWVAFLIVQILKTYSKTCSIEYWL 292
Query: 295 LNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLG 354
LN LQ+P+A V+ +EA+ L KG R IAS+G E T W+ ++ YC CGI AGMV GLLG
Sbjct: 293 LNSLQVPIAISVTLFEAICLCKGTRVIASRGKEIT-WKFHKICLYCFCGIIAGMVSGLLG 351
Query: 355 LGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII 414
LGGGFILGPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YY L RFP+PYA Y ++ I
Sbjct: 352 LGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYHLDRFPIPYASYLVLVATI 411
Query: 415 AAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNI 474
AA GQHV++K+I I GRASII+F L+F IFVSA+SLGGVG+ M++++E++EYMGFDN+
Sbjct: 412 AALTGQHVVRKIIAIFGRASIIVFILAFTIFVSAISLGGVGIGNMVEKMENEEYMGFDNL 471
Query: 475 CY 476
C+
Sbjct: 472 CH 473
>gi|338762845|gb|AEI98632.1| hypothetical protein 111018.19 [Coffea canephora]
Length = 465
Score = 470 bits (1210), Expect = e-130, Method: Compositional matrix adjust.
Identities = 257/415 (61%), Positives = 320/415 (77%), Gaps = 21/415 (5%)
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
+S Y+HVWP+++FGW++VVGT++GF G A GSVGGVGGGGIFVPML+LI+GFD KSS A+
Sbjct: 70 KSSYQHVWPEMEFGWKLVVGTVVGFLGAALGSVGGVGGGGIFVPMLSLIIGFDPKSSTAI 129
Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
SKCM+ G A S V YNLR RHPTLD+P+IDYDLALLFQP+L+LGISIGVAFNVIFADWM+
Sbjct: 130 SKCMIMGAAGSTVYYNLRLRHPTLDLPLIDYDLALLFQPMLMLGISIGVAFNVIFADWMV 189
Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDT 241
TVLLI+LF+ S++AF KG+ETW KET K +G G + + P +++
Sbjct: 190 TVLLILLFIGTSSKAFFKGLETWNKETKMK---------LGPGNEYKPLPGGPAAHADE- 239
Query: 242 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 301
KV ++ NIYWKE LL+ VW LA+QI K YT TCS YW LN LQ+P
Sbjct: 240 -----------KVPLLYNIYWKELCLLLFVWIAFLAIQIMKTYTQTCSAKYWTLNFLQVP 288
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
+A VS YEA LYKG R IASKG E T W+ + + YC GI AG+VGGLLGLGGGF+L
Sbjct: 289 IAASVSLYEATCLYKGTRTIASKGKEITVWKPNLIFLYCCLGIVAGVVGGLLGLGGGFVL 348
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GPLFLELG+PPQV+SAT+ FA+TFSSSMSVV+YYLL RFPVPYA YF ++ +AA VGQH
Sbjct: 349 GPLFLELGVPPQVASATSTFAMTFSSSMSVVQYYLLNRFPVPYATYFVLVATVAALVGQH 408
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICY 476
V++++I ILGRAS+IIF L+ IFVSA+SLGGVG+A MI+++++ EYMGFDN+C+
Sbjct: 409 VVRRIIAILGRASLIIFILALTIFVSAISLGGVGIASMIEKLQNHEYMGFDNLCH 463
>gi|388493882|gb|AFK35007.1| unknown [Medicago truncatula]
Length = 473
Score = 469 bits (1208), Expect = e-130, Method: Compositional matrix adjust.
Identities = 251/415 (60%), Positives = 321/415 (77%), Gaps = 7/415 (1%)
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
S Y+ VWP++KFGWRIVVG+I+GFFG A GSVGGVGGGGIF+PML LI+GFD KSS A+
Sbjct: 59 ESSYERVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTLIIGFDPKSSTAL 118
Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
SKCM+TG A S V NLR RHPTLD+P+IDYDLALLFQP+L+LGISIGVAFNV+FADWM+
Sbjct: 119 SKCMITGAAGSTVYCNLRLRHPTLDMPLIDYDLALLFQPMLMLGISIGVAFNVMFADWMV 178
Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDT 241
T+LLI+LF+ ST+A +KG++TWKKE I K+EA ++ Q E P+Y S +
Sbjct: 179 TILLIILFIGTSTKALVKGIDTWKKEAIMKKEAFEE------AAQMLESGSTPDYASEED 232
Query: 242 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 301
+ + +V +++NI+WKE +L+ VW L +QI K Y+ TCS+ YW+LN LQ+P
Sbjct: 233 YKSLPADLQDEEVPLLDNIHWKELSVLMYVWVAFLIVQILKTYSKTCSIEYWLLNSLQVP 292
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
+A V+ +EA+ L KG R IAS+G E T W+ ++ YC CGI AGMV GLLGLGGGFIL
Sbjct: 293 IAISVTLFEAICLCKGTRVIASRGKEIT-WKFHKICLYCFCGIIAGMVSGLLGLGGGFIL 351
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YY L RFP+PYA Y ++ IAA GQH
Sbjct: 352 GPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYHLDRFPIPYASYLVLVATIAALTGQH 411
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICY 476
V++K+I I GRASII+F L+F IFVSA+SLGGVG+ M++++E+ EYMGFDN+C+
Sbjct: 412 VVRKIIAIFGRASIIVFILAFTIFVSAISLGGVGIGNMVEKMENAEYMGFDNLCH 466
>gi|115476232|ref|NP_001061712.1| Os08g0387200 [Oryza sativa Japonica Group]
gi|40253486|dbj|BAD05436.1| unknown protein [Oryza sativa Japonica Group]
gi|113623681|dbj|BAF23626.1| Os08g0387200 [Oryza sativa Japonica Group]
gi|125603275|gb|EAZ42600.1| hypothetical protein OsJ_27163 [Oryza sativa Japonica Group]
Length = 469
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 251/414 (60%), Positives = 314/414 (75%), Gaps = 11/414 (2%)
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
+ Y HVWP ++FGW+IV+G+ +GF G A GS+GGVGGGG F+PML LI+GFDAKSS A+
Sbjct: 63 ENSYHHVWPPMEFGWQIVLGSFVGFIGAAFGSIGGVGGGGFFMPMLTLIIGFDAKSSVAI 122
Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
SKCM+ G AVS V NL+++HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW++
Sbjct: 123 SKCMIMGAAVSTVYCNLKRKHPTLDMPVIDYDLALLIQPMLMLGISIGVIFNVIFPDWLV 182
Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDT 241
TVLLI+LFL ST+AFLKG+ETWKKETI KREA K+ E +EE + P
Sbjct: 183 TVLLIILFLGTSTKAFLKGIETWKKETIIKREAEKRSE-----QTSEELEYRP------V 231
Query: 242 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 301
E++ P VSI+ N+YWKEFGLL VW LALQ+ KNY TCS YWVLNLLQIP
Sbjct: 232 PASESKPPSDEAVSILHNVYWKEFGLLAFVWIAFLALQVTKNYMPTCSTWYWVLNLLQIP 291
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
V+ GV+ YE + L +GRR I+S G+E T + QL+ YC GITAG+V GLLG+GGG IL
Sbjct: 292 VSVGVTMYEGLGLMQGRRVISSNGNEQTNLKFHQLLMYCFFGITAGIVAGLLGVGGGSIL 351
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GP+FL+LG+PPQV+SATA F++ FS+SMS VEYY L RFPVPYALY ++ +A VGQ
Sbjct: 352 GPMFLDLGVPPQVASATATFSMMFSASMSAVEYYFLDRFPVPYALYLTVVAFFSAIVGQR 411
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+++K+I LGRASIIIFTLS +IF+S + LGG+G+ I +IE EYMGF++IC
Sbjct: 412 MVRKVINWLGRASIIIFTLSIMIFLSTIPLGGIGIVNWIGKIERHEYMGFEDIC 465
>gi|356528377|ref|XP_003532780.1| PREDICTED: uncharacterized protein LOC100802321 [Glycine max]
Length = 434
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 274/469 (58%), Positives = 338/469 (72%), Gaps = 51/469 (10%)
Query: 10 RWRLLAAA---FIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYK 66
RW + A + L+V+ S+Q N + S N+ + A + YK
Sbjct: 8 RWAVKWVAEKVLVAFLLVSVYVASSQQNTDQKTISSASNETKG----VDTKAHHAEGFYK 63
Query: 67 HVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMV 126
H WP++KFGW+I+VG+IIGF G+A G+VGGVGGGGIFVPML LI+GFDA+S+ A+SKCM+
Sbjct: 64 HTWPEMKFGWKIIVGSIIGFLGSAFGTVGGVGGGGIFVPMLTLIIGFDARSATAISKCMI 123
Query: 127 TGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLI 186
TGGA + + YNL+QRHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNVIF DWM+T LLI
Sbjct: 124 TGGAGATIFYNLKQRHPTLDMPVIDYDLALLFQPMLMLGISIGVAFNVIFPDWMLTALLI 183
Query: 187 VLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEET 246
V+F+ LS AF KGV+TWKKETI
Sbjct: 184 VVFIGLSFNAFFKGVKTWKKETIV------------------------------------ 207
Query: 247 REPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGV 306
+KVS+IENIYWKE G+LV+VW +ILALQI KNYTT CS LYW LNLLQ+P+ G
Sbjct: 208 -----NKVSVIENIYWKELGILVSVWILILALQIGKNYTTNCSALYWALNLLQVPITVGT 262
Query: 307 SAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 366
+ YEAV LYKG+RKIASKGD+ T+WR QL+ YC CGI AG++GGLLGLGGGFILGPLF+
Sbjct: 263 TVYEAVLLYKGKRKIASKGDQQTRWRVHQLILYCTCGIIAGIIGGLLGLGGGFILGPLFI 322
Query: 367 ELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
LGI PQVSSAT+ FA+TFS+SMSVVEYYLLKRFP+PYALYF A++ AA VGQH+++K+
Sbjct: 323 GLGIHPQVSSATSTFAMTFSASMSVVEYYLLKRFPIPYALYFVAVATAAALVGQHLVRKV 382
Query: 427 IKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
I ILGR S+IIF L+ +FVS +SLGGVG+AK+ IE K M F+N+C
Sbjct: 383 IAILGRTSLIIFILALTVFVSGISLGGVGIAKL---IEKKVQMKFENLC 428
>gi|356496755|ref|XP_003517231.1| PREDICTED: uncharacterized protein LOC100801441 [Glycine max]
Length = 470
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 258/416 (62%), Positives = 323/416 (77%), Gaps = 11/416 (2%)
Query: 60 KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
+ +S Y+ VWP++KFGWRI+VG+IIGF G A GSVGGVGGGGIFVPML LI+GFD KSS
Sbjct: 63 EGKSSYEPVWPNMKFGWRIIVGSIIGFLGAALGSVGGVGGGGIFVPMLALIIGFDPKSST 122
Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
A+SKCM+ G ++S V YNLR RHPTLD+P+IDYDLAL+FQP+L+LGISIGV NV+FADW
Sbjct: 123 AISKCMIMGASISTVYYNLRLRHPTLDMPLIDYDLALIFQPMLMLGISIGVICNVMFADW 182
Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSN 239
M+TVLLI+LF+ ST+A KG++TWKKETI K+EA+K LE E K +Y S
Sbjct: 183 MVTVLLIILFIATSTKATYKGIDTWKKETIAKKEASKLLE--------AEPKSGDDYKSL 234
Query: 240 DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 299
+ P E+ + +++NIYWKE LL VW +QI K YT CS+ +W+LN LQ
Sbjct: 235 PSGPTESL---FEEAPLLKNIYWKELSLLAYVWVAFFIVQIVKEYTKPCSIQFWLLNFLQ 291
Query: 300 IPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 359
+PVA V+ +EA+ LYKG R IASKG E T W+ Q+ YC+ GI AGMVGGLLGLGGGF
Sbjct: 292 VPVAVSVTLFEAIGLYKGTRVIASKGKEVTNWKIHQICLYCSTGIMAGMVGGLLGLGGGF 351
Query: 360 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
ILGPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YYLL RFPVPYA YF ++ IAAF G
Sbjct: 352 ILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYLLDRFPVPYASYFALVATIAAFTG 411
Query: 420 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
QHV++K+I +LGRASIIIF L+ IF+SA+SLGGVG+ +I++IE+ EYMGF+++C
Sbjct: 412 QHVVRKVIVVLGRASIIIFILALTIFISAISLGGVGIENIIEKIENHEYMGFEDLC 467
>gi|414885096|tpg|DAA61110.1| TPA: hypothetical protein ZEAMMB73_890925 [Zea mays]
Length = 404
Score = 466 bits (1200), Expect = e-129, Method: Compositional matrix adjust.
Identities = 254/405 (62%), Positives = 314/405 (77%), Gaps = 6/405 (1%)
Query: 72 IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAV 131
++FGW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+SKCM+ G +V
Sbjct: 1 MEFGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAISKCMIMGASV 60
Query: 132 SAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLV 191
S V YNL+ +HPTLD+PVIDYDLA+L QP+L+LGISIGV FNVIF DW++TVLLI+LFL
Sbjct: 61 STVYYNLKLKHPTLDMPVIDYDLAVLMQPMLMLGISIGVIFNVIFPDWLVTVLLIILFLG 120
Query: 192 LSTRAFLKGVETWKKETITKREAAKQLELI-GNGYQTEECKCDPEYLSNDTTPEETREPK 250
ST+AFLKG+ETWKKET+ +REAAK LE G + P+ +N P
Sbjct: 121 TSTKAFLKGIETWKKETVIQREAAKLLEQTAGEEAEYAALPTGPDVAANKKALTSDEAP- 179
Query: 251 KSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYE 310
S+I+NI+WK+ GLL VW L LQ+ KNYT TCS YWVLNLLQ+PV+ GV+ YE
Sbjct: 180 ----SLIKNIHWKKVGLLSFVWVAFLVLQVTKNYTATCSPWYWVLNLLQVPVSVGVTLYE 235
Query: 311 AVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGI 370
L G+R ++SKG T + Q+V Y GI AG+VGGLLGLGGGFI+GPLFLELGI
Sbjct: 236 GFGLMSGKRVLSSKGSGQTTMKFHQVVVYGLFGIAAGLVGGLLGLGGGFIMGPLFLELGI 295
Query: 371 PPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKIL 430
PPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYA+YF ++ +AA +GQHV++KLI L
Sbjct: 296 PPQVSSATATFAMMFSSSMSVVEYYLLNRFPVPYAVYFIVVAFVAAIIGQHVVRKLINWL 355
Query: 431 GRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
GRAS+IIF L+F+IFVSA+SLGGVG++ M+ +IE EYMGF+++C
Sbjct: 356 GRASLIIFILAFMIFVSAISLGGVGVSNMVHKIERHEYMGFEDLC 400
>gi|357147619|ref|XP_003574412.1| PREDICTED: uncharacterized protein LOC100846223 [Brachypodium
distachyon]
Length = 468
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 273/469 (58%), Positives = 344/469 (73%), Gaps = 11/469 (2%)
Query: 10 RWRLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSA---PKSRSGYK 66
+W +AA I V+A+ + + L + + + V + + KV++ K + Y
Sbjct: 4 QWHAVAALCISCTVMAAAVGADRGRLFASTAAAPEPEDV--SFLRKVASYMWQKDGTTYH 61
Query: 67 HVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMV 126
HVWP +KFGW+IV+G+ GF G A GS+GGVGGGGIFVPML LI+GFD KS+AA+SKCM+
Sbjct: 62 HVWPPMKFGWKIVLGSFFGFIGGAFGSIGGVGGGGIFVPMLTLIIGFDPKSAAAMSKCMI 121
Query: 127 TGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLI 186
G AVS V NL+ +HPTLD+PVIDYDL LL QP+L+LG+SIGV NVIF +W++TVLLI
Sbjct: 122 MGAAVSTVYCNLKLKHPTLDMPVIDYDLVLLIQPMLMLGVSIGVICNVIFPNWLVTVLLI 181
Query: 187 VLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEET 246
+LFLV ST+AFLKGVETWKKETI +EA K+ E I +EE + P D+ P ET
Sbjct: 182 ILFLVTSTKAFLKGVETWKKETIIIKEAEKRSEQI-----SEETEYTPLPTGPDS-PAET 235
Query: 247 REPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGV 306
P VSI +N++WKEFGLLV VW LALQ+ KNY TCS YWVLNLLQIPV+ GV
Sbjct: 236 ITPSDEAVSIWKNVHWKEFGLLVFVWVAFLALQVTKNYIATCSTWYWVLNLLQIPVSVGV 295
Query: 307 SAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 366
+ YEAV L +G+R I+SK ++ T + QL+ YC G+ AG+V GLLG+GGG I+GPLFL
Sbjct: 296 AMYEAVGLLQGKRVISSKANDQTSLKVHQLLLYCFLGLMAGVVAGLLGVGGGSIMGPLFL 355
Query: 367 ELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
ELG+PPQVSSATA FA+ FSSSMSVVEY+LL RFPVPYAL+F L+ AA VGQ V++KL
Sbjct: 356 ELGVPPQVSSATATFAMMFSSSMSVVEYFLLNRFPVPYALFFTVLAFFAAIVGQRVVRKL 415
Query: 427 IKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
I +LGRAS+I+F LS +IFVSA SLGGVG++ MI +I EYMGF+NIC
Sbjct: 416 IGLLGRASLIVFILSMMIFVSAFSLGGVGISNMIHKITRHEYMGFENIC 464
>gi|83853809|gb|ABC47842.1| membrane protein-like protein [Glycine max]
Length = 469
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 260/415 (62%), Positives = 321/415 (77%), Gaps = 19/415 (4%)
Query: 66 KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
VWP++KFGWRIVVG+I+GFFG A GSVGGVGGGGIF+PML L++GFDAKSS A+SKCM
Sbjct: 70 DRVWPEMKFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTLVIGFDAKSSTALSKCM 129
Query: 126 VTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLL 185
+ G AVS V YNLR RHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNV+FADWM+TVLL
Sbjct: 130 IMGAAVSTVYYNLRFRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLL 189
Query: 186 IVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGN-GYQTEECKCDPEYLSNDTTPE 244
I+LF+ ST+A KG++TWKKETI K+EAAK LE + GY +EE +Y S P
Sbjct: 190 IILFIATSTKALFKGIDTWKKETIMKKEAAKMLESDSSPGYVSEE-----DYKS---LPA 241
Query: 245 ETREPKKSKV--SIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPV 302
+ +P+ +V I+ I + F + LAL + YT TCS+LYWVLN LQ+P+
Sbjct: 242 GSADPRDEEVIPMAIQGIPYSNF------IEVFLALHVV-TYTKTCSILYWVLNSLQVPI 294
Query: 303 AGGVSAYEAVALYKGRRKIASKGDEGTKW-RASQLVFYCACGITAGMVGGLLGLGGGFIL 361
A V+ YEA+ L G R IASKG E T W + ++ YC+CGI AG+V GLLGLGGGFIL
Sbjct: 295 AISVTLYEAICLCNGTRVIASKGKENTDWMKLHKICLYCSCGIIAGIVSGLLGLGGGFIL 354
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YYLL+RFPVPYA YF ++ IAA GQH
Sbjct: 355 GPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYLLERFPVPYASYFILVATIAALTGQH 414
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICY 476
V++K+I I GRASIIIF L+F IF+SA+SLGGVG+ M++++E+ EYMGF NIC+
Sbjct: 415 VVRKIIAIFGRASIIIFVLAFTIFLSAISLGGVGIENMVEKMENNEYMGFANICH 469
>gi|356554517|ref|XP_003545592.1| PREDICTED: uncharacterized protein LOC100781779 [Glycine max]
Length = 462
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 245/454 (53%), Positives = 321/454 (70%), Gaps = 16/454 (3%)
Query: 27 VSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGF 86
V +SA ++ K E +S N+ +I + GY+HVWP+++F WRIVVGT+IG
Sbjct: 16 VFISAITSTKSEVSSTNEEQSFSYHIK-ALEFIWKHLGYQHVWPEMEFSWRIVVGTLIGI 74
Query: 87 FGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD 146
G A GSVGGVGGGGIFVPML LI+GFD KS+ A+SKCMVTG A+SAV + ++QRHPTLD
Sbjct: 75 LGAAFGSVGGVGGGGIFVPMLILIIGFDPKSAVAISKCMVTGAAISAVFFCMKQRHPTLD 134
Query: 147 IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
PVIDYDL LL QP L+LGISIGV +VIFADWM+T+LLI+L +V S RAF G +TWKK
Sbjct: 135 EPVIDYDLMLLIQPTLMLGISIGVILSVIFADWMVTILLIILCIVTSIRAFFMGADTWKK 194
Query: 207 ETITKREAAKQLELIGNGYQTEECKCDPEY-----LSNDTTPEETREPKKSKVSIIENIY 261
ET K E+ + ++L T C + Y S++ ++TR+P +VSI+ N+Y
Sbjct: 195 ETKMKEESQETIKL---SESTATCSEEEGYKYLPGCSDEGYEKDTRKP---EVSILGNMY 248
Query: 262 WKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKI 321
WKEF L+ VW + LQIAKNYT +CSV YW+L L QIP+ G Y+A ALY+GR
Sbjct: 249 WKEFVLIFIVWLAFVVLQIAKNYTVSCSVTYWILILSQIPITVGFYLYQARALYQGR--- 305
Query: 322 ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 381
+ G + T W L C + AG+VGGLLG G GF++GPLFLE+GI PQV+SATA F
Sbjct: 306 -AAGSQHTHWPLHHLFLASICSLLAGIVGGLLGTGSGFVMGPLFLEVGIAPQVASATATF 364
Query: 382 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 441
+ +SSS+SV++YYLL RFPVPYAL+ ++ IAAF+GQ+++ KL+ I RAS+IIF L+
Sbjct: 365 GMMYSSSLSVIQYYLLNRFPVPYALFLTLVAAIAAFLGQYLIDKLVNIFQRASLIIFVLA 424
Query: 442 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
F IFVS+++LGGVG++ MI +I+ EYMGFDN C
Sbjct: 425 FTIFVSSIALGGVGISNMILKIQRNEYMGFDNFC 458
>gi|357519215|ref|XP_003629896.1| hypothetical protein MTR_8g088070 [Medicago truncatula]
gi|355523918|gb|AET04372.1| hypothetical protein MTR_8g088070 [Medicago truncatula]
Length = 448
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 257/430 (59%), Positives = 325/430 (75%), Gaps = 17/430 (3%)
Query: 23 VVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGT 82
V +S + Q P +++ K DH ++ YKH WP +KFGWRI+VGT
Sbjct: 33 VTSSHQYTDQKTTPP--SNETKGDH-------------AKVFYKHTWPGMKFGWRIIVGT 77
Query: 83 IIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRH 142
I+ F G++ G+VGGVGGG VPML LI+GFDAKS+ A+SKCM+TGGA + V+YNLRQRH
Sbjct: 78 ILAFLGSSFGTVGGVGGG-GIVPMLTLIIGFDAKSATAISKCMITGGAGATVLYNLRQRH 136
Query: 143 PTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVE 202
PTLD+PVIDYDLALLFQP+L+LGIS+GVAFNVIF DWMIT L+++ F +S +AFLKGV
Sbjct: 137 PTLDLPVIDYDLALLFQPMLMLGISLGVAFNVIFPDWMITSLILIFFTGISVKAFLKGVN 196
Query: 203 TWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYW 262
TWK ET+ K+EA +L + + E+ + D + + ++ KVS+IEN+ W
Sbjct: 197 TWKNETLAKKEARDNSQL-NDIRRIEDVELDLQAGESVNDNHTNTNIQRKKVSVIENVSW 255
Query: 263 KEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIA 322
KE GLL VW +ILALQI K+YT TCSV YW+LNLLQ+P+A GVS+YEAV LYKG++ IA
Sbjct: 256 KEVGLLFFVWIMILALQIGKSYTITCSVAYWILNLLQVPIAVGVSSYEAVLLYKGKKVIA 315
Query: 323 SKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFA 382
SKGD+G W QL+ YC+CGI AG++GGLLGLGGGFIL PLFL LGIPPQVSSAT+ FA
Sbjct: 316 SKGDQGINWSVKQLILYCSCGIIAGLIGGLLGLGGGFILAPLFLGLGIPPQVSSATSTFA 375
Query: 383 ITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSF 442
+TFS+SMSVVEYYLLKRFP+PYALYF ++ +AA VGQH+++KLI +LGRASIIIF L+
Sbjct: 376 MTFSASMSVVEYYLLKRFPIPYALYFVVVATVAALVGQHLVRKLIVLLGRASIIIFILAL 435
Query: 443 IIFVSALSLG 452
+FVS +SLG
Sbjct: 436 TVFVSGISLG 445
>gi|414589355|tpg|DAA39926.1| TPA: hypothetical protein ZEAMMB73_327447 [Zea mays]
Length = 497
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 256/438 (58%), Positives = 318/438 (72%), Gaps = 28/438 (6%)
Query: 64 GYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSK 123
Y HVWP + FGWR+VVG++IGFFG ACGSVGGVGGGGIFVPML LIVGFD KSS A+SK
Sbjct: 60 AYHHVWPPMGFGWRVVVGSLIGFFGAACGSVGGVGGGGIFVPMLALIVGFDPKSSTAISK 119
Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
CM+ GG+VS V YNL+++HP+LD+P+IDYDLALL QP+L+LG+SIGV FNVIF DW+IT
Sbjct: 120 CMIMGGSVSTVYYNLKRKHPSLDMPLIDYDLALLMQPMLMLGVSIGVIFNVIFPDWLITA 179
Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKRE-------AAKQ-----LELIGN-------- 223
LLI +FLV ST+A+LKG ETWKKET+ KR+ AA+Q ++ GN
Sbjct: 180 LLITIFLVTSTKAYLKGFETWKKETVKKRQMLLRLHRAARQETRQIMKRYGNLACIAVRT 239
Query: 224 -GYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK 282
Y C Y+ D E S+++NIYWKE GLL VW L LQ+ +
Sbjct: 240 IAYIVRIRTCVHIYI--DELMETACLALFQTSSVLKNIYWKELGLLAFVWIAFLGLQLTR 297
Query: 283 -NYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS-KG---DEGTKWRASQLV 337
NY +CSV YWVLN LQ+PVA GV+ YEA L GRR ++S KG + + R QL+
Sbjct: 298 TNYAASCSVWYWVLNSLQVPVAVGVTVYEACGLRTGRRALSSNKGGSQQQQSALRIRQLL 357
Query: 338 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLL 397
YC G+ AG+VGGLLG+GGGFI+GPLFLELGIPPQVSSATA F + FSSSMSVVEYYLL
Sbjct: 358 VYCLFGVLAGLVGGLLGMGGGFIMGPLFLELGIPPQVSSATATFTMMFSSSMSVVEYYLL 417
Query: 398 KRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 457
RFPVPYA YF ++ +AA GQH ++KLI LGRAS+IIF L+ +IFVSAL+LGGVG++
Sbjct: 418 HRFPVPYAAYFTGVAFVAAITGQHCVRKLIAWLGRASLIIFILASMIFVSALTLGGVGIS 477
Query: 458 KMIKRIEHKEYMGFDNIC 475
++ R+E +YMGF+++C
Sbjct: 478 NIVHRVERHQYMGFESLC 495
>gi|242044424|ref|XP_002460083.1| hypothetical protein SORBIDRAFT_02g022500 [Sorghum bicolor]
gi|241923460|gb|EER96604.1| hypothetical protein SORBIDRAFT_02g022500 [Sorghum bicolor]
Length = 455
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 250/417 (59%), Positives = 312/417 (74%), Gaps = 24/417 (5%)
Query: 64 GYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSK 123
Y HVWP ++FGWR+VVG++IGFFG ACGSVGGVGGGGIFVPML LI+GFD KSS A+SK
Sbjct: 56 AYHHVWPPMEFGWRVVVGSLIGFFGAACGSVGGVGGGGIFVPMLALIIGFDPKSSTAISK 115
Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
YNL+ +HP+LD+P+IDYDLALL QP+L+LG+SIGV FNVIF +W+IT
Sbjct: 116 ----------FYYNLKLKHPSLDMPLIDYDLALLMQPMLMLGVSIGVIFNVIFPNWLITA 165
Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEY---LSND 240
LLI +FLV ST+A+LKG ETWKKETI KRE A++ E I C+ +PE+ + +D
Sbjct: 166 LLITIFLVTSTKAYLKGFETWKKETIKKREDARRKEQI--------CQ-EPEHAIPIGSD 216
Query: 241 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 300
++ + P S+++NIYWKEFGLL VW L LQ+ KNY +CSV YWVLN LQI
Sbjct: 217 APADKAKTPTDEATSVLKNIYWKEFGLLAFVWVAFLGLQVTKNYVASCSVWYWVLNSLQI 276
Query: 301 PVAGGVSAYEAVALYKGRRKIASKGDEGT--KWRASQLVFYCACGITAGMVGGLLGLGGG 358
PVA GV+ YEA L G+R ++SKG R QL+ YC GI AG++GGLLG+GGG
Sbjct: 277 PVAVGVTVYEAYGLRTGKRVLSSKGSSQQQSTLRIRQLLVYCLFGILAGLIGGLLGMGGG 336
Query: 359 FILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 418
FI+GPLFLELGIPPQVSSATA F + FSSSMSVVEYYLL RFPVPYA YF ++ +AA
Sbjct: 337 FIMGPLFLELGIPPQVSSATATFTMMFSSSMSVVEYYLLHRFPVPYAAYFTGVAFVAAIT 396
Query: 419 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
GQH ++KLI LGRAS+IIF L+ +IFVSAL+LGGVG++ ++ R+E EYMGF+++C
Sbjct: 397 GQHFVRKLIAWLGRASLIIFILASMIFVSALTLGGVGISNIVHRMERHEYMGFESLC 453
>gi|357483379|ref|XP_003611976.1| Membrane protein-like protein [Medicago truncatula]
gi|358344383|ref|XP_003636269.1| Membrane protein-like protein [Medicago truncatula]
gi|355502204|gb|AES83407.1| Membrane protein-like protein [Medicago truncatula]
gi|355513311|gb|AES94934.1| Membrane protein-like protein [Medicago truncatula]
Length = 470
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 254/414 (61%), Positives = 320/414 (77%), Gaps = 11/414 (2%)
Query: 62 RSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
+S Y+ VWP++KF W+I+VG+IIGF G A GSVGGVGGGGIFVPML LI+GFD KSS A+
Sbjct: 66 KSSYEPVWPEMKFDWKIIVGSIIGFLGAALGSVGGVGGGGIFVPMLALIIGFDPKSSTAI 125
Query: 122 SKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMI 181
SKCM+ G A+S V YN+R R+PTLD+P+IDYDLALLFQP+L+LGISIGV NV+FADWM+
Sbjct: 126 SKCMIMGAALSTVYYNMRLRNPTLDMPLIDYDLALLFQPMLMLGISIGVICNVMFADWMV 185
Query: 182 TVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDT 241
TVLLI+LF+ ST+A +KG+ TWKKET+ K+E AKQLE EE K +Y
Sbjct: 186 TVLLIILFIGTSTKALIKGINTWKKETMLKKETAKQLE--------EEPKTGEDY---KP 234
Query: 242 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 301
P+ E + V +++NIYWKE LLV VW L +QI K YT TCS+ YW+LN LQ+P
Sbjct: 235 LPKGPGEIQDEVVPLLKNIYWKELSLLVYVWVAFLIVQIVKTYTKTCSIEYWILNFLQVP 294
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
+A V+ +EAV +YKG R I SKG E + Q++ YC+ G+ AGMVGGLLGLGGGFIL
Sbjct: 295 IAISVTLFEAVCIYKGTRVIKSKGKEVKNMKIYQILLYCSIGVIAGMVGGLLGLGGGFIL 354
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GPLFLE+GIPPQV+SAT+ F++ FSSSMSVV+YY L RFPVPYA YF ++ IAAF GQH
Sbjct: 355 GPLFLEMGIPPQVASATSTFSMLFSSSMSVVQYYYLDRFPVPYASYFVLVATIAAFAGQH 414
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
V++++I ILGRASIIIF L+ IF+SA+SLGGVG+ MI ++E+ EYMGF+N+C
Sbjct: 415 VVRRIIAILGRASIIIFILASTIFISAISLGGVGIQNMIVKLENHEYMGFENLC 468
>gi|215694369|dbj|BAG89362.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 230/354 (64%), Positives = 272/354 (76%), Gaps = 12/354 (3%)
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
M+ G AVS V YNL+ +HPTLD+PVIDYDLALL QP+L+LGISIGV FNVIF DW+ITVL
Sbjct: 1 MIMGAAVSTVYYNLKLKHPTLDMPVIDYDLALLIQPMLMLGISIGVLFNVIFPDWLITVL 60
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPE 244
LI+LFL ST+AFLKGVETWKKETI KREAAK+LE I +PEY T P
Sbjct: 61 LIILFLGTSTKAFLKGVETWKKETILKREAAKRLEQIAE---------EPEYSPLPTGPN 111
Query: 245 ---ETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIP 301
E++ P S+ +N+YWKEFGLL VW L LQ+ KNY TCS YWVLN LQIP
Sbjct: 112 ATAESKAPSDEAASLWQNVYWKEFGLLAFVWIAFLVLQVTKNYMPTCSTWYWVLNFLQIP 171
Query: 302 VAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFIL 361
V+ GV+ YE + L +GRR I+SKGDE T + QL+ YC G+ AG+VGGLLGLGGGFI+
Sbjct: 172 VSVGVTMYEGLGLMQGRRVISSKGDEQTNLKFHQLLVYCFFGMMAGVVGGLLGLGGGFIM 231
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQH 421
GPLFLELGIPPQVSSATA FA+ FSSSMSVVEYYLL RFPVPYAL+F ++ AA +GQH
Sbjct: 232 GPLFLELGIPPQVSSATATFAMMFSSSMSVVEYYLLDRFPVPYALFFTVVAFFAAIIGQH 291
Query: 422 VLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+++KLI LGRAS+IIF LSF+IF+SA+SLGGVG++ MI +I EYMGFDNIC
Sbjct: 292 IVRKLINWLGRASLIIFILSFMIFISAISLGGVGISNMIGKIARHEYMGFDNIC 345
>gi|67043799|gb|AAY63998.1| unknown [Brassica napus]
Length = 348
Score = 434 bits (1115), Expect = e-119, Method: Compositional matrix adjust.
Identities = 230/348 (66%), Positives = 277/348 (79%), Gaps = 5/348 (1%)
Query: 128 GGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIV 187
G +VS V YNLR RHPTLD+P+IDYDLALL QP+L+LGIS+GVAFNV+F DWM+TVLLI+
Sbjct: 2 GASVSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISVGVAFNVMFPDWMVTVLLII 61
Query: 188 LFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETR 247
LFL ST+AFLKG ETW KETI K EAAK+LE +G E + P + +T P
Sbjct: 62 LFLGTSTKAFLKGRETWNKETIEKMEAAKRLE--SDGVSATEVEYMPLPAAPNTNPGNN- 118
Query: 248 EPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVS 307
KK +VSIIEN+YWKE GLLV VW + LALQIAK TCSV YWV+NLLQIPVA GVS
Sbjct: 119 --KKREVSIIENVYWKELGLLVFVWIVFLALQIAKKNMPTCSVGYWVINLLQIPVAVGVS 176
Query: 308 AYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE 367
YEAVALY+GRR IAS G G+ + QL+ YC+ G+ AG+VGGLLGLGGGFI+GPLFLE
Sbjct: 177 GYEAVALYQGRRIIASNGQGGSNFTIGQLILYCSFGVLAGVVGGLLGLGGGFIMGPLFLE 236
Query: 368 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 427
LG+PPQVSSATA FA+TFSSSMSVVEYYLLKRFP+PYALY ++ IAA VGQHV+++LI
Sbjct: 237 LGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPIPYALYLVGVATIAALVGQHVVRRLI 296
Query: 428 KILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+LGRAS+IIF L+ IF+SA+SLGGVG+ M+ +++ EYMGF+N+C
Sbjct: 297 AVLGRASLIIFILASTIFISAISLGGVGIVNMMGKLQRHEYMGFENLC 344
>gi|3643608|gb|AAC42255.1| hypothetical protein [Arabidopsis thaliana]
gi|20197510|gb|AAM15102.1| hypothetical protein [Arabidopsis thaliana]
Length = 902
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 225/348 (64%), Positives = 265/348 (76%), Gaps = 22/348 (6%)
Query: 128 GGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIV 187
G +VS V YNLR RHPTLD+P+IDYDLALL QP+L+LGISIGVAFNVIF DW++TVLLIV
Sbjct: 2 GASVSTVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVAFNVIFPDWLVTVLLIV 61
Query: 188 LFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETR 247
LFL ST+AFLKG ETW KETI K+EAAK+LE NG E + P + T P
Sbjct: 62 LFLGTSTKAFLKGSETWNKETIEKKEAAKRLE--SNGVSGTEVEYVPLPAAPSTNPGNK- 118
Query: 248 EPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVS 307
KK +VSIIEN+YWKE GLLV VW + LALQI+K IPVA GVS
Sbjct: 119 --KKEEVSIIENVYWKELGLLVFVWIVFLALQISK-----------------IPVAVGVS 159
Query: 308 AYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE 367
YEAVALY+GRR IASKG + + QLV YC GI AG+VGGLLGLGGGFI+GPLFLE
Sbjct: 160 GYEAVALYQGRRIIASKGQGDSNFTVGQLVMYCTFGIIAGIVGGLLGLGGGFIMGPLFLE 219
Query: 368 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 427
LG+PPQVSSATA FA+TFSSSMSVVEYYLLKRFPVPYALY ++ IAA+VGQHV+++LI
Sbjct: 220 LGVPPQVSSATATFAMTFSSSMSVVEYYLLKRFPVPYALYLVGVATIAAWVGQHVVRRLI 279
Query: 428 KILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+GRAS+IIF L+ +IF+SA+SLGGVG+ MI +I+ EYMGF+N+C
Sbjct: 280 AAIGRASLIIFILASMIFISAISLGGVGIVNMIGKIQRHEYMGFENLC 327
>gi|224068935|ref|XP_002326235.1| predicted protein [Populus trichocarpa]
gi|222833428|gb|EEE71905.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 225/440 (51%), Positives = 293/440 (66%), Gaps = 13/440 (2%)
Query: 42 KNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGG 101
+ N+ PN + A S VWP ++F WR V+ T+IG G+ACG+VGGVGGGG
Sbjct: 25 SHPNEKASPNPNSDIFASPYLSTTDKVWPKLEFSWRTVLATVIGLLGSACGTVGGVGGGG 84
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
IFVPML LIVGFD KS+AA+SKCM+ + S+V YNLR HPT ++P+IDYDLALLFQP+
Sbjct: 85 IFVPMLTLIVGFDTKSAAALSKCMIMAASASSVWYNLRVPHPTREVPIIDYDLALLFQPM 144
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
L+LGI++GV+ +V+F W+ITVL+I+LF+ S+R+F KG+E WK+ETI K+E Q E I
Sbjct: 145 LLLGITLGVSLSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWKEETILKKEMVIQQETI 204
Query: 222 GNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIA 281
N E D EY P RE K + N+ WK +L VW L LQ+
Sbjct: 205 VNS--RGELLIDTEY-----EPLIPREEKSKMQILCFNLKWKRLLILFLVWTSFLLLQVI 257
Query: 282 KNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGT------KWRASQ 335
KN CS YWVL LQ P+A GV YEAV LY+ +K S G+ T +W
Sbjct: 258 KNDVAVCSTWYWVLFCLQFPIAFGVFGYEAVKLYRENKKRISTGNTETICEASIEWTPMH 317
Query: 336 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
++F CGI G VGGLLG GGGF+LGPL LE+G+ P V+SAT+ F + FSSS+SVVE+Y
Sbjct: 318 ILFCALCGIIGGTVGGLLGSGGGFVLGPLLLEIGVSPHVASATSTFVMMFSSSLSVVEFY 377
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVG 455
LLKRFP+P+ALY +S++A F GQ ++KL+KILGRAS+I+F LS +IFVSAL++GGVG
Sbjct: 378 LLKRFPIPFALYLMGVSVLAGFWGQFFVRKLVKILGRASLIVFILSGVIFVSALTMGGVG 437
Query: 456 LAKMIKRIEHKEYMGFDNIC 475
+ I I + E+MGF C
Sbjct: 438 IDTSITMIRNHEFMGFLEFC 457
>gi|297827147|ref|XP_002881456.1| hypothetical protein ARALYDRAFT_902780 [Arabidopsis lyrata subsp.
lyrata]
gi|297327295|gb|EFH57715.1| hypothetical protein ARALYDRAFT_902780 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 223/436 (51%), Positives = 299/436 (68%), Gaps = 13/436 (2%)
Query: 46 DHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVP 105
D PN +S+ S + +WPD+KF W++V+ T+I F G+ACG+VGGVGGGGIFVP
Sbjct: 32 DTTNPNHHHHLSSSSPLSATEKIWPDLKFSWKLVLATVIAFLGSACGTVGGVGGGGIFVP 91
Query: 106 MLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLG 165
ML LI+GFD KS+AA+SKCM+ G + S+V YN+R RHPT ++P++DYDLALLFQP+L+LG
Sbjct: 92 MLTLILGFDTKSAAAISKCMIMGASASSVWYNVRVRHPTKEVPILDYDLALLFQPMLLLG 151
Query: 166 ISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGY 225
I++GV+ +V+F W+ITVL+I+LF+ S+R+F KG+E WK+ET+ K E A+Q + N
Sbjct: 152 ITVGVSLSVVFPYWLITVLIIILFVGTSSRSFFKGIEMWKEETLLKNEMAQQRANMVNS- 210
Query: 226 QTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYT 285
E D EY P RE K I N+ WK +LV VW L +QI KN
Sbjct: 211 -RGELLIDTEY-----EPLYPREEKSELEIIRSNLKWKRLLVLVTVWLAFLLIQIIKNEI 264
Query: 286 TTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGD-----EGT-KWRASQLVFY 339
CS +YW+L +LQ PVA V +EA+ LY +K S G+ E T KW L+F
Sbjct: 265 KVCSTIYWILFILQFPVALAVFGFEAIKLYTENKKRLSSGNTECICEATIKWTPLSLIFC 324
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
CG+ G+VGGLLG GGGF+LGPL LE+G+ PQV+SATA F + FSSS+SVVE+YLLKR
Sbjct: 325 GLCGVIGGIVGGLLGSGGGFVLGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLLKR 384
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
FP+PYA+Y ++SI+A F GQ ++KL+ IL RASII+F LS +I SAL++G +G+ K
Sbjct: 385 FPIPYAMYLISVSILAGFWGQSFIRKLVAILKRASIIVFVLSGVICASALTMGVIGIEKS 444
Query: 460 IKRIEHKEYMGFDNIC 475
IK I + E+MGF C
Sbjct: 445 IKMIHNHEFMGFLGFC 460
>gi|30686851|ref|NP_850267.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
gi|18700129|gb|AAL77676.1| At2g36630/F1O11.26 [Arabidopsis thaliana]
gi|22137236|gb|AAM91463.1| At2g36630/F1O11.26 [Arabidopsis thaliana]
gi|330254181|gb|AEC09275.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
Length = 459
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 217/421 (51%), Positives = 293/421 (69%), Gaps = 13/421 (3%)
Query: 61 SRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAA 120
S S + +WPD+KF W++V+ T+I F G+ACG+VGGVGGGGIFVPML LI+GFD KS+AA
Sbjct: 43 SLSATEKIWPDLKFSWKLVLATVIAFLGSACGTVGGVGGGGIFVPMLTLILGFDTKSAAA 102
Query: 121 VSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWM 180
+SKCM+ G + S+V YN+R RHPT ++P++DYDLALLFQP+L+LGI++GV+ +V+F W+
Sbjct: 103 ISKCMIMGASASSVWYNVRVRHPTKEVPILDYDLALLFQPMLLLGITVGVSLSVVFPYWL 162
Query: 181 ITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSND 240
ITVL+I+LF+ S+R+F KG+E WK+ET+ K E A+Q + N E D EY
Sbjct: 163 ITVLIIILFVGTSSRSFFKGIEMWKEETLLKNEMAQQRANMVN--SRGELLIDTEY---- 216
Query: 241 TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 300
P RE K I N+ WK +LV VW L +QI KN CS +YW+L ++Q
Sbjct: 217 -EPLYPREEKSELEIIRSNLKWKGLLILVTVWLTFLLIQIVKNEIKVCSTIYWILFIVQF 275
Query: 301 PVAGGVSAYEAVALYKGRRKIASKGD-----EGT-KWRASQLVFYCACGITAGMVGGLLG 354
PVA V +EA LY +K + G+ E T +W L+F CG+ G+VGGLLG
Sbjct: 276 PVALAVFGFEASKLYTANKKRLNSGNTECICEATIEWTPLSLIFCGLCGLIGGIVGGLLG 335
Query: 355 LGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII 414
GGGF+LGPL LE+G+ PQV+SATA F + FSSS+SVVE+YLLKRFP+PYA+Y ++SI+
Sbjct: 336 SGGGFVLGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLLKRFPIPYAMYLISVSIL 395
Query: 415 AAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNI 474
A F GQ ++KL+ IL RASII+F LS +I SAL++G +G+ K IK I + E+MGF
Sbjct: 396 AGFWGQSFIRKLVAILRRASIIVFVLSGVICASALTMGVIGIEKSIKMIHNHEFMGFLGF 455
Query: 475 C 475
C
Sbjct: 456 C 456
>gi|356570810|ref|XP_003553577.1| PREDICTED: uncharacterized protein LOC100807336 [Glycine max]
Length = 459
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 223/431 (51%), Positives = 298/431 (69%), Gaps = 21/431 (4%)
Query: 56 VSAPKSRSGY--KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGF 113
+S P G+ +WP++K GWR+ + T+IGF G+A G+VGGVGGGGIFVPML LIVGF
Sbjct: 36 ISPPSHDYGFGNDKLWPELKVGWRVALATVIGFLGSAFGTVGGVGGGGIFVPMLTLIVGF 95
Query: 114 DAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFN 173
D KS+AA+SKCM+ G + ++V YNLR HPT ++P+IDYDLALLFQP+L+LGI++GV +
Sbjct: 96 DTKSAAALSKCMIMGASTASVWYNLRVPHPTKEVPIIDYDLALLFQPMLMLGITVGVVLS 155
Query: 174 VIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCD 233
V+F W+ITVL+I+LF+ S+R+F KG + W++ET+ K+E A+Q + N E D
Sbjct: 156 VVFPYWLITVLIIILFIGSSSRSFFKGTQMWREETLLKKEMARQRATLVNF--RGELLID 213
Query: 234 PEYLSNDTTPEETREPKKSKVSI---IENIYWKEFGLLVAVWAIILALQIAKNYTTTCSV 290
EY E PK+ K S+ N+ WK +L+ VW L LQ+ KN CSV
Sbjct: 214 TEY--------EQLFPKEEKSSMQIFCFNLKWKRILILMFVWVSFLLLQVIKNDVKICSV 265
Query: 291 LYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGT------KWRASQLVFYCACGI 344
YWVL LQ P+A V YEAV LYKG ++ S G+ + +W ++F CGI
Sbjct: 266 WYWVLFCLQFPIALLVFGYEAVKLYKGHKERVSTGNPESICEASIEWTVLHILFCALCGI 325
Query: 345 TAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPY 404
G VGGLLG GGGFILGPL +E+G+ PQV+SATA F + FSSS+SVVE+YLLKRFP+PY
Sbjct: 326 LGGTVGGLLGSGGGFILGPLLIEIGVIPQVASATATFVMMFSSSLSVVEFYLLKRFPIPY 385
Query: 405 ALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIE 464
ALY A+S++A F GQ+ ++KL+ IL RASII+F LS +IF SAL++G VG+ K IK I+
Sbjct: 386 ALYLTAVSVLAGFWGQYFVRKLMVILKRASIIVFILSGVIFASALTMGVVGIDKSIKMIQ 445
Query: 465 HKEYMGFDNIC 475
H E+MGF + C
Sbjct: 446 HHEFMGFLDFC 456
>gi|356533415|ref|XP_003535260.1| PREDICTED: uncharacterized protein LOC100788770 [Glycine max]
Length = 464
Score = 369 bits (948), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 224/454 (49%), Positives = 308/454 (67%), Gaps = 13/454 (2%)
Query: 29 VSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFG 88
VSA S A + H + N I + A + K VWPD++ WR+V+ T+IGF G
Sbjct: 14 VSAFSCAVLSALFLTNHIHHDDNKPITLLASRVSETLK-VWPDLEPSWRLVLATVIGFLG 72
Query: 89 TACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP 148
+ACG+VGGVGGGGIFVPMLNL++GFD KS+AA+SKCM+ G + S+V YN+R HPT ++P
Sbjct: 73 SACGTVGGVGGGGIFVPMLNLLLGFDTKSAAALSKCMIMGASASSVWYNVRVPHPTKEVP 132
Query: 149 VIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET 208
++DYDLALLFQP+L+LGI++GVA +V+F W+ITVL+I+LF+ S+R+F KG+E W++ET
Sbjct: 133 ILDYDLALLFQPMLMLGITVGVALSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWREET 192
Query: 209 ITKREAAKQLELIGNGYQTEECK-CDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGL 267
I KRE KQ + + + ++ D +Y P +E K + + N+ WK +
Sbjct: 193 IFKREKTKQRATLVDSHGEDKTGIIDTKY-----EPLIPKEEKSTIQILCLNLRWKRILV 247
Query: 268 LVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYK-GRRKIASKGD 326
L+ VW L +Q+ KN CS YWVL LQ P+A V YEAV LYK +R+++++
Sbjct: 248 LIVVWVAFLLVQVIKNDVEACSPWYWVLFGLQFPIALLVFGYEAVKLYKEHKRRMSTRNL 307
Query: 327 E-----GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 381
E +W A L F CGI G+VGGLLG GGGF+LGPL LE+G+ PQV+SATA F
Sbjct: 308 ECICEASIEWTAMNLAFCSLCGIVGGIVGGLLGSGGGFVLGPLLLEIGVIPQVASATATF 367
Query: 382 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 441
+ FSSS+SVVE+YLLKRFP+PYALY ++S++A F GQ +++LI LGRASII+F LS
Sbjct: 368 VMMFSSSLSVVEFYLLKRFPIPYALYLTSVSVLAGFWGQFFVRRLIACLGRASIIVFILS 427
Query: 442 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+IF SAL++G VG+ I+ I + E+MGF C
Sbjct: 428 GVIFASALTMGVVGIENSIQMINNHEFMGFLGFC 461
>gi|255575461|ref|XP_002528632.1| conserved hypothetical protein [Ricinus communis]
gi|223531921|gb|EEF33735.1| conserved hypothetical protein [Ricinus communis]
Length = 449
Score = 369 bits (948), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 223/419 (53%), Positives = 287/419 (68%), Gaps = 21/419 (5%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
S + VWP+++F WRIV+ T+IGF G+ACG+VGGVGGGGIFVPML LIVGFD KS+AA+S
Sbjct: 43 STTERVWPELEFSWRIVLATVIGFLGSACGTVGGVGGGGIFVPMLTLIVGFDTKSAAAIS 102
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G + S+V YNLR HPT ++P++DYDLALLFQP+L+LGI++GVA +V+F W+IT
Sbjct: 103 KCMIMGASASSVWYNLRVPHPTKEVPILDYDLALLFQPMLMLGITVGVASSVVFPYWLIT 162
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT 242
VL+I+LF+ S+R+F KGVE WK+ETI K+E AKQ E + N E D EY
Sbjct: 163 VLIIILFIGTSSRSFFKGVEMWKEETILKKELAKQQEAVVNS--RGELLIDTEY------ 214
Query: 243 PEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPV 302
E PK+ K ++E + L ++ L LQ+ KN TCS YWVL LQ PV
Sbjct: 215 --EPLVPKEEKSEMVEKAFCAAICL-----SVFLLLQVIKNDVATCSKWYWVLFCLQFPV 267
Query: 303 AGGVSAYEAVALYKGRRKIASKGDEGT------KWRASQLVFYCACGITAGMVGGLLGLG 356
A V YEAV LYK +K S G+ + W + F CGI G VGGLLG G
Sbjct: 268 ALAVFGYEAVKLYKEHKKRISTGNTESICEASIAWTPMHISFCALCGILGGTVGGLLGSG 327
Query: 357 GGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA 416
GGFILGPL LE+G+ PQV+SATA F + FSSS+SVVE+YLLKRFP+PYALY +S++A
Sbjct: 328 GGFILGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLLKRFPMPYALYLTGVSVLAG 387
Query: 417 FVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
F GQ ++KLI IL R S+I+F LS +IF SA+++G VG K I+ I + E+MGF C
Sbjct: 388 FWGQFFVRKLITILKRGSLIVFILSGVIFASAITMGVVGTEKSIRMINNHEFMGFLGFC 446
>gi|356548305|ref|XP_003542543.1| PREDICTED: uncharacterized protein LOC100804918 [Glycine max]
Length = 464
Score = 367 bits (942), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 220/437 (50%), Positives = 302/437 (69%), Gaps = 15/437 (3%)
Query: 47 HVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPM 106
H + N + + A + K VWPD++ WR+++ T+IGF G+ACG+VGGVGGGGIFVPM
Sbjct: 32 HDDDNKAVTLLASRVSETLK-VWPDLEPSWRLLLATVIGFLGSACGTVGGVGGGGIFVPM 90
Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGI 166
LNLI+GFD KS+AA+SKCM+ G + S+V YN+R HPT ++P++DYDLALLFQP+L+LGI
Sbjct: 91 LNLILGFDTKSAAALSKCMIMGASTSSVWYNVRVPHPTKEVPILDYDLALLFQPMLMLGI 150
Query: 167 SIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQ-LELIGNGY 225
++GVA +V+F W+ITVL+I+LF+ S+R+F KG+E W++ETI KRE Q L+ +
Sbjct: 151 TVGVALSVVFPYWLITVLIIILFIGTSSRSFFKGIEMWREETIFKREKTMQRATLVDSQG 210
Query: 226 QTEECKCDPEYLSNDTTPEETREPKKSKVSII-ENIYWKEFGLLVAVWAIILALQIAKNY 284
+ + D +Y P+E KKS + I+ N+ WK +L+ VW L +Q+ KN
Sbjct: 211 EDKTGIIDTKY--EPLIPKE----KKSTMEILCLNLRWKRILVLIVVWVGFLLVQVIKND 264
Query: 285 TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDE------GTKWRASQLVF 338
CS YWVL LQ+P+A V YEAV LYK ++ + G+ +W A L F
Sbjct: 265 VEACSAWYWVLFGLQLPIALLVFGYEAVKLYKEHKRRMNTGNSECICEASIEWTAINLAF 324
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
CGI G+VGGLLG GGGF+LGPL LE+G+ PQV+SATA F + FSSS+SVVE+YLLK
Sbjct: 325 CALCGIVGGIVGGLLGSGGGFVLGPLLLEIGVIPQVASATATFVMMFSSSLSVVEFYLLK 384
Query: 399 RFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAK 458
RFP+PYALY ++S++A F GQ +++LI LGRASII+F LS +IF SAL++G VG+
Sbjct: 385 RFPIPYALYLTSVSVLAGFWGQFFVRRLITCLGRASIIVFILSGVIFASALTMGVVGIEN 444
Query: 459 MIKRIEHKEYMGFDNIC 475
I+ I + E+MGF C
Sbjct: 445 SIQMINNHEFMGFLGFC 461
>gi|225441056|ref|XP_002283988.1| PREDICTED: uncharacterized protein LOC100240792 isoform 1 [Vitis
vinifera]
gi|297740033|emb|CBI30215.3| unnamed protein product [Vitis vinifera]
Length = 469
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 232/450 (51%), Positives = 302/450 (67%), Gaps = 22/450 (4%)
Query: 40 TSKNKNDHVEPN-------IVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACG 92
T N N P+ +++ P S + VWP++KF WRIV+ T+IGF G+A G
Sbjct: 25 TDPNGNHTYPPSNLSSLNTTLLQNPFPSVSSSSEKVWPELKFSWRIVLATVIGFLGSAFG 84
Query: 93 SVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDY 152
+VGGVGGGGIFVPML LIVGFD KS+AA+SKCM+ G + S+V YNLR HPT + P+IDY
Sbjct: 85 TVGGVGGGGIFVPMLTLIVGFDTKSAAALSKCMIMGASTSSVWYNLRVNHPTKEAPIIDY 144
Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR 212
DLALLFQP+L+LGI++GVA +V+F W+IT+L+I+LF+ STR+F K V+ W +ET+ K+
Sbjct: 145 DLALLFQPMLMLGITVGVALSVVFPYWLITILIIILFMGTSTRSFFKAVQMWNEETLFKK 204
Query: 213 EAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSII-ENIYWKEFGLLVAV 271
E +Q + N + E D EY + P E +KS + I+ N+ WK +LV V
Sbjct: 205 ELEEQRRTMVNSHG--ELLIDAEY--DPLIPNE----EKSWIQILCSNLRWKRILVLVTV 256
Query: 272 WAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKG-RRKIASKGDEGT- 329
W I L LQI KN +T CS YWVL LLQ PVA V YEAV L+K +++I S E
Sbjct: 257 WVIFLLLQIFKNDSTVCSAWYWVLFLLQFPVAVVVFGYEAVKLHKEYKKRIISGNTEAVC 316
Query: 330 ----KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITF 385
+W L F CGI G VGGLLG GGGFILGPL LE+G+ PQV+SATA F + F
Sbjct: 317 EANIEWTGLNLAFCALCGILGGTVGGLLGSGGGFILGPLLLEIGVIPQVASATATFVMMF 376
Query: 386 SSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIF 445
SSS+SVVE+YLLKRFP+PYALY ++S++A F GQ ++KL+ L RASIIIF LS +IF
Sbjct: 377 SSSLSVVEFYLLKRFPIPYALYLISVSVVAGFWGQLFIRKLVAFLRRASIIIFILSGVIF 436
Query: 446 VSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
SAL++G VG + I I + E+MGF + C
Sbjct: 437 ASALTMGVVGTKESITMIHNHEFMGFLSFC 466
>gi|326524532|dbj|BAK00649.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 474
Score = 360 bits (925), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 213/427 (49%), Positives = 288/427 (67%), Gaps = 11/427 (2%)
Query: 58 APKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKS 117
AP+ S VWPD++F WR+VV T++GF G+A G+VGGVGGGGIFVP+LNL++GFD KS
Sbjct: 49 APRLSSRSARVWPDLEFNWRLVVATVVGFLGSAFGTVGGVGGGGIFVPLLNLVLGFDTKS 108
Query: 118 SAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFA 177
+AA+SKCM+ G + S+V YNL+ HPT + PV+DY LALLFQP+L+LGI+IGV +V+F
Sbjct: 109 AAALSKCMIMGASASSVWYNLQVSHPTKEAPVLDYKLALLFQPMLMLGITIGVELSVVFP 168
Query: 178 DWMITVLLIVLFLVLSTRAFLKGVETWKKET---ITKREAAKQLELIGNGYQTEECKCDP 234
W+ITVL+I+LF+ S+R+F KG+ W++ET + RE Q + G T + DP
Sbjct: 169 YWLITVLIIILFVGTSSRSFYKGILMWREETRILMETRERETQSKSSG-ASATNDVVLDP 227
Query: 235 EYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWV 294
Y + P+ K +++ N+ WK +L+AVW+ L LQI KN + TCS YWV
Sbjct: 228 SY-AEPLLPQSKPTQKSGLETLMFNLRWKHILVLLAVWSSFLVLQILKNNSKTCSTFYWV 286
Query: 295 LNLLQIPVAGGVSAYEAVALYKGRRKIASKG------DEGTKWRASQLVFYCACGITAGM 348
+N+ QIP A V ++AV L + R G + +W +QL+F CG+ G
Sbjct: 287 INIFQIPAAVSVFVWKAVQLCRESRARRMNGNLECVCEANIEWSPTQLIFCAFCGLLGGT 346
Query: 349 VGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYF 408
VGGLLG GGGFILGPL LELG PQV+SATA F + FSSS+SVVE+Y L RFP+PYALY
Sbjct: 347 VGGLLGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLGRFPIPYALYL 406
Query: 409 FALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEY 468
+SI+A F GQ +++K++ +L RAS+I+F LS +IF SAL++G VG K I I EY
Sbjct: 407 IFISILAGFWGQCLVRKIVLVLKRASLIVFILSSVIFASALTMGVVGTEKSISMINKHEY 466
Query: 469 MGFDNIC 475
MGF + C
Sbjct: 467 MGFLSFC 473
>gi|449451245|ref|XP_004143372.1| PREDICTED: uncharacterized protein LOC101206149 [Cucumis sativus]
gi|449523213|ref|XP_004168618.1| PREDICTED: uncharacterized protein LOC101229265 [Cucumis sativus]
Length = 456
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 216/413 (52%), Positives = 286/413 (69%), Gaps = 13/413 (3%)
Query: 69 WPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTG 128
WPD++ WR+V T+IGF G+ACG+VGGVGGGGIFVPML LI+GFD KS+AA+SKCM+ G
Sbjct: 48 WPDLEPSWRLVAATVIGFLGSACGTVGGVGGGGIFVPMLTLIIGFDTKSAAAISKCMIMG 107
Query: 129 GAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVL 188
+ S+V YNLR HPT D+P+ID+DLALLFQP+L+LGI++GV+ +V+F W+IT+L+I+L
Sbjct: 108 ASTSSVWYNLRVAHPTKDVPIIDHDLALLFQPMLMLGITVGVSLSVVFPYWLITILIIIL 167
Query: 189 FLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETRE 248
F+ S+R+F KG+E WK+ETI K+E AK+ E + N + E D EY P +E
Sbjct: 168 FIGTSSRSFFKGIEMWKEETILKKEFAKRCETVVNSHG--ELLIDVEY-----DPLIPKE 220
Query: 249 PKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSA 308
K + N+ WK +L AVW L LQ+ KN CS+ YWV+ LQ P+A V
Sbjct: 221 QKTELELMCFNLRWKRTSILFAVWISFLILQVVKNDVAACSIWYWVVFFLQFPIAIVVFG 280
Query: 309 YEAVALYKGRRKIASKG------DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILG 362
YEA LYK +K G + W S L F CGI G VGGLLG GGGF+LG
Sbjct: 281 YEARKLYKEHKKRMEAGNLEQICEASIGWTGSHLAFCALCGIVGGTVGGLLGSGGGFVLG 340
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
PL LE+G+ PQV+SATA F + FSSS+SVVE+YLL RFP+PYALY ++S++A F GQ
Sbjct: 341 PLLLEIGVVPQVASATATFVMMFSSSLSVVEFYLLNRFPIPYALYLTSVSVLAGFWGQFF 400
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++KLI IL RAS+I+F LS +IF SA+++G VG+ K I I++ E+MGF + C
Sbjct: 401 VRKLITILRRASLIVFILSGVIFASAITMGIVGVTKSITMIQNHEFMGFLDFC 453
>gi|115455071|ref|NP_001051136.1| Os03g0726500 [Oryza sativa Japonica Group]
gi|41469272|gb|AAS07154.1| expressed protein [Oryza sativa Japonica Group]
gi|50428734|gb|AAT77085.1| expressed protein [Oryza sativa Japonica Group]
gi|108710852|gb|ABF98647.1| expressed protein [Oryza sativa Japonica Group]
gi|113549607|dbj|BAF13050.1| Os03g0726500 [Oryza sativa Japonica Group]
gi|215697621|dbj|BAG91615.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 475
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 215/425 (50%), Positives = 286/425 (67%), Gaps = 15/425 (3%)
Query: 60 KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
SRS VWPD+ F WR+VV T++GF G+A G+VGGVGGGGIFVPMLNL+VGFD KS+A
Sbjct: 55 SSRSA--RVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLLVGFDTKSAA 112
Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
A+SKCM+ G + S+V YNL+ HPT + PVIDY LALLFQP+L+LGI+IGV +VIF W
Sbjct: 113 ALSKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYW 172
Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDP---EY 236
+ITVL+I+LF+ S+R+F KG+ WK ET + E ++ E + + DP E
Sbjct: 173 LITVLIIILFIGTSSRSFYKGILMWKDETRIQMETREREEESKSSCAARDVVIDPSCEEP 232
Query: 237 LSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLN 296
L P+E K + + + N+ WK +L+ VW+ L LQI KN + +CS YWV+N
Sbjct: 233 LLCQPQPKE----KSALETFLFNLRWKNILVLMTVWSSFLVLQIFKNNSQSCSTFYWVIN 288
Query: 297 LLQIPVAGGVSAYEAVALYKGRRKIASKG------DEGTKWRASQLVFYCACGITAGMVG 350
+LQ+PVA V +E V L + R G + +W +QL+F CG+ G VG
Sbjct: 289 ILQVPVALSVFLWEGVQLCRESRARRMDGNWECVCEASIEWSPAQLIFCAFCGLLGGTVG 348
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
GLLG GGGFILGPL LELG PQV+SATA F + FSSS+SVVE+Y L RFP+P+A+Y
Sbjct: 349 GLLGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLNRFPIPFAVYLIC 408
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMG 470
+SI+A F GQ +++KL+ +L RAS+I+F LS +IF SAL++G VG K I I + EYMG
Sbjct: 409 ISILAGFWGQSLVRKLVHVLKRASLIVFILSSVIFASALTMGVVGTQKSISMINNHEYMG 468
Query: 471 FDNIC 475
F + C
Sbjct: 469 FLDFC 473
>gi|125545587|gb|EAY91726.1| hypothetical protein OsI_13369 [Oryza sativa Indica Group]
Length = 475
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 215/425 (50%), Positives = 286/425 (67%), Gaps = 15/425 (3%)
Query: 60 KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
SRS VWPD+ F WR+VV T++GF G+A G+VGGVGGGGIFVPMLNL+VGFD KS+A
Sbjct: 55 SSRSA--RVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLLVGFDTKSAA 112
Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
A+SKCM+ G + S+V YNL+ HPT + PVIDY LALLFQP+L+LGI+IGV +VIF W
Sbjct: 113 ALSKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYW 172
Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDP---EY 236
+ITVL+I+LF+ S+R+F KG+ WK ET + E ++ E + + DP E
Sbjct: 173 LITVLIIILFIGTSSRSFYKGILMWKDETRIQMETREREEESKSSCAARDVVIDPSCEEP 232
Query: 237 LSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLN 296
L P+E K + + + N+ WK +L+ VW+ L LQI KN + +CS YWV+N
Sbjct: 233 LLCQPQPKE----KSALETFLFNLRWKNILVLMTVWSSFLVLQIFKNNSQSCSTFYWVIN 288
Query: 297 LLQIPVAGGVSAYEAVALYKGRRKIASKG------DEGTKWRASQLVFYCACGITAGMVG 350
+LQ+PVA V +E V L + R G + +W +QL+F CG+ G VG
Sbjct: 289 ILQVPVALSVFLWEGVQLCRESRARRMDGNWECVCEASIEWSPAQLIFCAFCGLLGGTVG 348
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
GLLG GGGFILGPL LELG PQV+SATA F + FSSS+SVVE+Y L RFP+P+A+Y
Sbjct: 349 GLLGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLNRFPIPFAVYLIC 408
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMG 470
+SI+A F GQ +++KL+ +L RAS+I+F LS +IF SAL++G VG K I I + EYMG
Sbjct: 409 ISILAGFWGQSLVRKLVHVLKRASLIVFILSSVIFASALTMGVVGTQKSISMINNHEYMG 468
Query: 471 FDNIC 475
F + C
Sbjct: 469 FLDFC 473
>gi|242033121|ref|XP_002463955.1| hypothetical protein SORBIDRAFT_01g009520 [Sorghum bicolor]
gi|241917809|gb|EER90953.1| hypothetical protein SORBIDRAFT_01g009520 [Sorghum bicolor]
Length = 482
Score = 355 bits (910), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 218/453 (48%), Positives = 298/453 (65%), Gaps = 24/453 (5%)
Query: 33 SNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACG 92
S P AT+ +ND V + + +P+ VWPD+ F WR+VV T++GF G+A G
Sbjct: 43 SAADPPATAA-RND-VAARLRLSSRSPR-------VWPDLAFNWRVVVATVVGFLGSAFG 93
Query: 93 SVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDY 152
+VGGVGGGGIFVPMLNL+VGFD KS+AA+SKCM+ G + S+V YNL+ HPT + PVIDY
Sbjct: 94 TVGGVGGGGIFVPMLNLVVGFDTKSAAALSKCMIMGASASSVWYNLQVSHPTKEAPVIDY 153
Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR 212
LALLFQP+L+LGI+IGV +VIF W+ITVL+I+LF+ S+R+F KG+ W++ET
Sbjct: 154 KLALLFQPMLMLGITIGVELSVIFPYWLITVLIIILFVGTSSRSFYKGILMWREETRILM 213
Query: 213 EAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIE----NIYWKEFGLL 268
E ++ E + + DP Y T +P+ K S +E N+ WK +L
Sbjct: 214 ETREREEQSKSTCAAGDVIIDPSYEEPLLT-----QPQPKKKSALETFLFNLRWKNILVL 268
Query: 269 VAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG--- 325
++VW+ L LQ+ KN +CS YWV+N+LQ+PVA V +EA+ L + R G
Sbjct: 269 MSVWSSFLVLQVLKNNAKSCSTFYWVVNILQVPVAVSVFLWEAMQLCRESRARRMNGNLE 328
Query: 326 ---DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFA 382
+ +W +QL+F CG+ G VGGLLG GGGFILGPL LELG PQV+SATA F
Sbjct: 329 CVCEASIEWSPAQLIFCAFCGLLGGTVGGLLGSGGGFILGPLLLELGCIPQVASATATFV 388
Query: 383 ITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSF 442
+ FSSS+SVVE+Y L RFP+P+A Y +SI+A F GQ +++K++ +L RAS+I+F LS
Sbjct: 389 MMFSSSLSVVEFYFLHRFPLPFAGYLIFISILAGFWGQCLVRKIVHVLKRASVIVFILSS 448
Query: 443 IIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+IF SAL++G VG K I I++ EYMGF + C
Sbjct: 449 VIFASALTMGVVGTQKSISMIKNHEYMGFLDFC 481
>gi|293331891|ref|NP_001168709.1| uncharacterized protein LOC100382501 [Zea mays]
gi|223950337|gb|ACN29252.1| unknown [Zea mays]
gi|238009716|gb|ACR35893.1| unknown [Zea mays]
gi|414872565|tpg|DAA51122.1| TPA: hypothetical protein ZEAMMB73_068409 [Zea mays]
Length = 479
Score = 354 bits (908), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 215/424 (50%), Positives = 291/424 (68%), Gaps = 14/424 (3%)
Query: 60 KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
SRS VWPD+ F WR+VV T++GF G+A G+VGGVGGGGIFVPMLNL+VGFD KS+A
Sbjct: 61 SSRS--PRVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLVVGFDTKSAA 118
Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
A+SKCM+ G + S+V YNL+ HPT + PVIDY LALLFQP+L+LGI+IGV +VIF W
Sbjct: 119 ALSKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYW 178
Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSN 239
+ITVL+I+LF+ S+R+F KG+ W++E+ E + E + + DP Y
Sbjct: 179 LITVLIIILFVGTSSRSFYKGILMWREESRILMETRGREEQSKSTCAASDVVIDPSY--- 235
Query: 240 DTTPEETR-EPKKSK-VSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNL 297
P T+ +PKKS + + N+ WK +L++VW++ L LQ+ KN + +CS YWV+N+
Sbjct: 236 -GEPLLTQPQPKKSAWETFMFNLRWKNILVLMSVWSLFLVLQVLKNNSKSCSSFYWVINV 294
Query: 298 LQIPVAGGVSAYEAVALYK---GRR---KIASKGDEGTKWRASQLVFYCACGITAGMVGG 351
LQ+PVA V +EA+ L + RR + + +W +QL+F CG+ G VGG
Sbjct: 295 LQVPVAVSVFLWEAMQLCRESHARRMNGNLECVCEASIEWSPAQLIFCAFCGLLGGTVGG 354
Query: 352 LLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFAL 411
LLG GGGFILGPL LELG PQV+SATA F + FSSS+SVVE+Y L RFP+P+A Y +
Sbjct: 355 LLGSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLHRFPLPFAGYLIFI 414
Query: 412 SIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGF 471
SI+A F GQ +++K++ +L RAS+I+F LS +IFVSAL++G VG K I I + EYMGF
Sbjct: 415 SILAGFWGQCLVRKIVHVLKRASLIVFILSSVIFVSALTMGVVGTQKSISMINNHEYMGF 474
Query: 472 DNIC 475
N C
Sbjct: 475 LNFC 478
>gi|357117223|ref|XP_003560372.1| PREDICTED: uncharacterized protein LOC100844938 [Brachypodium
distachyon]
Length = 479
Score = 332 bits (851), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 206/422 (48%), Positives = 281/422 (66%), Gaps = 9/422 (2%)
Query: 60 KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
SRS VWPD++F WR+VV TI+GF G+A G+VGGVGGGGIFVP+LNL++GFD KS+A
Sbjct: 60 SSRSA--RVWPDLEFNWRLVVATIVGFLGSAFGTVGGVGGGGIFVPLLNLVLGFDTKSAA 117
Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
A+SKCM+ G + S+V YNL+ HPT + PV+DY LALLFQP+L+LGI+IGV +V+F W
Sbjct: 118 ALSKCMIMGASASSVWYNLQVSHPTKEAPVLDYKLALLFQPMLMLGITIGVELSVVFPYW 177
Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSN 239
+IT+L+I+LF+ S+R+F KG+ W++ET E ++ + + Y
Sbjct: 178 LITILIIILFVGTSSRSFYKGILMWREETRILMETQEREAQSKSSSAVSDVVLGTSY-EE 236
Query: 240 DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ 299
P+ K +++ N+ WK +L+AVW+ L LQI KN + TCS YWV+N+LQ
Sbjct: 237 PLLPQPKPTEKSGLETLLFNLRWKNILVLMAVWSSFLFLQILKNNSKTCSTFYWVINILQ 296
Query: 300 IPVAGGVSAYEAVALYKGRRKIASKG------DEGTKWRASQLVFYCACGITAGMVGGLL 353
+P A V ++AV L + R G + +W +QLVF CG+ G VGGLL
Sbjct: 297 VPAAVSVFLWKAVQLCRESRARRMNGNLECVCEASIEWSPTQLVFCAFCGLLGGTVGGLL 356
Query: 354 GLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSI 413
G GGGFILGPL LELG PQV+SATA F + FSSS+SVVE+Y L RFP+PYA Y +SI
Sbjct: 357 GSGGGFILGPLLLELGCIPQVASATATFVMMFSSSLSVVEFYFLHRFPIPYAAYLIFISI 416
Query: 414 IAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDN 473
+A F GQ +++K++ +L RAS+I+F LS +IF SA+++G VG K I I + EYMGF
Sbjct: 417 LAGFWGQCLVRKIVLVLKRASLIVFILSSVIFASAVTMGVVGTEKSISMINNHEYMGFLG 476
Query: 474 IC 475
C
Sbjct: 477 FC 478
>gi|242044420|ref|XP_002460081.1| hypothetical protein SORBIDRAFT_02g022480 [Sorghum bicolor]
gi|241923458|gb|EER96602.1| hypothetical protein SORBIDRAFT_02g022480 [Sorghum bicolor]
Length = 383
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 202/404 (50%), Positives = 256/404 (63%), Gaps = 57/404 (14%)
Query: 72 IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAV 131
++FGWRIV+G++IGFFG A GSVGGVGGGG+F+PML LI+GFD KSSAA+SKCM+ G +V
Sbjct: 33 MEFGWRIVLGSLIGFFGAAFGSVGGVGGGGVFLPMLALIIGFDPKSSAAMSKCMIMGASV 92
Query: 132 SAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLV 191
S V NL+ +HP+LDIP+IDYDLA+L QP+L+LGISIGV FNV+F DW++T LLI+LFLV
Sbjct: 93 STVYCNLKLKHPSLDIPLIDYDLAMLIQPMLMLGISIGVIFNVVFPDWLVTALLIILFLV 152
Query: 192 LSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKK 251
ST+AFLKGVETWKKETI KR
Sbjct: 153 TSTKAFLKGVETWKKETIKKR--------------------------------------- 173
Query: 252 SKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEA 311
K S+++N+YWKEFGLL VW L LQIA Y+ CS LYW LN LQIPVA GV+ YEA
Sbjct: 174 -KSSLLKNVYWKEFGLLAFVWLAFLGLQIANKYSAYCSALYWALNSLQIPVAVGVTMYEA 232
Query: 312 VALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIP 371
L G++ ++SKG + ++ QL YC GI AG+VGGLLG+GGGFI+GPLFLELGIP
Sbjct: 233 HGLATGKKVLSSKGSQQNALKSCQLFVYCQFGIIAGLVGGLLGVGGGFIMGPLFLELGIP 292
Query: 372 PQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 431
PQ AL S+ + ++L L F L F H+L KL +I
Sbjct: 293 PQ----PALCEEVDQSARACFTHHL--------HLGFHDLC-QRNFSWYHIL-KLAEINQ 338
Query: 432 RASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
L I L GGVG++ ++ +IE +YMGF+++C
Sbjct: 339 NP---YGALDSYINNHRLPAGGVGISNIVHKIECHQYMGFESLC 379
>gi|217075755|gb|ACJ86237.1| unknown [Medicago truncatula]
Length = 238
Score = 293 bits (749), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 160/249 (64%), Positives = 193/249 (77%), Gaps = 12/249 (4%)
Query: 72 IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAV 131
++FGW+IV+G+ IGF G A GSVGGVGGGGIFVPML+LI+GFD KSS A+SKCM+ G A+
Sbjct: 1 MEFGWQIVLGSFIGFCGAAFGSVGGVGGGGIFVPMLSLIIGFDPKSSTAISKCMIMGAAL 60
Query: 132 SAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLV 191
S V YNLR RHPTLD+P+IDYDLALL QP+L+LGISIGV FNV+F DW++T+LLIVLFL
Sbjct: 61 STVYYNLRLRHPTLDMPIIDYDLALLIQPMLMLGISIGVVFNVVFPDWLVTILLIVLFLG 120
Query: 192 LSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKK 251
ST+AF KG+ETW KETI K+EAA++ E G+G EY + T P E K+
Sbjct: 121 TSTKAFFKGLETWNKETIMKKEAARRQESNGSG----------EYKALPTGPNVAIE-KE 169
Query: 252 SKVSIIENIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLNLLQIPVAGGVSAYE 310
+KVSIIEN+YWKEFGLL VW LALQIAK NYTTTCS YW+LNLLQIP+A GV+AYE
Sbjct: 170 NKVSIIENVYWKEFGLLAFVWISFLALQIAKQNYTTTCSAAYWILNLLQIPIAVGVTAYE 229
Query: 311 AVALYKGRR 319
+ +G
Sbjct: 230 QLHCSQGEE 238
>gi|226503253|ref|NP_001145740.1| uncharacterized protein LOC100279247 [Zea mays]
gi|219884241|gb|ACL52495.1| unknown [Zea mays]
gi|414885095|tpg|DAA61109.1| TPA: hypothetical protein ZEAMMB73_890925 [Zea mays]
Length = 291
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 146/239 (61%), Positives = 180/239 (75%), Gaps = 6/239 (2%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
+ Y+HVWP ++FGW+IV+G++IGFFG A GSVGGVGGGGIFVPML LI+GFD KSS A+S
Sbjct: 58 NSYQHVWPPMEFGWQIVLGSLIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDPKSSTAIS 117
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G +VS V YNL+ +HPTLD+PVIDYDLA+L QP+L+LGISIGV FNVIF DW++T
Sbjct: 118 KCMIMGASVSTVYYNLKLKHPTLDMPVIDYDLAVLMQPMLMLGISIGVIFNVIFPDWLVT 177
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI-GNGYQTEECKCDPEYLSNDT 241
VLLI+LFL ST+AFLKG+ETWKKET+ +REAAK LE G + P+ +N
Sbjct: 178 VLLIILFLGTSTKAFLKGIETWKKETVIQREAAKLLEQTAGEEAEYAALPTGPDVAANKK 237
Query: 242 TPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQI 300
P S+I+NI+WK+ GLL VW L LQ+ KNYT TCS YWVLNLLQ+
Sbjct: 238 ALTSDEAP-----SLIKNIHWKKVGLLSFVWVAFLVLQVTKNYTATCSPWYWVLNLLQV 291
>gi|388514887|gb|AFK45505.1| unknown [Lotus japonicus]
Length = 290
Score = 277 bits (708), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 153/285 (53%), Positives = 203/285 (71%), Gaps = 8/285 (2%)
Query: 193 STRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPE--ETREPK 250
S +AF KGVETWKKETI K E LE T C D +Y +P+ ++ +
Sbjct: 8 SIKAFFKGVETWKKETIIKEETVMLLE------STAPCSGDEQYNLLPGSPDGGPQKKGR 61
Query: 251 KSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYE 310
K +VSI+ NIYWK+FGLL VW L LQIAK YT TCS W++ LLQIPV+ GV Y+
Sbjct: 62 KHEVSILGNIYWKQFGLLSFVWLAYLLLQIAKTYTATCSTTCWMIILLQIPVSVGVFLYK 121
Query: 311 AVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGI 370
A+ LY+G I++KGD+ +W+ L+ AC + AG++GGLLG+GGGF++GPLFLELGI
Sbjct: 122 AIGLYQGWSVISAKGDQAIQWKLHYLILTSACALLAGIIGGLLGIGGGFVMGPLFLELGI 181
Query: 371 PPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKIL 430
PQV+SATA F +TFS+S+SVV+YYLL RFPVPYALY ++ IAA+ GQ ++ KL+ +
Sbjct: 182 APQVASATATFGMTFSASISVVQYYLLNRFPVPYALYLTLVAAIAAYRGQFIIDKLVNMF 241
Query: 431 GRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
RAS+IIFTL+F IFVSA+ LGG G++ MI +I+ EYMGF+++C
Sbjct: 242 QRASLIIFTLAFTIFVSAIVLGGEGISDMIGQIQRSEYMGFEDLC 286
>gi|302816689|ref|XP_002990023.1| hypothetical protein SELMODRAFT_184956 [Selaginella moellendorffii]
gi|300142334|gb|EFJ09036.1| hypothetical protein SELMODRAFT_184956 [Selaginella moellendorffii]
Length = 356
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 172/378 (45%), Positives = 237/378 (62%), Gaps = 35/378 (9%)
Query: 109 LIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISI 168
L+ GFDAK++ A+SKCM+ S V YNL R + +IDYDLALLFQP+L+LGIS+
Sbjct: 2 LVCGFDAKTAPALSKCMIFAQCASTVAYNLGLRSSQTNSSLIDYDLALLFQPMLLLGISV 61
Query: 169 GVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTE 228
GV FNV+F +W+IT+LLIV+ L +++RAF KG+ETWKKET KR LI G T
Sbjct: 62 GVTFNVLFPNWLITLLLIVVSLAMASRAFSKGLETWKKETNEKR-------LILEGSLTP 114
Query: 229 ECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTC 288
P +N TT + S+ + WK+ LL AVW +I LQ+ K YT C
Sbjct: 115 ----GP---ANFTTLD----------SLWTTVEWKKLSLLFAVWCLITGLQVLKAYTANC 157
Query: 289 SVLYWVLNLLQIPVAGGVSAYEAVAL-----YKGRRKIASKGDEGTKW------RASQLV 337
S +W+ N+LQ PV V+ +A+ + YK R + + E + RA +
Sbjct: 158 STAFWIYNILQAPVTLAVTVTQALRMREHSSYKLRESLLDESSESSSASLKAAGRALDVF 217
Query: 338 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLL 397
Y G+ AG + GLLG+GGG LGP+ LE G+PPQ +SATA A+ FSSS+SVVE+Y L
Sbjct: 218 RYVFFGVLAGSIAGLLGVGGGATLGPIMLEFGVPPQTASATATLAMLFSSSLSVVEFYFL 277
Query: 398 KRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 457
R V YALYF A+ I++AF+ Q +++KL+ +LGR S+I FTL F+I VS +SLGGVG+
Sbjct: 278 GRIKVSYALYFGAICIVSAFLSQKLVQKLLNLLGRTSVITFTLVFVIVVSVISLGGVGIV 337
Query: 458 KMIKRIEHKEYMGFDNIC 475
I+ ++ +YMGF ++C
Sbjct: 338 DSIQELKRGKYMGFGSLC 355
>gi|357440251|ref|XP_003590403.1| hypothetical protein MTR_1g061710 [Medicago truncatula]
gi|355479451|gb|AES60654.1| hypothetical protein MTR_1g061710 [Medicago truncatula]
Length = 558
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 152/316 (48%), Positives = 212/316 (67%), Gaps = 11/316 (3%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
SG +WP+++ W++V+ T+IGFFG+ACG+VGGVGGGGIFVPMLNL++GFD KS+AA+S
Sbjct: 49 SGTDRIWPELEPSWKLVLATVIGFFGSACGTVGGVGGGGIFVPMLNLLLGFDTKSAAALS 108
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
KCM+ G + S+V YNLR HPT +P++DYDLALLFQP+L+LGI++GV +V+F W+IT
Sbjct: 109 KCMIMGASASSVWYNLRVAHPTKQVPILDYDLALLFQPMLMLGITVGVTLSVVFPFWLIT 168
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT 242
VL+I+LF+ S R+F KG+E W+ ET+ K+E A+Q L+ + + + D ++
Sbjct: 169 VLIIILFIGTSLRSFFKGIEMWRVETLLKKETAEQAALVNSNGEFKAVLVDTKF-----E 223
Query: 243 PEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPV 302
P +E K + + N+ WK +L+++W L +QI K + CSV YWVL LQ P+
Sbjct: 224 PLIPKEQKSTMQILCFNLRWKRILVLLSIWFSFLLIQIIKTHVEDCSVWYWVLFGLQFPI 283
Query: 303 AGGVSAYEAVALYKGRRKIASKG------DEGTKWRASQLVFYCACGITAGMVGGLLGLG 356
A V +EAV LYK R G + +W A+ L F GI G+VGGLLG G
Sbjct: 284 AFLVFGFEAVKLYKDHRMRMGTGSLEYICEASIEWSATSLAFCAMSGIVGGIVGGLLGSG 343
Query: 357 GGFILGPLFLELGIPP 372
GGFILGPL LE+G+ P
Sbjct: 344 GGFILGPLLLEIGVIP 359
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 57/132 (43%), Positives = 77/132 (58%), Gaps = 28/132 (21%)
Query: 374 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 433
V+SATA F + FSSS+SV E+Y+LKRFP+PYALY +S++A F GQ ++KL+ LGRA
Sbjct: 427 VASATATFVMMFSSSLSVFEFYILKRFPMPYALYLITVSVLAGFWGQFFVRKLVAFLGRA 486
Query: 434 SIIIFTLSFIIFVSALSLGG----------------------------VGLAKMIKRIEH 465
S+I+F LS +IF SAL++G VG K I+ I +
Sbjct: 487 SVIVFILSGVIFASALTMGNNGNLPQFADLYKMILAQISMLYIFISGIVGTEKSIEMIHN 546
Query: 466 KEYMGFDNICYT 477
E+MGF C T
Sbjct: 547 HEFMGFLGFCST 558
>gi|302785269|ref|XP_002974406.1| hypothetical protein SELMODRAFT_414563 [Selaginella moellendorffii]
gi|300158004|gb|EFJ24628.1| hypothetical protein SELMODRAFT_414563 [Selaginella moellendorffii]
Length = 380
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 177/383 (46%), Positives = 230/383 (60%), Gaps = 26/383 (6%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
+FVP+ L++GFD K+SAA+SK M+ GGAV++ +YNL +HP LD P+IDYDLALL QP+
Sbjct: 13 LFVPLFRLLIGFDVKTSAALSKSMIMGGAVASALYNLPSKHPVLDKPLIDYDLALLIQPM 72
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQ-LEL 220
L+LGISIGV NV+F DW++TVLL+ + ++ + F KG TW E+ K E
Sbjct: 73 LLLGISIGVMCNVMFPDWILTVLLVAVLTAMAFKTFNKGACTWSTESEQKAHCCDTGEED 132
Query: 221 IGNGYQTEECKCDPEYL---SNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILA 277
G+ +TE + L S PE + +I W LL VW + L
Sbjct: 133 FGSNSRTEPSSSLEQGLLAKSEQCAPE----------GLSSSIKWANICLLCTVWTVYLI 182
Query: 278 LQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRAS--- 334
LQ+ K+ TC LYW+LN+LQ PV+ G +A G +I +GD G + A+
Sbjct: 183 LQLLKSGAATCGRLYWILNVLQAPVSIGATAI-------GVWRIYRRGDFGKEKPAAGPT 235
Query: 335 --QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVV 392
QL Y + AG+VGGLLG+GGG IL PLFLELGI PQV+SAT F + FSSSMSVV
Sbjct: 236 CEQLFLYPLYAVLAGVVGGLLGIGGGMILAPLFLELGIIPQVTSATTTFIVVFSSSMSVV 295
Query: 393 EYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLG 452
E+YLL R PV +A YF L IAA VG H K I+ GR S IIF L+ II SA+ LG
Sbjct: 296 EFYLLGRLPVRFAAYFTTLCGIAALVGLHATKLFIRRYGRTSFIIFILAAIIGSSAIILG 355
Query: 453 GVGLAKMIKRIEHKEYMGFDNIC 475
+G I+R EYMGF ++C
Sbjct: 356 IIGGFADIRRYIAGEYMGFHSLC 378
>gi|302808025|ref|XP_002985707.1| hypothetical protein SELMODRAFT_446339 [Selaginella moellendorffii]
gi|300146616|gb|EFJ13285.1| hypothetical protein SELMODRAFT_446339 [Selaginella moellendorffii]
Length = 392
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 185/407 (45%), Positives = 242/407 (59%), Gaps = 26/407 (6%)
Query: 78 IVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYN 137
+V+G+++ F G + GVGGGG+FVP+ L++GFD K+SAA+SK M+ GGAV++ +YN
Sbjct: 1 MVLGSVVAFLGAMFANAAGVGGGGLFVPLFRLLIGFDVKTSAALSKSMIMGGAVASALYN 60
Query: 138 LRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
L +HP LD P+IDYDLALL QP+L+LGISIGV NV+F DW++TVLL+ + ++ + F
Sbjct: 61 LPSKHPVLDKPLIDYDLALLIQPMLLLGISIGVMCNVMFPDWILTVLLVAVLTGMAFKTF 120
Query: 198 LKGVETWKKETITKREAAKQLE--LIGNGYQTEECKCDPEYL--SNDTTPEETREPKKSK 253
KG TW E+ K E + N + L S PE
Sbjct: 121 NKGACTWSTESEQKAHCCDTGEEGFVSNSRTEPSSSLEQGLLAKSEQCAPE--------- 171
Query: 254 VSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVA 313
+ +I W LL VW + L LQ+ K+ TC LYW+L +LQ PV+ G +A
Sbjct: 172 -GLPSSIKWANICLLCTVWTVYLILQLLKSGAATCGRLYWILTVLQAPVSIGATAI---- 226
Query: 314 LYKGRRKIASKGDEGTKWRAS-----QLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL 368
G +I +GD G + AS QL Y + AG+VGGLLG+GGG IL PLFLEL
Sbjct: 227 ---GVWRIYRRGDFGKEKPASGPTCEQLFLYPLYAVLAGVVGGLLGIGGGMILAPLFLEL 283
Query: 369 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 428
GI PQV+SAT F + FSSSMSVVE+YLL R PV +A YF L IAA VG H K I+
Sbjct: 284 GIIPQVTSATTTFIVVFSSSMSVVEFYLLGRLPVRFAAYFTTLCGIAALVGLHATKLFIR 343
Query: 429 ILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
GR S IIF L+ II SA+ LG +G I+R EYMGF ++C
Sbjct: 344 RYGRTSFIIFILAAIIGSSAIILGIIGGFADIRRYIAGEYMGFHSLC 390
>gi|16660290|gb|AAL27555.1|AF420410_1 hypothetical protein [Musa acuminata AAA Group]
Length = 238
Score = 241 bits (614), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 136/196 (69%), Positives = 172/196 (87%), Gaps = 1/196 (0%)
Query: 276 LALQIAK-NYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRAS 334
L LQ+ K NYT+TCS+ YW+LNLLQ+PV+ GVS YEAV+LY+G+R I+SKG EGT +
Sbjct: 2 LILQVLKQNYTSTCSLWYWILNLLQVPVSLGVSGYEAVSLYRGKRIISSKGLEGTDFTVI 61
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
QLVFYC G+ AG+VGGLLGLGGGFILGP+FLELG+PPQVSSATA FA+TFSSSMSVVEY
Sbjct: 62 QLVFYCLIGVLAGVVGGLLGLGGGFILGPVFLELGVPPQVSSATATFAMTFSSSMSVVEY 121
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 454
YLLKRFP+PYALYF +++++AAFVGQH++K+LI+ILGRAS+IIF L+ IF+SA+SLGGV
Sbjct: 122 YLLKRFPIPYALYFVSVALVAAFVGQHLVKRLIEILGRASLIIFILASTIFISAISLGGV 181
Query: 455 GLAKMIKRIEHKEYMG 470
G++ M+++I+H E G
Sbjct: 182 GISNMVQKIQHHESWG 197
>gi|359482297|ref|XP_003632751.1| PREDICTED: uncharacterized protein LOC100240792 isoform 2 [Vitis
vinifera]
Length = 369
Score = 238 bits (607), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 142/269 (52%), Positives = 185/269 (68%), Gaps = 9/269 (3%)
Query: 59 PKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSS 118
P S + VWP++KF WRIV+ T+IGF G+A G+VGGVGGGGIFVPML LIVGFD KS+
Sbjct: 51 PSVSSSSEKVWPELKFSWRIVLATVIGFLGSAFGTVGGVGGGGIFVPMLTLIVGFDTKSA 110
Query: 119 AAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFAD 178
AA+SKCM+ G + S+V YNLR HPT + P+IDYDLALLFQP+L+LGI++GVA +V+F
Sbjct: 111 AALSKCMIMGASTSSVWYNLRVNHPTKEAPIIDYDLALLFQPMLMLGITVGVALSVVFPY 170
Query: 179 WMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLS 238
W+IT+L+I+LF+ STR+F K V+ W +ET+ K+E +Q + N + E D EY
Sbjct: 171 WLITILIIILFMGTSTRSFFKAVQMWNEETLFKKELEEQRRTMVNSHG--ELLIDAEY-- 226
Query: 239 NDTTPEETREPKKSKVSII-ENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNL 297
+ P E +KS + I+ N+ WK +LV VW I L LQI KN +T CS YWVL L
Sbjct: 227 DPLIPNE----EKSWIQILCSNLRWKRILVLVTVWVIFLLLQIFKNDSTVCSAWYWVLFL 282
Query: 298 LQIPVAGGVSAYEAVALYKGRRKIASKGD 326
LQ PVA V YEAV L+K +K G+
Sbjct: 283 LQFPVAVVVFGYEAVKLHKEYKKRIISGN 311
>gi|255645402|gb|ACU23197.1| unknown [Glycine max]
Length = 261
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 125/193 (64%), Positives = 155/193 (80%), Gaps = 9/193 (4%)
Query: 66 KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
VWP+++FGWRIVVG+I+GFFG A GSVGGVGGGGIF+PML L++GFDAKSS A+SKCM
Sbjct: 71 DRVWPEMEFGWRIVVGSIVGFFGAALGSVGGVGGGGIFIPMLTLVIGFDAKSSTALSKCM 130
Query: 126 VTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLL 185
+ G AVS V YNLR RHPTLD+PVIDYDLALLFQP+L+LGISIGVAFNV+FADWM+TVLL
Sbjct: 131 IMGAAVSTVYYNLRFRHPTLDLPVIDYDLALLFQPMLMLGISIGVAFNVMFADWMVTVLL 190
Query: 186 IVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGN-GYQTEECKCDPEYLSNDTTPE 244
I+LF+ ST+A KG++TWKKETI K+EAAK LE + GY +EE +Y S P
Sbjct: 191 IILFIATSTKALFKGIDTWKKETIMKKEAAKMLESDSSPGYVSEE-----DYKS---LPA 242
Query: 245 ETREPKKSKVSII 257
+ +P+ +V ++
Sbjct: 243 GSADPRDEEVPLL 255
>gi|356538176|ref|XP_003537580.1| PREDICTED: uncharacterized protein LOC100817312 [Glycine max]
Length = 198
Score = 225 bits (573), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 125/194 (64%), Positives = 158/194 (81%)
Query: 282 KNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCA 341
+ Y+ CS+ +W+LN LQ+P+A V+ +EA+ LYKG R IASKG E T W+ Q+ YC+
Sbjct: 2 QEYSKPCSIHFWLLNFLQVPIAVSVTLFEAIGLYKGTRVIASKGKEVTNWKIHQICLYCS 61
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 401
GI AGMVGGLLGLGGGFILGPLFLELGIPPQV+SAT+ FA+ FSSSMSVV+YYLL RFP
Sbjct: 62 TGIMAGMVGGLLGLGGGFILGPLFLELGIPPQVASATSTFAMVFSSSMSVVQYYLLDRFP 121
Query: 402 VPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK 461
VPYA YF ++ IAAF GQHV++K+I +LGRASIIIF L+ IF+SA+SLGGVG+ +I+
Sbjct: 122 VPYASYFALVATIAAFTGQHVVRKVIVVLGRASIIIFILALTIFISAISLGGVGIENIIE 181
Query: 462 RIEHKEYMGFDNIC 475
+IE EYMGF+++C
Sbjct: 182 KIESHEYMGFEDLC 195
>gi|147812176|emb|CAN61520.1| hypothetical protein VITISV_033969 [Vitis vinifera]
Length = 382
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 139/304 (45%), Positives = 176/304 (57%), Gaps = 67/304 (22%)
Query: 22 LVVASVSVSAQSNLKPEATSKN--KNDHVEPNIVIKVSA---PKSRSGYKHVWPDIKFGW 76
++AS VS + ++K EA++ N K N + KV RSGY+HVWP+++FGW
Sbjct: 80 FMLASAFVSGERSIKHEASTFNVTKEAGFNSNYLSKVVNFLWQSDRSGYQHVWPEMEFGW 139
Query: 77 RIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVY 136
+IVVG+IIGFFG A GSVGGVGGGGIFVPML+L++GFD KS+ A+SKCM+ G A S V Y
Sbjct: 140 QIVVGSIIGFFGAAFGSVGGVGGGGIFVPMLSLVIGFDPKSATALSKCMIMGAAGSTVYY 199
Query: 137 NLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRA 196
NL+ RHPTLD+P+IDYDLALLFQP+L++GISIGVAFNV+FADWM
Sbjct: 200 NLKLRHPTLDMPIIDYDLALLFQPMLMMGISIGVAFNVLFADWM---------------- 243
Query: 197 FLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI 256
EAAK+L GNG + E K P SN T + +
Sbjct: 244 ----------------EAAKRLGTNGNGTEEVEYKPLPSGPSNGTQXATNKSKE------ 281
Query: 257 IENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYK 316
+N T TCS+ YWVLN +QIPV+ GVS YEAV+L +
Sbjct: 282 ------------------------XENNTATCSMAYWVLNFMQIPVSVGVSLYEAVSLVQ 317
Query: 317 GRRK 320
G
Sbjct: 318 GTEN 321
>gi|302771113|ref|XP_002968975.1| hypothetical protein SELMODRAFT_170159 [Selaginella moellendorffii]
gi|300163480|gb|EFJ30091.1| hypothetical protein SELMODRAFT_170159 [Selaginella moellendorffii]
Length = 309
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 205/330 (62%), Gaps = 35/330 (10%)
Query: 157 LFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAK 216
+ QP+L+LGIS+GV FNV+F +W+IT+LLIV+ L +++RAF KG+ETWKKET KR
Sbjct: 3 VLQPMLLLGISVGVTFNVLFPNWLITLLLIVVSLAMASRAFSKGLETWKKETNEKR---- 58
Query: 217 QLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIIL 276
LI G T P +N TT + S+ + WK+ LL AVW +I
Sbjct: 59 ---LILEGSLTP----GP---ANFTTLD----------SLWTTVEWKKLSLLFAVWCLIT 98
Query: 277 ALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL-----YKGRRKIASKGDEGTKW 331
LQ+ K YT CS +W+ N+LQ PV V+ +A+ + YK R + + E +
Sbjct: 99 GLQVLKAYTANCSTAFWIYNILQAPVTLAVTVTQALRMREHSSYKLRESLLDESSESSSA 158
Query: 332 ------RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITF 385
RA + Y G+ AG + GLLG+GGG LGP+ LE G+PPQ +SATA A+ F
Sbjct: 159 SLKAAGRALDVFRYVFFGVLAGSIAGLLGVGGGATLGPIMLEFGVPPQTASATATLAMLF 218
Query: 386 SSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIF 445
SSS+SVVE+Y L R V YALYF A+ I++AF+ Q +++KL+ +LGR S+I FTL F+I
Sbjct: 219 SSSLSVVEFYFLGRIKVSYALYFGAICIVSAFLSQKLVQKLLNLLGRTSVITFTLVFVIV 278
Query: 446 VSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
VS +SLGGVG+ I+ ++ +YMGF ++C
Sbjct: 279 VSVISLGGVGIVDSIQELKRGKYMGFGSLC 308
>gi|222625716|gb|EEE59848.1| hypothetical protein OsJ_12424 [Oryza sativa Japonica Group]
Length = 465
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 192/339 (56%), Gaps = 23/339 (6%)
Query: 146 DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWK 205
PV + A+ P L +G + N + T S+R+F KG+ WK
Sbjct: 139 HCPVFCTNSAIASAPPLKFSTKVGSSRNACVTNSTGT----------SSRSFYKGILMWK 188
Query: 206 KETITKREAAKQLELIGNGYQTEECKCDP---EYLSNDTTPEETREPKKSKVSIIENIYW 262
ET + E ++ E + + DP E L P+E K + + + N+ W
Sbjct: 189 DETRIQMETREREEESKSSCAARDVVIDPSCEEPLLCQPQPKE----KSALETFLFNLRW 244
Query: 263 KEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIA 322
K +L+ VW+ L LQI KN + +CS YWV+N+LQ+PVA V +E V L + R
Sbjct: 245 KNILVLMTVWSSFLVLQIFKNNSQSCSTFYWVINILQVPVALSVFLWEGVQLCRESRARR 304
Query: 323 SKG------DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS 376
G + +W +QL+F CG+ G VGGLLG GGGFILGPL LELG PQV+S
Sbjct: 305 MDGNWECVCEASIEWSPAQLIFCAFCGLLGGTVGGLLGSGGGFILGPLLLELGCIPQVAS 364
Query: 377 ATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASII 436
ATA F + FSSS+SVVE+Y L RFP+P+A+Y +SI+A F GQ +++KL+ +L RAS+I
Sbjct: 365 ATATFVMMFSSSLSVVEFYFLNRFPIPFAVYLICISILAGFWGQSLVRKLVHVLKRASLI 424
Query: 437 IFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+F LS +IF SAL++G VG K I I + EYMGF + C
Sbjct: 425 VFILSSVIFASALTMGVVGTQKSISMINNHEYMGFLDFC 463
>gi|168019909|ref|XP_001762486.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686219|gb|EDQ72609.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 390
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 140/408 (34%), Positives = 232/408 (56%), Gaps = 34/408 (8%)
Query: 83 IIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRH 142
++G G + S GGVGGGG+F+P+ NL++GFDAKSSAA+S M+ GG+++ V +N+++ H
Sbjct: 2 VLGTLGASICSAGGVGGGGLFIPLFNLLIGFDAKSSAALSNFMIFGGSIANVWWNIQRDH 61
Query: 143 PTLD-IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGV 201
P L P+ID+D+ LL QP ++LGISIGV NV F W IT+ I+ ++ R+F G+
Sbjct: 62 PFLPGHPLIDFDVVLLLQPNMLLGISIGVICNVAFPSWFITLEFIITLGYITARSFRSGL 121
Query: 202 ETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETRE---PKKSKVSIIE 258
W+ ET +R D + + P+E E P +
Sbjct: 122 VRWRNETPLER-------------------LDSKLSWQNLCPKEGDEAIVPLLGESKPPR 162
Query: 259 NIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYE 310
N + + +L VW LA+Q+ + ++ S+L YW+L +Q+P+A ++ +
Sbjct: 163 NFPYVKLLMLTLVWTAFLAVQLLRGGKSSDSILTLEPCGGAYWLLTTMQVPLAFLLTGWS 222
Query: 311 AVALYKGRRKIASKGDEGTKWR---ASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE 367
A L+ + + E +W V + + + AG +GG+LG+GGG I+ P+ +E
Sbjct: 223 AWHLHHALEEKNTPELELEEWDVIGPRAFVLFPSMALLAGFLGGMLGIGGGMIINPMLIE 282
Query: 368 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 427
LG+ PQ+++AT F + FSSS+SVV+++LL R + +AL F A+ +++ +G HV+++ I
Sbjct: 283 LGMHPQLTAATTAFMVFFSSSLSVVQFWLLGRLQLDFALLFGAICFVSSLIGLHVVQQAI 342
Query: 428 KILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
GR SII+F++S ++ +SA+S G ++ + EYMGF C
Sbjct: 343 AKFGRPSIIVFSVSIVLGISAVSTTICGGFEVWDQFTGGEYMGFHYPC 390
>gi|168044376|ref|XP_001774657.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673957|gb|EDQ60472.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 380
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 134/384 (34%), Positives = 218/384 (56%), Gaps = 27/384 (7%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQP 160
+F+P+ NL++ FD K+SAA+S ++ GG+V+ +++NL QRHP+L IDYD+AL+ QP
Sbjct: 14 LFIPVFNLLLLFDPKTSAALSNFVILGGSVANLIWNLPQRHPSLPHKSAIDYDVALILQP 73
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
++LGISIGV NV+F W+I V L ++ ++TR++ G++ W+ E+ QL
Sbjct: 74 NMLLGISIGVICNVMFPGWLIIVQLALILGFITTRSWKNGIKRWRIES--------QLA- 124
Query: 221 IGNGYQTEECKCDPE-YLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQ 279
+TEE + E L +D P K+ + + + L VW +Q
Sbjct: 125 ---ALKTEEANAESEGPLESDNLHAPLLAPVKTPLE--PSCLGSKVMCLGLVWVAFFVIQ 179
Query: 280 IAKNYTTT--------CSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKW 331
+ + TT C V YW+L L QIP+A V+ + A ++ + + K D+G
Sbjct: 180 LLRGGKTTEGILPLKSCGVGYWLLTLTQIPLACFVTLWTA---FRHTQCSSDKQDQGEIT 236
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
R L + + AG+ GG+LG+GGG + PL +E G+ QV++AT F + FSSS+SV
Sbjct: 237 RHRALTVFPGMALVAGLWGGMLGIGGGMFMNPLLIEAGVHVQVTAATTAFMVFFSSSLSV 296
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
VE++LL R PV +A+ F ++ +A+ +G V+ I GRASII+F+++ + +SA+ +
Sbjct: 297 VEFWLLGRIPVDFAIVFSSICFVASLIGLTVVHHAISKYGRASIIVFSVAIALGISAVMM 356
Query: 452 GGVGLAKMIKRIEHKEYMGFDNIC 475
G G + ++ + YMGF C
Sbjct: 357 AGFGSLNVYRQYKDGAYMGFHTPC 380
>gi|290979999|ref|XP_002672720.1| hypothetical protein NAEGRDRAFT_81115 [Naegleria gruberi]
gi|284086299|gb|EFC39976.1| hypothetical protein NAEGRDRAFT_81115 [Naegleria gruberi]
Length = 610
Score = 198 bits (503), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 130/460 (28%), Positives = 224/460 (48%), Gaps = 52/460 (11%)
Query: 65 YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
+K ++P +F + + T+I F G+ ++ G+GGGG+ VP++ F + + +SK
Sbjct: 154 HKDLFP--QFSYLDAIATVISFLGSVLSTMAGIGGGGLIVPLMETAGQFPPQMAVGISKT 211
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
M+ G ++S + +RHP D P+I+YD+ALL QP ++G+ +GV N + +W+I +L
Sbjct: 212 MIFGASISNFIALSLKRHPHADRPLINYDVALLLQPTSLVGVLVGVLLNSMTPNWLIVLL 271
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKR------EAAKQLELIGNGYQTEECKCD----- 233
++ ++S F++ W+ E+ K +A ++ NG + D
Sbjct: 272 SAIILTIVSLTTFVRAGRMWRAESAAKLVGSNNGSSANYHQITDNGVNNDTIISDDEDET 331
Query: 234 ---PEYLSNDTTPE-----------------ETREPKKSKVSIIENIYWKEFG---LLVA 270
P+ +N+ E E E K+ ++ N F +LV
Sbjct: 332 FDRPKNTNNENVLETSLLGEDELEQMEQENIEKEEKKQIARKVLHNEKSTPFTKLFVLVL 391
Query: 271 VWAIILALQIAKN--------YTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL---YKGRR 319
W I+ L I + + CSV +WVL P+ VS V L +K R
Sbjct: 392 CWVIVFILTIVRGGHGAPSVFHIDHCSVWHWVLFAAMFPIMIAVSYMVGVYLVRKHKERS 451
Query: 320 KIASKGDE----GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVS 375
+ S G W ++ Y +G++ G+LG+GGG I P+ L LG+ P V+
Sbjct: 452 ALVSHGYRFVAGDPHWSVGTVILYPVFSALSGVIAGMLGVGGGMIKSPMLLYLGLDPLVA 511
Query: 376 SATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASI 435
+ATA F + F+SS+S +++ +L P Y L++F + A VGQ +L + GR S+
Sbjct: 512 AATASFMMLFTSSISSIQFVILGTVPFDYGLWYFVFGLFAGMVGQ-ILMHFVFQKGRKSV 570
Query: 436 IIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++F ++FII VS L + G G+ + +++ YMGF +IC
Sbjct: 571 LVFIVAFIIIVSTLCMTGFGIYNAVIMLQNNMYMGFHSIC 610
>gi|4415908|gb|AAD20139.1| unknown protein [Arabidopsis thaliana]
gi|20197925|gb|AAM15316.1| unknown protein [Arabidopsis thaliana]
Length = 288
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 92/156 (58%), Positives = 128/156 (82%)
Query: 60 KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
S S + +WPD+KF W++V+ T+I F G+ACG+VGGVGGGGIFVPML LI+GFD KS+A
Sbjct: 42 SSLSATEKIWPDLKFSWKLVLATVIAFLGSACGTVGGVGGGGIFVPMLTLILGFDTKSAA 101
Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
A+SKCM+ G + S+V YN+R RHPT ++P++DYDLALLFQP+L+LGI++GV+ +V+F W
Sbjct: 102 AISKCMIMGASASSVWYNVRVRHPTKEVPILDYDLALLFQPMLLLGITVGVSLSVVFPYW 161
Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAA 215
+ITVL+I+LF+ S+R+F KG+E WK+ET+ K A
Sbjct: 162 LITVLIIILFVGTSSRSFFKGIEMWKEETLLKFPVA 197
>gi|356566531|ref|XP_003551484.1| PREDICTED: uncharacterized protein LOC100809802 [Glycine max]
Length = 254
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/186 (53%), Positives = 125/186 (67%), Gaps = 20/186 (10%)
Query: 124 CMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITV 183
C++ VS V YNL+ RHPTL++P+IDYDLALL QP+L+LGI+IGV FNV+F+ W++T+
Sbjct: 3 CLLVIYVVSTVYYNLKLRHPTLNMPIIDYDLALLIQPMLMLGITIGVVFNVVFSYWIVTI 62
Query: 184 LLIVLFLVLSTRAFLKGVETWKKETITK-----REAAKQLELIGNG-YQTEECKCDPEYL 237
LLIVLFL ST++ KG+ETWKKETI K +EA K + NG + K P+
Sbjct: 63 LLIVLFLGTSTKSLFKGIETWKKETIIKKDQFTKEAGKH--PVSNGEFGPNHIKVLPDQF 120
Query: 238 SNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK-NYTTTCSVLYWVLN 296
+D + V+IIEN+YWKEFGLLV VW ALQI K NYTTTC YW LN
Sbjct: 121 LSDVS-----------VAIIENVYWKEFGLLVFVWVSFPALQIGKENYTTTCPTFYWALN 169
Query: 297 LLQIPV 302
LLQI V
Sbjct: 170 LLQIMV 175
>gi|290999413|ref|XP_002682274.1| predicted protein [Naegleria gruberi]
gi|284095901|gb|EFC49530.1| predicted protein [Naegleria gruberi]
Length = 399
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 129/411 (31%), Positives = 206/411 (50%), Gaps = 29/411 (7%)
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
++ TI F G + G GGGG+FVP+L++ F + +S M+ G + +
Sbjct: 3 ILATITCFIGGVLSAASGTGGGGVFVPLLHVAGQFPPTLAIPISTLMIFGAGIINIATLS 62
Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
+RHP D P+IDYD+AL+ +P +LG IGV FN++F DW+I V +I+ + S F
Sbjct: 63 FKRHPHADRPLIDYDIALMMEPPTLLGTIIGVFFNMMFPDWLIVVFVILTLSITSFVMFK 122
Query: 199 KGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTP-EETREPKKSKVSII 257
G + K+E E K+LE TEE D +N TT +E ++ +S+
Sbjct: 123 NGYKRLKRE----NEERKKLE----ESTTEENHNDN---NNPTTVYDELQQMYESE---- 167
Query: 258 ENIYWKEFGLLVAVWAIILALQIAKN---------YTTTCSVLYWVLNLLQIPVAGGVS- 307
+ + + +L W + L + K T CSV YW+L L P+ G ++
Sbjct: 168 KRTPFGKVIVLFICWMTVFTLSLLKGGHGAPSIIPSVTQCSVGYWILTALSFPLLGAMTL 227
Query: 308 --AYEAVALYKGRRKIASKGDEG-TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 364
+ + ++ + +I + EG W + Y + AG++ +LG+GGG + PL
Sbjct: 228 GIVFYLLKKHERKVQIGYQFVEGDVHWNKYNVTLYPIACVGAGILASMLGIGGGMVKSPL 287
Query: 365 FLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
L LG P S AT F I F+SS+S V+Y + PV Y L+F A I+ GQ +L
Sbjct: 288 LLILGSDPVSSQATTSFMILFTSSISTVQYLIAGLLPVDYGLWFLACGILCGVFGQLILD 347
Query: 425 KLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ GR SI+IF ++ + + +GG G+ ++K++E YMGF + C
Sbjct: 348 LWLDKSGRRSIMIFIVAIVTLAATFLMGGAGIYDVVKQVEKGVYMGFRSPC 398
>gi|384245840|gb|EIE19332.1| hypothetical protein COCSUDRAFT_44684 [Coccomyxa subellipsoidea
C-169]
Length = 495
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 131/466 (28%), Positives = 211/466 (45%), Gaps = 53/466 (11%)
Query: 57 SAPKSRSGYKHVW---PDIKFGW--RIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIV 111
A +S SG + P+ KF W R + + + G+GGG +VP+LN+++
Sbjct: 35 EASRSDSGLFQILGTGPNTKFEWTPRTIFAAALACVCALLANSAGIGGGPFYVPLLNVVL 94
Query: 112 GFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT-LDIPVIDYDLALLFQPLLVLGISIGV 170
GFD K++ +S +V AV++ +Y L Q HP P++D+D+AL F P L+LG+S GV
Sbjct: 95 GFDLKAATGLSHTIVATSAVASSIYGLIQTHPNDPSRPLVDFDVALTFIPALLLGVSFGV 154
Query: 171 AFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE--TITKREAAKQLELIGNGYQTE 228
NV+ DW+ T LL VL L + + KG+ W++E I ++ +A Q +L G+ E
Sbjct: 155 LLNVLVPDWLQTALLTVLLLFVINKTVRKGITQWRQEQKAIKQKRSAAQQDL-GDEDDEE 213
Query: 229 ECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLL-------------------V 269
+ + N P + S V ++ + F L V
Sbjct: 214 GVLHEERFERN---PSKRFSAPHSSVHQLQTTLSQIFHRLPLFKALRKVCETCVQMAAVV 270
Query: 270 AVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDE-- 327
A+WA+ LA Q K C+ Y+ + Q+ V+A+ Y+ ++ DE
Sbjct: 271 ALWAVFLAFQQLKARYPNCTWQYFTIFAAQVIFLLSVTAF--CIWYEAKKAAGPHADEMD 328
Query: 328 ------------------GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 369
GT +L AG GLLG+GG I P+ L+LG
Sbjct: 329 PELRTVILGEQSDSETPIGTADTYKRLAKVVGVMAFAGFTAGLLGIGGALIFNPVLLQLG 388
Query: 370 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 429
+ PQV+++T++ I F+SS + +Y YAL L +A+ +G V+ ++I+
Sbjct: 389 VQPQVTASTSVLMILFTSSAIALSFYFQGLLNTSYALVLAPLCFVASLIGVTVVGRIIRA 448
Query: 430 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
GR SII+ LS +I + G + + I +GF C
Sbjct: 449 SGRVSIIVLLLSALIIAGTVLTAFFGGIRAVNDIRDGAPIGFKPFC 494
>gi|290977704|ref|XP_002671577.1| predicted protein [Naegleria gruberi]
gi|284085147|gb|EFC38833.1| predicted protein [Naegleria gruberi]
Length = 537
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 124/431 (28%), Positives = 212/431 (49%), Gaps = 40/431 (9%)
Query: 83 IIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRH 142
+ F G+ S GG GGGG+FVP+L+++ F+A+ + +SK M+ G AV+ V + H
Sbjct: 107 LTNFIGSVLSSAGGTGGGGVFVPLLHVLGRFNAQEAVPLSKVMIFGAAVTNVFTLFFRSH 166
Query: 143 PTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLF---LVLSTRAFLK 199
P D P+IDYD+AL+ +P +LG IGV N+I +W+I + +I++ +L+ R F +
Sbjct: 167 PYADRPLIDYDIALMMEPATLLGTIIGVFLNIICPEWVIVLSVIIVLTITTILTFRKFFQ 226
Query: 200 -----------------GVETWKKET---ITKREAAKQLELIGNGYQTEEC--KCDPEYL 237
E KE +++ + +Q + G+ + EE P ++
Sbjct: 227 RARVEFAFLWKKKKKDEETEPLNKEQEPIVSENKEEEQAQGYGSVNKEEESLVNTQPIFV 286
Query: 238 SNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYT---------TTC 288
S ++ +P + + + YWK F +L+ W II L + + C
Sbjct: 287 SQQVAEKDFVKPSAWTI-VKKTPYWKIF-VLIVCWIIIFTLSLLRGGEGAPSVIPGLEMC 344
Query: 289 SVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG----DEGTKWRASQLVFYCACGI 344
S +YW L L P+ G + A L R+ +G KW + FY +
Sbjct: 345 SPVYWTLVGLSFPIIGAIMIIVAAYLLIDYRRKVKRGHVFVQGDVKWNWINVTFYPGACL 404
Query: 345 TAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPY 404
AG++ +LG+GGG I PL L LG P V +ATA F I F+SS+S ++ ++ R P Y
Sbjct: 405 IAGILAAMLGIGGGMIKSPLLLLLGSDPAVGAATAAFMIFFTSSISSAQFAIVGRIPFDY 464
Query: 405 ALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIE 464
+ + ++ FVG + ++ G+ S+II + ++ + + +GGVG+ ++ ++
Sbjct: 465 GMLYGLTGFVSGFVGYFFVTFGVERWGKRSLIILCVGLVLLFATMLMGGVGIYDVVIDLQ 524
Query: 465 HKEYMGFDNIC 475
YMGF + C
Sbjct: 525 QGVYMGFHDPC 535
>gi|407038050|gb|EKE38923.1| hypothetical protein ENU1_147930 [Entamoeba nuttalli P19]
Length = 459
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 129/446 (28%), Positives = 221/446 (49%), Gaps = 50/446 (11%)
Query: 74 FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
F W+++VG+I F + G+GGG ++ + LI+ D + +SK G A
Sbjct: 7 FDWKLIVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDPHQAIPLSKITTFGVACGG 66
Query: 134 VVYNLRQRHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
+ +RHP + P+I Y AL+ +PL + G IGV FN+I W+I ++L++L
Sbjct: 67 FLILWMKRHPNVRYKPLISYPTALMVEPLTIYGTMIGVIFNIISPSWLIIIVLVLLLGFT 126
Query: 193 STRAFLKGVETWKKETITKREAAKQLELI--------------------GNG-------- 224
S + F K ++ WK E KR+AAK EL+ GN
Sbjct: 127 SYKTFAKAIKQWKNEN-EKRDAAKATELVETSKPDIADNDNDDMKPSENGNNAVIVDERV 185
Query: 225 YQTEECK-CDPEYLSNDTTPEETREPKKS------KVSIIENIYWKEFGLLVAVWAIILA 277
+ EE + P+ L D + E +E KK K II+ I G+L+ VWA++
Sbjct: 186 QEEEEGQGSGPKLLPQDESQEAQKEAKKIEEKTLLKREIIKAIL--SVGILIVVWAVMFF 243
Query: 278 LQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGT 329
+ I K S++ YWVL + P+ V+ + L+ +R +G+
Sbjct: 244 IVILKGGEKMDSIVGIECGTPWYWVLTAIGGPLMLAVTIVVGIFLWWRQRGEEVQGE--V 301
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
+W + AG+ LG+GGG ++GP+ LE+G+ PQV++AT+ F I F++S
Sbjct: 302 QWTVKNCLIIPIGAFFAGVSAAYLGIGGGMVIGPILLEIGVLPQVATATSAFMIMFTASS 361
Query: 390 SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
S ++Y + + + Y L++FA+ I A GQ K+++ L R SII F L +I +S L
Sbjct: 362 SSLQYIIDGKLDIFYGLWYFAIGFIGAAFGQFGFSKIVQKLNRQSIIGFFLGVLIVLSTL 421
Query: 450 SLGGVGLAKMIKRIEHKEYMGFDNIC 475
++ + + +++ +++ + +GF ++C
Sbjct: 422 AMIAITVVQLVSDVKN-DNLGFKHLC 446
>gi|67466247|ref|XP_649271.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56465670|gb|EAL43888.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449701601|gb|EMD42390.1| Hypothetical protein EHI5A_207490 [Entamoeba histolytica KU27]
Length = 460
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 126/447 (28%), Positives = 218/447 (48%), Gaps = 51/447 (11%)
Query: 74 FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
F W+++VG+I F + G+GGG ++ + LI+ D + +SK G A
Sbjct: 7 FDWKLIVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDPHQAIPLSKITTFGVACGG 66
Query: 134 VVYNLRQRHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
+ +RHP + P+I Y AL+ +PL + G IGV FN+I W+I ++L++L
Sbjct: 67 FLILWMKRHPNVRYKPLISYPTALMVEPLTIYGTMIGVIFNIISPSWLIIIVLVLLLGFT 126
Query: 193 STRAFLKGVETWKKETITKREAAKQLELI---------------------GNGYQTEE-- 229
S + F K ++ WK E KR+AAK EL+ N +E
Sbjct: 127 SYKTFAKAIKQWKNEN-EKRDAAKATELVETSKPDIADNDNDDMKPSENGSNAVIVDEKV 185
Query: 230 -------CKCDPEYLSNDTTPEETREPKKS------KVSIIENIYWKEFGLLVAVWAIIL 276
P+ L D + E +E KK K II+ I G+L+ VWA++
Sbjct: 186 QEEEEEGQGTGPKLLPQDESQEAQKEAKKIEEKTLLKREIIKAIL--SVGILIIVWAVMF 243
Query: 277 ALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEG 328
+ I K S++ YWVL + P+ V+ + L+ +R +G+
Sbjct: 244 FIVILKGGEKMDSIVGIECGTPWYWVLTAIGGPLMLAVTIVVGIFLWWRQRGEEVEGE-- 301
Query: 329 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 388
+W + AG+ LG+GGG ++GP+ LE+G+ PQV++AT+ F I F++S
Sbjct: 302 VQWTVKNCLIIPIGAFFAGVSAAYLGIGGGMVIGPILLEIGVLPQVATATSAFMIMFTAS 361
Query: 389 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSA 448
S ++Y + + + Y +++FA+ I A GQ K+++ L R SII F L +I +S
Sbjct: 362 SSSLQYIIDGKLDIFYGIWYFAIGFIGAAFGQFGFSKIVQKLNRQSIIGFFLGVLIVLST 421
Query: 449 LSLGGVGLAKMIKRIEHKEYMGFDNIC 475
L++ + + +++ +++ + +GF ++C
Sbjct: 422 LAMIAITVVQLVSDVKN-DNLGFKHLC 447
>gi|168065030|ref|XP_001784459.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663984|gb|EDQ50721.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 405
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 136/413 (32%), Positives = 232/413 (56%), Gaps = 28/413 (6%)
Query: 77 RIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVY 136
R+V I+G G + GGVGGGG+F+P+ NL++ FDAK+SAA+S M+ G+++ + +
Sbjct: 4 RVVAAVILGSLGACICAAGGVGGGGLFIPIFNLLLLFDAKTSAALSNIMILAGSIAVLAW 63
Query: 137 NLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTR 195
N+R+ HP + P+IDYD+ALL P ++LGISIGV N+ F W++ +L V+ ++ R
Sbjct: 64 NIRRTHPLSPGKPLIDYDVALLLNPNMLLGISIGVFCNITFPGWLLISVLTVILFYMTNR 123
Query: 196 AFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVS 255
+ G WKKE+ +A +++ ++ + + E + L + P + K
Sbjct: 124 SIQNGFTRWKKESAAAAKAKEKIAIV-SAHSVETGQSKYPLLGGQSEPSLFAQCPPQK-- 180
Query: 256 IIENIYWKEFGLLVAVWAIILALQIAKNYTTT--------CSVLYWVLNLLQIPVAGGVS 307
+I+ LV +W + A+QI + T C + YW+L+ Q+P+A G++
Sbjct: 181 LIK---------LVMMWLLFFAVQILRGGEGTEGILKVKPCGLAYWLLSASQLPLAIGLT 231
Query: 308 AYEAVALYKGRRKIASKGDEGTK-----WRASQLVFYCACGITAGMVGGLLGLGGGFILG 362
A+ +AL + A+K E + + + + AGM+GG+LG+GGG I+
Sbjct: 232 AW--IALQHSSKSHAAKPSESNEEVDVMLTSRAYTVFPLMALLAGMLGGMLGIGGGMIIN 289
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
P+ E+G+ PQ ++ T+ F I F++SMSV++++LL R P+ +AL F A+ + + VG +
Sbjct: 290 PMLTEVGMHPQATAGTSSFMIFFATSMSVLQFWLLGRIPMDFALLFGAVCLFWSCVGIGL 349
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
L+ I GR S+I+F +S ++ VSAL + G + + +YMGF C
Sbjct: 350 LQAAIVKHGRPSVIVFLVSSVMGVSALLMATFGGFNVWHQYRAGDYMGFHAAC 402
>gi|356549443|ref|XP_003543103.1| PREDICTED: uncharacterized protein LOC100790958 [Glycine max]
Length = 466
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 128/397 (32%), Positives = 218/397 (54%), Gaps = 35/397 (8%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD-IPVIDYDLALLFQP 160
+FVP+L+++ G D K+++++S MVTGG+++ V+ N+ P +IDYD+AL +P
Sbjct: 82 LFVPILSIVAGLDLKTASSLSAFMVTGGSIANVMCNMCITSPKFGGKSLIDYDIALSSEP 141
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK-GVETWKKETITKREAAKQLE 219
++LG+S+GV N++F +W+ITVL + FL ST K G+ WK E+ R+
Sbjct: 142 CMLLGVSLGVICNLVFPEWLITVLFAI-FLAWSTSKTCKSGLLFWKAESEVIRK------ 194
Query: 220 LIGNGYQTEECKCDPEYLSNDTTPE-----ETREPKKSKVSIIE-------NIYWKEFGL 267
NG EE + L N+T + E EPK +VS++ I W + +
Sbjct: 195 ---NGLINEEL--EKGLLENETIEQRKVYIENNEPKSIEVSLLAPQGNSKVRIPWFKLAV 249
Query: 268 LVAVWAIILALQIAKNY--------TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGR- 318
L+ +W ++ + + C V YW+L+ +Q+P+A +A+ R
Sbjct: 250 LLLIWFSFFSVYLLRGNRYGEGIIPMEPCGVGYWILSSVQVPLAVVFTAWIVFRKESLRD 309
Query: 319 RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT 378
R + K TK R S ++ + + AG++GG+ G+GGG ++ PL L++G+ P+V++AT
Sbjct: 310 RTLIPKVPGLTKKRPSNILVFPLMALLAGILGGVFGIGGGMLISPLLLQVGVTPEVTAAT 369
Query: 379 ALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF 438
F + FS++MS ++Y LL V AL + +A+ +G V+++ I+ GRASII+F
Sbjct: 370 CSFMVLFSATMSGLQYLLLGMEHVQAALVLAIMCFVASLLGLLVVQRAIRKYGRASIIVF 429
Query: 439 TLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++S ++F+S + + G K+ E EYMGF C
Sbjct: 430 SVSIVMFISNVLMTSFGAIKVWTDYESGEYMGFKLPC 466
>gi|302758356|ref|XP_002962601.1| hypothetical protein SELMODRAFT_65204 [Selaginella moellendorffii]
gi|300169462|gb|EFJ36064.1| hypothetical protein SELMODRAFT_65204 [Selaginella moellendorffii]
Length = 385
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 126/392 (32%), Positives = 215/392 (54%), Gaps = 38/392 (9%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD-IPVIDYDLALLFQP 160
++VP+ NL++ F+ +++A +S M+ GG ++ ++++ QR P D P+ID+D ALL QP
Sbjct: 14 LYVPLFNLLLRFNTRTAAVLSNFMIFGGMIANMMWSAFQRDPFDDERPLIDFDAALLMQP 73
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
++LGIS+GV N++F W+IT+LL + ++ R+F G WK E+ +
Sbjct: 74 NMLLGISLGVLCNLMFPGWLITLLLTITLAFVTFRSFNCGFRLWKAESGSNSSD------ 127
Query: 221 IGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQI 280
G G + + L + P R P K GLL+ VW A+Q+
Sbjct: 128 -GEGKSAKYHDAEAPLLDSAEIPHR-RFPAL-----------KLAGLLL-VWLFFFAVQL 173
Query: 281 AKNYTTT--------CSVLYWVLNLLQIPVAGGVSAY---EAVALYKGRR--KIASKGDE 327
+ T+ C + YW++ Q+P+ + + EA + K +
Sbjct: 174 LRGSKTSEGYFHLDECGLGYWLITGSQLPLTLLFTVWTIREATVSTSCQSDVKFLVSSAK 233
Query: 328 GTKWRASQ----LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 383
KW +S+ + + AG++GGLLG+GGG ++ P+ LE+G+PPQV++AT+ F +
Sbjct: 234 SFKWNSSRSNRAHLTLPLMALLAGILGGLLGIGGGMLISPILLEMGMPPQVTAATSAFMV 293
Query: 384 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 443
FSSS+SV +Y+L+ R PV +AL+F + + + VG V+++ I+ GRASII+F ++ +
Sbjct: 294 FFSSSLSVAQYWLMGRIPVEFALWFSGICFVFSLVGLLVVQRAIQRYGRASIIVFLVTSV 353
Query: 444 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ +SAL + G G + K+ E +YMGF + C
Sbjct: 354 MGLSALLMAGFGGMDVWKQYERGDYMGFRSPC 385
>gi|255555059|ref|XP_002518567.1| conserved hypothetical protein [Ricinus communis]
gi|223542412|gb|EEF43954.1| conserved hypothetical protein [Ricinus communis]
Length = 463
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 140/419 (33%), Positives = 227/419 (54%), Gaps = 24/419 (5%)
Query: 70 PDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGG 129
P +K +V+ ++ F + S GG+GGGG+FVP+L ++ G D K++++ S MVTGG
Sbjct: 56 PMLKLASPMVLSGVLCFIAASISSAGGIGGGGLFVPILTIVAGLDLKTASSFSAFMVTGG 115
Query: 130 AVSAVVYNLRQRHPTLD-IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVL 188
+++ V+ NL P +IDYD+ALL +P ++LG+S+GV N+IF +W+ITVL VL
Sbjct: 116 SIANVLCNLFS--PKFGGKALIDYDIALLSEPCMLLGVSVGVICNLIFPEWLITVLF-VL 172
Query: 189 FLVLSTRAFLKG-VETWKKETI-TKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEET 246
FLV ST K V W E+ KR LE NG + E + P
Sbjct: 173 FLVWSTFKTCKNAVAHWNLESEEVKRNGHGNLE---NGRVKDRSSIGNEEIKIIKEPLMG 229
Query: 247 REPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYT--------TTCSVLYWVLNLL 298
E ++++S W++ G+LV +W L + + C V YWV++ L
Sbjct: 230 IE-MENRMSFT----WEKLGVLVLIWLSFSFLYLLRGNRYGEGIAPLKPCGVGYWVVSSL 284
Query: 299 QIPVAGGVSAYEAVA--LYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLG 356
QIP+A +A+ + Y+ + D+ + RA + + + AG++GG+ G+G
Sbjct: 285 QIPLAIIFTAWILLKKRHYQNQTANLQDIDDSMEGRAPNKLTFPIMALLAGILGGVFGIG 344
Query: 357 GGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA 416
GG ++ PL L +GIPP+V++AT F + FSS+MS +Y L AL F ++ +A+
Sbjct: 345 GGMLISPLLLHVGIPPEVTAATCSFMVFFSSTMSAFQYLLSGMEHTDTALMFASICFVAS 404
Query: 417 FVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
VG V++++I+ GRASII+F++S ++ +S + + G + + E MGF C
Sbjct: 405 LVGLLVVQRIIQDYGRASIIVFSVSIVMALSTVLITSFGTIDVWRNYESGTNMGFKLPC 463
>gi|302797450|ref|XP_002980486.1| hypothetical protein SELMODRAFT_55219 [Selaginella moellendorffii]
gi|300152102|gb|EFJ18746.1| hypothetical protein SELMODRAFT_55219 [Selaginella moellendorffii]
Length = 385
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 124/392 (31%), Positives = 212/392 (54%), Gaps = 38/392 (9%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD-IPVIDYDLALLFQP 160
++VP+ NL++ F+ +++A +S M+ GG ++ ++++ QR P D P+ID+D ALL QP
Sbjct: 14 LYVPLFNLLLRFNTRTAAVLSNFMIFGGMIANMMWSAFQRDPFDDERPLIDFDAALLMQP 73
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
++LGIS+GV N++F W+IT+LL + ++ R+F G WK E+ +
Sbjct: 74 NMLLGISLGVLCNLMFPGWLITLLLTITLAFVTFRSFNCGFRLWKAESGSNSSD------ 127
Query: 221 IGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQI 280
G G + + L + P R P K GLL+ VW A+Q+
Sbjct: 128 -GEGKSAKYHDAEAPLLDSAEIPHR-RFPAL-----------KLAGLLL-VWLFFFAVQL 173
Query: 281 AKNYTTT--------CSVLYWVLNLLQIPVA-----GGVSAYEAVALYKGRRKIASKGDE 327
+ T+ C + YW++ Q+P+ + + K +
Sbjct: 174 LRGSKTSEGYFHLDECGLGYWLITGSQLPLTLLFTVWTIRETTVSTSCQSDVKFLVSSAK 233
Query: 328 GTKWRASQ----LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 383
KW +S+ + + AG++GGLLG+GGG ++ P+ LE+G+PPQV++AT+ F +
Sbjct: 234 SFKWNSSRSNRAHLTLPLMALLAGILGGLLGIGGGMLISPILLEMGMPPQVTAATSAFMV 293
Query: 384 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 443
FSSS+SV +Y+L+ R PV +AL+F + + + VG V+++ I+ GRASII+F ++ +
Sbjct: 294 FFSSSLSVAQYWLMGRIPVEFALWFSGICFVFSLVGLLVVQRAIQRYGRASIIVFLVTSV 353
Query: 444 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ +SAL + G G + K+ E +YMGF + C
Sbjct: 354 MGLSALLMAGFGGMDVWKQYERGDYMGFRSPC 385
>gi|225469193|ref|XP_002275818.1| PREDICTED: uncharacterized protein LOC100252710 [Vitis vinifera]
gi|297741863|emb|CBI33227.3| unnamed protein product [Vitis vinifera]
Length = 458
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 134/419 (31%), Positives = 227/419 (54%), Gaps = 31/419 (7%)
Query: 71 DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
+KF IV ++ F + S GG+GGGG+FVP+L ++ G D K+++ S MV GG+
Sbjct: 57 QLKFSPPIVRAAVLCFIAASISSAGGIGGGGLFVPILAIVGGLDLKTASTFSAFMVAGGS 116
Query: 131 VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
+ ++ + VID+D+ALL +P L+LG+SIGV N++F +W+IT+L +V
Sbjct: 117 TANILCTMFINCIHGGKSVIDFDIALLSEPCLLLGVSIGVVCNIVFPEWLITILFVVFLS 176
Query: 191 VLSTRAFLKGVETWKKET-ITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREP 249
+++ KGV +WK E+ + +R +LE NG + +E + E + + +EP
Sbjct: 177 WTTSKTCRKGVVSWKLESEVIRRNGFGELE---NGVRRDESNGENEVIKS------LKEP 227
Query: 250 KKSKVSIIE-NIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQI 300
+V + +I W +FG LV +W L I + S++ YW+L+ LQ
Sbjct: 228 LMGEVENFKISIPWTKFGALVVIWLSFFLLYILRGDRDGQSIIPMEPCGEGYWILSSLQF 287
Query: 301 PVAGGVSAYEAVALYKGRRKIASK----GDEGTKWRASQLVFYCACGITAGMVGGLLGLG 356
P+A +A+ RR+ +++ G G K L+F + AG++GG+ G+G
Sbjct: 288 PLAITFTAW-----ILHRRETSNQQEILGQTGEK--PPNLIFPIM-ALLAGILGGVFGIG 339
Query: 357 GGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA 416
GG ++ PL L +GIPP+V++AT + FSS+MS +Y L+ AL F + A+
Sbjct: 340 GGMLISPLLLHIGIPPEVTAATCSVMVFFSSTMSSFQYLLIGMEHKEVALIFAIICFFAS 399
Query: 417 FVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+G V+++ I+ GRAS+I+F++S ++ +S + + G + + EYMGF C
Sbjct: 400 ILGVVVVQRAIEKYGRASLIVFSVSTVMALSTVLITSFGAIDVWRDYARGEYMGFKLPC 458
>gi|414872566|tpg|DAA51123.1| TPA: hypothetical protein ZEAMMB73_068409 [Zea mays]
Length = 199
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 90/141 (63%), Positives = 111/141 (78%), Gaps = 5/141 (3%)
Query: 60 KSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSA 119
SRS VWPD+ F WR+VV T++GF G+A G+VGGVGGGGIFVPMLNL+VGFD KS+A
Sbjct: 61 SSRS--PRVWPDLAFNWRVVVATVVGFLGSAFGTVGGVGGGGIFVPMLNLVVGFDTKSAA 118
Query: 120 AVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
A+SKCM+ G + S+V YNL+ HPT + PVIDY LALLFQP+L+LGI+IGV +VIF W
Sbjct: 119 ALSKCMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVIFPYW 178
Query: 180 MITVLLIVLFLVL---STRAF 197
+ITVL+I+LF STRAF
Sbjct: 179 LITVLIIILFAPRRDPSTRAF 199
>gi|125527981|gb|EAY76095.1| hypothetical protein OsI_04019 [Oryza sativa Indica Group]
Length = 461
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 124/394 (31%), Positives = 192/394 (48%), Gaps = 40/394 (10%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR-----QRHPTLDIPVIDYDLAL 156
+F+P+LNL+ G K + A S MVTGGA S V+YNL +R VIDYD+AL
Sbjct: 88 LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLLCTGCGRRAAA----VIDYDIAL 143
Query: 157 LFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAK 216
LFQP L+LG+SIGV NV+F +W+IT L + +T+ G+ W E ++
Sbjct: 144 LFQPCLLLGVSIGVVCNVMFPEWLITALFALFLAFCTTKTLRAGLRIWSSE--SRGATLA 201
Query: 217 QLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIIL 276
+G + +P L P T WK+ +LV VW
Sbjct: 202 VAAATAHGRE------EPLLL-----PHGTDAGNGGGARGDAGFPWKDVSVLVMVWLCFF 250
Query: 277 ALQI--------AKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK--IASKGD 326
L + C V YW++ L Q+P A +AY +Y R+K + ++ D
Sbjct: 251 VLHVFIGDKHGKGMIRIKPCGVAYWLITLSQVPFAVAFTAY---IIYAKRKKQVLHNQED 307
Query: 327 -----EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 381
E TK + + G + GL G+GGG +L P+ L++GIPPQ ++AT+ F
Sbjct: 308 GKANPESTKMDTLPTLLFPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSF 367
Query: 382 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 441
+ F +SMS+V++ LL + A + + +A+ VG V+++ I+ GR S+I+F ++
Sbjct: 368 MVLFCASMSMVQFILLGMQGIGEASVYAGICFVASVVGAVVIERAIRKSGRVSLIVFLVT 427
Query: 442 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
I+ VS + + G + + YMGF C
Sbjct: 428 GIMAVSTVIITFFGALDVWAQYTSGAYMGFKLPC 461
>gi|115440389|ref|NP_001044474.1| Os01g0786800 [Oryza sativa Japonica Group]
gi|113534005|dbj|BAF06388.1| Os01g0786800, partial [Oryza sativa Japonica Group]
Length = 434
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 124/394 (31%), Positives = 192/394 (48%), Gaps = 40/394 (10%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR-----QRHPTLDIPVIDYDLAL 156
+F+P+LNL+ G K + A S MVTGGA S V+YNL +R VIDYD+AL
Sbjct: 61 LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLLCTGCGRRAAA----VIDYDIAL 116
Query: 157 LFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAK 216
LFQP L+LG+SIGV NV+F +W+IT L + +T+ G+ W E ++
Sbjct: 117 LFQPCLLLGVSIGVVCNVMFPEWLITALFALFLAFCTTKTLRAGLRIWSSE--SRGATLA 174
Query: 217 QLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIIL 276
+G + +P L P T WK+ +LV VW
Sbjct: 175 VAAATAHGRE------EPLLL-----PHGTDAGNGGGARGDAGFPWKDVSVLVMVWLCFF 223
Query: 277 ALQI--------AKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK--IASKGD 326
L + C V YW++ L Q+P A +AY +Y R+K + ++ D
Sbjct: 224 VLHVFIGDKHGKGMIRIKPCGVAYWLITLSQVPFAVAFTAY---IIYAKRKKQVLHNQED 280
Query: 327 -----EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 381
E TK + + G + GL G+GGG +L P+ L++GIPPQ ++AT+ F
Sbjct: 281 GKANPESTKMDTLPTLLFPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSF 340
Query: 382 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 441
+ F +SMS+V++ LL + A + + +A+ VG V+++ I+ GR S+I+F ++
Sbjct: 341 MVLFCASMSMVQFILLGMQGIGEASVYAGICFVASVVGAVVIERAIRKSGRVSLIVFLVT 400
Query: 442 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
I+ VS + + G + + YMGF C
Sbjct: 401 GIMAVSTVIITFFGALDVWAQYTSGAYMGFKLPC 434
>gi|53792419|dbj|BAD53257.1| membrane protein-like [Oryza sativa Japonica Group]
Length = 461
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 124/394 (31%), Positives = 192/394 (48%), Gaps = 40/394 (10%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR-----QRHPTLDIPVIDYDLAL 156
+F+P+LNL+ G K + A S MVTGGA S V+YNL +R VIDYD+AL
Sbjct: 88 LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLLCTGCGRRAAA----VIDYDIAL 143
Query: 157 LFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAK 216
LFQP L+LG+SIGV NV+F +W+IT L + +T+ G+ W E ++
Sbjct: 144 LFQPCLLLGVSIGVVCNVMFPEWLITALFALFLAFCTTKTLRAGLRIWSSE--SRGATLA 201
Query: 217 QLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIIL 276
+G + +P L P T WK+ +LV VW
Sbjct: 202 VAAATAHGRE------EPLLL-----PHGTDAGNGGGARGDAGFPWKDVSVLVMVWLCFF 250
Query: 277 ALQI--------AKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK--IASKGD 326
L + C V YW++ L Q+P A +AY +Y R+K + ++ D
Sbjct: 251 VLHVFIGDKHGKGMIRIKPCGVAYWLITLSQVPFAVAFTAY---IIYAKRKKQVLHNQED 307
Query: 327 -----EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 381
E TK + + G + GL G+GGG +L P+ L++GIPPQ ++AT+ F
Sbjct: 308 GKANPESTKMDTLPTLLFPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSF 367
Query: 382 AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLS 441
+ F +SMS+V++ LL + A + + +A+ VG V+++ I+ GR S+I+F ++
Sbjct: 368 MVLFCASMSMVQFILLGMQGIGEASVYAGICFVASVVGAVVIERAIRKSGRVSLIVFLVT 427
Query: 442 FIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
I+ VS + + G + + YMGF C
Sbjct: 428 GIMAVSTVIITFFGALDVWAQYTSGAYMGFKLPC 461
>gi|357125384|ref|XP_003564374.1| PREDICTED: uncharacterized protein LOC100827655 [Brachypodium
distachyon]
Length = 469
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 121/391 (30%), Positives = 192/391 (49%), Gaps = 33/391 (8%)
Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
+F+P+LNL+ G K +A S MVTGGA S V+YNL + +IDYD+ALLFQP
Sbjct: 96 SLFLPILNLVAGLSLKRAATYSSFMVTGGAASNVLYNLLWNRGRV---LIDYDIALLFQP 152
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
L+LG+SIGV NV+F +W+IT L + + + GV+ W+ E+ A
Sbjct: 153 CLLLGVSIGVVCNVMFPEWLITALFSLFLAFCTVKTCRAGVKIWRSESCAASAAVAVAAA 212
Query: 221 IGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQ- 279
N + P D ++ WK+ +LVAVW L
Sbjct: 213 RHNENKEPLLVLLPAGQDGD----------QAAAGNGAGFPWKDVSVLVAVWLCFFLLHA 262
Query: 280 -IAKNY------TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWR 332
I + T C + YW+ + Q+P + +AY +Y R+K + E K
Sbjct: 263 FIGDKHGKGMIRITPCGIAYWLFTISQVPFSVAFTAY---IIYAKRKKQLLRNQEDGKAN 319
Query: 333 A--------SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAIT 384
S L+ A +T G + GL G+GGG +L P+ L++GIPPQ ++AT+ F +
Sbjct: 320 CVETKTETMSSLILPLAAFVT-GSLSGLFGIGGGLLLNPVLLQMGIPPQTAAATSSFMVL 378
Query: 385 FSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFII 444
F +SMS+V++ LL + A + + +A+ G +++K ++ GR S+I+F ++ I+
Sbjct: 379 FCASMSMVQFILLGMDGIGEASVYAGICFVASVAGAVLIEKAVRKSGRVSMIVFLVTAIM 438
Query: 445 FVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+S + + G + K+ YMGF C
Sbjct: 439 ALSTVIVTCFGALDVWKQYNGGAYMGFKLPC 469
>gi|3023070|gb|AAC12639.1| hypothetical protein [Mesembryanthemum crystallinum]
Length = 132
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 82/145 (56%), Positives = 102/145 (70%), Gaps = 14/145 (9%)
Query: 149 VIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET 208
+IDYDLALLFQP+L+LGISIGVAFNVIFADWM+TVLLIVLF+ ST+AF++G++TWKKET
Sbjct: 1 IIDYDLALLFQPMLMLGISIGVAFNVIFADWMVTVLLIVLFVGTSTKAFMRGIDTWKKET 60
Query: 209 ITKREAAKQLELIG-NGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGL 267
+ ++EAAK+ E G +G + E PE +E RE I+EN+YWKE GL
Sbjct: 61 LMQKEAAKRAESNGADGVEYEPLPAGPE--------KEDRE-----APILENVYWKEVGL 107
Query: 268 LVAVWAIILALQIAKNYTTTCSVLY 292
L VW LA +I T TCSV Y
Sbjct: 108 LCFVWVAFLAFEIINENTATCSVAY 132
>gi|414880188|tpg|DAA57319.1| TPA: hypothetical protein ZEAMMB73_804983 [Zea mays]
Length = 458
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 118/390 (30%), Positives = 184/390 (47%), Gaps = 28/390 (7%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
+F+P+LNL+ G K + A S MVTGGA S V+YNL +IDYD+ALLFQP
Sbjct: 81 LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLACTGGGGRGRLIDYDIALLFQPC 140
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
L+LG+SIGV NV+F +W+IT+L + + + GV+ W+ E+
Sbjct: 141 LLLGVSIGVVCNVMFPEWLITLLFSLFLAFCTAKTCRAGVKIWRSES------------S 188
Query: 222 GNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIY-WKEFGLLVAVWAIILALQI 280
G G + + P T + + W + LLV +W AL +
Sbjct: 189 GCGSGARSSRGGHSHSKEPLLPRGTSDGDAEGGGGNGAGFPWGDVALLVIIWLCFFALHV 248
Query: 281 AKN--------YTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKW- 331
C V YW++ Q+P A + Y A K R + ++G+K
Sbjct: 249 LIGDKHGKGVIRIRPCGVAYWLITFFQLPAAVAFTGYIVYAKRKKRAVHSESQEDGSKAD 308
Query: 332 ------RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITF 385
A L G + GL G+GGG +L P+ L++GIPPQ ++AT+ F + F
Sbjct: 309 LADAGVEALPLPTLPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLF 368
Query: 386 SSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIF 445
+SMS+V++ LL V A + + + G V+++ + GR S+I+F ++ I+
Sbjct: 369 CASMSMVQFILLGVEGVGQACVYAGVCFAGSVAGVVVIERAVSKSGRVSLIVFLVTAIMA 428
Query: 446 VSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+SA+ + G + + EYMGF C
Sbjct: 429 LSAVIVTCFGALDVWMQYTTGEYMGFKLPC 458
>gi|224116342|ref|XP_002317274.1| predicted protein [Populus trichocarpa]
gi|222860339|gb|EEE97886.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 136/463 (29%), Positives = 237/463 (51%), Gaps = 34/463 (7%)
Query: 39 ATSKNKNDHVEP---NIVIKVSAPK----SRSGYKHVWPDIKFGWRIVVGTIIGFFGTAC 91
A S+++ + +P N+ I + K S + +K +V+ ++ F +
Sbjct: 19 ALSQSRAEETQPLSNNLKIDLFLDKIGKWSHHQIQSQETGLKLAPSMVIAGVLCFIAASV 78
Query: 92 GSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD-IPVI 150
S GG+GGGG+++P+L ++ D K++++ S MVTGG+V+ V+ N+ R ++
Sbjct: 79 SSAGGIGGGGLYIPILTIVASLDLKTASSFSAFMVTGGSVANVMCNMFTRSAKFGGQTLV 138
Query: 151 DYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETI- 209
DYD+A+L +P ++LG+S+GV N++F +W++T+L V + + GV WK E+
Sbjct: 139 DYDIAILSEPCMLLGVSVGVICNLVFPEWLVTILFAVFLACSTFKTCQNGVFHWKLESEE 198
Query: 210 TKREAAKQLELIGNG---YQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFG 266
R + LE NG Y+T K E +S+ P E S + W + G
Sbjct: 199 VNRNESGNLE---NGLVEYETST-KESEEVISSVKEPLLGVELTSS----VLRFPWMKLG 250
Query: 267 LLVAVWAI--ILALQIAKNY------TTTCSVLYWVLNLLQIPVAGGVSAY-----EAVA 313
+L +W IL L Y +C YWV++ LQIP+A +A+ E+
Sbjct: 251 ILFIIWFSFSILYLLRGNRYGEGIIPMESCGFGYWVVSSLQIPLAIMFTAWILYRKESCQ 310
Query: 314 LYKGRRKIASKGDEG-TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPP 372
++++ KG E T S + + + AGM+GG+ G+GGG ++ PL L +GI P
Sbjct: 311 HQTINQQLSVKGMEDLTGGGTSNKLIFPVMALLAGMLGGVFGIGGGMLISPLLLHVGIAP 370
Query: 373 QVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR 432
++++AT F + FSSSMS ++Y LL V A+ + +A+ +G V+++ I GR
Sbjct: 371 EITAATCSFMVFFSSSMSALQYLLLGMEHVDTAIILSVICFVASLLGLLVVQRAIVKYGR 430
Query: 433 ASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
AS+I+F++S ++ +S + + G + + MGF C
Sbjct: 431 ASMIVFSVSTVMALSTVLMTSFGALNVWRDYNSGRNMGFKLPC 473
>gi|242058941|ref|XP_002458616.1| hypothetical protein SORBIDRAFT_03g036740 [Sorghum bicolor]
gi|241930591|gb|EES03736.1| hypothetical protein SORBIDRAFT_03g036740 [Sorghum bicolor]
Length = 473
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 144/481 (29%), Positives = 232/481 (48%), Gaps = 41/481 (8%)
Query: 14 LAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIK 73
+A +F+ ++ + S+ +L E + + +P+ S+P + + V P+I
Sbjct: 15 IAVSFLSAAAASNTTTSSLQSLLAEVSQWRETQLGDPS-----SSPGGGAAHAGVRPNIV 69
Query: 74 FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
W ++ F + S GGVGGG +F+P+LNL+ G K + A S MVTGGA S
Sbjct: 70 ASW------VLSFLAASVSSAGGVGGGSLFLPILNLVAGLGLKRATAYSSFMVTGGAASN 123
Query: 134 VVYNL---RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
V+YNL +IDYD+ALLFQP L+LG+SIGV NV+F +W+IT+L V
Sbjct: 124 VLYNLASRSSTGTGGGGRLIDYDIALLFQPCLLLGVSIGVVCNVVFPEWLITLLFSVFLA 183
Query: 191 VLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPK 250
+ + GV+ W+ E+ A +G E ++D E
Sbjct: 184 SCTAKTCRAGVKIWRSESGGAGTARGDHH---HGIGKEPLLLRLPLGTSDGDAE------ 234
Query: 251 KSKVSIIENIYWKEFGLLVAVWAIILALQIAKN--------YTTTCSVLYWVLNLLQIPV 302
W + LLV VW AL + CSV YW++ L Q+P
Sbjct: 235 GGGRGNGAGFPWADVALLVMVWLCFFALHVLIGDKHGKGVIRIRPCSVAYWLITLSQLPA 294
Query: 303 AGGVSAYEAVALYKGRRKIASKG--------DEGTKWRASQLVFYCACGITAGMVGGLLG 354
A + Y + + +R + S+ D G + L A +T G + GL G
Sbjct: 295 AVAFTGY-IIHSKRKKRVVPSQEDGKQEDLVDTGVETTLPSLTLPLAAFVT-GALSGLFG 352
Query: 355 LGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII 414
+GGG +L P+ L++GIPPQ ++AT+ F + F +SMS+V++ LL + A + + +
Sbjct: 353 IGGGLLLNPVLLQIGIPPQTAAATSSFMVLFCASMSMVQFILLGVEGIGQASIYAGICFV 412
Query: 415 AAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNI 474
A+ VG V+++ I+ GR S+I+F ++ I+ +S + + G + + EYMGF +
Sbjct: 413 ASVVGVVVIQRAIRKSGRVSLIVFLVTAIMALSTVIVTCFGALDVWMQYTSGEYMGFKLL 472
Query: 475 C 475
C
Sbjct: 473 C 473
>gi|328868613|gb|EGG16991.1| hypothetical protein DFA_07972 [Dictyostelium fasciculatum]
Length = 521
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 116/399 (29%), Positives = 196/399 (49%), Gaps = 37/399 (9%)
Query: 65 YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
+K ++ D F + +VG + F G A S GGVGGGGI++P+L L+ F K++ +S C
Sbjct: 120 HKSLFHD-HFTFLDIVGLCLLFIGCALSSGGGVGGGGIYIPILILVSKFSPKTAIPLSNC 178
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
+V G A++ + N +RHP + +IDY + LL +PL + G + GV + ++I +L
Sbjct: 179 LVAGCAMANFIQNFPRRHPFANKHLIDYSVVLLIEPLTLGGTTFGVYLHTFLPPFVILIL 238
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAA-KQLELIGNGYQTEECKCDPEYLSNDTTP 243
L+V + F KG+ +KKE TK + K + +G +T++ + P
Sbjct: 239 LVVTLTATAITTFRKGLSIYKKENETKSYSQIKNTSINSDGSETQQ-----------SNP 287
Query: 244 EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK----NYTTT----CSVLYWVL 295
+ E W + +V V A+ + K Y+ CS YW L
Sbjct: 288 FKDAE-------------WGKISAIVLVLALSTVFSVLKGGDGEYSMVGIKLCSPTYWTL 334
Query: 296 NLLQIPVAGGVSAYEAVALYKGRRKIASKG---DEGTKWRASQLVFYCACGITAGMVGGL 352
+ PV A+ LY+ +K +G + + ++ + AG++ L
Sbjct: 335 SFAIWPVIIVTWILTALYLYRRWKKNQLQGIKVEGDINYSPQTIILLGFLSVIAGILASL 394
Query: 353 LGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALS 412
LG+GGG I GP+ L++G+ P +++AT+ F I F+S+ S ++Y LL + + Y L ++ +
Sbjct: 395 LGIGGGMIKGPVLLQMGLSPDITAATSSFMILFTSASSAIQYVLLGKLRLDYGLVYYFTA 454
Query: 413 IIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
IA FVG L +K R S IF + +I +S + L
Sbjct: 455 FIACFVGTQSLLYAVKKSNRKSYFIFLICLVIVISTILL 493
>gi|326504538|dbj|BAJ91101.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 465
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 118/396 (29%), Positives = 191/396 (48%), Gaps = 35/396 (8%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
+F+P+LNL+ G K + S MVTGGA S V+YNL + +IDYD+ALLFQP
Sbjct: 83 LFLPILNLVAGLTLKHATTYSSFMVTGGAASNVLYNLWRARGRGRAALIDYDIALLFQPC 142
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET------ITKREAA 215
L+LG+SIGV NV+F +W+IT L + + + G + W+ E+ R
Sbjct: 143 LLLGVSIGVVCNVMFPEWLITALFSLFLAFCTAKTCRAGAKIWRCESAGAGAPAAARHGH 202
Query: 216 KQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAII 275
K L+ G P+ +D + R+ + WK+ +LV VW
Sbjct: 203 KVPLLLDVGGL-------PQPSQDDGGLQAARDGGSGGGAGFP---WKDVAVLVIVWLCF 252
Query: 276 LALQ--IAKNY------TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDE 327
L I + C + YW+ + Q+P A +AY +Y R+K A+ E
Sbjct: 253 FLLHVFIGDKHGKGVIRIKPCGIAYWLATVSQVPFAVAFTAY---IIYAKRKKQATHHHE 309
Query: 328 G--------TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 379
TK + G + GL G+GGG +L P+ L++GIPPQ ++AT+
Sbjct: 310 DGKAHSSVHTKSETLPALALPLAAFVTGSLSGLFGIGGGLLLNPVLLQIGIPPQTAAATS 369
Query: 380 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT 439
F + F +SMS+V++ LL V A + + +A+ G +++++++ GR S+I+F
Sbjct: 370 SFMVLFCASMSMVQFILLGMDGVGEASVYAGICFVASIAGVVLIERVVRKSGRVSMIVFL 429
Query: 440 LSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++ I+ +S + + G + + YMGF C
Sbjct: 430 VTAIMALSTVIVTCFGALDVWTQYTGGAYMGFKLPC 465
>gi|226497914|ref|NP_001144009.1| uncharacterized protein LOC100276828 precursor [Zea mays]
gi|195635291|gb|ACG37114.1| hypothetical protein [Zea mays]
Length = 458
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 118/390 (30%), Positives = 184/390 (47%), Gaps = 28/390 (7%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
+F+P+LNL+ G K + A S MVTGGA S V+YNL +IDYD+ALLFQP
Sbjct: 81 LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLACTGGGGRGRLIDYDIALLFQPC 140
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
L+LG+SIGV NV+F +W+IT+L + + + GV+ W+ E+
Sbjct: 141 LLLGVSIGVVCNVMFPEWLITLLFSLFLAFCTAKTCRAGVKIWRSES------------S 188
Query: 222 GNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIY-WKEFGLLVAVWAIILALQI 280
G G + + P T + + W + LLV +W AL +
Sbjct: 189 GCGSGARSSRGGHSHSKEPLLPRGTSDGDAEGGGGNGAGFPWGDVALLVIIWLCFFALHV 248
Query: 281 AKN--------YTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKW- 331
C V YW++ Q+P A + Y A K R + ++G+K
Sbjct: 249 LIGDKHGKGVIRIRPCGVAYWLITFFQLPAAVAFTGYIVYAKRKKRAVHSESQEDGSKAD 308
Query: 332 ------RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITF 385
A L G + GL G+GGG +L P+ L++GIPPQ ++AT+ F + F
Sbjct: 309 LADAGVEALPLPTLPLAAFVTGALXGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLF 368
Query: 386 SSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIF 445
+SMS+V++ LL V A + + + G V+++ + GR S+I+F ++ I+
Sbjct: 369 CASMSMVQFILLGVEGVGQACVYAGVCFAGSVAGVVVIERAVSKSGRVSLIVFLVTAIMA 428
Query: 446 VSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+SA+ + G + + EYMGF C
Sbjct: 429 LSAVIVTCFGALDVWMQYTTGEYMGFKLPC 458
>gi|222640475|gb|EEE68607.1| hypothetical protein OsJ_27142 [Oryza sativa Japonica Group]
Length = 192
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 87/164 (53%), Positives = 119/164 (72%)
Query: 56 VSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDA 115
++A S G + D+ F WR+VV T++GF G+ G+VGGVGGGG F PM+NL+VGF+
Sbjct: 1 MAASASPRGRRASGRDLGFKWRVVVATVVGFPGSGFGTVGGVGGGGKFGPMVNLLVGFET 60
Query: 116 KSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVI 175
KS+A VSK M+ G + S+V YNL+ HPT + PVIDY LALLFQP+L+LGI+IGV +VI
Sbjct: 61 KSAAGVSKGMIMGASASSVWYNLQVSHPTKEAPVIDYKLALLFQPMLMLGITIGVELSVI 120
Query: 176 FADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
F W+ITVL+I+LF+ S+R+F KG+ WK ET + E ++ E
Sbjct: 121 FPYWLITVLIIILFIGTSSRSFYKGILMWKDETRIQMETREREE 164
>gi|125527980|gb|EAY76094.1| hypothetical protein OsI_04018 [Oryza sativa Indica Group]
Length = 397
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 196/392 (50%), Gaps = 41/392 (10%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL---RQRHPTLDIPVIDYDLALLF 158
++VP+LN++ G K++ A S MVTGG +S V+Y L R P+IDYD+A++
Sbjct: 29 LYVPILNIVAGLSLKTATAFSTFMVTGGTLSNVLYTLIVLRGHEKGGHQPLIDYDIAVVS 88
Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
QP L+LG+S+GV NV+F +W+IT L V + + + G++ W+ ET R +
Sbjct: 89 QPCLLLGVSVGVICNVMFPEWLITALFAVFLASATFKTYGTGMKRWRAETAAARRMLEGG 148
Query: 219 ELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILAL 278
+G+G E L + R + +LV +W +
Sbjct: 149 SSLGDG--------AGEALLGQKDGDGHRRQ------------CVDLMVLVTIWLCFFVI 188
Query: 279 QI------AKNY--TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDE--- 327
+ AK C V YW++ + QIPVA A+ A +++ R+ A E
Sbjct: 189 HLFIGGEGAKGVFDIEPCGVTYWLITIAQIPVA---VAFTACIVHQKRKSHAQNSQEFDQ 245
Query: 328 ----GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 383
+K + + + + G++ GL G+GGG +L P+ L++G+PP+ +S+T +F +
Sbjct: 246 AISVKSKLESLPVYVFPVAALLTGVMSGLFGIGGGLLLNPVLLQIGVPPKTASSTTMFMV 305
Query: 384 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 443
F +SMS+V++ +L + AL + +A+ VG V++ I+ GR S+I+F ++ I
Sbjct: 306 LFCASMSMVQFIILGVDGIVTALVYAITCFVASIVGLVVIQGAIRKSGRVSLIVFMVAAI 365
Query: 444 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ +S + + G ++ + +YMGF C
Sbjct: 366 LALSVVVIACSGAVRVWVQYTSGQYMGFKMPC 397
>gi|330818743|ref|XP_003291498.1| hypothetical protein DICPUDRAFT_89425 [Dictyostelium purpureum]
gi|325078304|gb|EGC31962.1| hypothetical protein DICPUDRAFT_89425 [Dictyostelium purpureum]
Length = 463
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 119/404 (29%), Positives = 209/404 (51%), Gaps = 32/404 (7%)
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
++G ++ F G A S GGVGGGGI++P+L L+ +D KSS +S C+V G +++ + N
Sbjct: 77 ILGMVLLFIGCALSSGGGVGGGGIYIPILILVSKWDPKSSIPLSNCLVAGCSLANFIQNF 136
Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
+RHP + +ID+ +ALL +PL + G GV + F +I +LL++ S +
Sbjct: 137 PRRHPFSNKHLIDFSVALLIEPLTLAGTIFGVYLHTYFPPLVILLLLVITLGFTSFKTIT 196
Query: 199 KGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIE 258
KGVE ++KE K + L+ N + ND+ + P +
Sbjct: 197 KGVEIYRKEIKAK------VSLLNNDHHK----------INDSNGSGSSNPNGDGAN--S 238
Query: 259 NIYWKEFGLLVAVWAIILALQIAKNYT----TTCSVLYWVLNLLQIPVAGGVSAYEAVAL 314
N+ ++ LL+ + + Y+ CS LYWVL+ + +PV + + A L
Sbjct: 239 NV---KYNLLIFSTMFSIFKGGDEEYSLIGVKLCSPLYWVLSFVMVPVIIILWGFTARYL 295
Query: 315 YKG---RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIP 371
Y+ RR + + K+ ++ I AG++ LLG+GGG I GP+ L++G+
Sbjct: 296 YREYEIRRDEGREIEGEIKYTHKNIIVLGILSIVAGILASLLGIGGGMIKGPVLLQMGLS 355
Query: 372 PQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 431
P V++AT+ + I F+S+ S ++Y L+ + Y + ++A+ I+ FVG L ++K
Sbjct: 356 PDVTAATSSYMILFTSASSAIQYILVGKLRWDYGIVYYAIGFISCFVGTQTLIWIVKKYQ 415
Query: 432 RASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
R S I+F + +I VS + L + + I ++++ + FD+IC
Sbjct: 416 RRSYIVFLIGAVISVSTILL---VVTESIDFVKYRN-LSFDSIC 455
>gi|357446189|ref|XP_003593372.1| hypothetical protein MTR_2g010740 [Medicago truncatula]
gi|355482420|gb|AES63623.1| hypothetical protein MTR_2g010740 [Medicago truncatula]
Length = 466
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 139/462 (30%), Positives = 244/462 (52%), Gaps = 31/462 (6%)
Query: 30 SAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGT 89
S KP +T N N +++ K+S K+ + H ++ +VV I+ F +
Sbjct: 20 SNAKQTKPFSTILNIN-----HVLDKISQLKNVTQGFHE-SQLQISVPLVVAGILCFIAS 73
Query: 90 ACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVV-YNLRQRHPTLDIP 148
+ S GG+GGGGIF+P+L ++ G D K ++++S MVTGG+++ V+ Y
Sbjct: 74 SISSAGGIGGGGIFIPILTIVAGLDLKVASSISAFMVTGGSIANVICYMFTTSTKFGGKS 133
Query: 149 VIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK-GVETW--K 205
+IDYD+AL +P ++LG+S+GV N++F +W+IT L+ +FL ST K GV W +
Sbjct: 134 LIDYDIALSSEPCMLLGVSVGVICNLVFPEWLIT-LMFAVFLAWSTSKTCKSGVMFWNIE 192
Query: 206 KETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEE---TREPKKSKVSIIENIYW 262
E I K +++E G E + + T E + SK+ I W
Sbjct: 193 SEEIRKNIGVQEIE---KGLLENEITMHKDNDGSKTVEENLVLVPQENSSKLCI----PW 245
Query: 263 KEFGLLVAVWAIILALQIAKNY-------TTTCSVLYWVLNLLQIPVAGGVSAYEAV--A 313
+ G+L+ +W ++ + + C V YW+++ +Q+P+A +A+ +
Sbjct: 246 LKLGVLLLIWFSFFSIYLIRGNGYGQIIPMEPCGVGYWIISSVQVPLAVVFTAWMVLRKE 305
Query: 314 LYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQ 373
+ + I + +++LVF + AGM+GG+ G+GGG ++ PL L++GI P+
Sbjct: 306 SIQDQTLIPQVQCQNRNCPSNKLVFPLM-ALLAGMLGGVFGIGGGMLISPLLLQVGIAPE 364
Query: 374 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 433
V++AT F + FSS+MS ++Y LL V AL + +A+ +G V++K+I+ GR
Sbjct: 365 VTAATCSFMVFFSSTMSSLQYLLLGMEHVETALILAIMCFVASLLGLLVVQKVIRKYGRP 424
Query: 434 SIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
SII+F++S ++ +S + + G K+ + + +YMGF C
Sbjct: 425 SIIVFSVSIVMSLSIVLMTSFGTLKVWEDYKSGKYMGFKLPC 466
>gi|53792418|dbj|BAD53256.1| membrane protein-like [Oryza sativa Japonica Group]
gi|53792472|dbj|BAD53380.1| membrane protein-like [Oryza sativa Japonica Group]
gi|125572272|gb|EAZ13787.1| hypothetical protein OsJ_03712 [Oryza sativa Japonica Group]
Length = 397
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 111/392 (28%), Positives = 196/392 (50%), Gaps = 41/392 (10%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL---RQRHPTLDIPVIDYDLALLF 158
++VP+LN++ G K++ A S MVTGG +S V+Y L R P+IDYD+A++
Sbjct: 29 LYVPILNIVAGLSLKTATAFSTFMVTGGTLSNVLYTLIVLRGHEKGGHQPLIDYDIAVVS 88
Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
QP L+LG+S+GV NV+F +W+IT L V + + + G++ W+ ET R +
Sbjct: 89 QPCLLLGVSVGVICNVMFPEWLITALFAVFLASATFKTYGTGMKRWRAETAAARRMLEGG 148
Query: 219 ELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILAL 278
+G+G E L + R + +LV +W +
Sbjct: 149 SSLGDG--------AGEALLGQKDGDGHRRQ------------CVDLMVLVTIWLCFFVI 188
Query: 279 QI------AKNY--TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDE--- 327
+ AK C V YW++ + QIP+A A+ A +++ R+ A E
Sbjct: 189 HLFIGGEGAKGVFDIEPCGVTYWLITIAQIPIA---VAFTACIVHQKRKSHAQNSQEFDQ 245
Query: 328 ----GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 383
+K + + + + G++ GL G+GGG +L P+ L++G+PP+ +S+T +F +
Sbjct: 246 AISVKSKLESLPVYVFPVAALLTGVMSGLFGIGGGLLLNPVLLQIGVPPKTASSTTMFMV 305
Query: 384 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 443
F +SMS+V++ +L + AL + +A+ VG V++ I+ GR S+I+F ++ I
Sbjct: 306 LFCASMSMVQFIILGVDGIVTALVYAITCFVASIVGLVVIQGTIRKSGRVSLIVFMVAAI 365
Query: 444 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ +S + + G ++ + +YMGF C
Sbjct: 366 LALSVVVIACSGAVRVWVQYTSGQYMGFKMPC 397
>gi|223994523|ref|XP_002286945.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220978260|gb|EED96586.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 385
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 183/390 (46%), Gaps = 33/390 (8%)
Query: 74 FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
F W +VGT++ F GVGGGGI++P+ +++ F K S +S+ + G +
Sbjct: 3 FLWTDLVGTVVWFLTAGIAVSCGVGGGGIYMPLGMILLRFAPKQSTGLSQACIFGAGLGG 62
Query: 134 VVYNLRQRHPTLDI---------PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
++ N R+RHP I P+IDYDLAL P+ + G +GV + DW+ +
Sbjct: 63 LIINSRKRHPDRHIRDTKGFYTRPIIDYDLALFQAPMELAGAVVGVTVQRLLPDWLFLSI 122
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTP- 243
+V+ + + F K E++KK+ + K K L + + E+ + + + +P
Sbjct: 123 AVVILGLTCFKTFQKFFESYKKDKMQK----KHLAFLAQRHLDEQ---EAQKIPGCPSPG 175
Query: 244 EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVA 303
+ E + + V + + +L +W + L + + T + W LN + +
Sbjct: 176 YNSDESEHTTVELCAESVPDDVMILRCLWRLFLRIFRTSHQALTSMMTQWNLNFVDSSLQ 235
Query: 304 GGVSAYEAV----ALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 359
++++ + LY G W Q+ + AGM+GGL+G+ G+
Sbjct: 236 RMLTSFRGLRLLHCLYFGL------------WDYKQVRDFSLVSFGAGMIGGLVGISAGY 283
Query: 360 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
GP ++ G+ P+VS+AT + +SS V + L P YALYFF + + AFVG
Sbjct: 284 FTGPFMIQRGLHPRVSTATTATTMLLTSSSVAVMFVLSGLLPWEYALYFFLVCVTGAFVG 343
Query: 420 QHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
+ + +K G AS++I L+ II S L
Sbjct: 344 KTRIDAYVKKTGMASVLIGALATIIGCSTL 373
>gi|356524768|ref|XP_003531000.1| PREDICTED: uncharacterized protein LOC100806202 [Glycine max]
Length = 473
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 116/397 (29%), Positives = 209/397 (52%), Gaps = 32/397 (8%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQP 160
+F+P+L ++ D K+++++S MVTGG+++ V+ NLR +P L +IDYD+ALL +P
Sbjct: 86 LFIPILTIVASLDLKTASSLSAFMVTGGSIANVMCNLRATNPKLGGKSLIDYDIALLSEP 145
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
++LG+S+GV N++F +W+IT+L V +++ GV WK E+ +R+
Sbjct: 146 CMLLGVSVGVICNLVFPEWLITMLFAVFLTWSTSKTCNSGVVFWKIESEERRK------- 198
Query: 221 IGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIEN------------IYWKEFGLL 268
+G++ E + S + E +K KV IE I W + +L
Sbjct: 199 -NDGFEGLEKGLLEDESSEEREEGVQVEKEKEKVKSIEEQVMVPEENIRVRIPWLKLVVL 257
Query: 269 VAVWAIILALQIAKNY--------TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK 320
+ VW +L + + C V YW+++ Q+P+A +A+ + +
Sbjct: 258 LLVWFSFFSLYLLRGNKYGQSIIPMEPCGVGYWIISSAQVPLALFFTAWIVYRKESHQDQ 317
Query: 321 IASKGDE--GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT 378
+ D + +++L+F + AG++GG+ G+GGG ++ PL L +GI P+V++AT
Sbjct: 318 NLMQEDSCLSSNGPSNKLIFP-MMALLAGILGGVFGIGGGMLISPLLLHVGIAPEVTAAT 376
Query: 379 ALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF 438
F + FSS+MS ++Y LL + AL + +A+ +G V++K I+ GR S+I+F
Sbjct: 377 CSFMVFFSSTMSALQYLLLGMDHIETALILALICFVASLIGLLVVQKAIQSYGRPSLIVF 436
Query: 439 TLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++S ++ +S + + G + K YMGF C
Sbjct: 437 SVSIVMTLSIVLMTSFGAIRTWKDYTSGRYMGFKLPC 473
>gi|186512097|ref|NP_193857.3| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
gi|332659032|gb|AEE84432.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
Length = 449
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 136/416 (32%), Positives = 220/416 (52%), Gaps = 28/416 (6%)
Query: 71 DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
++K I++ ++ F S GG+GGGG+F+P++ ++ G D K++++ S MVTGG+
Sbjct: 51 ELKLSSAIIMAGVLCFLAALISSAGGIGGGGLFIPIMTIVAGVDLKTASSFSAFMVTGGS 110
Query: 131 VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
++ V+ NL +DYDLALL +P ++LG+SIGV N + +W+ITVL V
Sbjct: 111 IANVISNLFGGKAL-----LDYDLALLLEPCMLLGVSIGVICNRVLPEWLITVLFAVFLA 165
Query: 191 VLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPK 250
S + GV+ WK E+ RE+ G G EE K N P +
Sbjct: 166 WSSLKTCRSGVKFWKLESEIARESGHGRPERGQGQIEEETK-------NLKAPLLEAQAT 218
Query: 251 KSKVSIIENIYWKEFGLLVAVWA---IILALQIAKN-----YTTTCSVLYWVLNLLQIPV 302
K+K I W + G+LV VWA +I L+ K+ C V YW+L LQIP+
Sbjct: 219 KNK----SKIPWTKLGVLVIVWASFFVIYLLRGNKDGKGIITIKPCGVEYWILLSLQIPL 274
Query: 303 AGGVSAYEAVALYKGRRKIA---SKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 359
A + A++ + R++ + K EGT+ S + + A AG++GG+ G+GGG
Sbjct: 275 AL-IFTKLALSRTESRQEQSPNDQKNQEGTRLDKSTRLKFPAMSFLAGLLGGIFGIGGGM 333
Query: 360 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
++ PL L+ GIPPQ+++AT F + FS++MS V+Y LL A F + +A+ +G
Sbjct: 334 LISPLLLQSGIPPQITAATTSFMVFFSATMSAVQYLLLGMQNTDTAYVFSFICFLASLLG 393
Query: 420 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+++K + GRASII+F++ ++ +S + + G + + MGF C
Sbjct: 394 LVLVQKAVAQFGRASIIVFSVGTVMSLSTVLMTSFGALDVWTDYVAGKDMGFKLPC 449
>gi|297737341|emb|CBI26542.3| unnamed protein product [Vitis vinifera]
Length = 220
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 99/192 (51%), Positives = 120/192 (62%), Gaps = 23/192 (11%)
Query: 72 IKFGWRIVVGTIIGFFGTACGSVGG---VGGGGIFVPMLNLIVGFDAKSSAAVSK----C 124
+KFGW+I +IIGFFG G GG V GG FV +SS ++ C
Sbjct: 1 MKFGWQIDDCSIIGFFGATFGVGGGGSFVTCGGSFVTC--------GRSSEVINSICWSC 52
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
MV G S V YNL+ RHPTLD+P+I YDLALLFQP+L++GISIGV FNV ADWM+T+L
Sbjct: 53 MVMGAVGSTVYYNLKLRHPTLDMPIICYDLALLFQPMLMMGISIGVVFNVAVADWMVTIL 112
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPE 244
LIVLFL T+AF+KGVETWKKETI KR NG + E K P LSN T
Sbjct: 113 LIVLFLGTPTKAFIKGVETWKKETIMKRNKC-------NGTKEVEYKPLPSGLSNGTQ-N 164
Query: 245 ETREPKKSKVSI 256
TR+ ++ +VSI
Sbjct: 165 ATRKFEELEVSI 176
>gi|297837261|ref|XP_002886512.1| hypothetical protein ARALYDRAFT_475155 [Arabidopsis lyrata subsp.
lyrata]
gi|297332353|gb|EFH62771.1| hypothetical protein ARALYDRAFT_475155 [Arabidopsis lyrata subsp.
lyrata]
Length = 461
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 129/427 (30%), Positives = 220/427 (51%), Gaps = 40/427 (9%)
Query: 70 PDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGG 129
P I+ ++ ++ F ++ S GG+GGGG++VP++ ++ G D K++++ S MVTGG
Sbjct: 54 PKIELTTSTIIAGLLSFLASSISSAGGIGGGGLYVPIMTIVAGLDLKTASSFSAFMVTGG 113
Query: 130 AVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVL 188
+++ V NL R+P + +ID+DLALL +P ++LG+SIGV N++F +W+IT L V
Sbjct: 114 SIANVGCNLFVRNPKSGGKTLIDFDLALLLEPCMLLGVSIGVICNLVFPNWLITSLFAV- 172
Query: 189 FLVLST-RAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETR 247
FL ST + F G+ W+ E+ E K E G EE K + L E+
Sbjct: 173 FLAWSTLKTFGNGLYYWRLES----EMVKIRESNRIGEDDEEDKIESLKLP---LLEDYE 225
Query: 248 EPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYT--------TTCSVLYWVLNLLQ 299
PK+ W + G+LV +W A+ + + C YW+++ Q
Sbjct: 226 RPKRFP--------WIKLGVLVIIWLSYFAVYLLRGNKYGEGIISIEPCGNAYWLISSSQ 277
Query: 300 IPV----------AGGVSAYEAVALYKGRRKIAS-KGDEGTKWRASQLVFYCACGITAGM 348
IP+ + V + + + + + ++G + S + + AG+
Sbjct: 278 IPLTLFFTLWICFSDNVQSQQPSDYNVSIKDVEDLRSNDGAR---SNKCMFPVMALLAGV 334
Query: 349 VGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYF 408
+GG+ G+GGG ++ PL L++GI P+V++AT F + FSS+MS ++Y LL A F
Sbjct: 335 LGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLLLGMEHTGTASIF 394
Query: 409 FALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEY 468
+ +A+ VG V++K+I GRASII+F++ ++ +S + + G + Y
Sbjct: 395 AVICFVASLVGLKVVQKVITEYGRASIIVFSVCIVMALSIVLMTSYGALDVWNDYVAGRY 454
Query: 469 MGFDNIC 475
MGF C
Sbjct: 455 MGFKLPC 461
>gi|440293005|gb|ELP86177.1| hypothetical protein EIN_328840 [Entamoeba invadens IP1]
Length = 492
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 113/461 (24%), Positives = 214/461 (46%), Gaps = 59/461 (12%)
Query: 71 DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
++ F W++ VG+I F + G+GGG ++ + LI+ DA + +SK G A
Sbjct: 2 ELTFDWKLAVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDAHEAVPLSKITTFGVA 61
Query: 131 VSAVVYNLRQRHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLF 189
+ + HPT+ P+I Y AL+ +PL + G +GV N+ W+I V+L++L
Sbjct: 62 CGGYLVLWMKMHPTIKYKPLISYATALMVEPLTIYGTMLGVILNITSPSWLIIVVLVLLL 121
Query: 190 LVLSTRAFLKGVETWKKETITKREAAKQLEL----------IGNGYQTEECKCDPEYLS- 238
S + F K + +K E ++A +E+ + TE K E
Sbjct: 122 GYTSYKTFTKAWKQYKAENEKMKQAKIAIEMKEEKQPDQDTANDDMTTESLKTGAESDDK 181
Query: 239 -------------NDTTPEETREPK--------KSKVSIIENIYWKEFGLL--------- 268
N++TP++ + + + + +E+ KE +L
Sbjct: 182 VQSGVIVSDKIDLNESTPQDDKNEQGTGPQLLPQDENETVEDTKKKEEKILFKREVLKSI 241
Query: 269 ------VAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAVAL 314
+AVWA++ + I + SV+ YW+L + P+ V+ L
Sbjct: 242 LSIIILIAVWAVMFCIVILRGGGKMDSVVGVQCGTPWYWILTGVGAPLMLTVTLIVGGVL 301
Query: 315 YKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQV 374
+ R +G+ +W + + AG+ LG+GGG ++GP+ LE+G+ PQV
Sbjct: 302 WYKHRGEHIEGE--VQWSVKNCIIIPVGALFAGVSAAFLGIGGGMVIGPILLEIGVLPQV 359
Query: 375 SSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRAS 434
++AT+ F I F++S S ++Y + + + L++F + I A GQ K+++ + R S
Sbjct: 360 ATATSAFMIMFTASSSSLQYIIDGKLDLYCGLWYFGIGFIGAAFGQFGFSKIVQKMNRQS 419
Query: 435 IIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+I F L +I +S L++ + + ++I+ +++ +GF ++C
Sbjct: 420 VIGFFLGALIVLSTLAMVAMTVIQLIQDVKNHN-LGFHHLC 459
>gi|15220600|ref|NP_176367.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
gi|13272465|gb|AAK17171.1|AF325103_1 unknown protein [Arabidopsis thaliana]
gi|4508075|gb|AAD21419.1| Unknown protein [Arabidopsis thaliana]
gi|111074464|gb|ABH04605.1| At1g61740 [Arabidopsis thaliana]
gi|332195760|gb|AEE33881.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
Length = 458
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 125/428 (29%), Positives = 222/428 (51%), Gaps = 42/428 (9%)
Query: 70 PDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGG 129
P I+ ++ ++ F ++ S GG+GGGG++VP++ ++ G D K++++ S MVTGG
Sbjct: 51 PRIELTTSTIIAGLLSFLASSISSAGGIGGGGLYVPIMTIVAGLDLKTASSFSAFMVTGG 110
Query: 130 AVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVL 188
+++ V NL R+P + +ID+DLALL +P ++LG+SIGV N++F +W+IT L V
Sbjct: 111 SIANVGCNLFVRNPKSGGKTLIDFDLALLLEPCMLLGVSIGVICNLVFPNWLITSLFAV- 169
Query: 189 FLVLST-RAFLKGVETWKKET-ITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEET 246
FL ST + F G+ W+ E+ + K + ++E + E K E+
Sbjct: 170 FLAWSTLKTFGNGLYYWRLESEMVKIRESNRIEEDDEEDKIESLKLP--------LLEDY 221
Query: 247 REPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYT--------TTCSVLYWVLNLL 298
+ PK+ W + G+LV +W A+ + + C YW+++
Sbjct: 222 QRPKRFP--------WIKLGVLVIIWLSYFAVYLLRGNKYGEGIISIEPCGNAYWLISSS 273
Query: 299 QIPV----------AGGVSAYEAVALYKGRRKIAS-KGDEGTKWRASQLVFYCACGITAG 347
QIP+ + V + + + + + + ++G + S + + AG
Sbjct: 274 QIPLTLFFTLWICFSDNVQSQQQSDYHVSVKDVEDLRSNDGAR---SNKCMFPVMALLAG 330
Query: 348 MVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALY 407
++GG+ G+GGG ++ PL L++GI P+V++AT F + FSS+MS ++Y LL A
Sbjct: 331 VLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLLLGMEHTGTASI 390
Query: 408 FFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE 467
F + +A+ VG V++K+I GRASII+F++ ++ +S + + G +
Sbjct: 391 FAVICFVASLVGLKVVQKVITEYGRASIIVFSVGIVMALSIVLMTSYGALDVWNDYVSGR 450
Query: 468 YMGFDNIC 475
YMGF C
Sbjct: 451 YMGFKLPC 458
>gi|4079632|emb|CAA10487.1| hypothetical protein [Arabidopsis thaliana]
Length = 389
Score = 141 bits (355), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 115/396 (29%), Positives = 204/396 (51%), Gaps = 42/396 (10%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQP 160
++VP++ ++ G D K++++ S MVTGG+++ V NL R+P + +ID+DLALL +P
Sbjct: 14 LYVPIMTIVAGLDLKTASSFSAFMVTGGSIANVGCNLFVRNPKSGGKTLIDFDLALLLEP 73
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLST-RAFLKGVETWKKET-ITKREAAKQL 218
++LG+SIGV N++F +W+IT L V FL ST + F G+ W+ E+ + K + ++
Sbjct: 74 CMLLGVSIGVICNLVFPNWLITSLFAV-FLAWSTLKTFGNGLYYWRLESEMVKIRESNRI 132
Query: 219 ELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILAL 278
E + E K E+ + PK+ W + G+LV +W A+
Sbjct: 133 EEDDEEDKIESLKLP--------LLEDYQRPKRFP--------WIKLGVLVIIWLSYFAV 176
Query: 279 QIAKNYT--------TTCSVLYWVLNLLQIPV----------AGGVSAYEAVALYKGRRK 320
+ + C YW+++ QIP+ + V + + + +
Sbjct: 177 YLLRGNKYGEGIISIEPCGNAYWLISSSQIPLTLFFTLWICFSDNVQSQQQSDYHVSVKD 236
Query: 321 IAS-KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 379
+ + ++G + S + + AG++GG+ G+GGG ++ PL L++GI P+V++AT
Sbjct: 237 VEDLRSNDGAR---SNKCMFPVMALLAGVLGGVFGIGGGMLISPLLLQVGIAPEVTAATC 293
Query: 380 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT 439
F + FSS+MS ++Y LL A F + +A+ VG V++K+I GRASII+F+
Sbjct: 294 SFMVLFSSTMSAIQYLLLGMEHTGTASIFAVICFVASLVGLKVVQKVITEYGRASIIVFS 353
Query: 440 LSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ ++ +S + + G + YMGF C
Sbjct: 354 VGIVMALSIVLMTSYGALDVWNDYVSGRYMGFKLPC 389
>gi|281207304|gb|EFA81487.1| hypothetical protein PPL_05475 [Polysphondylium pallidum PN500]
Length = 500
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 113/408 (27%), Positives = 194/408 (47%), Gaps = 39/408 (9%)
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
+VG + F G A S GGVGGGGI++P+L L+ F K++ +S C+V G + + ++ N
Sbjct: 113 IVGMALLFLGCALSSGGGVGGGGIYIPILILVNKFSPKTAIPLSNCLVAGCSFANLIQNF 172
Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
+RHP + +IDY + LL +PL + G G+ + + ++I +LL+V S F
Sbjct: 173 PRRHPHANKHLIDYSVVLLIEPLTLGGTVFGIYLHTVLPPYVILILLVVTLTATSATTFK 232
Query: 199 KGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIE 258
KG++ KKE K EYL + + P+K KV+
Sbjct: 233 KGLDLRKKENTKK-----------------------EYLLINNNSDAYLTPEK-KVNPFL 268
Query: 259 NIYWKEFGLLVAVWAIILALQIAKNYTTT--------CSVLYWVLNLLQIPVAGGVSAYE 310
+ W + ++++ + + K + CS YWVL+ P+ +
Sbjct: 269 DADWVKIFAILSILILSTMFSVFKGGDSEVSLIGIKLCSPPYWVLSFAIWPIIIITWIFT 328
Query: 311 AVALYKGRRKIASKGD--EGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE 367
A LY + + G EG ++ ++ + AG++ LLG+GGG I GP+ L
Sbjct: 329 ARYLYGQWLRNQADGTIIEGDIRYSRKTIILLGILSVVAGILASLLGIGGGMIKGPVLLA 388
Query: 368 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 427
+G+ P + +AT+ F I F+S+ S +Y LL + + Y L ++ + A FVG L ++
Sbjct: 389 MGLSPDIVAATSSFMILFTSASSAFQYILLGKLRLDYGLVYYIIGFAACFVGTQTLIWVV 448
Query: 428 KILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ S IIF ++ II +S + L + + +E + F +IC
Sbjct: 449 NKYKKRSYIIFLITAIIVISTILL----VVTEVLDLEKYKNQPFQSIC 492
>gi|297804026|ref|XP_002869897.1| hypothetical protein ARALYDRAFT_492757 [Arabidopsis lyrata subsp.
lyrata]
gi|297315733|gb|EFH46156.1| hypothetical protein ARALYDRAFT_492757 [Arabidopsis lyrata subsp.
lyrata]
Length = 447
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 138/452 (30%), Positives = 232/452 (51%), Gaps = 35/452 (7%)
Query: 39 ATSKNKNDHVEP----NIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSV 94
A + N+ + +P N++ KV ++ S + ++K I+V ++ F S
Sbjct: 16 AINANQEEENQPQSHHNLLHKVQQWRT-SLKESSSAELKLSSAIIVAGVLCFLAALISSA 74
Query: 95 GGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
GG+GGGG+F+P++ ++ G D K++++ S MVTGG+++ V+ NL +DYDL
Sbjct: 75 GGIGGGGLFIPIMTIVAGLDLKTASSFSAFMVTGGSIANVISNLFGGKAL-----LDYDL 129
Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREA 214
ALL +P ++LG+SIGV N + +W+IT L V S + GV+ WK E+ R
Sbjct: 130 ALLLEPCMLLGVSIGVICNRVLPEWLITALFAVFLAWSSLKTCRSGVKFWKIESEIARGK 189
Query: 215 AKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWA- 273
+ G G + EE L +++ P W + G+LV VWA
Sbjct: 190 GHERPEKGQG-EIEEDNLKAPLLEAQVNRNKSKIP------------WTKLGVLVIVWAS 236
Query: 274 --IILALQIAKN-----YTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIA---S 323
+I L+ K+ C V YW+L LQIP+A + A++ + R++ +
Sbjct: 237 FFVIYLLRGNKDGKGIITIKPCGVEYWILLSLQIPLAL-IFTKLALSRTESRQEQSPNNQ 295
Query: 324 KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAI 383
K EGT+ S + + A AG++GG+ G+GGG ++ PL L+ GIPPQ+++AT F +
Sbjct: 296 KNQEGTRMDQSMRLKFPAMSFLAGLLGGIFGIGGGMLISPLLLQSGIPPQITAATTSFMV 355
Query: 384 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 443
FS++MS V+Y LL A F + +A+ +G +++K + GRASII+F++ +
Sbjct: 356 FFSATMSAVQYLLLGMQNTDTAYVFSFICFLASLLGLVLVQKAVAQFGRASIIVFSVGTV 415
Query: 444 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ +S + + G + + MGF C
Sbjct: 416 MSLSTVLMTSFGALDVWTDYMAGKDMGFKLPC 447
>gi|6554197|gb|AAF16643.1|AC011661_21 T23J18.20 [Arabidopsis thaliana]
Length = 491
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 128/412 (31%), Positives = 211/412 (51%), Gaps = 32/412 (7%)
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
++ ++ FF + S GG+GGGG+F+ ++ +I G + K++++ S MVTG + + V NL
Sbjct: 97 IIAAVLSFFAASISSAGGIGGGGLFLSIMTIIAGLEMKTASSFSAFMVTGVSFANVGCNL 156
Query: 139 RQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLST-RA 196
R+P + D +ID+DLAL QP L+LG+SIGV N +F +W++ + L +FL ST +
Sbjct: 157 FLRNPKSRDKTLIDFDLALTIQPCLLLGVSIGVICNRMFPNWLV-LFLFAVFLAWSTMKT 215
Query: 197 FLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI 256
KGV W E ++R K D ++ E RE + + I
Sbjct: 216 CKKGVSYWNLE--SERAKIKSPR-----------DVDGIEVARSPLLSEEREDVRQRGMI 262
Query: 257 IENIYWKEFGLLVAVWAIILALQIAKNYT--------TTCSVLYWVLNLLQIPVAGGVS- 307
W + G+LV +W + ++ + + C LYW L+ LQIP+ +
Sbjct: 263 --RFPWMKLGVLVIIWLLFFSINLFRGNKYGQGIISIKPCGALYWFLSSLQIPLTIFFTL 320
Query: 308 ----AYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 363
+ + + S+ + G R ++L+ + AG++GGL G+GGG ++ P
Sbjct: 321 CIYFSDNVQSNHTSHSNQNSEQETGVGGRQNKLMLP-VMALLAGVLGGLFGIGGGMLISP 379
Query: 364 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 423
L L++GI P+V++AT F + FSSSMS ++Y LL A F + +A+ VG V+
Sbjct: 380 LLLQIGIAPEVTAATCSFMVLFSSSMSAIQYLLLGMEHAGTAAIFALVCFVASLVGLMVV 439
Query: 424 KKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
KK+I GRASII+F + ++ +S + + G + YMGF C
Sbjct: 440 KKVIAKYGRASIIVFAVGIVMALSTVLMTTHGAFNVWNDFVSGRYMGFKLPC 491
>gi|348688213|gb|EGZ28027.1| hypothetical protein PHYSODRAFT_469591 [Phytophthora sojae]
Length = 545
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 138/528 (26%), Positives = 237/528 (44%), Gaps = 95/528 (17%)
Query: 39 ATSKNKND----HVEPNIVIKVSAPKSRSGYKH------VWPD---IKFGWRIVVGTIIG 85
A ++N ND VEP + + + G +H V D + G+I
Sbjct: 24 ALAQNVNDSPVLDVEPEVSCEAATDCLAQGLEHSTCRAGVCVDDALFPLSREEIAGSISA 83
Query: 86 FFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL 145
F + + G+GGGG+ VP+ + + + + +SK + GGA+++ + N+R+RHP +
Sbjct: 84 FLASVVAAGSGLGGGGLLVPLYIMTMSMSSHEAVPLSKATIFGGAIASFLLNVRKRHPLV 143
Query: 146 -DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETW 204
P+IDY+ LL +P+ + G IGV N +F +W+IT+ ++ L + R + KG W
Sbjct: 144 RSRPLIDYETILLMEPMTLAGTIIGVNMNAVFPEWLITLCIVWLLTKTALRTYSKGKTIW 203
Query: 205 KKETITKREAA----------------KQLELIGNGY---------QTEECK-------- 231
K+E + KQ + Y + EE +
Sbjct: 204 KEEADADTKIVSDIVAYWRLLPYESNFKQFRAVARAYLKWKSYKSPEKEELRLKILAGKA 263
Query: 232 -CDPEYLSNDTTPEETREPKKSKVSIIENIY-W--------------------------K 263
+ ++ S+++T T E S + E++ W
Sbjct: 264 SSEEDHSSSNSTEASTEEEASSDENESESLMSWGLQDKKRPVKFLSVEEIAKARRTVPMA 323
Query: 264 EFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAV--- 312
+ G+L W ++ +AK T SV+ YW L ++ P GV+ Y +
Sbjct: 324 DMGVLFLTWVGLVLFSMAKGGHGTPSVIGLSCGSFGYWSLIVVSFPFFMGVTIYFGMKIS 383
Query: 313 ---ALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 369
A+ + +KGD W ++ Y A AG+ GLLG+GGG + GPL LE+G
Sbjct: 384 RFHAMLQASDYTYAKGD--MVWTKHAVIKYPALCTAAGVAAGLLGIGGGMVKGPLLLEMG 441
Query: 370 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 429
+ PQVSSAT+ I F+SS + +++ +L V +AL+ + IA +GQ + LIK
Sbjct: 442 LIPQVSSATSSSMILFTSSATTIQFIILGTLSVEHALWHGTVGFIAGLIGQLGMSYLIKK 501
Query: 430 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICYT 477
+++++IF ++ I VS +G +G+A RI + GF ++C +
Sbjct: 502 YRKSALVIFLIAIFIGVSGGVMGVLGVA----RISEIGFGGFRSLCLS 545
>gi|222619377|gb|EEE55509.1| hypothetical protein OsJ_03713 [Oryza sativa Japonica Group]
Length = 351
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 115/371 (30%), Positives = 178/371 (47%), Gaps = 40/371 (10%)
Query: 125 MVTGGAVSAVVYNLR-----QRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
MVTGGA S V+YNL +R VIDYD+ALLFQP L+LG+SIGV NV+F +W
Sbjct: 1 MVTGGAASNVLYNLLCTGCGRRAAA----VIDYDIALLFQPCLLLGVSIGVVCNVMFPEW 56
Query: 180 MITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSN 239
+IT L + +T+ G+ W E+ + +G + +P L
Sbjct: 57 LITALFALFLAFCTTKTLRAGLRIWSSES--RGATLAVAAATAHGRE------EPLLL-- 106
Query: 240 DTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQI--------AKNYTTTCSVL 291
P T WK+ +LV VW L + C V
Sbjct: 107 ---PHGTDAGNGGGARGDAGFPWKDVSVLVMVWLCFFVLHVFIGDKHGKGMIRIKPCGVA 163
Query: 292 YWVLNLLQIPVAGGVSAYEAVALYKGRRK--IASKGD-----EGTKWRASQLVFYCACGI 344
YW++ L Q+P A A+ A +Y R+K + ++ D E TK + +
Sbjct: 164 YWLITLSQVPFA---VAFTAYIIYAKRKKQVLHNQEDGKANPESTKMDTLPTLLFPLAAF 220
Query: 345 TAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPY 404
G + GL G+GGG +L P+ L++GIPPQ ++AT+ F + F +SMS+V++ LL +
Sbjct: 221 VTGALSGLFGIGGGLLLNPVLLQIGIPPQTAAATSSFMVLFCASMSMVQFILLGMQGIGE 280
Query: 405 ALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIE 464
A + + +A+ VG V+++ I+ GR S+I+F ++ I+ VS + + G + +
Sbjct: 281 ASVYAGICFVASVVGAVVIERAIRKSGRVSLIVFLVTGIMAVSTVIITFFGALDVWAQYT 340
Query: 465 HKEYMGFDNIC 475
YMGF C
Sbjct: 341 SGAYMGFKLPC 351
>gi|301122683|ref|XP_002909068.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099830|gb|EEY57882.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 483
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 185/386 (47%), Gaps = 42/386 (10%)
Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
I VP + LI+GFD K + +S + GGAV+ +N+R+RHP ++ P+ID +LAL P
Sbjct: 84 VIMVPAMVLIMGFDIKRATPISNVAILGGAVANAWFNMRKRHPNVNRPLIDPELALGMIP 143
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
+++ G +G N + +++++L +V+ LV +R KG+ KKE +REA
Sbjct: 144 VVIGGTVLGALINKLIPSYVLSLLFVVVLLVGGSRTMKKGIRLHKKEVAKRREAEAATSE 203
Query: 221 IGNGYQTEECKCDPEYLSNDTTPEETREPKK-------------------------SKVS 255
+ T + P +TP+ T +K S V
Sbjct: 204 V-----TADIPVSPGAYVQVSTPQITGNDEKRLSLSVTGGDAAPVKSAAGDHAGSDSLVQ 258
Query: 256 IIE---NIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAV 312
I+E + W ++ + ++A I V YWV+ L++IP +
Sbjct: 259 ILEKERHFAWGPHVAIMVCYLGVVAASIGDASVDCGGVAYWVILLIEIPWVAVFVVLTSH 318
Query: 313 ALYKGR-RKIASK---GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL 368
L+K RK A+ D KW +V++ AG+V G+ G+GGG I GP+ +EL
Sbjct: 319 YLHKVYLRKTAANYQYVDGDIKWTKKMVVYFPLGCAVAGIVAGMFGVGGGIITGPIMIEL 378
Query: 369 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 428
GI P+V+S+T I +SS+ + ++ + K +AL+ A++ + Q V+ ++
Sbjct: 379 GIVPEVASSTTALMILYSSAAATAKFAVFKMVAWDWALFLCAVAFVVTSASQVVILGFVR 438
Query: 429 ILGRASIIIFTLSFIIFVSALSLGGV 454
GR SII+ ++ +A+ +GGV
Sbjct: 439 RTGRQSIIVLCIA-----TAVLIGGV 459
>gi|363807478|ref|NP_001242649.1| uncharacterized protein LOC100803518 precursor [Glycine max]
gi|255636709|gb|ACU18690.1| unknown [Glycine max]
Length = 473
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 122/422 (28%), Positives = 230/422 (54%), Gaps = 20/422 (4%)
Query: 71 DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
I+ IVV ++ F ++ S GG+GGGG+F+P+L ++ D K+++++S MVTGG+
Sbjct: 55 QIQISGPIVVAGVLCFIASSISSAGGIGGGGLFLPILTIVACLDLKTASSLSAFMVTGGS 114
Query: 131 VSAVVYNLRQRHPTL-DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLF 189
++ V+ NL P +IDYD+ALL +P ++LG+S+GV N++F +W+IT+L V
Sbjct: 115 IANVLCNLCATSPKFGGKSLIDYDIALLSEPCMLLGVSVGVICNLVFPEWLITMLFAVFL 174
Query: 190 LVLSTRAFLKGVETWKKETITKR--EAAKQLE--LIGNG---YQTEECKCDPEYLSNDTT 242
+++ GV WK E+ +R + ++LE L+ +G + E + + E +
Sbjct: 175 TWSTSKTCNSGVLFWKIESEERRKNDGFERLEKGLLEDGSSEEREERVQVNNEKAGMKSI 234
Query: 243 PEETREPKKS-KVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTT------CSVLYWVL 295
E+ P+++ ++ I + ++ ++ L L Y + C V YW+L
Sbjct: 235 EEQVMVPEENIRMRIPWLKLVVLLLVWLSFFS--LYLLRGNKYGQSIIPMEPCGVGYWIL 292
Query: 296 NLLQIPVAGGVSAYEAVALYKGRRKIASKGDE--GTKWRASQLVFYCACGITAGMVGGLL 353
+ Q+P+A +A+ + + + D + +++L+F + AG++GG+
Sbjct: 293 SSAQVPLALFFTAWIVYRKESHQDQNLMQEDPCLSSNGPSNKLIFP-MMALLAGILGGVF 351
Query: 354 GLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSI 413
G+GGG ++ PL L +GI P+V++AT F + FSS+MS ++Y LL + AL +
Sbjct: 352 GIGGGMLISPLLLHVGIAPEVTAATCSFMVFFSSTMSALQYLLLGMDHIETALILALICF 411
Query: 414 IAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDN 473
+A+ +G V+++ ++ GR S+I+F++S ++ +S + + G+ + K YMGF
Sbjct: 412 VASLIGLLVVQRAVQSYGRPSLIVFSVSIVMTLSIVLMTSFGVIRTWKDYTSGRYMGFKL 471
Query: 474 IC 475
C
Sbjct: 472 PC 473
>gi|42561921|ref|NP_172621.2| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
gi|332190628|gb|AEE28749.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
Length = 367
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 118/384 (30%), Positives = 192/384 (50%), Gaps = 32/384 (8%)
Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLG 165
+ +I G + K++++ S MVTG + + V NL R+P + D +ID+DLAL QP L+LG
Sbjct: 1 MTIIAGLEMKTASSFSAFMVTGVSFANVGCNLFLRNPKSRDKTLIDFDLALTIQPCLLLG 60
Query: 166 ISIGVAFNVIFADWMITVLLIVLFLVLST-RAFLKGVETWKKETITKREAAKQLELIGNG 224
+SIGV N +F +W++ + L +FL ST + KGV W E ++R K
Sbjct: 61 VSIGVICNRMFPNWLV-LFLFAVFLAWSTMKTCKKGVSYWNLE--SERAKIKSPR----- 112
Query: 225 YQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNY 284
D ++ E RE + + I W + G+LV +W + ++ + +
Sbjct: 113 ------DVDGIEVARSPLLSEEREDVRQRGMI--RFPWMKLGVLVIIWLLFFSINLFRGN 164
Query: 285 T--------TTCSVLYWVLNLLQIPVAGGVS-----AYEAVALYKGRRKIASKGDEGTKW 331
C LYW L+ LQIP+ + + + + S+ + G
Sbjct: 165 KYGQGIISIKPCGALYWFLSSLQIPLTIFFTLCIYFSDNVQSNHTSHSNQNSEQETGVGG 224
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
R ++L+ + AG++GGL G+GGG ++ PL L++GI P+V++AT F + FSSSMS
Sbjct: 225 RQNKLMLP-VMALLAGVLGGLFGIGGGMLISPLLLQIGIAPEVTAATCSFMVLFSSSMSA 283
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
++Y LL A F + +A+ VG V+KK+I GRASII+F + ++ +S + +
Sbjct: 284 IQYLLLGMEHAGTAAIFALVCFVASLVGLMVVKKVIAKYGRASIIVFAVGIVMALSTVLM 343
Query: 452 GGVGLAKMIKRIEHKEYMGFDNIC 475
G + YMGF C
Sbjct: 344 TTHGAFNVWNDFVSGRYMGFKLPC 367
>gi|320170699|gb|EFW47598.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 642
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 122/447 (27%), Positives = 216/447 (48%), Gaps = 75/447 (16%)
Query: 86 FFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL 145
FFG++ +GGVGGGG+FVP+L +I F A + +S M+T A+ ++++ +R + P
Sbjct: 199 FFGSSLAVIGGVGGGGLFVPLLMIITHFKADQAVPISSTMITAAAIMSLLFEIRAKRPN- 257
Query: 146 DIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWK 205
PVIDYD++ L QP+ + G ++GV NV+ WMI + L+V+ + +T+ KG+ +K
Sbjct: 258 GRPVIDYDVSFLLQPVCLAGTTLGVFLNVLLPGWMIILALLVILVYTTTKTMKKGITMYK 317
Query: 206 KETITKREAA------------------KQLELIGNGYQTEE-----CKCDPEYLS---- 238
KE+ +R A ++L + +Q +E E LS
Sbjct: 318 KESQQRRALANGGTSANVAASAPAAAPLQKLNKRHHKHQDKESLVASTDTSAEQLSVNMD 377
Query: 239 ---------------NDTTPEETREPKKSKV----SIIENIYWKEFGLL--VAVWAIILA 277
+ P E P K +V + + + + +L +A+W I+LA
Sbjct: 378 DSDLSDSEDAGRIQLSQPVPSEAELPSKDQVLYQRQLDQELRFPTTQILGMIAMWLIVLA 437
Query: 278 LQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRR---KIASKGDEGTKWRAS 334
K + + CS +W++ L +P+A V L+ GRR K G ++ +
Sbjct: 438 CSTIKRFVSKCSAEFWIVAFLPLPIA------ILVTLWYGRRLRDAFELKQRCGHQFEPT 491
Query: 335 QLVF-------YCACGITAGMVGGLLGLGGGFILGPLFLELGI---PPQVSSATALFAIT 384
VF Y G+ GG++G+GG ++GPL L + + P V++A + +
Sbjct: 492 DFVFNRRNTIVYPLLSFGGGLAGGMVGVGGAMVIGPLLLNMAVQTPDPSVTTAISNLLVV 551
Query: 385 FSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFII 444
F+++ +V+++ +L YAL+ A +++A+ + + VLK GR S ++F L
Sbjct: 552 FTAASTVIQFVILNTLVYDYALFLSAFTLMASVLSKKVLKPWFDNKGRKSFVVFAL---- 607
Query: 445 FVSALSLGGVGLAKMIKRIEHKEYMGF 471
V ++SL G+ A I+ I ++GF
Sbjct: 608 -VLSISLSGILTA--IEGILKVAHVGF 631
>gi|428174477|gb|EKX43372.1| hypothetical protein GUITHDRAFT_95350 [Guillardia theta CCMP2712]
Length = 572
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 197/381 (51%), Gaps = 19/381 (4%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
I VPM I F +SK + GGAV+ YN+++RHP + P++DY+ ++ +P+
Sbjct: 180 ILVPMYLSIGKFSPHYGIPLSKATIFGGAVTNNYYNVQRRHPYANRPLVDYNTCMMLEPV 239
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
L+LG IGV FN + W+IT+LL++ + R +K +ET+ KE + + L+
Sbjct: 240 LLLGTIIGVFFNAVSPGWLITILLVLSLTYTTYRTSVKALETYNKEEKAV-KEEETKHLL 298
Query: 222 GNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIA 281
G+ E+ P ++ + PE+ RE +++ ++ I W +++AV +I+ +
Sbjct: 299 GSKAGPEQ---HPSFMLDANIPEDLREIYEAESRVL-TISW----IIIAVCSILKGGEGG 350
Query: 282 KNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL---YKGRRKIASKGDEG----TKWRAS 334
+ S+ YW+L +P+ G+ Y L Y+ + ++ + EG T+ AS
Sbjct: 351 QGIVACGSLGYWLLVAAPLPMVLGLVMYCGDILVRGYEDKLRLGYEFAEGDIHWTRKNAS 410
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
YC I+AG G LG+ G ILGP+ LELG+ P V +A++ F + F++S + ++
Sbjct: 411 VYPLYC---ISAGFAAGALGIAAGTILGPILLELGMLPLVGTASSGFMVIFTASSTTFQF 467
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 454
++ + + YAL+F + ++ +G V+ +K + ++ LS ++ S + +G
Sbjct: 468 LIMGQLQIDYALFFCGIGLLGGAIGNTVVSFFVKKYKKTWFVVAILSAVLAASTVLMGYA 527
Query: 455 GLAKMIKRIEHKEYMGFDNIC 475
G + +H + MG +C
Sbjct: 528 GFERAELSYDHGKNMGIRRLC 548
>gi|301117154|ref|XP_002906305.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107654|gb|EEY65706.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 543
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 136/526 (25%), Positives = 237/526 (45%), Gaps = 95/526 (18%)
Query: 41 SKNKND----HVEPNIVIKVSAPKSRSGYKH------VWPD---IKFGWRIVVGTIIGFF 87
++N ND VE ++V + + G KH V D + G+I F
Sbjct: 23 AQNVNDSPVLDVESDVVCEAATDCLAQGLKHSTCRAGVCVDNALFPLSHEEIAGSIGAFV 82
Query: 88 GTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-D 146
T + G+GGGG+ VP+ +++ + + +SK + GGA+++ + N+R+RHP +
Sbjct: 83 ATILAAGSGLGGGGLLVPLYIMLMSMSSHEAVPLSKTTIFGGAIASFLLNVRKRHPLVRS 142
Query: 147 IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
P+IDY+ LL +P+ + G IGV N +F +W+IT+ ++ L + R + KG + WK+
Sbjct: 143 RPLIDYETMLLMEPMTLAGTIIGVNMNAVFPEWLITICIVWLLTKTALRTYSKGKKIWKE 202
Query: 207 ETITKREAA----------------KQLELIGNGY---------QTEECK---------- 231
E + KQ +++ Y + EE +
Sbjct: 203 EVDADNKIIMDIVAYWRLLPYESNFKQFQVVARAYLKWKAYKSPEKEELRLKILADEASS 262
Query: 232 CDPEYLSNDTTPEETREPKKSKVSIIENIY-W-------------------------KEF 265
+ S++ T T E S + EN+ W +
Sbjct: 263 VEERKSSSNITEASTEEETSSDENESENLMSWGLQDKRPVKFLSVEEIAKTRRTVPMADM 322
Query: 266 GLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAVALYKG 317
G+L W ++ +AK T SV+ YW+L ++ P V+ Y + + +
Sbjct: 323 GVLFLTWIGLVLFSMAKGGHGTPSVIGLSCGSIGYWLLVIVSFPFFMSVTIYFGMKISRF 382
Query: 318 RRKIAS------KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIP 371
+ + KGD W +V Y A AG+ GLLG+GGG + GPL +E+G+
Sbjct: 383 HTMLQASDYTYAKGD--MIWTKFAVVKYPALCTAAGVAAGLLGIGGGMVKGPLLIEMGLL 440
Query: 372 PQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 431
PQVSSAT+ I F+SS + +++ +L V +AL+ A+ +A +GQ + L K
Sbjct: 441 PQVSSATSSSMILFTSSATTIQFIILGTLSVNHALWHGAVGFVAGLIGQLGMSYLFKKYR 500
Query: 432 RASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICYT 477
+++++IF ++ I VS + +G +G RI + GF ++C +
Sbjct: 501 KSALVIFLVAVFIGVSGIVMGVLGAV----RISEIGFGGFRSLCLS 542
>gi|116783346|gb|ABK22903.1| unknown [Picea sitchensis]
Length = 95
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 62/93 (66%), Positives = 77/93 (82%)
Query: 383 ITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSF 442
+ FSSSMSVVEYY LKRFPVPYA YFF + IIAAF GQHV++KL+ +LGRASIIIF L+F
Sbjct: 1 MLFSSSMSVVEYYFLKRFPVPYAAYFFGVCIIAAFTGQHVIRKLVLLLGRASIIIFCLAF 60
Query: 443 IIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+IF+SA +GGVG++KM+ I+ YMGF N+C
Sbjct: 61 MIFISAWIMGGVGISKMVHEIKDGAYMGFQNLC 93
>gi|242043762|ref|XP_002459752.1| hypothetical protein SORBIDRAFT_02g009840 [Sorghum bicolor]
gi|241923129|gb|EER96273.1| hypothetical protein SORBIDRAFT_02g009840 [Sorghum bicolor]
Length = 432
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 118/395 (29%), Positives = 197/395 (49%), Gaps = 47/395 (11%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQR---------HPTLDIPVIDY 152
++VP+L+++ G K++ A+S MVTGG +S V+Y L R P+IDY
Sbjct: 64 LYVPILSIVAGLSLKTATALSTFMVTGGTLSNVLYTLFLRGGGSGSGQGQGQGQQPLIDY 123
Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLST-RAFLKGVETWKKETITK 211
D+A++ QP L+LG+S+GV NV+F +W+IT L LFL +T + + GV W+ ET
Sbjct: 124 DIAVVSQPCLLLGVSVGVVCNVVFPEWLITALF-SLFLAFATFKTYGAGVRRWRAET--- 179
Query: 212 REAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAV 271
+L I + E + L + VS W + +LV V
Sbjct: 180 ----AELGRIPDAAGAETAAAEEALLGRN-------------VSGGHRCQWVDLAVLVTV 222
Query: 272 WAIILALQI------AKNY--TTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS 323
W + + AK C +YW++ + Q+PVA + A +RK +
Sbjct: 223 WLCFFVMHLFIGGEGAKGVFDIEPCGTVYWLITVAQVPVAVAFT-----ACIGQKRKSQA 277
Query: 324 KGD---EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATAL 380
G K A + + G++ GL G+GGG +L P+ L++G+PP +SAT +
Sbjct: 278 HGQVISAKRKLDALPAYVFPVAALLTGVMSGLFGIGGGLLLNPVLLQIGVPPTTASATTM 337
Query: 381 FAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTL 440
F + F +SMS+V++ +L + A+ + A +A+ VG ++ ++ GRAS+I+F +
Sbjct: 338 FMVLFCASMSMVQFIILGVDGIASAVLYAATCFVASIVGLVGIQGAVRRSGRASLIVFMV 397
Query: 441 SFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ ++ VSAL + G A++ + +YMGF C
Sbjct: 398 AGVLAVSALVIACSGAARVWEEYMSGQYMGFKMPC 432
>gi|167393930|ref|XP_001733526.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165894999|gb|EDR22813.1| hypothetical protein EDI_269710 [Entamoeba dispar SAW760]
Length = 404
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 102/367 (27%), Positives = 175/367 (47%), Gaps = 49/367 (13%)
Query: 74 FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
F W+++VG+I F + G+GGG ++ + LI+ D + +SK G A
Sbjct: 7 FDWKLIVGSIGSLFFAVLCAGSGIGGGCFYLVIFVLILQMDPHQAIPLSKITTFGVACGG 66
Query: 134 VVYNLRQRHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
+ +RHP + P+I Y AL+ +PL + G IGV FN+I W+I ++L++L
Sbjct: 67 FLILWMKRHPIVRYKPLISYPTALMVEPLTIYGTMIGVIFNIISPSWLIIIVLVILLGFT 126
Query: 193 STRAFLKGVETWKKETITKREAAKQLELI--------------------GNG-------- 224
S + F K ++ WK E KR+A K EL+ GN
Sbjct: 127 SYKTFAKAIKQWKNEN-EKRDATKATELVETSKPDITDNDNDDMKPSENGNNAVIVDERI 185
Query: 225 -YQTEECKCDPEYLSNDTTPEETREPK----KS--KVSIIENIYWKEFGLLVAVWAIILA 277
+ +E P+ L D + E ++ K K+ K II+ + G+L+ VWA++
Sbjct: 186 QEEDDEQGTGPKLLPQDESQEAQQQAKIVEEKTLLKREIIKAVL--SVGILIIVWAVMFF 243
Query: 278 LQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGT 329
+ I K S++ YW+L ++ P+ V+ + L+ +R +G+
Sbjct: 244 IVILKGGEKMDSIVGIECGTPWYWILTVIGGPLMLTVTIIVGIFLWWRQRGEEVEGE--V 301
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
+W + AG+ LG+GGG ++GP+ LE+G+ PQV++AT+ F I F++S
Sbjct: 302 QWTVKNCLIIPIGAFFAGVSAAYLGIGGGMVIGPILLEIGVLPQVATATSAFMIMFTASS 361
Query: 390 SVVEYYL 396
S ++Y +
Sbjct: 362 SSLQYII 368
>gi|301122685|ref|XP_002909069.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099831|gb|EEY57883.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 470
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 182/376 (48%), Gaps = 30/376 (7%)
Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
I VP + LI+GFD K + VS + GGA++ +N+R+RHP +D P+ID DL+ P
Sbjct: 74 TILVPAMVLIMGFDIKRATPVSNLAIVGGAIANAWFNIRKRHPAVDRPLIDADLSFSMIP 133
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETIT-KREAAKQLE 219
L++ G +G + +++++L +V+ ++ TR KG++ ++ E + K + ++ +
Sbjct: 134 LVMGGAVVGTVLAKLLPSYLLSLLFVVVLVLGGTRTVSKGIKMYRAEMKSCKVQTTEEQQ 193
Query: 220 LIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVS------------IIENIYWKE--- 264
C + ++D + K S ++ I +E
Sbjct: 194 AAAYAAVCSPSSCTEDKFADDGGDSTSHSLLKGTGSLSNECGGSTDEEVLTEIVERERHF 253
Query: 265 ----FGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGR-R 319
G ++ + I+A I + + YW+L ++++P G AV LY+ R
Sbjct: 254 SLTKHGAIMLCYMGIVAASIGGAAVSCGGITYWLLLIIELPWIAGFGVCTAVYLYRQHCR 313
Query: 320 KIA-----SKGDEGTKWRASQLV-FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQ 373
K++ + GD W +V F AC AG++ GL G+GGG + GPL +E+GI P+
Sbjct: 314 KVSVNYEFAAGD--IHWTKKTVVRFPLACA-GAGLIAGLFGVGGGIVTGPLMIEMGIVPE 370
Query: 374 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 433
V+SAT + +SS+ + ++ + +AL A++ + V Q ++ ++ GR
Sbjct: 371 VASATTALMVLYSSAAATAKFAVFNMTAWDWALLLSAVAFVVTAVSQVIILGFVRRTGRQ 430
Query: 434 SIIIFTLSFIIFVSAL 449
S+I+ + I + A+
Sbjct: 431 SVIVLCIGATICIGAV 446
>gi|224014684|ref|XP_002297004.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968384|gb|EED86732.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 522
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 116/461 (25%), Positives = 188/461 (40%), Gaps = 99/461 (21%)
Query: 80 VGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR 139
VG I+ F + GVGGGGI+VP+ L++ F KSS+ +S+ + G ++ ++ N+R
Sbjct: 11 VGIIVWFLTAGIATACGVGGGGIYVPLGILLLRFPPKSSSGLSQASIFGASLGGLIVNIR 70
Query: 140 QRHPTLDI-----------------------------------------------PVIDY 152
RHP L + PVIDY
Sbjct: 71 NRHPDLYMRDTRGKPLQEGSGKIVSYEKDKGPAAIEEDRKAYLAGGDGKQKFYTRPVIDY 130
Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK------------- 199
D+AL P+ + G +GV +F DW+ V+ S + + K
Sbjct: 131 DMALFLAPMEMAGAVLGVIIQRLFPDWLFLSFAAVVLGFTSYKTYTKFFSAFKKDKEKRE 190
Query: 200 --------GVETWKKETITKREAAKQLELIG-------NGYQTEECKCDPEYLSNDTTPE 244
E ++ E K + N Y+ + + D +
Sbjct: 191 KDRQLVQAESERNVSTSVVSDEDNKDEDATNGVRDANTNAYEGNAANTSTKSKNGDEEEQ 250
Query: 245 ETREPK---KSKVSIIENI--YWKE-FGLLVAVW---AIILALQIAKNYTT----TC-SV 290
E +PK K ++ + ++ Y KE L+ +W +I L+ K + TC
Sbjct: 251 EQDDPKELEKRRLFLQDDSRQYPKEKIAFLILLWIGLTVITFLKGGKGIDSVIGITCEDP 310
Query: 291 LYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASK-------GDEGTKWRASQLVFYCACG 343
Y VL Q G + AV +K ++ + + W +L FY
Sbjct: 311 AYIVLVAAQFLWTFG---FAAVFGWKNTKRTQERLAVNYPFQEHDVLWNFKKLQFYSFFT 367
Query: 344 ITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVP 403
AG+V GL+G+GGG +LGPL L +GI P VS+AT I +SS V + + P
Sbjct: 368 FVAGIVAGLIGIGGGMVLGPLMLVMGIHPSVSTATTASMILLTSSSVAVMFVMSGLVPWQ 427
Query: 404 YALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFII 444
YALYFF + ++ A++G+ + ++ G S+++ TL+ II
Sbjct: 428 YALYFFCVCLVGAYIGKSRIDSYVRKTGMTSVLVGTLATII 468
>gi|2911082|emb|CAA17544.1| hypothetical protein [Arabidopsis thaliana]
gi|7268922|emb|CAB79125.1| hypothetical protein [Arabidopsis thaliana]
Length = 431
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 131/416 (31%), Positives = 208/416 (50%), Gaps = 46/416 (11%)
Query: 71 DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
++K I++ ++ F S GG+GGGG+F+P++ ++ G D K++++ S MVTGG+
Sbjct: 51 ELKLSSAIIMAGVLCFLAALISSAGGIGGGGLFIPIMTIVAGVDLKTASSFSAFMVTGGS 110
Query: 131 VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
++ V+ NL +DYDLALL +P ++LG+SIGV N + +W+ITVL V
Sbjct: 111 IANVISNLFGGKAL-----LDYDLALLLEPCMLLGVSIGVICNRVLPEWLITVLFAVFLA 165
Query: 191 VLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPK 250
+ GV+ WK E+ RE+ G G EE K N P +
Sbjct: 166 WSILKTCRSGVKFWKLESEIARESGHGRPERGQGQIEEETK-------NLKAPLLEAQAT 218
Query: 251 KSKVSIIENIYWKEFGLLVAVWA---IILALQIAKN-----YTTTCSVLYWVLNLLQIPV 302
K+K I W + G+LV VWA +I L+ K+ C V YW+L LQIP+
Sbjct: 219 KNK----SKIPWTKLGVLVIVWASFFVIYLLRGNKDGKGIITIKPCGVEYWILLSLQIPL 274
Query: 303 AGGVSAYEAVALYKGRRKIA---SKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGF 359
A + A++ + R++ + K EGT+ S + + A AG++GG+ G+GGG
Sbjct: 275 AL-IFTKLALSRTESRQEQSPNDQKNQEGTRLDKSTRLKFPAMSFLAGLLGGIFGIGGGM 333
Query: 360 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
++ PL L+ GIPPQ+++AT F + FS++MS V+Y LL
Sbjct: 334 LISPLLLQSGIPPQITAATTSFMVFFSATMSAVQYLLL------------------GMQN 375
Query: 420 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
K + GRASII+F++ ++ +S + + G + + MGF C
Sbjct: 376 TDTAYKAVAQFGRASIIVFSVGTVMSLSTVLMTSFGALDVWTDYVAGKDMGFKLPC 431
>gi|123437805|ref|XP_001309695.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121891432|gb|EAX96765.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 448
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 114/411 (27%), Positives = 191/411 (46%), Gaps = 45/411 (10%)
Query: 73 KFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVS 132
K+ R+++G I F S G+GGGG+FVP++ L+V F + SK ++ GG+++
Sbjct: 75 KWDTRLIIGIICVFIAGIFVSGAGIGGGGLFVPIMMLLVNFPTSYAIPTSKAIIFGGSLA 134
Query: 133 AVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
++NL +RHP + P+I+Y++A + +P+ LG IGV FN I +W++ + VL
Sbjct: 135 VTLFNLNKRHPYYERPLINYNVAAMIEPISWLGTVIGVIFNSIIPEWLLYSVQFVLLTYT 194
Query: 193 STRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKS 252
+ F KG+ K + K + EL+ G P Y
Sbjct: 195 AWNTFKKGL---KDQRNAKLGISPNNELLVKGTYD-----GPTYSIGLLW---------- 236
Query: 253 KVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSV--------LYWVLNLLQIPVAG 304
LL+ ++ + LA+ + S+ +YW L P+
Sbjct: 237 --------------LLLIIYVVFLAISFLRGGDGADSIIGIKFCSPIYWALTFGPFPIYL 282
Query: 305 GVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 364
G++A+ V + K R + +E TK + G AGM G LG+GGG I GP+
Sbjct: 283 GITAW-MVHIAK-RYPVLGHKNELTK---KDIFLLMMSGFVAGMAAGFLGIGGGMIKGPM 337
Query: 365 FLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
L L I + +AT+ F I +SS + ++Y P F ++ ++ +G L+
Sbjct: 338 MLALEIEAEEMAATSSFMILMTSSATSIQYIAEGLMPWLEFGVFTSMGFVSFLIGVIFLR 397
Query: 425 KLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
L+K LG SI ++ L+ II +SA+ + VG+ +I ++ MGF C
Sbjct: 398 WLVKKLGNRSIFLYFLAAIIMISAILMSVVGIEIIILEVKEHASMGFRPFC 448
>gi|449519050|ref|XP_004166548.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101231924 [Cucumis sativus]
Length = 455
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 123/413 (29%), Positives = 219/413 (53%), Gaps = 35/413 (8%)
Query: 78 IVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYN 137
IVV ++ F + S GG+GGGG+++P+L ++ G D K++++++ CMVTGG+++ V+ N
Sbjct: 63 IVVAGVLCFIAASLSSAGGIGGGGLYLPILTIVAGVDLKTASSLTACMVTGGSIANVLSN 122
Query: 138 LRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
+ + +I++D+ALL +P ++LG+SIGV N+ F +W +T +L +FL ST
Sbjct: 123 FFSK--SGGKCLINFDIALLTEPCMLLGVSIGVICNLSFPEW-VTTILFAIFLAWSTLKT 179
Query: 198 LK-GVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI 256
K G+ W++E+ +LE +G Q E + + + P T+E +S+
Sbjct: 180 CKSGMVYWERESEGLMNNGCKLE---DGLQNEN---EAKLVEEPLLP--TQENCRSRFPS 231
Query: 257 IENIYWKEFGLLVAVWAIILALQIAKNYTT--------TCSVLYWVLNLLQIPVAGGVSA 308
+ + G LV VW + + + TC YW+L+ +Q+PVA A
Sbjct: 232 M------KLGXLVLVWFCFYLIYLLRGDQDGRGLLPIETCGTGYWILSSVQVPVA---IA 282
Query: 309 YEAVALYKGRRKIASKGDEGTKWRA------SQLVFYCACGITAGMVGGLLGLGGGFILG 362
+ LYK + + + A S+ + + AG++GG+ G+GGG ++
Sbjct: 283 FTLWILYKQKSPQSEDSSQKELEEARPVVEPSKKLIFPVMAFLAGILGGMFGIGGGMLIS 342
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
P L++GI P+ ++AT F + FS++MS +Y LL A+ F + IA+ +G V
Sbjct: 343 PFLLQVGIVPEKTAATCSFMVFFSATMSAGQYLLLGMEHAEIAIIFAIICFIASVLGLVV 402
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++K I+ GRAS+IIF++S ++ +S + + G + + MGF + C
Sbjct: 403 VQKAIRDHGRASVIIFSVSIVMALSTVLMTTFGALNVFRDYIAGNPMGFKSPC 455
>gi|449454696|ref|XP_004145090.1| PREDICTED: uncharacterized protein LOC101208650 [Cucumis sativus]
gi|449470720|ref|XP_004153064.1| PREDICTED: uncharacterized protein LOC101220005 [Cucumis sativus]
Length = 455
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 123/413 (29%), Positives = 219/413 (53%), Gaps = 35/413 (8%)
Query: 78 IVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYN 137
IVV ++ F + S GG+GGGG+++P+L ++ G D K++++++ CMVTGG+++ V+ N
Sbjct: 63 IVVAGVLCFIAASLSSAGGIGGGGLYLPILTIVAGVDLKTASSLTACMVTGGSIANVLSN 122
Query: 138 LRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
+ + +I++D+ALL +P ++LG+SIGV N+ F +W +T +L +FL ST
Sbjct: 123 FFSK--SGGKCLINFDIALLTEPCMLLGVSIGVICNLSFPEW-VTTILFAIFLAWSTLKT 179
Query: 198 LK-GVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI 256
K G+ W++E+ +LE +G Q E + + + P T+E +S+
Sbjct: 180 CKSGMVYWERESEGLMNNGCKLE---DGLQNEN---EAKLVEEPLLP--TQENCRSRFPS 231
Query: 257 IENIYWKEFGLLVAVWAIILALQIAKNYTT--------TCSVLYWVLNLLQIPVAGGVSA 308
+ + G LV VW + + + TC YW+L+ +Q+PVA A
Sbjct: 232 M------KLGALVLVWFCFYLIYLLRGDQDGRGLLPIETCGTGYWILSSVQVPVA---IA 282
Query: 309 YEAVALYKGRRKIASKGDEGTKWRA------SQLVFYCACGITAGMVGGLLGLGGGFILG 362
+ LYK + + + A S+ + + AG++GG+ G+GGG ++
Sbjct: 283 FTLWILYKQKSPQSEDSSQKELEEARPVVEPSKKLIFPVMAFLAGILGGMFGIGGGMLIS 342
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
P L++GI P+ ++AT F + FS++MS +Y LL A+ F + IA+ +G V
Sbjct: 343 PFLLQVGIVPEKTAATCSFMVFFSATMSAGQYLLLGMEHAEIAIIFAIICFIASVLGLVV 402
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++K I+ GRAS+IIF++S ++ +S + + G + + MGF + C
Sbjct: 403 VQKAIRDHGRASVIIFSVSIVMALSTVLMTTFGALNVFRDYIAGNPMGFKSPC 455
>gi|357483383|ref|XP_003611978.1| Membrane protein-like protein [Medicago truncatula]
gi|355513313|gb|AES94936.1| Membrane protein-like protein [Medicago truncatula]
Length = 109
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 67/106 (63%), Positives = 88/106 (83%)
Query: 370 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 429
+ +V+SAT+ F++ FSSSMSVV+YY L RFPVPYA YF ++ IAAF GQHV++K+I I
Sbjct: 1 MKEKVASATSTFSMLFSSSMSVVQYYYLDRFPVPYASYFVLVATIAAFAGQHVVRKIIAI 60
Query: 430 LGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
LGRASIIIF L+ IF+SA+SLGGVG+ KMI ++E+ EYMGF+N+C
Sbjct: 61 LGRASIIIFILASTIFISAISLGGVGIEKMIVKMENHEYMGFENLC 106
>gi|348676056|gb|EGZ15874.1| hypothetical protein PHYSODRAFT_316057 [Phytophthora sojae]
Length = 437
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 92/362 (25%), Positives = 167/362 (46%), Gaps = 53/362 (14%)
Query: 100 GGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQ 159
G I VP + LI+GFD K + +S + GGA++ +N+++RHP+ D P+ID DLAL
Sbjct: 77 GVIMVPAMVLIMGFDIKRATPISNVGILGGALANAWFNMQKRHPSADRPLIDADLALGMI 136
Query: 160 PLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
P+L +++++L +V+ TR +KG++ ++ E+ K +A
Sbjct: 137 PVL-------------LPSYIVSLLFVVVLAASGTRMMIKGIQLYRAESTKKAQAD---- 179
Query: 220 LIGNGYQTEECKCDPEYLSNDTTPEETREPK--------------KSKVSIIE---NIYW 262
+ T + P+ + TP + + K I+E + W
Sbjct: 180 --ADSKDTADAAMSPDAYAQAFTPNPSIDSDASAAKSASASAQAVKVLAEILEQERHFAW 237
Query: 263 KEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIA 322
++ G ++ + ++A I + V WV+ L +IP VA +
Sbjct: 238 RKHGAILVCYLGVVATSIGDASVSCGGVADWVILLAEIP---------WVARKASVGYLY 288
Query: 323 SKGDEGTKWRASQLVFY---CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 379
+GD +W ++ + CA G G+V G+ G+GGG I GP+ +E+GI P+V+S+T
Sbjct: 289 IEGD--IRWTQKAVICFPLGCALG---GIVAGMFGVGGGIITGPIRIEMGIVPEVASSTM 343
Query: 380 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT 439
I +SS+ + +Y + +A A++ Q V+ ++ GR SI++
Sbjct: 344 ALMILYSSAAATAKYTVFNMIAWDWAALLCAVTFAVTSAAQVVILAYVRRSGRQSIVVLC 403
Query: 440 LS 441
+S
Sbjct: 404 IS 405
>gi|358344379|ref|XP_003636267.1| Membrane protein-like protein [Medicago truncatula]
gi|355502202|gb|AES83405.1| Membrane protein-like protein [Medicago truncatula]
Length = 110
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/102 (65%), Positives = 86/102 (84%)
Query: 374 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 433
V+SAT+ F++ FSSSMSVV+YY L RFPVPYA YF ++ IAAF GQHV++K+I ILGRA
Sbjct: 6 VASATSTFSMLFSSSMSVVQYYYLDRFPVPYASYFVLVATIAAFAGQHVVRKIIAILGRA 65
Query: 434 SIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
SIIIF L+ IF+SA+SLGGVG+ KMI ++E+ EYMGF+N+C
Sbjct: 66 SIIIFILASTIFISAISLGGVGIEKMIVKMENHEYMGFENLC 107
>gi|223998204|ref|XP_002288775.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975883|gb|EED94211.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 385
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 187/394 (47%), Gaps = 43/394 (10%)
Query: 84 IGFFGTACGSV----GGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR 139
+GFF A G V GG+GGGGI VP+ LI+ F K + +S V GGAV+ ++N+R
Sbjct: 3 LGFFFAALGLVLAAGGGIGGGGILVPVYILILDFLPKHAIPLSNVTVFGGAVANTIFNVR 62
Query: 140 QRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK 199
+RHP D P+ID+DL L+ +P +LG +G N I ++ +I V+L+VL + K
Sbjct: 63 KRHPLADRPLIDWDLILVMEPSTLLGALVGANLNKILSETLIAVMLVVLLSFTAYGTLKK 122
Query: 200 GVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKK------SK 253
+ + KET E K +G + E+L ND + + E +K +
Sbjct: 123 AGKMYDKET----EDLKNEWSYSDGLR--------EHLVNDYSHMDDEEGRKGANDNNKE 170
Query: 254 VSIIENIYWKEFGLLVA------VWAIILALQIAKN-------YTTTC-SVLYWVLNLLQ 299
++ E ++EFG+ A ++ ++LA+ I K C S +W+ L
Sbjct: 171 DTVTE---YEEFGMHEANSLDRLMFVVVLAINILKGGGGFASPVGIKCGSAAFWISQALL 227
Query: 300 IPVAGGVSAYEAVALYKGRRKIASKG----DEGTKWRASQLVFYCACGITAGMVGGLLGL 355
+ G+S L K G E W + Y AG G+ G+
Sbjct: 228 LVWIIGISLVARRQLIKDTALKMGAGFRYLKEDMIWDDKSTIIYPLFSTVAGFCAGMFGI 287
Query: 356 GGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIA 415
GGG + GPL + +G+ P V+SAT+ I F+S + + + YA+ L +A
Sbjct: 288 GGGIVKGPLMIMMGVHPAVASATSACMILFTSFTATTTFAVYGLLVHDYAIACVILGFVA 347
Query: 416 AFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
VGQ + +L+K R S I F++ F++ +SAL
Sbjct: 348 TAVGQTITTRLLKKSRRNSYIAFSIGFVVLLSAL 381
>gi|414880189|tpg|DAA57320.1| TPA: hypothetical protein ZEAMMB73_804983 [Zea mays]
Length = 407
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/289 (31%), Positives = 131/289 (45%), Gaps = 28/289 (9%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
+F+P+LNL+ G K + A S MVTGGA S V+YNL +IDYD+ALLFQP
Sbjct: 81 LFLPILNLVAGLSLKRATAYSSFMVTGGAASNVLYNLACTGGGGRGRLIDYDIALLFQPC 140
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
L+LG+SIGV NV+F +W+IT+L + + + GV+ W+ E+
Sbjct: 141 LLLGVSIGVVCNVMFPEWLITLLFSLFLAFCTAKTCRAGVKIWRSES------------S 188
Query: 222 GNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIY-WKEFGLLVAVWAIILALQI 280
G G + + P T + + W + LLV +W AL +
Sbjct: 189 GCGSGARSSRGGHSHSKEPLLPRGTSDGDAEGGGGNGAGFPWGDVALLVIIWLCFFALHV 248
Query: 281 AKN--------YTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKW- 331
C V YW++ Q+P A + Y A K R + ++G+K
Sbjct: 249 LIGDKHGKGVIRIRPCGVAYWLITFFQLPAAVAFTGYIVYAKRKKRAVHSESQEDGSKAD 308
Query: 332 ------RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQV 374
A L G + GL G+GGG +L P+ L++GIPPQV
Sbjct: 309 LADAGVEALPLPTLPLAAFVTGALSGLFGIGGGLLLNPVLLQIGIPPQV 357
>gi|224014498|ref|XP_002296911.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968291|gb|EED86639.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 516
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 114/451 (25%), Positives = 189/451 (41%), Gaps = 99/451 (21%)
Query: 33 SNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFG------WRIVVGTIIGF 86
++ K E K HVE N + P + Y +W D+ R VGT + F
Sbjct: 66 ADYKHELCYDRKLFHVEGN---EEGDPDNH--YPFLWNDVSQSSSLSVIHRNSVGTFLWF 120
Query: 87 FGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD 146
A GVGGGGI+VP+ L++ F KSS+ +S+ + G A+ ++ NLR RHP
Sbjct: 121 LTAALSITCGVGGGGIYVPVGILLLRFPPKSSSGLSQASIFGAALGGLIVNLRNRHPYTF 180
Query: 147 I-----------------------------------------------PVIDYDLALLFQ 159
+ PVIDYD+AL
Sbjct: 181 VRDTKGTPLEDHPGKIVSYEKDKGPAAIEKDREAYLAGGDGKRKFYTRPVIDYDMALFLA 240
Query: 160 PLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
P+ + G +GV +F +W+ V+ S + + K ++KK+ + RE A +L
Sbjct: 241 PMEMAGAVLGVIIQKLFPNWLFLSFATVILGFTSYKTYKKFFSSYKKDKLN-RETAMRLS 299
Query: 220 L-------------IGNGYQTEECKCDPEY--LSNDTTPEETREPKK--SKVSIIENI-- 260
+ GN + + + E ++ ++ +PK+ + +EN
Sbjct: 300 MAESMNVSASAADATGNEEPSNDADANAEENGTADTAIKDDLDDPKELEKRRMFLENDSR 359
Query: 261 -YWKE-FGLLVAVW---AIILALQIAKNYTT----TC-SVLYWVLNLLQIPVAGGVSAYE 310
+ KE L+ +W A+I L+ K + TC Y+VL Q G +A+
Sbjct: 360 QFPKEKVAYLILLWIGLAVITFLKGGKGVDSLIGITCEDSAYYVLVAAQFLWTMGFAAFF 419
Query: 311 AVALYKGRRKIASK-------GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 363
+K ++ + ++ W +L FY AG+V GL+G+GGG +LGP
Sbjct: 420 G---WKNTKRTQERLAVNYPFQEQDVLWDLKKLQFYSFFTFVAGIVAGLIGIGGGMVLGP 476
Query: 364 LFLELGIPPQVSSA-TALFAITFSSSMSVVE 393
L + +G+ P V++A TA + SSS++V+
Sbjct: 477 LMMVMGVHPSVATATTATMVVLTSSSVAVIR 507
>gi|219112381|ref|XP_002177942.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410827|gb|EEC50756.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 644
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 118/432 (27%), Positives = 199/432 (46%), Gaps = 52/432 (12%)
Query: 84 IGFF----GTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR 139
IGFF G + GG+GGGGI VP+ L++GF K + +S V GGAV+ + N R
Sbjct: 207 IGFFLATLGLMVAAGGGIGGGGILVPVYILVMGFTPKHAIPLSNVTVLGGAVANTILNAR 266
Query: 140 QRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK 199
+RHP D P++D+DL L+ +PL + G +G N + + ++TVLL++L V + + K
Sbjct: 267 KRHPLADRPLVDWDLILVMEPLTIAGALLGAFLNKVLPELLLTVLLVLLLSVTAYTSLTK 326
Query: 200 GVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPE------ETRE----- 248
++ + +E ++ AA Q + +G + E D E E E
Sbjct: 327 ALKLYARE--SRAMAAAQGLVRVDGTKESELTVMARLEDQDDHDEAAEVLLENMERDDDD 384
Query: 249 --------------PKKSKVSIIENIYWKE-------FGLLVAVWAIILALQIAKNYTTT 287
P S + ++ + +E +LV ++ ++L + + K
Sbjct: 385 DESSSDDDMKSVELPASSLQAELDQLLEEECTTPMANISILVTMFIVVLTINVLKGGGAF 444
Query: 288 CSVL--------YWVLNLLQIPVAG----GVSAYEAVALYKGRRKIASKGDEGT-KWRAS 334
S L +W+ NL+ + G G+ AY V ++ +R+++ EG +W A
Sbjct: 445 PSPLGIRCGSRAFWIANLVMLAWIGIISVGIRAY-LVRRFEQKRRLSFPYVEGDIRWDAR 503
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
+ Y AG G+ G+GGG + GPL L +G+ P VSSA++ I F+S + +
Sbjct: 504 ATIVYPVVCCMAGFFAGMFGVGGGIVKGPLMLAMGVHPAVSSASSACMILFTSFTATTSF 563
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 454
+ YA A+ +A F GQ L L++ R S I F++ ++ +SA +
Sbjct: 564 VVFGLLVWDYAYVCMAIGFVATFAGQVGLSYLMRRAQRNSYIAFSIGAVVLLSAFLMTIQ 623
Query: 455 GLAKMIKRIEHK 466
L M +H
Sbjct: 624 SLLSMAAGEKHH 635
>gi|348675995|gb|EGZ15813.1| hypothetical protein PHYSODRAFT_263156 [Phytophthora sojae]
Length = 421
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 167/363 (46%), Gaps = 36/363 (9%)
Query: 104 VPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLV 163
VP L+ + +DA S A M+ G AVS+ T D P+ID +LAL P+++
Sbjct: 38 VPSLSEMDTYDALSIA----FMIIGLAVSSA-------GATADRPLIDPELALGLIPVVI 86
Query: 164 LGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGN 223
G +G N + +++++L +V+ R L+G+ +K+E KR + E +
Sbjct: 87 GGTVLGAVINKLIPSYIVSLLFVVVLTFSDARMTLRGIRLFKQEVAQKRAESSANETKAD 146
Query: 224 GYQTEEC---KCDPEYLSNDTTPEETREP--KKSKVSIIE----------------NIYW 262
++ P+ S+D EE R + S S+++ + W
Sbjct: 147 DPESPSVYIKASTPQPSSDDIMAEEHRLSISRSSLKSVLDEDEDGAIRSQILGKERHFAW 206
Query: 263 KEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYK--GRRK 320
+ + ++A I V+YW+L L+++P + + L+K R++
Sbjct: 207 GSHSATLVCFLGVVATSIGDASVDCGGVVYWILLLIEVPWVVAFVFFTSHYLHKIYLRKE 266
Query: 321 IASKG--DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT 378
S D +W +V++ AG+V G+ G+GGG I GP+ +ELGI P+V+S+T
Sbjct: 267 AVSYQYVDGDIQWTKKTVVYFPLGCAVAGIVAGMFGVGGGIITGPIMIELGIVPEVASST 326
Query: 379 ALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF 438
I +SS+ + ++ + K +AL A++ + Q ++ ++ GR SIII
Sbjct: 327 TALMILYSSAAATAKFAVFKMIAWDWALLLCAVAFVVTSASQVMILGFVRRTGRQSIIIL 386
Query: 439 TLS 441
+S
Sbjct: 387 CIS 389
>gi|223992571|ref|XP_002285969.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977284|gb|EED95610.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 564
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 106/419 (25%), Positives = 172/419 (41%), Gaps = 70/419 (16%)
Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
G+ VP+ LI+GF K + +S V GGAV+ + N+R+RHP D P++D+DL L+ +P
Sbjct: 119 GVLVPIYILIMGFSPKHAIPLSNITVFGGAVANTILNVRKRHPLADRPLVDWDLILVMEP 178
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
L + G IG N + + ++ + L+ L S K + +K E+ ++
Sbjct: 179 LTIAGALIGAFLNKLLPEAVLVLSLVALLSFTSYTTLKKAIRMYKAESKALEAQRGVRKV 238
Query: 221 IGNGYQTEE-------------------CKC----------DPEYLSNDTTPEETREP-- 249
G+G E C D + +D EE +P
Sbjct: 239 RGDGTVESELTRLAREMEEDEEDEEEEGCTVGLLDAAITTDDQDETESDANDEENGQPTS 298
Query: 250 -----------KKSKVSIIENIYWKE----------------FGLLVAVWAIILALQIAK 282
+S S ++ KE +L+ + +IL + I K
Sbjct: 299 SNTDTTNDNNDDESTFSTATDLKNKEELSKILSEERTVPKGNVQVLLVTFTVILFINIMK 358
Query: 283 NYTTTCSVL--------YWVLNLLQIPVAGGVSAYEA---VALYKGRRKIASKGDEGT-K 330
S L +WV N + I VS + V Y+ + ++ EG +
Sbjct: 359 GGGAFPSPLGIRCGSPSFWVSNGIMIGWILLVSVFARSYLVRRYEIKERVGFPYVEGDIR 418
Query: 331 WRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMS 390
W V Y AG G+ G+GGG + GPL L +G+ P+VSSA++ I F+S +
Sbjct: 419 WDGRATVVYPLVCTAAGFFAGMFGVGGGIVKGPLMLAMGVHPKVSSASSACMILFTSFTA 478
Query: 391 VVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
+ + + YA L +A VGQ L L++ R S I F++ I+ +SA
Sbjct: 479 TTSFVVFGLLDMDYATVCMTLGFVATLVGQIGLFYLMEKFQRNSYIAFSIGGIVLLSAF 537
>gi|15233581|ref|NP_193858.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
gi|2911083|emb|CAA17545.1| hypothetical protein [Arabidopsis thaliana]
gi|7268923|emb|CAB79126.1| hypothetical protein [Arabidopsis thaliana]
gi|332659033|gb|AEE84433.1| Sulfite exporter TauE/SafE family protein [Arabidopsis thaliana]
Length = 393
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 123/461 (26%), Positives = 201/461 (43%), Gaps = 104/461 (22%)
Query: 12 RLLAAAFIGLLVVASVSVSAQSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWP- 70
+ L FI +L V +++ + Q + E+T + N++ KV K R+ K
Sbjct: 2 KTLFVLFILILCVFAINANQQEEINQESTQQTHQ-----NLLYKVQ--KWRTSLKDSSDA 54
Query: 71 DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
++K +VV ++ F S G+ D K++++ S MVTGG+
Sbjct: 55 ELKLSPALVVAGVLCFTAALISSASGI----------------DLKAASSFSAFMVTGGS 98
Query: 131 VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
++ NL H +IDYDLALL +P ++LG+S+GV N +F +W+IT L +V +
Sbjct: 99 IA----NLINNHFGCK-KLIDYDLALLLEPCMLLGVSVGVICNKVFPEWLITGLFVVFLM 153
Query: 191 VLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPK 250
S G +WK I + + E+ R+ +
Sbjct: 154 WSSMETCENGHTSWKLSLILREK------------------------------EDMRDSR 183
Query: 251 KSKVSIIENI-YWKEFGLLVAVWAIILALQIAKNYTTT--------------CSVLYWVL 295
++V I ++K L L+I K T CSV YW+L
Sbjct: 184 LAEVKRRRTIIFFKH-----------LYLKIKKTETKQSFLGRNLGIISIKPCSVEYWIL 232
Query: 296 NLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGL 355
LQIP+A + +AL R L AG++GG+ G+
Sbjct: 233 LSLQIPLA---LVFTILALS----------------RTESLQEQSISNQEAGLLGGIFGI 273
Query: 356 GGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIA 415
GGG I+ PL L GIPPQV++AT F + FS++MS V+Y LL A F + A
Sbjct: 274 GGGMIISPLLLRAGIPPQVTAATTSFMVFFSATMSGVQYLLLGMQNTEAAYVFSVICFFA 333
Query: 416 AFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 456
+ +G +K++ RASII+F + +++++ + + G+
Sbjct: 334 STLGLVFAQKVVPHFRRASIIVFLVGTMMYLTTIVMASFGI 374
>gi|303279591|ref|XP_003059088.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458924|gb|EEH56220.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 461
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 89/142 (62%), Gaps = 2/142 (1%)
Query: 78 IVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYN 137
I + ++ FF + + GGVGGGG+FVP+L L+VG K + VS CM+ GA+ A +N
Sbjct: 16 IAIASVACFFVSILANAGGVGGGGVFVPLLMLVVGLSGKWAIPVSNCMILAGAIPATFFN 75
Query: 138 LRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
L +RHPT D P++D + ALL P + G + GV NV+F +W+++ +LI L ST+ F
Sbjct: 76 LMKRHPTRDRPLLDTNAALLLIPATLAGTTPGVMLNVLFPEWLVSAMLICLLTYTSTQTF 135
Query: 198 LKGVETWKKETITKREAAKQLE 219
KG W+KE KR+ K++E
Sbjct: 136 QKGKREWRKEGEIKRK--KRME 155
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 90/172 (52%), Gaps = 14/172 (8%)
Query: 286 TTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGIT 345
T C YW L+ P+A G+ A LY A + G + A L GI
Sbjct: 267 TPCVNEYWA--LVAAPIALGLCASYLAGLYL-HADCAERLRVGYAYDAGDLRMETTDGIA 323
Query: 346 --------AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLL 397
AGM GLLG+GGG ILGPL L+LG+ P+VS+ FA+ F+SSMSV+++ LL
Sbjct: 324 KWTLWAFGAGMGSGLLGIGGGMILGPLLLDLGMSPKVSAPITHFAVLFTSSMSVIQFALL 383
Query: 398 KRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR-ASIIIFTLSFIIFVSA 448
+ +A L++I + + VLK+ + G ASII+ L+ I+ +SA
Sbjct: 384 GQLLPAHA--GCCLTLIGSVMSDKVLKREMAKRGYGASIIVLCLAGIMSLSA 433
>gi|146183328|ref|XP_001025893.2| hypothetical protein TTHERM_00713420 [Tetrahymena thermophila]
gi|146143638|gb|EAS05648.2| hypothetical protein TTHERM_00713420 [Tetrahymena thermophila
SB210]
Length = 505
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 94/407 (23%), Positives = 180/407 (44%), Gaps = 68/407 (16%)
Query: 68 VWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVT 127
V D+ FG ++ ++ +VGG+GGG + VP+L +++ F K + +S ++
Sbjct: 34 VHQDLTFGVVEIISYVLISIIVGLANVGGLGGGIVKVPILVILLNFSVKEATFLSYPILL 93
Query: 128 GGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIV 187
GG +S + + QRHP D P+ID+DL L+ P ++LG +G+ NVI ++ ++T + ++
Sbjct: 94 GGVLSNAILLISQRHPRKDKPIIDFDLVLILVPTVLLGTVVGILMNVIISEIILTSVFML 153
Query: 188 LFLVLSTRAFLKGVETWKKETITKREAAKQLE-----------------LIGNGYQTE-- 228
++ F+K + +K+ K E ++ + N Y +E
Sbjct: 154 FMCLVCVYLFMKARDIQQKQQEDKEEQDSSIQDNNEQNQNKLSKSTIQLVKQNSYLSEVD 213
Query: 229 -------ECKCDPEY-------LSNDTTPEETREP------KKSKVSIIENIYWKEF--- 265
+ C E +S D+ + + E K+ + + IEN EF
Sbjct: 214 QKKIEQIDENCQKELGIQYYQVVSEDSQEQNSEEDSKGVSNKEKQKNKIENQLLAEFLEQ 273
Query: 266 ----------GLLVAVWAIILALQIAKNYT--------TTCSVLYWVLNLLQIPVAGGVS 307
LV ++ + + I+K C LY++L LQ+ ++ +
Sbjct: 274 EKKMLPLDKLFYLVLIFLVFTFIGISKGGKGFQSIFGIQKCDNLYYLLTALQL-ISSIIF 332
Query: 308 AY----EAVALYKGRRKIASKGDEGTKWRASQLVFYCA-CGITAGMVGGLLGLGGGFILG 362
+ + L++ + I + D + ++ F + G+ AG + G+LG+G G I+
Sbjct: 333 MFFIYLQQKRLHEYKISINYQFDREDFYFSNYNFFILSLSGLAAGSITGMLGMGSGLIIL 392
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFF 409
P+ L LG +V S+T+ F F S++ Y+L + Y + F
Sbjct: 393 PVLLSLGCHTRVCSSTSGFMYLFIGGTSII--YVLTEGILSYKMILF 437
>gi|294944275|ref|XP_002784174.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239897208|gb|EER15970.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 393
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 92/361 (25%), Positives = 169/361 (46%), Gaps = 35/361 (9%)
Query: 93 SVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDY 152
S G+GGG +F P+L+L+ D+K ++A S+ ++ +++ + N + P+I
Sbjct: 24 SPAGIGGGILFTPVLHLVGELDSKEASATSQALIAASQLASCIINFWTQWHEPKKPLIIL 83
Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR 212
++ P +V G IGV N+I +I +L +++ + + LKGV+ ++ E TK+
Sbjct: 84 PYVIIMLPCIVAGAVIGVYLNMILPQLIILILYVLVAAFSTIKTTLKGVKQYRSENATKK 143
Query: 213 EAAKQLELIGNGYQTEECKCDPEYLSNDT--TPEETREPKKSKVSIIEN-----IYWKEF 265
A+K+ E P S T T EE +E K ++ + YW
Sbjct: 144 -ASKEHE-------------SPSSASQKTIVTLEEAKEKKVDPFLVMPSRKVLFFYWTTA 189
Query: 266 GLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKG 325
L +W + L + + +T S+ +P GGV + A I+S G
Sbjct: 190 FL---IWVLCLIFPLLRGSSTAKSIA-------PVPYCGGVYWFLAALEIALLLGISS-G 238
Query: 326 DEGTKWRASQLVFYCACGIT---AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFA 382
K + +LV C G+ G++ ++G+GGG ++ P+ L+ G+ PQ +AT
Sbjct: 239 FIFAKRKVCRLVQCCILGVEIVFTGIISSMVGIGGGILMNPIILDFGLNPQQGTATNAIN 298
Query: 383 ITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSF 442
I S+ + + Y + FP ++ L + +G+ VLK+++ GR S+++F L+
Sbjct: 299 IFAMSTSTALSYGMSGYFPGGSDMWIVVLPFVGGIIGKLVLKQIVAKTGRMSVLVFLLAG 358
Query: 443 I 443
I
Sbjct: 359 I 359
>gi|348689886|gb|EGZ29700.1| hypothetical protein PHYSODRAFT_309925 [Phytophthora sojae]
Length = 580
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 132/483 (27%), Positives = 210/483 (43%), Gaps = 102/483 (21%)
Query: 74 FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
F + ++ T+I F TA G+ GG+GGGG+ VPM + G + K + +SK + G AV+
Sbjct: 116 FTFNDLLTTLIAFSCTALGAGGGIGGGGLLVPMY-IFAGLNPKHAIPLSKVTIFGSAVA- 173
Query: 134 VVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLS 193
++ P+ID+ L L +P+ ++G GV N I +W+I VLL+ L ++
Sbjct: 174 -----------MNRPLIDFALVALMEPMTLVGTVFGVMLNHISPNWLILVLLVTLMSFIT 222
Query: 194 TRAFLKG---------------------------------------VETWKKETITKREA 214
LKG VE + + K
Sbjct: 223 YNTVLKGNKIQDKESKLQHALVKSTLIGGPNGRGRGRTWSIYRRFDVEVAARHWLEKTRR 282
Query: 215 AKQLELIGNGYQTEECKCDP--------------------EYLSNDTTPEETR---EPKK 251
A++ L+ + + P + S+D + R E ++
Sbjct: 283 ARKARLVREEDEEDFSSLPPLIARKAIGLDPLVGDNRRFGTFPSDDDKKAQRRGTIERRE 342
Query: 252 SKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVA 303
+V E I W LV W IIL I + S + YWVL L+ + +
Sbjct: 343 GRVLPFEYI-WP----LVVSWFIILVQSILRGGHGAPSAIGVGCNSSDYWVLTLMPLSIL 397
Query: 304 GGVSAYEAVALY-KGRRKIAS-----KGDEGTKW-RASQLVFYCACGITAGMVGGLLGLG 356
G+S Y L R K+ S +GD W + LVF C I AG+ GLLG+G
Sbjct: 398 VGISLYVGYRLRLTNRLKVVSGYFFVEGD--MHWVKRRTLVFPAVCTI-AGVAAGLLGIG 454
Query: 357 GGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA----LS 412
GG + GP+ LE G+ P V SATA F I F++S + +++ + +FP + F A +
Sbjct: 455 GGMVKGPIMLEAGVLPAVQSATASFMILFTASSTTLQFAINGQFPGEFQFDFMAWLAFVG 514
Query: 413 IIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFD 472
+ F G + +K R SI+++TL+ I +SA+++ +GL + IE ++GF
Sbjct: 515 FVGGFCGLKCVGYFVKKYRRESIMVYTLAATIGLSAVAMRFIGLQSTLSDIESGVHLGFH 574
Query: 473 NIC 475
IC
Sbjct: 575 GIC 577
>gi|297737351|emb|CBI26552.3| unnamed protein product [Vitis vinifera]
Length = 100
Score = 103 bits (257), Expect = 2e-19, Method: Composition-based stats.
Identities = 49/76 (64%), Positives = 62/76 (81%)
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
M G A S + NL+ +HPTLD+P+I+YDLALLFQP+L++GISI VAFNV+F D M+T+L
Sbjct: 24 MNMGAASSTIYCNLKLKHPTLDMPIIEYDLALLFQPMLMMGISIEVAFNVVFVDSMVTIL 83
Query: 185 LIVLFLVLSTRAFLKG 200
LIVLFL ST+ FLKG
Sbjct: 84 LIVLFLGTSTKTFLKG 99
>gi|297849536|ref|XP_002892649.1| hypothetical protein ARALYDRAFT_334446 [Arabidopsis lyrata subsp.
lyrata]
gi|297338491|gb|EFH68908.1| hypothetical protein ARALYDRAFT_334446 [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 69/258 (26%), Positives = 122/258 (47%), Gaps = 32/258 (12%)
Query: 60 KSRSGYKHVWPD-----IKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFD 114
+ S + H P+ IKF ++ ++ FF S+ GGG +F+ ++ I G +
Sbjct: 29 NATSQWLHFPPNLNESTIKFSIPTIIAAVLSFFA---ASISSAGGGALFLSIMTTISGLE 85
Query: 115 AKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFN 173
K++++ S M+TG +++ V NL R+P + D +ID+DL+L QP L+LG+SIGV N
Sbjct: 86 MKTASSFSAFMITGVSIANVGCNLFARNPKSRDKTLIDFDLSLTLQPCLLLGVSIGVICN 145
Query: 174 VIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCD 233
+F +W++ L V + + KGV W E ++RE + + D
Sbjct: 146 RMFPNWLVLSLFAVFLAWSTMKTCKKGVSYWNLE--SEREKIRS-------------RRD 190
Query: 234 PEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYT-------- 285
+ + +P E + + W + G+LV +W + ++ + +
Sbjct: 191 DDRIKVARSPLLANEGEAEVERGMIRFPWMKLGVLVIIWLVFFSINLFRGNKYGQGIISI 250
Query: 286 TTCSVLYWVLNLLQIPVA 303
C LYW L+ LQIP+
Sbjct: 251 KPCGGLYWFLSSLQIPLT 268
>gi|358344827|ref|XP_003636488.1| hypothetical protein MTR_042s0030 [Medicago truncatula]
gi|355502423|gb|AES83626.1| hypothetical protein MTR_042s0030 [Medicago truncatula]
Length = 150
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/119 (54%), Positives = 77/119 (64%), Gaps = 22/119 (18%)
Query: 310 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 369
+A AL+ RR IAS GD+G + L+ YC G+ AG+VGGLLGLGGGF++GPLFLELG
Sbjct: 27 KATALFTQRRVIASAGDQGKAFTVGLLIIYCVFGVLAGIVGGLLGLGGGFVMGPLFLELG 86
Query: 370 IPPQ----------------------VSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
+P Q V SAT FA+TFSSSMSVVEYYLLKRFP+PY L
Sbjct: 87 VPSQSSESFKLQYINRHSLILIIKQLVPSATTTFAMTFSSSMSVVEYYLLKRFPIPYGL 145
>gi|268083363|gb|ACY95275.1| unknown [Zea mays]
Length = 94
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 69/93 (74%)
Query: 383 ITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSF 442
+ FSSS+SVVE+Y L RFP+P+A Y +SI+A F GQ +++K++ +L RAS+I+F LS
Sbjct: 1 MMFSSSLSVVEFYFLHRFPLPFAGYLIFISILAGFWGQCLVRKIVHVLKRASLIVFILSS 60
Query: 443 IIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+IFVSAL++G VG K I I + EYMGF N C
Sbjct: 61 VIFVSALTMGVVGTQKSISMINNHEYMGFLNFC 93
>gi|401403532|ref|XP_003881498.1| Os03g0726500 protein, related [Neospora caninum Liverpool]
gi|325115911|emb|CBZ51465.1| Os03g0726500 protein, related [Neospora caninum Liverpool]
Length = 494
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 117/243 (48%), Gaps = 26/243 (10%)
Query: 3 RVGSECFRWRLLAAAFIGLLVVASVSVSAQSNLKP----EATSKN--------KNDHVEP 50
R+ E R RL A L ++ V Q P + T +N D +P
Sbjct: 110 RLQQEAIRVRLQQEAIRVGLQQEAIRVGLQQEAIPVGLQQETPRNTVGLYRSLSTD--DP 167
Query: 51 NIVIKVSAP-KSRSG---YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPM 106
+V+ +P + SG + ++ G IV +I G A G GGG IFVP+
Sbjct: 168 LLVLNTGSPVRKLSGLVDERRTLSELNNGPTIVCIVLIAIVG-AVSVTAGTGGGAIFVPL 226
Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGI 166
+ LI+ F+ + A S+C++TG A++ + N +R+P +D+P+ID D+ LL P+ + G
Sbjct: 227 MQLIMHFNTFEATATSQCLMTGSALAGLCLNFVRRNPVVDMPLIDMDMVLLLGPMQMCGS 286
Query: 167 SIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQ 226
S+GV N + W+ITVLL+V L + R ++ +REA K +L + +
Sbjct: 287 SVGVIVNRVLPAWLITVLLVVCLLYETVRLM-------RRLRDKQREAKKVTQLTASEHA 339
Query: 227 TEE 229
+E
Sbjct: 340 HKE 342
>gi|242083980|ref|XP_002442415.1| hypothetical protein SORBIDRAFT_08g019640 [Sorghum bicolor]
gi|241943108|gb|EES16253.1| hypothetical protein SORBIDRAFT_08g019640 [Sorghum bicolor]
Length = 170
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 65/89 (73%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
I+ + + + + K+ M+ GG+VS V YNL+ +HP+LD+P+I+YDLALL QP+
Sbjct: 43 IYSTAIYVYLNLNEKAQVCWFCSMIMGGSVSTVYYNLKLKHPSLDMPLIEYDLALLMQPM 102
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFL 190
L+LG+SIGV FNVIF +W+IT LLI +FL
Sbjct: 103 LMLGVSIGVIFNVIFPNWLITALLITIFL 131
>gi|219112005|ref|XP_002177754.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410639|gb|EEC50568.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 629
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 93/385 (24%), Positives = 177/385 (45%), Gaps = 47/385 (12%)
Query: 109 LIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISI 168
L++G + ++ V GGA+++ + N+++RHP D P+ID+DL L+ +PL ++G +
Sbjct: 220 LVMGLHPHYAIPIASVTVFGGALASTIVNMQRRHPLADRPIIDWDLVLMMEPLTLIGTLL 279
Query: 169 GVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE--AAKQLELIGN--- 223
G F+ I ++ ++ VLL++L + + K + ++ E R AA+ GN
Sbjct: 280 GTLFHRILSEKILIVLLVLLLSITAHSTLSKAMRMYEAEKRYIRHLIAAQADSPRGNPSL 339
Query: 224 -GY-----QTEECKCDPEYLSNDTTPEETREP--------KKSKVSIIENIYWKEFGLLV 269
GY ++ + D E R+ + K ++E ++
Sbjct: 340 GGYVLPFGDEDDSRADTGCKEEARMAAEERQRILILNPDFRTMKTDLLEQEKVTPRSKII 399
Query: 270 A---VWAIILALQI-------AKNYTTTC-SVLYWVLNLLQIPVAGGVSAYEAVALYKGR 318
A ++++++ L + + C S +WV++++ I SA+ A R
Sbjct: 400 ALCCMFSVLIFLNLMVGGGSFDSPWDIKCGSTAFWVVHVVMIAFLMS-SAWMAQTYLIAR 458
Query: 319 RKIAS-------KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIP 371
+I GD KW + Y A AG+ G+ G+GGG ++ PL L G+
Sbjct: 459 HEIKDMVRFDYVHGD--IKWDTRTSIIYPAVFTIAGVFAGMFGIGGGVVIVPLLLHSGVH 516
Query: 372 PQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI-- 429
P V+SAT+ I F+S SV Y++ +A+ F + I++ +GQ +++++ +
Sbjct: 517 PGVASATSSAMILFTSLASVSTYFVFGLIVADFAMAGFVIGFISSTLGQILMRRVRQAKS 576
Query: 430 -----LGRASIIIFTLSFIIFVSAL 449
R S + F + ++ VSAL
Sbjct: 577 ASGRKFERNSYLAFVIGGVVLVSAL 601
>gi|356533449|ref|XP_003535276.1| PREDICTED: uncharacterized protein LOC100806242 [Glycine max]
Length = 408
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/204 (40%), Positives = 113/204 (55%), Gaps = 15/204 (7%)
Query: 277 ALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQL 336
A+ + +NYTT CS LYW++NLLQ+P+ G + YE V LYKG+ IASKGD+ T+W QL
Sbjct: 201 AVIVKQNYTTNCSALYWIMNLLQVPITIGTTFYEVVLLYKGQSVIASKGDQQTRWHVQQL 260
Query: 337 VFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE--Y 394
+ YC CGI +G++GGLLGLGGGFILGPLF+ LGI PQ+ + T SS+ +
Sbjct: 261 ILYCMCGIISGIIGGLLGLGGGFILGPLFIGLGIHPQLKTNTKPHPWLLSSTHTQPHPPK 320
Query: 395 YLLKRFPVPYALYFFAL-----SIIAAFVGQHVLKKLIKILGRASIIIFTLSF------- 442
+LL + P P + + I A K+ +ILG L F
Sbjct: 321 FLLYQSPNPKSSRLITVHAEVKHTIVADTTTSSNDKIREILGNHDYNKLGLPFNEQKHLA 380
Query: 443 IIFVSALSLGGVGLAKMIKRIEHK 466
I V A+ L V +A ++ + K
Sbjct: 381 NIIVDAV-LDNVSIATTYRKTQEK 403
>gi|325193669|emb|CCA27930.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 591
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 91/158 (57%), Gaps = 8/158 (5%)
Query: 323 SKGD-EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 381
S+GD TK + ++L FYC + AG+ GLLG+GGG + GP+ LE+G+ P V ATA F
Sbjct: 434 SEGDLHWTKKKTTRLPFYC---VIAGITAGLLGIGGGMVKGPIMLEMGVLPMVQVATANF 490
Query: 382 AITFSSSMSVVEYYLLKRFP----VPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 437
I F+S+ + +++ + +FP Y +F + A+ GQ V+ L+K R S+++
Sbjct: 491 MILFTSASTTLQFAIAGQFPGSLQYDYVGWFACAGFVGAYCGQSVVAFLLKKYNRESMLV 550
Query: 438 FTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
+ L+ +I VSA +G VG + I + ++GF C
Sbjct: 551 YILAVMIGVSAFCMGIVGFQIVENEIALRMHLGFSGSC 588
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 85/146 (58%), Gaps = 2/146 (1%)
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
++ TI+ TA + G+GGGG+ VP L++G K + +SK + G +V+ +N
Sbjct: 123 LIATIMTLLITALSAGCGIGGGGLLVPSFILVIGLSPKHAIPLSKATILGTSVANFWFNY 182
Query: 139 RQRHPTLD-IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
+RHP + +P+I+Y +A + +P ++G IGV N + +W+I +LLI L ++ R F
Sbjct: 183 HRRHPMKNNVPIINYAMAAIMEPPTLIGAVIGVMLNHVVPNWLIFLLLISLLTSITLRTF 242
Query: 198 LKGVETWKKETITKREAAKQLELIGN 223
+KG +KET +R+A + +GN
Sbjct: 243 IKGNRLREKET-KRRQALIKNVFVGN 267
>gi|403339320|gb|EJY68918.1| hypothetical protein OXYTRI_10465 [Oxytricha trifallax]
Length = 539
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 108/426 (25%), Positives = 190/426 (44%), Gaps = 68/426 (15%)
Query: 113 FDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP---VIDYDLALLFQPLLVLGISIG 169
FD K++ A+S + +V+ +Y L ++HP D +I+Y+LA++ P +++G G
Sbjct: 118 FDTKNAIAISNFAIFTCSVTRYIYTLDKKHPHPDKKQNVIIEYNLAIVMLPTVMMGSLTG 177
Query: 170 VAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETIT-KREAAK------------ 216
V N+IF + +L L + LS ++ +KG + ++KETI ++EA K
Sbjct: 178 VFLNIIFPAIALQAILTALLIFLSLQSLMKGKDMYRKETIKFQQEAEKLKKKQDEERLMM 237
Query: 217 ----QLELIGNGYQTEECKCDPEYLSND-TTPE-----ETREPKKSKVSIIENIYWKEFG 266
+L+ G E D E L + T P+ E E + +I N+ +E
Sbjct: 238 EKILKLKTDSQGEIIRESPIDEEQLKKEGTKPQQIQNFEIEEGQPLTKGLINNVNQEELQ 297
Query: 267 LLVAV---------WA------IILALQIAKNY------------TTTCSVLYWVLNLLQ 299
L + W +I +L ++ N CS+L W++
Sbjct: 298 QLERILEKEKGHKQWDKHLTCLLIFSLLVSTNLLRGNKSLDSIIGIERCSLLDWMVLFTF 357
Query: 300 IPVAGGVS-------AYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGL 352
+ ++ E + K +R +A T +L + G G + G
Sbjct: 358 YILCSSITFLGIRRVRKEQILKEKYKRGLADCDIRLTPRNTLRLQIFSFVG---GWISGA 414
Query: 353 LGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALS 412
LGLGGG I P+ + LG PP V++AT+++ I+FSS+ S Y + +P++++ +
Sbjct: 415 LGLGGGAIFNPILIGLGTPPAVATATSMYMISFSSAGSTATYIIYGLINLPFSIWVGVIG 474
Query: 413 IIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL---GGVGLAKMIKRIEHKEYM 469
A G + + K R S I+F L+ ++ SAL + GG+ L K+I R E +
Sbjct: 475 CFGATGGLALFNVVTKKYNRQSFIVFVLAGVMGASALLVPIFGGLDLMKLIDRGE--DIF 532
Query: 470 GFDNIC 475
++IC
Sbjct: 533 KMNSIC 538
>gi|440800424|gb|ELR21463.1| hypothetical protein ACA1_184090 [Acanthamoeba castellanii str.
Neff]
Length = 512
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 67/228 (29%), Positives = 118/228 (51%), Gaps = 23/228 (10%)
Query: 264 EFGLLVAVWAIILAL-------QIAKNY--TTTCSVLYWVLNLLQIPVAGG---VSAYEA 311
+F L VW I+ A+ + ++Y CS+ +W+L + P+ G +S A
Sbjct: 275 KFLYLAIVWCIMFAIVFVRGGGESDESYLGIVKCSMYFWILWGVMFPIMLGFMVLSCIIA 334
Query: 312 VALYKGRRKIASKGDEG-TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGI 370
+Y R++ EG +W L G G G LG+G G + GP+ LE+G+
Sbjct: 335 RLIYSYRKRNGWPFIEGDVQWSVKSLFLIPFAGTIGGTAAGFLGIGSGMVNGPVMLEIGM 394
Query: 371 PPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKIL 430
P+V++AT+ F I F++ +V +Y+++ AL+FF L ++AA VGQ+ ++ ++K
Sbjct: 395 TPEVATATSSFIIVFTALSTVSQYFIIGALNWQPALWFFVLGVLAAVVGQYGVQYVVKRF 454
Query: 431 GRASIIIFTLSFIIFVSALSLGGVGLAKMIK---RIEHKEYMGFDNIC 475
++SII F L+F+I G G+A ++ +I + GF ++C
Sbjct: 455 NKSSIISFLLAFVI-------AGSGVAMIVTGALQIADEGITGFADLC 495
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 60/148 (40%), Positives = 96/148 (64%), Gaps = 2/148 (1%)
Query: 71 DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
++ FGWR+ +GT + F G A G+GGGGI+VP+L LIVG+ AK + +SK V G A
Sbjct: 3 ELLFGWRLYLGTGLAFVGGALSGAVGIGGGGIYVPVLILIVGYTAKEAIPLSKVTVAGVA 62
Query: 131 VSAVVYNLRQRHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLF 189
+S+ + N+ +RHP ++DYD+A++ P +LG ++GV VI +W+I +LLI++
Sbjct: 63 ISSFLVNVLRRHPRAPWRALVDYDVAMVMTPTTLLGTTVGVLVYVILPEWLILILLILVL 122
Query: 190 LVLSTRAFLKGVETWKKETITKREAAKQ 217
++ R F+ ++ WKKE + K EA +Q
Sbjct: 123 GLVDYRTFVAAIKLWKKEKVAK-EAERQ 149
>gi|340508148|gb|EGR33919.1| hypothetical protein IMG5_031030 [Ichthyophthirius multifiliis]
Length = 434
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 86/398 (21%), Positives = 169/398 (42%), Gaps = 55/398 (13%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVL 164
P+L +++ + + + +S C+V GG ++ + LR++HP P+IDY++ ++ P+++L
Sbjct: 55 PLLMILMNYQQRKATFISYCIVLGGCLANSLLILRKQHPLKKKPLIDYNIIMIINPMVIL 114
Query: 165 GISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNG 224
G +IG+ NVIF + + VL I+ +S F KKE+ R+ LE + N
Sbjct: 115 GTNIGIILNVIFPEIVSGVLFIIFLCTVSPYLF-------KKESQLSRD--NDLEKVNNS 165
Query: 225 YQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNY 284
Y + K D + + + +P + K +++ L+ + + +++Q
Sbjct: 166 YIISDVKVD----NIAQSQIKNNDPGELKCFLMQEERQYPLNKLLILMFVFVSIQFLIFL 221
Query: 285 -----------TTTCSVLYWVLNLLQIPVAGGVSAYEAVA-----LYKGRRKIASK---- 324
CS YW+L + G+ Y V ++ R +I K
Sbjct: 222 RGGKGVGSFIGIKICSNSYWLL-------SAGILVYSLVVSYFIKIFISRNEIQKKMIFQ 274
Query: 325 --------GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS 376
D+ + A G+ +G + G G G +L P+F+ +PP + S
Sbjct: 275 KYGLEEYFKDDFDISDNKKYFIIWASGLLSGCLSGTFGSGAALLLMPVFISYQLPPIIGS 334
Query: 377 ATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASII 436
A F F + S++ + + + + L+ + FV +L +++ I+
Sbjct: 335 AVCGFNYFFIACASIISVFSEQYLTAYEVIIYSFLAFLGGFVCARILYGIVERKKAQHIV 394
Query: 437 IFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNI 474
+F I VS L +G ++ I+ K G D++
Sbjct: 395 VF-----IVVSLAVLNIIG--NIVYLIKKKNDYGIDSL 425
>gi|297597753|ref|NP_001044473.2| Os01g0786700 [Oryza sativa Japonica Group]
gi|255673761|dbj|BAF06387.2| Os01g0786700 [Oryza sativa Japonica Group]
Length = 300
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 75/126 (59%), Gaps = 3/126 (2%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL---RQRHPTLDIPVIDYDLALLF 158
++VP+LN++ G K++ A S MVTGG +S V+Y L R P+IDYD+A++
Sbjct: 29 LYVPILNIVAGLSLKTATAFSTFMVTGGTLSNVLYTLIVLRGHEKGGHQPLIDYDIAVVS 88
Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
QP L+LG+S+GV NV+F +W+IT L V + + + G++ W+ ET R +
Sbjct: 89 QPCLLLGVSVGVICNVMFPEWLITALFAVFLASATFKTYGTGMKRWRAETAAARRMLEGG 148
Query: 219 ELIGNG 224
+G+G
Sbjct: 149 SSLGDG 154
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 59/107 (55%)
Query: 369 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 428
G + +S+T +F + F +SMS+V++ +L + AL + +A+ VG V++ I+
Sbjct: 194 GEGAKTASSTTMFMVLFCASMSMVQFIILGVDGIVTALVYAITCFVASIVGLVVIQGTIR 253
Query: 429 ILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
GR S+I+F ++ I+ +S + + G ++ + +YMGF C
Sbjct: 254 KSGRVSLIVFMVAAILALSVVVIACSGAVRVWVQYTSGQYMGFKMPC 300
>gi|340501283|gb|EGR28084.1| hypothetical protein IMG5_183410 [Ichthyophthirius multifiliis]
Length = 470
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 88/375 (23%), Positives = 165/375 (44%), Gaps = 61/375 (16%)
Query: 65 YKHVWP--DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
+K++WP I+ I++ +IG +VGG GG + VP+L L++ + +S +S
Sbjct: 24 HKNLWPPNTIEICAYILIPILIGI-----SNVGGQGGSIVRVPLLMLMLNYSQSTSVFIS 78
Query: 123 KCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
++ G + + L +RHP DIP+I++DL L+ P L++G G+ ++ +++
Sbjct: 79 FIILFGSCLPNSLLLLTKRHPFKDIPLINFDLVLILLPNLIVGNIYGILLTLVVPEFITI 138
Query: 183 VLLIVLFLVLSTRAFLKGVETWK-KETITKREAAKQLELIGNGYQTEECKCDPEYLSND- 240
+L I+ ++ + KG++ +K K+ ++E Q+ L Q + +D
Sbjct: 139 ILFILYLFAITPYFYRKGMKLYKEKKHKDQKEVYLQINL-NKTIQRHNINENVNTYQDDN 197
Query: 241 -------TTPEETREPKKSKVSIIENIYWKEFGL-----------LVAVWAI---ILALQ 279
E + P+ K + IY ++ L ++A + I IL L+
Sbjct: 198 NSNISNYNNNIEIQSPQSQKN---KQIYIRKKKLKSILPIKKILAIIATFLIIQTILMLR 254
Query: 280 IAKNYT----TTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS------------ 323
++ + T ++ Y+++NL V ++ Y +K R K S
Sbjct: 255 CSQKFDYLGIKTYNLYYYLINLFLFIV--NIAMY---LFFKDRFKTKSLIIQIKKQQQID 309
Query: 324 ------KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 377
K +E + + + G +G+ GL G+G G + P L L I P V++A
Sbjct: 310 LNESQYKDNEFSLQSYKCFLQIISLGFISGVFAGLFGIGSGLTIVPALLYLKIEPTVAAA 369
Query: 378 TALFAITFSSSMSVV 392
T F F S SV+
Sbjct: 370 TNGFITFFLSLNSVI 384
>gi|325181581|emb|CCA16031.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 421
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/330 (24%), Positives = 147/330 (44%), Gaps = 41/330 (12%)
Query: 81 GTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQ 140
G I F A S G+GGGG+ VP+ L + + +SK ++ G ++S ++ LR+
Sbjct: 97 GIIGTFLSAAIASGCGLGGGGLLVPLYILTQHLSPQKAIPLSKAVIFGASISGLIVTLRR 156
Query: 141 RHPTLDI-PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK 199
+HP + +I++D LL +P+ + G IGV I D+++T+LL
Sbjct: 157 KHPFVSSRSIINFDAVLLMEPMTLAGTVIGVTMITILPDYIVTILL-------------- 202
Query: 200 GVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIEN 259
A+ L + + +++C+ D + + K+ +
Sbjct: 203 -------------GQAQILRYSHHQFASDDCQ-DQSLTDQSLNLSDGGIMNRQKIQTSQR 248
Query: 260 IYWKEFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSAYEA 311
+ ++ W ++ + + K S++ YW + V ++AY A
Sbjct: 249 NLLCDMAVISLTWLSLIVISLVKGGHGAASIVGIGCGSYSYWAVITFMPYVFCAITAYFA 308
Query: 312 VALYKGRRKIAS---KGDEGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE 367
+ K R + S + EG +W +V Y AG+ G+LG+GGG + GP+ LE
Sbjct: 309 QRILKQRAFLESYDYRYAEGEMRWDRRGVVRYPIFCSLAGVAAGMLGIGGGMVKGPILLE 368
Query: 368 LGIPPQVSSATALFAITFSSSMSVVEYYLL 397
+ PQV+SAT+ I F+SS +V++ L+
Sbjct: 369 IRFHPQVASATSSTMIFFTSSTTVIQEALI 398
>gi|294944337|ref|XP_002784205.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239897239|gb|EER16001.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 392
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/366 (24%), Positives = 162/366 (44%), Gaps = 55/366 (15%)
Query: 96 GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL------RQRHPTLDIPV 149
GVGGG +FVP+LNL+ +K++ A+S+ ++ V+ V+++L R+R +++P
Sbjct: 27 GVGGGILFVPLLNLVGRLPSKNATAMSQGLIMSATVAKVLFSLHAQYTSRRRGRVINLPY 86
Query: 150 IDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWK 205
+ +L P +++G IG+ W+ ++ ++L+++ S + LKG W+
Sbjct: 87 V-----VLMLPCMIVGGLIGIYIY----SWLPVLIQLILYVITALFGSAMSLLKGFHLWR 137
Query: 206 KETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSK-VSIIENIYWKE 264
ET K +A + E+ G K P R + K V+ + I+
Sbjct: 138 AETSEKEKAKRDAEVSAGG----TLKVPASSTVETVVPPLLRPITRRKAVACVITIF--- 190
Query: 265 FGLLVAVWAIILALQIAKNYTTT--------CSVLYWV-LNLLQIPVAGGVSAYEAVALY 315
AVW +++ ++ ++T C LYW ++ + + AY +
Sbjct: 191 -----AVWILVILSRLILGSSSTRSIVGISYCEGLYWALSVVVVVVLLMVPLAYALIDRS 245
Query: 316 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVS 375
G K A S L GI G + ++G+ GG I+ PL + G+ P +
Sbjct: 246 PGSSKAA------LTLSGSLL------GI--GFLAAVVGISGGIIITPLVMFTGLTPPQA 291
Query: 376 SATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASI 435
S T I +SS + + L P AL+ AL A G +L ++++ GR SI
Sbjct: 292 SGTGSVVILVNSSSLALSFGLGGYLPDASALWIIALPFCGALTGDIILTRIMRRTGRTSI 351
Query: 436 IIFTLS 441
+ F LS
Sbjct: 352 LAFLLS 357
>gi|428163170|gb|EKX32257.1| hypothetical protein GUITHDRAFT_148761 [Guillardia theta CCMP2712]
Length = 538
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 105/409 (25%), Positives = 183/409 (44%), Gaps = 39/409 (9%)
Query: 82 TIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQR 141
T+I F A + G GGGGI VPM +I F S+ +SK + GGA++ + N+++R
Sbjct: 136 TVITFVTIALAAPTGTGGGGILVPMYMIIGHFSPHSAIPLSKATILGGAIANNLINIQRR 195
Query: 142 HPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIV----LFLVLSTRAF 197
HP + P++DYD + P L++G +GV N + W++T+ L+V F + + +A+
Sbjct: 196 HPFANRPLVDYDSLQILVPSLLIGTILGVFLNAVSPAWLVTLGLVVSLGYSFAIAAKKAW 255
Query: 198 LKGVETWKKETITKREA--AKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVS 255
VE K ++ +RE ++ E Y +E K +P+ + E+R K+
Sbjct: 256 AIYVEEVLK-SLPEREPLLGERKEQPAQHYSFDEDKLEPQL--REIIKAESRHDFKAIGM 312
Query: 256 IIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVAL- 314
I+ I W +LVAV ++I + S +W++ LL P+ +S +L
Sbjct: 313 IV--ISW----ILVAVCSLIKGGSGPNQFVACGSWSFWMVALLPFPIVMILSWRVGTSLN 366
Query: 315 --YKGRRKIASKGDEGTK-WRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIP 371
++ ++ + EG W + + I G++ G LGG G E G
Sbjct: 367 EKFESKKACGYRFAEGDAVWDVQHVRIFPFVSIIVGILAG--ALGGVEPCG----ERGDD 420
Query: 372 PQVSSATALF---AITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 428
S L A++ S +LL +AA +G + + +
Sbjct: 421 GAHGSLHVLLDHHAVSLPRSAQARLRHLLH-----------PRRTVAASIGNTAIHHVSR 469
Query: 429 ILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICYT 477
+ ++ L+ I +SA+ LG VG + I+ E MGF +IC++
Sbjct: 470 KYRKTWFVVAILAITIGLSAVLLGYVGYYRAIRSWLEGEDMGFRDICHS 518
>gi|85002083|gb|ABC68446.1| unknown [Tragopogon pratensis]
Length = 134
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 68/132 (51%), Positives = 80/132 (60%), Gaps = 6/132 (4%)
Query: 267 LLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGD 326
LL AVWA L LQI KN CS YWVL + Q P V +E V LYK +K G+
Sbjct: 3 LLFAVWAAFLVLQIIKNELVVCSTWYWVLTVAQFPATLVVFGFECVKLYKESKKRKRAGN 62
Query: 327 EGT------KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATAL 380
+ +W L+F CGI G VGGLLG GGGFILGPL LE+G+ PQV+SATA
Sbjct: 63 TESVCEASIEWTIPSLIFCALCGILGGTVGGLLGSGGGFILGPLLLEIGVIPQVASATAT 122
Query: 381 FAITFSSSMSVV 392
F + FSSS+SVV
Sbjct: 123 FVMMFSSSLSVV 134
>gi|148906476|gb|ABR16391.1| unknown [Picea sitchensis]
Length = 157
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 63/126 (50%), Positives = 80/126 (63%), Gaps = 16/126 (12%)
Query: 17 AFIGLLVVASVSVSAQSNLKPEATSKNKN-------DHVEPNIVIKVSAPKSR------- 62
A GLL+ ++S + ++ L + S D VE N +K K+R
Sbjct: 16 ATFGLLLAVTLSSAERTFLDGNSASSETERSMEKIVDPVEGN-TLKYHVLKARNFLWRSN 74
Query: 63 -SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAV 121
S Y+ VWP++KFGWRIV+G+I+GFFG A GSVGGVGGGGIFVPML LI+GFDAKSS A+
Sbjct: 75 ISDYQRVWPEMKFGWRIVIGSIVGFFGAAVGSVGGVGGGGIFVPMLTLIIGFDAKSSTAI 134
Query: 122 SKCMVT 127
SKC T
Sbjct: 135 SKCKQT 140
>gi|301093159|ref|XP_002997428.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110684|gb|EEY68736.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 591
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 58/144 (40%), Positives = 88/144 (61%), Gaps = 5/144 (3%)
Query: 336 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
LVF C + AG+ GLLG+GGG + GP+ LE+GI P V SATA F I F+SS + +++
Sbjct: 446 LVFPALCSM-AGVAAGLLGIGGGMVKGPIMLEMGILPPVQSATANFMILFTSSSTTLQFA 504
Query: 396 LLKRFP----VPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
+ +FP Y +F + I F GQ V+ L+K R SI+++ L+ I +SAL++
Sbjct: 505 INGQFPGQLQYDYMAWFALMGCIGGFCGQKVVAYLVKKYRRESIMVYLLAVTIGLSALAM 564
Query: 452 GGVGLAKMIKRIEHKEYMGFDNIC 475
G +GL ++ IE ++GF+ IC
Sbjct: 565 GIIGLKSTLRDIEKGVHLGFNGIC 588
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 82/146 (56%), Gaps = 1/146 (0%)
Query: 74 FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
F W+ +V TII TA GS GVGGGG+ VPM G K + +SK + G AVSA
Sbjct: 109 FTWKDIVATIIALVSTALGSGCGVGGGGLLVPMYIFFYGLSPKHAIPLSKATIFGNAVSA 168
Query: 134 VVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
+N ++HP +P+I+Y +A + +P ++G GV N +F DW+I VLL+ L +
Sbjct: 169 YFFNFNRKHPMNAKLPLINYQVAGVMEPTTLIGAIFGVMMNHMFPDWLILVLLVSLLSYI 228
Query: 193 STRAFLKGVETWKKETITKREAAKQL 218
+ + LKG +KE+ +R K +
Sbjct: 229 TYKTVLKGNTIREKESRYQRAVVKSV 254
>gi|384249010|gb|EIE22493.1| hypothetical protein COCSUDRAFT_47913 [Coccomyxa subellipsoidea
C-169]
Length = 628
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 105/216 (48%), Gaps = 4/216 (1%)
Query: 264 EFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS 323
+ LL ++A I+A K+ T + YW++ L +PV ++ + L + +
Sbjct: 409 KLALLFLMFACIVASDSFKDRTVCGTWQYWLVVLSVLPVILIITLFVRAYLVRDFNAKQA 468
Query: 324 KG---DEGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 379
G EG +W + + A AG++ G+ G+GGG + GPL LE+G+ P V++AT+
Sbjct: 469 SGYVWTEGDVEWSRRNTLLFPALSSLAGLIAGMFGVGGGIVKGPLMLEMGVLPDVAAATS 528
Query: 380 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT 439
I F+++ + V Y P Y L F + +I VGQ L+K L R S+++
Sbjct: 529 ATMIMFTAASASVVYLSFGGIPFDYGLATFLVGLIFTMVGQVTCYWLMKALDRRSVVVIA 588
Query: 440 LSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
++ ++ +S + + + I ++ + F IC
Sbjct: 589 MALLMVISMVIIYYEAVVSTIAAVQDHRLLDFGTIC 624
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 63/196 (32%), Positives = 102/196 (52%), Gaps = 9/196 (4%)
Query: 65 YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
+K +WP WR VVG I + GG+GGGGI VP+ L++GF ++ A+S
Sbjct: 29 HKGLWP---LDWRDVVGYIFAAVSLFIAAGGGIGGGGILVPLYILVMGFPTNTAVALSNI 85
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
+ GGA+S V+N+ +RH + P+ID+++ L +P +LG +G N +WM T+L
Sbjct: 86 TIVGGAISNFVFNVGRRHAFFNRPLIDWNIILAMEPATILGALLGGYLNKATPNWMTTIL 145
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDP----EYLSND 240
L L +L+ + + V TW+KE + + AA G+ + P E L+
Sbjct: 146 LAALLTLLTYKLVDRAVVTWRKENLEFKRAAAGSSQDGSDPSEPLLRKGPQEQQEILNEA 205
Query: 241 TTPEETREPKKSKVSI 256
PE++ P++S I
Sbjct: 206 FAPEQS--PRQSTQPI 219
>gi|452823720|gb|EME30728.1| hypothetical protein Gasu_19670 [Galdieria sulphuraria]
Length = 711
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 92/178 (51%), Gaps = 6/178 (3%)
Query: 70 PDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGG 129
P F WR + ++ F + GGVGGG +FVP+L L G+ A ++AA+S+ +VTG
Sbjct: 140 PLTDFTWRDGLTFVLVFIIAGLSNAGGVGGGFLFVPVLVLATGYRASTAAAISQALVTGA 199
Query: 130 AVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLF 189
+ + Y L +RHP + P IDY + + F P ++ G SIGV N +F ++ L L
Sbjct: 200 SGANTFYGLIRRHPKRERPRIDYGVVIHFIPSVLCGTSIGVLLNELFPNFFTLFALSALV 259
Query: 190 LVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETR 247
L + + KG+ WK+E +A K+ E ++ + + E ND +E +
Sbjct: 260 LYVFYVSLKKGISLWKQERKEAEDAKKKSE------TSQSQQTNSELTQNDQVAQEQQ 311
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/147 (22%), Positives = 71/147 (48%), Gaps = 1/147 (0%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSS 388
W +++++ I G +G +G GG F+ P+ + +G+ P V +TA F S
Sbjct: 562 HWTKLRVIYFSPLMIILGAIGAWVGAGGSFMSTPILVAGIGMDPVVVQSTAGFMNFTSGF 621
Query: 389 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSA 448
S ++Y + + Y L A +++ +F G ++L L+ +I++ +S ++F +
Sbjct: 622 SSALQYIFDHQMKIDYGLSLGATTLVGSFTGLYILNGLVARYNLQAILVIVMSIVMFGAF 681
Query: 449 LSLGGVGLAKMIKRIEHKEYMGFDNIC 475
G+ ++I ++ E+ +IC
Sbjct: 682 AVDLYAGVQELIGVLDLNEHFPIHSIC 708
>gi|294944269|ref|XP_002784171.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239897205|gb|EER15967.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 409
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 94/416 (22%), Positives = 176/416 (42%), Gaps = 69/416 (16%)
Query: 96 GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP--VIDYD 153
G+GGG +FVP+L LI G K S A+S+ ++ +++A ++N +++ + P +I +
Sbjct: 27 GIGGGVLFVPVLRLIGGLQLKESTALSQALIASASLAATLFNCFEQYSARNDPKALIVWP 86
Query: 154 LALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWKKETI 209
+L P V+G IGV W+ ++ +++L+ S A+ KG+ WK E
Sbjct: 87 FVILTLPCTVIGSLIGVYLY----SWLPSLFILILYFCFVCLGSFMAYRKGIRLWKAENG 142
Query: 210 TKREAAKQLELIGNGYQTEECKCDPEYLSNDTT---PEETREPKKSKVSIIENIYWKEFG 266
KR A +G T+ + S++ T P R P K++ +I
Sbjct: 143 AKRRAV-------DGDSTDMSR------SSEVTVEVPSLLRMPNMKKLAAYTSIA----- 184
Query: 267 LLVAVWAIILALQIAKNYTTTCSVLYWVLNLL----QIPVAGGVSAYEAVAL-------- 314
+WA+ L + T V++ + L Q VA S V
Sbjct: 185 --ALIWAVCLIFPLLTG-THPLQVVFDLSPSLCDEQQDDVAATPSGVIGVPFCEEAFWGL 241
Query: 315 --------------YKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 360
Y ++ A G S +V G++ ++G+ GG
Sbjct: 242 AALQALILLLIPTGYVVAKRTAETARVGLVLMTSMIVI--------GLISSIVGISGGLF 293
Query: 361 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQ 420
+ P+ L LG+ P+ ++AT I +S+ + + + L FP L+ + I A +G+
Sbjct: 294 MIPVVLSLGLDPKQATATTSIVIFATSTSTALSFALGGYFPPASDLWIVVMPFIGALLGK 353
Query: 421 HVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK-RIEHKEYMGFDNIC 475
++ +LI GR SI++ L ++ + ++ G+ ++ + E + F + C
Sbjct: 354 TIVARLIAKTGRMSILVLLLGTVVIIGGITTISTGIVSVVNGALNGDEVVQFGSFC 409
>gi|298710140|emb|CBJ31852.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 378
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 66/107 (61%), Gaps = 2/107 (1%)
Query: 103 FVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLL 162
VP+ L++GFD K + +S + GGA++ V NL +RHP D P++D+DL L+ +PL
Sbjct: 146 LVPLYILVLGFDPKHAIPLSNITIFGGAITNTVLNLSKRHPAADRPLVDWDLILVMEPLT 205
Query: 163 VLGISIGVAFNVIFADWMITVLLIVL-FLVLSTRAFLKGVETWKKET 208
+ G +G N + DW++ ++LIVL + R KG++++ KET
Sbjct: 206 IGGALVGSFINKVLPDWILAIMLIVLL-AATANRTLRKGIKSYNKET 251
>gi|326433498|gb|EGD79068.1| hypothetical protein PTSG_02036 [Salpingoeca sp. ATCC 50818]
Length = 607
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 75/132 (56%)
Query: 344 ITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVP 403
+ AGM+ G LG+GGG +L PL L+ + P VSSAT + F+S+ S ++ +L R P
Sbjct: 475 LVAGMLAGFLGIGGGMVLAPLLLQFNMHPLVSSATTAYMTLFTSAGSFTQFVILNRVPYD 534
Query: 404 YALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRI 463
Y + F L+ A+ VGQ +L ++ G +S+I F L F+I ++A+ L G ++
Sbjct: 535 YGIALFLLAAAASVVGQILLHSYVRRTGNSSVIAFILGFVIGLAAIMLLVSGSLQLKAAH 594
Query: 464 EHKEYMGFDNIC 475
+ E GF +C
Sbjct: 595 DRGESFGFKPLC 606
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 66/138 (47%), Gaps = 15/138 (10%)
Query: 121 VSKCMVTGGAVSAVVYNLRQRHPTLDIP------VIDYDLALLFQPLLVLGISIGVAFNV 174
+S + G ++ + +R+RHP +P VID+D L+ PL + G +GV FN
Sbjct: 126 LSSTAIVGASIVQFFFQIRRRHP---LPGAQHRRVIDFDTILMLLPLALAGTVVGVIFNT 182
Query: 175 IFADWMITVLLIVLFLVLSTRAFLKGVETWKKE----TITKREAAKQLELIGNGYQTEEC 230
+ DW++ V++I++ + + + +KG E ++E + +R K N EE
Sbjct: 183 VSPDWLVLVVVIIVLVFTTFKTLVKGRELRRQEQEARALPRRSIVKHGINDPNSVNGEED 242
Query: 231 KCDPEYLSNDTTPEETRE 248
E + DT E+ +
Sbjct: 243 SGTTETI--DTVGEDAKR 258
>gi|449018085|dbj|BAM81487.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 607
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 87/163 (53%), Gaps = 3/163 (1%)
Query: 51 NIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLI 110
NI + A +R +K P +F WR + + F S GGVGGG +FVP+L L+
Sbjct: 108 NIQLDCDANSNRCVHK---PLSRFTWRDGLTFGLIFVIAGLSSTGGVGGGFLFVPVLVLL 164
Query: 111 VGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGV 170
GF A+ +AA+S+ +VTGG+ + Y L RHP + P IDY + +F ++ G S+GV
Sbjct: 165 TGFQARRAAALSQALVTGGSGANAFYGLITRHPFRERPRIDYYVVTVFMATILCGTSVGV 224
Query: 171 AFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE 213
N++F ++ +L VL + + K ++ WK E R
Sbjct: 225 ILNILFPNFFTLFMLAVLVAYVFYISIKKAIQLWKDERAASRN 267
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 33/151 (21%), Positives = 70/151 (46%), Gaps = 1/151 (0%)
Query: 326 DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAIT 384
+ +W +++++ I G VG +G GG F+ P+ + +G+ P V ATA F
Sbjct: 456 ERDMEWTRNRILWIAPLMIFLGAVGAWVGAGGSFMSTPILIAGVGMDPLVVQATAGFMNF 515
Query: 385 FSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFII 444
++ S ++YY+ P+ Y L + + + +L+ +I++F ++ ++
Sbjct: 516 VAAFSSAIQYYVNHELPLDYGLALGGTAFLGSLSFVVFFNRLVYKFKLQAILVFIMAGVM 575
Query: 445 FVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
F +A+ G ++ + + F NIC
Sbjct: 576 FGAAVLNIYAGALELKTTLNQGKPFPFGNIC 606
>gi|118348628|ref|XP_001007789.1| hypothetical protein TTHERM_00069510 [Tetrahymena thermophila]
gi|89289556|gb|EAR87544.1| hypothetical protein TTHERM_00069510 [Tetrahymena thermophila
SB210]
Length = 549
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 95/171 (55%), Gaps = 7/171 (4%)
Query: 65 YKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKC 124
Y +WP F +++V +I F +VGG+GGG I VP++ +++ + +K + +S C
Sbjct: 25 YNDLWPPSAF--QVIVYILIPFI-IGVSNVGGLGGGIIKVPLITVMLNYPSKVATFISYC 81
Query: 125 MVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVL 184
++ G +V + ++HP + P+IDY++ L+ P+++LG + G+ NV+ + + V+
Sbjct: 82 ILFGSSVVHSTIIIFKKHPLFNKPIIDYNIVLVINPMVLLGTNAGILLNVLMPEIVAGVI 141
Query: 185 LIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPE 235
+ V +++ K + +K ITK++ +Q+E +T E K + E
Sbjct: 142 ICVYLSLIAPYILFKAISLYK---ITKKQ-QQQIEPEAKALETVERKNEGE 188
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 106/251 (42%), Gaps = 41/251 (16%)
Query: 208 TITKREAAKQLE------LIGNGYQTEECKCDPE-----YLSNDTTPEETREPKKSKVSI 256
+I K E+A QLE + + Y +E D +S ++ E EPK+ ++
Sbjct: 247 SIQKMESAAQLEDQLLMQRVNSYYVPQEPPKDKTKRGSIIVSQNSLKETPEEPKQEQIMT 306
Query: 257 IE--NIYWKEFG--------LLVAVWAIILALQIAKNYT--------TTCSVLYWVLN-- 296
E Y +E+ LLV ++ II + + TCS YWV N
Sbjct: 307 EELKAFYDEEYQQFPKKKILLLVIIFCIIQYIVFQRGGKGLQSFVGIKTCSASYWVSNGA 366
Query: 297 LLQIPVAG-GVSAYEAVALYKGRRKIASK--------GDEGTKWRASQLVFYCACGITAG 347
+L + VA V Y + K + +I K GD + V A G+ AG
Sbjct: 367 ILVLCVAAIFVIRYYLLKWTKNKNEIIKKYNLQKEFEGDFNVLNKTHYFVVLLA-GLAAG 425
Query: 348 MVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALY 407
+V G +G+G G L PL L +G+ PQV +AT F F ++ ++++ + +
Sbjct: 426 LVAGTVGVGAGLTLVPLLLSIGVHPQVVAATCGFNYLFIATTTIIQVFTSHYLSYAQIVL 485
Query: 408 FFALSIIAAFV 418
F LS + F+
Sbjct: 486 FSLLSFVGGFI 496
>gi|348690321|gb|EGZ30135.1| hypothetical protein PHYSODRAFT_353731 [Phytophthora sojae]
Length = 449
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 103/194 (53%), Gaps = 12/194 (6%)
Query: 292 YWVLNLLQIPVAGGVSAYEAVALYK-GRRKIAS-----KGDEGTKWRASQLVFYCACGIT 345
YW+L L + + G++ + L R K+ +GD W +++ +
Sbjct: 255 YWILTFLPLLILVGITLWVGYHLRLLNRSKVLCDYPFVEGD--IHWIKRRILVFPTLCTM 312
Query: 346 AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP---- 401
AG+ GLLG+GGG + GP+ LE+GI P V SATA F I F+SS + +++ + +FP
Sbjct: 313 AGVAAGLLGIGGGMVKGPIMLEMGILPPVQSATANFMILFTSSSTTLQFAINGQFPGERQ 372
Query: 402 VPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK 461
Y +F + I GQ V+ L+K R SI+++ L+ I +SAL++G +GL +
Sbjct: 373 YDYIAWFALIGCIGGLCGQKVVAYLVKKYKRESIMVYLLAVTIGLSALAMGFIGLKSTLS 432
Query: 462 RIEHKEYMGFDNIC 475
IE ++GF IC
Sbjct: 433 DIEKGVHLGFHGIC 446
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 64/111 (57%), Gaps = 7/111 (6%)
Query: 112 GFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT-LDIPVIDYDLALLFQPLLVLGISIGV 170
G K + +SK + G AVSA ++N ++HPT +P+I+Y +A + +P ++G GV
Sbjct: 7 GLSPKHAIPLSKATIFGNAVSAYLFNFNRKHPTNAKLPLINYQVAGIMEPTTLIGAIFGV 66
Query: 171 AFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
N +F DW+I VLL+ L ++ + LKG I+++E+ QL ++
Sbjct: 67 MMNHMFPDWLILVLLVSLLSYITYKTILKG------NKISEKESKHQLSVV 111
>gi|403353250|gb|EJY76161.1| hypothetical protein OXYTRI_02333 [Oxytricha trifallax]
Length = 548
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 88/175 (50%), Gaps = 12/175 (6%)
Query: 84 IGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP 143
I T C +V G+GGGG VP+L F+AK + A+S + AV+ ++N RQRHP
Sbjct: 70 ISILMTLC-TVAGIGGGGDLVPLLMSFFEFEAKEAVAISGFAIFLCAVTRYIFNFRQRHP 128
Query: 144 TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVET 203
+ D IDY LA + P + +G +GV N++F + + L +L L K V
Sbjct: 129 SKDSVQIDYGLASVMMPTVFIGSFLGVLINIVFPVLYLQIALTLLLCFLFYECTKKAVVI 188
Query: 204 WKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEE------TREPKKS 252
++KE ++ Q++ G + +E + Y +N+ E+ T++ KK+
Sbjct: 189 FRKENQAQQPTLSQVQ----GIKVDENQI-TSYQNNENINEKLLTKSFTKDSKKN 238
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 93/195 (47%), Gaps = 7/195 (3%)
Query: 287 TCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEG-----TKWRASQLVFYCA 341
TCSV W+ I +S AV K ++++ K G K+ ++
Sbjct: 355 TCSVWDWLCFAFFIVFCFFMSMV-AVNNLKKQQQLKMKSGRGLHPSDIKFEGKKVTKLIL 413
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 401
G+V G G+GGG I PL L +G PP V+S+T ++ + FS+ S + Y +
Sbjct: 414 VSSIGGLVSGAFGIGGGTIYNPLLLSMGAPPSVASSTGMYMVMFSNFGSSITYIVYGTLN 473
Query: 402 VPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK 461
V + + ++ +G +L K++K L R S I+ L+ ++ +SAL + G M+K
Sbjct: 474 VKFGFWIGGFCCASSILGLFLLNKIVKKLNRQSPIVIILAVVMGISALLVPIFGSIDMVK 533
Query: 462 RIEHKEYM-GFDNIC 475
++ + + F ++C
Sbjct: 534 QVRDGQSITQFSSLC 548
>gi|302834698|ref|XP_002948911.1| hypothetical protein VOLCADRAFT_89258 [Volvox carteri f.
nagariensis]
gi|300265656|gb|EFJ49846.1| hypothetical protein VOLCADRAFT_89258 [Volvox carteri f.
nagariensis]
Length = 1005
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 70/108 (64%), Gaps = 1/108 (0%)
Query: 93 SVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT-LDIPVID 151
S GVGGG IF+P+ N +VGF+ KSS A+S+ +T GA + ++ NL HP+ + ++D
Sbjct: 32 SSSGVGGGAIFIPLFNALVGFNLKSSTALSQACITFGAAATLLANLMHHHPSDPTVALVD 91
Query: 152 YDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLK 199
+ L + P+L++G+ IGV NV+ W++T LL++L + L ++ K
Sbjct: 92 FTLVAVLTPMLLMGVGIGVLLNVMLPSWLLTTLLLLLLIALVAQSLSK 139
>gi|40253413|dbj|BAD05342.1| unknown protein [Oryza sativa Japonica Group]
gi|40253789|dbj|BAD05727.1| unknown protein [Oryza sativa Japonica Group]
Length = 132
Score = 78.6 bits (192), Expect = 6e-12, Method: Composition-based stats.
Identities = 46/65 (70%), Positives = 56/65 (86%)
Query: 63 SGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVS 122
+ Y HVWP ++FGW+IV+G +IGFFG A GSVGGVGGGGIFVPML LI+GFDAKSS A+S
Sbjct: 55 ASYHHVWPPMEFGWKIVLGILIGFFGAAFGSVGGVGGGGIFVPMLTLIIGFDAKSSTAIS 114
Query: 123 KCMVT 127
K +V+
Sbjct: 115 KFIVS 119
>gi|307108116|gb|EFN56357.1| hypothetical protein CHLNCDRAFT_144844 [Chlorella variabilis]
Length = 670
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 70/125 (56%)
Query: 103 FVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLL 162
VP+ +++GF + A+S + GG ++ + +N+++RHP P+ID+DL ++ +P
Sbjct: 149 LVPLNLMVLGFGTAPAVALSNITIVGGTLANLAFNVQRRHPARHGPLIDWDLIMVMEPST 208
Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIG 222
+LG +G N I W+ TV L VL ++S + F KG E +E + + ++L ++
Sbjct: 209 ILGALVGGYLNKILPGWLTTVSLSVLLALISYKTFNKGREIHARERDSLHHSLQRLSVVN 268
Query: 223 NGYQT 227
+ T
Sbjct: 269 SADYT 273
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 97/214 (45%), Gaps = 15/214 (7%)
Query: 279 QIAKNYTTTCSVLYWVLNLLQIPVA-GGVSAYEAVALYKGRRKIASKGD--EGT-KWRAS 334
+ K+ S+ YW+ +PV +SA L KG + AS + EG +W
Sbjct: 453 DVLKSQAACPSLPYWLAATAMVPVTLATLSAVRTYLLKKGAAQRASGHELLEGDVEWTPG 512
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFI-----------LGPLFLELGIPPQVSSATALFAI 383
+ Y A AG+ G+ G+GGG I GPL LE+G+ P+V++AT+ I
Sbjct: 513 STLLYPALCSFAGLAAGIFGVGGGIIKARWLAVRENPAGPLMLEMGVLPEVAAATSATMI 572
Query: 384 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFI 443
F+S+ + V + YA F + +I GQ ++ + + + S+++F ++ +
Sbjct: 573 FFTSAFASVVFISFGAVQWDYAAMLFTMGLICTAAGQLLVLWVNQHIKSRSLLVFVMATV 632
Query: 444 IFVSALSLGGVGLAKMIKRIEHKEYMGFDNICYT 477
+ VS+++L G + F IC T
Sbjct: 633 LSVSSVALAVQGAQSTAAAAAAHDLWHFHGICGT 666
>gi|307109812|gb|EFN58049.1| hypothetical protein CHLNCDRAFT_50737 [Chlorella variabilis]
Length = 752
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 77/125 (61%), Gaps = 1/125 (0%)
Query: 89 TACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDI 147
+A + GVGGG +FVP+ ++++ K++ A+S+ ++TGGA+ +V ++L ++HP
Sbjct: 339 SALSNAAGVGGGAVFVPLFSVLLSISIKAATALSQAVITGGAIGSVAFSLTRQHPLRPGA 398
Query: 148 PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE 207
P+ID+ LAL P L+LG++ GV N+ W++T+LLI L + + R G+ + E
Sbjct: 399 PLIDFSLALTLLPPLLLGVATGVLLNLALPAWLVTILLIPLLITFAIRTAATGLSMRRAE 458
Query: 208 TITKR 212
++
Sbjct: 459 KQAQQ 463
>gi|118395416|ref|XP_001030058.1| hypothetical protein TTHERM_01164150 [Tetrahymena thermophila]
gi|89284345|gb|EAR82395.1| hypothetical protein TTHERM_01164150 [Tetrahymena thermophila
SB210]
Length = 503
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 83/359 (23%), Positives = 159/359 (44%), Gaps = 55/359 (15%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
I VP+L G++ K S A+ + ++ ++ ++Q+ PVIDY + +L P
Sbjct: 135 IVVPVLMSFFGYETKKSIALVFITIFSASLGNLMSFMKQKSKDGG-PVIDYRIVVLSLPT 193
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
+++G GVA N F ++ + F++ S +T+K KRE AK+++
Sbjct: 194 IMVGSIYGVALNK-FIPQIVLAFALAFFILQSL------TKTYKS---YKREKAKEVQEN 243
Query: 222 GNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEF---GLLVAVWAIILAL 278
N ++++ P Y E ++P ++ + I KE LL ++ I L
Sbjct: 244 QNNNKSDQSS--PLY--------ELKQPNENGLPPISQSSKKEQYPKSLLSKIFCITLGF 293
Query: 279 QI------AKNYTT-----TCSVLYWVLNLLQIPVAGGVSAYEAVALYKG-------RRK 320
+ + + C LY + NL SAY A L KG + K
Sbjct: 294 AVFSLLRGGSKFDSLLGIPPCGFLYQISNL--------ASAYVAYLLVKGIIAGLVIQNK 345
Query: 321 IASK-----GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVS 375
I K + T+ A ++ + AG++G GLGGG +L P +LE GIP +
Sbjct: 346 IEEKLVVNTSSDDTQLTAQEMQGFAVTAFLAGLIGSTFGLGGGMVLVPKWLEQGIPSYKT 405
Query: 376 SATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRAS 434
+ ++ + ++ S +++ L + +YF ++++++F+ +++ +K +AS
Sbjct: 406 TPCSISLLFLTAFNSAIQFALGGVYETEEIIYFSVIALLSSFIVSSSIQQYVKKTNQAS 464
>gi|294944335|ref|XP_002784204.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239897238|gb|EER16000.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 354
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 83/356 (23%), Positives = 155/356 (43%), Gaps = 55/356 (15%)
Query: 96 GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL------RQRHPTLDIPV 149
GVGGG +FVP+LNL+ +K++ A+S+ ++ V+ V+++L R+R +++P
Sbjct: 27 GVGGGILFVPLLNLVGRLPSKNATAMSQGLIMSATVAKVLFSLHAQYTSRRRGRVINLPY 86
Query: 150 IDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWK 205
+ +L P +++G IG+ W+ ++ ++L+++ S + LKG W+
Sbjct: 87 V-----VLMLPCMIVGGLIGIYIY----SWLPVLIQLILYVITALFGSAMSLLKGFHLWR 137
Query: 206 KETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSK-VSIIENIYWKE 264
ET K +A + E+ G K P R + K V+ + I+
Sbjct: 138 AETSEKEKAKRDAEVSAGG----TLKVPASSTVETVVPPLLRPITRRKAVACVITIF--- 190
Query: 265 FGLLVAVWAIILALQIAKNYTTT--------CSVLYWV-LNLLQIPVAGGVSAYEAVALY 315
AVW +++ ++ ++T C LYW ++ + + AY +
Sbjct: 191 -----AVWILVILSRLILGSSSTRSIVGISYCEGLYWALSVVVVVVLLMVPLAYALIDRS 245
Query: 316 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVS 375
G K A+ + GI G + ++G+ GG I+ PL + G+ P +
Sbjct: 246 PGSSK------------AALTLSGSLLGI--GFLAAVVGISGGIIITPLVMFTGLTPPQA 291
Query: 376 SATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 431
S T I +SS + + L P AL+ AL A G +L ++++ G
Sbjct: 292 SGTGSVVILVNSSSLALSFGLGGYLPDASALWIIALPFCGALTGDIILTRIMRRTG 347
>gi|326433239|gb|EGD78809.1| hypothetical protein PTSG_01784 [Salpingoeca sp. ATCC 50818]
Length = 589
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 97/210 (46%), Gaps = 12/210 (5%)
Query: 258 ENIYWKEFGLLVAVW----AIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVA 313
+ W++ LL+ VW I + L A + CS + VL L IP ++ +
Sbjct: 358 RTVPWEKLVLLILVWLGYTTITMLLYEANDVIKPCSAGWIVLLLCAIPYVIAITYFAGRM 417
Query: 314 LYKG--RRKIAS----KGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE 367
L + R++ + GD W + L + A AG+ ++G+GGG I P+ L
Sbjct: 418 LKRQTVRKRKCNYPFLPGD--VMWEGANLNKFPALAFFAGVAAAMMGIGGGMIKSPIMLA 475
Query: 368 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 427
+G+ PQV + T+ F I F+SS + ++Y +L + ++ A VGQ V+ +I
Sbjct: 476 MGLQPQVVTTTSSFMIIFTSSATTLQYLILGKLKPQQLGIVMSMGFAGAVVGQRVVNYII 535
Query: 428 KILGRASIIIFTLSFIIFVSALSLGGVGLA 457
+ S +IF L + VS + + LA
Sbjct: 536 AKYKKQSFLIFLLGGLTIVSGIIIVATSLA 565
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 92/171 (53%), Gaps = 7/171 (4%)
Query: 50 PNIVIKVSAPKSRSGYK----HVWPDIK--FGWRIVVGTIIGFFGTACGSVGGVGGGGIF 103
PN + P + + HV IK W+ V +I F + G+GGG +F
Sbjct: 30 PNTSSTQTTPHRITSHHISSHHVTALIKMVLAWQWAVASICVFLCASLAVGAGIGGGALF 89
Query: 104 VPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLL 162
V + +I+G DA ++ +SK + G A++A NL +RHP + P+IDYD AL+ +P+
Sbjct: 90 VGIYMIILGMDAHAAVPLSKATIFGLAIAAYSVNLWKRHPHSQQRPLIDYDTALMLEPMT 149
Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE 213
+LG +GV NV+F +W++ + L +L +V+S R KG+ KE T +
Sbjct: 150 LLGAIVGVLLNVLFPNWLVLLPLCLLLMVVSYRTIRKGLRLRAKEKGTPHQ 200
>gi|428174009|gb|EKX42907.1| hypothetical protein GUITHDRAFT_110958 [Guillardia theta CCMP2712]
Length = 197
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/127 (40%), Positives = 75/127 (59%), Gaps = 4/127 (3%)
Query: 86 FFGTACGSV----GGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQR 141
FF T C S GG+GGGGI VP+L + GF + +++ V G ++ ++YN+R+R
Sbjct: 17 FFLTFCASALGAAGGIGGGGIMVPLLVSVGGFSVHHAIPLTQATVLGASIMNLIYNVRKR 76
Query: 142 HPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGV 201
+P LD P+IDY+ AL+ + +LG IGV N I W+IT+LLIV + R KG+
Sbjct: 77 NPVLDRPLIDYNTALILEVTTLLGTVIGVDVNKISPVWLITILLIVTLGYTTYRTLRKGL 136
Query: 202 ETWKKET 208
E ET
Sbjct: 137 ELRAIET 143
>gi|301118272|ref|XP_002906864.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262108213|gb|EEY66265.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 439
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 86/447 (19%), Positives = 182/447 (40%), Gaps = 55/447 (12%)
Query: 22 LVVASVSVSAQSNLK--PEATSKNKNDHVEPNIVIK-VSAPKSRSGYKHVWPDIKFGWRI 78
LV+A V + +K P T + + EP V++ V P
Sbjct: 20 LVLAQRPVGEDNPVKFGPVPTFPPQGEQYEPRGVLRSVEKPHQ----------------- 62
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
++G ++ S +GGG + + L++ + ++ V GGAV NL
Sbjct: 63 LLGILLAALVVLISSGAKIGGGAVLDAVYILVLKLGPDEAIPLASITVFGGAVCDFFLNL 122
Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
++ + P+I++D L+ QP+L++G + G + F+ W++T+ LIV + + +AF
Sbjct: 123 WKKPINSNFPLINWDFMLIMQPMLLMGAAFGASMISWFSTWLLTIALIVYLVYVGKKAFK 182
Query: 199 KGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIE 258
K +E +++ + L+G P + +
Sbjct: 183 KARAVGHEEGWRWCSSSETMSLLG-------------------APSMSFQDDDGSFQYKS 223
Query: 259 NIYWKEFGLLVAVWA---IILALQIAKNYTT-----TCSVLYWVLNLLQIPVAGGVSAY- 309
+ W++ G+ ++ ++ ALQ + + + S + ++++L VS Y
Sbjct: 224 GLSWRKLGINFGIFTATVLLTALQGGRYFPSPLGIPPTSFFFLIVSMLPFIFLSVVSHYQ 283
Query: 310 --EAVALYKGR---RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 364
+AVA Y+ + R I + + +W + + G + G G+GG L
Sbjct: 284 MKDAVATYQRQQNPRFILAPNE--VQWSPDAIRKIPLRLLGIGAIAGAFGVGGEGATSSL 341
Query: 365 FLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
+ P SA + A+ F S M+ ++ L + + A + L + +G+ L
Sbjct: 342 LRGVNFTPAAVSAMSATAVFFVSGMASFDFLLWGKLDLNLAKFLMPLGFMMTLLGRLCLI 401
Query: 425 KLIKILGRASIIIFTLSFIIFVSALSL 451
++++ ++++F ++ +F+S L
Sbjct: 402 RIVRKAKSRTLLLFAIAAAMFISIFPL 428
>gi|167525260|ref|XP_001746965.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774745|gb|EDQ88372.1| predicted protein [Monosiga brevicollis MX1]
Length = 499
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 52/185 (28%), Positives = 89/185 (48%), Gaps = 14/185 (7%)
Query: 86 FFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-T 144
F G A G+GGG FV + L++G DA + +SK + G +++A N+ +RHP
Sbjct: 15 FLGAALNVGAGIGGGATFVAIYFLVLGEDAHGAVPLSKATIFGLSLAAFAVNIWKRHPHD 74
Query: 145 LDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETW 204
P+IDYD A++ +P+ +LG +GV NVIF +W++ + L +L +S + K
Sbjct: 75 PKRPLIDYDTAMMLEPMTLLGGILGVILNVIFPNWLVLLPLCLLLGFISYKTLKKAWNMH 134
Query: 205 KKETITK----------REAAKQLELIGNGYQTEECKCD---PEYLSNDTTPEETREPKK 251
KKE + ++ K+ + + + +E E L + E P
Sbjct: 135 KKELAARIGADANKPALEDSDKEQLMPADDAEDDEASTGLVMAEQLEKEDPSLELTMPSD 194
Query: 252 SKVSI 256
++ SI
Sbjct: 195 ARRSI 199
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 62/227 (27%), Positives = 98/227 (43%), Gaps = 37/227 (16%)
Query: 262 WKEFGLLVAVW----AIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKG 317
W +L AVW + L A++ C + VL L IP GV+ A L+
Sbjct: 259 WDRILMLGAVWIGYFVLTFLLYRAEDAVPRCEAGWVVLLLCSIPYVVGVTYLFARRLH-- 316
Query: 318 RRKIASK--------GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 369
R+ I K GD W L +Y AG+ ++G+GGG I PL L +G
Sbjct: 317 RQTIQKKAVGYVFHPGD--VMWEGRNLYYYPEMAFFAGLCASMMGIGGGLIKSPLMLSMG 374
Query: 370 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII--------------A 415
+ PQ ++ T+ F I F+SS S + +L P A ++ L +
Sbjct: 375 LNPQTTTTTSSFMIIFTSSASTFQVWLRALVSTPKAPHYLILGKLHGAELAAVMASGFAG 434
Query: 416 AFVGQHVLKKLIKILGRASIIIFTLS-------FIIFVSALSLGGVG 455
A +GQ V+ L++ + S+++F L I+F A++ G +G
Sbjct: 435 ALMGQKVVNHLVQKYQKQSLLMFLLGGLTVLSVVILFSLAIADGKIG 481
>gi|154342935|ref|XP_001567413.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134064745|emb|CAM42850.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 510
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 101/394 (25%), Positives = 181/394 (45%), Gaps = 29/394 (7%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP--TLDIPVIDYDLALLFQPLL 162
P+ I+ + + +S+ + G + ++ ++Q+HP + D P+I+Y L PL
Sbjct: 120 PLFCGIMKIPMQFAVGMSQATICGQSTVSMYLIVQQKHPDSSWDRPLINYQYLSLILPLG 179
Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLST---RAFLKGVETWKKET--------ITK 211
++G IG + +F D + L++LF++LST R +K V+ +K++T
Sbjct: 180 LIGTLIGGILSKLFPD---VLRLLLLFVILSTALYRTLVKVVKQYKEDTNAWQATVEADG 236
Query: 212 REAAKQLELIGN-----GYQ--TEECKCDPEYLSNDTTPEETREPKKSKVSIIENIY-WK 263
AA Q GN GY+ E L T + + + ++ Y +
Sbjct: 237 ANAASQQGNYGNNDERQGYELINRAVDAKSESLQGTTVLATSAHSLPAFENPPQSQYPQQ 296
Query: 264 EFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIAS 323
E + + ++L I + Y +LYW+ L IP+ +S + K R+ + S
Sbjct: 297 ELAMNFTCFLVLLLFNILRMYLVCGGLLYWLCVL--IPLV-FLSTVFYLNYEKLRKLVES 353
Query: 324 KGDEGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFA 382
+ T W V Y + AG +LG+GGG +LG + E+G+ PQ +SAT A
Sbjct: 354 DSAQVTFAWTQKNTVMYPMVAVIAGASAAMLGIGGGLVLGFVLNEVGLVPQQASATGGMA 413
Query: 383 ITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSF 442
F + SV+E + V + + F + + + +GQ V + IK G +II +L+F
Sbjct: 414 TLFIAFSSVLELLVTGHLVVDFGIVFCIVGLCSGSLGQFVFMEYIKGHGLNYLIIGSLAF 473
Query: 443 IIFVSALSLGGVGLAKMIKRIEH-KEYMGFDNIC 475
++ S +SLGG G+ + + + M F ++C
Sbjct: 474 VLGGSLVSLGGYGIYTTVISVHAGRSLMAFGHLC 507
>gi|301118274|ref|XP_002906865.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262108214|gb|EEY66266.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 260
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/237 (24%), Positives = 115/237 (48%), Gaps = 35/237 (14%)
Query: 229 ECKCDPEYLS----NDTTPEETREPKKSKVSIIE--NIYWKEFGLLVAVWAIILALQI-- 280
+C PE + ++ + + +++ + ++ NI W++ L +++ ++ +++
Sbjct: 35 QCGVSPEATTLLGIASSSADTLKAHREADLKVVSRANIPWRKLATLASLFVVVAGMRVLR 94
Query: 281 -AKNYTT-----TCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRAS 334
KN+ + + S LY VL +GG YE A KW S
Sbjct: 95 GGKNFDSPVGIDSSSTLYPVLQ-----QSGG---YELEA-------------HEIKWTPS 133
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
+ ++ C + AG V G+ G+GGG I GPL LE+G+ P +SA + FSS MS + Y
Sbjct: 134 SIRYFPMCSLAAGAVSGMFGIGGGIINGPLLLEVGVDPSAASAMTATTVLFSSGMSSLNY 193
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
+ + + A + ++ ++G L KL++ S+IIF+++ I+ +SA+++
Sbjct: 194 AAMGKMDLHLAQLMLPMGLVTTYIGHLCLLKLVRHYNCPSMIIFSMATIVLISAVAM 250
>gi|221481168|gb|EEE19573.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 299
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 61/100 (61%)
Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGI 166
+ L++ F+ + A S+C++TG +++ ++ N +R+P +D+P+ID D+ LL P+ + G
Sbjct: 1 MQLLMRFNTFEATATSQCLMTGSSLAGLILNFIRRNPVVDMPLIDMDMVLLLGPMQMCGS 60
Query: 167 SIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
SIGV N + W+I VLL+V L + R + + K+
Sbjct: 61 SIGVIVNRVLPAWLIIVLLVVCLLYETIRLMRRLCDQPKR 100
>gi|237844563|ref|XP_002371579.1| hypothetical protein TGME49_020560 [Toxoplasma gondii ME49]
gi|211969243|gb|EEB04439.1| hypothetical protein TGME49_020560 [Toxoplasma gondii ME49]
Length = 299
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 61/100 (61%)
Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGI 166
+ L++ F+ + A S+C++TG +++ ++ N +R+P +D+P+ID D+ LL P+ + G
Sbjct: 1 MQLLMRFNTFEATATSQCLMTGSSLAGLILNFIRRNPVVDMPLIDMDMVLLLGPMQMCGS 60
Query: 167 SIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
SIGV N + W+I VLL+V L + R + + K+
Sbjct: 61 SIGVIVNRVLPAWLIIVLLVVCLLYETIRLMRRLCDQPKR 100
>gi|221501858|gb|EEE27611.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 299
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 61/100 (61%)
Query: 107 LNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGI 166
+ L++ F+ + A S+C++TG +++ ++ N +R+P +D+P+ID D+ LL P+ + G
Sbjct: 1 MQLLMRFNTFEATATSQCLMTGSSLAGLILNFIRRNPVVDMPLIDMDMVLLLGPMQMCGS 60
Query: 167 SIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK 206
SIGV N + W+I VLL+V L + R + + K+
Sbjct: 61 SIGVIVNRVLPAWLIIVLLVVCLLYETIRLMRRLCDQPKR 100
>gi|384252265|gb|EIE25741.1| hypothetical protein COCSUDRAFT_64812 [Coccomyxa subellipsoidea
C-169]
Length = 553
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 75/114 (65%), Gaps = 1/114 (0%)
Query: 103 FVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPL 161
FVP+ N+++ F K +AA+S+ ++ GGA++ V L ++HP P+ID+DLAL+ P+
Sbjct: 91 FVPLFNILLQFSVKGAAALSQAVIAGGALAGVGVTLHKKHPYDPSKPLIDFDLALMLLPV 150
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAA 215
++LG+S+GV N +F +W+ITVLL++L + L+ K + + E + K E A
Sbjct: 151 ILLGVSVGVLANQLFPNWLITVLLLLLLIFLTHMTVKKALSLHRAEVLYKAEQA 204
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 64/260 (24%), Positives = 118/260 (45%), Gaps = 10/260 (3%)
Query: 222 GNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIA 281
G +T+ + ++S D + ++ + + W + +L+ W I + Q+
Sbjct: 284 GARARTQIVSTEVGHMSGDEGEHDEDRQQEHRSVRSQPATWLQVVVLLGCWGIFVTFQLL 343
Query: 282 KNYTTTCSVLYWVLNLLQ--IPVAGGVSAYEAVALYKGRRK--------IASKGDEGTKW 331
+ CS YW + +Q + + V+ V+ K R + +AS E W
Sbjct: 344 LSRWPHCSGPYWAIFAVQAGLCLIAEVAFLRLVSARKQRSQESRLEQPMLASVYKEAPAW 403
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
+L+ + AG + GLLG+GGG I+ PL LE G P V++AT+ + FSSS +
Sbjct: 404 TLPRLIRSAIITLLAGFIAGLLGIGGGMIVNPLLLEFGTHPHVAAATSTLMVLFSSSSAA 463
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
+ + +AL F + A+ +G ++ ++++ G ASII+F L+ +I A
Sbjct: 464 LSFGFSHLLNAQFALVFGLCCMAASLIGVLIVSRIVERSGNASIIVFLLALVIATGATLT 523
Query: 452 GGVGLAKMIKRIEHKEYMGF 471
G ++ + H +GF
Sbjct: 524 AAFGGRFAVQDLIHHRSIGF 543
>gi|299115466|emb|CBN75630.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 236
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 103 FVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLL 162
VP+ L++ F K + +S + GGA++ N+++RHP D P++D+DL L+ +PL
Sbjct: 38 LVPIFILVLRFGPKYAVPLSNITIFGGAITNTFLNMKKRHPLADRPLVDWDLILVMEPLT 97
Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
+ G +G + + ++T+ +++L + + R F KG++ +KKE+ ++ AK+L
Sbjct: 98 IGGALVGSFIQKVLPEVVLTLSMVLLLVATADRTFRKGLKAFKKESSLQQ--AKEL 151
>gi|403372026|gb|EJY85900.1| hypothetical protein OXYTRI_16112 [Oxytricha trifallax]
Length = 499
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 97/432 (22%), Positives = 172/432 (39%), Gaps = 62/432 (14%)
Query: 76 WRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVV 135
W ++ T + TA G GGV + PM GF + + A+S + V+
Sbjct: 43 WGLIAITFLMLLCTAGGIGGGV----VVAPMCIAFFGFGTREAVALSGFCILICQVTKWA 98
Query: 136 YNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTR 195
Y+ Q+HP D IDY LA + P ++LG IG ++ ++ +LL L L+ +
Sbjct: 99 YSWSQKHPEKDTVSIDYGLATVMLPTVLLGSFIGTFVTILVPPIVLQILLTALLTFLTVQ 158
Query: 196 AFLKGVETWKKET----------------------ITKREA--------AKQLELIGNGY 225
+ LK E ++KE R++ K+L +
Sbjct: 159 SGLKAKEIYEKENAKIKKLKEAEEAKAAAEADKMAKANRQSVQVVDTVDGKRLSINRGSL 218
Query: 226 QTEECKCDPEYLSNDTTPEETRE----------PKKSKVSII------ENIYW---KEFG 266
+ ++ + NDTT + + + P + ++ + E +W K
Sbjct: 219 REKKPTIASKVHDNDTTDDPSAKLLHGGDYSEGPTQEEIDKVDAMLRREKTHWQWDKHLT 278
Query: 267 LLVAVWA-----IILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVS--AYEAVALYKGRR 319
LV + + II + K+ C V W+L L + VS A + + + +
Sbjct: 279 CLVVLVSQVIINIIRGSKSTKSVVEHCGVADWILVGLYAAICITVSVIAIKRIIAEQALK 338
Query: 320 KIASKG--DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 377
KG ++ + G G LGL GG I PL L G+PP V+S+
Sbjct: 339 TKVGKGLIPSDIRFNKQTVRSVVITAFVGGWASGCLGLSGGAIFNPLLLNQGVPPSVASS 398
Query: 378 TALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 437
T ++ I FS+ + V Y + + + +A+ G + L K++K GR S ++
Sbjct: 399 TGMYMILFSTIGTCVVYSIQGSLNFAFGGWIGGWCCLASVGGMYALDKVVKKFGRQSPLV 458
Query: 438 FTLSFIIFVSAL 449
L+ ++ +S +
Sbjct: 459 VVLTGVLALSTI 470
>gi|66825573|ref|XP_646141.1| hypothetical protein DDB_G0269644 [Dictyostelium discoideum AX4]
gi|60474235|gb|EAL72172.1| hypothetical protein DDB_G0269644 [Dictyostelium discoideum AX4]
Length = 549
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/135 (37%), Positives = 79/135 (58%)
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
+VG I+ F G A S GGVGGGGI++P+L L+ +D K++ +S C+V G A++ + N
Sbjct: 137 IVGFILLFIGCALSSGGGVGGGGIYIPILILVSKWDPKTAIPLSNCLVAGCALANFIQNF 196
Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
+RHP + +IDY +ALL +PL + G GV + F +I +LL++ S +
Sbjct: 197 PRRHPFSNKHLIDYSVALLIEPLTLAGTIFGVYLHTYFPPLVILLLLVITLGFTSYKTIS 256
Query: 199 KGVETWKKETITKRE 213
KG++ WK E K
Sbjct: 257 KGIDIWKSEKKKKNS 271
>gi|301093157|ref|XP_002997427.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110683|gb|EEY68735.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 432
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 99/194 (51%), Gaps = 9/194 (4%)
Query: 74 FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
F ++ ++ T++ F TA G+ GGVGGGG+ VPM ++ G + K + +SK + G AV+
Sbjct: 116 FTYKDLITTLVAFSSTALGAGGGVGGGGLLVPMY-ILGGLNPKHAIPLSKVTIFGSAVAM 174
Query: 134 VVYNLRQRHP-TLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
N R++HP + P+ID+ L L +P ++G GV N IF +W+I V L+ L +
Sbjct: 175 YTVNFRRKHPLNKNRPLIDFGLVGLMEPTTLIGTVFGVMLNHIFPNWLILVCLVTLLTFI 234
Query: 193 STRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKS 252
+ + LKG + E +R K EE + D + L P R+P S
Sbjct: 235 TYKTILKGNTRFDVEVAARRWLNKTRRRRKVRQIREEDEADFQSLP----PLNDRKPTSS 290
Query: 253 KVSIIENIYWKEFG 266
+I N ++FG
Sbjct: 291 D-PLIRNK--RQFG 301
>gi|397643932|gb|EJK76166.1| hypothetical protein THAOC_02088 [Thalassiosira oceanica]
Length = 534
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 73/141 (51%), Gaps = 4/141 (2%)
Query: 104 VPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLV 163
VP+ LI+ F K + +S V GG+++ + N R+RHP D P+ID+DL ++ +P +
Sbjct: 80 VPIYILILNFLPKHAIPLSNVTVFGGSIANTLLNWRKRHPVADRPLIDWDLIVVMEPPTL 139
Query: 164 LGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET--ITKRE-AAKQLEL 220
LG IG N + + I +LL+VL + S K ++KET I R + +E
Sbjct: 140 LGALIGANLNKLLPETAIAILLVVLLVYTSFNTLKKAHSMYQKETSEIKHRNINHRSVEP 199
Query: 221 IGNGYQTEECKCDPEY-LSND 240
++TE EY L ND
Sbjct: 200 TAGLFKTESENGLEEYFLLND 220
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 99/218 (45%), Gaps = 23/218 (10%)
Query: 244 EETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKN-------YTTTC-SVLYWVL 295
+E + PKK +++I ++ ++L + I K C S +WV
Sbjct: 297 DEEKRPKKKNIALIATLFM-----------VVLTINILKGGGAFESPLGIECGSASFWVA 345
Query: 296 NLLQIPVAGGVSAYEAVALYKGRRKIASKG----DEGTKWRASQLVFYCACGITAGMVGG 351
+L + VS L K K G DE +W + + Y AG+ G
Sbjct: 346 QILLLIWICVVSWIGRKMLLKSTAKKTDAGFAYLDEDIRWNGKKTIIYPMISTLAGVAAG 405
Query: 352 LLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFAL 411
L G+GGG I GPL + LG+ P V+SAT+ I F+S + + + YA+ L
Sbjct: 406 LFGIGGGIIKGPLMIALGVHPAVASATSACMILFTSFTATTTFSVYGLMVRDYAIACSIL 465
Query: 412 SIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
++ VGQ V+ +++ R+S I +++ F++ +SA+
Sbjct: 466 GFVSTLVGQKVMNSILRKTNRSSYIAYSIGFVVLLSAI 503
>gi|118348622|ref|XP_001007786.1| hypothetical protein TTHERM_00069480 [Tetrahymena thermophila]
gi|89289553|gb|EAR87541.1| hypothetical protein TTHERM_00069480 [Tetrahymena thermophila
SB210]
Length = 575
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 68/115 (59%), Gaps = 1/115 (0%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVL 164
PML L++ + K + +S C++ G ++ + ++HP LD P+IDY++ L+ P+++L
Sbjct: 63 PMLMLMLNYQTKVATFISYCILFGSCLANSTLLIFKKHPLLDKPIIDYNIVLMINPMVLL 122
Query: 165 GISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWK-KETITKREAAKQL 218
G +IG+ N++ + +L I +++S F KG+ ++ K+ K + ++QL
Sbjct: 123 GTNIGIFLNILLPEIAAGILFICFLILISPYMFKKGLNLYRLKKEQQKCQLSEQL 177
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 85/185 (45%), Gaps = 16/185 (8%)
Query: 286 TTCSVLYWVLN---LLQIPVAGGVSAYEAVALYKGRRKIASK---GDE-GTKWRASQLVF 338
TTCS YWV N +L VA V + K +R K DE + + +
Sbjct: 374 TTCSTAYWVTNGGIVLLAFVAAFVIRFFLQKWEKNKRIFIEKYHLEDEFASDLDVNNNMN 433
Query: 339 YCA---CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
Y G+TAGM+ G G+G G L P+ L G+ PQV++AT F F S+ ++++ +
Sbjct: 434 YIKISLAGLTAGMLAGTFGVGAGLALVPILLASGVHPQVAAATCGFNYFFISTTTIIQVF 493
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF------TLSFIIFVSAL 449
+ + F LS I F+ ++ + ++ + +I+F L+ + F+ L
Sbjct: 494 TNDYLNISQIILFSILSFIGGFLCAKLIYRYVEKKKASYLIVFIVFGLAILNILAFIVYL 553
Query: 450 SLGGV 454
+ GV
Sbjct: 554 TKKGV 558
>gi|159469083|ref|XP_001692697.1| predicted protein [Chlamydomonas reinhardtii]
gi|158277950|gb|EDP03716.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1854
Score = 70.9 bits (172), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/84 (36%), Positives = 56/84 (66%), Gaps = 2/84 (2%)
Query: 87 FGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD 146
F +A + GVGGG I++P+ N ++GF+ K+S A+S+ +T G+++A+ NL +RHP
Sbjct: 1663 FASAVATASGVGGGAIYIPLFNALIGFELKASTALSQACITAGSLAALGSNLHRRHPLRP 1722
Query: 147 IP--VIDYDLALLFQPLLVLGISI 168
++D+ L L+ P+L++G S+
Sbjct: 1723 EAWHLVDFRLMLVLTPVLLVGSSL 1746
>gi|221505087|gb|EEE30741.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 665
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 65/118 (55%), Gaps = 4/118 (3%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
I+VP++ L++GF + A S+ ++ GG+++ NL +RHP D P ID DL LL P+
Sbjct: 145 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTSLNLFRRHPYADRPAIDLDLVLLMGPM 204
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
+ G + G+ N + ++I LL+VL + + F + + K E R KQLE
Sbjct: 205 QIAGATFGLVINRCWPVYLIMALLVVLLFATAYKTF-RQMMRLKHEGAAAR---KQLE 258
>gi|237836929|ref|XP_002367762.1| hypothetical protein TGME49_005300 [Toxoplasma gondii ME49]
gi|211965426|gb|EEB00622.1| hypothetical protein TGME49_005300 [Toxoplasma gondii ME49]
Length = 665
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 65/118 (55%), Gaps = 4/118 (3%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
I+VP++ L++GF + A S+ ++ GG+++ NL +RHP D P ID DL LL P+
Sbjct: 145 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTSLNLFRRHPYADRPAIDLDLVLLMGPM 204
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
+ G + G+ N + ++I LL+VL + + F + + K E R KQLE
Sbjct: 205 QIAGATFGLVINRCWPVYLIMALLVVLLFATAYKTF-RQMMRLKHEGAAAR---KQLE 258
>gi|145528512|ref|XP_001450050.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417650|emb|CAK82653.1| unnamed protein product [Paramecium tetraurelia]
Length = 400
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 182/382 (47%), Gaps = 45/382 (11%)
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
++G+++G GG+GGG I P++ +++G +K + + M+ GG++
Sbjct: 19 ILGSLVGL-----AQAGGIGGGPIVSPVMMVLLGCSSKQAIWNTYIMLFGGSIGNFARLG 73
Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
R++ P+I+Y L + PLL+ G +GVA W+ +++++ + FL
Sbjct: 74 REKIQDGSSPLINYQLVQITLPLLLAGAILGVA----SGKWLPKLVIVIFLFAILLNVFL 129
Query: 199 KGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPK--KSKVSI 256
K +KK RE + LI Q E + + + P+ ++ K +SK+
Sbjct: 130 KTKNVYKKV----REKERNDLLI----QVEMKEININ--DQNILPQNLQQLKDNESKLYP 179
Query: 257 IENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGVSA 308
EN+ KE L V I++AL + K T S+L Y +N + GV
Sbjct: 180 TENL--KEIALSV---FIVVALTLLKGAATIPSILGIGYCGYGYHFINF----IIFGVGF 230
Query: 309 YEAVALYKGRRK-----IASKGDEGTKWRASQLV-FYCACGITAGMVGGLLGLGGGFILG 362
Y V Y+ + K S G + + + S++ + AG +GGL+GLGGG +L
Sbjct: 231 YN-VQRYRQQIKKDEVLKESIGYDFSGGKISEVYDITVKSSMKAGFLGGLVGLGGGVVLT 289
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
P++LE GI P ++A+A F + F+S +SV L + + L ++S +++ +
Sbjct: 290 PVWLETGINPARAAASATFTVMFTSFISVFIIALSGGYQLSQFLILASVSGCGSYLVAGI 349
Query: 423 LKKLIKILGRASIIIFTLSFII 444
LKKL+K R SIII L +I
Sbjct: 350 LKKLVKKYKRESIIIQVLLVVI 371
>gi|221482011|gb|EEE20377.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 665
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 65/118 (55%), Gaps = 4/118 (3%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
I+VP++ L++GF + A S+ ++ GG+++ NL +RHP D P ID DL LL P+
Sbjct: 145 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTSLNLFRRHPYADRPAIDLDLVLLMGPM 204
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
+ G + G+ N + ++I LL+VL + + F + + K E R KQLE
Sbjct: 205 QIAGATFGLVINRCWPVYLIMALLVVLLFATAYKTF-RQMMRLKHEGAAAR---KQLE 258
>gi|146096258|ref|XP_001467747.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398020624|ref|XP_003863475.1| hypothetical protein, conserved [Leishmania donovani]
gi|134072113|emb|CAM70812.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322501708|emb|CBZ36789.1| hypothetical protein, conserved [Leishmania donovani]
Length = 511
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 179/384 (46%), Gaps = 48/384 (12%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP--TLDIPVIDYDLALLFQPLL 162
PM ++ +S+ +S+ + G + + +RQ+HP + D P+I+Y L P
Sbjct: 120 PMFCGLMEIPMQSAVGMSQSTICGQSTLNMYLIIRQKHPDSSWDRPLINYQYLSLLLPPG 179
Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE----------TITKR 212
++G IG + + D + +LL+VL V+ R++ ET KK+ T+
Sbjct: 180 LIGTLIGGILSKLCPDVLRLILLLVLLSVVLYRSW----ETMKKQYRQDTDPMHVTVEAN 235
Query: 213 EA---AKQLELIGNGYQ-----TEECKCDPEYLSND----TTPEETREPKKSKVSIIENI 260
+A + + GNG TE + L D TPE++ P +
Sbjct: 236 DANATSHRESYDGNGKSQSRELTETAGGAKKGLCEDIAVLPTPEQSPPPIERPPQ--SQY 293
Query: 261 YWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK 320
+E + +A + ++L I + YT LYW+ L +PV A+ +V Y R K
Sbjct: 294 LQQELSMNIACFLVLLLFNIFRTYTVCGGFLYWLCVL--VPV-----AFLSVVFYFNREK 346
Query: 321 IA--SKGDEGT---KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVS 375
+ ++ D W V Y + AG +LG+GGG +LG + E+GI PQ +
Sbjct: 347 LRKLTESDPAQMTFTWTQRNSVTYPMVAVLAGGSAAMLGIGGGLVLGFVLNEVGIVPQEA 406
Query: 376 SAT---ALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR 432
SAT A F I FSS++ ++ + V +++ F + + ++ +GQ V+ IK G
Sbjct: 407 SATGGMATFFIAFSSALHLL---ITGSLVVDFSVVFCIVGLCSSALGQLVIMNYIKRRGL 463
Query: 433 ASIIIFTLSFIIFVSALSLGGVGL 456
+ +II +L F++ S ++LGG G+
Sbjct: 464 SYLIIGSLVFVVGGSLVALGGYGI 487
>gi|145501808|ref|XP_001436884.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404029|emb|CAK69487.1| unnamed protein product [Paramecium tetraurelia]
Length = 424
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 78/352 (22%), Positives = 152/352 (43%), Gaps = 43/352 (12%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDLALLFQPLLV 163
P+L +++ ++ + +S C++ GG V +++ HP P+++YDL ++ +
Sbjct: 62 PILLMMLDYEQSYATQLSYCLMFGGCVINTFLLMKKSHPYDQKRPLVNYDLVVILNCSIA 121
Query: 164 LGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGN 223
LG +G NV A +I + +FL++ L KKE + K +L+L
Sbjct: 122 LGSYLGSILNVFLAP-IIETMFQQIFLIIVIPFLLNKA---KKEKLRKIRCQSELDL--- 174
Query: 224 GYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAK- 282
+YL N T E + + +N + L +A+ I++ I
Sbjct: 175 ----------EKYLLNQKDSIYTEEQQLLLQNEFQN-FCPSKKLAIALSFFIVSQIIMTG 223
Query: 283 ----------NYTTTCSVLYWV-LNLLQIPVAGGVSAYEAVALYKGRRKIASKGD----- 326
N + W+ L ++ I ++ V Y G +K D
Sbjct: 224 GKYLKPFIPLNKCFDFRYMLWIMLFIVNIFMSRLVYTY-------GLKKEMIFDDYKIYM 276
Query: 327 EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFS 386
+ ++ ++ + G AG++ GLL LG G I+ P+ LELG+ P++++AT+ F F
Sbjct: 277 QERYFQKNRFILIYVSGFFAGLISGLLALGAGLIMVPVLLELGLHPRIATATSAFNYFFI 336
Query: 387 SSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF 438
++V+ + + +FF L+++ + H KLI+ L ++I+
Sbjct: 337 GLTNIVKLITDSQISIAEIAWFFGLALVFGTICCHFSLKLIEKLQLVHLVIY 388
>gi|110740663|dbj|BAE98434.1| hypothetical protein [Arabidopsis thaliana]
Length = 210
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 57/199 (28%), Positives = 103/199 (51%), Gaps = 14/199 (7%)
Query: 288 CSVLYWVLNLLQIPV----------AGGVSAYEAVALYKGRRKIAS-KGDEGTKWRASQL 336
C YW+++ QIP+ + V + + + + + + ++G + S
Sbjct: 15 CGNAYWLISSSQIPLTLFFTLWICFSDNVQSQQQSDYHVSVKDVEDLRSNDGAR---SNK 71
Query: 337 VFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYL 396
+ + AG++GG+ G+GGG ++ PL L++GI P+V++AT F + FSS+MS ++Y L
Sbjct: 72 CMFPVMALLAGVLGGVFGIGGGMLISPLLLQVGIAPEVTAATCSFMVLFSSTMSAIQYLL 131
Query: 397 LKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 456
L A F + +A+ VG V++K+I GRASII+F++ ++ +S + + G
Sbjct: 132 LGMEHTGTASIFAVICFVASLVGLKVVQKVITEYGRASIIVFSVGIVMALSIVLMTSYGA 191
Query: 457 AKMIKRIEHKEYMGFDNIC 475
+ YMGF C
Sbjct: 192 LDVWNDYVSGRYMGFKLPC 210
>gi|403365874|gb|EJY82729.1| hypothetical protein OXYTRI_19655 [Oxytricha trifallax]
Length = 683
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 100/196 (51%), Gaps = 11/196 (5%)
Query: 288 CSVLYWVLNLLQIPVAGGVSAYE---AVALYKGRRK----IASKGDEGTKWRASQLVFYC 340
CS + W + ++ I +SAY AV K + K IAS + K +LV
Sbjct: 491 CSTMDWAILMIFITFCSMLSAYSIFIAVREQKLKTKYSMGIASSDIQFNKPAIFKLVISA 550
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
G G V G LGLGGG I P+ L +G+PP+V+SAT ++ I FS+S S V Y + +
Sbjct: 551 FIG---GTVSGALGLGGGAIFNPILLSMGVPPKVASATGMYMIMFSTSGSSVIYVMYRML 607
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMI 460
+ Y + S + +G ++L K++K+ R S ++F L F++ +SA+ + G +
Sbjct: 608 NIQYGFWLGFWSSSGSILGMYLLNKVVKMYNRQSPVVFCLVFVLALSAVLVPIFGYIDLK 667
Query: 461 KRI-EHKEYMGFDNIC 475
RI E + F IC
Sbjct: 668 ARISEGVDITKFSTIC 683
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 63/119 (52%), Gaps = 1/119 (0%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVL 164
P F+ K++ A+S + +++ + ++ +RHP + VIDY+LA + P++++
Sbjct: 100 PFCMTFFSFETKNAIALSGLTILSCSITRYIVSINERHPERNSVVIDYNLAAIMLPIVLV 159
Query: 165 GISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET-ITKREAAKQLELIG 222
G IGV NV F + ++L ++ L L+ K +K+ET + K +Q EL G
Sbjct: 160 GSMIGVLVNVAFPSLYLQIMLTLVLLSLALHTAYKARFIYKQETEMLKVRQQQQKELQG 218
>gi|118348626|ref|XP_001007788.1| hypothetical protein TTHERM_00069500 [Tetrahymena thermophila]
gi|89289555|gb|EAR87543.1| hypothetical protein TTHERM_00069500 [Tetrahymena thermophila
SB210]
Length = 570
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 70/126 (55%), Gaps = 5/126 (3%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVL 164
PM+ L++ ++ K + +S C++ G ++ + ++HP D P+IDY++ L+ P+++L
Sbjct: 63 PMVMLMLNYETKIATFISYCVLFGSCLANSTLLIFKKHPLYDKPIIDYNIVLMINPMVLL 122
Query: 165 GISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL--ELIG 222
G +IG+ N++ + +L I +++S F KG+ +K + K + QL L+
Sbjct: 123 GTNIGIFLNILLPEIAAGILFIGFLILVSPYMFKKGLNLYK---LKKEQQKNQLSQSLLE 179
Query: 223 NGYQTE 228
N + E
Sbjct: 180 NDQENE 185
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 83/181 (45%), Gaps = 19/181 (10%)
Query: 286 TTCSVLYWVLN--LLQIPVAGGVSAYEAVALYKGRRKI-----ASKGDEGTKWRASQLVF 338
TTCS+ YW+ N +L + VA + ++ +KI K + + +
Sbjct: 369 TTCSISYWITNAGILVLAVAAVFVIRFFLLKWEKNKKILIEKYNLKEEFADDLDVTNSIM 428
Query: 339 YC---ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
Y G+TAGM+ G G+G G L P+ L G+ PQV++AT F F S+ ++++ +
Sbjct: 429 YAKISVAGLTAGMLAGTFGVGAGLALVPILLASGVNPQVAAATCGFNYFFISTTTIIQVF 488
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFT---------LSFIIFV 446
+ + F LS I FV + ++ + +++F +SFII++
Sbjct: 489 TNSYLDLSQIVLFSILSFIGGFVIAKFIYNFVERKKASYLVVFMVFGLAILNIISFIIYL 548
Query: 447 S 447
+
Sbjct: 549 T 549
>gi|301093155|ref|XP_002997426.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110682|gb|EEY68734.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 124
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 65/118 (55%), Gaps = 4/118 (3%)
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP----VPYALYFFALSIIAAF 417
GP+ LE G+ P V SATA F I F++S + +++ + +FP Y +F + + F
Sbjct: 4 GPIMLEAGVLPAVQSATASFMILFTASSTTLQFAINGQFPGEFQFDYMAWFALVGFVGGF 63
Query: 418 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
G + +K R SI+++ L+ I +SAL++G +GL + IE ++GF IC
Sbjct: 64 CGLKCVGYFVKKYKRESIMVYMLATTIGLSALAMGFIGLKLTLADIESGVHLGFHGIC 121
>gi|401405669|ref|XP_003882284.1| hypothetical protein NCLIV_020390 [Neospora caninum Liverpool]
gi|325116699|emb|CBZ52252.1| hypothetical protein NCLIV_020390 [Neospora caninum Liverpool]
Length = 675
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 65/120 (54%), Gaps = 7/120 (5%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPL 161
I+VP++ L++GF + A S+ ++ GG+++ NL +RHP D P ID DL LL P+
Sbjct: 158 IYVPIMILVMGFTVYEATATSQALMFGGSLAGTCLNLFRRHPFADRPAIDLDLVLLMGPM 217
Query: 162 LVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELI 221
+ G + G+ N + ++I +L+VL F +T ++ KRE A +L+
Sbjct: 218 QIAGATYGLVINRCWPVYLIMAVLVVLL-------FATAYKTSRQMMRLKREGAAARKLL 270
>gi|294948280|ref|XP_002785684.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239899707|gb|EER17480.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 392
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 79/307 (25%), Positives = 140/307 (45%), Gaps = 43/307 (14%)
Query: 96 GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLR-QRHPTLDI-PVIDYD 153
G+GGG +FVP+L++I A+ +AA S+ +V ++ V+ ++ Q H + +I+
Sbjct: 27 GMGGGTLFVPVLHIIGMLSARDAAATSQVLVASATLAKVLSSVYIQLHGNSSVESIINLP 86
Query: 154 LALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE 213
A+L P +V+G +GV + VL + ++ S F KG+ WK ET
Sbjct: 87 YAVLMLPPMVVGGLLGVYLYTWLPEIFQLVLYVFTAILASCMGFKKGIGLWKSET-EGAS 145
Query: 214 AAKQLELIGNGYQTEECKCDPEYLSNDTT-PEETREPKKSKVSIIENIYWKEFGL---LV 269
+A QL ++ P ++ DT P T + SI ++ +K+ L L+
Sbjct: 146 SAGQLGVV-----------PPPCVTEDTVLPSVTSRAR----SISLSLKYKKAILITTLL 190
Query: 270 AVWAIILALQIAKNYTTT--------CSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKI 321
AVW ++ ++ ++T C+ LYW L+++ + V VA+ K
Sbjct: 191 AVWIAVILSRLLLGSSSTPSIIGIPYCTGLYWALSVIVCIMLMAVPGLFVVAI-----KS 245
Query: 322 ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 381
A+ K + L G + L+G GGG ++ PL L + + PQ ++AT
Sbjct: 246 AAMLKLAVKLSGAMLCI--------GFIAALVGQGGGSLITPLLLYMELNPQQAAATGSV 297
Query: 382 AITFSSS 388
+ +SS
Sbjct: 298 VMLITSS 304
>gi|449016482|dbj|BAM79884.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 649
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/228 (27%), Positives = 106/228 (46%), Gaps = 17/228 (7%)
Query: 37 PEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTACGSVGG 96
P +S N + NI PKS + VW D I+V + F +A + G
Sbjct: 121 PCTSSANCSSLQSCNINTGYCDPKSLF-HPFVWND-----GIMVPLM--FISSAVHNAAG 172
Query: 97 VGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLAL 156
GGG ++ + L++GF +++A S + G ++ V+ NL RHP D+P ID++L
Sbjct: 173 TGGGTSYIALFVLVLGFRVATASANSHAFIFAGMIANVLVNLWARHPFRDMPRIDWNLVA 232
Query: 157 LFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVE-TWKKETITKREAA 215
PL + G SIGV N +F ++ +++LL L L+L+ GV TW++ E+
Sbjct: 233 TSVPLFLAGSSIGVFLNQLFPNYFLSLLLAFLMLLLTMLVVWLGVRLTWREMKRHPLESN 292
Query: 216 KQLELIGNGYQTEECKCDPEY--------LSNDTTPEETREPKKSKVS 255
+ + + C PE ++ D + E+ ++S+ S
Sbjct: 293 RDPCPGDVLAEQDSLTCGPETSKAHTKTAIATDLSMEDKTLLERSRTS 340
>gi|348690322|gb|EGZ30136.1| hypothetical protein PHYSODRAFT_475542 [Phytophthora sojae]
Length = 124
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 65/118 (55%), Gaps = 4/118 (3%)
Query: 362 GPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP----VPYALYFFALSIIAAF 417
GP+ LE GI P V SATA F I F++S + +++ + +FP Y +F + + F
Sbjct: 4 GPIMLEAGILPPVQSATASFMILFTASSTTLQFAINGQFPGEFQFDYMAWFAFVGFVGGF 63
Query: 418 VGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
G + +K R SI+++ L+ I +SA+++G +GL + +E ++GF IC
Sbjct: 64 CGLKCVGYFVKKYKRESIMVYMLAATIGLSAVAMGFIGLQSTLGDLESGVHLGFHGIC 121
>gi|397620930|gb|EJK66001.1| hypothetical protein THAOC_13100 [Thalassiosira oceanica]
Length = 344
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 61/120 (50%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
+W + V Y A AG G+ G+GGG + GPL L +G+ P+VSSA++ I F+S
Sbjct: 198 RWDSRATVVYPAVCTAAGFFAGMFGVGGGIVKGPLMLAMGVHPKVSSASSACMILFTSFT 257
Query: 390 SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
+ + + + YA L A VGQ L L++ R S I F++ I+ +SA
Sbjct: 258 ATTSFVVFGLLDMEYAPVCLLLGFAATLVGQIGLIYLMERFQRNSYIAFSIGAIVALSAF 317
>gi|294948278|ref|XP_002785683.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|294954883|ref|XP_002788342.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239899706|gb|EER17479.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239903654|gb|EER20138.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 354
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/359 (23%), Positives = 153/359 (42%), Gaps = 73/359 (20%)
Query: 96 GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD--IPVIDYD 153
G+GGG +FVP+L LI K + A+S+ ++ +++A+++N +++ + +I +
Sbjct: 27 GIGGGVLFVPVLRLIGKVPLKEATALSQALIASASLAAILFNFFEQYRARNESKALIVWP 86
Query: 154 LALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWKKETI 209
+L P V+G IGV IF+ W+ ++ +++L+ S A KG++ WK ET
Sbjct: 87 FVILIIPCTVIGSLIGV---YIFS-WLPSLFILILYFCYACLGSFMACKKGIKLWKAETR 142
Query: 210 TKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLV 269
KR+ I G T+ + P P + K L+
Sbjct: 143 AKRQ-------IPVGDSTDSSRSSE---VAQEIPPLLEMPNRKK--------------LI 178
Query: 270 AVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGT 329
A +I+ L W + L+ P+ KG+ T
Sbjct: 179 AYTSIV--------------ALIWAVCLIFPPL---------------------KGNSAT 203
Query: 330 KWRASQLV----FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITF 385
+ R+ V I G++ ++G G + + L LG+ P+ ++ATA I
Sbjct: 204 QKRSPGAVKIGLVLMTSTIVIGLLSSIIGTAGALFIIRVVLSLGLDPKQATATATVVIFA 263
Query: 386 SSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFII 444
+SS + + + L FP L+ L A +G+ ++ KLI GR SI++ L+ ++
Sbjct: 264 TSSRTALSFALGGYFPPASNLWIVVLPFAGALLGKTIVAKLIAKTGRLSILVLLLAAMV 322
>gi|145495145|ref|XP_001433566.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400684|emb|CAK66169.1| unnamed protein product [Paramecium tetraurelia]
Length = 399
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 177/376 (47%), Gaps = 54/376 (14%)
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
++GT++G GG+GGG I P++ +++G +K + + M+ GG++
Sbjct: 19 ILGTLVGL-----AQAGGIGGGPIVSPVMMVLLGCSSKQAIWNTYIMLFGGSIGNYARLG 73
Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
R+R P+I+Y L + PLL+ G +GVA W+ V++++ + FL
Sbjct: 74 RERISNGSSPLINYQLVQITLPLLLAGAILGVA----TGKWLPKVVVVIFLFAILLNVFL 129
Query: 199 KGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT--PEETRE--PKKSKV 254
K + +KK I ++E ++ L Q E K E + D + P++ ++ +SK+
Sbjct: 130 KTKKVYKK--IREKEQSELL-------QQVEMK---ELMLTDYSAVPQDLQQILENESKL 177
Query: 255 SIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVL--------YWVLNLLQIPVAGGV 306
EN+ KE V I++AL + K T S+L + +N + +
Sbjct: 178 YPTENL--KEIAFSV---IIVVALTLLKGAATIPSILGISYCGMEFHFINFIIFGI---- 228
Query: 307 SAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGIT------AGMVGGLLGLGGGFI 360
Y V Y+ K + + + S GIT AG +GGL+GLGGG +
Sbjct: 229 -GYYNVQRYRKWIKKDEEFKQSLGYDFSGGKISEVFGITVKSSMKAGFLGGLVGLGGGVV 287
Query: 361 LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA--FV 418
L P++LE GI P ++A+A F + F+S +SV L + + F L++I + ++
Sbjct: 288 LTPIWLETGIHPPRAAASATFTVMFTSFISVFIIALSGGYHLS---QFLILAVINSLYYL 344
Query: 419 GQHVLKKLIKILGRAS 434
+LKKL+K R S
Sbjct: 345 IAGILKKLVKKYKRES 360
>gi|224111746|ref|XP_002332885.1| predicted protein [Populus trichocarpa]
gi|222833707|gb|EEE72184.1| predicted protein [Populus trichocarpa]
Length = 151
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 79/133 (59%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
+ AGM+GG+ G+GGG ++ PL L +GI P++++AT F + FSSSMS ++Y LL V
Sbjct: 19 ALLAGMLGGVFGIGGGMLISPLLLHVGIAPEITAATCSFMVFFSSSMSALQYLLLGMEHV 78
Query: 403 PYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKR 462
A+ + +A+ +G V+++ I GRAS+I+F++S ++ +S + + G + +
Sbjct: 79 DTAIILSVICFVASLLGLLVVQRAIVKYGRASMIVFSVSTVMALSTVLMTSFGALNVWRD 138
Query: 463 IEHKEYMGFDNIC 475
MGF C
Sbjct: 139 YNSGRNMGFKLPC 151
>gi|401426825|ref|XP_003877896.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494143|emb|CBZ29440.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 511
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 96/403 (23%), Positives = 179/403 (44%), Gaps = 44/403 (10%)
Query: 99 GGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP--TLDIPVIDYDLAL 156
GG + +PML S+ +S+ + G + + + ++++HP + D P+I+Y
Sbjct: 123 GGLMEIPML---------SAVGMSQSTICGQSTLNMYFAIQEKHPDSSWDRPLINYQYLG 173
Query: 157 LFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR---- 212
L P ++G +G + + D + +LL+VL V+ R++ + ++++T
Sbjct: 174 LLLPPGLIGTLVGGILSKLCPDVLRLILLLVLLSVVLYRSWGTMKKQYRQDTNPSHVTVE 233
Query: 213 ----EAAKQLELIGNGYQTEECKCDPE------YLSNDT----TPEETREPKKSKVSIIE 258
E N +++ C+ + L +T TPE++ P+ +
Sbjct: 234 TGNANETSHRENHDNNDESQRCELTEKAGGVKRELGENTAILSTPEQS--PQSLRCPPQS 291
Query: 259 NIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGR 318
+E L A + ++L I + Y LYW+ L +PV A+ +V + R
Sbjct: 292 QYPQQELSLNFACFLVLLLFNIFRTYAVCGGFLYWLCVL--VPV-----AFLSVVFFFNR 344
Query: 319 RKI-----ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQ 373
K+ ++ W V Y + AG +LG+GGG +LG + E+GI PQ
Sbjct: 345 EKLRKLAGSNPAQMTFAWTQRNSVAYPMVAVLAGASAAMLGIGGGLVLGFVLNEVGIVPQ 404
Query: 374 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 433
+S T A F + S ++ + V + + F + + ++ +GQ V IK G +
Sbjct: 405 EASVTGGMATFFIAFSSALQLLITGSLVVDFGIVFSIVGLCSSALGQLVFMTYIKSHGLS 464
Query: 434 SIIIFTLSFIIFVSALSLGGVGLAKMIKRIEH-KEYMGFDNIC 475
+II +L F++ S +SLGG G+ + I+ MGF +C
Sbjct: 465 YLIIGSLIFVVGGSLVSLGGFGIYNAVISIQAGGSVMGFGCLC 507
>gi|403340471|gb|EJY69522.1| hypothetical protein OXYTRI_09740 [Oxytricha trifallax]
Length = 715
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 64/117 (54%), Gaps = 1/117 (0%)
Query: 360 ILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
+ P+ L +G+PP VSS+T ++ I FS+S S + Y L +A++ IA+ VG
Sbjct: 599 VFNPVLLSMGVPPSVSSSTGMYMIMFSTSGSSIVYILYGMLNYQFAMWLGFWCSIASLVG 658
Query: 420 QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHK-EYMGFDNIC 475
+L K +K R S I+F L I+ +SAL + + K+I++ M F+++C
Sbjct: 659 LQMLNKFVKKYNRQSPIVFLLGLILGLSALLVPIFAYFDLNKQIQNGVNIMKFNSMC 715
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 72/143 (50%)
Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
G+ +P F K++ A+S + ++ VY L + HP D +IDY LA + P
Sbjct: 46 GVVIPFCMTFFTFQTKNAIAMSGFTILTCSIVKFVYGLNEHHPEKDAVIIDYSLATIMLP 105
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLEL 220
++++G IGV N++F ++ +L ++ L+L+ ++ K +KKE + ++ ++ E
Sbjct: 106 VVMMGSMIGVLLNIMFPSLLLQTILTIVLLLLTWQSLGKARTIYKKENLKLQQIKEKQEQ 165
Query: 221 IGNGYQTEECKCDPEYLSNDTTP 243
+ ++ + D + L P
Sbjct: 166 MSQSPMLKKKESDIKMLRFSQQP 188
>gi|348688914|gb|EGZ28728.1| hypothetical protein PHYSODRAFT_475263 [Phytophthora sojae]
Length = 257
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 63/254 (24%), Positives = 113/254 (44%), Gaps = 35/254 (13%)
Query: 207 ETITKREAAKQLELIGNGYQTEECKC-DPEYLSNDTTPEETREPKKSKVSIIENIYWKEF 265
T+ K A+Q E G E + S D T EP+ + ++ W++
Sbjct: 20 RTLQKAIRARQKERWQCGVSPESTSLLGVDSASTDGTKAHAGEPQ-----LRADVPWRKI 74
Query: 266 GLLVAVWAIILALQIAK---NYTT-----TCSVLYWVLNLLQIPVAGGVSAYEAVALYKG 317
L ++ ++ +++ + N+ + + S LY VL Q P AYE A
Sbjct: 75 ATLFGLFVVVAGMRLVRGGQNFDSPIGLDSSSALYPVLQ--QSP------AYELEA---- 122
Query: 318 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 377
+W +S + F+ AG V G+ G+GGG I PL LE+G+ P +SA
Sbjct: 123 ---------HEIQWTSSSIKFFPVFSFAAGTVSGMFGIGGGIINAPLLLEVGVDPSAASA 173
Query: 378 TALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 437
+ FSS MS Y +L + + A + + ++G L K+++ S+I+
Sbjct: 174 MTAATVLFSSGMSSFNYLMLGKLDLDLARLMLPMGFLTTYIGHICLLKVVRRFQCPSLIV 233
Query: 438 FTLSFIIFVSALSL 451
F+++ I+ +SA+++
Sbjct: 234 FSMAAIVLISAVAM 247
>gi|145531341|ref|XP_001451437.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419092|emb|CAK84040.1| unnamed protein product [Paramecium tetraurelia]
Length = 430
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/152 (25%), Positives = 73/152 (48%), Gaps = 11/152 (7%)
Query: 93 SVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDY 152
+VGG+GGG +P+L +++ F + + G + + + Q+HP D +IDY
Sbjct: 47 TVGGLGGGIEKIPILIVMLNFSQSKATLYVYVLTFGTNLVNFLMLIYQKHPLADKQIIDY 106
Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE----- 207
+L+L+ P + G + G + I D + +LIV F + + ++K + ++E
Sbjct: 107 ELSLILLPTALFGSAFGNILHQILPDIFLISILIVFFSIFVPKLYIKAKQNREQEMQVDN 166
Query: 208 ---TITKREAAKQLELIGNGYQTEECKCDPEY 236
T+ +E LI Y++E+ + P Y
Sbjct: 167 KQKTVINQEVT---HLIAEQYKSEDQQIIPLY 195
>gi|302828096|ref|XP_002945615.1| hypothetical protein VOLCADRAFT_85767 [Volvox carteri f. nagariensis]
gi|300268430|gb|EFJ52610.1| hypothetical protein VOLCADRAFT_85767 [Volvox carteri f. nagariensis]
Length = 1018
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 99/185 (53%), Gaps = 11/185 (5%)
Query: 299 QIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFY------CACGITAGMVGGL 352
Q P G++A A G +A GT A+ LV CA AG+ GGL
Sbjct: 835 QRPDGEGMAAPAAAVAAAGAVAVAEPALGGT---AALLVLEAPVKTLCAT-FGAGLTGGL 890
Query: 353 LGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALS 412
LGLGGG ++GPL L++G+ PQV++A++ + FSSS +++++ LL R YAL F A S
Sbjct: 891 LGLGGGMVMGPLLLQIGVHPQVTAASSGAMVLFSSSAALIQFVLLHRLNTDYALVFGAAS 950
Query: 413 IIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFD 472
++A VG + IK GR SI++ L+ ++ + + + GL ++ + +GF
Sbjct: 951 LVAGLVGTQTVSGAIKRSGRPSIVVLALAGVMGIGTVCVAAFGLRNAAGQLRRGD-LGFA 1009
Query: 473 NICYT 477
IC +
Sbjct: 1010 GICSS 1014
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 62/94 (65%), Gaps = 1/94 (1%)
Query: 116 KSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQPLLVLGISIGVAFNV 174
K S A+S+ +T +++AVV NL + HP++ + P+ID+ L LL P+L++G+ IGV NV
Sbjct: 11 KPSTALSQACITASSLAAVVSNLPRTHPSVPEAPLIDFPLILLLTPVLLVGVGIGVLLNV 70
Query: 175 IFADWMITVLLIVLFLVLSTRAFLKGVETWKKET 208
W++ +LL+VL L+L +A KG W +E+
Sbjct: 71 ALPSWLLNLLLLVLLLLLLAQAIAKGKALWAQES 104
>gi|157873729|ref|XP_001685369.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68128441|emb|CAJ08542.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 511
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 96/401 (23%), Positives = 176/401 (43%), Gaps = 43/401 (10%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP--TLDIPVIDYDLALLFQPLL 162
P+ ++ +S +S+ + G + + ++QRHP + D P+I+Y L PL
Sbjct: 120 PLFCGLMEVPMQSGVGMSQSAICGQSTLNIYLAIQQRHPDSSWDRPLINYQYLSLLLPLG 179
Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIG 222
++G IG + + D + +LL+VL V+ R++ + ++++T + +
Sbjct: 180 LIGTLIGGVLSKLCPDVLRLILLLVLLSVVLYRSWETMKKQYRQDTDPMHVTVEASDANA 239
Query: 223 NGYQ--------------TEECKCDPEYLSND----TTPEET----REPKKSKVSIIENI 260
+Q TE + L D TPE++ P +S+
Sbjct: 240 TSHQKSYDSTGKSQSRELTETAGGAKKVLGEDIAVLPTPEQSPPSIERPPQSQYP----- 294
Query: 261 YWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK 320
+E + A + ++L I + YT LYW+ L +PV A+ +V Y R K
Sbjct: 295 -QQELSMNFACFLVLLLFNILRTYTVCGGFLYWLCVL--VPV-----AFLSVVFYFNREK 346
Query: 321 IASKGDEGT-----KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVS 375
+ + W V Y + AG+ +LG+GGG +LG + E+GI PQ +
Sbjct: 347 LRKLAESDPAQMTFTWTERNSVTYPMVAVLAGVSAAMLGIGGGLVLGFVLNEVGIAPQEA 406
Query: 376 SATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASI 435
S T+ F + SV++ + V + + F + + ++ +GQ VL IK G +
Sbjct: 407 SVTSGMTTFFIAFSSVLQLLITGSLVVDFGVVFCIVGLCSSALGQLVLMNYIKRRGLNYL 466
Query: 436 IIFTLSFIIFVSALSLGGVGL-AKMIKRIEHKEYMGFDNIC 475
II +L F++ S ++LGG G+ + +I + F +C
Sbjct: 467 IIGSLVFVVGGSLVALGGYGIYSAVISTQTGGSVLAFGRLC 507
>gi|118401229|ref|XP_001032935.1| hypothetical protein TTHERM_00470450 [Tetrahymena thermophila]
gi|89287281|gb|EAR85272.1| hypothetical protein TTHERM_00470450 [Tetrahymena thermophila SB210]
Length = 1325
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 72/140 (51%), Gaps = 10/140 (7%)
Query: 69 WPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTG 128
+P+I FG IV+ TI+GF G G+GGG P+L + +D K S A+ +V
Sbjct: 899 FPEI-FGS-IVIITILGF-----GQAAGIGGGTSITPILLALFLYDTKKSVALVILLVFS 951
Query: 129 GAVSAVVYNLRQRHPTLD-IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIV 187
++ + R+R T D +PVI+Y L L+ P L++G GVA N + +LL++
Sbjct: 952 SSLGNTIQISRER--THDGVPVIEYRLILVTLPTLIVGTVYGVAVNKFLPSIAVCILLVI 1009
Query: 188 LFLVLSTRAFLKGVETWKKE 207
L +++L+ K E
Sbjct: 1010 LLAQQIQKSYLRYKNMRKNE 1029
>gi|294944267|ref|XP_002784170.1| cgmp-dependent protein kinase, putative [Perkinsus marinus ATCC
50983]
gi|239897204|gb|EER15966.1| cgmp-dependent protein kinase, putative [Perkinsus marinus ATCC
50983]
Length = 311
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 103/228 (45%), Gaps = 36/228 (15%)
Query: 96 GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLA 155
G+GGG +FVP+L LI G K S A+S+ ++ +++A ++N +++ + P A
Sbjct: 4 GIGGGVLFVPVLRLIGGLQLKESTALSQALIASASLAATLFNCFEQYSARNDPK-----A 58
Query: 156 LLFQPLLVLGIS---IGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWKKET 208
L+ P ++L + IG V W+ ++ +++L+ S A+ KG+ WK E
Sbjct: 59 LIVWPFVILTLPCTVIGSLIGVYLYSWLPSLFILILYFCFVCLGSFMAYRKGIRLWKAEN 118
Query: 209 ITKREAAKQLELIGNGYQTEECKCDPEYLSNDTT---PEETREPKKSKVSIIENIYWKEF 265
KR A +G T+ + S++ T P R P K++ +I
Sbjct: 119 GAKRRAV-------DGDSTDMSR------SSEVTVEVPSLLRMPNMKKLAAYTSIA---- 161
Query: 266 GLLVAVWAIILALQIAKNYTTTC----SVLYWVLNLLQIPVAGGVSAY 309
L+ AV+ + TTC +V +W L +L I + G +
Sbjct: 162 ALVWAVYGHFPGIDHDARVATTCGYGYAVDFWTLGILLIEMLTGKPPF 209
>gi|397638312|gb|EJK73008.1| hypothetical protein THAOC_05398, partial [Thalassiosira oceanica]
Length = 318
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 45/76 (59%), Gaps = 10/76 (13%)
Query: 313 ALYKGRRKIASKG----------DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILG 362
A Y+GRR +AS D +W A+ L Y A + AG+V GL+G+GGG +LG
Sbjct: 205 AAYQGRRIVASAAAKVRAGYPFRDRDVRWDAAALRLYGAWTLLAGVVAGLVGIGGGMVLG 264
Query: 363 PLFLELGIPPQVSSAT 378
PL L + I P+VS+AT
Sbjct: 265 PLMLAMNIDPRVSTAT 280
>gi|255635532|gb|ACU18117.1| unknown [Glycine max]
Length = 114
Score = 58.9 bits (141), Expect = 5e-06, Method: Composition-based stats.
Identities = 31/70 (44%), Positives = 39/70 (55%), Gaps = 7/70 (10%)
Query: 26 SVSVSA-----QSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVV 80
SVSVSA N+ P + + V+ S SGY+H WPDI+FGWRI+
Sbjct: 24 SVSVSALQHVSHKNINPTTVETTQTSFL--GKVVNFLWSSSGSGYQHTWPDIEFGWRIIT 81
Query: 81 GTIIGFFGTA 90
GTIIGF G+A
Sbjct: 82 GTIIGFLGSA 91
>gi|407410866|gb|EKF33150.1| hypothetical protein MOQ_002989 [Trypanosoma cruzi marinkellei]
Length = 407
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 149/335 (44%), Gaps = 42/335 (12%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT--LDIPVIDYDLALLFQPLL 162
PM L++G + +S+ + G ++ V +R+R P P+I+Y L PL
Sbjct: 48 PMFCLLMGLPMDFAVGLSQSTICGQSILNVFIAIRKRFPCAGCSRPLINYQYLTLLVPLG 107
Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR-----EAAKQ 217
V+G IG N + D + +LL +L + R+ K + ++K+ +R +A++
Sbjct: 108 VIGTLIGGVLNRLCPDLLRLILLFLLLTAVLYRSVRKMIAQYRKDQSERRGTNTVSSAEE 167
Query: 218 LELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILA 277
+ +G T PE + + T P+ W E + + + L+
Sbjct: 168 V----SGTPTLN---SPEEVFHVTQPQYP---------------WIEISCVFFSFIVNLS 205
Query: 278 LQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLV 337
++ T Y V L PVA ++ + Y+ R K W S +
Sbjct: 206 FGAWRSRTKCGGGAYIVAYCL--PVALNIAIFFG---YRHRLSHMEKLSLVFHWSNSTTI 260
Query: 338 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT----ALFAITFSSSMSVVE 393
FY + AG+ +LG+GGG +LG + ++G+ P+ +S T LF + FSS++S+V
Sbjct: 261 FYPLVSVVAGIASAMLGIGGGLVLGFILYDVGLIPEEASVTGGVVTLF-LAFSSALSLV- 318
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 428
+ F + Y + FA I++ GQ VL +LIK
Sbjct: 319 --IEGHFLLDYGGFLFACGIVSTLFGQFVLMRLIK 351
>gi|348677068|gb|EGZ16885.1| hypothetical protein PHYSODRAFT_503702 [Phytophthora sojae]
Length = 254
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/235 (25%), Positives = 110/235 (46%), Gaps = 35/235 (14%)
Query: 229 ECKCDPE---YLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQI---AK 282
+C PE L D++ + + P+K + ++ W++ L +++ +I ++I K
Sbjct: 35 QCCVSPEATALLGIDSSEKVSSTPEKKPQA---DVPWRKLATLSSLFVVIAGMRILRGGK 91
Query: 283 NYTT-----TCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLV 337
++ + + S LY +L Q P YE A KW +
Sbjct: 92 DFDSPLGIDSSSALYPMLQ--QTP------GYELEA-------------HEIKWTPLSIR 130
Query: 338 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLL 397
F+ + AG V G+ G+GGG I GPL LE+GI +SA + FSS MS Y ++
Sbjct: 131 FFPFFSLAAGAVSGMFGIGGGIINGPLLLEVGIDASAASAMTATTVLFSSGMSAFNYTVM 190
Query: 398 KRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLG 452
+ + A + + ++GQ L K+++ S+IIF+++ I+ +SA+++
Sbjct: 191 GKTDIHLAQVLLPMGFLMTYIGQLCLLKVVRRFQCPSLIIFSMAVIVLISAIAMS 245
>gi|294944271|ref|XP_002784172.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239897206|gb|EER15968.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 180
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 78/156 (50%), Gaps = 21/156 (13%)
Query: 96 GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP--VIDYD 153
G+GGG +FVP+L LI G K S+++S+ +V A++A ++N ++ + P +I +
Sbjct: 4 GIGGGILFVPVLRLIGGLSQKESSSLSQALVAASALAANLFNFYAQYRAKNEPKALIVWP 63
Query: 154 LALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL----STRAFLKGVETWKKETI 209
+L P V+G IG + W+ ++L ++L+ ++ S A+ KG + WK ET
Sbjct: 64 FVILMLPCAVVGSLIG----IYLYSWLPSLLQLILYFIVACFGSLAAYRKGYKLWKAETD 119
Query: 210 TKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEE 245
K A ++ + T C P PEE
Sbjct: 120 AKESAIREFD------STTTVTCSP-----SVEPEE 144
>gi|403352629|gb|EJY75833.1| hypothetical protein OXYTRI_02776 [Oxytricha trifallax]
Length = 366
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 61/110 (55%), Gaps = 5/110 (4%)
Query: 113 FDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP---VIDYDLALLFQPLLVLGISIG 169
FD K++ A+S + +V+ +Y L ++HP D +I+Y+LA++ P +++G G
Sbjct: 118 FDTKNAIAISNFAIFTCSVTRYIYTLDKKHPHPDKKQNVIIEYNLAIVMLPTVMMGSLTG 177
Query: 170 VAFNVIFADWMITVLLIVLFLVLSTRAFLKG--VETWKKETITKREAAKQ 217
V N+IF + +L L + LS ++ +KG ++ +K ++ K KQ
Sbjct: 178 VFLNIIFPAIALQAILTALLIFLSLQSLMKGKDIQKFKGQSHNKSRILKQ 227
>gi|449016905|dbj|BAM80307.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 804
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 53/105 (50%)
Query: 113 FDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAF 172
F ++A S+ ++ G +++ VYN R+RH D P ID++L + P + G + G
Sbjct: 131 FPIPQASANSQSLILGASIANSVYNFRKRHVIRDAPRIDWNLVISTMPFFLCGTTPGYFL 190
Query: 173 NVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQ 217
N+ + +L + L+ ++FL G +++ +RE +Q
Sbjct: 191 NISLPGYFTGFVLAAMLGALTIQSFLSGTRMTRRQWRMRREFLRQ 235
>gi|302828094|ref|XP_002945614.1| hypothetical protein VOLCADRAFT_85766 [Volvox carteri f.
nagariensis]
gi|300268429|gb|EFJ52609.1| hypothetical protein VOLCADRAFT_85766 [Volvox carteri f.
nagariensis]
Length = 136
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 59/103 (57%), Gaps = 1/103 (0%)
Query: 373 QVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR 432
QV++A++ + FSSS +++++ LL R YA+ F A S++A +G + + IK GR
Sbjct: 33 QVTAASSSTMVLFSSSAALIQFILLHRLNTDYAIVFGAASLVAGLLGTQAVSRAIKRSGR 92
Query: 433 ASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNIC 475
S+++ L+ +I ++ L + GL ++ E +GF IC
Sbjct: 93 PSVVVLALAGVIGIATLCVAIFGLRNAAVQLRAGE-LGFLGIC 134
>gi|452819360|gb|EME26420.1| hypothetical protein Gasu_59110 [Galdieria sulphuraria]
Length = 579
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 60/112 (53%), Gaps = 5/112 (4%)
Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
IFVP+L +++ F + VS+ M+TG AV+ +RHP D P + Y++ P
Sbjct: 107 PIFVPVLEVMLNFAVSQAGGVSQSMITGMAVAVSTIAFLKRHPERDRPAVYYEIVTSMVP 166
Query: 161 LLVLGISIGVAFNVI----FADWMITVLLIVLFLVLSTRAF-LKGVETWKKE 207
+ ++G IG N + F +++ ++LI L + S +A L+ +E +++
Sbjct: 167 ICLMGTFIGTYVNQVLPGYFTAFVVVIILIYLVITTSMKAMSLRRIELHERD 218
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 83/205 (40%), Gaps = 22/205 (10%)
Query: 262 WKEFGLLVAVWA---IILALQIAKNYTTT------CSVLYWVLNLLQIPVAGGVSAYEAV 312
W + LLV W+ I +AL+ K + C +YW L + V +S+ +
Sbjct: 351 WGDLLLLVTTWSFVIIFVALRGGKPRIISPLGVHLCGWIYWFLLAILELVLLIISSVTML 410
Query: 313 ALYKGRRKIASKG----DEGTKWRASQLVFY---C-ACGITAGMVG-GLLGLGGGFILGP 363
LY + G +W L+ Y C CG+ A VG + L GF+L
Sbjct: 411 RLYSLHQHRVRLGYLFCRSDVRWTPKTLILYGIFCFVCGLVASWVGISVETLAAGFLL-- 468
Query: 364 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 423
+ LG+ P V T F+SS E YAL++ + + A VG V+
Sbjct: 469 --VVLGVDPLVVQLTGGVINLFTSSAIAAESAANGSLAWRYALFYAGFTFLGALVGVLVV 526
Query: 424 KKLIKILGRASIIIFTLSFIIFVSA 448
+K SII+F L F + V+
Sbjct: 527 GHFVKKYHLKSIIVFCLVFFLIVAT 551
>gi|71420985|ref|XP_811672.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876359|gb|EAN89821.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 473
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 91/349 (26%), Positives = 152/349 (43%), Gaps = 38/349 (10%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT--LDIPVIDYDLALLFQPLL 162
PM L++G + +S+ + G ++ V+ +R+R P P+I+Y L PL
Sbjct: 114 PMFCLLMGLPMDFAVGLSQTTICGQSILNVLIAIRKRFPCAGCSRPLINYQYLTLLVPLG 173
Query: 163 VLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR--EAAKQLEL 220
V+G IG N + D VLL +L + R+ K + ++K+ +R E
Sbjct: 174 VIGTLIGGVLNRLCPDLFRLVLLFLLLTAVLYRSVRKMIAQYRKDQSERRGTNTVSSTEE 233
Query: 221 IGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQI 280
+ +G T PE + + T P+ W E +V + + L+
Sbjct: 234 V-SGTPTLN---SPEEIFHVTQPQYP---------------WIEISCVVFSFIVNLSFGA 274
Query: 281 AKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYC 340
++ T Y V L PV + + Y+ R K W S + Y
Sbjct: 275 WRSRTKCGGGAYIVAYCL--PVVLNIVIFFC---YRHRLSNMEKFRLVFHWNNSTTILYP 329
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT----ALFAITFSSSMS-VVEYY 395
+ AG+ +LG+GGG +LG + ++G+ P+ +S T LF + FSS++S ++E +
Sbjct: 330 LVSVVAGVASAMLGIGGGLVLGFILYDVGLIPEEASVTGGVVTLF-LAFSSALSLLIESH 388
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFII 444
LL + Y FA I++ +GQ VL +LIK +II L II
Sbjct: 389 LL----IDYGGVLFACGIVSTILGQFVLMRLIKKYKLKFLIIAALVTII 433
>gi|323456207|gb|EGB12074.1| hypothetical protein AURANDRAFT_9279, partial [Aureococcus
anophagefferens]
Length = 89
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 52/90 (57%), Gaps = 4/90 (4%)
Query: 126 VTGGAVSAVVYNLRQRH---PTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT 182
++GG++ A +Y QR+ PTL P+IDYD +LLF P L+ G G F+V+F W++
Sbjct: 1 ISGGSI-ANLYTYTQRYHPNPTLRRPLIDYDASLLFCPALLAGTMFGGLFSVMFPPWLVV 59
Query: 183 VLLIVLFLVLSTRAFLKGVETWKKETITKR 212
+ L+VL R KG+ W E+ ++
Sbjct: 60 ICLVVLLGYSGKRTVKKGIAKWNAESAKQK 89
>gi|159115095|ref|XP_001707771.1| Hypothetical protein GL50803_4181 [Giardia lamblia ATCC 50803]
gi|157435878|gb|EDO80097.1| hypothetical protein GL50803_4181 [Giardia lamblia ATCC 50803]
Length = 520
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 67/159 (42%), Gaps = 16/159 (10%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRH---PTLDIPVIDYDLALLF 158
IFV ML L G +A +SK M+ GG+ N+ Q PT P I +DL +
Sbjct: 39 IFVSMLQLF-GVSPHVAAPLSKAMIFGGSCVLTCMNIFQHEDNEPTK--PSIIWDLVFII 95
Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETIT-------- 210
+P V G IG NV+ +W++ VL + L + + + T KE I
Sbjct: 96 EPAAVSGALIGALINVVLPEWLLLVLEVAFLLYTTQKMLRSSLATLNKERIAAGKRLLCT 155
Query: 211 --KREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETR 247
R A ++ G+ +Q D S +TT E +
Sbjct: 156 RKSRAPALSIDERGSPHQPSTFIEDQSTRSGNTTSNEIQ 194
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 66/134 (49%), Gaps = 2/134 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ AG++G +LG+GGG + P+ + GI P+ + + I F+S S++ Y ++
Sbjct: 388 GLFAGILGAMLGIGGGLLKNPILISFGIDPERARTASTVMIAFTSMSSMISYVVIGGLHF 447
Query: 403 PYALYFFALSIIAAFVGQHVLKKL-IKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK 461
YA + L++ A FV + L +L I+ S I F ++ +I V + + I
Sbjct: 448 EYA-WPLMLTVGAFFVSGYYLSELIIRCFKTKSFIPFLITALIVVCTCFIVANMIIVFID 506
Query: 462 RIEHKEYMGFDNIC 475
+ GF ++C
Sbjct: 507 IAKTGHLPGFTSLC 520
>gi|145493138|ref|XP_001432565.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124399678|emb|CAK65168.1| unnamed protein product [Paramecium tetraurelia]
Length = 441
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 75/343 (21%), Positives = 153/343 (44%), Gaps = 34/343 (9%)
Query: 94 VGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT-LDIPVIDY 152
+GG+G G I P+LNL++ + + S V+ C + + ++ ++HP ++P+I++
Sbjct: 55 MGGLGSGMIKRPILNLLLNYPSSISTQVADCFLFTTSTLNSLFLFFEKHPDHHNLPLINF 114
Query: 153 DLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR 212
D++++ + L S G ++I ++ + FL+ + K + + ++ I ++
Sbjct: 115 DISIILNQTIPLAWSAGAFLQTRSPKFVIYIIQLC-FLMGAIPFLWKYMLAYLQKRI-EQ 172
Query: 213 EAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSI-------IENIYWKEF 265
+ + + L+ +T+E ++++T +E + + K I I+N+ + F
Sbjct: 173 DKRENVILVNQKIKTQED------MASETNFDEKQLTQYQKFYINDHSKFQIKNLCFI-F 225
Query: 266 GLLVAVWAIILALQIAKNYT----TTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKI 321
G + IIL N + C++ NL+ + + G Y + R +
Sbjct: 226 GSFLVNQTIILMRSNKYNNSIIGLDKCTLEN---NLILLAILGINLTYTFFIYWSKRNEE 282
Query: 322 ASKGDEGTKWRASQ--------LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQ 373
K + ++R +Q FY G AG + G LG+GGG I+ L I +
Sbjct: 283 YYK--DIVQYRPNQRFFKDKKQFFFYYMAGFLAGFITGFLGMGGGLIMVTFLLSQKIIAR 340
Query: 374 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA 416
++ATA F S S++++ L K + FF L I+
Sbjct: 341 EAAATAAFGSFMISLNSLIQFILQKTISDEQMIVFFILGILGV 383
>gi|253746884|gb|EET01870.1| Hypothetical protein GL50581_856 [Giardia intestinalis ATCC 50581]
Length = 511
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 52/113 (46%), Gaps = 7/113 (6%)
Query: 70 PDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGG 129
P ++ +VV + F S G+GGG IFV ML L G +A +SK M+ GG
Sbjct: 8 PGLEVWMEVVVAIVCALFAML-ASAAGIGGGVIFVSMLQLF-GVSPHVAAPLSKAMIFGG 65
Query: 130 A---VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
+ ++ PT P I +DL + +P V G IG NV+ +W
Sbjct: 66 SCVLTCMTIFQHEDNDPTK--PAIIWDLVFIIEPAAVSGALIGALINVVLPEW 116
Score = 46.6 bits (109), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 54/105 (51%), Gaps = 2/105 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ AG++G +LG+GGG + P+ + GI P+ + + I F+S S++ Y ++
Sbjct: 379 GLFAGILGAMLGIGGGLLKNPILISFGIEPERARTASTVMIAFTSMSSMISYVVIGGLHF 438
Query: 403 PYALYFFALSIIAAFVGQHVLKKLI-KILGRASIIIFTLSFIIFV 446
YA + L I FV + L LI + S I F ++ +I V
Sbjct: 439 EYA-WPLMLVIGVFFVSGYYLSGLIMRCFKTKSFIPFIITALIIV 482
>gi|145538953|ref|XP_001455171.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124422970|emb|CAK87774.1| unnamed protein product [Paramecium tetraurelia]
Length = 406
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 88/370 (23%), Positives = 152/370 (41%), Gaps = 67/370 (18%)
Query: 126 VTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIF-ADWMITVL 184
+ G A+ + +Q P PV+ A++ P +++G +G+ N I A ++I++L
Sbjct: 84 IFGAALGNFLNLAQQLDPQTKTPVVKIRSAIVACPAMIIGSMVGLILNKILPAFFLISIL 143
Query: 185 LIVLFLVLSTRAFLK-GVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTP 243
FL LS + F K ++ WK E Q +L N + + N
Sbjct: 144 --QYFLFLSCQKFYKTALKEWKNE--------NQRKLTQNSVEIP--------ILNKLQE 185
Query: 244 EETREPKKSKVSIIENIYWKEFGLLVAVWAIILA------LQIAKNYTTT-----CSVLY 292
E+T + S+ ENI L+VA+ I++ L+ + N + C Y
Sbjct: 186 EQTLQ----GASLKENI-----DLIVAIGVIVVTVLLGFFLRGSPNVESIIGISYCGFFY 236
Query: 293 WVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGL 352
W++ L + V Y ++ +K K + R S AG+ G
Sbjct: 237 WIITLGLVVVL----YYYFEYIFDRFQKEELKPLKKECLRDS---------FKAGIFNG- 282
Query: 353 LGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALS 412
GLGGG IL P++L +G S+ T+ F + SS + +L A F ++
Sbjct: 283 FGLGGGMILIPMYLGMGFTTIQSTGTSSFNVLLSSFQVAAQVIILGYMGTLQAFSLFTMT 342
Query: 413 IIAAFVGQHVLKKLIKILGRASIIIFTL-SFIIFVSALSLGGVGLA-KMIKRIEHKEY-- 468
++ ++ + +K R S+I++ L F+IF V LA + I++ H Y
Sbjct: 343 VVGCYLSSSYIFNNLKKRDRLSLIMWGLVGFVIFAM------VNLAIQFIQKWSHNGYQI 396
Query: 469 ---MGFDNIC 475
F +IC
Sbjct: 397 SELFAFQSIC 406
>gi|408404324|ref|YP_006862307.1| sulfonate transporter [Candidatus Nitrososphaera gargensis Ga9.2]
gi|408364920|gb|AFU58650.1| putative sulfonate transporter [Candidatus Nitrososphaera gargensis
Ga9.2]
Length = 256
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 49/86 (56%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 393
+ +V A G+ AG +G ++G+GGG I+ P LG+PP +++T+L A+T +S S +E
Sbjct: 6 TTIVVLIAVGLGAGTLGSMIGVGGGIIMVPALTFLGLPPAQAASTSLIAVTSTSVSSTIE 65
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVG 419
Y KR L A ++ A +G
Sbjct: 66 YSRQKRIDYRLGLTMSAFAVPGAVLG 91
>gi|401412944|ref|XP_003885919.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325120339|emb|CBZ55893.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 486
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/105 (24%), Positives = 58/105 (55%)
Query: 114 DAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFN 173
D + +SK + G + +V++NL ++ P+ ++ +I Y+LA + +P ++G +GV N
Sbjct: 41 DVHKAVPLSKVTIFGACLVSVLFNLGRKQPSGNLLLISYELAAILEPFTLMGGVLGVLLN 100
Query: 174 VIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQL 218
++ +D I L+V+ + + +G+ ++ E+ E A +L
Sbjct: 101 IVMSDIQIICCLVVVLSFTTYKTTRRGIIQYQTESRLLAERAARL 145
>gi|405984262|ref|ZP_11042565.1| hypothetical protein HMPREF9451_01696 [Slackia piriformis YIT
12062]
gi|404388094|gb|EJZ83178.1| hypothetical protein HMPREF9451_01696 [Slackia piriformis YIT
12062]
Length = 274
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 5/93 (5%)
Query: 339 YCA----CGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVE 393
YCA G+ AG++ G +G+GGGFI+ PLF+ L IP +++S T+L A+ S +E
Sbjct: 153 YCAKVALIGVVAGLLSGYVGVGGGFIMVPLFMTLLSIPMRLASGTSLIAVCILSVPGTIE 212
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
LL + A SI AFVG ++KK+
Sbjct: 213 QALLGNIHFLLGIAMAAGSIPGAFVGASMVKKV 245
>gi|308160584|gb|EFO63064.1| Hypothetical protein GLP15_1793 [Giardia lamblia P15]
Length = 520
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 6/111 (5%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQ---RHPTLDIPVIDYDLALLF 158
IFV ML L G +A +SK M+ GG+ N+ Q PT P I +DL +
Sbjct: 39 IFVSMLQLF-GVSPHVAAPLSKAMIFGGSCVLTCINIFQYEDDEPTK--PSIIWDLVFII 95
Query: 159 QPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETI 209
+P V G IG NV+ +W++ VL ++ L + + + T KE I
Sbjct: 96 EPAAVSGALIGALINVVLPEWLLLVLEVIFLLYTTQKMLRNSLTTLNKERI 146
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 57/105 (54%), Gaps = 2/105 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ AG++G +LG+GGG + P+ + GI P+ + + I F+S S++ Y ++
Sbjct: 388 GLFAGILGAMLGIGGGLLKNPILISFGIDPERARTASTVMIAFTSMSSMISYTVIGGLHF 447
Query: 403 PYALYFFALSIIAAFVGQHVLKKL-IKILGRASIIIFTLSFIIFV 446
YA + L++ FVG + L +L I+ S I F ++ +I V
Sbjct: 448 EYA-WPLMLTVGTFFVGGYYLSELMIRCFRTKSFIPFIITALIVV 491
>gi|297526402|ref|YP_003668426.1| hypothetical protein Shell_0395 [Staphylothermus hellenicus DSM
12710]
gi|297255318|gb|ADI31527.1| protein of unknown function DUF81 [Staphylothermus hellenicus DSM
12710]
Length = 250
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 62/111 (55%), Gaps = 6/111 (5%)
Query: 318 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSS 376
R++I +K +K R V AG+ G+LG+GGG + PL + LG+P + +
Sbjct: 122 RKRIDAKNHMVSKQRKGLGVLIA---FLAGLTSGMLGIGGGVLKVPLMTIVLGLPIRTAI 178
Query: 377 ATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII-AAFVGQHVLKKL 426
AT+ F + ++S + YLLK + PYA+ AL II A +G H+LKK+
Sbjct: 179 ATSSFMVGLTASAGSL-VYLLKGYVNPYAVAALALGIIPGATLGAHMLKKI 228
>gi|145549159|ref|XP_001460259.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428088|emb|CAK92862.1| unnamed protein product [Paramecium tetraurelia]
Length = 482
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 78/390 (20%), Positives = 146/390 (37%), Gaps = 92/390 (23%)
Query: 92 GSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVI 150
G++G +G G + P L + FD++S+ ++ C+ ++ ++ QRHP + P+I
Sbjct: 77 GALGALGAGVVKRPFLEATLNFDSESAGNMTMCLQFASQLANIIIIFFQRHPEDPERPII 136
Query: 151 DYDLALLFQPLLVLGISIGVAF-------NVIFADWMITVLLIVLFLVLSTRAFLKGVET 203
+Y++AL++ + L S+G F ++++ + V + + + +T+ +
Sbjct: 137 NYEIALIYCLAIPLSQSLGTEFANYLPLGSLLWIQNIFFVAVCPILFLFATKELTLQTQK 196
Query: 204 WKKETITKREAAKQLE-------------LIGNGYQTEECKCDPEY-------------- 236
+ I + A LE ++ + ++C P +
Sbjct: 197 EQDNDILLKSALVPLEGLKDQQQLQQQEAVLYKQFYNDQCNKFPMFPIVVTVGTFAINEA 256
Query: 237 --LSND----TTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQIAKNYTTTCSV 290
LS T+P T E KK + + A W + L + T V
Sbjct: 257 IILSRTTPGLTSPYYTNETKKERT-------------ICAAWNFYMLLLLFAVNTIITLV 303
Query: 291 LYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDEGTK-WRASQLVFYCACGITAGMV 349
+Y++ + +AV + R GT+ W+ A G G+V
Sbjct: 304 VYYIKRKEEY-------TKDAVKFFNCERFYTP----GTRFWKIY------AAGWVTGIV 346
Query: 350 GGLLGLGGGFILGPLFLELGIPPQVSSATA----------LFAITFSSSMSVVEYYLLKR 399
G G+ G I+ E G+ V+ TA LFA S+ +
Sbjct: 347 AGFFGMAAGLIMVTTMSEFGLISAVAGGTANFCYFIICCQLFATVLSNQIEE------NN 400
Query: 400 FPVPYALYFFALSIIAAF----VGQHVLKK 425
F V +F+ L II+ +G + LKK
Sbjct: 401 FGVGDQFFFYGLGIISVLIFTNIGYYYLKK 430
>gi|167521960|ref|XP_001745318.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776276|gb|EDQ89896.1| predicted protein [Monosiga brevicollis MX1]
Length = 512
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/156 (23%), Positives = 71/156 (45%), Gaps = 11/156 (7%)
Query: 104 VPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP---VIDYDLALLFQP 160
VPML ++ F + +S + GG++ + ++++HP + +I+Y + LL P
Sbjct: 45 VPMLLVVGDFPTYYAIPLSVTAIVGGSIVRFIMQVQRKHPNPKVAHRQLINYPMVLLLLP 104
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKK-ETITKREAAKQLE 219
+ + G IGV N + +W+I L +FLVL+ +F +T KK + + +E +
Sbjct: 105 MALAGTVIGVLLNSVAPNWLI---LATIFLVLTYTSF----KTLKKGKELRAKEKEAHAQ 157
Query: 220 LIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVS 255
+ T E D + + P + S +
Sbjct: 158 MAATELHTMELIVDDNGDNKNKVPHVADDSGDSGID 193
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 44/78 (56%)
Query: 374 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 433
VSSAT + F+S S ++ +L R PV Y + F L+ +A+ +GQ L ++ G+
Sbjct: 371 VSSATTSYMTLFTSISSFTQFLVLNRVPVDYGILLFFLAAVASVLGQLALNSYVRKTGKN 430
Query: 434 SIIIFTLSFIIFVSALSL 451
SII + L II ++ + L
Sbjct: 431 SIIAYILGVIISLATVLL 448
>gi|403374048|gb|EJY86957.1| hypothetical protein OXYTRI_08652 [Oxytricha trifallax]
Length = 426
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 88/377 (23%), Positives = 156/377 (41%), Gaps = 62/377 (16%)
Query: 129 GAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQPLLVLGISIGVAFNVIFAD-WMITVLLI 186
++S + N QRHP + IDYD+ L P++ LG GV + + T
Sbjct: 42 SSLSRFIINYTQRHPKQPERLAIDYDIIELSMPIMYLGTLFGVQIGTRLTEVQLATTFAC 101
Query: 187 VLFLVLSTRAFLKGVETWKKET---------------------------------ITKRE 213
VLF V S + K + K+E I RE
Sbjct: 102 VLFFV-SYKTTTKAISMIKEENAKKKMKYLSYTDKNEYKELLLPKSNRTSKQNSPIKDRE 160
Query: 214 AAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWK---------- 263
+ L+ + + + S DT EE + +K++ +IE Y +
Sbjct: 161 SDSALQKLSHQNDQNLSTVQVQLFS-DTPFEEHNQQQKNR-QMIEKQYQQVLKDQSQHFT 218
Query: 264 -----EFGLLVAVWAIILALQIAKNYTTT-CSVLYWVLNLLQIPVAGGVSAYEAVA---- 313
FGL + I L ++ + S +Y +LL + S +
Sbjct: 219 LRRCLTFGLTLGFLMITSTLLNQRDDSQYYVSPMYGYASLLLFLIYAVFSTIKKAQYLQE 278
Query: 314 LYKGRRKIASKGDEGT-KWRASQLVF--YCACGITAGMVGGLLGLGGGFILGPLFLELGI 370
++ + ++ K D+ + +++F C I +G++GG++G+ GG IL PLFL+LG+
Sbjct: 279 MHSVKLQMGYKFDQNDLTYENYRVIFNIILLCHI-SGILGGVVGIAGGIILAPLFLQLGM 337
Query: 371 PPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKIL 430
P + ++T + S+ +Y+ L Y F L II +++G V+ K+++
Sbjct: 338 LPVIVASTNQYLALISTIAVSSQYWYLGMMNWQYCSVFGFLGIIGSYIGIKVVMKIVQKS 397
Query: 431 GRASIIIFTLSFIIFVS 447
GR SI++ L+F++F S
Sbjct: 398 GRQSIMVVILAFVLFTS 414
>gi|403359664|gb|EJY79493.1| hypothetical protein OXYTRI_23231 [Oxytricha trifallax]
Length = 420
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 86/378 (22%), Positives = 154/378 (40%), Gaps = 70/378 (18%)
Query: 129 GAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQPLLVLG----ISIGVAFNVIFADWMITV 183
++S + N QRHP + IDYD+ L P+L LG + IG N + + T
Sbjct: 42 SSLSRFIINYAQRHPKQPERLAIDYDIIELSMPILYLGTLFGVQIGTRLNEV---QLATT 98
Query: 184 LLIVLFLVLSTRAFLKGVETWKKET---------------------------------IT 210
VLF V S + K + ++E I
Sbjct: 99 FACVLFFV-SYKTTTKAISMIREENAKKKMKYLSYTDKNEYKDLLLPKSNRTSKQNSPIK 157
Query: 211 KREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWK------- 263
RE+ L+ + + + S+ TP E + ++ K ++E Y +
Sbjct: 158 DRESDSALQKVSHQVNQNLSSVQVQLFSD--TPFEKQNQQQEKRQMLEKKYQQVLKDQSQ 215
Query: 264 --------EFGL---LVAVWAIILALQIAKNYTTTCSVLYWVLNLLQ-IPVAGGVSAYEA 311
FGL + + + +L + Y +L+ + + I A + +
Sbjct: 216 HFTLKRSLTFGLTLGFLMITSTLLNYYVNPMYGYASLLLFLIYAVFSTIKKAQYLQETHS 275
Query: 312 VALYKGRRKIASKGDEGTKWRASQLVF--YCACGITAGMVGGLLGLGGGFILGPLFLELG 369
V L G + + +++F C I +G++GG++G+ GG IL PLFL+LG
Sbjct: 276 VKLQMGYQF----DQNDLNYENYKVIFNIILLCHI-SGILGGVVGIAGGIILAPLFLQLG 330
Query: 370 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 429
+ P + ++T + S+ +Y+ L Y F L II +++G V+ K+++
Sbjct: 331 MLPVIVASTNQYLALISTIAVSSQYWYLGMMNWQYCSVFGFLGIIGSYIGIKVVMKIVQK 390
Query: 430 LGRASIIIFTLSFIIFVS 447
GR SI++ L+F++F S
Sbjct: 391 SGRQSIMVVILAFVLFTS 408
>gi|294891216|ref|XP_002773478.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239878631|gb|EER05294.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 385
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 62/107 (57%), Gaps = 1/107 (0%)
Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
+++P ++G DA + +SK + G AVSA ++NLRQRHPT D P+ID D+ LL +P
Sbjct: 34 ALYMPAFVAVMG-DAHWAVPLSKVAINGVAVSATIFNLRQRHPTYDQPLIDLDIGLLLEP 92
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE 207
L +LG +GV NV I L+++ + + F K ++ + E
Sbjct: 93 LTLLGSMLGVYLNVAMTSVEIFSCLVLVLSITAALTFRKAIQRRRLE 139
>gi|449017188|dbj|BAM80590.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 606
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 108/237 (45%), Gaps = 29/237 (12%)
Query: 262 WKEFGLLVAVWAIILALQI----AKNYTTT----CSVLYWVLNLLQIPV---AGGVSAYE 310
W+ L+A W ++ +QI A+ ++ C +YW+L +Q V G + +
Sbjct: 372 WRYMLFLLACWLVLFIIQILSGSAEKHSPVGAPLCGAVYWILFAIQESVLFAMGCFTIWR 431
Query: 311 AVALYKGRRKI-----ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF 365
++L + R ++ + G +W + + Y AG+ G +G+GG +L P F
Sbjct: 432 NLSLNRLRDRLGYPWYSKNGLGDIRWTRALIFGYPVWAFIAGVFGSWVGIGGSSLLIP-F 490
Query: 366 LEL---GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
L L P V S+ +L + S+S + V Y R + Y L+F +++ ++ G +
Sbjct: 491 LNLVGRADPATVQSSMSLSNLLGSASGAFV-YLAQGRLNISYGLFFGLFALLGSYTGVWM 549
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMG----FDNIC 475
+ L++ ++ +F L+ I FV+AL G L I+ E G F NIC
Sbjct: 550 VYFLVERYQIRALFVFALT-ICFVTAL---GAILYIAIRNTLGVEAAGVGWVFTNIC 602
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 58/106 (54%)
Query: 74 FGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSA 133
F W V ++ F T + GG+GGG + VP+L+L GF K+++A ++ ++ G +++
Sbjct: 85 FLWSDGVLIVLVFLATGLSNAGGIGGGILLVPLLSLFAGFTLKNASANAQPLIFGASIAN 144
Query: 134 VVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADW 179
+ N+ +RH D P ID++L P ++G + G N F +
Sbjct: 145 AMVNVPRRHAARDTPRIDWNLVACTVPFFLMGTTPGTFMNQAFPSY 190
>gi|159479540|ref|XP_001697848.1| hypothetical protein CHLREDRAFT_151478 [Chlamydomonas reinhardtii]
gi|158273946|gb|EDO99731.1| predicted protein [Chlamydomonas reinhardtii]
Length = 929
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 65/115 (56%), Gaps = 1/115 (0%)
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
PL L LG+ PQV++AT+ + FSSS +++++ L YA F A S +AA G V
Sbjct: 816 PLLLHLGVHPQVTAATSGAMVLFSSSTALLQFALAGELNAQYAAVFAAASAVAALAGTLV 875
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKEYMGFDNICYT 477
+ L++ GR SI++ L+ ++ + +S+ GL + K + + +GF +C T
Sbjct: 876 VAGLVRRSGRPSIVVLALAGVMGLGLVSVAVFGLQRAAKDLGAGD-IGFSQLCAT 929
>gi|294925749|ref|XP_002778995.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239887841|gb|EER10790.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 397
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 62/107 (57%), Gaps = 1/107 (0%)
Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQP 160
+++P ++G DA + +SK + G AVSA ++NLRQRHPT D P+ID D+ LL +P
Sbjct: 34 ALYMPAFVAVMG-DAHWAVPLSKVAINGVAVSATIFNLRQRHPTYDQPLIDLDIGLLLEP 92
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE 207
L +LG +GV NV I L+++ + + F K ++ + E
Sbjct: 93 LTLLGSMLGVYLNVAMTSVEIFSCLVLVLSITAALTFRKAIQRRRLE 139
>gi|126465319|ref|YP_001040428.1| hypothetical protein Smar_0408 [Staphylothermus marinus F1]
gi|126014142|gb|ABN69520.1| protein of unknown function DUF81 [Staphylothermus marinus F1]
Length = 250
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 52/83 (62%), Gaps = 3/83 (3%)
Query: 346 AGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPY 404
AG+ G+LG+GGG + PL + LG+P + + AT+ F + ++S + YLLK + PY
Sbjct: 147 AGLTSGMLGIGGGVLKVPLMTMVLGLPIRTAIATSSFMVGLTASAGSL-VYLLKGYVNPY 205
Query: 405 ALYFFALSII-AAFVGQHVLKKL 426
A+ AL II A +G H+LKK+
Sbjct: 206 AVASLALGIIPGATLGAHMLKKI 228
>gi|145539706|ref|XP_001455543.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423351|emb|CAK88146.1| unnamed protein product [Paramecium tetraurelia]
Length = 484
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 92 GSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-DIPVI 150
S+G +GGG + P L I+ FDA + ++ C++ + +V+ Q HP + + PVI
Sbjct: 72 ASLGALGGGIVKRPFLQSILNFDASIAGDITACLMISAQIVNMVFIFLQNHPDVPERPVI 131
Query: 151 DYDLALLFQPLLVLGISIG 169
+YD+A+++ + + + +G
Sbjct: 132 NYDIAIIYTLAIPVSLCLG 150
>gi|145515812|ref|XP_001443800.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411200|emb|CAK76403.1| unnamed protein product [Paramecium tetraurelia]
Length = 439
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 73/332 (21%), Positives = 136/332 (40%), Gaps = 27/332 (8%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTL-DIPVIDYDLALLFQPLLV 163
P+LNL++ + A + V C + + + ++HP ++P+++++++++F +
Sbjct: 66 PILNLMLNYPASIATQVGDCFLFVTTTLNSLLLIFEKHPDHPELPLVNFEISIIFNQTIP 125
Query: 164 LGISIGVAFNVIFADWMITVLLIVLF---LVLSTRAFL-KGVETWKKETITKREAAKQLE 219
L S+G F I I LF +L FL K + K+ I KR+ K++
Sbjct: 126 LAWSVGA-----FLQQRIPQFAIYLFQLCFMLGAIPFLWKFTHSQKQLEIDKRD--KKVL 178
Query: 220 LIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILALQ 279
+I E+ + + + E K I+N+ + FG + I+L
Sbjct: 179 VIEKIKTKEDMANETSLNAKELQQYEIFYILNHKKIQIKNLCFI-FGSFIVNQTIVLMRS 237
Query: 280 IAKNYT----TTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRK------IASKGDEGT 329
N + C++ NL+ I + Y + + R + + + D
Sbjct: 238 NKFNNSIIGIDACTLEN---NLILILILCANCIYTILVYWNKRNEEFYKDLVQYRPDYRY 294
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
FY G AG G +G+GGG I+ + L I + ++ATA F +
Sbjct: 295 FTPKKTFWFYYLGGCLAGFSTGFIGMGGGLIMVSILLHKKIIAREAAATAGFGTFMIALN 354
Query: 390 SVVEYYLLKRFPVPYALYFFALSIIA-AFVGQ 420
S+++ +L K FF L II AF+ +
Sbjct: 355 SLIQLFLQKTITTGQMFTFFGLGIIGLAFITK 386
>gi|407465090|ref|YP_006775972.1| hypothetical protein NSED_06140 [Candidatus Nitrosopumilus sp. AR2]
gi|407048278|gb|AFS83030.1| hypothetical protein NSED_06140 [Candidatus Nitrosopumilus sp. AR2]
Length = 255
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 48/93 (51%), Gaps = 4/93 (4%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 401
G AG++G ++GLGGG I+ P+ LG PP +++ +LFA ++ S + Y KR
Sbjct: 10 LGFAAGILGSMIGLGGGVIVVPVLTFLGFPPTAAASNSLFAALSNAIASTISYSKQKRIE 69
Query: 402 VPYALYFFALSI----IAAFVGQHVLKKLIKIL 430
P L LSI + A + V + KIL
Sbjct: 70 YPLGLKLGLLSIPGTVLGALISSDVAPDVFKIL 102
>gi|387196905|gb|AFJ68782.1| hypothetical protein NGATSA_3034300 [Nannochloropsis gaditana
CCMP526]
Length = 278
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 99/181 (54%), Gaps = 8/181 (4%)
Query: 291 LYWVLNLLQIPVAGGVSAY-EAVALYKGRRKIAS-----KGDEGTKWRASQLVFYCACGI 344
+YW + LL +P AG Y + L + K+ + +GD +W + + + +
Sbjct: 91 MYWFVTLLTLPWAGAFFLYIRRLILKEQEAKVLAHYPFVEGD--IRWTPTSTLKFPLLCV 148
Query: 345 TAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPY 404
AG++ G+ G+GGG + GPL LE+G+ P V++A+A I ++S+ + V +Y+ P Y
Sbjct: 149 MAGVLAGVFGVGGGIVKGPLMLEMGVLPPVAAASAATMILYTSASATVAFYVFGLIPGDY 208
Query: 405 ALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIE 464
L+FF + VGQ +L +L++ + S+I+ ++ +I +SA+ + +A ++ +
Sbjct: 209 GLFFFFWGFLCTGVGQILLLRLLQHSHKQSLIVLSIGLVITLSAVMMSFQAVADYLENPD 268
Query: 465 H 465
+
Sbjct: 269 N 269
>gi|294887822|ref|XP_002772241.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239876300|gb|EER04057.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 116
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/114 (23%), Positives = 59/114 (51%), Gaps = 1/114 (0%)
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
P+ L LG+ P+ ++AT I +S+ + + + L FP L+ + I A +G+ +
Sbjct: 3 PVVLSLGLDPKQATATTSIVIFATSTSTALSFALGGYFPPASDLWIVVMPFIGALLGKTI 62
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIK-RIEHKEYMGFDNIC 475
+ +LI GR S+++ L ++ + ++ G+ ++K + +E + F + C
Sbjct: 63 VARLIAKTGRMSMLVLLLGTVVIIGGITTISTGIVSVVKGALNGEEVVQFGSFC 116
>gi|83950218|ref|ZP_00958951.1| membrane protein [Roseovarius nubinhibens ISM]
gi|83838117|gb|EAP77413.1| membrane protein [Roseovarius nubinhibens ISM]
Length = 307
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 61/106 (57%), Gaps = 2/106 (1%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
+ G+ G++ G+ G+GGGF++ PL +GIPP V+ AT+ I +S +++ + +
Sbjct: 19 SIGLMVGLLSGIFGVGGGFLITPLLFFIGIPPAVAVATSANQIVAASFSAILAHLRRRTV 78
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
+P L ++ + VG ++ L++ LG+ +++ L +++F+
Sbjct: 79 DIPMGLALMIGGLLGSTVGIYIF-NLLRSLGQVDLLV-NLFYVVFL 122
>gi|403360891|gb|EJY80141.1| hypothetical protein OXYTRI_22577 [Oxytricha trifallax]
Length = 388
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 54/114 (47%), Gaps = 1/114 (0%)
Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIP-VIDYDLALLFQ 159
GI +P L + + + ++ +V+ VYN Q+HP +IDY++ L
Sbjct: 66 GIVIPFLMIFMKMPIQECIPLANIFAMISSVTRFVYNFNQKHPYRPFRMIIDYEIVTLTM 125
Query: 160 PLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKRE 213
P++ G IGV + + +LL ++ + F K ++T+ KET +R+
Sbjct: 126 PMVYFGSLIGVYAGSLMNQLTLVILLQIVLAFTLYKTFQKALQTYIKETNRRRQ 179
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 53/93 (56%), Gaps = 6/93 (6%)
Query: 362 GPLFLELGIPPQVSSATALFA---ITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFV 418
P+FL LG+ P V++AT + TFS S+ ++ + YA A+ + A +
Sbjct: 292 SPVFLSLGLLPSVTAATNQYIGMISTFSVSL---QFIYKGQLNYSYAYVIGAVVLFTAII 348
Query: 419 GQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
G V+++++K GR SII+F +SF++ +S L L
Sbjct: 349 GLSVVERVVKKSGRQSIIVFIISFVLLISFLVL 381
>gi|78042783|ref|YP_361264.1| hypothetical protein CHY_2470 [Carboxydothermus hydrogenoformans
Z-2901]
gi|77994898|gb|ABB13797.1| putative membrane protein [Carboxydothermus hydrogenoformans
Z-2901]
Length = 266
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 43/82 (52%), Gaps = 1/82 (1%)
Query: 342 CGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
GI GM+G L+G GGGFIL P L L PQ ++ T+LF + F++ + Y KR
Sbjct: 9 IGIFVGMIGTLIGAGGGFILIPYLILVAKFSPQTAAGTSLFMVFFNALSGSIAYIRQKRV 68
Query: 401 PVPYALYFFALSIIAAFVGQHV 422
A YF +I A G ++
Sbjct: 69 DFRTAFYFALATIPGAIFGAYL 90
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 65/152 (42%), Gaps = 11/152 (7%)
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
++ IIG F G++ G GGG I +P L L+ F +++A S MV A+S + +
Sbjct: 4 LLNVIIGIFVGMIGTLIGAGGGFILIPYLILVAKFSPQTAAGTSLFMVFFNALSGSIAYI 63
Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
RQ+ +D+ A F + G G N + VL +L L+ R F
Sbjct: 64 RQKR-------VDFRTAFYFALATIPGAIFGAYLNSFLHSRLFNVLFALLLFFLAVRTFF 116
Query: 199 KGVETWKKET----ITKREAAKQLELIGNGYQ 226
KKE I E + ++ GN Y+
Sbjct: 117 HKTSNAKKENKETKIPGHETREIIDAEGNKYE 148
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 39/70 (55%), Gaps = 3/70 (4%)
Query: 327 EGTKWRASQLVFYCACGIT--AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAIT 384
EG K+ S + GI+ G++ LLG+GGG I PL LG P +++AT+ F +
Sbjct: 143 EGNKYEYS-FNLWLGIGISLIVGILSSLLGIGGGIIHVPLMGFLGFPMHIATATSHFILV 201
Query: 385 FSSSMSVVEY 394
+S + V+ +
Sbjct: 202 ITSLIGVISH 211
>gi|358344799|ref|XP_003636474.1| hypothetical protein MTR_042s0017 [Medicago truncatula]
gi|355502409|gb|AES83612.1| hypothetical protein MTR_042s0017 [Medicago truncatula]
Length = 107
Score = 47.0 bits (110), Expect = 0.019, Method: Composition-based stats.
Identities = 31/73 (42%), Positives = 42/73 (57%), Gaps = 6/73 (8%)
Query: 214 AAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWA 273
AA++ E I +G + E + LS + + K KV+IIEN+ KEFGLL VW
Sbjct: 3 AARRQETIDSGAEVEY-----KPLSTGSNGALEKGAKMHKVTIIENVCRKEFGLLAFVWV 57
Query: 274 IILALQIAK-NYT 285
++A QIAK NYT
Sbjct: 58 SLIAAQIAKQNYT 70
>gi|402849112|ref|ZP_10897353.1| membrane protein, putative [Rhodovulum sp. PH10]
gi|402500640|gb|EJW12311.1| membrane protein, putative [Rhodovulum sp. PH10]
Length = 310
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 68/129 (52%), Gaps = 12/129 (9%)
Query: 336 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
LVF A G G + G+ G+GGGF++ PL + +G+ P VS AT IT SS + Y+
Sbjct: 16 LVF--AMGTAVGFISGMFGIGGGFLMTPLLIFIGVSPAVSVATVAAHITASSFSGALTYW 73
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHV---LKKLIKILGRASIIIFTLSFIIFVSALSLG 452
+ +P A +I G + L +++ LG+ + I L++++ + S+G
Sbjct: 74 RRRAIDLPLAFVL----LIGGLTGTAIGVWLFGVLRALGQLDLFI-ALAYVVLLC--SVG 126
Query: 453 GVGLAKMIK 461
G+ +A+ ++
Sbjct: 127 GLMVAESVR 135
>gi|119873364|ref|YP_931371.1| hypothetical protein Pisl_1881 [Pyrobaculum islandicum DSM 4184]
gi|119674772|gb|ABL89028.1| protein of unknown function DUF81 [Pyrobaculum islandicum DSM 4184]
Length = 243
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 40/68 (58%), Gaps = 3/68 (4%)
Query: 66 KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
K+ P W +G ++ F G S+ GVGGG IFVP L LI GFD KS+AA+S +
Sbjct: 112 KNARPRRNIPW---LGLLLIFIGGFASSLFGVGGGTIFVPALILISGFDPKSAAAMSMGI 168
Query: 126 VTGGAVSA 133
+ AVS+
Sbjct: 169 IFPTAVSS 176
>gi|294954881|ref|XP_002788341.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239903653|gb|EER20137.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 318
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 64/246 (26%), Positives = 111/246 (45%), Gaps = 35/246 (14%)
Query: 160 PLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLE 219
P +V+G +GV + VL + ++ S F KG+ WK ET +A QL
Sbjct: 3 PPMVVGGLLGVYLYTWLPEIFQLVLYVFTAILASCMGFKKGIGLWKSET-EGASSAGQLG 61
Query: 220 LIGNGYQTEECKCDPEYLSNDTT-PEETREPKKSKVSIIENIYWKEFGL---LVAVWAII 275
++ P ++ DT P T + SI ++ +K+ L L+AVW +
Sbjct: 62 VV-----------PPPCVTEDTVLPSVTSRAR----SISLSLKYKKAILITTLLAVWIAV 106
Query: 276 LALQIAKNYTTT--------CSVLYWVLN----LLQIPVAG-GVSAYEAVALYKGRRKIA 322
+ ++ ++T C+ LYW L+ ++ + V G V A ++ A+ K K++
Sbjct: 107 ILSRLLLGSSSTPSIIGIPYCTGLYWALSVIVCIMLMAVPGLFVVAIKSAAMLKLAVKLS 166
Query: 323 SKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFA 382
++S LV GI G + L+G GGG ++ PL L + + PQ ++AT
Sbjct: 167 GAMLCNATAKSSALVLGNIIGI--GFIAALVGQGGGSLITPLLLYMELNPQQAAATGSVV 224
Query: 383 ITFSSS 388
+ +SS
Sbjct: 225 MLITSS 230
>gi|444375855|ref|ZP_21175107.1| hypothetical protein D515_1881 [Enterovibrio sp. AK16]
gi|443680083|gb|ELT86731.1| hypothetical protein D515_1881 [Enterovibrio sp. AK16]
Length = 257
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 1/91 (1%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 393
+ L+ + AG + + G GGG ++ P L +G+PP ++ T + TF+SS +
Sbjct: 8 NTLIILGMVALAAGFIDAVAG-GGGMLMVPALLSIGLPPHIALGTNKLSATFASSTAAFT 66
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
YY K F + ++ F + I A +G V+
Sbjct: 67 YYRKKLFSPSFWIHAFVATFIGAILGTLVVD 97
>gi|75674383|ref|YP_316804.1| hypothetical protein Nwi_0184 [Nitrobacter winogradskyi Nb-255]
gi|74419253|gb|ABA03452.1| Protein of unknown function DUF81 [Nitrobacter winogradskyi Nb-255]
Length = 343
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 68/134 (50%), Gaps = 4/134 (2%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
++ A G G V G+ G+GGGF++ PL + +GI P V+ AT I SS + Y
Sbjct: 48 NVLLILAMGAAVGFVSGMFGIGGGFLMTPLLIFVGIAPAVAVATVTSHIAASSFSGALSY 107
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL--SLG 452
+ + AL + +I +G +L + LG+ ++I LS+++ +S + ++
Sbjct: 108 WRKRAVDPMLALVLLSGGVIGTILGVLTFTRL-RSLGQLDLLI-ALSYLVLLSGVGAAMS 165
Query: 453 GVGLAKMIKRIEHK 466
GL M++ +++
Sbjct: 166 WEGLRAMLRARQNR 179
>gi|85705724|ref|ZP_01036821.1| membrane protein [Roseovarius sp. 217]
gi|85669714|gb|EAQ24578.1| membrane protein [Roseovarius sp. 217]
Length = 304
Score = 46.2 bits (108), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 61/117 (52%), Gaps = 4/117 (3%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G+ G+GGGFIL PL +GIPP V+ AT I +S +V+ ++ K +
Sbjct: 29 GMFGVGGGFILTPLLFFIGIPPAVAVATGANQIMAASFSAVLAHFRRKTVDLEMGTVLLI 88
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE 467
+I A +G V L K LG+ +++ TL +++F+ +GG+ + ++ I +
Sbjct: 89 GGLIGAAMGVFVFNYL-KSLGQVDLLV-TLCYVVFLGV--IGGMMFIESLRSIRNTR 141
>gi|145483119|ref|XP_001427582.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394664|emb|CAK60184.1| unnamed protein product [Paramecium tetraurelia]
Length = 491
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 69/140 (49%), Gaps = 18/140 (12%)
Query: 92 GSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQR-------HPT 144
GS+G +GGG + P L I+ +DA + ++ C++ + +++ Q+ HP
Sbjct: 75 GSLGALGGGTVKRPFLQAILNYDASIAGDITSCLMFSAQLVNMIFIFFQKYIMNQFSHPD 134
Query: 145 LD-IPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLI---VLFLVLSTRAFLKG 200
+D P+I+Y++ L++ LGI I + A+++ + L+ VLF V+ + L
Sbjct: 135 VDERPIINYEIGLIY----TLGIPISMCMGSELANFLPLLPLLSFQVLFFVVISPVLLYY 190
Query: 201 VETWKKETITKREAAKQLEL 220
KK+ I +R +L
Sbjct: 191 A---KKQDIIERSKETNNDL 207
>gi|85713974|ref|ZP_01044963.1| hypothetical protein NB311A_07458 [Nitrobacter sp. Nb-311A]
gi|85699100|gb|EAQ36968.1| hypothetical protein NB311A_07458 [Nitrobacter sp. Nb-311A]
Length = 371
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 58/117 (49%), Gaps = 7/117 (5%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
A G G V G+ G+GGGF++ PL + +GI P V+ AT I SS + Y+ +
Sbjct: 82 AMGAAVGFVSGMFGIGGGFLMTPLLIFIGIAPAVAVATVTSHIAASSFSGALSYWRKRAV 141
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 457
A+ + II G +L + LG+ ++I LS+++ +S GVG A
Sbjct: 142 DPMLAVILLSGGIIGTVFGVSTFTRL-RSLGQLDLLI-ALSYLVLLS-----GVGAA 191
>gi|145500024|ref|XP_001435996.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403133|emb|CAK68599.1| unnamed protein product [Paramecium tetraurelia]
Length = 400
Score = 46.2 bits (108), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 86/348 (24%), Positives = 155/348 (44%), Gaps = 38/348 (10%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVL 164
P++ +++G +K + + M+ GG++ + ++R P+I+Y L + PLL+
Sbjct: 40 PVMMVLLGCPSKKAIWNTYIMLLGGSLGNFIRLGKERTANGSAPLINYQLVQITLPLLLA 99
Query: 165 GISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAA------KQL 218
G +GVA W+ +L+++ + FLK + K + K+L
Sbjct: 100 GAILGVAT----GKWLPKLLIVIFLFGILLTVFLKTKSLYTKTRSKEMNEHLIPVELKEL 155
Query: 219 ELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIILAL 278
+ ++E E E E + + II K G + ++ I
Sbjct: 156 SIQNQSTHSKELNIIKEKDGRLYPTEPLTEISLTVLIIIVVTLLKGSGAVPSILGI---- 211
Query: 279 QIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAVALYKGRRKIASKGDE-----GTKWRA 333
+Y C + + LN+ V G++ Y +Y+ RK SK +E G +
Sbjct: 212 ----DY---CGLGFHFLNI----VIFGIACYN---VYR-YRKFISKDEEYKQSIGYDFSD 256
Query: 334 SQL--VF--YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
++ VF + AG +GGL+GLGGG +L PL+LE GI P ++A+A F + F+SS+
Sbjct: 257 GKMSAVFDITVKSSLYAGFLGGLVGLGGGVVLTPLWLETGINPPRAAASATFTVLFTSSI 316
Query: 390 SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 437
SV L + + +S +++ VLK +++ R SI+I
Sbjct: 317 SVFIIALSGGYQFEEFIILGLVSSFGSYLVAGVLKHIVQKYQRESILI 364
>gi|407798212|ref|ZP_11145120.1| hypothetical protein OCGS_0193 [Oceaniovalibus guishaninsula
JLT2003]
gi|407059648|gb|EKE45576.1| hypothetical protein OCGS_0193 [Oceaniovalibus guishaninsula
JLT2003]
Length = 303
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
G++ G+ G+GGGF++ PL +GIPP V+ AT+ I SS V+ ++ + + L
Sbjct: 25 GILSGMFGVGGGFLITPLLFFVGIPPAVAVATSANQIVASSVSGVLAHFRRRTVDLRMGL 84
Query: 407 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
A ++ + G + L + +G+ +++ L +++F+ A+
Sbjct: 85 VLLAGGLVGSAAGVQIFNVL-RQMGQVDLMV-RLCYVVFLGAI 125
>gi|421872468|ref|ZP_16304086.1| conserved hypothetical protein [Brevibacillus laterosporus GI-9]
gi|372458441|emb|CCF13635.1| conserved hypothetical protein [Brevibacillus laterosporus GI-9]
Length = 275
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 7/98 (7%)
Query: 336 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSS 388
++ + G+ AG++G L+G+GGG P L G I PQ++S T+L I ++
Sbjct: 7 ILLFMFIGLFAGVMGSLVGIGGGMFFVPALLYFGNMYEPGSITPQIASGTSLLVIAITAL 66
Query: 389 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
S + ++ K+ V A+ FF S A VG +V K L
Sbjct: 67 SSSISFWKQKKVEVQAAVLFFIGSAPGAIVGVYVNKWL 104
>gi|339007279|ref|ZP_08639854.1| hypothetical membrane protein [Brevibacillus laterosporus LMG
15441]
gi|338776488|gb|EGP36016.1| hypothetical membrane protein [Brevibacillus laterosporus LMG
15441]
Length = 275
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 51/98 (52%), Gaps = 7/98 (7%)
Query: 336 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSS 388
++ + G+ AG++G L+G+GGG P L G I PQ++S T+L I ++
Sbjct: 7 ILLFMFIGLFAGVMGSLVGIGGGMFFVPALLYFGNMYEPGSITPQIASGTSLLVIAITAL 66
Query: 389 MSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
S + ++ K+ V A+ FF S A VG +V K L
Sbjct: 67 SSSISFWKQKKVEVQAAVLFFIGSAPGAIVGVYVNKWL 104
>gi|338211621|ref|YP_004655674.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336305440|gb|AEI48542.1| protein of unknown function DUF81 [Runella slithyformis DSM 19594]
Length = 301
Score = 45.8 bits (107), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 44/95 (46%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 393
S LVFY GI A +V G LG+G G L LG+PP VSS + + F++ S +
Sbjct: 57 SDLVFYLLVGIGAQLVDGALGMGYGATATSFLLTLGVPPAVSSTSVHLSEMFTTGASAIS 116
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 428
++ K + + +G ++L +I
Sbjct: 117 HFKFKNINKKLFKTLLIPGVTGSIIGAYLLSDVID 151
>gi|149202975|ref|ZP_01879946.1| membrane protein [Roseovarius sp. TM1035]
gi|149143521|gb|EDM31557.1| membrane protein [Roseovarius sp. TM1035]
Length = 304
Score = 45.8 bits (107), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 61/117 (52%), Gaps = 4/117 (3%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G+ G+GGGFIL PL +GIPP V+ AT I +S +V+ ++ K +
Sbjct: 29 GMFGVGGGFILTPLLFFIGIPPAVAVATGANQIMAASFSAVLAHFRRKTVDLEMGTVLLI 88
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE 467
+I A +G V L K LG+ +++ TL +++F+ +GG+ + ++ I +
Sbjct: 89 GGLIGAALGVMVFNYL-KSLGQVDLLV-TLCYVVFLGV--IGGMMFIESLRAIRNTR 141
>gi|221485444|gb|EEE23725.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 482
Score = 45.8 bits (107), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 154/354 (43%), Gaps = 67/354 (18%)
Query: 114 DAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFN 173
D + +SK + G + +V++NL ++ P+ ++ +I Y+LA + +P +LG +GV N
Sbjct: 41 DVHKAVPLSKVTIFGACLVSVLFNLGRKQPSGNLLLISYELAAILEPFTLLGGVLGVLLN 100
Query: 174 VIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET--ITKREA--AKQLELIGNGYQTEE 229
++ D I L+++ + + +G+ ++ E+ + +R A +++ + + EE
Sbjct: 101 IVMTDIQIISCLVLVLSFTTYKTTRRGLLQYQTESRLLAERAAVLSREHHSLSTRDREEE 160
Query: 230 CK---------------CDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFG-----LL- 268
C +L+ + E P+ + E + G LL
Sbjct: 161 TSSLLAGEEPRASSAPACGENFLAVSSLAEMMETPQVLLDPLPEGCEADDLGAKGEALLP 220
Query: 269 --VAVWAIILALQI----AKNYTTTCSVLYWV----LNLLQIPVAGGVSAYE-------- 310
A A+IL + AK++ + S++Y + +N L + AGG A
Sbjct: 221 SSSAESAVILKERREILEAKHHPSWISLVYLIFDVAVNTLCLLAAGGPVAVVCGEAWQQA 280
Query: 311 -----------AVALY-------KGRRKIASKGDE-----GTKWRASQ-LVFYCACGITA 346
A ALY K RR+ DE G W + + +V Y + A
Sbjct: 281 LILFLISFHVFATALYARWLLKSKRRREQLGIQDEDVEHGGLGWVSPRTVVLYPFLSLLA 340
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
G+ G LG+GGG I GPL LE+G+ + TA F I F+SS + ++Y L R
Sbjct: 341 GVAAGSLGIGGGLIKGPLLLEIGLNSLSAVTTANFMIFFTSSANALQYATLGRL 394
>gi|212704214|ref|ZP_03312342.1| hypothetical protein DESPIG_02269 [Desulfovibrio piger ATCC 29098]
gi|212672418|gb|EEB32901.1| hypothetical protein DESPIG_02269 [Desulfovibrio piger ATCC 29098]
Length = 420
Score = 45.4 bits (106), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 5/122 (4%)
Query: 305 GVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 364
G AYEA + G+ A T + AS A G+ AG++ G +G GGGFI+ P
Sbjct: 51 GAEAYEAHEMLGGKSAYAPAFYTTTIFLAS-----IAVGLAAGLITGCIGAGGGFIITPA 105
Query: 365 FLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
+ G+ ++ T LF I + M + L V A+ F S FVG + K
Sbjct: 106 LMAAGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNVSVKLAIAFLVGSGAGTFVGGAINK 165
Query: 425 KL 426
L
Sbjct: 166 GL 167
>gi|237835457|ref|XP_002367026.1| hypothetical protein TGME49_046180 [Toxoplasma gondii ME49]
gi|211964690|gb|EEA99885.1| hypothetical protein TGME49_046180 [Toxoplasma gondii ME49]
gi|221506300|gb|EEE31935.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 482
Score = 45.4 bits (106), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 154/354 (43%), Gaps = 67/354 (18%)
Query: 114 DAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFN 173
D + +SK + G + +V++NL ++ P+ ++ +I Y+LA + +P +LG +GV N
Sbjct: 41 DVHKAVPLSKVTIFGACLVSVLFNLGRKQPSGNLLLISYELAAILEPFTLLGGVLGVLLN 100
Query: 174 VIFADWMITVLLIVLFLVLSTRAFLKGVETWKKET--ITKREA--AKQLELIGNGYQTEE 229
++ D I L+++ + + +G+ ++ E+ + +R A +++ + + EE
Sbjct: 101 IVMTDIQIISCLVLVLSFTTYKTTRRGLLQYQTESRLLAERAAVLSREHHSLSTRDREEE 160
Query: 230 CK---------------CDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFG-----LL- 268
C +L+ + E P+ + E + G LL
Sbjct: 161 TSSLLAEEEPRASSAPACGENFLAVSSLAEMMETPQVLLDPLPEGCEADDLGAKGEALLP 220
Query: 269 --VAVWAIILALQI----AKNYTTTCSVLYWV----LNLLQIPVAGGVSAYE-------- 310
A A+IL + AK++ + S++Y + +N L + AGG A
Sbjct: 221 SSSAESAVILKERREILEAKHHPSWISLVYLIFDVAVNTLCLLAAGGPVAVVCGEAWQQA 280
Query: 311 -----------AVALY-------KGRRKIASKGDE-----GTKWRASQ-LVFYCACGITA 346
A ALY K RR+ DE G W + + +V Y + A
Sbjct: 281 LILFLISFHVFATALYARWLLKSKRRREQLGIQDEDVEHGGLGWVSPRTVVLYPFLSLLA 340
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
G+ G LG+GGG I GPL LE+G+ + TA F I F+SS + ++Y L R
Sbjct: 341 GVAAGSLGIGGGLIKGPLLLEIGLNSLSAVTTANFMIFFTSSANALQYATLGRL 394
>gi|145499407|ref|XP_001435689.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402823|emb|CAK68292.1| unnamed protein product [Paramecium tetraurelia]
Length = 400
Score = 45.1 bits (105), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 169/373 (45%), Gaps = 41/373 (10%)
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
++G ++G G + P++ +++G +K + + M+ GG++ +
Sbjct: 19 ILGILVGLSQAGGIGGGPIIS-----PVMMVLLGCPSKKAIWNTYIMLLGGSLGNFLRLG 73
Query: 139 RQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFL 198
++R + P+I+Y L + PLL+ G +GVA W+ +++++ + FL
Sbjct: 74 KERTANGNAPLINYQLVQITLPLLLAGAILGVA----TGKWLPKLIIVIFLFGILMTVFL 129
Query: 199 KGVETWKKETITKREAAKQLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIE 258
K + K +E +QL + + ++ E SN + + K +++ IE
Sbjct: 130 KTKSLYAKT--RSKEMNEQLIPV----ELKDLTVQKE--SNHSKELNILKEKDARLYPIE 181
Query: 259 NIYWKEFGLLVAVWAIILALQIAKNYTTT-----CSVLYWVLNLLQIPVAGGVSAYEAVA 313
+ E L + + ++ L+ + + C Y LN V G++ Y
Sbjct: 182 PL--TEISLTILIIIVVTLLKGSGAVPSLLGVDFCGYGYHFLNF----VIFGIAFYNV-- 233
Query: 314 LYKGRRKIASKGDE-----GTKWRASQL--VF--YCACGITAGMVGGLLGLGGGFILGPL 364
+ RK SK +E G + ++ VF + AG +GGL+GLGGG +L PL
Sbjct: 234 --QRYRKQISKDEEYRESIGYDFADGKMSSVFDITVKSSLYAGFLGGLVGLGGGVVLTPL 291
Query: 365 FLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
+LE GI P ++A+A F + F+SS+SV L + + +S + +++ LK
Sbjct: 292 WLETGINPPRAAASATFTVLFTSSISVFIIALSGGYQFSEFIILGLVSSLGSYLVAGFLK 351
Query: 425 KLIKILGRASIII 437
KL+K R SI+I
Sbjct: 352 KLVKKYKRESILI 364
>gi|145479595|ref|XP_001425820.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392892|emb|CAK58422.1| unnamed protein product [Paramecium tetraurelia]
Length = 494
Score = 45.1 bits (105), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 46/86 (53%), Gaps = 8/86 (9%)
Query: 92 GSVGGVG-------GGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT 144
G++GG GG + P L I+ FDA S ++ C++ + +V+ Q HP
Sbjct: 75 GALGGTSLQIIFLKGGIVKRPFLQTILNFDASISGDITACLMISAQIVNMVFIFLQNHPD 134
Query: 145 L-DIPVIDYDLALLFQPLLVLGISIG 169
+ + PVI+YD+A+++ + + + +G
Sbjct: 135 VQERPVINYDIAIIYTLAIPVSLCLG 160
>gi|452824321|gb|EME31325.1| hypothetical protein Gasu_15590 [Galdieria sulphuraria]
Length = 322
Score = 44.7 bits (104), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 50/91 (54%), Gaps = 1/91 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 401
G +AG + L+G GGG +L PLF+ G+ + + T+L A++F+S ++ Y+L R
Sbjct: 75 GFSAGCIASLVGAGGGVVLTPLFVTFYGLTQKQAQGTSLVAVSFNSLVASFIYFLGGRVL 134
Query: 402 VPYALYFFALSIIAAFVGQHVLKKLIKILGR 432
+ A + ++I A VG V KL + R
Sbjct: 135 LLPAFFLTLTAVICARVGAKVTSKLDNLFLR 165
>gi|298290659|ref|YP_003692598.1| hypothetical protein Snov_0646 [Starkeya novella DSM 506]
gi|296927170|gb|ADH87979.1| protein of unknown function DUF81 [Starkeya novella DSM 506]
Length = 308
Score = 44.7 bits (104), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 35/54 (64%), Gaps = 3/54 (5%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA---TALFAITFSSSMS 390
A GI G + G+ G+GGGF++ PL + LG+PP V+ A T + A +FS +++
Sbjct: 18 LALGIAVGFISGMFGVGGGFLMTPLLIFLGVPPAVAVASVSTHMAASSFSGTLT 71
>gi|315230493|ref|YP_004070929.1| hypothetical protein TERMP_00729 [Thermococcus barophilus MP]
gi|315183521|gb|ADT83706.1| hypothetical protein TERMP_00729 [Thermococcus barophilus MP]
Length = 254
Score = 44.7 bits (104), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 42/81 (51%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
Y A G+ G++ L GLGGGF++ P LG+ + T+ A+ F+S S + Y K
Sbjct: 7 YFAVGVVIGILAALFGLGGGFLIVPTLNLLGVEIHHAVGTSSAAVVFTSLSSALAYSRQK 66
Query: 399 RFPVPYALYFFALSIIAAFVG 419
R L + +II A++G
Sbjct: 67 RVHYKVGLLLASTAIIGAYIG 87
>gi|303325610|ref|ZP_07356053.1| putative membrane protein [Desulfovibrio sp. 3_1_syn3]
gi|345892217|ref|ZP_08843040.1| hypothetical protein HMPREF1022_01700 [Desulfovibrio sp.
6_1_46AFAA]
gi|302863526|gb|EFL86457.1| putative membrane protein [Desulfovibrio sp. 3_1_syn3]
gi|345047356|gb|EGW51221.1| hypothetical protein HMPREF1022_01700 [Desulfovibrio sp.
6_1_46AFAA]
Length = 421
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 58/129 (44%), Gaps = 7/129 (5%)
Query: 300 IPVAGG--VSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGG 357
+P+ G AYE + G+ A T + AS A G+ AG++ G +G GG
Sbjct: 44 VPILAGCLAEAYEYHEMLGGKSAYAPAFYTTTIFLAS-----IAVGLAAGLITGCIGAGG 98
Query: 358 GFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAF 417
GFI+ P + +G+ ++ T LF I + M + L A+ F SI+ F
Sbjct: 99 GFIITPALMAVGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNVSAKLAIAFLVGSIVGTF 158
Query: 418 VGQHVLKKL 426
+G + K L
Sbjct: 159 IGGAINKGL 167
>gi|399056281|ref|ZP_10743692.1| putative permease [Brevibacillus sp. CF112]
gi|433543570|ref|ZP_20499974.1| hypothetical protein D478_07713 [Brevibacillus agri BAB-2500]
gi|398046272|gb|EJL38897.1| putative permease [Brevibacillus sp. CF112]
gi|432185127|gb|ELK42624.1| hypothetical protein D478_07713 [Brevibacillus agri BAB-2500]
Length = 274
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 46/91 (50%), Gaps = 7/91 (7%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSV 391
+ G+ AG++G + GLGGG P + +PPQV++AT+L I ++ S
Sbjct: 9 FLLIGVVAGVIGSIAGLGGGIFFVPALMFFANWYEPGSMPPQVAAATSLLVIAVTALSSS 68
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
+ Y K+ + AL FF S A VG ++
Sbjct: 69 ISYIKQKKVDMQSALLFFIGSAPGAIVGVYL 99
>gi|317130121|ref|YP_004096403.1| hypothetical protein Bcell_3430 [Bacillus cellulosilyticus DSM
2522]
gi|315475069|gb|ADU31672.1| protein of unknown function DUF81 [Bacillus cellulosilyticus DSM
2522]
Length = 274
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 61/132 (46%), Gaps = 7/132 (5%)
Query: 71 DIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGA 130
+I +G+ ++G +IGF C + G+GGG + VP + L+ GF + A S M+ A
Sbjct: 148 EITYGYHPIIGILIGFVVGMCSGLFGIGGGSLMVPAMILLFGFPPHIAVATSMLMIFLSA 207
Query: 131 VSAVVYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFL 190
+ + V ++ + ID+ L P +G +G A N I LL V +
Sbjct: 208 ILSSVSHMALGN-------IDWLYVLALLPGAWIGGQLGAAINRKMQSDAIVNLLRVFLI 260
Query: 191 VLSTRAFLKGVE 202
++ R +G+
Sbjct: 261 IIGIRLIYQGIT 272
Score = 38.5 bits (88), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 42/86 (48%), Gaps = 1/86 (1%)
Query: 342 CGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
G GM GL G+GGG ++ P + L G PP ++ AT++ I S+ +S V + L
Sbjct: 161 IGFVVGMCSGLFGIGGGSLMVPAMILLFGFPPHIAVATSMLMIFLSAILSSVSHMALGNI 220
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKL 426
Y L + I +G + +K+
Sbjct: 221 DWLYVLALLPGAWIGGQLGAAINRKM 246
>gi|452853551|ref|YP_007495235.1| conserved membrane protein of unknown function [Desulfovibrio
piezophilus]
gi|451897205|emb|CCH50084.1| conserved membrane protein of unknown function [Desulfovibrio
piezophilus]
Length = 309
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 66/129 (51%), Gaps = 9/129 (6%)
Query: 336 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
LVF A G G++ G+ G+GGGF++ PL + GIPP V++A+ I +S+ + +Y
Sbjct: 17 LVF--ALGGFVGLLSGIFGVGGGFLMTPLLIMFGIPPTVAAASDSNQIVGASTSGCLAHY 74
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVG 455
L ++ F G V+K L K +G A +I ++++ L LGGVG
Sbjct: 75 RLGNVDFKMGFLLLIGGVLGGFAGVQVIKVL-KAMGNADFLI-NITYV-----LMLGGVG 127
Query: 456 LAKMIKRIE 464
I+ ++
Sbjct: 128 SYMFIESVQ 136
>gi|256419285|ref|YP_003119938.1| hypothetical protein Cpin_0237 [Chitinophaga pinensis DSM 2588]
gi|256034193|gb|ACU57737.1| protein of unknown function DUF81 [Chitinophaga pinensis DSM 2588]
Length = 335
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 43/92 (46%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
+L+FY G+ A V G LG+ G L L LGIPP V+SA+ A F++ S + +
Sbjct: 79 ELLFYILVGLAAQTVDGALGMAYGATSSSLLLGLGIPPSVASASVHVAEVFTTGASGISH 138
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
+ LY II + G ++ +
Sbjct: 139 FRFGNVNKKLFLYLLVPGIIGSIAGSFLISNM 170
>gi|242279660|ref|YP_002991789.1| hypothetical protein Desal_2192 [Desulfovibrio salexigens DSM 2638]
gi|242122554|gb|ACS80250.1| protein of unknown function DUF81 [Desulfovibrio salexigens DSM
2638]
Length = 426
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 62/130 (47%), Gaps = 11/130 (8%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
A G+ AG++ G +G GGGFI+ P + GI ++ T LF I + M V + +
Sbjct: 86 AIGLGAGLITGCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVIHRKMGNV 145
Query: 401 PVPYALYFFALSIIAAFVG---QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 457
VP A+ F +I+ A G VL ++ +L A FI V +L LG +G
Sbjct: 146 SVPLAVVFLIGAIVGATAGGIINRVLYEINPVLSDA--------FITTVYSLMLGFLGFY 197
Query: 458 KMIKRIEHKE 467
M ++ ++
Sbjct: 198 AMFDYLKARK 207
>gi|170741286|ref|YP_001769941.1| hypothetical protein M446_3097 [Methylobacterium sp. 4-46]
gi|168195560|gb|ACA17507.1| protein of unknown function DUF81 [Methylobacterium sp. 4-46]
Length = 307
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 32/51 (62%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVV 392
G G V G+ G+GGGF++ P+ + LGIPP ++ AT I SS+ SV+
Sbjct: 20 LGAAVGFVSGVFGIGGGFLMTPILIVLGIPPAIAVATQTAPIVASSTTSVL 70
>gi|268591933|ref|ZP_06126154.1| putative membrane protein [Providencia rettgeri DSM 1131]
gi|291312322|gb|EFE52775.1| putative membrane protein [Providencia rettgeri DSM 1131]
Length = 264
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 71/156 (45%), Gaps = 23/156 (14%)
Query: 309 YEAVALYKGRRKIASKGDEGTK-------------WRASQLVFYCACGITAGMVGGLLGL 355
+ V LY G R ++GDE W + + G+ G++ G+LG+
Sbjct: 101 FSLVMLYVGGRIFFNRGDEQNDAHCLINTETGKFIWNTKTALIFGGIGVITGLLTGMLGV 160
Query: 356 GGGFILGPL---FLELGIPPQVSSATALFAITFSSSMSV-VEYYLLKRFPVPYALYFFAL 411
GGGFI+ P F L + + AT+L I S+S+ + + ++P ++ F A
Sbjct: 161 GGGFIIVPALRKFTNLNMKSII--ATSLMVIFCIGSISIFINVWNGFQYPKEISIVFIAS 218
Query: 412 SIIAAFVGQHVL----KKLIKILGRASIIIFTLSFI 443
++ +G+ ++ K I+IL +++I L I
Sbjct: 219 CVVGLLIGRLIMHYISNKAIQILFSSAVIFVALGLI 254
>gi|332295235|ref|YP_004437158.1| hypothetical protein Thena_0382 [Thermodesulfobium narugense DSM
14796]
gi|332178338|gb|AEE14027.1| protein of unknown function DUF81 [Thermodesulfobium narugense DSM
14796]
Length = 253
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 40/70 (57%)
Query: 318 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 377
++K + G++ + L G AG V GLLG+GGG ++ P+ + +G+P + ++A
Sbjct: 117 KQKKNNSGEDFQINKVQMLTIGVPIGAFAGFVAGLLGIGGGNMIIPILIFIGVPARFAAA 176
Query: 378 TALFAITFSS 387
T F + FSS
Sbjct: 177 TTSFIVLFSS 186
>gi|113970806|ref|YP_734599.1| hypothetical protein Shewmr4_2471 [Shewanella sp. MR-4]
gi|113885490|gb|ABI39542.1| protein of unknown function DUF81 [Shewanella sp. MR-4]
Length = 257
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 48/91 (52%), Gaps = 11/91 (12%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY---LLKR 399
G+ AG + ++G GGG + P L LGIPP + T FA +F SSM+ YY LLK
Sbjct: 17 GLIAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWTYYRQHLLKP 75
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKIL 430
F+ ++ IA F+G + L+ +L
Sbjct: 76 -------SFWYMAFIATFIGAVLGSVLVYLL 99
>gi|254467336|ref|ZP_05080747.1| permease [Rhodobacterales bacterium Y4I]
gi|206688244|gb|EDZ48726.1| permease [Rhodobacterales bacterium Y4I]
Length = 285
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 55/101 (54%), Gaps = 2/101 (1%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
G++ G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++ + + L
Sbjct: 3 GVLSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRRTVDIKMGL 62
Query: 407 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVS 447
A ++ A +G V L K LG+ +++ L +++F+
Sbjct: 63 VLQAGGLMGAALGVVVFNYL-KALGQVDLLV-KLCYVVFLG 101
>gi|145494802|ref|XP_001433395.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400512|emb|CAK65998.1| unnamed protein product [Paramecium tetraurelia]
Length = 466
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 79/378 (20%), Positives = 158/378 (41%), Gaps = 44/378 (11%)
Query: 109 LIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD-IPVIDYDLALLFQPLLVLGIS 167
+I+ + + ++ C++ GG + + + +++P P+I+Y ++++F + +
Sbjct: 72 MILNYTQSEATHIAYCLMFGGTLLNTILLMFEKNPEDQRRPIINYRISIIFNLAVPFATN 131
Query: 168 IGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETIT----KREAAKQLELIGN 223
+G + F + T++L LFL K + K E ++ ++ ++ L L G+
Sbjct: 132 LGSSL-ASFLPQLYTLILQELFLFAVAPILWKKAQKAKSEELSTPDKQKNESQALNLDGS 190
Query: 224 GYQT--EECKCDPEYLSNDTTPEETREPKKSKVSIIENIYWKEFGLLVAVWAIIL----- 276
Q E K + + S T K++ ++ + +L + A+ +
Sbjct: 191 NTQQKIELQKIEEQQYSVSLTSFSVENSKQNSNTLYYQFKQETENILPFMPALFILGSFG 250
Query: 277 --ALQIAKNYTTTCSVLY-------WVLNLLQIPVAGGVSAYEAVALYKGRR------KI 321
+ I T Y W + + + + ++ VA G R ++
Sbjct: 251 LNQIFIQMRSTNPSKPSYVGIYDCTWQNDFMILILIIANVLFDYVAWNFGSRQEKYFDQL 310
Query: 322 ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 381
+E S+ A G AG V G LG+G GF++ P L G+ P+ +SAT+ F
Sbjct: 311 NYLPNERYFTPISRFFKIYAGGFGAGFVSGFLGMGAGFVMVPTLLYSGLIPRCASATSAF 370
Query: 382 AITFSSSMS-----VVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASII 436
I F S++ + +YL ++ L F L++I G V+ K+ IL I
Sbjct: 371 -IYFMISLNNLITLLTNHYLDQQM----ILLFTGLAVI----GGSVITKIGYILLSKYKI 421
Query: 437 IFTLSFIIFVSALSLGGV 454
+T+ I+F AL + +
Sbjct: 422 GYTVILIVF--ALDIANI 437
>gi|294500093|ref|YP_003563793.1| hypothetical protein BMQ_3337 [Bacillus megaterium QM B1551]
gi|294350030|gb|ADE70359.1| putative membrane protein [Bacillus megaterium QM B1551]
Length = 258
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 51/82 (62%), Gaps = 3/82 (3%)
Query: 347 GMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAITFSSSM-SVVEYYLLKRFPVPY 404
G V G+LG GG FIL P+ L L IP +++ AT+L A+TF SS+ S V ++ + P
Sbjct: 153 GAVAGVLGAGGAFILVPVMLVVLKIPTRITIATSL-AVTFISSIGSTVGKLIMHQVPFIP 211
Query: 405 ALYFFALSIIAAFVGQHVLKKL 426
AL A S+IA+ +G V +K+
Sbjct: 212 ALILVAASLIASPIGAKVGQKM 233
>gi|254475159|ref|ZP_05088545.1| membrane protein [Ruegeria sp. R11]
gi|214029402|gb|EEB70237.1| membrane protein [Ruegeria sp. R11]
Length = 307
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 54/100 (54%), Gaps = 2/100 (2%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
G++ G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++ + + L
Sbjct: 25 GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRRTVDIKMGL 84
Query: 407 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
+I A +G V L K LG+ +++ L +++F+
Sbjct: 85 VLQVGGLIGAALGVVVFNYL-KALGQVDLLV-KLCYVVFL 122
>gi|259415120|ref|ZP_05739042.1| membrane protein [Silicibacter sp. TrichCH4B]
gi|259349030|gb|EEW60784.1| membrane protein [Silicibacter sp. TrichCH4B]
Length = 305
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 5/104 (4%)
Query: 346 AGMVG---GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
GMVG G+ G+GGGF+L PL +GIPP V+ AT I SS V+ + + +
Sbjct: 21 GGMVGVLSGMFGVGGGFLLTPLLFFIGIPPAVAVATGANQIVASSVSGVLAHLRRRTVDL 80
Query: 403 PYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
A +I A +G + L K LG+ + + TL +++F+
Sbjct: 81 KMGAVLQAGGLIGAALGVAIFNYL-KSLGQIDLFV-TLCYVVFL 122
>gi|452965232|gb|EME70258.1| permease [Magnetospirillum sp. SO-1]
Length = 301
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 67/124 (54%), Gaps = 9/124 (7%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
A G G++ G+ G+GGGF+L PL + LGIPP V+ A+ + SS V ++ +R
Sbjct: 20 AVGGGIGVLSGIFGVGGGFLLTPLLIMLGIPPAVAVASGANQVLGSSVSGVFAHW--RRR 77
Query: 401 PVPYALYFFALSIIAAFVGQHV-LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
V + F L+ A V L L+K LG+ ++I +LS++ F LG VGL +
Sbjct: 78 NVDIRMAVFLLAGGFAGSAAGVWLFALLKRLGQIDLVI-SLSYVGF-----LGSVGLLML 131
Query: 460 IKRI 463
++ +
Sbjct: 132 VETL 135
>gi|114048031|ref|YP_738581.1| hypothetical protein Shewmr7_2539 [Shewanella sp. MR-7]
gi|113889473|gb|ABI43524.1| protein of unknown function DUF81 [Shewanella sp. MR-7]
Length = 257
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 48/91 (52%), Gaps = 11/91 (12%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY---LLKR 399
G+ AG + ++G GGG + P L LGIPP + T FA +F SSM+ YY LLK
Sbjct: 17 GLIAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWTYYRQHLLKP 75
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKIL 430
F+ ++ IA F+G + L+ +L
Sbjct: 76 -------SFWYMAFIATFIGAVLGSVLVYLL 99
>gi|57640914|ref|YP_183392.1| hypothetical protein TK0979 [Thermococcus kodakarensis KOD1]
gi|57159238|dbj|BAD85168.1| hypothetical membrane protein, conserved, DUF81 family
[Thermococcus kodakarensis KOD1]
Length = 254
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 42/81 (51%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
Y GI G++ + GLGGGF++ P LG+ + T+ A+ F+S S + Y+ +
Sbjct: 7 YLGVGIVIGILAAMFGLGGGFLIVPTLNFLGVEIHHAVGTSSAAVVFTSLSSAIAYHRQR 66
Query: 399 RFPVPYALYFFALSIIAAFVG 419
R L + ++I A++G
Sbjct: 67 RIHYKAGLLLASTAVIGAYIG 87
>gi|117921072|ref|YP_870264.1| hypothetical protein Shewana3_2631 [Shewanella sp. ANA-3]
gi|117613404|gb|ABK48858.1| protein of unknown function DUF81 [Shewanella sp. ANA-3]
Length = 257
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 11/80 (13%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY---LLKR 399
G+ AG + ++G GGG + P L LGIPP + T FA +F SSM+ YY LLK
Sbjct: 17 GLIAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWTYYRQHLLKP 75
Query: 400 FPVPYALYFFALSIIAAFVG 419
F+ ++ IA F+G
Sbjct: 76 -------SFWYMAFIATFIG 88
>gi|84685127|ref|ZP_01013026.1| membrane protein [Maritimibacter alkaliphilus HTCC2654]
gi|84666859|gb|EAQ13330.1| membrane protein [Rhodobacterales bacterium HTCC2654]
Length = 304
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 51/96 (53%), Gaps = 2/96 (2%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G+ G+GGGF++ PL +GIPP V+ AT I SS V+ + K + +
Sbjct: 31 GMFGVGGGFLMTPLLFMIGIPPAVAVATGTNQIVASSVSGVLAHLKRKTVDLKMGCVLLS 90
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
++ A G + L + +G+ +++ TLS+++F+
Sbjct: 91 GGLVGAAAGVQLFAYL-RSIGQVDLLV-TLSYVVFL 124
>gi|86606127|ref|YP_474890.1| hypothetical protein CYA_1459 [Synechococcus sp. JA-3-3Ab]
gi|86554669|gb|ABC99627.1| putative membrane protein [Synechococcus sp. JA-3-3Ab]
Length = 317
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 35/53 (66%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
G+ AG++ G LG+GGG IL PL + LGIPP + AT+ F+I +S ++ +
Sbjct: 31 GLAAGILAGFLGIGGGTILVPLQVSLGIPPLQAVATSNFSILLTSLAGSIQNW 83
>gi|24373383|ref|NP_717426.1| 4-toluene sulfonate uptake permease family protein [Shewanella
oneidensis MR-1]
gi|24347649|gb|AAN54870.1| 4-toluene sulfonate uptake permease family protein [Shewanella
oneidensis MR-1]
Length = 257
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 50/100 (50%), Gaps = 11/100 (11%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 393
S A G+ AG + ++G GGG + P L LGIPP + T FA +F SSM+
Sbjct: 8 SNWALLAAIGLVAGFIDAVVG-GGGLLSIPALLTLGIPPHTALGTNKFAASFGSSMAAWT 66
Query: 394 YY---LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKIL 430
+Y LLK F+ ++ +A F+G + L+ +L
Sbjct: 67 FYRQHLLKP-------AFWYMAFVATFIGAVLGSILVYLL 99
>gi|23099829|ref|NP_693295.1| hypothetical protein OB2374 [Oceanobacillus iheyensis HTE831]
gi|22778060|dbj|BAC14330.1| hypothetical conserved protein [Oceanobacillus iheyensis HTE831]
Length = 285
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 48/98 (48%), Gaps = 7/98 (7%)
Query: 314 LYKGRRKIASKGDEGTKWRASQLVFYCACGIT--AGMVGGLLGLGGGFILGPLFLEL-GI 370
L G R KG+ +L F A I GM+ GL G+GGG I+ P+ + L G
Sbjct: 148 LEPGARSFELKGEV----YIYKLSFISAISIALFVGMLSGLFGIGGGAIMVPVMMLLFGF 203
Query: 371 PPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYF 408
P +++AT++F I F S M + + L Y L+F
Sbjct: 204 PAHLAAATSMFMIIFVSFMGSITHIYLGNVVWEYVLFF 241
Score = 38.1 bits (87), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 54/106 (50%), Gaps = 7/106 (6%)
Query: 96 GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDLA 155
G+GGG I VP++ L+ GF A +AA S M+ V + H L V +Y
Sbjct: 186 GIGGGAIMVPVMMLLFGFPAHLAAATSMFMII-----FVSFMGSITHIYLGNVVWEY--V 238
Query: 156 LLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGV 201
L F P +G +G N + ++ ++ +LL V+ +++ R ++G+
Sbjct: 239 LFFIPGAWIGGKLGAKVNHLLSNNVLELLLKVMLILVGIRMIMQGI 284
>gi|436841460|ref|YP_007325838.1| conserved membrane protein of unknown function [Desulfovibrio
hydrothermalis AM13 = DSM 14728]
gi|432170366|emb|CCO23737.1| conserved membrane protein of unknown function [Desulfovibrio
hydrothermalis AM13 = DSM 14728]
Length = 426
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 12/178 (6%)
Query: 294 VLNLLQIPVA-GGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGL 352
VL L+ IPV GG++ +A K + + A G+ AG++ G
Sbjct: 38 VLGLMLIPVILGGIAFASDIAPVLPDLLGGKKAYSPAFYSLGIFIVSIAIGMGAGLITGC 97
Query: 353 LGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALS 412
+G GGGFI+ P + GI ++ T LF I + M V + + VP A F +
Sbjct: 98 IGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAMMGSVIHRKMGNVSVPLAGVFLIGA 157
Query: 413 IIAAFVG---QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE 467
I+ A G VL ++ +L A FI V +L LG +G M+ ++ ++
Sbjct: 158 IVGATAGGVINRVLYEINPVLSDA--------FITTVYSLMLGFLGTYAMMDYLKARK 207
>gi|452973225|gb|EME73047.1| hypothetical protein BSONL12_14964 [Bacillus sonorensis L12]
Length = 272
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 45/91 (49%), Gaps = 8/91 (8%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLEL-GIP-------PQVSSATALFAITFSSSMSVVE 393
G+ AG +G L+GLGGG ++ P L L GIP PQ++ T+L I F+ S +
Sbjct: 9 LGLIAGTIGSLVGLGGGIVIVPSLLFLSGIPGVFDHVTPQMAVGTSLLVIIFTGLSSTIA 68
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
Y K L FF S + +G HV K
Sbjct: 69 YMKYKTVDYKSGLIFFIGSGPGSIIGAHVSK 99
>gi|303246503|ref|ZP_07332782.1| protein of unknown function DUF81 [Desulfovibrio fructosovorans JJ]
gi|302492213|gb|EFL52088.1| protein of unknown function DUF81 [Desulfovibrio fructosovorans JJ]
Length = 420
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 53/119 (44%), Gaps = 5/119 (4%)
Query: 306 VSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF 365
VS EA G+ A T + AS A G+ AG++ G +G GGGFI+ P
Sbjct: 49 VSFAEAGGYLGGKHAYAPAFYTSTIFLAS-----IAVGLAAGLITGCIGAGGGFIITPAL 103
Query: 366 LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
+ +G+ ++ T LF I + M + L V A+ F SI FVG + K
Sbjct: 104 MSVGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNVSVKLAIAFLVGSISGTFVGGAINK 162
>gi|301102374|ref|XP_002900274.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102015|gb|EEY60067.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 340
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 61/133 (45%), Gaps = 6/133 (4%)
Query: 317 GRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVS 375
GRR + S E T A L A G AGMV L G+GGG IL P + + Q
Sbjct: 21 GRRVLCSAPKERTSSAAIPLGVGVATGGVAGMVSALTGVGGGIILIPAMAKFTSLSQQAI 80
Query: 376 SATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG---QHVL--KKLIKIL 430
+ T+L A+ +SS+ Y VP AL +I+A G H L KKL ++
Sbjct: 81 NGTSLGAVAIASSVGAWNYLQSGLCNVPLALTTTIPAILATKYGVRTAHRLTSKKLSLVV 140
Query: 431 GRASIIIFTLSFI 443
G A ++ L F+
Sbjct: 141 GSAMLLCSPLIFL 153
>gi|163744253|ref|ZP_02151613.1| integral membrane protein, putative [Oceanibulbus indolifex HEL-45]
gi|161381071|gb|EDQ05480.1| integral membrane protein, putative [Oceanibulbus indolifex HEL-45]
Length = 304
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 57/98 (58%), Gaps = 6/98 (6%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL--YF 408
G+ G+GGGF++ PL +GIPP V+ AT+ I SS +++ + LKR V + +
Sbjct: 29 GMFGVGGGFLITPLLFFVGIPPAVAVATSTNQIVASSFSALLAH--LKRRTVDFRMGWVL 86
Query: 409 FALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
A +I + +G + L K LG+ +++ TL ++IF+
Sbjct: 87 LAGGLIGSGIGMFIFNYL-KSLGQVDLLV-TLCYVIFL 122
>gi|23016247|ref|ZP_00056005.1| COG0730: Predicted permeases [Magnetospirillum magnetotacticum
MS-1]
Length = 301
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 60/112 (53%), Gaps = 13/112 (11%)
Query: 355 LGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSII 414
+GGGF+L PL + LGIPP V+ A+ + SS V ++ + V A++ ++
Sbjct: 34 VGGGFLLTPLLIMLGIPPAVAVASGANQVLGSSVSGVFAHWRRRNVDVKMAVFL----LV 89
Query: 415 AAFVGQHV---LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRI 463
F G V L L+K LG+ ++I +LS++ F LG VGL +I+ +
Sbjct: 90 GGFAGSGVGVWLFALLKRLGQIDLVI-SLSYVGF-----LGSVGLLMLIETL 135
>gi|242398129|ref|YP_002993553.1| permease [Thermococcus sibiricus MM 739]
gi|242264522|gb|ACS89204.1| Predicted permease [Thermococcus sibiricus MM 739]
Length = 254
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 43/81 (53%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
Y + GI G++ L GLGGGF++ P LG+ + T+ A+ F+S S + Y+ +
Sbjct: 7 YFSIGIFVGILAALFGLGGGFLVVPTLNLLGVEIHHAVGTSSAAVVFTSLSSAIAYHRQR 66
Query: 399 RFPVPYALYFFALSIIAAFVG 419
R L + ++I A++G
Sbjct: 67 RIHYKAGLLLASTAVIGAYIG 87
>gi|340027139|ref|ZP_08663202.1| hypothetical protein PaTRP_00390 [Paracoccus sp. TRP]
Length = 307
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 64/132 (48%), Gaps = 18/132 (13%)
Query: 313 ALYKGRR----KIASKGDEG--------TKWRASQL----VFYCACGITAGMVGGLLGLG 356
AL++ RR K + G K+RAS L + G G++G ++G+G
Sbjct: 136 ALHRARRSGPIKPMRRHQHGWVHKWPLKMKFRASGLYISAIPVLMVGAAVGVLGAIMGVG 195
Query: 357 GGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR-FPVPYALYFFALSII 414
GGFI+ P + LG+P +V T+LF ITF S+ + + + + V A+ +I
Sbjct: 196 GGFIMVPAMIYLLGMPTKVVIGTSLFQITFLSAYTTLMHAVSSNTVDVMLAVLLIVGGVI 255
Query: 415 AAFVGQHVLKKL 426
A VG H+ +L
Sbjct: 256 GAQVGTHLGARL 267
>gi|407787737|ref|ZP_11134876.1| hypothetical protein B30_16868 [Celeribacter baekdonensis B30]
gi|407199016|gb|EKE69040.1| hypothetical protein B30_16868 [Celeribacter baekdonensis B30]
Length = 302
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 53/103 (51%), Gaps = 6/103 (5%)
Query: 330 KWRASQLVFYC----ACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAIT 384
K+RAS L A G+ GM+G ++G+GGGFI+ P + LG+P +V T+LF I
Sbjct: 164 KFRASGLYISAIPPIAVGMMVGMLGAVMGVGGGFIMVPAMIYLLGMPTKVVIGTSLFQII 223
Query: 385 FSSSM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
F++ +++ Y V A +I A +G + K+
Sbjct: 224 FTAGFTTLLHAYTNHTVDVLLAFLLLVGGVIGAQIGTQIGLKM 266
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 64/118 (54%), Gaps = 7/118 (5%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
G++ G+ G+GGGF++ PL +GIPP V+ AT+ I SS V+ ++ K + L
Sbjct: 25 GVLSGMFGVGGGFLITPLLFFVGIPPAVAVATSANQIVASSVSGVLAHFKRKTVDLKMGL 84
Query: 407 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIE 464
++ + +G + ++K +G+ +++ TL +++F LG +G +I+ I
Sbjct: 85 VLLGGGLVGSAIGIQIF-NMLKAMGQVDLLV-TLCYVVF-----LGIIGFLMLIESIR 135
>gi|154250913|ref|YP_001411737.1| hypothetical protein Plav_0457 [Parvibaculum lavamentivorans DS-1]
gi|154154863|gb|ABS62080.1| protein of unknown function DUF81 [Parvibaculum lavamentivorans
DS-1]
Length = 307
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 33/59 (55%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
A G T G + G+ G+GGGF++ PL + LGIPP V+ T + SS + ++ K
Sbjct: 18 LAMGATVGFLSGMFGIGGGFLMTPLLIFLGIPPAVAVGTQTTQVVASSVTGALAHFTRK 76
>gi|323144141|ref|ZP_08078778.1| hypothetical protein HMPREF9444_01429 [Succinatimonas hippei YIT
12066]
gi|322416088|gb|EFY06785.1| hypothetical protein HMPREF9444_01429 [Succinatimonas hippei YIT
12066]
Length = 251
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 61/120 (50%), Gaps = 7/120 (5%)
Query: 96 GVGGGGIFVPMLNLIV-GFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
G+GGG + VP+L + F + AA S +V G A+ + Y +RQ+ I Y
Sbjct: 6 GIGGGVLMVPLLMTVFPQFSMQMVAATSLSIVIGSALINLTYFIRQKIA------ISYKG 59
Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREA 214
+L+ +++G+ G + IF +I + IV L+L+ + FLK ++++ + K A
Sbjct: 60 LVLWSAGMIIGVQGGFELSFIFHPNVIVGIFIVTMLILAVKTFLKLKKSYQVKEDVKVNA 119
>gi|108757215|ref|YP_628460.1| hypothetical protein MXAN_0177 [Myxococcus xanthus DK 1622]
gi|108461095|gb|ABF86280.1| membrane protein [Myxococcus xanthus DK 1622]
Length = 293
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 77/163 (47%), Gaps = 14/163 (8%)
Query: 304 GGVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 363
G V AVA+ + RR+ + A++++ A GI G + GL+G GGGF++ P
Sbjct: 104 GSVMVAAAVAMLR-RREAGPAATSVSPLPAARVL---AQGIAVGALSGLVGAGGGFLIVP 159
Query: 364 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 423
G+P V++AT+L I + +V + L +P+ L L +AA G V
Sbjct: 160 ALSLAGLPMPVATATSLVVIALQCAAGLVGH--LGHLDLPWVLTGEVL--LAAMTGSLVG 215
Query: 424 KKLIKILGRASIIIFTLSFIIFV---SALSLGGVGLAKMIKRI 463
+L GR S + F +FV +A LG A + +R+
Sbjct: 216 GRLA---GRVSPAMLRKGFAVFVLTTAAFLLGAQSPAPLRERV 255
>gi|442317193|ref|YP_007357214.1| hypothetical protein MYSTI_00171 [Myxococcus stipitatus DSM 14675]
gi|441484835|gb|AGC41530.1| hypothetical protein MYSTI_00171 [Myxococcus stipitatus DSM 14675]
Length = 300
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 53/98 (54%), Gaps = 4/98 (4%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ G++ GL+G GGGF++ P +G+P V++AT+L I + +V + L +
Sbjct: 145 GVGVGILSGLVGAGGGFLIVPALAMVGLPTPVATATSLVVIALQCAAGLVGH--LGHLDL 202
Query: 403 PYAL--YFFALSIIAAFVGQHVLKKLIKILGRASIIIF 438
P+ L A++++ +FVG + ++ L R +F
Sbjct: 203 PWLLTAQVIAVALVGSFVGGRLAGRVAPGLLRKGFAVF 240
>gi|89099437|ref|ZP_01172313.1| hypothetical protein B14911_19705 [Bacillus sp. NRRL B-14911]
gi|89085823|gb|EAR64948.1| hypothetical protein B14911_19705 [Bacillus sp. NRRL B-14911]
Length = 285
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 68/140 (48%), Gaps = 10/140 (7%)
Query: 257 IENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYWVLNLLQIPVAGGVS-AYEAVALY 315
++NI WKE G ++ V + LA Q AK S L+ IP V +Y A ++
Sbjct: 64 LKNILWKE-GAILGVSGM-LATQAAKPLVLFLS--EKGLDATVIPACYIVLLSYFAFTMF 119
Query: 316 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQV 374
+ +K + EG A L+ G + G V LG+GGGFI+ PL + LG+ P+
Sbjct: 120 RQGKKTGEQSREGRPSLAGMLL----IGFSGGFVSAALGVGGGFIMVPLSIAFLGLQPRK 175
Query: 375 SSATALFAITFSSSMSVVEY 394
+ T+LFA+ S + Y
Sbjct: 176 AVGTSLFAVLLIVSTGFLSY 195
>gi|158423157|ref|YP_001524449.1| permease [Azorhizobium caulinodans ORS 571]
gi|158330046|dbj|BAF87531.1| putative permease [Azorhizobium caulinodans ORS 571]
Length = 317
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 34/53 (64%), Gaps = 3/53 (5%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA---TALFAITFSSSMS 390
A G G V G+ G+GGGF++ PL L +G+PP V+ A T + A +FS ++S
Sbjct: 19 ALGAAVGFVSGMFGVGGGFLMTPLLLFIGVPPAVAVASVSTHMAASSFSGALS 71
>gi|138896571|ref|YP_001127024.1| hypothetical protein GTNG_2934 [Geobacillus thermodenitrificans
NG80-2]
gi|196249274|ref|ZP_03147972.1| protein of unknown function DUF81 [Geobacillus sp. G11MC16]
gi|134268084|gb|ABO68279.1| Hypothetical membrane spanning protein [Geobacillus
thermodenitrificans NG80-2]
gi|196211031|gb|EDY05792.1| protein of unknown function DUF81 [Geobacillus sp. G11MC16]
Length = 300
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 52/116 (44%), Gaps = 14/116 (12%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 395
G AG VG L GLGGG I+ P L G + PQV+ T+L I F+ S + Y
Sbjct: 10 GFVAGTVGSLAGLGGGVIIVPALLFFGSLGWLSAVTPQVAVGTSLVVIIFNGLSSTLSYM 69
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
K L F SI +G V L S+ F+L F +F+ A+SL
Sbjct: 70 KDKMVDYQSGLLFCLGSIPGTVIGAWVNNTL-------SVDRFSLYFGLFLMAMSL 118
>gi|407462768|ref|YP_006774085.1| hypothetical protein NKOR_06310 [Candidatus Nitrosopumilus
koreensis AR1]
gi|407046390|gb|AFS81143.1| hypothetical protein NKOR_06310 [Candidatus Nitrosopumilus
koreensis AR1]
Length = 120
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 35/57 (61%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G AG++G ++GLGGG I+ P+ LG PP +++ +LFA ++ S V Y +R
Sbjct: 11 GFAAGVLGSMIGLGGGIIVVPVLTFLGFPPIAAASNSLFAALSNAVASTVSYSRQRR 67
>gi|256828130|ref|YP_003156858.1| hypothetical protein Dbac_0315 [Desulfomicrobium baculatum DSM
4028]
gi|256577306|gb|ACU88442.1| protein of unknown function DUF81 [Desulfomicrobium baculatum DSM
4028]
Length = 415
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 2/98 (2%)
Query: 334 SQLVFYCA--CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
S +F+ + G+ AG++ G +G GGGFI+ P + G+ ++ T LF I + M
Sbjct: 67 SLYIFFVSIFVGVGAGLISGCIGAGGGFIIAPALMSAGVKGILAVGTDLFHIFAKAIMGS 126
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 429
V + + VP A F +II VG + + L I
Sbjct: 127 VLHRKMGNVSVPLAFVFLIGAIIGTTVGAGINRALYNI 164
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMS 390
R +F GI GM G++G+GGGF+ P+F+ LG+ + T +F I F++
Sbjct: 240 RKISWIFLVLSGILVGMAAGIMGVGGGFLTFPIFVYILGVSSLTTVGTDIFQIVFTAGYG 299
Query: 391 VVEYYLLKRF 400
+ Y + F
Sbjct: 300 AITQYAIYGF 309
>gi|449017054|dbj|BAM80456.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 383
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 39/63 (61%), Gaps = 1/63 (1%)
Query: 333 ASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSV 391
A+++V G+TAG +G L+G+GGG I+ PL + G+ + T+L A++FSS +
Sbjct: 105 AARIVLSGLIGLTAGFLGSLIGVGGGIIMTPLLTSVAGLSQHEAHGTSLVAVSFSSCVGA 164
Query: 392 VEY 394
+ Y
Sbjct: 165 LAY 167
>gi|406672837|ref|ZP_11080062.1| hypothetical protein HMPREF9700_00604 [Bergeyella zoohelcum CCUG
30536]
gi|405587381|gb|EKB61109.1| hypothetical protein HMPREF9700_00604 [Bergeyella zoohelcum CCUG
30536]
Length = 263
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 1/96 (1%)
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMS 390
+ + + G+ G++ G +G GGGF++ P L + G+ + + AT+LF I +S++
Sbjct: 140 KENNYTLLISQGLLVGIIMGFIGAGGGFLIVPALVMLFGMNMREAVATSLFIIMINSAVG 199
Query: 391 VVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
+ P+ L F LSII F+G + KK+
Sbjct: 200 FISSLDKIAIDWPFLLVFSGLSIIGIFIGMLIAKKM 235
>gi|417646396|ref|ZP_12296255.1| hypothetical protein SEVCU144_0935 [Staphylococcus epidermidis
VCU144]
gi|329727892|gb|EGG64342.1| hypothetical protein SEVCU144_0935 [Staphylococcus epidermidis
VCU144]
Length = 125
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 61/119 (51%), Gaps = 10/119 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFA---LSIIAAFVGQHVLKKLIKILGRASIIIFTL 440
S +++ LK L F A +IIA ++ ++LK LI I+ + ++I+T+
Sbjct: 61 SALKFLRSGKVDLKIVGKMLPLIFVASGGAAIIATYIPANILKPLI-IIALSLVLIYTV 118
>gi|423315709|ref|ZP_17293614.1| hypothetical protein HMPREF9699_00185 [Bergeyella zoohelcum ATCC
43767]
gi|405585813|gb|EKB59616.1| hypothetical protein HMPREF9699_00185 [Bergeyella zoohelcum ATCC
43767]
Length = 263
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 1/96 (1%)
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMS 390
+ + + G+ G++ G +G GGGF++ P L + G+ + + AT+LF I +S++
Sbjct: 140 KENNYTLLISQGLLVGIIMGFIGAGGGFLIVPALVMLFGMNMREAVATSLFIIMINSAVG 199
Query: 391 VVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
+ P+ L F LSII F+G + KK+
Sbjct: 200 FISSLDKIAIDWPFLLVFSGLSIIGIFIGMLIAKKM 235
>gi|255534255|ref|YP_003094626.1| hypothetical protein FIC_00088 [Flavobacteriaceae bacterium
3519-10]
gi|255340451|gb|ACU06564.1| conserved hypothetical transmembrane protein [Flavobacteriaceae
bacterium 3519-10]
Length = 266
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 58/107 (54%), Gaps = 4/107 (3%)
Query: 324 KGDEGTKWRASQLVFY---CACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATA 379
+ +E + R V Y + G+ G+V GL+G GGGF++ P L + LG+ + + AT+
Sbjct: 129 RKNERPRLRKYDEVNYTILVSQGLLVGIVTGLIGAGGGFLIVPALVMLLGLNMKKAVATS 188
Query: 380 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
LF I +S + + + + + L F +LS+I F+G + K++
Sbjct: 189 LFIIAMNSLIGFLSTMKIVKHDWVFLLSFTSLSVIGIFIGLALSKRM 235
>gi|254410195|ref|ZP_05023975.1| conserved domain protein, putative [Coleofasciculus chthonoplastes
PCC 7420]
gi|196183231|gb|EDX78215.1| conserved domain protein, putative [Coleofasciculus chthonoplastes
PCC 7420]
Length = 267
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 39/64 (60%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 393
+QL+ + + G+ AG++ G LG+GGG +L PL + LG Q + AT+ +I ++ V+
Sbjct: 4 TQLLIFASAGLFAGILAGFLGIGGGTVLVPLLVTLGYDYQQAVATSTLSIVITAISGTVQ 63
Query: 394 YYLL 397
+ L
Sbjct: 64 NWRL 67
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 61/135 (45%), Gaps = 15/135 (11%)
Query: 85 GFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT 144
G F G+GGG + VP+L + +G+D + + A S + A+S V N R +
Sbjct: 13 GLFAGILAGFLGIGGGTVLVPLL-VTLGYDYQQAVATSTLSIVITAISGTVQNWRLGN-- 69
Query: 145 LDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMIT-----VLLIVLFLVLSTRAFLK 199
ID+ + ++ IG +FAD+ + +LLI ++LV R LK
Sbjct: 70 -----IDFKRIIAIGFPAIITAPIGAYLTELFADYWLKAAFGLLLLINIYLV-RLRKRLK 123
Query: 200 GVETWKK-ETITKRE 213
E ET+T R+
Sbjct: 124 AREKQDSLETVTPRQ 138
>gi|16079702|ref|NP_390526.1| hypothetical protein BSU26490 [Bacillus subtilis subsp. subtilis
str. 168]
gi|221310580|ref|ZP_03592427.1| hypothetical protein Bsubs1_14486 [Bacillus subtilis subsp.
subtilis str. 168]
gi|221314904|ref|ZP_03596709.1| hypothetical protein BsubsN3_14407 [Bacillus subtilis subsp.
subtilis str. NCIB 3610]
gi|221319826|ref|ZP_03601120.1| hypothetical protein BsubsJ_14323 [Bacillus subtilis subsp.
subtilis str. JH642]
gi|221324105|ref|ZP_03605399.1| hypothetical protein BsubsS_14457 [Bacillus subtilis subsp.
subtilis str. SMY]
gi|402776802|ref|YP_006630746.1| integral inner membrane protein [Bacillus subtilis QB928]
gi|418032177|ref|ZP_12670660.1| hypothetical protein BSSC8_16040 [Bacillus subtilis subsp. subtilis
str. SC-8]
gi|452915804|ref|ZP_21964430.1| sulfite exporter TauE/SafE family protein [Bacillus subtilis
MB73/2]
gi|1731129|sp|P54437.1|YRKJ_BACSU RecName: Full=UPF0721 transmembrane protein YrkJ
gi|1303709|dbj|BAA12365.1| YrkJ [Bacillus subtilis]
gi|2635094|emb|CAB14590.1| putative integral inner membrane protein [Bacillus subtilis subsp.
subtilis str. 168]
gi|351471040|gb|EHA31161.1| hypothetical protein BSSC8_16040 [Bacillus subtilis subsp. subtilis
str. SC-8]
gi|402481982|gb|AFQ58491.1| Putative integral inner membrane protein [Bacillus subtilis QB928]
gi|407959832|dbj|BAM53072.1| hypothetical protein BEST7613_4141 [Bacillus subtilis BEST7613]
gi|407965475|dbj|BAM58714.1| hypothetical protein BEST7003_2513 [Bacillus subtilis BEST7003]
gi|452116152|gb|EME06548.1| sulfite exporter TauE/SafE family protein [Bacillus subtilis
MB73/2]
Length = 261
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 60/114 (52%), Gaps = 11/114 (9%)
Query: 316 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQV 374
KG+ K KW AS L F G V G+LG GG FIL P+ L L IP +V
Sbjct: 131 KGQEHSEDKEVIFNKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRV 184
Query: 375 SSATALFAITFSSSM--SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
+ A++L AITF SS+ +V + + VP AL S+IA+ +G V +K+
Sbjct: 185 TVASSL-AITFLSSIGATVGKVITGQVLFVP-ALVLMITSLIASPIGASVGQKV 236
>gi|392410298|ref|YP_006446905.1| putative permease [Desulfomonile tiedjei DSM 6799]
gi|390623434|gb|AFM24641.1| putative permease [Desulfomonile tiedjei DSM 6799]
Length = 279
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 6/98 (6%)
Query: 326 DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL---GIPPQVSSATALFA 382
E T + A + V Y A G G+ G L+G GGGF+L P+ L L P Q++S + A
Sbjct: 2 TETTLFSAERFVGYVALGFGTGLYGTLIGAGGGFVLMPILLLLFPKESPEQLTSIS--LA 59
Query: 383 ITFSSSMSVVE-YYLLKRFPVPYALYFFALSIIAAFVG 419
+ F +++ E Y L++R L F A +I A +G
Sbjct: 60 VVFFNALGGSEAYALMRRIDYRSGLMFAAATIPGAILG 97
>gi|333373756|ref|ZP_08465660.1| protein of hypothetical function DUF81 [Desmospora sp. 8437]
gi|332969423|gb|EGK08447.1| protein of hypothetical function DUF81 [Desmospora sp. 8437]
Length = 273
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 324 KGDEGTKWR-ASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALF 381
K EG +R CG G++ GL G+GGG +L PL + L PP V++AT++F
Sbjct: 143 KDAEGNIYRYGYHRTIALTCGFVVGLLSGLFGIGGGALLVPLMVLLFRFPPHVATATSMF 202
Query: 382 AITFSS 387
I SS
Sbjct: 203 VIFLSS 208
>gi|260431503|ref|ZP_05785474.1| putative membrane protein [Silicibacter lacuscaerulensis ITI-1157]
gi|260415331|gb|EEX08590.1| putative membrane protein [Silicibacter lacuscaerulensis ITI-1157]
Length = 306
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 53/101 (52%), Gaps = 2/101 (1%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
G++ G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++ K +
Sbjct: 25 GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRKTVDLKMGC 84
Query: 407 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVS 447
++ A +G V L K LG+ +++ L +++F+
Sbjct: 85 VLLVGGLLGAALGVVVFNYL-KSLGQVDLLV-KLCYVVFLG 123
>gi|240102618|ref|YP_002958927.1| putative permease [Thermococcus gammatolerans EJ3]
gi|239910172|gb|ACS33063.1| Predicted permease [Thermococcus gammatolerans EJ3]
Length = 254
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 42/81 (51%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
Y A G+ G++ L GLGGGF++ P LG+ + T+ A+ F+S S + Y K
Sbjct: 7 YFAVGVFIGILAALFGLGGGFLVVPTLNLLGVEMHHAVGTSSAAVVFTSLSSTIAYSRQK 66
Query: 399 RFPVPYALYFFALSIIAAFVG 419
R L + ++I A++G
Sbjct: 67 RIHYRVGLLLASTAVIGAYIG 87
>gi|260431501|ref|ZP_05785472.1| membrane protein [Silicibacter lacuscaerulensis ITI-1157]
gi|260415329|gb|EEX08588.1| membrane protein [Silicibacter lacuscaerulensis ITI-1157]
Length = 307
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 36/57 (63%), Gaps = 1/57 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
GI G + G+ G+GGGF++ PL +GIPP V+ AT+ + +SS S V +L +R
Sbjct: 21 GIAVGFLSGMFGVGGGFLITPLLFLIGIPPAVAVATSANQVV-ASSFSGVLAHLKRR 76
>gi|308270305|emb|CBX26917.1| hypothetical protein N47_A09460 [uncultured Desulfobacterium sp.]
Length = 260
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 52/112 (46%), Gaps = 3/112 (2%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
GI AG++ G +G GGGFI+ P + GI ++ T LF I + M V + L V
Sbjct: 88 GICAGLITGCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVLHRKLGNISV 147
Query: 403 PYALYFFALSIIAAFVG---QHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
L F SII A G V+ + IL A I + + + + A SL
Sbjct: 148 ALGLIFILGSIIGATTGGVINRVIYDINPILSDAFITVIYVFMLGILGAYSL 199
>gi|206598109|gb|ACI15919.1| hypothetical protein [Bodo saltans]
Length = 526
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 88/415 (21%), Positives = 165/415 (39%), Gaps = 47/415 (11%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDI----PVIDYDLALLFQP 160
PM + S+ +S+ + G + ++Y + + PT P+I+Y + P
Sbjct: 112 PMYAALTEMPLISAVGLSQAAICGQSAFNMIYQIPKTMPTQTPNEVRPLINYQYLAILLP 171
Query: 161 LLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKE---------TITK 211
L ++G +G + DW+ LL +L + R K +K++ +TK
Sbjct: 172 LSLIGTLLGSLGGRVVPDWLRLALLFLLLTSVLHRVIEKAKRQYKQDAAEKNEEAAALTK 231
Query: 212 REAAK-QLELIGNGYQTEECKCDPEYLSNDTTPEETREPKKSKVSIIEN----------- 259
+ K Q + N T P TTP R + V + E
Sbjct: 232 KVVQKPQQQTDANATDTGAVSMKPVL---STTPAAPRGQTTTTVVVAEEAVGSEGESHFD 288
Query: 260 ------------IYWKEFGLLVAVWAIILALQIAKNYTTTC-SVLYWVLNLLQIPVAGGV 306
+ +E L+ + ++L I ++ T C S + +L ++ I V G+
Sbjct: 289 DNDGPRSPPSLRMPRREISLMGFSFLVLLISNIVRSNFTVCGSTAHVLLFIIPIVVLAGL 348
Query: 307 ----SAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILG 362
AV L S +W ++++ + AG LLG+GGG +L
Sbjct: 349 WWLAKHMAAVTLANVASGFLSPDHMTFQWNQKTMIYFPVAAVVAGAGAALLGIGGGLVLS 408
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
+ E G+ P+ +SAT+ FA +S + + + YA +F +++ +GQ
Sbjct: 409 FVLFEAGLAPEEASATSGFATLLIASEAALLMLFQGQLVPDYATMYFVCGVVSTVLGQSG 468
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE--YMGFDNIC 475
I+ + +I+ L+ II S L L GL +++++ + + + F +IC
Sbjct: 469 FMAYIRWSKKRFLIVTALACIIGGSLLMLTSYGLYEVVEQYKRHDGSLLRFGHIC 523
>gi|422350303|ref|ZP_16431189.1| hypothetical protein HMPREF9465_02079 [Sutterella wadsworthensis
2_1_59BFAA]
gi|404657451|gb|EKB30342.1| hypothetical protein HMPREF9465_02079 [Sutterella wadsworthensis
2_1_59BFAA]
Length = 266
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 8/88 (9%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS--------ATALFAITFSSSMSVVEY 394
GI G G L+G+GGG I+ PLF+ +PP S+ T+LF + ++ Y
Sbjct: 11 GIGVGSFGALVGIGGGLIMVPLFMYFMMPPSGSTFANVQEVVGTSLFGVLLNALSGTWAY 70
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHV 422
+ KR + A+ F ++ AF+G +V
Sbjct: 71 FRQKRIILSVAMPFALATVPGAFLGSYV 98
>gi|339446247|ref|YP_004712251.1| hypothetical protein EGYY_28770 [Eggerthella sp. YY7918]
gi|338905999|dbj|BAK45850.1| hypothetical protein EGYY_28770 [Eggerthella sp. YY7918]
Length = 285
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 3/108 (2%)
Query: 320 KIASKGDEGTK--WRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSS 376
K + D GT + QL+ G+ AG+ G +G+GGGF++ PL L L GI + +S
Sbjct: 147 KKTPEADTGTPATFSRRQLLMGAGIGLLAGVASGYVGVGGGFLMVPLMLSLVGIGMRQAS 206
Query: 377 ATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
T+L A+ + VVE LL + +I A +G +++
Sbjct: 207 GTSLIAVMILAVPGVVEQALLGNIDYMAGIAISIGTIPGALIGAQLVR 254
>gi|384921152|ref|ZP_10021141.1| hypothetical protein C357_18457 [Citreicella sp. 357]
gi|384464952|gb|EIE49508.1| hypothetical protein C357_18457 [Citreicella sp. 357]
Length = 302
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 59/107 (55%), Gaps = 6/107 (5%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G G++ G+ G+GGGF++ PL +GIPP V+ AT I SS V+ + LKR V
Sbjct: 21 GALVGILSGMFGVGGGFLITPLLFFMGIPPAVAVATGANQIVASSVSGVLAH--LKRRTV 78
Query: 403 PYALYFFALS--IIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVS 447
+ LS I+ A VG V L+K +G+ +++ L +++F+
Sbjct: 79 DLRMGTVLLSGGIVGAAVGIWVF-NLLKGMGQVDLLV-RLCYVVFLG 123
>gi|317152720|ref|YP_004120768.1| hypothetical protein Daes_1006 [Desulfovibrio aespoeensis Aspo-2]
gi|316942971|gb|ADU62022.1| protein of unknown function DUF81 [Desulfovibrio aespoeensis
Aspo-2]
Length = 427
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 53/111 (47%), Gaps = 6/111 (5%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ AG++ G +G GGGFI+ P + GI ++ T LF I + M V + L V
Sbjct: 88 GMGAGLITGCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVIHRKLGNVSV 147
Query: 403 PYALYFFALSIIAAFVG---QHVLKKLIKILGRASIII---FTLSFIIFVS 447
P A F +II A G VL ++ IL A I F L F+ F S
Sbjct: 148 PLAAVFLIGAIIGASAGGIINRVLYEINPILSDAFITTVYSFMLGFLGFYS 198
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMS 390
R VF GI G+ G++G+GGGF+ P+F+ LG+ + T +F I F++ +
Sbjct: 251 RQISWVFLVLSGILVGLAAGIMGVGGGFLTFPIFVYVLGVSSMTTVGTDIFQIVFTAGFA 310
Query: 391 VVEYYLLKRF 400
+ Y + F
Sbjct: 311 SISQYAIYGF 320
>gi|295705457|ref|YP_003598532.1| hypothetical protein BMD_3342 [Bacillus megaterium DSM 319]
gi|294803116|gb|ADF40182.1| putative membrane protein [Bacillus megaterium DSM 319]
Length = 258
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 3/82 (3%)
Query: 347 GMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAITFSSSM-SVVEYYLLKRFPVPY 404
G V G+LG GG FIL P+ L L IP +++ AT+L A+TF SS+ S + + + P
Sbjct: 153 GAVAGVLGAGGAFILVPVMLVVLKIPTRITIATSL-AVTFISSIGSTIGKLITHQVPFIP 211
Query: 405 ALYFFALSIIAAFVGQHVLKKL 426
AL A S+IA+ +G V +K+
Sbjct: 212 ALILVAASLIASPIGAKVGQKM 233
>gi|90422261|ref|YP_530631.1| hypothetical protein RPC_0741 [Rhodopseudomonas palustris BisB18]
gi|90104275|gb|ABD86312.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
BisB18]
Length = 308
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 58/115 (50%), Gaps = 2/115 (1%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
++ A G G V G+ G+GGGF++ PL + +GI P V+ A+ + SS + Y
Sbjct: 13 NVLLILAMGAAVGFVSGMFGIGGGFLMTPLLIFIGIAPAVAVASVTSHMAASSLSGALSY 72
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
+ + AL +I +G L++ LG+ ++I LS+++ ++A+
Sbjct: 73 WRRRAIDPALALVLLCGGLIGTALGVWFF-TLMRSLGQLDLVI-ALSYVVLLTAV 125
>gi|154686411|ref|YP_001421572.1| hypothetical protein RBAM_019790 [Bacillus amyloliquefaciens FZB42]
gi|154352262|gb|ABS74341.1| YrkJ1 [Bacillus amyloliquefaciens FZB42]
Length = 175
Score = 42.7 bits (99), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 63/107 (58%), Gaps = 8/107 (7%)
Query: 347 GMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAITFSSSM-SVVEYYLLKRFPVPY 404
G+ G++G GG F+L PL L L IP +++ AT+L A+TF SS+ + V ++ + P
Sbjct: 69 GLASGIVGAGGAFLLFPLMLVVLKIPTKITIATSL-AVTFISSIGTTVTKLVIGQVPFMP 127
Query: 405 ALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
AL A S+IA+ +G + KK+ A+I+ L F+I ++ + +
Sbjct: 128 ALVIVAASLIASPIGVLIGKKM-----NANILQGILIFVILITTVKI 169
>gi|171185687|ref|YP_001794606.1| hypothetical protein Tneu_1233 [Pyrobaculum neutrophilum V24Sta]
gi|170934899|gb|ACB40160.1| protein of unknown function DUF81 [Pyrobaculum neutrophilum V24Sta]
Length = 242
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 39/69 (56%), Gaps = 3/69 (4%)
Query: 66 KHVWPDIKFGWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCM 125
K+V P W +G + F G S+ GVGGG +FVP L L G DAK +AA+S +
Sbjct: 111 KNVEPRRNAPW---LGPPLVFIGGFASSLFGVGGGTVFVPTLMLTSGLDAKRAAAMSMGI 167
Query: 126 VTGGAVSAV 134
+ AVS+V
Sbjct: 168 IFPTAVSSV 176
>gi|83312652|ref|YP_422916.1| permease [Magnetospirillum magneticum AMB-1]
gi|82947493|dbj|BAE52357.1| Predicted permease [Magnetospirillum magneticum AMB-1]
Length = 301
Score = 42.7 bits (99), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 58/108 (53%), Gaps = 7/108 (6%)
Query: 356 GGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIA 415
GGGF+L PL + LGIPP V+ A+ + SS V ++ + + AL+ A I
Sbjct: 35 GGGFLLTPLLIMLGIPPAVAVASGANQVLGSSVSGVFAHWRRRNVDIKMALFLLAGGFIG 94
Query: 416 AFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRI 463
+ G L ++K LG+ ++I +LS++ F LG VGL +++ +
Sbjct: 95 SAAGVW-LFAVLKRLGQIDLVI-SLSYVGF-----LGSVGLLMLVETL 135
>gi|212557467|gb|ACJ29921.1| Conserved hypothetical protein [Shewanella piezotolerans WP3]
Length = 257
Score = 42.7 bits (99), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 41/82 (50%), Gaps = 1/82 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ AG + + G GGG + P L +G+PP ++ AT A F SSM+ YY + F
Sbjct: 17 GLIAGFIDAVAG-GGGLLSIPALLTMGVPPHMALATNKLAACFGSSMAAYTYYKQRLFSP 75
Query: 403 PYALYFFALSIIAAFVGQHVLK 424
+ F + I A +G ++
Sbjct: 76 SLWYHTFIATFIGAVIGTFIVS 97
>gi|375082782|ref|ZP_09729829.1| permease [Thermococcus litoralis DSM 5473]
gi|374742630|gb|EHR79021.1| permease [Thermococcus litoralis DSM 5473]
Length = 254
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 42/81 (51%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
Y G+ G++ L GLGGGF++ P LG+ + T+ A+ F+S S + Y+ +
Sbjct: 7 YFGVGVFIGILAALFGLGGGFLIVPTLNLLGVEIHHAVGTSSAAVVFTSLSSAIAYHRQR 66
Query: 399 RFPVPYALYFFALSIIAAFVG 419
R L + ++I A++G
Sbjct: 67 RIHYKAGLLLASTAVIGAYIG 87
>gi|456352291|dbj|BAM86736.1| permease [Agromonas oligotrophica S58]
Length = 308
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 63/126 (50%), Gaps = 8/126 (6%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+ +
Sbjct: 18 LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYWRRRA 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV----G 455
AL + +G L++ LG+ ++I LS+++ +S S+GGV G
Sbjct: 78 IDPSLALVLMIGGTLGTALGVSTF-TLLRSLGQLDLMI-ALSYVVLLS--SVGGVMFWEG 133
Query: 456 LAKMIK 461
L M++
Sbjct: 134 LRAMMR 139
>gi|308159430|gb|EFO61961.1| Hypothetical protein GLP15_502 [Giardia lamblia P15]
Length = 748
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 3/54 (5%)
Query: 141 RHPTLDI--PVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVL 192
RH D P++ +DL +LFQP +LG +G N IF W++++ + LFL+L
Sbjct: 75 RHKKYDALSPLLQWDLLILFQPFSLLGALVGSICNAIFPSWVLSI-FVCLFLIL 127
>gi|223478524|ref|YP_002582988.1| hypothetical protein [Thermococcus sp. AM4]
gi|214033750|gb|EEB74576.1| conserved hypothetical protein [Thermococcus sp. AM4]
Length = 254
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 42/81 (51%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
Y A G+ G++ L GLGGGF++ P LG+ + T+ A+ F+S S + Y +
Sbjct: 7 YFAVGVFIGILAALFGLGGGFLIVPTLNLLGVEMHHAVGTSSAAVVFTSLSSAIAYSRQR 66
Query: 399 RFPVPYALYFFALSIIAAFVG 419
R L + ++I A++G
Sbjct: 67 RIHYKVGLLLASTAVIGAYIG 87
>gi|215740460|dbj|BAG97116.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 109
Score = 42.4 bits (98), Expect = 0.47, Method: Composition-based stats.
Identities = 20/27 (74%), Positives = 24/27 (88%)
Query: 101 GIFVPMLNLIVGFDAKSSAAVSKCMVT 127
GIFVPML LI+GFDAKSS A+SK +V+
Sbjct: 70 GIFVPMLTLIIGFDAKSSTAISKFIVS 96
>gi|126458964|ref|YP_001055242.1| hypothetical protein Pcal_0341 [Pyrobaculum calidifontis JCM 11548]
gi|126248685|gb|ABO07776.1| protein of unknown function DUF81 [Pyrobaculum calidifontis JCM
11548]
Length = 244
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 38/57 (66%)
Query: 79 VVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVV 135
V+G ++ F G ++ GVGGG +FVP L L +GFDAK++AA S ++ AV++ +
Sbjct: 121 VLGYLLIFAGGLASALFGVGGGTVFVPALVLAMGFDAKAAAASSMGIILPTAVASTL 177
>gi|408381578|ref|ZP_11179127.1| permease [Methanobacterium formicicum DSM 3637]
gi|407816045|gb|EKF86608.1| permease [Methanobacterium formicicum DSM 3637]
Length = 267
Score = 42.4 bits (98), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 52/109 (47%), Gaps = 1/109 (0%)
Query: 318 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 377
+++ +SK D R Q + G+ +G++ G+ G+ G + LG+P
Sbjct: 133 KKETSSKNDPVVLTRPRQFL-ASLFGVASGLLAGIFGISGTPPVSAGLYSLGLPAMTVVG 191
Query: 378 TALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
T +F + F+S + Y+LL R + + + + AF+G +LKK+
Sbjct: 192 TTVFVLIFNSLTGIGGYFLLGRLDITLIILLAGGAAVGAFIGPKLLKKI 240
>gi|119385456|ref|YP_916512.1| hypothetical protein Pden_2732 [Paracoccus denitrificans PD1222]
gi|119375223|gb|ABL70816.1| protein of unknown function DUF81 [Paracoccus denitrificans PD1222]
Length = 307
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 51/99 (51%), Gaps = 17/99 (17%)
Query: 313 ALYKGRR----KIASKGDEG--------TKWRASQL----VFYCACGITAGMVGGLLGLG 356
AL++ RR K + G K+RAS L + G G++G L+G+G
Sbjct: 136 ALHRARRSGPIKPMRRHQHGWVHKWPLKMKFRASGLYISSIPVLLVGAAVGVLGALMGVG 195
Query: 357 GGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEY 394
GGFI+ P + LG+P +V T+LF ITF S+ + + +
Sbjct: 196 GGFIMVPAMIYLLGMPTKVVIGTSLFQITFLSAYTTLMH 234
>gi|11497744|ref|NP_068966.1| hypothetical protein AF0127 [Archaeoglobus fulgidus DSM 4304]
gi|2650522|gb|AAB91106.1| predicted coding region AF_0127 [Archaeoglobus fulgidus DSM 4304]
Length = 475
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 58/118 (49%), Gaps = 1/118 (0%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
+ +F A G GM+ G +G GG F+L P + +G P ++ A+ + + +
Sbjct: 24 EAIFLLALGFFGGMLSGFIGSGGAFVLTPGMMSIGTPGPIAVASNMCHKFPKAMIGAYRR 83
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF-TLSFIIFVSALSL 451
Y L++ A+ +I VG V K++ + LG ++ +++F++ + A+SL
Sbjct: 84 YKLRQLDPKLAVIMAVSAIAGVQVGIQVQKQIYEALGETGTNLYVSIAFLVVLPAVSL 141
>gi|25026873|ref|NP_736927.1| hypothetical protein CE0317 [Corynebacterium efficiens YS-314]
gi|259506064|ref|ZP_05748966.1| integral membrane protein [Corynebacterium efficiens YS-314]
gi|23492153|dbj|BAC17127.1| conserved hypothetical protein [Corynebacterium efficiens YS-314]
gi|259166352|gb|EEW50906.1| integral membrane protein [Corynebacterium efficiens YS-314]
Length = 284
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 64/137 (46%), Gaps = 5/137 (3%)
Query: 311 AVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELG 369
A A+ +GR++ A G++ A L A G+ G+V GL+G GGGF++ P L L G
Sbjct: 113 ATAMLRGRKQRADTGEK----TALPLGKILAEGLAVGLVTGLVGAGGGFLVVPALALLGG 168
Query: 370 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 429
+P V+ T+L I+ S + Y P L +++ A +G + + +
Sbjct: 169 LPMPVAVGTSLLVISMKSFAGLAGYMTSVSLDWPLVLAVTGAALVGALIGARLTSVVPEQ 228
Query: 430 LGRASIIIFTLSFIIFV 446
R +F L +F+
Sbjct: 229 ALRKGFGVFVLVMGVFI 245
>gi|218667447|ref|YP_002426656.1| hypothetical protein AFE_2258 [Acidithiobacillus ferrooxidans ATCC
23270]
gi|415968072|ref|ZP_11558253.1| hypothetical protein GGI1_04804 [Acidithiobacillus sp. GGI-221]
gi|218519660|gb|ACK80246.1| membrane protein, putative [Acidithiobacillus ferrooxidans ATCC
23270]
gi|339833297|gb|EGQ61154.1| hypothetical protein GGI1_04804 [Acidithiobacillus sp. GGI-221]
Length = 296
Score = 42.4 bits (98), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 44/84 (52%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ G + GL G+GGGF++ PL + +G+PP V+ T I +S++ ++ + +
Sbjct: 12 GLIVGFLSGLTGVGGGFLITPLLIFVGVPPLVAVGTGAAQIVGASAVGSYAHWRMGNVDM 71
Query: 403 PYALYFFALSIIAAFVGQHVLKKL 426
A A S + +G HV + L
Sbjct: 72 RMAFILLAGSWMGGLLGVHVARIL 95
>gi|99078076|ref|YP_611334.1| hypothetical protein TM1040_3098 [Ruegeria sp. TM1040]
gi|99035214|gb|ABF62072.1| protein of unknown function DUF81 [Ruegeria sp. TM1040]
Length = 330
Score = 42.4 bits (98), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 36/56 (64%), Gaps = 4/56 (7%)
Query: 347 GMVG---GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
GMVG G+ G+GGGF+L PL +GIPP V+ AT I +SS+S V +L +R
Sbjct: 47 GMVGVLSGMFGVGGGFLLTPLLFFIGIPPAVAVATGANQIV-ASSVSGVLAHLRRR 101
>gi|408379853|ref|ZP_11177444.1| permease [Agrobacterium albertimagni AOL15]
gi|407746230|gb|EKF57755.1| permease [Agrobacterium albertimagni AOL15]
Length = 305
Score = 42.4 bits (98), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 63/126 (50%), Gaps = 4/126 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 18 LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSVSGAISHFRRGT 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
+ ++ A +G ++ L + LG+ +II +L ++IF+ ++GG+ LA+
Sbjct: 78 LDMKLGTVLLIGGLVGATIGIYIFSWL-RRLGQLDLII-SLLYVIFLG--TVGGLMLAES 133
Query: 460 IKRIEH 465
++ +
Sbjct: 134 LRALRR 139
>gi|198284012|ref|YP_002220333.1| hypothetical protein Lferr_1905 [Acidithiobacillus ferrooxidans
ATCC 53993]
gi|198248533|gb|ACH84126.1| protein of unknown function DUF81 [Acidithiobacillus ferrooxidans
ATCC 53993]
Length = 305
Score = 42.4 bits (98), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 44/84 (52%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ G + GL G+GGGF++ PL + +G+PP V+ T I +S++ ++ + +
Sbjct: 21 GLIVGFLSGLTGVGGGFLITPLLIFVGVPPLVAVGTGAAQIVGASAVGSYAHWRMGNVDM 80
Query: 403 PYALYFFALSIIAAFVGQHVLKKL 426
A A S + +G HV + L
Sbjct: 81 RMAFILLAGSWMGGLLGVHVARIL 104
>gi|448239247|ref|YP_007403305.1| DUF81 family protein [Geobacillus sp. GHH01]
gi|445208089|gb|AGE23554.1| DUF81 family protein [Geobacillus sp. GHH01]
Length = 295
Score = 42.4 bits (98), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 52/116 (44%), Gaps = 14/116 (12%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 395
G AG VG L GLGGG I+ P L G + PQV+ T+L I F+ S + Y
Sbjct: 10 GFIAGTVGSLAGLGGGVIIVPALLFFGALGWLSAVTPQVAVGTSLVVIIFNGLSSTLSYM 69
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
K L F S+ A +G V L S F+L F +F+ A+SL
Sbjct: 70 KDKMVDYQSGLLFCLGSVPGAVIGAWVNNTL-------SAAHFSLYFGLFLIAMSL 118
>gi|392955479|ref|ZP_10321010.1| putative permease with tauE sulfite export domain [Bacillus
macauensis ZFHKF-1]
gi|391878406|gb|EIT86995.1| putative permease with tauE sulfite export domain [Bacillus
macauensis ZFHKF-1]
Length = 290
Score = 42.4 bits (98), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 62/131 (47%), Gaps = 12/131 (9%)
Query: 257 IENIYWKEFGLLVAVWAIILALQIAKNYTTTCSVLYW---VLNLLQIPVAGGVSAYEAVA 313
++NIYWK + + + + LA Q A S +W V+ +L I + G Y V
Sbjct: 63 LKNIYWKT-AITLGISGV-LATQAANPLVVYMSNHHWDKIVIPVLYIVLMG----YFCVQ 116
Query: 314 LYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPP 372
L RRK + D G A L G +G + LG+GGGF++ PL + L I P
Sbjct: 117 LLVKRRK--KEQDRGATRHAFSLPKTLLIGFVSGFLSSTLGVGGGFVMVPLMISILKIEP 174
Query: 373 QVSSATALFAI 383
+ + T+L ++
Sbjct: 175 RKAVGTSLVSV 185
>gi|134299284|ref|YP_001112780.1| hypothetical protein Dred_1425 [Desulfotomaculum reducens MI-1]
gi|134051984|gb|ABO49955.1| protein of unknown function DUF81 [Desulfotomaculum reducens MI-1]
Length = 426
Score = 42.4 bits (98), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 64/139 (46%), Gaps = 7/139 (5%)
Query: 334 SQLVFY--CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
+ ++FY A GI AG++ G +G GGGF++ P + LG+ ++ T F I + M
Sbjct: 70 NPMMFYGSMAVGICAGLITGCIGAGGGFVITPALMSLGVKGILAVGTDQFHIFAKAIMGT 129
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
V + L V A+ F S I G + + L + S I +L +++ L
Sbjct: 130 VIHKKLGNVNVALAIAFLVGSGIGVTAGGTLNRALFNMNPVLSDFIISLVYVVM-----L 184
Query: 452 GGVGLAKMIKRIEHKEYMG 470
G +G M I++K G
Sbjct: 185 GFLGFYSMYDFIKNKNTSG 203
>gi|337285350|ref|YP_004624824.1| membrane protein, conserved [Pyrococcus yayanosii CH1]
gi|334901284|gb|AEH25552.1| hypothetical membrane protein, conserved [Pyrococcus yayanosii CH1]
Length = 254
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 40/81 (49%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
Y A GI G + L GLGGGF++ P LG+ + T+ AI F+S S Y+
Sbjct: 7 YFAVGIVIGTLAALFGLGGGFLIVPTLNLLGVEIHKAVGTSSAAIVFTSLSSAYAYHRQG 66
Query: 399 RFPVPYALYFFALSIIAAFVG 419
R L + +I+ A++G
Sbjct: 67 RIHYKAGLLLASTAIVGAYIG 87
>gi|386759198|ref|YP_006232414.1| Sulfite exporter TauE/SafE [Bacillus sp. JS]
gi|384932480|gb|AFI29158.1| Sulfite exporter TauE/SafE [Bacillus sp. JS]
Length = 258
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 60/114 (52%), Gaps = 11/114 (9%)
Query: 316 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQV 374
KG+ + K KW AS L F G V G+LG GG FIL P L L IP ++
Sbjct: 128 KGQEQSEDKEVIFNKWLASSLAFII------GGVSGVLGAGGAFILVPFMLSILNIPVRI 181
Query: 375 SSATALFAITFSSSM--SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
+ A++L AITF SS+ +V + + VP AL S+IA+ +G V +K+
Sbjct: 182 TVASSL-AITFLSSIGATVGKVITGQVLFVP-ALVLMIASLIASPIGASVGQKV 233
>gi|420205978|ref|ZP_14711489.1| hypothetical protein HMPREF9977_00722 [Staphylococcus epidermidis
NIHLM008]
gi|394278651|gb|EJE22965.1| hypothetical protein HMPREF9977_00722 [Staphylococcus epidermidis
NIHLM008]
Length = 253
Score = 42.0 bits (97), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 61/118 (51%), Gaps = 10/118 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFA---LSIIAAFVGQHVLKKLIKILGRASIIIFT 439
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGAAIIATYIPANILKPLI-IIALSLVLIYT 117
>gi|242242978|ref|ZP_04797423.1| membrane protein [Staphylococcus epidermidis W23144]
gi|242233579|gb|EES35891.1| membrane protein [Staphylococcus epidermidis W23144]
Length = 254
Score = 42.0 bits (97), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 440
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T+
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118
>gi|56421519|ref|YP_148837.1| hypothetical protein GK2984 [Geobacillus kaustophilus HTA426]
gi|56381361|dbj|BAD77269.1| hypothetical conserved protein [Geobacillus kaustophilus HTA426]
Length = 300
Score = 42.0 bits (97), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 52/116 (44%), Gaps = 14/116 (12%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 395
G AG VG L GLGGG I+ P L G + PQV+ T+L I F+ S + Y
Sbjct: 10 GFIAGTVGSLAGLGGGVIIVPSLLFFGALGWLSAVTPQVAVGTSLVVIIFNGLSSTLSYM 69
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
K L F S+ A +G V L S F+L F +F+ A+SL
Sbjct: 70 KDKMVDYQSGLLFCLGSVPGAVIGAWVNNTL-------SAAHFSLYFGLFLIAMSL 118
>gi|84500291|ref|ZP_00998557.1| membrane protein [Oceanicola batsensis HTCC2597]
gi|84392225|gb|EAQ04493.1| membrane protein [Oceanicola batsensis HTCC2597]
Length = 306
Score = 42.0 bits (97), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 36/53 (67%), Gaps = 1/53 (1%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G++ G+ G+GGGF++ PL +GIPP V+ AT+ I +SS+S V +L +R
Sbjct: 25 GVLSGMFGVGGGFLITPLLFFVGIPPAVAVATSANQIV-ASSVSAVLAHLRRR 76
>gi|375010109|ref|YP_004983742.1| hypothetical protein [Geobacillus thermoleovorans CCB_US3_UF5]
gi|359288958|gb|AEV20642.1| hypothetical protein GTCCBUS3UF5_33410 [Geobacillus thermoleovorans
CCB_US3_UF5]
Length = 300
Score = 42.0 bits (97), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 52/116 (44%), Gaps = 14/116 (12%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 395
G AG VG L GLGGG I+ P L G + PQV+ T+L I F+ S + Y
Sbjct: 10 GFIAGTVGSLAGLGGGVIIVPSLLFFGALGWLSAVTPQVAVGTSLVVIIFNGLSSTLSYM 69
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
K L F S+ A +G V L S F+L F +F+ A+SL
Sbjct: 70 KDKMVDYQSGLLFCLGSVPGAVIGAWVNNTL-------SAAHFSLYFGLFLIAMSL 118
>gi|418614254|ref|ZP_13177232.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
gi|374820914|gb|EHR84988.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
Length = 254
Score = 42.0 bits (97), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 440
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T+
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118
>gi|255036228|ref|YP_003086849.1| hypothetical protein Dfer_2466 [Dyadobacter fermentans DSM 18053]
gi|254948984|gb|ACT93684.1| protein of unknown function DUF81 [Dyadobacter fermentans DSM
18053]
Length = 293
Score = 42.0 bits (97), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 26/108 (24%), Positives = 47/108 (43%)
Query: 321 IASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATAL 380
I+ ++ T++ Y G+ A +V G LG+ G L +G+ P VSSA+
Sbjct: 37 ISVSSEQVTEFLTGDFALYVLVGLAAQLVDGALGMAYGVTSNSFLLSVGVTPAVSSASVH 96
Query: 381 FAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 428
FA F++ S + ++ K ++ A G ++L +I
Sbjct: 97 FAEMFTTGASAISHFRFKNINKKLFKSLLIPGVLGAIAGAYLLSDVID 144
>gi|418634984|ref|ZP_13197372.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
gi|420190314|ref|ZP_14696257.1| hypothetical protein HMPREF9984_06342 [Staphylococcus epidermidis
NIHLM037]
gi|420192642|ref|ZP_14698500.1| hypothetical protein HMPREF9983_06044 [Staphylococcus epidermidis
NIHLM023]
gi|420204616|ref|ZP_14710174.1| hypothetical protein HMPREF9978_05542 [Staphylococcus epidermidis
NIHLM015]
gi|374835742|gb|EHR99339.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
gi|394258759|gb|EJE03633.1| hypothetical protein HMPREF9984_06342 [Staphylococcus epidermidis
NIHLM037]
gi|394260815|gb|EJE05619.1| hypothetical protein HMPREF9983_06044 [Staphylococcus epidermidis
NIHLM023]
gi|394273626|gb|EJE18057.1| hypothetical protein HMPREF9978_05542 [Staphylococcus epidermidis
NIHLM015]
Length = 254
Score = 42.0 bits (97), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 440
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T+
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118
>gi|127513412|ref|YP_001094609.1| hypothetical protein Shew_2484 [Shewanella loihica PV-4]
gi|126638707|gb|ABO24350.1| protein of unknown function DUF81 [Shewanella loihica PV-4]
Length = 257
Score = 42.0 bits (97), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 41/80 (51%), Gaps = 5/80 (6%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
A G AG + + G GGG + P L +GIPP + T A +F SSM+ YY K+
Sbjct: 14 AATGFIAGFIDAIAG-GGGLLSIPALLTIGIPPHTALGTNKLAASFGSSMAAFTYY-RKQ 71
Query: 400 FPVPYALYFFALSIIAAFVG 419
F P F+ + IA F+G
Sbjct: 72 FFTP---RFWYHASIATFIG 88
>gi|345020875|ref|ZP_08784488.1| hypothetical protein OTW25_06042 [Ornithinibacillus scapharcae
TW25]
Length = 271
Score = 42.0 bits (97), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 5/88 (5%)
Query: 344 ITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
+ G + GL G+GGG I+ P + L G P +++AT++F I F S M + L
Sbjct: 161 LVVGTLSGLFGIGGGSIMVPAMILLFGFPVHIATATSMFMIFFVSMMGASSHIFLGHIAW 220
Query: 403 PYALYFFALSIIAAFVGQHVLKKLIKIL 430
Y L+F I A++G V K+ ++L
Sbjct: 221 EYVLFF----IPGAWIGGKVGAKISQLL 244
>gi|420174532|ref|ZP_14680982.1| hypothetical protein HMPREF9990_02314 [Staphylococcus epidermidis
NIHLM061]
gi|394245037|gb|EJD90364.1| hypothetical protein HMPREF9990_02314 [Staphylococcus epidermidis
NIHLM061]
Length = 254
Score = 42.0 bits (97), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 440
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T+
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118
>gi|89053334|ref|YP_508785.1| hypothetical protein Jann_0843 [Jannaschia sp. CCS1]
gi|88862883|gb|ABD53760.1| protein of unknown function DUF81 [Jannaschia sp. CCS1]
Length = 307
Score = 42.0 bits (97), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 49/97 (50%), Gaps = 2/97 (2%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G+ G+GGGF++ PL +GIPP V+ AT I SS V+ + K + L
Sbjct: 29 GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHLKRKTVDLKMGLVLLI 88
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVS 447
+I A +G V L I G+ +++ L +++F+
Sbjct: 89 GGLIGAAIGVQVFAALTAI-GQVDLLV-RLCYVVFLG 123
>gi|420202259|ref|ZP_14707852.1| hypothetical protein HMPREF9979_05842 [Staphylococcus epidermidis
NIHLM018]
gi|394269667|gb|EJE14197.1| hypothetical protein HMPREF9979_05842 [Staphylococcus epidermidis
NIHLM018]
Length = 254
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 61/118 (51%), Gaps = 10/118 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSVMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 439
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYT 117
>gi|350266812|ref|YP_004878119.1| hypothetical protein GYO_2879 [Bacillus subtilis subsp. spizizenii
TU-B-10]
gi|349599699|gb|AEP87487.1| YrkJ [Bacillus subtilis subsp. spizizenii TU-B-10]
Length = 259
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 62/115 (53%), Gaps = 8/115 (6%)
Query: 320 KIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSAT 378
K ++G++ R + + G+ G++G GG F+L PL L L IP +++ AT
Sbjct: 127 KTGTEGNQSDHIRFPK-ILTSIMTFVVGLASGIVGAGGAFLLIPLMLVILKIPARITIAT 185
Query: 379 ALFAITFSSSMSVVEYYL-LKRFPVPYALYFFALSIIAA----FVGQHVLKKLIK 428
+L A+TF SS+ L + + P+ AL A S+IAA F+G+ V K+++
Sbjct: 186 SL-AVTFISSIGTTATKLVIGQVPMMPALVVMAASLIAAPLGVFIGKKVNAKVLQ 239
>gi|408375545|ref|ZP_11173209.1| hypothetical protein A11A3_15572 [Alcanivorax hongdengensis A-11-3]
gi|407764566|gb|EKF73039.1| hypothetical protein A11A3_15572 [Alcanivorax hongdengensis A-11-3]
Length = 264
Score = 42.0 bits (97), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 32/57 (56%), Gaps = 2/57 (3%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF--AITFSSSM 389
++ Y A G G+ GL G+GGG ++ PL L G PPQ + T L AIT SS M
Sbjct: 7 DMLLYIAAGGLVGLAIGLTGVGGGSLMTPLLLLFGFPPQTAIGTDLLYAAITKSSGM 63
>gi|390960289|ref|YP_006424123.1| hypothetical protein CL1_0114 [Thermococcus sp. CL1]
gi|390518597|gb|AFL94329.1| hypothetical protein CL1_0114 [Thermococcus sp. CL1]
Length = 238
Score = 42.0 bits (97), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 46/85 (54%), Gaps = 1/85 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 401
G +G+ GLLG+ GG I PLF EL GIP + + T+ A+ F+S + +Y L +
Sbjct: 132 GAFSGLASGLLGVSGGIINVPLFHELAGIPVKYAVGTSSLALFFTSLTAAYAHYTLGQVD 191
Query: 402 VPYALYFFALSIIAAFVGQHVLKKL 426
+ A + +FVG H++ ++
Sbjct: 192 ITTAAMIVPGLMAGSFVGAHLVSRI 216
>gi|254451969|ref|ZP_05065406.1| membrane protein [Octadecabacter arcticus 238]
gi|198266375|gb|EDY90645.1| membrane protein [Octadecabacter arcticus 238]
Length = 319
Score = 42.0 bits (97), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 49/95 (51%), Gaps = 2/95 (2%)
Query: 352 LLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFAL 411
+ G+GGGF++ PL +GIPP V+ AT I SS V+ + K +
Sbjct: 44 MFGVGGGFLMTPLLFFIGIPPAVAVATGAVQIVASSFSGVLAHLRRKTVDLKMGTVLLVG 103
Query: 412 SIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
+I A VG VL +K LG+ +++ L +++F+
Sbjct: 104 GLIGAAVGI-VLFNYLKFLGQVDLLV-NLCYVVFL 136
>gi|86138958|ref|ZP_01057529.1| membrane protein [Roseobacter sp. MED193]
gi|85824189|gb|EAQ44393.1| membrane protein [Roseobacter sp. MED193]
Length = 306
Score = 41.6 bits (96), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 52/96 (54%), Gaps = 2/96 (2%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++ + + +
Sbjct: 29 GMFGVGGGFLMTPLLFFIGIPPVVAVATEANQIVASSFSGVLAHFRRRTVDIKMGVVLQV 88
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
+I A +G V L K LG+ +++ TL +++F+
Sbjct: 89 GGLIGAGLGVVVFNYL-KSLGQVDLLV-TLCYVVFL 122
>gi|261313116|ref|ZP_05952313.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10]
gi|261302142|gb|EEY05639.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10]
Length = 306
Score = 41.6 bits (96), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 30/133 (22%), Positives = 66/133 (49%), Gaps = 4/133 (3%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
++ G G + GL G+GG F++ PL + IPP ++ AT + SS + +
Sbjct: 13 NMLVLLGMGAAVGFLSGLFGVGGSFLITPLLIFYNIPPAIAVATGANQVIASSVSGALAH 72
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 454
+ + + L+ A I+ + +G V L + LG+ +I+ ++ ++ F+ ++GG+
Sbjct: 73 FKRRTLDIKLGLFLVAGGILGSLIGIFVFSWL-RDLGQLDLIV-SILYVFFLG--TIGGL 128
Query: 455 GLAKMIKRIEHKE 467
L + ++ + +
Sbjct: 129 MLVESVQALRRAK 141
>gi|11499103|ref|NP_070337.1| hypothetical protein AF1508 [Archaeoglobus fulgidus DSM 4304]
gi|2649059|gb|AAB89742.1| predicted coding region AF_1508 [Archaeoglobus fulgidus DSM 4304]
Length = 480
Score = 41.6 bits (96), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 32/141 (22%), Positives = 65/141 (46%), Gaps = 7/141 (4%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
+ +F A G GM+ G +G GG F+L P + +G P ++ A+ + + +
Sbjct: 24 EALFLLALGFFGGMLSGFIGTGGAFVLTPGMMSIGTPGPIAVASNMCHKFPKAMIGAYRR 83
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIF-TLSFIIFVSALSLGG 453
Y L++ AL +I VG V K + ++LG ++ +++F+I + ++
Sbjct: 84 YKLRQLDPKLALLMATSAIFGVQVGIQVQKHIYELLGPTGTNLYVSVAFLIVLPTVA--- 140
Query: 454 VGLAKMIKRIEHKEYMGFDNI 474
A +I+ + MG ++
Sbjct: 141 ---AVLIRDVVKARRMGISDL 158
>gi|319789078|ref|YP_004150711.1| hypothetical protein Theam_0096 [Thermovibrio ammonificans HB-1]
gi|317113580|gb|ADU96070.1| protein of unknown function DUF81 [Thermovibrio ammonificans HB-1]
Length = 262
Score = 41.6 bits (96), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 40/71 (56%), Gaps = 8/71 (11%)
Query: 343 GITAGMVGGLLGLGGGFILGPL----FLELGIPPQVS----SATALFAITFSSSMSVVEY 394
GI AG V GL G+GGG IL PL F ++G+P ++S AT+L I S+ S V +
Sbjct: 12 GIAAGFVAGLFGIGGGVILVPLFWFFFQKIGVPQELSFKLAVATSLSVIAVSTLFSTVSH 71
Query: 395 YLLKRFPVPYA 405
L +P+ A
Sbjct: 72 ILKGSYPLKEA 82
>gi|224476019|ref|YP_002633625.1| hypothetical protein Sca_0526 [Staphylococcus carnosus subsp.
carnosus TM300]
gi|222420626|emb|CAL27440.1| putative membrane protein [Staphylococcus carnosus subsp. carnosus
TM300]
Length = 274
Score = 41.6 bits (96), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 3/74 (4%)
Query: 318 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSS 376
R IA+ G F+ GI G++ GL G+GGG ++ PL L PP V+
Sbjct: 141 RTYIANDGTTYHYHVPPVFAFFATFGI--GILTGLFGIGGGALMTPLMLIVFRFPPHVAV 198
Query: 377 ATALFAITFSSSMS 390
T++ I FSS MS
Sbjct: 199 GTSMMMIFFSSVMS 212
>gi|212223719|ref|YP_002306955.1| membrane protein [Thermococcus onnurineus NA1]
gi|212008676|gb|ACJ16058.1| hypothetical membrane protein, conserved [Thermococcus onnurineus
NA1]
Length = 254
Score = 41.6 bits (96), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 52/103 (50%), Gaps = 4/103 (3%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
Y G+ G++ L GLGGGF++ P LG+ + T+ A+ F+S S + Y K
Sbjct: 7 YFTVGVFIGILAALFGLGGGFLIVPTLNLLGVEIHHAVGTSSAAVVFTSLSSALAYSRQK 66
Query: 399 RFPVPYALYFFALSIIAAFVG----QHVLKKLIKILGRASIII 437
R L + ++I A++G + ++K++ A++II
Sbjct: 67 RIHYKIGLLLASTAVIGAYIGAWMTSFISAGMLKVIFGATLII 109
>gi|149909081|ref|ZP_01897739.1| hypothetical protein PE36_17470 [Moritella sp. PE36]
gi|149807832|gb|EDM67777.1| hypothetical protein PE36_17470 [Moritella sp. PE36]
Length = 259
Score = 41.6 bits (96), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 53/120 (44%), Gaps = 14/120 (11%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 393
S + C+ + AG + + G GGG + P L G+PP ++ T A +F S + +
Sbjct: 9 SVFIILCSVALVAGFIDAIAG-GGGLLTVPALLSAGLPPHLALGTNKLAASFGSCAAAIT 67
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHV--------LKKLIKILGRASIIIFTLSFIIF 445
+Y + F + F + I A G V L KL+ + IIIFT + IF
Sbjct: 68 FYKKRLFNPKFWWRSFVFTAIGAICGTLVVNVISTAWLDKLLPV-----IIIFTAIYSIF 122
>gi|443327177|ref|ZP_21055809.1| putative permease [Xenococcus sp. PCC 7305]
gi|442793208|gb|ELS02663.1| putative permease [Xenococcus sp. PCC 7305]
Length = 270
Score = 41.6 bits (96), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 47/89 (52%)
Query: 337 VFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYL 396
VF G +G++ G LG+GGG +L PL + LG P + AT+ AIT ++ ++ +
Sbjct: 7 VFIGISGFFSGILAGFLGIGGGTVLVPLLVALGYEPIQAVATSALAITITALSGTLQNWR 66
Query: 397 LKRFPVPYALYFFALSIIAAFVGQHVLKK 425
+ + LY +++AA VG + K
Sbjct: 67 MGYIKLQSILYLGLPALLAAQVGVYFADK 95
>gi|367478122|ref|ZP_09477444.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 285]
gi|365269682|emb|CCD89912.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 285]
Length = 331
Score = 41.6 bits (96), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 4/117 (3%)
Query: 338 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLL 397
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+
Sbjct: 39 LLLAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYWRR 98
Query: 398 KRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 454
+ AL + +G L++ LG+ ++I LS+++ ++ S+GGV
Sbjct: 99 RAIDPSLALVLMIGGTLGTALGVATF-TLLRSLGQLDLMI-ALSYVVLLT--SVGGV 151
>gi|328952971|ref|YP_004370305.1| hypothetical protein Desac_1265 [Desulfobacca acetoxidans DSM
11109]
gi|328453295|gb|AEB09124.1| protein of unknown function DUF81 [Desulfobacca acetoxidans DSM
11109]
Length = 311
Score = 41.6 bits (96), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 27/39 (69%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT 378
G+ G + G+ G+GGGF+L PL + +GIPP V++A+
Sbjct: 18 IGLGVLVGFLSGMFGVGGGFLLTPLMMMVGIPPAVAAAS 56
>gi|420173231|ref|ZP_14679726.1| hypothetical protein HMPREF9991_07281 [Staphylococcus epidermidis
NIHLM067]
gi|420183366|ref|ZP_14689497.1| hypothetical protein HMPREF9987_08245 [Staphylococcus epidermidis
NIHLM049]
gi|394240409|gb|EJD85833.1| hypothetical protein HMPREF9991_07281 [Staphylococcus epidermidis
NIHLM067]
gi|394249078|gb|EJD94299.1| hypothetical protein HMPREF9987_08245 [Staphylococcus epidermidis
NIHLM049]
Length = 254
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 61/118 (51%), Gaps = 10/118 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 439
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYT 117
>gi|27468308|ref|NP_764945.1| hypothetical protein SE1390 [Staphylococcus epidermidis ATCC 12228]
gi|57867226|ref|YP_188850.1| hypothetical protein SERP1278 [Staphylococcus epidermidis RP62A]
gi|251811104|ref|ZP_04825577.1| membrane protein [Staphylococcus epidermidis BCM-HMP0060]
gi|282875864|ref|ZP_06284731.1| putative membrane protein [Staphylococcus epidermidis SK135]
gi|293366339|ref|ZP_06613019.1| conserved hypothetical protein [Staphylococcus epidermidis
M23864:W2(grey)]
gi|417656164|ref|ZP_12305855.1| putative membrane protein [Staphylococcus epidermidis VCU028]
gi|417658659|ref|ZP_12308280.1| putative membrane protein [Staphylococcus epidermidis VCU045]
gi|417908781|ref|ZP_12552538.1| putative membrane protein [Staphylococcus epidermidis VCU037]
gi|417911376|ref|ZP_12555083.1| putative membrane protein [Staphylococcus epidermidis VCU105]
gi|417914293|ref|ZP_12557945.1| putative membrane protein [Staphylococcus epidermidis VCU109]
gi|418603691|ref|ZP_13167072.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU041]
gi|418606180|ref|ZP_13169474.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU057]
gi|418610089|ref|ZP_13173215.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU065]
gi|418612531|ref|ZP_13175566.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU117]
gi|418618135|ref|ZP_13181014.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU120]
gi|418623251|ref|ZP_13185969.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU123]
gi|418625217|ref|ZP_13187873.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU125]
gi|418627260|ref|ZP_13189839.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU126]
gi|418629290|ref|ZP_13191802.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU127]
gi|418664063|ref|ZP_13225560.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU081]
gi|419769208|ref|ZP_14295304.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
IS-250]
gi|419771352|ref|ZP_14297406.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
IS-K]
gi|420166604|ref|ZP_14673287.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM088]
gi|420170403|ref|ZP_14676964.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM070]
gi|420187092|ref|ZP_14693115.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM039]
gi|420195366|ref|ZP_14701159.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM021]
gi|420196947|ref|ZP_14702681.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM020]
gi|420209206|ref|ZP_14714644.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM003]
gi|420215110|ref|ZP_14720382.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIH05005]
gi|420216860|ref|ZP_14722054.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIH05001]
gi|420220645|ref|ZP_14725604.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIH04008]
gi|420221508|ref|ZP_14726438.1| hypothetical protein HMPREF1390_00623 [Staphylococcus epidermidis
NIH08001]
gi|420225897|ref|ZP_14730724.1| hypothetical protein HMPREF1389_10265 [Staphylococcus epidermidis
NIH06004]
gi|420227493|ref|ZP_14732261.1| hypothetical protein HMPREF1388_05408 [Staphylococcus epidermidis
NIH05003]
gi|420229811|ref|ZP_14734513.1| hypothetical protein HMPREF1387_05447 [Staphylococcus epidermidis
NIH04003]
gi|420232217|ref|ZP_14736858.1| hypothetical protein HMPREF1386_05683 [Staphylococcus epidermidis
NIH051668]
gi|420234863|ref|ZP_14739423.1| hypothetical protein HMPREF1385_05672 [Staphylococcus epidermidis
NIH051475]
gi|421606803|ref|ZP_16048057.1| hypothetical protein B440_00608 [Staphylococcus epidermidis
AU12-03]
gi|27315854|gb|AAO04989.1|AE016748_223 conserved hypothetical protein [Staphylococcus epidermidis ATCC
12228]
gi|57637884|gb|AAW54672.1| conserved hypothetical protein [Staphylococcus epidermidis RP62A]
gi|251805324|gb|EES57981.1| membrane protein [Staphylococcus epidermidis BCM-HMP0060]
gi|281294889|gb|EFA87416.1| putative membrane protein [Staphylococcus epidermidis SK135]
gi|291319577|gb|EFE59943.1| conserved hypothetical protein [Staphylococcus epidermidis
M23864:W2(grey)]
gi|329737414|gb|EGG73668.1| putative membrane protein [Staphylococcus epidermidis VCU028]
gi|329737501|gb|EGG73754.1| putative membrane protein [Staphylococcus epidermidis VCU045]
gi|341652991|gb|EGS76765.1| putative membrane protein [Staphylococcus epidermidis VCU109]
gi|341653699|gb|EGS77466.1| putative membrane protein [Staphylococcus epidermidis VCU105]
gi|341656142|gb|EGS79865.1| putative membrane protein [Staphylococcus epidermidis VCU037]
gi|374405505|gb|EHQ76437.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU065]
gi|374407009|gb|EHQ77878.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU041]
gi|374409263|gb|EHQ80062.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU057]
gi|374410953|gb|EHQ81682.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU081]
gi|374816377|gb|EHR80581.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU120]
gi|374818856|gb|EHR82999.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU117]
gi|374824144|gb|EHR88117.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU123]
gi|374825718|gb|EHR89642.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU125]
gi|374829935|gb|EHR93727.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU126]
gi|374834371|gb|EHR98017.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU127]
gi|383358277|gb|EID35736.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
IS-250]
gi|383361578|gb|EID38948.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
IS-K]
gi|394233474|gb|EJD79078.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM088]
gi|394240741|gb|EJD86164.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM070]
gi|394256839|gb|EJE01765.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM039]
gi|394263320|gb|EJE08056.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM021]
gi|394266921|gb|EJE11539.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM020]
gi|394279434|gb|EJE23742.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIHLM003]
gi|394282559|gb|EJE26749.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIH05005]
gi|394285998|gb|EJE30064.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIH04008]
gi|394290606|gb|EJE34460.1| hypothetical protein HMPREF1390_00623 [Staphylococcus epidermidis
NIH08001]
gi|394291222|gb|EJE35046.1| membrane protein, DUF81 family [Staphylococcus epidermidis
NIH05001]
gi|394293331|gb|EJE37054.1| hypothetical protein HMPREF1389_10265 [Staphylococcus epidermidis
NIH06004]
gi|394297117|gb|EJE40729.1| hypothetical protein HMPREF1388_05408 [Staphylococcus epidermidis
NIH05003]
gi|394298610|gb|EJE42175.1| hypothetical protein HMPREF1387_05447 [Staphylococcus epidermidis
NIH04003]
gi|394301540|gb|EJE44996.1| hypothetical protein HMPREF1386_05683 [Staphylococcus epidermidis
NIH051668]
gi|394304106|gb|EJE47516.1| hypothetical protein HMPREF1385_05672 [Staphylococcus epidermidis
NIH051475]
gi|406657581|gb|EKC83966.1| hypothetical protein B440_00608 [Staphylococcus epidermidis
AU12-03]
Length = 254
Score = 41.6 bits (96), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 61/118 (51%), Gaps = 10/118 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 439
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYT 117
>gi|404416492|ref|ZP_10998312.1| hypothetical protein SARL_01356 [Staphylococcus arlettae CVD059]
gi|403491149|gb|EJY96674.1| hypothetical protein SARL_01356 [Staphylococcus arlettae CVD059]
Length = 297
Score = 41.6 bits (96), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 53/112 (47%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
+L+ + G A M+ G LG+G G + L G P + SAT F+ +++ S V +
Sbjct: 3 KLLIFALAGFFAQMIDGSLGMGFGATSSSILLAAGAAPAIVSATIHFSEIATTAASGVSH 62
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
Y K A+ IAAF+G VL + L + I +F L+ +F+
Sbjct: 63 YKFKNVDKRMAIKLAIPGAIAAFIGSAVLSNIHSDLIKPFISLFLLTMGLFI 114
>gi|416125430|ref|ZP_11596028.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
gi|319401027|gb|EFV89246.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
Length = 254
Score = 41.6 bits (96), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A + ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFIDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 440
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T+
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118
>gi|398828136|ref|ZP_10586338.1| putative permease [Phyllobacterium sp. YR531]
gi|398218854|gb|EJN05356.1| putative permease [Phyllobacterium sp. YR531]
Length = 316
Score = 41.6 bits (96), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 64/127 (50%), Gaps = 4/127 (3%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 25 GMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPAVAVATGANQVIASSFSGALTHFKRGTL 84
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMI 460
V I A +G +V L++ LG+ +I+ +L +++F+S ++GG+ L + +
Sbjct: 85 DVKLGSLLVIGGIFGAAIGIYVF-VLLRELGQLDLIV-SLLYVVFLS--TIGGLMLYESV 140
Query: 461 KRIEHKE 467
K + +
Sbjct: 141 KAMRRTK 147
>gi|20093583|ref|NP_613430.1| permease [Methanopyrus kandleri AV19]
gi|19886439|gb|AAM01360.1| Predicted permease [Methanopyrus kandleri AV19]
Length = 252
Score = 41.6 bits (96), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 44/98 (44%), Gaps = 8/98 (8%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
++ Y G G+V G+ G+GGGF+L PL G+P ++ T L AI+ +
Sbjct: 6 SMMLYLGIGGLVGLVSGMFGVGGGFLLVPLLNSTGMPMHLALGTTLLAISLGGFTGAYRH 65
Query: 395 YLLKRFPVPYALYFFALSIIAAFVG--------QHVLK 424
V A F +I+ A VG +HVLK
Sbjct: 66 LQEGNVHVDAAPIFGLSAIVGAQVGSYLACLTPEHVLK 103
>gi|390449540|ref|ZP_10235145.1| hypothetical protein A33O_08366 [Nitratireductor aquibiodomus RA22]
gi|389664037|gb|EIM75548.1| hypothetical protein A33O_08366 [Nitratireductor aquibiodomus RA22]
Length = 306
Score = 41.6 bits (96), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 62/128 (48%), Gaps = 4/128 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
A G G + G+ G+GGGF++ PL + IPP ++ AT + SS + ++ K
Sbjct: 18 LAMGGAVGFLSGMFGVGGGFLITPLLIFYNIPPAIAVATGANQVIASSFSGALAHFKRKS 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
+ A I+ A +G V L + LG+ +I+ +L ++ F+ +GG+ L +
Sbjct: 78 LDLKLGTVLLAGGIVGASLGVSVFAYL-RRLGQLDLIV-SLLYVAFLGV--VGGLMLMES 133
Query: 460 IKRIEHKE 467
++ I
Sbjct: 134 VRAIRRSR 141
>gi|348027031|ref|YP_004766836.1| hypothetical protein MELS_1790 [Megasphaera elsdenii DSM 20460]
gi|341823085|emb|CCC74009.1| putative membrane protein [Megasphaera elsdenii DSM 20460]
Length = 262
Score = 41.6 bits (96), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 41/79 (51%), Gaps = 9/79 (11%)
Query: 329 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 388
T AS L+F A G AG V + G GGG I P+ L G+PP ++ T F+ TF +
Sbjct: 8 TTLSASLLLFIMAGGFLAGFVDSIAG-GGGLISLPVLLAAGLPPHLAIGTNKFSATFGAV 66
Query: 389 MSVVEYY--------LLKR 399
MS +++ LLKR
Sbjct: 67 MSAWQFWRAGKVDMGLLKR 85
>gi|220903637|ref|YP_002478949.1| hypothetical protein Ddes_0357 [Desulfovibrio desulfuricans subsp.
desulfuricans str. ATCC 27774]
gi|219867936|gb|ACL48271.1| protein of unknown function DUF81 [Desulfovibrio desulfuricans
subsp. desulfuricans str. ATCC 27774]
Length = 420
Score = 41.6 bits (96), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 43/86 (50%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
A G+ AG++ G +G GGGFI+ P + +G+ ++ T LF I + M + L
Sbjct: 82 AVGLAAGLITGCIGAGGGFIITPALMAVGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNV 141
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKL 426
A+ F S++ F+G + K L
Sbjct: 142 SGKLAVAFLCGSVVGTFIGGAINKGL 167
>gi|347529542|ref|YP_004836290.1| hypothetical protein SLG_31580 [Sphingobium sp. SYK-6]
gi|345138224|dbj|BAK67833.1| conserved hypothetical membrane protein [Sphingobium sp. SYK-6]
Length = 304
Score = 41.6 bits (96), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 55/106 (51%), Gaps = 7/106 (6%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G+ G+GGGF+ PL + GIPP V++A+A + +S SV+ + K +P
Sbjct: 29 GMFGVGGGFLTTPLLIFYGIPPTVAAASAATQVMGASVSSVMSHLGRKTVDLPMGGVMVV 88
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 456
II A +G V+ +++ LG+ +I L ++ LG +GL
Sbjct: 89 GGIIGAAIGT-VIFRILDRLGQIDTVIGLLYVVM------LGSIGL 127
>gi|392955880|ref|ZP_10321410.1| hypothetical protein A374_04039 [Bacillus macauensis ZFHKF-1]
gi|391878122|gb|EIT86712.1| hypothetical protein A374_04039 [Bacillus macauensis ZFHKF-1]
Length = 254
Score = 41.6 bits (96), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 65/124 (52%), Gaps = 7/124 (5%)
Query: 305 GVSAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 364
G+ A AV L ++ A + + T R + F G+ AG +LG GG FIL P+
Sbjct: 111 GILALIAVMLMFTPKREALESGKVTFHRGGAIFFSFVIGLVAG----VLGAGGAFILVPV 166
Query: 365 FLE-LGIPPQVSSATALFAITFSSSM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
L L IP + + AT+L AITF SS+ S + + + P A+ +S+IA+ +G V
Sbjct: 167 MLAVLKIPTKTTIATSL-AITFLSSIGSTIGKLVTHQVPYIPAIVLMIVSLIASPIGAKV 225
Query: 423 LKKL 426
++L
Sbjct: 226 GQRL 229
>gi|161528556|ref|YP_001582382.1| hypothetical protein Nmar_1048 [Nitrosopumilus maritimus SCM1]
gi|160339857|gb|ABX12944.1| protein of unknown function DUF81 [Nitrosopumilus maritimus SCM1]
Length = 257
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 35/58 (60%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G AG++G ++GLGGG I+ P+ LG PP +++ +LFA ++ S + Y KR
Sbjct: 10 LGFAAGVLGSMIGLGGGVIVVPVLTFLGFPPTAAASNSLFAALSNAVASTMSYSKQKR 67
>gi|260430369|ref|ZP_05784342.1| membrane protein [Citreicella sp. SE45]
gi|260418398|gb|EEX11655.1| membrane protein [Citreicella sp. SE45]
Length = 301
Score = 41.6 bits (96), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 32/61 (52%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
L G G++ GL G+GGGFIL PL +GIPP V+ +T I SS + +
Sbjct: 13 NLFLIVGLGGVVGIMSGLFGVGGGFILTPLLFFIGIPPAVAVSTQAVQIVASSCSGALAH 72
Query: 395 Y 395
+
Sbjct: 73 W 73
>gi|205375197|ref|ZP_03227988.1| hypothetical protein Bcoam_19517 [Bacillus coahuilensis m4-4]
Length = 273
Score = 41.6 bits (96), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 43/92 (46%), Gaps = 1/92 (1%)
Query: 316 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQV 374
KG+++ D T Q V A G+ GL G+GGG ++ P + L PP V
Sbjct: 135 KGKKRSFQDADGNTYEYHIQPVIAIASAFIVGLTSGLFGIGGGSLMVPAMILLFLFPPHV 194
Query: 375 SSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
+ AT++F I S+ +S V + YAL
Sbjct: 195 AVATSMFMIFLSAIVSSVTHIAFGNVDWLYAL 226
>gi|420178011|ref|ZP_14684345.1| hypothetical protein HMPREF9989_07051 [Staphylococcus epidermidis
NIHLM057]
gi|420181101|ref|ZP_14687307.1| hypothetical protein HMPREF9988_09900 [Staphylococcus epidermidis
NIHLM053]
gi|394247198|gb|EJD92446.1| hypothetical protein HMPREF9989_07051 [Staphylococcus epidermidis
NIHLM057]
gi|394247337|gb|EJD92583.1| hypothetical protein HMPREF9988_09900 [Staphylococcus epidermidis
NIHLM053]
Length = 254
Score = 41.6 bits (96), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 62/119 (52%), Gaps = 10/119 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A + ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFIDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 440
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T+
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYTI 118
>gi|357419022|ref|YP_004932014.1| hypothetical protein Tlie_0171 [Thermovirga lienii DSM 17291]
gi|355396488|gb|AER65917.1| protein of unknown function DUF81 [Thermovirga lienii DSM 17291]
Length = 274
Score = 41.6 bits (96), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 76/176 (43%), Gaps = 22/176 (12%)
Query: 32 QSNLKPEATSKNKNDHVEPNIVIKVSAPKSRSGYKHVWPDIKFGWRIVVGTIIGFFGTAC 91
+S+ E T K +H + ++ K S + + K R+++G G
Sbjct: 113 RSDDIQEKTPHQKANHTKNTLLDCSGKFKDPSSGETIAYTPK---RLLLGMAFSTLGGII 169
Query: 92 GSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMV-TGGAVSAVVYNLR-QRHPTLDIPV 149
++ G+GGG I VP++ L++G K++ A S MV +VSAV+Y P L IP
Sbjct: 170 SALLGIGGGPIKVPIMRLVMGLPLKAATATSTFMVGITASVSAVIYLFEGMVKPDLAIPA 229
Query: 150 IDYDLALLFQPLLVLGISIGVAFNVIFADWM----ITVLLIVLFLVLSTRAFLKGV 201
VLGI IG + A M I +L ++++ LKG+
Sbjct: 230 -------------VLGIFIGARTGALVAGKMKSHHIRTAFSILLVLIAIEMLLKGL 272
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 44/79 (55%), Gaps = 2/79 (2%)
Query: 317 GRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLF-LELGIPPQVS 375
G+ K S G E + +L+ A G++ LLG+GGG I P+ L +G+P + +
Sbjct: 138 GKFKDPSSG-ETIAYTPKRLLLGMAFSTLGGIISALLGIGGGPIKVPIMRLVMGLPLKAA 196
Query: 376 SATALFAITFSSSMSVVEY 394
+AT+ F + ++S+S V Y
Sbjct: 197 TATSTFMVGITASVSAVIY 215
>gi|376296158|ref|YP_005167388.1| hypothetical protein DND132_1375 [Desulfovibrio desulfuricans
ND132]
gi|323458719|gb|EGB14584.1| protein of unknown function DUF81 [Desulfovibrio desulfuricans
ND132]
Length = 309
Score = 41.6 bits (96), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 38/62 (61%), Gaps = 2/62 (3%)
Query: 336 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
LVF A G G++ G+ G+GGGF++ PL + GIPP V++A+ I +S+ + +Y
Sbjct: 17 LVF--ALGGFVGLLSGIFGVGGGFLMTPLLIMFGIPPTVAAASDSNQIVGASTSGCLAHY 74
Query: 396 LL 397
L
Sbjct: 75 RL 76
>gi|430756311|ref|YP_007208850.1| hypothetical protein A7A1_0819 [Bacillus subtilis subsp. subtilis
str. BSP1]
gi|430020831|gb|AGA21437.1| Hypothetical protein YrkJ [Bacillus subtilis subsp. subtilis str.
BSP1]
Length = 262
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 9/100 (9%)
Query: 329 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 387
KW AS L F G V G+LG GG FIL P+ L L IP +++ A++L AITF S
Sbjct: 145 NKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 197
Query: 388 SM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
S+ + V + + AL S+IA+ +G V +K+
Sbjct: 198 SIGATVGKVITDQVLFVPALVLIMASLIASPIGASVGQKV 237
>gi|374299305|ref|YP_005050944.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
gi|332552241|gb|EGJ49285.1| protein of unknown function DUF81 [Desulfovibrio africanus str.
Walvis Bay]
Length = 423
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 42/84 (50%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ AG++ G +G GGGFI+ P + GI ++ T LF I + M + L V
Sbjct: 81 GLIAGLITGCIGAGGGFIITPALMAAGIKGILAVGTDLFHIFAKAIMGTTVHKKLGNVSV 140
Query: 403 PYALYFFALSIIAAFVGQHVLKKL 426
AL F A + I F+G + K L
Sbjct: 141 KLALGFLAGASIGTFIGGAINKGL 164
>gi|145493226|ref|XP_001432609.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124399722|emb|CAK65212.1| unnamed protein product [Paramecium tetraurelia]
Length = 454
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/142 (21%), Positives = 66/142 (46%), Gaps = 6/142 (4%)
Query: 78 IVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYN 137
+V+ IG F + G+G G + +L L+ + K ++ +++ M + N
Sbjct: 33 LVIPISIGVFNSI-----GLGAQGFQIFILLLVFNYGKKDASLMNQPMQLAQYFLTSIEN 87
Query: 138 LRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAF 197
L++R + P ++ +L LL P+++ G +IG+ + V+ ++ L+L +
Sbjct: 88 LKKRKQP-NQPAVNENLVLLLGPMMISGCTIGLHSKDYIPTFFTIVITLISLLILMMTTY 146
Query: 198 LKGVETWKKETITKREAAKQLE 219
K + E++ +E K+ E
Sbjct: 147 KKTKQVRYLESLATKEQLKEFE 168
>gi|406888182|gb|EKD34745.1| hypothetical protein ACD_75C02211G0002 [uncultured bacterium]
Length = 381
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 60/120 (50%), Gaps = 2/120 (1%)
Query: 334 SQLVFYCAC--GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
+ L+F+ + G+ AG++ G +G GGGFI+ P + G+ ++ T LF I + M
Sbjct: 14 TNLIFFASVFVGVCAGLITGCIGAGGGFIIAPALMSAGVKGILAVGTDLFHIFAKAIMGS 73
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
V + + + A++F S+I A +G + + L + S + T+ ++ + LS
Sbjct: 74 VLHRKMGNISIGLAVHFLIGSLIGATLGGMLNRYLYDLNPVLSDLFITVVYVFILGFLSF 133
>gi|262277629|ref|ZP_06055422.1| membrane protein [alpha proteobacterium HIMB114]
gi|262224732|gb|EEY75191.1| membrane protein [alpha proteobacterium HIMB114]
Length = 351
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 58/104 (55%), Gaps = 4/104 (3%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 401
G+ G++ LLG+GGGFIL P+ + +G+P ++ T+LFA+ F + + + +
Sbjct: 182 GVLIGLISSLLGVGGGFILVPILIYIIGMPAKLVPGTSLFAMIFVMIIVTLLHAIANYTI 241
Query: 402 VPYALYFFAL-SIIAAFVGQHVLKKLI--KILGRASIIIFTLSF 442
Y ++ AL S++ A +G + KL ++ G SI+I T F
Sbjct: 242 DIYLVFILALGSVVGAQLGSIISSKLAGEELRGLLSILILTFGF 285
>gi|52081145|ref|YP_079936.1| hypothetical protein BL02055 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|319644890|ref|ZP_07999123.1| YrkJ protein [Bacillus sp. BT1B_CT2]
gi|404490024|ref|YP_006714130.1| hypothetical protein BLi02790 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|423683115|ref|ZP_17657954.1| hypothetical protein MUY_02968 [Bacillus licheniformis WX-02]
gi|52004356|gb|AAU24298.1| conserved membrane protein YrkJ [Bacillus licheniformis DSM 13 =
ATCC 14580]
gi|52349027|gb|AAU41661.1| DUF81 transmembrane protein YrkJ [Bacillus licheniformis DSM 13 =
ATCC 14580]
gi|317392699|gb|EFV73493.1| YrkJ protein [Bacillus sp. BT1B_CT2]
gi|383439889|gb|EID47664.1| hypothetical protein MUY_02968 [Bacillus licheniformis WX-02]
Length = 260
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 71/134 (52%), Gaps = 9/134 (6%)
Query: 320 KIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSAT 378
K ++G++ R + + G+ G++G GG F+L PL L + IP +++ AT
Sbjct: 127 KTGTEGNQSDHIRFPK-ILTSIMTFVVGLASGIVGAGGAFLLVPLMLVIIKIPARITIAT 185
Query: 379 ALFAITFSSSMSVVEYYL-LKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 437
+L A+TF SS+ L + + P+ AL A S+IA+ +G + KK+ A ++
Sbjct: 186 SL-AVTFISSIGTTATKLVIGQVPIMPALVVMAASLIASPLGVFIGKKV-----NAKVLQ 239
Query: 438 FTLSFIIFVSALSL 451
L F+IF++ + +
Sbjct: 240 GILIFVIFLTTVKI 253
>gi|254510226|ref|ZP_05122293.1| membrane protein [Rhodobacteraceae bacterium KLH11]
gi|221533937|gb|EEE36925.1| membrane protein [Rhodobacteraceae bacterium KLH11]
Length = 321
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 54/103 (52%), Gaps = 2/103 (1%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
G++ G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++ + +
Sbjct: 40 GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRRTVDLKMGT 99
Query: 407 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
+ A +G +L +K LG+ +++ L +++F+ +
Sbjct: 100 VLLVGGLAGAALGV-ILFNYLKSLGQVDLLV-KLCYVVFLGVI 140
>gi|262276252|ref|ZP_06054061.1| hypothetical protein VHA_003237 [Grimontia hollisae CIP 101886]
gi|262220060|gb|EEY71376.1| hypothetical protein VHA_003237 [Grimontia hollisae CIP 101886]
Length = 257
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 41/79 (51%), Gaps = 1/79 (1%)
Query: 346 AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYA 405
AG + + G GGG + P L +G+PP ++ T + TF+SS + + YY K F +
Sbjct: 20 AGFIDAVAG-GGGMLTVPALLSIGLPPHIALGTNKLSATFASSTAALTYYRKKLFSPRFW 78
Query: 406 LYFFALSIIAAFVGQHVLK 424
+ F ++I A G V+
Sbjct: 79 SHAFLATLIGAVFGTLVVD 97
>gi|225017187|ref|ZP_03706379.1| hypothetical protein CLOSTMETH_01113 [Clostridium methylpentosum
DSM 5476]
gi|224950106|gb|EEG31315.1| hypothetical protein CLOSTMETH_01113 [Clostridium methylpentosum
DSM 5476]
Length = 117
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 43/92 (46%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
Q + G AG++ GL G GGG + PL GI + S AT++ I S +S Y
Sbjct: 3 QKTHFIVTGAVAGLLNGLFGAGGGVAVVPLLEHAGIEARKSHATSIAIIAALSVVSAAFY 62
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
+F AL + ++ A VG +L K+
Sbjct: 63 LFNGKFDFWQALPYLPFGLVGAVVGAKLLPKI 94
>gi|341581528|ref|YP_004762020.1| putative permease [Thermococcus sp. 4557]
gi|340809186|gb|AEK72343.1| putative permease [Thermococcus sp. 4557]
Length = 254
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 42/81 (51%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
Y A G+ G++ + GLGGGF++ P LG+ + T+ A+ F+S S + Y +
Sbjct: 7 YFAVGVFIGILAAMFGLGGGFLIVPTLNFLGVEIHHAVGTSSAAVVFTSLSSAIAYSRQR 66
Query: 399 RFPVPYALYFFALSIIAAFVG 419
R L + ++I A++G
Sbjct: 67 RIHYKAGLLLASTAVIGAYIG 87
>gi|410463419|ref|ZP_11316934.1| putative permease [Desulfovibrio magneticus str. Maddingley MBC34]
gi|409983446|gb|EKO39820.1| putative permease [Desulfovibrio magneticus str. Maddingley MBC34]
Length = 313
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 27/36 (75%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT 378
G G++ G+ G+GGGF++ PL + +GIPP V++AT
Sbjct: 25 GGAVGLLSGIFGVGGGFLMTPLLIMMGIPPTVAAAT 60
>gi|154246428|ref|YP_001417386.1| hypothetical protein Xaut_2487 [Xanthobacter autotrophicus Py2]
gi|154160513|gb|ABS67729.1| protein of unknown function DUF81 [Xanthobacter autotrophicus Py2]
Length = 307
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 34/53 (64%), Gaps = 3/53 (5%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 390
A G+ G + G+ G+GGGF++ PL + G+PP V+ A+ + A +FS ++S
Sbjct: 19 AMGLAVGFISGMFGVGGGFLMTPLLIFTGVPPAVAVASVSPYMAASSFSGALS 71
>gi|381397639|ref|ZP_09923049.1| protein of unknown function DUF81 [Microbacterium laevaniformans
OR221]
gi|380775207|gb|EIC08501.1| protein of unknown function DUF81 [Microbacterium laevaniformans
OR221]
Length = 269
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 56/108 (51%), Gaps = 4/108 (3%)
Query: 323 SKGDEGTKWRASQLVFYCAC---GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSAT 378
S T R + F +C G+ AG++ GL G+GGG ++ P L L L ++++ T
Sbjct: 2 SPAASATPARRRDIRFVLSCIGVGLAAGLLSGLFGVGGGTVIVPMLVLLLRFDQRLAAGT 61
Query: 379 ALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
+L AI ++S+ VV Y + AL A +++ A +G +L ++
Sbjct: 62 SLAAIVPTASVGVVSYAIHGSVAWIPALILAAGAVVGAQIGSWLLARI 109
>gi|376294959|ref|YP_005166189.1| hypothetical protein DND132_0167 [Desulfovibrio desulfuricans
ND132]
gi|323457520|gb|EGB13385.1| protein of unknown function DUF81 [Desulfovibrio desulfuricans
ND132]
Length = 425
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 60/130 (46%), Gaps = 11/130 (8%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
A G+ AG++ G +G GGGFI+ P + GI ++ T LF I + M V + L
Sbjct: 86 AIGVGAGLITGCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVIHRKLGNV 145
Query: 401 PVPYALYFFALSIIAAFVG---QHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 457
V A+ F +I+ A G VL ++ IL A FI V +L LG +G
Sbjct: 146 SVSLAVVFLIGAILGATAGGVINRVLYEINPILSDA--------FITTVYSLMLGFLGFY 197
Query: 458 KMIKRIEHKE 467
+ + ++
Sbjct: 198 ALTDFLRSRK 207
>gi|242058939|ref|XP_002458615.1| hypothetical protein SORBIDRAFT_03g036735 [Sorghum bicolor]
gi|241930590|gb|EES03735.1| hypothetical protein SORBIDRAFT_03g036735 [Sorghum bicolor]
Length = 110
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 52/111 (46%), Gaps = 19/111 (17%)
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
P+F +G+PP+ +SA +F I SSSMS + SI G
Sbjct: 16 PIFFHVGVPPRSASARTMFLILLSSSMSTAQ------------------SITLGAEGATD 57
Query: 423 LKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM-IKRIEHKEYMGFD 472
I+ GRAS+I+ ++ ++ ++A + G+ ++ + +EYMGF
Sbjct: 58 ATSAIRKSGRASLIVLAVATVMALTAAVVLRSGVPRVWAQYTGGREYMGFK 108
>gi|375361158|ref|YP_005129197.1| hypothetical protein BACAU_0468 [Bacillus amyloliquefaciens subsp.
plantarum CAU B946]
gi|421732851|ref|ZP_16171967.1| UPF0721 transmembrane protein [Bacillus amyloliquefaciens subsp.
plantarum M27]
gi|371567152|emb|CCF04002.1| UPF0721 transmembrane protein [Bacillus amyloliquefaciens subsp.
plantarum CAU B946]
gi|407073212|gb|EKE46209.1| UPF0721 transmembrane protein [Bacillus amyloliquefaciens subsp.
plantarum M27]
Length = 258
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 53/99 (53%), Gaps = 9/99 (9%)
Query: 329 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 387
KW AS L F G V G+LG GG FIL P+ L L IP +++ A++L AITF S
Sbjct: 141 NKWLASSLAFLI------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 193
Query: 388 SM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 425
S+ + V + + AL S+IA+ +G V +K
Sbjct: 194 SIGATVGKVITGQVLFTPALVLMIASLIASPIGASVGQK 232
>gi|321312144|ref|YP_004204431.1| YrkJ [Bacillus subtilis BSn5]
gi|320018418|gb|ADV93404.1| YrkJ [Bacillus subtilis BSn5]
Length = 258
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/75 (42%), Positives = 43/75 (57%), Gaps = 10/75 (13%)
Query: 318 RRKIASKGDE--GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQV 374
R + +G+E KW AS L F G V G+LG GG FIL P+ L L IP ++
Sbjct: 128 RGQAHREGEEVIFNKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRI 181
Query: 375 SSATALFAITFSSSM 389
+ A++L AITF SS+
Sbjct: 182 TVASSL-AITFLSSI 195
>gi|420198663|ref|ZP_14704355.1| hypothetical protein HMPREF9980_00100 [Staphylococcus epidermidis
NIHLM031]
gi|394273839|gb|EJE18266.1| hypothetical protein HMPREF9980_00100 [Staphylococcus epidermidis
NIHLM031]
Length = 254
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 60/119 (50%), Gaps = 10/119 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYLLKRFPVPYALYFFAL--------SIIAAFVGQHVLKKLIKILGRASIIIFTL 440
S +++ + + A F L +IIA ++ +LK LI I+ + ++I+T+
Sbjct: 61 SALKFLRSGKVDLKIAGKMFPLIFVASGGGAIIATYIPAIILKPLI-IIALSLVLIYTI 118
>gi|378550344|ref|ZP_09825560.1| hypothetical protein CCH26_09660 [Citricoccus sp. CH26A]
Length = 298
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 7/77 (9%)
Query: 311 AVALYKGRRKI-ASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG 369
A A+ +GRRK +S GDE K ++V G+ G+V GL+G GGGF++ P + LG
Sbjct: 113 AAAMIRGRRKAPSSGGDEAKKLPVVKVVLE---GLVVGLVTGLVGAGGGFLVVPALVLLG 169
Query: 370 ---IPPQVSSATALFAI 383
+P V ++ + A+
Sbjct: 170 GLSMPAAVGTSLVVIAM 186
>gi|154685010|ref|YP_001420171.1| hypothetical protein RBAM_005480 [Bacillus amyloliquefaciens FZB42]
gi|154350861|gb|ABS72940.1| YrkJ [Bacillus amyloliquefaciens FZB42]
Length = 258
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 9/100 (9%)
Query: 329 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 387
KW AS L F G V G+LG GG FIL P+ L L IP +++ A++L AITF S
Sbjct: 141 NKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 193
Query: 388 SM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
S+ + V + + AL S+IA+ +G V +K+
Sbjct: 194 SIGATVGKVITGQVLFAPALVLMIASLIASPIGASVGQKV 233
>gi|393762657|ref|ZP_10351283.1| hypothetical protein AGRI_06745 [Alishewanella agri BL06]
gi|392606279|gb|EIW89164.1| hypothetical protein AGRI_06745 [Alishewanella agri BL06]
Length = 262
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 39/87 (44%), Gaps = 1/87 (1%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
C+ AG + + G GGG + P L G+PP V T A TF S + V YY K
Sbjct: 16 CSVAFIAGYIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSLTASVTYYRKKL 74
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKL 426
F Y + I A +G V+ +L
Sbjct: 75 FNPVYWRNSLLFTAIGALLGTLVVDQL 101
>gi|451348137|ref|YP_007446768.1| hypothetical protein KSO_017010 [Bacillus amyloliquefaciens IT-45]
gi|449851895|gb|AGF28887.1| hypothetical protein KSO_017010 [Bacillus amyloliquefaciens IT-45]
Length = 258
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 53/99 (53%), Gaps = 9/99 (9%)
Query: 329 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 387
KW AS L F G V G+LG GG FIL P+ L L IP +++ A++L AITF S
Sbjct: 141 NKWLASSLAFLI------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 193
Query: 388 SM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 425
S+ + V + + AL S+IA+ +G V +K
Sbjct: 194 SIGATVGKVITGQVLFTPALVLMIASLIASPIGASVGQK 232
>gi|52081735|ref|YP_080526.1| hypothetical protein BL02149 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|319647651|ref|ZP_08001869.1| YunE protein [Bacillus sp. BT1B_CT2]
gi|404490618|ref|YP_006714724.1| hypothetical protein BLi03429 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|423683731|ref|ZP_17658570.1| hypothetical protein MUY_03584 [Bacillus licheniformis WX-02]
gi|52004946|gb|AAU24888.1| conserved protein YunE [Bacillus licheniformis DSM 13 = ATCC 14580]
gi|52349623|gb|AAU42257.1| DUF81 transmembrane protein YunE [Bacillus licheniformis DSM 13 =
ATCC 14580]
gi|317389992|gb|EFV70801.1| YunE protein [Bacillus sp. BT1B_CT2]
gi|383440505|gb|EID48280.1| hypothetical protein MUY_03584 [Bacillus licheniformis WX-02]
Length = 273
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 43/89 (48%), Gaps = 8/89 (8%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLEL-GIP-------PQVSSATALFAITFSSSMSVVE 393
G AG +G L+GLGGG ++ P L L G+P PQ++ T+L I F+ S +
Sbjct: 9 LGFIAGTIGSLVGLGGGIVIVPSLLFLAGLPTLFDHVTPQMAVGTSLVVIIFTGLSSTIA 68
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHV 422
Y K L FF S + VG HV
Sbjct: 69 YMKYKTVDYKSGLIFFIGSGPGSIVGAHV 97
>gi|384264092|ref|YP_005419799.1| hypothetical protein BANAU_0461 [Bacillus amyloliquefaciens subsp.
plantarum YAU B9601-Y2]
gi|380497445|emb|CCG48483.1| UPF0721 transmembrane protein [Bacillus amyloliquefaciens subsp.
plantarum YAU B9601-Y2]
Length = 258
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 64/128 (50%), Gaps = 13/128 (10%)
Query: 305 GVSAYEAVALY----KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFI 360
G+ A AV L KG+ + KW AS L F G V G+LG GG FI
Sbjct: 113 GILAILAVVLMFIPKKGQEHSEDEEVIFNKWLASSLAFII------GGVSGILGAGGAFI 166
Query: 361 LGPLFLE-LGIPPQVSSATALFAITFSSSM-SVVEYYLLKRFPVPYALYFFALSIIAAFV 418
L P+ L L IP +++ A++L AITF SS+ + V + + AL S+IA+ +
Sbjct: 167 LVPIMLSILNIPVRITVASSL-AITFLSSIGATVGKVITGQVLFAPALVLMIASLIASPI 225
Query: 419 GQHVLKKL 426
G V +K+
Sbjct: 226 GASVGQKV 233
>gi|433446826|ref|ZP_20410718.1| sulfite exporter, TauE/SafE family [Anoxybacillus flavithermus
TNO-09.006]
gi|432000333|gb|ELK21233.1| sulfite exporter, TauE/SafE family [Anoxybacillus flavithermus
TNO-09.006]
Length = 258
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 70/138 (50%), Gaps = 14/138 (10%)
Query: 316 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQV 374
KG I+ KW A+ L F G++AG+VG G FIL P+ L L IP ++
Sbjct: 128 KGIDNISLDQVTFNKWLAAALAFIV--GVSAGIVGA----AGAFILVPIMLVVLKIPTRM 181
Query: 375 SSATALFAITFSSSM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRA 433
+ AT+L AITF SS+ + V +F A S+IA+ +G +V KK+
Sbjct: 182 TIATSL-AITFISSIGATVGKITTGQFDFWTAFIMVVASLIASPIGANVGKKV-----NT 235
Query: 434 SIIIFTLSFIIFVSALSL 451
I+ F LS +I +A+ +
Sbjct: 236 KILQFILSALIVATAIKI 253
>gi|288959672|ref|YP_003450013.1| hypothetical protein AZL_028310 [Azospirillum sp. B510]
gi|288911980|dbj|BAI73469.1| hypothetical protein AZL_028310 [Azospirillum sp. B510]
Length = 305
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 63/131 (48%), Gaps = 4/131 (3%)
Query: 337 VFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYL 396
+ G G + G+ G+GGGF++ PL + +G+PP ++ T + +S V+ ++
Sbjct: 15 LLVLGMGWLVGFLSGMFGVGGGFLMTPLLIFIGVPPAIAVGTQANQLVAASVSGVLAHWR 74
Query: 397 LKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 456
V + ++ VG V L + LG+ I I TLS++ F+ ++GG+ L
Sbjct: 75 RGNVDVKLGVVMLGGGVLGTAVGVWVFGILQR-LGQIDIAI-TLSYVFFLG--TIGGMML 130
Query: 457 AKMIKRIEHKE 467
+ + I +
Sbjct: 131 VESSRAILRRR 141
>gi|387897005|ref|YP_006327301.1| hypothetical protein MUS_0502 [Bacillus amyloliquefaciens Y2]
gi|387171115|gb|AFJ60576.1| conserved hypothetical protein YrkJ [Bacillus amyloliquefaciens Y2]
Length = 276
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 9/100 (9%)
Query: 329 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 387
KW AS L F G V G+LG GG FIL P+ L L IP +++ A++L AITF S
Sbjct: 159 NKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 211
Query: 388 SM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
S+ + V + + AL S+IA+ +G V +K+
Sbjct: 212 SIGATVGKVITGQVLFAPALVLMIASLIASPIGASVGQKV 251
>gi|27376265|ref|NP_767794.1| hypothetical protein bll1154 [Bradyrhizobium japonicum USDA 110]
gi|27349405|dbj|BAC46419.1| bll1154 [Bradyrhizobium japonicum USDA 110]
Length = 380
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+
Sbjct: 94 AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGAISYW 148
>gi|206890428|ref|YP_002248551.1| hypothetical protein THEYE_A0709 [Thermodesulfovibrio yellowstonii
DSM 11347]
gi|206742366|gb|ACI21423.1| membrane protein, putative [Thermodesulfovibrio yellowstonii DSM
11347]
Length = 422
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 43/89 (48%), Gaps = 7/89 (7%)
Query: 326 DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAIT 384
DE R +F CG G++ ++G+GGGF+ PLF+ LG+ + T + I
Sbjct: 242 DEDFGGRRISWLFVAMCGFLVGLLAAVMGVGGGFVTFPLFVYVLGVSTPTTVGTDILQII 301
Query: 385 FSSSMSVVEYYLLKRFPVPYALYFFALSI 413
F++ S + Y + Y F+ L++
Sbjct: 302 FTAGYSSISQYAI------YGYVFYTLAM 324
Score = 38.9 bits (89), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 43/86 (50%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
A G+ AG++ G +G GGGFI+ P + G+ ++ T F I + M V + L
Sbjct: 79 AVGLAAGLITGCIGAGGGFIIAPALMAAGVKGILAVGTDQFHIFAKAIMGTVIHKKLGNV 138
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKL 426
V A+ F S++ A G ++ + +
Sbjct: 139 CVGLAVAFVIGSVLGATTGGYIQRTI 164
>gi|359795015|ref|ZP_09297681.1| hypothetical protein MAXJ12_35646 [Mesorhizobium alhagi CCNWXJ12-2]
gi|359248639|gb|EHK52372.1| hypothetical protein MAXJ12_35646 [Mesorhizobium alhagi CCNWXJ12-2]
Length = 307
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 67/130 (51%), Gaps = 8/130 (6%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
A G G + G+ G+GGGF++ PL + IPP ++ AT + SS + ++ KR
Sbjct: 18 LAMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPAIAVATGANQVIASSFSGALAHF--KR 75
Query: 400 FPVPYAL--YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLA 457
+ + L A I+ + +G + L+++LG+ + I +L +++F+ +GG+ L
Sbjct: 76 GTLDFKLGTVLLAGGIVGSTIGIYAF-TLLRVLGQLDLFI-SLLYVVFLGL--VGGLMLV 131
Query: 458 KMIKRIEHKE 467
+ I I +
Sbjct: 132 ESINAIRRAK 141
>gi|408526472|emb|CCK24646.1| hypothetical protein BN159_0267 [Streptomyces davawensis JCM 4913]
Length = 306
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 6/85 (7%)
Query: 311 AVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELG 369
AVA+ +GRRK +K ++ GI G+V GL+G GGGF++ P L L G
Sbjct: 110 AVAMIRGRRKATTKTHH-----ELPVLHVLLDGIVVGLVTGLVGAGGGFLVVPALALLGG 164
Query: 370 IPPQVSSATALFAITFSSSMSVVEY 394
+P V+ T+L I+ S + Y
Sbjct: 165 LPMTVAVGTSLLVISMKSFAGLAGY 189
>gi|297583461|ref|YP_003699241.1| hypothetical protein [Bacillus selenitireducens MLS10]
gi|297141918|gb|ADH98675.1| protein of unknown function DUF81 [Bacillus selenitireducens MLS10]
Length = 272
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 43/90 (47%), Gaps = 2/90 (2%)
Query: 319 RKIASKGDEGTKWR-ASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSS 376
R + +G +W Q V G G + GL G+GGG ++ P + L PP ++
Sbjct: 136 RNLYWTDHQGVEWTYGYQPVIGIMLGFIVGSISGLFGIGGGSLMVPAMILLFAFPPHLAV 195
Query: 377 ATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
AT++F I S+ + V + L+ YAL
Sbjct: 196 ATSMFMIMLSAIVGSVSHILVGNVHWLYAL 225
>gi|157962478|ref|YP_001502512.1| hypothetical protein Spea_2657 [Shewanella pealeana ATCC 700345]
gi|157847478|gb|ABV87977.1| protein of unknown function DUF81 [Shewanella pealeana ATCC 700345]
Length = 257
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 41/84 (48%), Gaps = 1/84 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G AG + + G GGG + P L +G+PP ++ T A +F SSM+ YY + F
Sbjct: 17 GFIAGFIDAVSG-GGGLLSIPALLTMGVPPHLALGTNKLAASFGSSMAAYTYYKQRLFSP 75
Query: 403 PYALYFFALSIIAAFVGQHVLKKL 426
+ F + I A +G ++ +
Sbjct: 76 SLWYHTFIATFIGAVIGTFIVYHI 99
>gi|423480829|ref|ZP_17457519.1| hypothetical protein IEQ_00607 [Bacillus cereus BAG6X1-2]
gi|401146715|gb|EJQ54226.1| hypothetical protein IEQ_00607 [Bacillus cereus BAG6X1-2]
Length = 255
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 48/86 (55%), Gaps = 12/86 (13%)
Query: 310 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 366
A+ ++ +R KG EG KW AS L F + G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGSEGIMYNKWLASLLAF------SVGSVSGIIGAGGAFLLVPIML 169
Query: 367 -ELGIPPQVSSATALFAITFSSSMSV 391
L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194
>gi|226314422|ref|YP_002774318.1| hypothetical protein BBR47_48370 [Brevibacillus brevis NBRC 100599]
gi|226097372|dbj|BAH45814.1| conserved hypothetical membrane protein [Brevibacillus brevis NBRC
100599]
Length = 274
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 8/102 (7%)
Query: 337 VFYCACGITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSM 389
+ + G+ AG++G + GLGGG P L + PQV++AT+L I ++
Sbjct: 7 ILFVIIGVVAGVIGSIAGLGGGIFFVPALLYFANWYTPGSMNPQVAAATSLIVIAVTALS 66
Query: 390 SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 431
S + Y K+ AL FF S A VG + L L+ + G
Sbjct: 67 SSLSYLKQKKVDKQSALLFFIGSAPGAIVGVY-LNTLLAVEG 107
>gi|410030356|ref|ZP_11280186.1| hypothetical protein MaAK2_14171 [Marinilabilia sp. AK2]
Length = 291
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 39/84 (46%), Gaps = 8/84 (9%)
Query: 328 GTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSS 387
G W A +++ G A + G LG+ G + L +GIPP V+SA FA F+S
Sbjct: 40 GENWNAEMFLWFLLVGFIAQTIDGALGMAYGVSSNSILLGIGIPPAVASAWVHFAQVFTS 99
Query: 388 --------SMSVVEYYLLKRFPVP 403
+ V++ L KR +P
Sbjct: 100 LASGISHLKLGNVQWSLAKRLMIP 123
>gi|154489152|ref|ZP_02030001.1| hypothetical protein BIFADO_02467 [Bifidobacterium adolescentis
L2-32]
gi|154083289|gb|EDN82334.1| hypothetical protein BIFADO_02467 [Bifidobacterium adolescentis
L2-32]
Length = 292
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 70/143 (48%), Gaps = 5/143 (3%)
Query: 305 GVSAYEAVALYKGRRKIASK-GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 363
G +A A A+ +K+ S+ G+ A ++ G+ G++ G G+GGG ++ P
Sbjct: 5 GETADRADAVRTESQKMKSRAGETALDGSARSIIVLIVVGMLVGVLSGTFGIGGGTMIVP 64
Query: 364 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 423
+ LG+ + ++AT++ AI +S V+ Y A F I +G +L
Sbjct: 65 ALVWLGLSQRNAAATSMLAIVPTSISGVISYATGGHVDWLAAALLFCGMFIGGQIGSWLL 124
Query: 424 KKLIKILGRASIIIFTLSFIIFV 446
+L +++ R IF ++F++FV
Sbjct: 125 SRLPELVLRW---IF-VAFLVFV 143
>gi|429504024|ref|YP_007185208.1| hypothetical protein B938_02510 [Bacillus amyloliquefaciens subsp.
plantarum AS43.3]
gi|429485614|gb|AFZ89538.1| hypothetical protein B938_02510 [Bacillus amyloliquefaciens subsp.
plantarum AS43.3]
Length = 258
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 9/100 (9%)
Query: 329 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSS 387
KW AS L F G V G+LG GG FIL P+ L L IP +++ A++L AITF S
Sbjct: 141 NKWLASSLAFII------GGVSGILGAGGAFILVPIMLSILNIPVRITVASSL-AITFLS 193
Query: 388 SM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
S+ + V + + AL S+IA+ +G V +K+
Sbjct: 194 SIGATVGKVITGQVLFAPALVLMIASLIASPIGATVGQKV 233
>gi|346994770|ref|ZP_08862842.1| membrane protein [Ruegeria sp. TW15]
Length = 306
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 52/100 (52%), Gaps = 2/100 (2%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
G++ G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++ + +
Sbjct: 25 GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRRTVDLRMGT 84
Query: 407 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
+ A +G V L K LG+ +++ L ++IF+
Sbjct: 85 VLLVGGLAGAALGVVVFNYL-KSLGQVDLLV-KLCYVIFL 122
>gi|84515899|ref|ZP_01003260.1| membrane protein [Loktanella vestfoldensis SKA53]
gi|84510341|gb|EAQ06797.1| membrane protein [Loktanella vestfoldensis SKA53]
Length = 305
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 53/101 (52%), Gaps = 2/101 (1%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G+ G+GGGF++ PL +GIPP V+ AT+ I SS V+ ++ K
Sbjct: 29 GMFGVGGGFLITPLLFFIGIPPAVAVATSTNQIVASSISGVLAHFKRKTVDPVMGGVLLV 88
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
+I + +G + L + LG+ +++ L +++F+ A+ +
Sbjct: 89 GGVIGSAIGIVIFNYL-RALGQVDLLV-QLCYVVFLGAIGM 127
>gi|340503379|gb|EGR29975.1| hypothetical protein IMG5_145240 [Ichthyophthirius multifiliis]
Length = 162
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 39/63 (61%), Gaps = 1/63 (1%)
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAA-FVGQH 421
P +LELGI P ++A ++ + F+S S +Y+L + + +YFF LS+I++ FV +
Sbjct: 52 PKWLELGISPTRTTACSICILFFTSFNSFFQYFLANVYNIEEIIYFFILSVISSQFVCNY 111
Query: 422 VLK 424
+L
Sbjct: 112 ILN 114
>gi|126740488|ref|ZP_01756175.1| membrane protein [Roseobacter sp. SK209-2-6]
gi|126718289|gb|EBA15004.1| membrane protein [Roseobacter sp. SK209-2-6]
Length = 306
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 51/96 (53%), Gaps = 2/96 (2%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++ + + +
Sbjct: 29 GMFGVGGGFLMTPLLFFIGIPPTVAVATEANQIVASSFSGVLAHFRRRTVDIKMGVVLQI 88
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
+I A +G V L K LG+ +++ L +++F+
Sbjct: 89 GGVIGAALGVVVFNYL-KSLGQVDLLV-KLCYVVFL 122
>gi|332186970|ref|ZP_08388711.1| hypothetical protein SUS17_2059 [Sphingomonas sp. S17]
gi|332012980|gb|EGI55044.1| hypothetical protein SUS17_2059 [Sphingomonas sp. S17]
Length = 304
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 61/117 (52%), Gaps = 7/117 (5%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
A G G++ G+ G+GGGF+ PL + GIPP V++A+A +T +S V ++
Sbjct: 18 IALGAGVGLLSGMFGVGGGFLTTPLLIVYGIPPTVAAASAASQVTGASVSGVFAHWQRGG 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 456
V A I+ +F+G + + L +AS I T+ II+V L LG +GL
Sbjct: 78 VDVKMGGVLVAGGIVGSFIGAWIFRLL-----QASGQIDTVIAIIYV--LMLGWIGL 127
>gi|220925134|ref|YP_002500436.1| hypothetical protein Mnod_5288 [Methylobacterium nodulans ORS 2060]
gi|219949741|gb|ACL60133.1| protein of unknown function DUF81 [Methylobacterium nodulans ORS
2060]
Length = 307
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 30/44 (68%)
Query: 349 VGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVV 392
+ G+ G+GGGF++ P+ + LGIPP ++ AT I SS+ SV+
Sbjct: 27 ISGVFGIGGGFLMTPVLIVLGIPPAIAVATQTAPIVASSTTSVL 70
>gi|120402827|ref|YP_952656.1| hypothetical protein Mvan_1828 [Mycobacterium vanbaalenii PYR-1]
gi|119955645|gb|ABM12650.1| protein of unknown function DUF81 [Mycobacterium vanbaalenii PYR-1]
Length = 291
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 5/86 (5%)
Query: 310 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLEL 368
A+A+ +GR+ I + G+ G + ++V A G+ G+V GL+G GGGF++ P L L
Sbjct: 109 TAIAMLRGRKTIET-GETGHRLPVPKIV---AEGLIVGLVTGLVGAGGGFLVVPALALLG 164
Query: 369 GIPPQVSSATALFAITFSSSMSVVEY 394
G+P ++ T+L I S + Y
Sbjct: 165 GLPMPIAVGTSLIVIAMKSFAGLAGY 190
>gi|225352706|ref|ZP_03743729.1| hypothetical protein BIFPSEUDO_04335 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
gi|225156671|gb|EEG70065.1| hypothetical protein BIFPSEUDO_04335 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
Length = 308
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 47/90 (52%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ G++ G+ G+GGG ++ P + LG+ + ++AT++ AI +S V+ Y
Sbjct: 61 GVAVGLLSGMFGIGGGTVIVPALVWLGLTQRNAAATSMLAIVPTSISGVISYATCGNVDW 120
Query: 403 PYALYFFALSIIAAFVGQHVLKKLIKILGR 432
AL F + +G +L +L +++ R
Sbjct: 121 LAALLLFCGMFVGGQIGSWLLSRLPELVLR 150
>gi|206598110|gb|ACI15920.1| hypothetical protein [Bodo saltans]
Length = 507
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 87/413 (21%), Positives = 166/413 (40%), Gaps = 44/413 (10%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDI----PVIDYDLALL 157
I VPM ++ ++ +S+ + G + ++Y + + PT P+I+Y +
Sbjct: 96 ILVPMFAALIEMPLIAAVGLSQAAICGQSAFNMIYQVTKTMPTQTPNEVRPLINYQYLAI 155
Query: 158 FQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQ 217
PL ++G +G + DW+ LL +L + R K + ++ T ++
Sbjct: 156 LLPLSLIGTLLGSLGGRVVPDWLRLALLFLLLTSVLHRVIEKAKRQYMQDASTS-SGSRI 214
Query: 218 LELIGNGYQTEECKCDPEYLSNDTTPEETREP-----KKSKVSIIENIYW---------- 262
E++ P+ D P E ++ S + NI +
Sbjct: 215 AEVVKLTSTRVVVHASPD----DANPNEDDGEGGDGIRQGVASELANINFVNESASLVLA 270
Query: 263 -------------KEFGLLVAVWAIILALQIAKNYTTTC-SVLYWVLNLLQIPVAGGV-- 306
+E LL + ++L I ++ T C SV + +L ++ I V G+
Sbjct: 271 DSFDGPLTIRSSRREMALLAIGFVVLLVCNIVRSSMTVCGSVDHALLFIIPISVLVGLWW 330
Query: 307 --SAYEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPL 364
A+ L + S +W + ++ I AG LLG GGG +L +
Sbjct: 331 LGKEMAAMTLTNVQSGFLSPDHMTFQWNDRTMTWFPFAAIIAGAGTALLGNGGGLVLSFV 390
Query: 365 FLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLK 424
E + P+ +SAT+ FA +S S + + YA +F +++ +GQ
Sbjct: 391 LFEAELTPEEASATSGFANLLIASESALLMLFQGQLVPDYATMYFVCGVVSTILGQSGFM 450
Query: 425 KLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKMIKRIEHKE--YMGFDNIC 475
I+ + +I+ +L+ II S L L GL ++I++++ + F +IC
Sbjct: 451 AYIRWSKKRFLIVTSLACIIGGSLLMLLSYGLFEVIEQVKRHDGSLWAFGHIC 503
>gi|428224218|ref|YP_007108315.1| hypothetical protein GEI7407_0765 [Geitlerinema sp. PCC 7407]
gi|427984119|gb|AFY65263.1| protein of unknown function DUF81 [Geitlerinema sp. PCC 7407]
Length = 275
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 46/84 (54%), Gaps = 4/84 (4%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK--RF 400
G AG++ G LG+GGG IL PL + LG P + AT+ AI +S ++ + + RF
Sbjct: 25 GFLAGILAGFLGIGGGTILVPLMVTLGFAPVQAVATSSLAILITSLSGSLQNWRMGYLRF 84
Query: 401 PVPYALYFFAL--SIIAAFVGQHV 422
P AL AL + I AF+ ++
Sbjct: 85 PQVLALGLPALLTAQIGAFLANYI 108
>gi|418327069|ref|ZP_12938243.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU071]
gi|365223948|gb|EHM65221.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU071]
Length = 254
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 61/118 (51%), Gaps = 10/118 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A + ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWDLSIILMIVAFGFIASFIDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 439
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASAGGAIIATYIPANILKPLI-IIALSLVLIYT 117
>gi|255263607|ref|ZP_05342949.1| membrane protein, putative [Thalassiobium sp. R2A62]
gi|255105942|gb|EET48616.1| membrane protein, putative [Thalassiobium sp. R2A62]
Length = 304
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 2/96 (2%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++ K +
Sbjct: 29 GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRKTVDLKMGTVLML 88
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
++ A +G + L K LG+ +++ L +++F+
Sbjct: 89 GGLVGAALGVFIFNYL-KSLGQVDLLV-KLCYVVFL 122
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 39/67 (58%), Gaps = 5/67 (7%)
Query: 329 TKWRASQLVFYC----ACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAI 383
TK+R S L A G+ G++ ++G+GGGFI+ P + LG+P +V T+LF I
Sbjct: 161 TKFRVSGLYISVLPPLAVGVVVGILAAIMGVGGGFIMVPAMIYLLGMPTKVVVGTSLFQI 220
Query: 384 TFSSSMS 390
F ++ +
Sbjct: 221 IFVTAFT 227
>gi|257062790|ref|YP_003142462.1| permease [Slackia heliotrinireducens DSM 20476]
gi|256790443|gb|ACV21113.1| predicted permease [Slackia heliotrinireducens DSM 20476]
Length = 277
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 1/53 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVEY 394
G+ G++ GL G+GGG I+ PLF L G+PP ++AT+LF + +S +V++
Sbjct: 11 GLLVGVLSGLFGIGGGTIMVPLFRLAFGLPPIGATATSLFTVIPTSLSGLVKH 63
>gi|160902447|ref|YP_001568028.1| hypothetical protein Pmob_0983 [Petrotoga mobilis SJ95]
gi|160360091|gb|ABX31705.1| protein of unknown function DUF81 [Petrotoga mobilis SJ95]
Length = 246
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 46/97 (47%), Gaps = 16/97 (16%)
Query: 338 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMS------- 390
F+ G A +V G LG+ G I L L G+PP +SSA+ FA F++S+S
Sbjct: 6 FFIIVGFLAQVVDGALGMAYGTISNALLLSAGVPPAISSASVHFAEIFTTSISGFSHLKL 65
Query: 391 -VVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
V+ L K+ +P I+ +G ++L +
Sbjct: 66 GNVDKSLFKKLLIP--------GIVGGVLGAYILTNI 94
>gi|326386778|ref|ZP_08208399.1| hypothetical protein Y88_2671 [Novosphingobium nitrogenifigens DSM
19370]
gi|326208831|gb|EGD59627.1| hypothetical protein Y88_2671 [Novosphingobium nitrogenifigens DSM
19370]
Length = 345
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 61/136 (44%), Gaps = 14/136 (10%)
Query: 278 LQIAKNYTTTCSVLYWVL-----NLLQIPVAGGVSAYEAVALYKGRRKIASK--GDEGTK 330
LQ TT +VLY V+ +L+ + G + A + K RR+ +
Sbjct: 145 LQALGQIDTTINVLYVVMLGSIGSLMGVESFGAIRARQTGRAQKARRRRHHPLVASLPLR 204
Query: 331 WRASQLVFYCA------CGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAI 383
WR + Y + GI G++ L+G+GGGFIL P L LG+ V T+LF I
Sbjct: 205 WRFYRSGLYISPLAPLLLGIATGILTMLMGIGGGFILVPAMLYILGMSVTVVVGTSLFQI 264
Query: 384 TFSSSMSVVEYYLLKR 399
F + + + + L R
Sbjct: 265 LFVTMATTMMHALTTR 280
>gi|240102617|ref|YP_002958926.1| putative permease [Thermococcus gammatolerans EJ3]
gi|239910171|gb|ACS33062.1| Predicted permease [Thermococcus gammatolerans EJ3]
Length = 243
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 7/82 (8%)
Query: 318 RRKIASKGDEGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVS 375
R+K +EG + LV GI +G+V GLLG+ GG + PLF +GIP + +
Sbjct: 116 RKKCRPHDEEGAVHYSRVPLV-----GILSGLVSGLLGISGGVLNVPLFHTFVGIPMRYA 170
Query: 376 SATALFAITFSSSMSVVEYYLL 397
T+ FA+ F++ +E+Y L
Sbjct: 171 VGTSSFALFFTALAGAIEHYRL 192
>gi|399991606|ref|YP_006571846.1| hypothetical protein PGA1_c03930 [Phaeobacter gallaeciensis DSM
17395 = CIP 105210]
gi|398656161|gb|AFO90127.1| hypothetical protein PGA1_c03930 [Phaeobacter gallaeciensis DSM
17395 = CIP 105210]
Length = 307
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 31/49 (63%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
G++ G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++
Sbjct: 25 GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHF 73
>gi|400753245|ref|YP_006561613.1| hypothetical protein PGA2_c03480 [Phaeobacter gallaeciensis 2.10]
gi|398652398|gb|AFO86368.1| hypothetical protein PGA2_c03480 [Phaeobacter gallaeciensis 2.10]
Length = 307
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 31/49 (63%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
G++ G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++
Sbjct: 25 GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHF 73
>gi|335423623|ref|ZP_08552644.1| hypothetical protein SSPSH_13047 [Salinisphaera shabanensis E1L3A]
gi|334891448|gb|EGM29696.1| hypothetical protein SSPSH_13047 [Salinisphaera shabanensis E1L3A]
Length = 287
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 60/132 (45%), Gaps = 16/132 (12%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY------ 395
G A +V G LG+G G + + LGI P V+SA+ FA +++ S +Y
Sbjct: 5 AGFVAQLVDGSLGMGYGLTSSTMLIALGISPAVASASIHFAEIATTAASGTAHYRFGNVD 64
Query: 396 --LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGG 453
LLKR VP AL AFVG L + L + + F L ++V+A L G
Sbjct: 65 VALLKRMVVPGAL--------GAFVGALFLGSISGDLIKPFVSAFLLLLGVYVTASVLLG 116
Query: 454 VGLAKMIKRIEH 465
+ + K + H
Sbjct: 117 LRRTREKKLLSH 128
>gi|418323949|ref|ZP_12935206.1| sulfite exporter TauE/SafE [Staphylococcus pettenkoferi VCU012]
gi|365228878|gb|EHM70051.1| sulfite exporter TauE/SafE [Staphylococcus pettenkoferi VCU012]
Length = 274
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 53/109 (48%), Gaps = 2/109 (1%)
Query: 316 KGRRKIASKGDEGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQ 373
K + + + +EG + + +F + G++ GL G+GGG ++ PL L PP
Sbjct: 136 KDKYRRSYTDNEGVTYHYNVPPLFAFVATLFIGILTGLFGIGGGALMTPLMLIVFRFPPH 195
Query: 374 VSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
V+ T++ I FSS MS + + + Y+L SII A +G +
Sbjct: 196 VAVGTSMLMIFFSSLMSSLGHMVQGHVAWGYSLVLIISSIIGAQIGVRI 244
>gi|56695290|ref|YP_165638.1| hypothetical protein SPO0375 [Ruegeria pomeroyi DSS-3]
gi|56677027|gb|AAV93693.1| membrane protein [Ruegeria pomeroyi DSS-3]
Length = 316
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 31/49 (63%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
G++ G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++
Sbjct: 35 GILSGMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHF 83
>gi|77360916|ref|YP_340491.1| hypothetical protein PSHAa1992 [Pseudoalteromonas haloplanktis
TAC125]
gi|76875827|emb|CAI87048.1| conserved protein of unknown functions similar to YfcA
[Pseudoalteromonas haloplanktis TAC125]
Length = 258
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 4/133 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
CA + AG + + G GGG + P L G+PP ++ T A +F S + YY
Sbjct: 15 CAVALAAGFIDAIAG-GGGMLTVPALLTAGLPPHLTLGTNKLAASFGSLTASYTYYKKDL 73
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKL-IKILGR--ASIIIFTLSFIIFVSALSLGGVGL 456
F + L + I A +G ++ L I L + III + +F S + G L
Sbjct: 74 FNPKFWLASIFATAIGALIGTLIVDHLSIDFLNKLLPIIIIVVACYSLFGSLSTTEGSEL 133
Query: 457 AKMIKRIEHKEYM 469
K+ + ++ K+++
Sbjct: 134 PKLTQTMKIKQWL 146
>gi|407777273|ref|ZP_11124543.1| hypothetical protein NA2_04876 [Nitratireductor pacificus pht-3B]
gi|407300973|gb|EKF20095.1| hypothetical protein NA2_04876 [Nitratireductor pacificus pht-3B]
Length = 306
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 62/128 (48%), Gaps = 4/128 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
A G G + G+ G+GGGF++ PL + IPP ++ AT + SS + ++ K
Sbjct: 18 LAMGGAVGFLSGMFGVGGGFLITPLLIFYNIPPAIAVATGANQVIASSFSGALVHFKRKT 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
V A I+ A +G L + LG+ +I+ +L +++F+ +GG+ L +
Sbjct: 78 LDVKLGTVLLAGGIVGASLGAMAFAYL-RRLGQLDLIV-SLLYVLFLGV--VGGLMLLES 133
Query: 460 IKRIEHKE 467
++ +
Sbjct: 134 VRALRRAR 141
>gi|316932351|ref|YP_004107333.1| hypothetical protein [Rhodopseudomonas palustris DX-1]
gi|315600065|gb|ADU42600.1| protein of unknown function DUF81 [Rhodopseudomonas palustris DX-1]
Length = 320
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 69/134 (51%), Gaps = 6/134 (4%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
++ A G G V G+ G+GGGF++ PL + +GI P V+ A+ + SS + Y
Sbjct: 26 NVLLVLAMGAAVGFVSGMFGVGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALSY 85
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHV-LKKLIKILGRASIIIFTLSFIIFVSALSLGG 453
+ +R + L F L A G V L++ +G+ ++I LS+++ ++A +GG
Sbjct: 86 W--RRRAIDPLLAFILLCGGIAGTGLGVWFFVLMRSVGQLDLVI-ALSYVVLLTA--VGG 140
Query: 454 VGLAKMIKRIEHKE 467
+ +++ I+ I
Sbjct: 141 LMVSEGIRAIRRTS 154
>gi|145514235|ref|XP_001443028.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410389|emb|CAK75631.1| unnamed protein product [Paramecium tetraurelia]
Length = 473
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 381
A G AG V G LG+G GF++ P L G+ P+ +SAT+ F
Sbjct: 339 AGGFMAGFVSGFLGMGAGFVMVPTLLYSGLIPRCASATSAF 379
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 24/114 (21%), Positives = 58/114 (50%), Gaps = 2/114 (1%)
Query: 105 PMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLD-IPVIDYDLALLFQPLLV 163
P+L +I+ + + ++ C++ GG + V + +++P D P+I+Y +A++F +
Sbjct: 72 PLLEMILNYSQSEATHIAYCLMFGGTLLNTVLLMFEKNPEDDRRPIINYRIAIIFNLAVP 131
Query: 164 LGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQ 217
++G + F + T++L LFL + ++ K+ + ++AK+
Sbjct: 132 FATNLGSSL-ASFLPQLYTLILQELFLFGVAPILWQKAKSAKEAELKISDSAKE 184
>gi|254440248|ref|ZP_05053742.1| conserved domain protein, putative [Octadecabacter antarcticus 307]
gi|198255694|gb|EDY80008.1| conserved domain protein, putative [Octadecabacter antarcticus 307]
Length = 305
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 50/96 (52%), Gaps = 2/96 (2%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G+ G+GGGF++ PL +GIPP V+ AT I SS V+ + K +
Sbjct: 29 GMFGVGGGFLMTPLLFFIGIPPAVAVATEAVQIVASSFSGVLAHLRRKTVDLRMGTVLLI 88
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
+I A +G VL +K LG+ +++ L +++F+
Sbjct: 89 GGLIGAAIGV-VLFNYLKSLGQVDLLV-RLCYVVFL 122
>gi|77462758|ref|YP_352262.1| hypothetical protein RSP_2206 [Rhodobacter sphaeroides 2.4.1]
gi|77387176|gb|ABA78361.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1]
Length = 306
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 2/100 (2%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
G + G+ G+GGGF++ PL +GIPP V+ AT + SS V+ K L
Sbjct: 25 GFMSGMFGVGGGFLITPLLFFIGIPPAVAVATGANQVVASSVSGVLAQLKRKTVDFRMGL 84
Query: 407 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
A + + G V +L LG+ + + LS+++F+
Sbjct: 85 VLLAGGVAGSAFGVAVFARL-NTLGQVDLFV-QLSYVLFL 122
>gi|423643404|ref|ZP_17619022.1| hypothetical protein IK9_03349 [Bacillus cereus VD166]
gi|401275408|gb|EJR81375.1| hypothetical protein IK9_03349 [Bacillus cereus VD166]
Length = 266
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G+ AG +G L+GLGGG I+ PL + L + PQ++ T++ + F+ S + Y KR
Sbjct: 9 IGLIAGTIGSLVGLGGGIIIVPLLIGLHSLSPQLAVGTSMVTVVFTGLSSTLTYMKHKR 67
>gi|223042876|ref|ZP_03612924.1| putative membrane protein [Staphylococcus capitis SK14]
gi|417905704|ref|ZP_12549503.1| putative membrane protein [Staphylococcus capitis VCU116]
gi|222443730|gb|EEE49827.1| putative membrane protein [Staphylococcus capitis SK14]
gi|341598598|gb|EGS41098.1| putative membrane protein [Staphylococcus capitis VCU116]
Length = 275
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 50/102 (49%), Gaps = 2/102 (1%)
Query: 327 EGTKWRASQLVFYC-ACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 384
EG + S F+ + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYHYSVPPFFAFVATLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207
Query: 385 FSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
FSS MS + + + Y++ SII A +G V + L
Sbjct: 208 FSSVMSSIGHIVQGHVAWGYSVALIISSIIGAQIGVKVNRSL 249
>gi|46201920|ref|ZP_00054109.2| COG0730: Predicted permeases [Magnetospirillum magnetotacticum
MS-1]
Length = 308
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 38/64 (59%), Gaps = 5/64 (7%)
Query: 330 KWRASQL----VFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAIT 384
++R SQL + A G+ G++ L+G+GGGFI+ P + LG+P V T+LF I
Sbjct: 166 RFRRSQLYISAIVPAAIGLLVGILAALMGVGGGFIMVPAMIYLLGMPTAVVVGTSLFQII 225
Query: 385 FSSS 388
F ++
Sbjct: 226 FVTA 229
>gi|407972997|ref|ZP_11153910.1| hypothetical protein NA8A_01815 [Nitratireductor indicus C115]
gi|407431768|gb|EKF44439.1| hypothetical protein NA8A_01815 [Nitratireductor indicus C115]
Length = 306
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 47/98 (47%), Gaps = 1/98 (1%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
A G G + G+ G+GGGF++ PL + IPP ++ AT + SS + ++ K
Sbjct: 18 LAMGGAVGFLSGMFGVGGGFLITPLLIFYNIPPSIAVATGANQVIASSFSGALAHFKRKS 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 437
V A I+ A +G V L + LG+ +I+
Sbjct: 78 LDVKLGTVLLAGGIVGASLGAVVFAYL-RRLGQLDLIV 114
>gi|404445797|ref|ZP_11010928.1| hypothetical protein MVAC_21158 [Mycobacterium vaccae ATCC 25954]
gi|403651615|gb|EJZ06726.1| hypothetical protein MVAC_21158 [Mycobacterium vaccae ATCC 25954]
Length = 291
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 45/79 (56%), Gaps = 5/79 (6%)
Query: 310 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLEL 368
AVA+ +GRR + + EG + ++V A G+ G+V GL+G GGGF++ P L L
Sbjct: 109 TAVAMLRGRRAVGTS-REGHRLPVPKIV---AEGLVVGLVTGLVGAGGGFLVVPALALLG 164
Query: 369 GIPPQVSSATALFAITFSS 387
G+P ++ T+L I S
Sbjct: 165 GLPMPIAVGTSLIVIAMKS 183
>gi|297528876|ref|YP_003670151.1| hypothetical protein GC56T3_0521 [Geobacillus sp. C56-T3]
gi|297252128|gb|ADI25574.1| protein of unknown function DUF81 [Geobacillus sp. C56-T3]
Length = 300
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 51/116 (43%), Gaps = 14/116 (12%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 395
G AG VG L GLGGG I+ P L G + PQ + T+L I F+ S + Y
Sbjct: 10 GFIAGTVGSLAGLGGGVIIVPSLLFFGALGWLSAVTPQTAVGTSLVVIIFNGLSSTLSYM 69
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
K L F S+ A +G V L S F+L F +F+ A+SL
Sbjct: 70 KDKMVDYQSGLLFCLGSVPGAVIGAWVNNTL-------SAAHFSLYFGLFLIAMSL 118
>gi|261420421|ref|YP_003254103.1| hypothetical protein GYMC61_3061 [Geobacillus sp. Y412MC61]
gi|319768087|ref|YP_004133588.1| hypothetical protein [Geobacillus sp. Y412MC52]
gi|261376878|gb|ACX79621.1| protein of unknown function DUF81 [Geobacillus sp. Y412MC61]
gi|317112953|gb|ADU95445.1| protein of unknown function DUF81 [Geobacillus sp. Y412MC52]
Length = 300
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 51/116 (43%), Gaps = 14/116 (12%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEYY 395
G AG VG L GLGGG I+ P L G + PQ + T+L I F+ S + Y
Sbjct: 10 GFIAGTVGSLAGLGGGVIIVPSLLFFGALGWLSAVTPQTAVGTSLVVIIFNGLSSTLSYM 69
Query: 396 LLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
K L F S+ A +G V L S F+L F +F+ A+SL
Sbjct: 70 KDKMVDYQSGLLFCLGSVPGAVIGAWVNNTL-------SAAHFSLYFGLFLIAMSL 118
>gi|126461650|ref|YP_001042764.1| hypothetical protein Rsph17029_0881 [Rhodobacter sphaeroides ATCC
17029]
gi|221638621|ref|YP_002524883.1| hypothetical protein RSKD131_0522 [Rhodobacter sphaeroides KD131]
gi|332557643|ref|ZP_08411965.1| hypothetical protein RSWS8N_01290 [Rhodobacter sphaeroides WS8N]
gi|429206978|ref|ZP_19198238.1| Sulfite exporter TauE/SafE [Rhodobacter sp. AKP1]
gi|126103314|gb|ABN75992.1| protein of unknown function DUF81 [Rhodobacter sphaeroides ATCC
17029]
gi|221159402|gb|ACM00382.1| Hypothetical Protein RSKD131_0522 [Rhodobacter sphaeroides KD131]
gi|332275355|gb|EGJ20670.1| hypothetical protein RSWS8N_01290 [Rhodobacter sphaeroides WS8N]
gi|428189973|gb|EKX58525.1| Sulfite exporter TauE/SafE [Rhodobacter sp. AKP1]
Length = 306
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 2/100 (2%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
G + G+ G+GGGF++ PL +GIPP V+ AT + SS V+ K L
Sbjct: 25 GFMSGMFGVGGGFLITPLLFFIGIPPAVAVATGANQVVASSVSGVLAQLKRKTVDFRMGL 84
Query: 407 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
A + + G V +L LG+ + + LS+++F+
Sbjct: 85 VLLAGGVAGSAFGVAVFARL-NTLGQVDLFV-QLSYVLFL 122
>gi|229056545|ref|ZP_04195952.1| hypothetical protein bcere0026_6680 [Bacillus cereus AH603]
gi|228720758|gb|EEL72315.1| hypothetical protein bcere0026_6680 [Bacillus cereus AH603]
Length = 257
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 63/122 (51%), Gaps = 13/122 (10%)
Query: 310 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 366
A+ ++ +R KG EG KW AS L F G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-DRKGSEGIMYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169
Query: 367 -ELGIPPQVSSATALFAITFSSSMSVVEYYLLK-RFPVPYALYFFALSIIAAFVGQHVLK 424
L +P + + AT++ AITF SS+ + ++ + V AL S+ AA +G V K
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGITTGKVITGQIVVIPALIVAIASLFAAPLGVKVGK 228
Query: 425 KL 426
+L
Sbjct: 229 RL 230
>gi|423474379|ref|ZP_17451118.1| hypothetical protein IEM_05680 [Bacillus cereus BAG6O-2]
gi|402423143|gb|EJV55362.1| hypothetical protein IEM_05680 [Bacillus cereus BAG6O-2]
Length = 255
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)
Query: 310 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 366
A+ ++ +R KG EG KW AS L F G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGIMYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169
Query: 367 -ELGIPPQVSSATALFAITFSSSMSV 391
L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194
>gi|260907045|ref|ZP_05915367.1| hypothetical protein BlinB_17054 [Brevibacterium linens BL2]
Length = 266
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 53/107 (49%), Gaps = 4/107 (3%)
Query: 314 LYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELG-IPP 372
+ +GRR+ + + + S+++ A G G+V G +G GGGF++ P + L +P
Sbjct: 126 MIRGRRQTGDQHERPPELPISKVL---ATGAVVGLVSGFVGAGGGFLIVPALVLLARLPM 182
Query: 373 QVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
+ AT+L IT S+ + Y L A+ L++ +F+G
Sbjct: 183 PRAVATSLLIITMQSASGLAGYALSTPLDWSLAVAIAGLAVAGSFIG 229
>gi|420163862|ref|ZP_14670596.1| hypothetical protein HMPREF9995_07828 [Staphylococcus epidermidis
NIHLM095]
gi|420168565|ref|ZP_14675173.1| hypothetical protein HMPREF9993_07496 [Staphylococcus epidermidis
NIHLM087]
gi|394232988|gb|EJD78599.1| hypothetical protein HMPREF9995_07828 [Staphylococcus epidermidis
NIHLM095]
gi|394233274|gb|EJD78882.1| hypothetical protein HMPREF9993_07496 [Staphylococcus epidermidis
NIHLM087]
Length = 254
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 60/118 (50%), Gaps = 10/118 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVTFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 439
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYT 117
>gi|86356207|ref|YP_468099.1| permease [Rhizobium etli CFN 42]
gi|86280309|gb|ABC89372.1| putative permease protein [Rhizobium etli CFN 42]
Length = 307
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 61/126 (48%), Gaps = 4/126 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 18 LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
+ + A VG + L++ +G+ +II +L ++IF+ ++GG+ L +
Sbjct: 78 LDIKLGTVLLVGGLSGATVGIWIF-SLLRAIGQLDLII-SLMYVIFLG--TVGGLMLLES 133
Query: 460 IKRIEH 465
IK +
Sbjct: 134 IKAMRR 139
>gi|423455682|ref|ZP_17432535.1| hypothetical protein IEE_04426 [Bacillus cereus BAG5X1-1]
gi|401134319|gb|EJQ41936.1| hypothetical protein IEE_04426 [Bacillus cereus BAG5X1-1]
Length = 255
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)
Query: 310 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 366
A+ ++ +R KG EG KW AS L F G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGIMYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169
Query: 367 -ELGIPPQVSSATALFAITFSSSMSV 391
L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194
>gi|390960291|ref|YP_006424125.1| putative permease [Thermococcus sp. CL1]
gi|390518599|gb|AFL94331.1| putative permease [Thermococcus sp. CL1]
Length = 254
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 43/85 (50%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
+ V Y A G+ G++ L GLGGGF++ P LG+ + T+ A+ F+S S + Y
Sbjct: 3 KYVSYFAVGVFIGILAALFGLGGGFLIVPTLNFLGVEIHHAVGTSSAAVVFTSLSSALAY 62
Query: 395 YLLKRFPVPYALYFFALSIIAAFVG 419
R L + ++I A++G
Sbjct: 63 SRQGRIHYKVGLLLASTAVIGAYIG 87
>gi|418327458|ref|ZP_12938613.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
gi|418630881|ref|ZP_13193353.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
gi|365232948|gb|EHM73921.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
gi|374836191|gb|EHR99779.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
Length = 254
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 61/119 (51%), Gaps = 10/119 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ A G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVAFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFTL 440
S +++ LK + L F A +IIA ++ +LK LI I+ + ++I+T+
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPAIILKPLI-IIALSLVLIYTI 118
>gi|423367325|ref|ZP_17344758.1| hypothetical protein IC3_02427 [Bacillus cereus VD142]
gi|401085435|gb|EJP93677.1| hypothetical protein IC3_02427 [Bacillus cereus VD142]
Length = 257
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)
Query: 310 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 366
A+ ++ +R KG EG KW AS L F G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169
Query: 367 -ELGIPPQVSSATALFAITFSSSMSV 391
L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194
>gi|358639366|dbj|BAL26663.1| permease, putative [Azoarcus sp. KH32C]
Length = 276
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS--ATALFAITFSSSMSVVEYYLLKRF 400
G G + G+LG+GGGF++ P L +P + S AT+L AI +SS++ V L R
Sbjct: 163 GAATGFLSGMLGVGGGFVIVPA-LRATLPLSMHSAVATSLMAIALTSSITFVTCILQGRI 221
Query: 401 -PVPYALYFFALSIIAAFVGQHVLKKL 426
P+ A+ F A S++ +G+ V ++
Sbjct: 222 VPLLAAVPFVAGSVLGMVLGRRVAPRI 248
>gi|229165721|ref|ZP_04293489.1| hypothetical protein bcere0007_6980 [Bacillus cereus AH621]
gi|423595218|ref|ZP_17571249.1| hypothetical protein IIG_04086 [Bacillus cereus VD048]
gi|228617722|gb|EEK74779.1| hypothetical protein bcere0007_6980 [Bacillus cereus AH621]
gi|401222489|gb|EJR29079.1| hypothetical protein IIG_04086 [Bacillus cereus VD048]
Length = 257
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)
Query: 310 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 366
A+ ++ +R KG EG KW AS L F G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169
Query: 367 -ELGIPPQVSSATALFAITFSSSMSV 391
L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194
>gi|326332008|ref|ZP_08198293.1| putative membrane protein [Nocardioidaceae bacterium Broad-1]
gi|325950146|gb|EGD42201.1| putative membrane protein [Nocardioidaceae bacterium Broad-1]
Length = 311
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 40/78 (51%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
G+V GL G+GGG ++ P + LG+PP + A L A + + S+ ++ + A
Sbjct: 21 GIVVGLTGMGGGALMTPALIFLGVPPTTAVANDLVAASVNKSVGAAVHWRHGSPNIRLAT 80
Query: 407 YFFALSIIAAFVGQHVLK 424
Y S+ AF+G V K
Sbjct: 81 YLIIGSVPCAFLGTFVHK 98
>gi|395790715|ref|ZP_10470175.1| hypothetical protein MEC_00166 [Bartonella alsatica IBS 382]
gi|395409467|gb|EJF76057.1| hypothetical protein MEC_00166 [Bartonella alsatica IBS 382]
Length = 304
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 340 CACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 392
A G+ G++ ++G+GGGF + P L L +P V T+LF ITF SS + V
Sbjct: 179 LAIGLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQITFVSSFTTV 232
>gi|418412109|ref|ZP_12985374.1| hypothetical protein HMPREF9281_00978 [Staphylococcus epidermidis
BVS058A4]
gi|420184668|ref|ZP_14690777.1| hypothetical protein HMPREF9986_02847 [Staphylococcus epidermidis
NIHLM040]
gi|394257319|gb|EJE02241.1| hypothetical protein HMPREF9986_02847 [Staphylococcus epidermidis
NIHLM040]
gi|410890123|gb|EKS37923.1| hypothetical protein HMPREF9281_00978 [Staphylococcus epidermidis
BVS058A4]
Length = 254
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 60/118 (50%), Gaps = 10/118 (8%)
Query: 330 KWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM 389
W S ++ G A V ++G GGG I P L +G+PP V+ T FA +FS+
Sbjct: 2 DWNLSIMLMIVTFGFIASFVDSVVG-GGGLISTPALLAVGLPPSVALGTNKFASSFSTLT 60
Query: 390 SVVEYYL-----LKRFPVPYALYFFAL---SIIAAFVGQHVLKKLIKILGRASIIIFT 439
S +++ LK + L F A +IIA ++ ++LK LI I+ + ++I+T
Sbjct: 61 SALKFLRSGKVDLKIVGKMFPLIFVASGGGAIIATYIPANILKPLI-IIALSLVLIYT 117
>gi|332185052|ref|ZP_08386801.1| hypothetical protein SUS17_418 [Sphingomonas sp. S17]
gi|332014776|gb|EGI56832.1| hypothetical protein SUS17_418 [Sphingomonas sp. S17]
Length = 253
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 30/129 (23%), Positives = 59/129 (45%), Gaps = 12/129 (9%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
+ A G A +V G LG+ G I L + LG+PP +SA TF++++S + + + +
Sbjct: 14 FVAVGFAAQVVDGALGMAFGVISNTLLMSLGVPPAAASAGVHSVETFTTAVSGISHAVHR 73
Query: 399 RFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAK 458
L +I +G +VL + + + I+++ L G+G+
Sbjct: 74 NVDWKLFLRLMIPGVIGGALGAYVLSNIDAAVVKPFILMY------------LAGIGVYL 121
Query: 459 MIKRIEHKE 467
+I+ + H+
Sbjct: 122 LIRALRHRP 130
>gi|163938698|ref|YP_001643582.1| hypothetical protein BcerKBAB4_0692 [Bacillus weihenstephanensis
KBAB4]
gi|229131719|ref|ZP_04260596.1| hypothetical protein bcere0014_6720 [Bacillus cereus BDRD-ST196]
gi|423485997|ref|ZP_17462679.1| hypothetical protein IEU_00620 [Bacillus cereus BtB2-4]
gi|423491721|ref|ZP_17468365.1| hypothetical protein IEW_00619 [Bacillus cereus CER057]
gi|423501487|ref|ZP_17478104.1| hypothetical protein IEY_04714 [Bacillus cereus CER074]
gi|423515545|ref|ZP_17492026.1| hypothetical protein IG7_00615 [Bacillus cereus HuA2-4]
gi|163860895|gb|ABY41954.1| protein of unknown function DUF81 [Bacillus weihenstephanensis
KBAB4]
gi|228651773|gb|EEL07733.1| hypothetical protein bcere0014_6720 [Bacillus cereus BDRD-ST196]
gi|401153579|gb|EJQ61004.1| hypothetical protein IEY_04714 [Bacillus cereus CER074]
gi|401158654|gb|EJQ66044.1| hypothetical protein IEW_00619 [Bacillus cereus CER057]
gi|401166933|gb|EJQ74231.1| hypothetical protein IG7_00615 [Bacillus cereus HuA2-4]
gi|402440558|gb|EJV72550.1| hypothetical protein IEU_00620 [Bacillus cereus BtB2-4]
Length = 257
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)
Query: 310 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 366
A+ ++ +R KG EG KW AS L F G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169
Query: 367 -ELGIPPQVSSATALFAITFSSSMSV 391
L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194
>gi|384222369|ref|YP_005613535.1| hypothetical protein BJ6T_87050 [Bradyrhizobium japonicum USDA 6]
gi|354961268|dbj|BAL13947.1| hypothetical protein BJ6T_87050 [Bradyrhizobium japonicum USDA 6]
Length = 305
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+
Sbjct: 19 AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALSYW 73
>gi|347523588|ref|YP_004781158.1| hypothetical protein Pyrfu_1038 [Pyrolobus fumarii 1A]
gi|343460470|gb|AEM38906.1| protein of unknown function DUF81 [Pyrolobus fumarii 1A]
Length = 241
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 42/81 (51%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 401
G+ AG+VG + GLGGG I+ PL + G+ P + + +LFA+ +S ++ +
Sbjct: 5 VGLLAGIVGSIAGLGGGAIMVPLLILAGVEPHHAVSASLFAVLGTSLGGIIHLHKEGLLD 64
Query: 402 VPYALYFFALSIIAAFVGQHV 422
A+ S+ A +G +V
Sbjct: 65 TRLAVTLATASVTGAVIGAYV 85
>gi|18313308|ref|NP_559975.1| hypothetical protein PAE2388 [Pyrobaculum aerophilum str. IM2]
gi|18160832|gb|AAL64157.1| hypothetical protein PAE2388 [Pyrobaculum aerophilum str. IM2]
Length = 243
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 31/47 (65%)
Query: 80 VGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMV 126
+G ++ F G S+ GVGGG IFVP L L+ G DAK +AA+S ++
Sbjct: 123 LGYLLVFIGGFVSSLFGVGGGTIFVPALILLAGLDAKLAAAMSMGII 169
>gi|443325967|ref|ZP_21054637.1| putative permease [Xenococcus sp. PCC 7305]
gi|442794404|gb|ELS03821.1| putative permease [Xenococcus sp. PCC 7305]
Length = 260
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 48/93 (51%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 393
L+F G +G++ G LG+GGG +L PL + LG P + AT+ AIT ++ ++
Sbjct: 4 DNLLFIAFGGFISGILAGFLGIGGGTVLVPLLVALGYSPIQAVATSALAITITALSGTLQ 63
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
+ + LY +++ A VG ++ + L
Sbjct: 64 NWRMGYIKPKSLLYLGVPALLFAQVGVYLSEGL 96
>gi|423556322|ref|ZP_17532625.1| hypothetical protein II3_01527 [Bacillus cereus MC67]
gi|401195511|gb|EJR02467.1| hypothetical protein II3_01527 [Bacillus cereus MC67]
Length = 257
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 12/86 (13%)
Query: 310 EAVALYKGRRKIASKGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL 366
A+ ++ +R KG EG KW AS L F G V G++G GG F+L P+ L
Sbjct: 117 AAIMMFIPKRS-NRKGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIML 169
Query: 367 -ELGIPPQVSSATALFAITFSSSMSV 391
L +P + + AT++ AITF SS+ +
Sbjct: 170 VVLKLPIRTTIATSI-AITFISSIGI 194
>gi|254422238|ref|ZP_05035956.1| conserved domain protein, putative [Synechococcus sp. PCC 7335]
gi|196189727|gb|EDX84691.1| conserved domain protein, putative [Synechococcus sp. PCC 7335]
Length = 266
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 10/88 (11%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 393
S +V G+ +G++ G LG+GGG +L P+ L+LG ++AT+ AI +S+ +
Sbjct: 4 SSVVLLSIAGLFSGILAGFLGIGGGTLLVPVLLQLGFESHAATATSSLAILVTSTTGSAQ 63
Query: 394 YY----------LLKRFPVPYALYFFAL 411
+ LL P A +F AL
Sbjct: 64 NWRMGYLDPKQILLLGIPAAIAGFFAAL 91
>gi|119026567|ref|YP_910412.1| hypothetical protein BAD_1549 [Bifidobacterium adolescentis ATCC
15703]
gi|118766151|dbj|BAF40330.1| widely conserved hypothetical transmembrane protein with dub81
[Bifidobacterium adolescentis ATCC 15703]
Length = 292
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 69/143 (48%), Gaps = 5/143 (3%)
Query: 305 GVSAYEAVALYKGRRKIASK-GDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP 363
G +A A A+ +K+ S+ G+ A ++ G+ G++ G G+GGG ++ P
Sbjct: 5 GETADRADAVRTESQKMESRAGETALDGSARSIIVLIVVGMLVGVLSGTFGIGGGTVIVP 64
Query: 364 LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVL 423
+ LG+ + ++AT+ AI +S VV Y A F + +G +L
Sbjct: 65 ALVWLGLSQRNAAATSTLAIVPTSISGVVSYATGGHVDWLAAALLFCGMFVGGQIGSWLL 124
Query: 424 KKLIKILGRASIIIFTLSFIIFV 446
+L +++ R IF ++F++FV
Sbjct: 125 SRLPELVLR---WIF-VAFLVFV 143
>gi|152974977|ref|YP_001374494.1| hypothetical protein Bcer98_1177 [Bacillus cytotoxicus NVH 391-98]
gi|152023729|gb|ABS21499.1| protein of unknown function DUF81 [Bacillus cytotoxicus NVH 391-98]
Length = 266
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 401
G+ AG +G L+GLGGG I+ PL + L + PQ++ T++ + F+ S + Y KR
Sbjct: 10 GLIAGTIGSLVGLGGGIIVVPLLIGLHSLSPQIAVGTSIVTVVFTGLSSTLAYVKHKRVD 69
Query: 402 VPYALYFF 409
L F
Sbjct: 70 YKSGLILF 77
>gi|334341160|ref|YP_004546140.1| hypothetical protein [Desulfotomaculum ruminis DSM 2154]
gi|334092514|gb|AEG60854.1| protein of unknown function DUF81 [Desulfotomaculum ruminis DSM
2154]
Length = 426
Score = 39.7 bits (91), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 2/98 (2%)
Query: 334 SQLVFY--CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
+ +FY A G+ AG++ G +G GGGF++ P + LG+ ++ T F I + M
Sbjct: 70 TPFMFYGSMAVGVCAGLITGCIGAGGGFVITPALMSLGVKGILAVGTDQFHIFAKAIMGT 129
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKI 429
V + L +P A+ F S I G + + L +
Sbjct: 130 VIHKKLGNVNIPLAIAFLCGSGIGVTAGGTLNRALFNM 167
>gi|298674473|ref|YP_003726223.1| hypothetical protein Metev_0515 [Methanohalobium evestigatum
Z-7303]
gi|298287461|gb|ADI73427.1| protein of unknown function DUF81 [Methanohalobium evestigatum
Z-7303]
Length = 252
Score = 39.7 bits (91), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 29/46 (63%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSS 387
G G+V GLLGL GG + PL + LG + +SAT+LF I F+S
Sbjct: 140 SGFMIGIVAGLLGLSGGVFIVPLLVYLGYNIKTASATSLFIIVFTS 185
>gi|83313505|ref|YP_423769.1| permease [Magnetospirillum magneticum AMB-1]
gi|82948346|dbj|BAE53210.1| Predicted permease [Magnetospirillum magneticum AMB-1]
Length = 317
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 38/64 (59%), Gaps = 5/64 (7%)
Query: 330 KWRASQL----VFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAIT 384
++R SQL + A G+ G++ L+G+GGGFI+ P + LG+P V T+LF I
Sbjct: 175 RFRRSQLYISAIVPAAIGLLVGILAALMGVGGGFIMVPAMIYLLGMPTAVVVGTSLFQII 234
Query: 385 FSSS 388
F ++
Sbjct: 235 FVTA 238
>gi|229817013|ref|ZP_04447295.1| hypothetical protein BIFANG_02268 [Bifidobacterium angulatum DSM
20098 = JCM 7096]
gi|229785758|gb|EEP21872.1| hypothetical protein BIFANG_02268 [Bifidobacterium angulatum DSM
20098 = JCM 7096]
Length = 280
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 63/136 (46%), Gaps = 15/136 (11%)
Query: 309 YEAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL 368
YE AL+ R+ L A G+ AG+ G+ G+GGG ++ P + +
Sbjct: 3 YEQAALHGSARR--------------TLCILLAVGLIAGLFSGMFGIGGGTVIVPALVWI 48
Query: 369 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 428
G+ + ++AT++ AI +S V+ Y AL ++ + +G +L +L +
Sbjct: 49 GLTQRQAAATSICAIIITSISGVISYAHGGDVDWIAALLLAVGMMVGSQIGSWLLSRLSE 108
Query: 429 ILGRASIIIFTLSFII 444
+ R ++F L F+I
Sbjct: 109 VFLRWFYVVF-LCFVI 123
>gi|414153131|ref|ZP_11409458.1| conserved membrane hypothetical protein [Desulfotomaculum
hydrothermale Lam5 = DSM 18033]
gi|411455513|emb|CCO07360.1| conserved membrane hypothetical protein [Desulfotomaculum
hydrothermale Lam5 = DSM 18033]
Length = 428
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 62/138 (44%), Gaps = 7/138 (5%)
Query: 337 VFY--CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
+FY A G+ AG++ G +G GGGF++ P + LG+ ++ T F I + M V +
Sbjct: 73 MFYGSMAVGVCAGLITGCIGAGGGFVITPALMSLGVKGILAVGTDQFHIFAKAIMGTVIH 132
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 454
L VP A+ F S + G + + L S + +L +++ LG +
Sbjct: 133 KKLGNVNVPLAIAFLVGSGLGVTAGGTLNRALFNANPVLSDFVISLVYVVM-----LGFL 187
Query: 455 GLAKMIKRIEHKEYMGFD 472
G + I+ + G D
Sbjct: 188 GFYSLYDFIKTRHQKGGD 205
>gi|119485715|ref|ZP_01619990.1| hypothetical protein L8106_25070 [Lyngbya sp. PCC 8106]
gi|119457040|gb|EAW38167.1| hypothetical protein L8106_25070 [Lyngbya sp. PCC 8106]
Length = 297
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 33/54 (61%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSS 387
S+L+ GI AG++ G LG+GGG +L P+ L LG P + AT+ +I +S
Sbjct: 4 SELLTLAVSGIFAGILAGFLGIGGGTVLVPMMLGLGYEPINAVATSTLSIMITS 57
>gi|371777478|ref|ZP_09483800.1| hypothetical protein AnHS1_08668 [Anaerophaga sp. HS1]
Length = 265
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 28/109 (25%), Positives = 56/109 (51%)
Query: 318 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSA 377
RRK + K DE + S+++ G+ AG+V GL G G + L L +P ++ +
Sbjct: 128 RRKRSGKTDEMKLFSRSKVIQSSVYGLLAGVVTGLFGTSGTAPVLAGLLALELPLKIVAG 187
Query: 378 TALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
T+L +T ++ ++ + L+ + L+ A +I+ AF+G + ++
Sbjct: 188 TSLLVVTTNTFSALGAHLLIGEIDMTLILFLTAGTILGAFMGPRRMARV 236
>gi|423677376|ref|ZP_17652315.1| hypothetical protein IKS_04919 [Bacillus cereus VDM062]
gi|401306991|gb|EJS12457.1| hypothetical protein IKS_04919 [Bacillus cereus VDM062]
Length = 255
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 11/72 (15%)
Query: 324 KGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATA 379
KG EG KW AS L F G V G++G GG F+L P+ L L +P + + AT+
Sbjct: 130 KGTEGIMYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRTTIATS 183
Query: 380 LFAITFSSSMSV 391
+ AITF SS+ +
Sbjct: 184 I-AITFISSIGI 194
>gi|423666571|ref|ZP_17641600.1| hypothetical protein IKO_00268 [Bacillus cereus VDM034]
gi|401305297|gb|EJS10838.1| hypothetical protein IKO_00268 [Bacillus cereus VDM034]
Length = 255
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 11/72 (15%)
Query: 324 KGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATA 379
KG EG KW AS L F G V G++G GG F+L P+ L L +P + + AT+
Sbjct: 130 KGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRTTIATS 183
Query: 380 LFAITFSSSMSV 391
+ AITF SS+ +
Sbjct: 184 I-AITFISSIGI 194
>gi|423601769|ref|ZP_17577769.1| hypothetical protein III_04571 [Bacillus cereus VD078]
gi|401228892|gb|EJR35412.1| hypothetical protein III_04571 [Bacillus cereus VD078]
Length = 257
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 41/72 (56%), Gaps = 11/72 (15%)
Query: 324 KGDEG---TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATA 379
KG EG KW AS L F G V G++G GG F+L P+ L L +P + + AT+
Sbjct: 130 KGTEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRTTIATS 183
Query: 380 LFAITFSSSMSV 391
+ AITF SS+ +
Sbjct: 184 I-AITFISSIGI 194
>gi|218887074|ref|YP_002436395.1| hypothetical protein DvMF_1984 [Desulfovibrio vulgaris str.
'Miyazaki F']
gi|218758028|gb|ACL08927.1| protein of unknown function DUF81 [Desulfovibrio vulgaris str.
'Miyazaki F']
Length = 415
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 41/86 (47%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
A G+ AG++ G +G GGGFI+ P + G+ ++ T LF I + M + L
Sbjct: 79 AVGLAAGLITGCIGAGGGFIITPALMAAGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNV 138
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKL 426
V A+ F S F+G + K L
Sbjct: 139 SVKLAIAFLVGSGAGTFIGGAINKGL 164
>gi|374572587|ref|ZP_09645683.1| putative permease [Bradyrhizobium sp. WSM471]
gi|374420908|gb|EHR00441.1| putative permease [Bradyrhizobium sp. WSM471]
Length = 305
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+
Sbjct: 19 AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73
>gi|314933135|ref|ZP_07840500.1| putative membrane protein [Staphylococcus caprae C87]
gi|313653285|gb|EFS17042.1| putative membrane protein [Staphylococcus caprae C87]
Length = 275
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 50/102 (49%), Gaps = 2/102 (1%)
Query: 327 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 384
EG + S +F + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYHYSVPPLFAFVATLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207
Query: 385 FSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
FSS MS V + + Y++ SII A +G V + L
Sbjct: 208 FSSVMSSVGHIVQGHVAWGYSVALIISSIIGAQIGVKVNRSL 249
>gi|110632388|ref|YP_672596.1| hypothetical protein Meso_0026 [Chelativorans sp. BNC1]
gi|110283372|gb|ABG61431.1| protein of unknown function DUF81 [Chelativorans sp. BNC1]
Length = 305
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 66/138 (47%), Gaps = 14/138 (10%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMSV 391
++ A G G + G+ G+GGGF++ PL + +PP ++ AT + A +FS ++S
Sbjct: 13 NMLVLLAMGGAVGFLSGMFGVGGGFLITPLLIFYNVPPAIAVATGANQVIASSFSGTLS- 71
Query: 392 VEYYLLKR--FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
LKR V L II + G +V L ++ ++ T+S + V
Sbjct: 72 ----HLKRGTLDVKLGLVLLVGGIIGSAAGIYVFAYLRQL----GLLDLTVSLLYVVLLG 123
Query: 450 SLGGVGLAKMIKRIEHKE 467
S+GG+ L + ++ I+
Sbjct: 124 SVGGLMLVESVRAIQRSR 141
>gi|407850242|gb|EKG04697.1| hypothetical protein TCSYLVIO_004241 [Trypanosoma cruzi]
Length = 368
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 55/113 (48%), Gaps = 2/113 (1%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT--LDIPVIDYDLALLFQ 159
I VPM L++G + +S+ + G ++ V+ +R+R P P+I+Y L
Sbjct: 105 ILVPMFCLLMGLPMDFAVGLSQSTICGQSILNVLIAIRKRFPCAGCSRPLINYQYLTLLV 164
Query: 160 PLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR 212
PL V+G IG N + D VLL +L + R+ K + ++K+ +R
Sbjct: 165 PLGVIGTLIGGVLNRLCPDLFRLVLLFLLLTTVLYRSVRKMIAQYRKDQSERR 217
>gi|384085444|ref|ZP_09996619.1| hypothetical protein AthiA1_08064 [Acidithiobacillus thiooxidans
ATCC 19377]
Length = 304
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 23/87 (26%), Positives = 42/87 (48%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
+ G+ G + GL G+GGG ++ PL + +G+PP V+ T I S++ ++ +
Sbjct: 18 VSFGLAVGFLSGLTGVGGGILITPLLIFIGVPPLVAVGTGAAQIVGGSAIGSYAHWRMGN 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKL 426
+ A A S +G H+ + L
Sbjct: 78 VDMRMAFVLLAGSWTGGLIGVHIARIL 104
>gi|383935848|ref|ZP_09989281.1| UPF0721 transmembrane protein [Rheinheimera nanhaiensis E407-8]
gi|383703167|dbj|GAB59372.1| UPF0721 transmembrane protein [Rheinheimera nanhaiensis E407-8]
Length = 260
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 41/93 (44%), Gaps = 1/93 (1%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 393
S L+ C +AG + + G GGG + P L G+PP V T A TF S + +
Sbjct: 10 SVLLILCLVAFSAGFIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSFTASLT 68
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
YY K F Y + + A +G V+ L
Sbjct: 69 YYRKKLFNPLYWRRSLGYTALGAVLGTLVVDHL 101
>gi|83949994|ref|ZP_00958727.1| membrane protein [Roseovarius nubinhibens ISM]
gi|83837893|gb|EAP77189.1| membrane protein [Roseovarius nubinhibens ISM]
Length = 305
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 50/96 (52%), Gaps = 2/96 (2%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G+ G+GGGF++ PL +GIPP V+ AT I SS V+ ++ K +
Sbjct: 29 GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAHFRRKTVDLRMGTVLLV 88
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
+ A VG VL +K +G+ +++ L +++F+
Sbjct: 89 GGLCGAAVGV-VLFNYLKSMGQVDLLV-KLCYVVFL 122
>gi|146278323|ref|YP_001168482.1| hypothetical protein Rsph17025_2287 [Rhodobacter sphaeroides ATCC
17025]
gi|145556564|gb|ABP71177.1| protein of unknown function DUF81 [Rhodobacter sphaeroides ATCC
17025]
Length = 306
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 49/100 (49%), Gaps = 2/100 (2%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYAL 406
G + G+ G+GGGF++ PL +GIPP V+ AT + SS V+ K +
Sbjct: 25 GFMSGMFGVGGGFLITPLLFFIGIPPAVAVATGANQVVASSVSGVLAQIKRKAVDFRMGM 84
Query: 407 YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
A + + G V +L LG+ + + LS+++F+
Sbjct: 85 VLLAGGVAGSAFGVAVFARL-NTLGQVDLFV-QLSYVLFL 122
>gi|354583034|ref|ZP_09001934.1| protein of unknown function DUF81 [Paenibacillus lactis 154]
gi|353198451|gb|EHB63921.1| protein of unknown function DUF81 [Paenibacillus lactis 154]
Length = 263
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 3/83 (3%)
Query: 318 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSS 376
R I + G E A L F + G + L G+GGG ++ P + L L PP +++
Sbjct: 132 RNFIDASGAE--HRYAFNLWFGSGVALFVGFISSLFGIGGGSVMVPTMILFLAFPPHIAT 189
Query: 377 ATALFAITFSSSMSVVEYYLLKR 399
AT++F+I SS + + + +L
Sbjct: 190 ATSMFSILLSSLVGTISHVVLDH 212
>gi|365896889|ref|ZP_09434939.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3843]
gi|365422354|emb|CCE07481.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3843]
Length = 308
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 62/126 (49%), Gaps = 8/126 (6%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+ +
Sbjct: 18 LAMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPGVAVASVSSHIAASSFSGALSYWRRRA 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV----G 455
A I +G + L++ LG+ ++I LS++ ++ S+GGV G
Sbjct: 78 IDPSLASVLLIGGAIGTALGVYTF-TLLRALGQLDLMI-ALSYVALLT--SVGGVMFWEG 133
Query: 456 LAKMIK 461
L M++
Sbjct: 134 LRAMLR 139
>gi|398812927|ref|ZP_10571633.1| putative permease [Brevibacillus sp. BC25]
gi|398039917|gb|EJL33039.1| putative permease [Brevibacillus sp. BC25]
Length = 274
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 8/102 (7%)
Query: 337 VFYCACGITAGMVGGLLGLGGGFILGPLFLELG---IP----PQVSSATALFAITFSSSM 389
+ + G+ AG++G + GLGGG P L IP PQV++AT+L I ++
Sbjct: 7 ILFVIIGVVAGVIGSIAGLGGGIFFVPALLYFANWYIPGSMNPQVAAATSLIVIAVTALS 66
Query: 390 SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILG 431
S + Y + AL FF S A VG + L L+ + G
Sbjct: 67 SSLSYLKQNKVDKQSALLFFIGSAPGAIVGVY-LNTLLAVEG 107
>gi|386399941|ref|ZP_10084719.1| putative permease [Bradyrhizobium sp. WSM1253]
gi|385740567|gb|EIG60763.1| putative permease [Bradyrhizobium sp. WSM1253]
Length = 305
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+
Sbjct: 19 AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73
>gi|398824727|ref|ZP_10583048.1| putative permease [Bradyrhizobium sp. YR681]
gi|398224594|gb|EJN10895.1| putative permease [Bradyrhizobium sp. YR681]
Length = 305
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+
Sbjct: 19 AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73
>gi|325959155|ref|YP_004290621.1| hypothetical protein Metbo_1410 [Methanobacterium sp. AL-21]
gi|325330587|gb|ADZ09649.1| protein of unknown function DUF81 [Methanobacterium sp. AL-21]
Length = 263
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 63/134 (47%), Gaps = 8/134 (5%)
Query: 319 RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSAT 378
+++ + E T + + G+ +G++ G+ G+ G + LG+P + T
Sbjct: 129 KELNGENQEKTTLKGPKRAVASLFGVASGLLAGVFGISGTPPIIAGLYSLGLPALMVVGT 188
Query: 379 ALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI-----KILG-- 431
++F + F+S + +YLL R + + + + AF+G +LKK+ KI G
Sbjct: 189 SVFVLIFNSLAGIGGFYLLGRLNLTLIILLGGSAAVGAFIGPLLLKKINPNTFEKIYGPV 248
Query: 432 -RASIIIFTLSFII 444
++IIF L II
Sbjct: 249 IMGTMIIFALGLII 262
>gi|189345900|ref|YP_001942429.1| hypothetical protein Clim_0357 [Chlorobium limicola DSM 245]
gi|189340047|gb|ACD89450.1| conserved hypothetical protein [Chlorobium limicola DSM 245]
Length = 121
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSS-MSVV 392
Q VF GI AG++ G+ G+GGG I+ P L L G+ ++AT+L A+ + V+
Sbjct: 2 QTVFLLIIGIAAGLLSGMFGVGGGIIIVPALVLLFGMSQHSANATSLVALLLPVGILGVM 61
Query: 393 EYY 395
EYY
Sbjct: 62 EYY 64
>gi|251810296|ref|ZP_04824769.1| hypothetical membrane protein [Staphylococcus epidermidis
BCM-HMP0060]
gi|251806178|gb|EES58835.1| hypothetical membrane protein [Staphylococcus epidermidis
BCM-HMP0060]
Length = 275
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 2/77 (2%)
Query: 327 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 384
EG +R S +F + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGLLTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207
Query: 385 FSSSMSVVEYYLLKRFP 401
FSS MS + + P
Sbjct: 208 FSSVMSSIGHIFQGHVP 224
>gi|209547817|ref|YP_002279734.1| hypothetical protein Rleg2_0208 [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|424915621|ref|ZP_18338985.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM597]
gi|209533573|gb|ACI53508.1| protein of unknown function DUF81 [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|392851797|gb|EJB04318.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM597]
Length = 307
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 18 LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
V + A VG + L++ +G+ +II +L ++IF+ ++GG+ L +
Sbjct: 78 LDVKLGTVLLVGGLTGATVGIWIF-SLLRAIGQLDLII-SLMYVIFLG--TVGGLMLLES 133
Query: 460 IKRIEH 465
I +
Sbjct: 134 INAMRR 139
>gi|120404979|ref|YP_954808.1| hypothetical protein Mvan_4025 [Mycobacterium vanbaalenii PYR-1]
gi|119957797|gb|ABM14802.1| protein of unknown function DUF81 [Mycobacterium vanbaalenii PYR-1]
Length = 291
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 48/86 (55%), Gaps = 5/86 (5%)
Query: 310 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLEL 368
A+A+ +GR+ I + G+ G + +++ A G+ G+V GL+G GGGF++ P L L
Sbjct: 109 TAIAMLRGRKTIET-GETGHRLPVPKIL---AEGLIVGLVTGLVGAGGGFLVVPALALLG 164
Query: 369 GIPPQVSSATALFAITFSSSMSVVEY 394
G+P ++ T+L I S + Y
Sbjct: 165 GLPMPIAVGTSLIVIAMKSFAGLAGY 190
>gi|383769051|ref|YP_005448114.1| hypothetical protein S23_07820 [Bradyrhizobium sp. S23321]
gi|381357172|dbj|BAL74002.1| hypothetical protein S23_07820 [Bradyrhizobium sp. S23321]
Length = 305
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+
Sbjct: 19 AMGAAVGFVSGMFGIGGGFLMTPLLIFIGITPAVAVASVASHIAASSFSGALAYW 73
>gi|340622187|ref|YP_004740639.1| hypothetical protein Ccan_14160 [Capnocytophaga canimorsus Cc5]
gi|339902453|gb|AEK23532.1| UPF0721 transmembrane protein [Capnocytophaga canimorsus Cc5]
Length = 266
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 19/95 (20%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELG-IPPQVSSATALFAITFSS----------SMSV 391
G+T G++ GL+G GGGF++ P + L +P +++ T+LF I S S ++
Sbjct: 153 GLTVGVLTGLVGAGGGFLIIPALVLLARMPMKLAVGTSLFIIAIKSLLGFLGDLQTSYTL 212
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
+ + LL LYF LS+I F+G + KK+
Sbjct: 213 INWRLL--------LYFTLLSVIGIFIGMLLAKKI 239
>gi|347732908|ref|ZP_08865979.1| hypothetical protein DA2_2276 [Desulfovibrio sp. A2]
gi|347518384|gb|EGY25558.1| hypothetical protein DA2_2276 [Desulfovibrio sp. A2]
Length = 415
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 41/86 (47%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
A G+ AG++ G +G GGGFI+ P + G+ ++ T LF I + M + L
Sbjct: 79 AVGLAAGLITGCIGAGGGFIITPALMAAGVKGILAVGTDLFHIFAKAIMGTTVHKKLGNV 138
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKL 426
V A+ F S F+G + K L
Sbjct: 139 SVKLAIAFLVGSGAGTFIGGAINKGL 164
>gi|124028506|ref|YP_001013826.1| hypothetical protein Hbut_1668 [Hyperthermus butylicus DSM 5456]
gi|123979200|gb|ABM81481.1| universally conserved protein [Hyperthermus butylicus DSM 5456]
Length = 255
Score = 39.3 bits (90), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 45/85 (52%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 401
G+ AG VG LLG+GGG I+ P + LG+P +V++ +L AI +S + +
Sbjct: 11 VGVVAGFVGTLLGIGGGAIMVPALVLLGVPVKVAAPASLVAILGTSLGGIRRLFRRGLVD 70
Query: 402 VPYALYFFALSIIAAFVGQHVLKKL 426
V A++ S + A VG + +L
Sbjct: 71 VMLAVFLETASGLGALVGVIAVGRL 95
>gi|145514580|ref|XP_001443195.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410573|emb|CAK75798.1| unnamed protein product [Paramecium tetraurelia]
Length = 472
Score = 39.3 bits (90), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALF 381
A G AG V G LG+G GF++ P L G+ P+ +SAT+ F
Sbjct: 338 AGGFLAGFVSGFLGMGAGFVMVPTLLFSGLIPRCASATSAF 378
>gi|15616016|ref|NP_244321.1| hypothetical protein BH3454 [Bacillus halodurans C-125]
gi|10176077|dbj|BAB07173.1| BH3454 [Bacillus halodurans C-125]
Length = 273
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 46/93 (49%), Gaps = 13/93 (13%)
Query: 338 FYCAC-GITAGMVGGLLGLGGGFILGPLFLELG--------IPPQVSSATALFAITFSSS 388
F C G+ AG +G L+GLGGG I+ P L LG + PQV+ T+L + F+
Sbjct: 4 FILVCVGLLAGTIGSLMGLGGGIIVVPALLWLGGTALLSEAVTPQVAVGTSLLIMIFTGL 63
Query: 389 MSVVEYYLLKRFPVPY--ALYFFALSIIAAFVG 419
S Y +K+ V Y L FF S A VG
Sbjct: 64 SSTFAY--MKKGTVDYKSGLIFFIGSGPGALVG 94
>gi|395777819|ref|ZP_10458333.1| hypothetical protein MCU_00034 [Bartonella elizabethae Re6043vi]
gi|423715745|ref|ZP_17689966.1| hypothetical protein MEE_01163 [Bartonella elizabethae F9251]
gi|395418604|gb|EJF84926.1| hypothetical protein MCU_00034 [Bartonella elizabethae Re6043vi]
gi|395429259|gb|EJF95328.1| hypothetical protein MEE_01163 [Bartonella elizabethae F9251]
Length = 327
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
Query: 330 KWRASQL----VFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAIT 384
++RAS + + G+ G++ ++G+GGGF + P L L +P V T+LF IT
Sbjct: 186 RFRASMIYVSIIPVLGIGLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQIT 245
Query: 385 FSSSMSVV 392
F SS + V
Sbjct: 246 FVSSFTTV 253
>gi|393795138|ref|ZP_10378502.1| hypothetical protein CNitlB_01991 [Candidatus Nitrosoarchaeum
limnia BG20]
Length = 258
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 34/58 (58%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G AG++G ++GLGGG ++ P+ G PP ++++ +LFA ++ S Y KR
Sbjct: 11 LGFAAGVLGSMIGLGGGIVVVPVLTFFGFPPTLAASNSLFAAFSNAVGSTFSYSRQKR 68
>gi|329765383|ref|ZP_08256963.1| hypothetical protein Nlim_0726 [Candidatus Nitrosoarchaeum limnia
SFB1]
gi|329138289|gb|EGG42545.1| hypothetical protein Nlim_0726 [Candidatus Nitrosoarchaeum limnia
SFB1]
Length = 258
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 37/59 (62%), Gaps = 2/59 (3%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSM-SVVEYYLLKR 399
G AG++G ++GLGGG ++ P+ G PP ++++ +LFA FS+++ S Y KR
Sbjct: 11 LGFAAGVLGSMIGLGGGIVVVPVLTFFGFPPTLAASNSLFA-AFSNAIGSTFSYSRQKR 68
>gi|258514257|ref|YP_003190479.1| hypothetical protein Dtox_0967 [Desulfotomaculum acetoxidans DSM
771]
gi|257777962|gb|ACV61856.1| protein of unknown function DUF81 [Desulfotomaculum acetoxidans DSM
771]
Length = 303
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 55/102 (53%), Gaps = 3/102 (2%)
Query: 327 EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITF 385
E + +S +V + A G+ G++ L+G+GGGFI+ P+ + L G+P + T++F I F
Sbjct: 165 EVSGIESSAIVLF-ALGLLVGVLAALMGVGGGFIMLPVMIYLIGMPTHNAVGTSIFVIIF 223
Query: 386 SS-SMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
++ +++V L + A+ S I A G + KKL
Sbjct: 224 TAINVTVANSALNHTVDLVLAMVLLVGSSIGAQFGAKLGKKL 265
>gi|18976740|ref|NP_578097.1| hypothetical protein PF0368 [Pyrococcus furiosus DSM 3638]
gi|397650869|ref|YP_006491450.1| hypothetical protein PFC_00925 [Pyrococcus furiosus COM1]
gi|18892327|gb|AAL80492.1| hypothetical protein PF0368 [Pyrococcus furiosus DSM 3638]
gi|393188460|gb|AFN03158.1| hypothetical protein PFC_00925 [Pyrococcus furiosus COM1]
Length = 251
Score = 39.3 bits (90), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 43/84 (51%)
Query: 336 LVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
++F GI G++ + GLGGGF++ P LG+ + T+ +I F+S S + Y
Sbjct: 1 MIFLFFVGIVVGILAAMFGLGGGFLIVPTLNILGVEIHSAIGTSSASIVFTSLSSSIAYS 60
Query: 396 LLKRFPVPYALYFFALSIIAAFVG 419
+R + + +II A++G
Sbjct: 61 RQRRIHYRAGILLASTAIIGAYIG 84
>gi|423525299|ref|ZP_17501772.1| hypothetical protein IGC_04682 [Bacillus cereus HuA4-10]
gi|401167981|gb|EJQ75250.1| hypothetical protein IGC_04682 [Bacillus cereus HuA4-10]
Length = 257
Score = 39.3 bits (90), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 60/113 (53%), Gaps = 10/113 (8%)
Query: 316 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQV 374
+ RK+ ++G KW AS L F G V G++G GG F+L P+ L L +P +
Sbjct: 126 RSNRKV-TEGITYNKWLASLLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRT 178
Query: 375 SSATALFAITFSSSMSVVEYYLLK-RFPVPYALYFFALSIIAAFVGQHVLKKL 426
+ AT++ AITF SS+ + ++ + V AL S+ AA +G V K+L
Sbjct: 179 TIATSI-AITFISSIGITTGKVITGQIVVIPALIVAIASLFAAPLGVKVGKRL 230
>gi|395788323|ref|ZP_10467887.1| hypothetical protein ME7_01222 [Bartonella birtlesii LL-WM9]
gi|395408240|gb|EJF74851.1| hypothetical protein ME7_01222 [Bartonella birtlesii LL-WM9]
Length = 306
Score = 39.3 bits (90), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 340 CACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 392
A G+ G++ ++G+GGGF + P L L +P V T+LF ITF SS + +
Sbjct: 179 LAIGLIVGLLSSIMGIGGGFFMIPALIYLLHVPTSVVIGTSLFQITFVSSFTTI 232
>gi|297570449|ref|YP_003691793.1| protein of unknown function DUF81 [Desulfurivibrio alkaliphilus
AHT2]
gi|296926364|gb|ADH87174.1| protein of unknown function DUF81 [Desulfurivibrio alkaliphilus
AHT2]
Length = 435
Score = 39.3 bits (90), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 57/117 (48%), Gaps = 8/117 (6%)
Query: 337 VFYCACGIT--AGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
+F+ + G+ AG++ G +G GGGFI+ P + G+ ++ T LF I + M V +
Sbjct: 80 IFFASIGVGLGAGLITGCIGAGGGFIIAPALMSAGVKGIMAVGTDLFHIFAKAIMGSVLH 139
Query: 395 YLLKRFPVPYALYFFALSIIAAFVG---QHVLKKLIKILGRA---SIIIFTLSFIIF 445
L V A+ F +I A VG L +L +L A ++ +F L F+ F
Sbjct: 140 RKLGNISVALAVTFLIGAIGGATVGGLINRTLYELNPVLSDAFITTVYVFMLGFLGF 196
>gi|298710141|emb|CBJ31853.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 105
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 43/81 (53%)
Query: 373 QVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGR 432
V+SAT+ I ++S + + + YA+ F + + A VGQ V+ L+K R
Sbjct: 1 MVASATSAVMILYTSFTATTSFMVFGLLEEDYAIALFVVGLAATAVGQVVVNHLVKKYKR 60
Query: 433 ASIIIFTLSFIIFVSALSLGG 453
S I+ ++ ++ +SA+ +GG
Sbjct: 61 TSFIVLSIGAVVALSAVMMGG 81
>gi|224370873|ref|YP_002605037.1| hypothetical protein HRM2_38150 [Desulfobacterium autotrophicum
HRM2]
gi|223693590|gb|ACN16873.1| conserved hypothetical protein [Desulfobacterium autotrophicum
HRM2]
Length = 424
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 26/108 (24%), Positives = 54/108 (50%), Gaps = 10/108 (9%)
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMS 390
R+ +F G+ G+ G++G+GGGF+ P+F+ +G+ + T +F I F++ +
Sbjct: 247 RSISWLFLVMSGVLVGLAAGIMGVGGGFLTFPIFVYVMGVSSMTTVGTDIFQIIFTAGYA 306
Query: 391 VVEYYLLKRFPVPYALYFFALSI---IAAFVGQHVLKKLIKILGRASI 435
+ Y + Y F+ L+I + + VG + + K++ +I
Sbjct: 307 SITQYAI------YGFIFYTLAIGMLLGSLVGIQIGAMVTKVVSDITI 348
Score = 38.5 bits (88), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 52/108 (48%), Gaps = 6/108 (5%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 401
G+ AG++ G +G GGGFI+ P + GI ++ T LF I + M V + L
Sbjct: 87 VGLLAGLITGCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVIHRKLGNVS 146
Query: 402 VPYALYFFALSIIAAFVG---QHVLKKLIKILGRA---SIIIFTLSFI 443
VP A F +++ A G L +L +L A ++ +F L F+
Sbjct: 147 VPLAFVFVIGALMGATGGGLLNRYLYELNPVLSDAFISTVYVFMLGFL 194
>gi|94498747|ref|ZP_01305296.1| hypothetical protein SKA58_14342 [Sphingomonas sp. SKA58]
gi|94421797|gb|EAT06849.1| hypothetical protein SKA58_14342 [Sphingomonas sp. SKA58]
Length = 304
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 74/168 (44%), Gaps = 23/168 (13%)
Query: 278 LQIAKNYTTTCSVLYWVLNLLQIPVAGGVSAYEAV----ALYKGRRKIASKGDEG----- 328
LQ T +LY +L + GG+ A E++ AL GR+ ASK
Sbjct: 104 LQSLGQIDTVIGILY----VLMLGGIGGLMAKESIQALIALKTGRKPQASKRRHHPLVAA 159
Query: 329 --TKWRASQLVFYCA------CGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATA 379
+WR + Y + G+ G++ LLG+GGGFIL P L LG+ Q T+
Sbjct: 160 LPMRWRFYRSGLYISPLAPFLLGMATGILTMLLGVGGGFILVPAMLYLLGMTTQSVVGTS 219
Query: 380 LFAITFSSSMSVVEYYL-LKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
LF I F + + + + + K + A+ S+ A +G + KL
Sbjct: 220 LFQILFVTMATTMMHAMTTKAVDLVLAMLLLIGSVTGAQIGTRLSMKL 267
>gi|423609344|ref|ZP_17585205.1| hypothetical protein IIM_00059 [Bacillus cereus VD107]
gi|401251962|gb|EJR58230.1| hypothetical protein IIM_00059 [Bacillus cereus VD107]
Length = 257
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 8/70 (11%)
Query: 323 SKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALF 381
S+G KW AS L F G V G++G GG F+L P+ L L +P + + AT++
Sbjct: 132 SEGTMYNKWLASMLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRTTIATSI- 184
Query: 382 AITFSSSMSV 391
AITF SS+ +
Sbjct: 185 AITFISSVGI 194
>gi|418626680|ref|ZP_13189277.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU126]
gi|374831454|gb|EHR95194.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU126]
Length = 275
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 327 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 384
EG +R S +F + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGLLTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207
Query: 385 FSSSMSVVEY 394
FSS MS + +
Sbjct: 208 FSSVMSSIGH 217
>gi|86141473|ref|ZP_01060019.1| membrane protein, putative [Leeuwenhoekiella blandensis MED217]
gi|295133642|ref|YP_003584318.1| hypothetical protein ZPR_1792 [Zunongwangia profunda SM-A87]
gi|85832032|gb|EAQ50487.1| membrane protein, putative [Leeuwenhoekiella blandensis MED217]
gi|294981657|gb|ADF52122.1| membrane protein, putative [Zunongwangia profunda SM-A87]
Length = 267
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 316 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQV 374
G R+ S+ E + S + G+ G+V G++G GGGF++ P L L +P +
Sbjct: 133 NGNRQFNSEVIE----KKSNYLLLSVQGLFIGVVTGMVGAGGGFLIIPALVLLAKLPMKK 188
Query: 375 SSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
+ AT+LF I +S + + P+ L F +SI+ F+G
Sbjct: 189 AVATSLFIIAINSLIGFLGDVQNLDIDWPFLLIFTGISIVGIFIG 233
>gi|298208928|ref|YP_003717107.1| hypothetical protein CA2559_11828 [Croceibacter atlanticus
HTCC2559]
gi|83848855|gb|EAP86724.1| membrane protein, putative [Croceibacter atlanticus HTCC2559]
Length = 266
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 53/110 (48%), Gaps = 5/110 (4%)
Query: 311 AVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELG 369
+V++ + +RK + E T A GI G + G++G GGGF++ P L L
Sbjct: 127 SVSMIRNKRKETDEEAEIT----YNYPLIIAEGIIVGAITGIVGAGGGFLIIPALVLLAK 182
Query: 370 IPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
+P + + AT+LF I S + + P+ L F LSII F+G
Sbjct: 183 LPMKKAVATSLFIIAIKSLIGFLGDVQNLDIDWPFLLIFTGLSIIGIFIG 232
>gi|27467533|ref|NP_764170.1| hypothetical protein SE0615 [Staphylococcus epidermidis ATCC 12228]
gi|293368299|ref|ZP_06614927.1| conserved hypothetical protein [Staphylococcus epidermidis
M23864:W2(grey)]
gi|417646541|ref|ZP_12296396.1| putative membrane protein [Staphylococcus epidermidis VCU144]
gi|417658571|ref|ZP_12308195.1| putative membrane protein [Staphylococcus epidermidis VCU045]
gi|417910002|ref|ZP_12553734.1| putative membrane protein [Staphylococcus epidermidis VCU037]
gi|417911750|ref|ZP_12555450.1| putative membrane protein [Staphylococcus epidermidis VCU105]
gi|418604210|ref|ZP_13167570.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU041]
gi|418608065|ref|ZP_13171279.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU057]
gi|418609993|ref|ZP_13173126.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU065]
gi|418617244|ref|ZP_13180148.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU120]
gi|418620984|ref|ZP_13183774.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU123]
gi|418624342|ref|ZP_13187018.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU125]
gi|419770080|ref|ZP_14296166.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
IS-250]
gi|419770736|ref|ZP_14296803.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
IS-K]
gi|420166657|ref|ZP_14673339.1| hypothetical protein HMPREF9994_10576 [Staphylococcus epidermidis
NIHLM088]
gi|420172159|ref|ZP_14678674.1| hypothetical protein HMPREF9991_01925 [Staphylococcus epidermidis
NIHLM067]
gi|420187849|ref|ZP_14693865.1| hypothetical protein HMPREF9985_06315 [Staphylococcus epidermidis
NIHLM039]
gi|420194410|ref|ZP_14700224.1| hypothetical protein HMPREF9982_02468 [Staphylococcus epidermidis
NIHLM021]
gi|420198269|ref|ZP_14703984.1| hypothetical protein HMPREF9981_09791 [Staphylococcus epidermidis
NIHLM020]
gi|420203118|ref|ZP_14708702.1| hypothetical protein HMPREF9979_10294 [Staphylococcus epidermidis
NIHLM018]
gi|420206739|ref|ZP_14712244.1| hypothetical protein HMPREF9977_04533 [Staphylococcus epidermidis
NIHLM008]
gi|420215409|ref|ZP_14720677.1| hypothetical protein HMPREF9974_10136 [Staphylococcus epidermidis
NIH05005]
gi|420216635|ref|ZP_14721838.1| hypothetical protein HMPREF9973_04057 [Staphylococcus epidermidis
NIH05001]
gi|420221146|ref|ZP_14726099.1| hypothetical protein HMPREF9972_12757 [Staphylococcus epidermidis
NIH04008]
gi|420222261|ref|ZP_14727183.1| hypothetical protein HMPREF1390_04404 [Staphylococcus epidermidis
NIH08001]
gi|420225187|ref|ZP_14730022.1| hypothetical protein HMPREF1389_06703 [Staphylococcus epidermidis
NIH06004]
gi|420226740|ref|ZP_14731518.1| hypothetical protein HMPREF1388_01657 [Staphylococcus epidermidis
NIH05003]
gi|420229060|ref|ZP_14733770.1| hypothetical protein HMPREF1387_01666 [Staphylococcus epidermidis
NIH04003]
gi|420231420|ref|ZP_14736071.1| hypothetical protein HMPREF1386_01642 [Staphylococcus epidermidis
NIH051668]
gi|420234086|ref|ZP_14738658.1| hypothetical protein HMPREF1385_01791 [Staphylococcus epidermidis
NIH051475]
gi|27315077|gb|AAO04212.1|AE016746_2 conserved hypothetical protein [Staphylococcus epidermidis ATCC
12228]
gi|291317546|gb|EFE57964.1| conserved hypothetical protein [Staphylococcus epidermidis
M23864:W2(grey)]
gi|329726803|gb|EGG63263.1| putative membrane protein [Staphylococcus epidermidis VCU144]
gi|329737583|gb|EGG73829.1| putative membrane protein [Staphylococcus epidermidis VCU045]
gi|341651884|gb|EGS75675.1| putative membrane protein [Staphylococcus epidermidis VCU037]
gi|341652261|gb|EGS76050.1| putative membrane protein [Staphylococcus epidermidis VCU105]
gi|374402560|gb|EHQ73581.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU057]
gi|374405432|gb|EHQ76366.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU041]
gi|374405979|gb|EHQ76886.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU065]
gi|374819091|gb|EHR83222.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU120]
gi|374827860|gb|EHR91717.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU125]
gi|374830843|gb|EHR94603.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU123]
gi|383357543|gb|EID35012.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
IS-250]
gi|383363082|gb|EID40427.1| sulfite exporter TauE/SafE [Staphylococcus aureus subsp. aureus
IS-K]
gi|394233386|gb|EJD78993.1| hypothetical protein HMPREF9994_10576 [Staphylococcus epidermidis
NIHLM088]
gi|394243630|gb|EJD88992.1| hypothetical protein HMPREF9991_01925 [Staphylococcus epidermidis
NIHLM067]
gi|394255694|gb|EJE00641.1| hypothetical protein HMPREF9985_06315 [Staphylococcus epidermidis
NIHLM039]
gi|394264655|gb|EJE09330.1| hypothetical protein HMPREF9981_09791 [Staphylococcus epidermidis
NIHLM020]
gi|394264800|gb|EJE09471.1| hypothetical protein HMPREF9982_02468 [Staphylococcus epidermidis
NIHLM021]
gi|394268449|gb|EJE13006.1| hypothetical protein HMPREF9979_10294 [Staphylococcus epidermidis
NIHLM018]
gi|394276842|gb|EJE21175.1| hypothetical protein HMPREF9977_04533 [Staphylococcus epidermidis
NIHLM008]
gi|394282277|gb|EJE26480.1| hypothetical protein HMPREF9974_10136 [Staphylococcus epidermidis
NIH05005]
gi|394285105|gb|EJE29191.1| hypothetical protein HMPREF9972_12757 [Staphylococcus epidermidis
NIH04008]
gi|394289497|gb|EJE33378.1| hypothetical protein HMPREF1390_04404 [Staphylococcus epidermidis
NIH08001]
gi|394291602|gb|EJE35399.1| hypothetical protein HMPREF9973_04057 [Staphylococcus epidermidis
NIH05001]
gi|394293931|gb|EJE37628.1| hypothetical protein HMPREF1389_06703 [Staphylococcus epidermidis
NIH06004]
gi|394298189|gb|EJE41769.1| hypothetical protein HMPREF1388_01657 [Staphylococcus epidermidis
NIH05003]
gi|394299585|gb|EJE43124.1| hypothetical protein HMPREF1387_01666 [Staphylococcus epidermidis
NIH04003]
gi|394302741|gb|EJE46177.1| hypothetical protein HMPREF1386_01642 [Staphylococcus epidermidis
NIH051668]
gi|394304583|gb|EJE47981.1| hypothetical protein HMPREF1385_01791 [Staphylococcus epidermidis
NIH051475]
Length = 275
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 327 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 384
EG +R S +F + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGLLTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207
Query: 385 FSSSMSVVEY 394
FSS MS + +
Sbjct: 208 FSSVMSSIGH 217
>gi|418326457|ref|ZP_12937641.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU071]
gi|420162808|ref|ZP_14669563.1| hypothetical protein HMPREF9995_02315 [Staphylococcus epidermidis
NIHLM095]
gi|420167251|ref|ZP_14673912.1| hypothetical protein HMPREF9993_01067 [Staphylococcus epidermidis
NIHLM087]
gi|420185285|ref|ZP_14691380.1| hypothetical protein HMPREF9986_06026 [Staphylococcus epidermidis
NIHLM040]
gi|365225378|gb|EHM66622.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU071]
gi|394235805|gb|EJD81355.1| hypothetical protein HMPREF9995_02315 [Staphylococcus epidermidis
NIHLM095]
gi|394238880|gb|EJD84337.1| hypothetical protein HMPREF9993_01067 [Staphylococcus epidermidis
NIHLM087]
gi|394255019|gb|EJD99979.1| hypothetical protein HMPREF9986_06026 [Staphylococcus epidermidis
NIHLM040]
Length = 275
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 327 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 384
EG +R S +F + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGLLTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207
Query: 385 FSSSMSVVEY 394
FSS MS + +
Sbjct: 208 FSSVMSSIGH 217
>gi|90407709|ref|ZP_01215888.1| hypothetical membrane spanning protein [Psychromonas sp. CNPT3]
gi|90311178|gb|EAS39284.1| hypothetical membrane spanning protein [Psychromonas sp. CNPT3]
Length = 274
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 44/93 (47%), Gaps = 1/93 (1%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 393
S L + G+ AG + ++G GGG + P L LG+ P ++ T A +F S+ +
Sbjct: 24 SNLFLLASVGLLAGFIDAIVG-GGGLLTVPTLLSLGLSPHLTLGTNKLAASFGSATAAFT 82
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
Y+ F + + F S+I A G V+ +
Sbjct: 83 YFKKNLFDPVFWKHSFYSSLIGALAGTLVINAI 115
>gi|57866438|ref|YP_188101.1| hypothetical protein SERP0509 [Staphylococcus epidermidis RP62A]
gi|282875582|ref|ZP_06284453.1| putative membrane protein [Staphylococcus epidermidis SK135]
gi|417656395|ref|ZP_12306082.1| putative membrane protein [Staphylococcus epidermidis VCU028]
gi|417913194|ref|ZP_12556865.1| putative membrane protein [Staphylococcus epidermidis VCU109]
gi|418611361|ref|ZP_13174451.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU117]
gi|418663817|ref|ZP_13225324.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU081]
gi|420171339|ref|ZP_14677883.1| hypothetical protein HMPREF9992_10312 [Staphylococcus epidermidis
NIHLM070]
gi|420182552|ref|ZP_14688688.1| hypothetical protein HMPREF9987_04152 [Staphylococcus epidermidis
NIHLM049]
gi|420210378|ref|ZP_14715806.1| hypothetical protein HMPREF9976_10849 [Staphylococcus epidermidis
NIHLM003]
gi|420212014|ref|ZP_14717369.1| hypothetical protein HMPREF9975_06216 [Staphylococcus epidermidis
NIHLM001]
gi|421607567|ref|ZP_16048806.1| hypothetical protein B440_04399 [Staphylococcus epidermidis
AU12-03]
gi|57637096|gb|AAW53884.1| conserved hypothetical protein [Staphylococcus epidermidis RP62A]
gi|281295609|gb|EFA88132.1| putative membrane protein [Staphylococcus epidermidis SK135]
gi|329736846|gb|EGG73111.1| putative membrane protein [Staphylococcus epidermidis VCU028]
gi|341656570|gb|EGS80284.1| putative membrane protein [Staphylococcus epidermidis VCU109]
gi|374411142|gb|EHQ81861.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU081]
gi|374823447|gb|EHR87443.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU117]
gi|394238212|gb|EJD83690.1| hypothetical protein HMPREF9992_10312 [Staphylococcus epidermidis
NIHLM070]
gi|394250097|gb|EJD95299.1| hypothetical protein HMPREF9987_04152 [Staphylococcus epidermidis
NIHLM049]
gi|394276430|gb|EJE20770.1| hypothetical protein HMPREF9976_10849 [Staphylococcus epidermidis
NIHLM003]
gi|394280281|gb|EJE24565.1| hypothetical protein HMPREF9975_06216 [Staphylococcus epidermidis
NIHLM001]
gi|406656772|gb|EKC83172.1| hypothetical protein B440_04399 [Staphylococcus epidermidis
AU12-03]
Length = 275
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 327 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 384
EG +R S +F + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGLLTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207
Query: 385 FSSSMSVVEY 394
FSS MS + +
Sbjct: 208 FSSVMSSIGH 217
>gi|242242213|ref|ZP_04796658.1| hypothetical membrane protein [Staphylococcus epidermidis W23144]
gi|416123952|ref|ZP_11595138.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
gi|418327602|ref|ZP_12938754.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
gi|418613538|ref|ZP_13176544.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
gi|418632622|ref|ZP_13195052.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
gi|418633694|ref|ZP_13196100.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
gi|420175320|ref|ZP_14681760.1| hypothetical protein HMPREF9990_06284 [Staphylococcus epidermidis
NIHLM061]
gi|420189924|ref|ZP_14695872.1| hypothetical protein HMPREF9984_04365 [Staphylococcus epidermidis
NIHLM037]
gi|420193318|ref|ZP_14699172.1| hypothetical protein HMPREF9983_09537 [Staphylococcus epidermidis
NIHLM023]
gi|420203830|ref|ZP_14709391.1| hypothetical protein HMPREF9978_01579 [Staphylococcus epidermidis
NIHLM015]
gi|242234332|gb|EES36644.1| hypothetical membrane protein [Staphylococcus epidermidis W23144]
gi|319401800|gb|EFV90008.1| conserved hypothetical protein [Staphylococcus epidermidis FRI909]
gi|365232855|gb|EHM73831.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis 14.1.R1.SE]
gi|374823394|gb|EHR87391.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU118]
gi|374832192|gb|EHR95912.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU128]
gi|374838641|gb|EHS02179.1| sulfite exporter TauE/SafE [Staphylococcus epidermidis VCU129]
gi|394243782|gb|EJD89143.1| hypothetical protein HMPREF9990_06284 [Staphylococcus epidermidis
NIHLM061]
gi|394259217|gb|EJE04075.1| hypothetical protein HMPREF9984_04365 [Staphylococcus epidermidis
NIHLM037]
gi|394260170|gb|EJE04990.1| hypothetical protein HMPREF9983_09537 [Staphylococcus epidermidis
NIHLM023]
gi|394274412|gb|EJE18833.1| hypothetical protein HMPREF9978_01579 [Staphylococcus epidermidis
NIHLM015]
Length = 275
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 2/72 (2%)
Query: 327 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 384
EG +R S +F + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207
Query: 385 FSSSMSVVEYYL 396
FSS MS + + L
Sbjct: 208 FSSVMSSIGHIL 219
>gi|148252390|ref|YP_001236975.1| permease [Bradyrhizobium sp. BTAi1]
gi|146404563|gb|ABQ33069.1| putative permease [Bradyrhizobium sp. BTAi1]
Length = 308
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 57/115 (49%), Gaps = 4/115 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+ +
Sbjct: 18 LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYWRRRA 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV 454
AL + +G L++ LG+ ++I LS++ ++ S+GGV
Sbjct: 78 IDPSLALVLMIGGTLGTALGVSTF-TLLRSLGQLDLMI-ALSYVALLT--SVGGV 128
>gi|410457114|ref|ZP_11310951.1| hypothetical protein BABA_24600 [Bacillus bataviensis LMG 21833]
gi|409926453|gb|EKN63624.1| hypothetical protein BABA_24600 [Bacillus bataviensis LMG 21833]
Length = 273
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 52/113 (46%), Gaps = 11/113 (9%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELG-------IPPQVSSATALFAITFSSSMSVVEY 394
G+ A +G L+G+GGG I+ P L L + PQV T+LF + F+ S + Y
Sbjct: 9 VGLVASSIGSLIGMGGGIIVVPALLYLATLSTFSHLTPQVVVGTSLFTMIFTGLSSTLSY 68
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVS 447
LK L F S + +G V +K L S I+ FI+FVS
Sbjct: 69 MKLKTVDYKSGLIFLIGSGPGSILGAWVTEK----LNLHSFNIYFGIFILFVS 117
>gi|417644318|ref|ZP_12294319.1| putative membrane protein [Staphylococcus warneri VCU121]
gi|330684914|gb|EGG96596.1| putative membrane protein [Staphylococcus epidermidis VCU121]
Length = 275
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 49/99 (49%), Gaps = 4/99 (4%)
Query: 327 EGTKWRAS--QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAI 383
EG + S LV + A + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYHYSVPPLVAFVAT-LLIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206
Query: 384 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
FSS MS + + + Y++ S I A +G V
Sbjct: 207 FFSSVMSSIGHIIQGHVAWDYSIVLIISSYIGAKIGVKV 245
>gi|241202991|ref|YP_002974087.1| hypothetical protein Rleg_0237 [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|424879924|ref|ZP_18303556.1| putative permease [Rhizobium leguminosarum bv. trifolii WU95]
gi|240856881|gb|ACS54548.1| protein of unknown function DUF81 [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|392516287|gb|EIW41019.1| putative permease [Rhizobium leguminosarum bv. trifolii WU95]
Length = 307
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 60/126 (47%), Gaps = 4/126 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 18 LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGS 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
V + A VG + L++ +G+ +II +L +++F+ ++GG+ L +
Sbjct: 78 LDVKLGTVLLVGGLTGATVGIWIF-SLLRAIGQLDLII-SLMYVVFLG--TVGGLMLLES 133
Query: 460 IKRIEH 465
I +
Sbjct: 134 INAMRR 139
>gi|158521364|ref|YP_001529234.1| hypothetical protein Dole_1353 [Desulfococcus oleovorans Hxd3]
gi|158510190|gb|ABW67157.1| protein of unknown function DUF81 [Desulfococcus oleovorans Hxd3]
Length = 427
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 73/161 (45%), Gaps = 9/161 (5%)
Query: 295 LNLLQIPVAGGVSAYE---AVALYKGRRKIASKGDEGTK-WRASQLVFYCACGITAGMVG 350
L LL I + G V A E + Y G + S TK + AS L+ CA G++
Sbjct: 41 LMLLPILIGGYVFADEIANGIPQYLGGKHAYSPSFFTTKIFIASILIGLCA-----GLIT 95
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFA 410
G +G GGGFI+ P + GI ++ T LF I + M V + L V A+ F
Sbjct: 96 GCIGAGGGFIIAPALMSAGIKGILAVGTDLFHIFAKAIMGSVLHRKLGNISVVLAVTFLI 155
Query: 411 LSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
SI A +G + + L +I S T +++ + AL L
Sbjct: 156 GSIGGATLGGWINRVLYEINPIISDAFITTIYVVMLGALGL 196
>gi|347734809|ref|ZP_08867798.1| permease [Azospirillum amazonense Y2]
gi|346922127|gb|EGY02616.1| permease [Azospirillum amazonense Y2]
Length = 307
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 58/109 (53%), Gaps = 20/109 (18%)
Query: 304 GGVSA---YEAV-ALYKGRRKIASKGD-------EGTKWRA----SQLVFYCACGITAGM 348
GG+S E+V AL++GRRK A++G G ++ S+L + G
Sbjct: 123 GGISGLMLMESVRALFRGRRKGATRGKLHHHTWLHGLPFKMRFPRSRLYISVLLPVGIGF 182
Query: 349 VGGLL----GLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVV 392
+GG+L G+GGGF+L P + LG+P + + T+L I +++++ V
Sbjct: 183 IGGVLVAIMGIGGGFLLVPAMIYILGMPTTLVAGTSLLQIILTTAVAAV 231
>gi|152996364|ref|YP_001341199.1| hypothetical protein Mmwyl1_2342 [Marinomonas sp. MWYL1]
gi|150837288|gb|ABR71264.1| protein of unknown function DUF81 [Marinomonas sp. MWYL1]
Length = 253
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 4/117 (3%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
++Y + G G+ G+ G+GGG ++ PL L G PP ++ T L + S VV +
Sbjct: 2 DFLWYISAGAAVGLAVGITGVGGGSLMTPLLLMFGFPPHIAIGTDLMYAGIAKSTGVVMH 61
Query: 395 YLLKRFPVPYALYF--FALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
KR V + + F A SI A+ + L + K I+ TL ++ ++A+
Sbjct: 62 A--KRGNVNWKIVFAMAAGSIPASLITVWALSQFEKPDHYQEILTSTLGMMLIITAM 116
>gi|407451351|ref|YP_006723075.1| hypothetical protein B739_0573 [Riemerella anatipestifer RA-CH-1]
gi|403312336|gb|AFR35177.1| hypothetical protein B739_0573 [Riemerella anatipestifer RA-CH-1]
Length = 264
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 56/110 (50%), Gaps = 2/110 (1%)
Query: 318 RRKIASKGDEGT-KWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVS 375
+ + SK + T + S+ + G+ G++ GL+G GGGF++ P L + LG+ + +
Sbjct: 126 KMIVGSKTEPSTLEISPSRNTLLISQGLLVGIITGLVGAGGGFLIIPALVMILGLKMKEA 185
Query: 376 SATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 425
T+LF IT SS++ V + L F LSI+ +G + K+
Sbjct: 186 IGTSLFIITLSSTIGFVSSLDKVAIDWHFLLSFTGLSIVGVLLGLALSKR 235
>gi|120599392|ref|YP_963966.1| hypothetical protein Sputw3181_2588 [Shewanella sp. W3-18-1]
gi|146292612|ref|YP_001183036.1| hypothetical protein Sputcn32_1512 [Shewanella putrefaciens CN-32]
gi|386313288|ref|YP_006009453.1| hypothetical protein [Shewanella putrefaciens 200]
gi|120559485|gb|ABM25412.1| protein of unknown function DUF81 [Shewanella sp. W3-18-1]
gi|145564302|gb|ABP75237.1| protein of unknown function DUF81 [Shewanella putrefaciens CN-32]
gi|319425913|gb|ADV53987.1| protein of unknown function DUF81 [Shewanella putrefaciens 200]
Length = 257
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 39/77 (50%), Gaps = 5/77 (6%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ AG + ++G GGG + P L LGI P + T A +F SSM+ YY F
Sbjct: 17 GLIAGFIDAIVG-GGGLLSIPALLTLGIAPHTALGTNKLAASFGSSMAAWTYYRQHLFTP 75
Query: 403 PYALYFFALSIIAAFVG 419
F+ ++ IA F+G
Sbjct: 76 S----FWYMAFIATFIG 88
>gi|420199664|ref|ZP_14705335.1| hypothetical protein HMPREF9980_05334 [Staphylococcus epidermidis
NIHLM031]
gi|394271414|gb|EJE15907.1| hypothetical protein HMPREF9980_05334 [Staphylococcus epidermidis
NIHLM031]
Length = 275
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 2/72 (2%)
Query: 327 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 384
EG +R S +F + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207
Query: 385 FSSSMSVVEYYL 396
FSS MS + + L
Sbjct: 208 FSSVMSSIGHIL 219
>gi|420178833|ref|ZP_14685159.1| hypothetical protein HMPREF9989_11288 [Staphylococcus epidermidis
NIHLM057]
gi|420181508|ref|ZP_14687706.1| hypothetical protein HMPREF9988_11921 [Staphylococcus epidermidis
NIHLM053]
gi|394245744|gb|EJD91018.1| hypothetical protein HMPREF9989_11288 [Staphylococcus epidermidis
NIHLM057]
gi|394245936|gb|EJD91205.1| hypothetical protein HMPREF9988_11921 [Staphylococcus epidermidis
NIHLM053]
Length = 275
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 2/72 (2%)
Query: 327 EGTKWRAS-QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAIT 384
EG +R S +F + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYRYSVPPLFAFITTLFIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIF 207
Query: 385 FSSSMSVVEYYL 396
FSS MS + + L
Sbjct: 208 FSSVMSSIGHIL 219
>gi|389807654|ref|ZP_10204191.1| Sulfate transporter, CysZ-type [Rhodanobacter thiooxydans LCS2]
gi|388443779|gb|EIL99914.1| Sulfate transporter, CysZ-type [Rhodanobacter thiooxydans LCS2]
Length = 250
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 60/116 (51%), Gaps = 6/116 (5%)
Query: 334 SQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVE 393
S+ + + A G A +V G LG+ G + + L LG+PP V+SA+ +A TF+ S +
Sbjct: 5 SEFLTFAAVGALAQLVDGALGMAYGVVSAAMLLGLGLPPAVASASVHYAETFTCGASGLS 64
Query: 394 YYLLKRFPVPYALYFFALSI---IAAFVGQHVLKKLIKILGRASIIIFTLSFIIFV 446
+ L V L F+ L+I I A +G +V+ + R ++ I+ L +F+
Sbjct: 65 H--LAAGNVRRQL-FWTLAIPGAIGAVIGAYVVIHVPADGMRLALTIYMLGMGVFL 117
>gi|219851806|ref|YP_002466238.1| hypothetical protein Mpal_1173 [Methanosphaerula palustris E1-9c]
gi|219546065|gb|ACL16515.1| protein of unknown function DUF81 [Methanosphaerula palustris
E1-9c]
Length = 267
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 46/92 (50%), Gaps = 3/92 (3%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
LV + A G G+V G+ G GG + P + LG+P ++ AT+L I +S +
Sbjct: 152 HLVIWGAMG---GLVSGITGTSGGALFVPALVVLGVPIHLAVATSLLTIIPTSITGAATH 208
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
L +PY + + A +++ AF G + ++
Sbjct: 209 IALGNISLPYVVVYGAGAVLGAFAGASLAPRI 240
>gi|445060181|ref|YP_007385585.1| sulfite exporter TauE/SafE-like protein [Staphylococcus warneri
SG1]
gi|443426238|gb|AGC91141.1| sulfite exporter TauE/SafE-like protein [Staphylococcus warneri
SG1]
Length = 275
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 49/99 (49%), Gaps = 4/99 (4%)
Query: 327 EGTKWRAS--QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAI 383
EG + S LV + A + G++ GL G+GGG ++ PL L PP V+ T++ I
Sbjct: 148 EGKTYHYSVPPLVAFVAT-LLIGILTGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMI 206
Query: 384 TFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
FSS MS + + + Y++ S I A +G V
Sbjct: 207 FFSSVMSSIGHIIQGHVAWDYSIVLIISSYIGAKIGVKV 245
>gi|389578235|ref|ZP_10168262.1| putative permease [Desulfobacter postgatei 2ac9]
gi|389399870|gb|EIM62092.1| putative permease [Desulfobacter postgatei 2ac9]
Length = 427
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 51/109 (46%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
G+ AG++ G +G GGGFI+ P + +G+ ++ T LF I + M V + L
Sbjct: 86 GVGLCAGLITGCIGAGGGFIIAPALMSVGVKGILAVGTDLFHIFAKAIMGSVLHKKLGNI 145
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
V AL F SI A +G + + L S T ++I + AL
Sbjct: 146 SVSIALIFLIGSIGGATLGGWINRVLYDFNPVISDAFITTIYVIMLGAL 194
>gi|163761304|ref|ZP_02168379.1| hypothetical protein HPDFL43_21639 [Hoeflea phototrophica DFL-43]
gi|162281461|gb|EDQ31757.1| hypothetical protein HPDFL43_21639 [Hoeflea phototrophica DFL-43]
Length = 308
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 58/117 (49%), Gaps = 4/117 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 18 LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPAVAVATGANQVVASSISGALAHFRRGS 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 456
+ L + A VG + L++ LG+ +II +L +++F+ S+GG+ L
Sbjct: 78 LDIKLGLVLLIGGLGGATVGIQIF-TLLRQLGQLDLII-SLLYVVFLG--SVGGLML 130
>gi|327398711|ref|YP_004339580.1| hypothetical protein Hipma_0550 [Hippea maritima DSM 10411]
gi|327181340|gb|AEA33521.1| protein of unknown function DUF81 [Hippea maritima DSM 10411]
Length = 259
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 30/47 (63%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 388
G+ G V G+LG+GGGF+ P+ + +G ++AT + +TFSS+
Sbjct: 144 SGLLIGFVAGMLGIGGGFLAAPILMSMGYGAIRAAATTAYIVTFSSA 190
>gi|333987091|ref|YP_004519698.1| hypothetical protein MSWAN_0870 [Methanobacterium sp. SWAN-1]
gi|333825235|gb|AEG17897.1| protein of unknown function DUF81 [Methanobacterium sp. SWAN-1]
Length = 265
Score = 38.9 bits (89), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 50/107 (46%), Gaps = 4/107 (3%)
Query: 324 KGDEG----TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA 379
KGDE T + V G+ +G++ G+ G+ G + LG+P + T
Sbjct: 132 KGDESKKEFTTLTGPKRVIASFFGVASGLMAGVFGISGTPPVTAGLYSLGLPAIMVVGTT 191
Query: 380 LFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
+F + F+S + Y+LL R + + + + AF+G +LKK+
Sbjct: 192 VFVLIFNSIAGIGGYFLLGRLDLTLIILLGGGAAVGAFIGPKLLKKI 238
>gi|424888735|ref|ZP_18312338.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM2012]
gi|393174284|gb|EJC74328.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM2012]
Length = 307
Score = 38.9 bits (89), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 18 LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
V + A VG + L++ +G+ +II +L ++IF+ ++GG+ L +
Sbjct: 78 LDVKLGTVLLIGGLTGATVGIWIF-SLLRAIGQLDLII-SLMYVIFLG--TVGGLMLLES 133
Query: 460 IKRIEH 465
I +
Sbjct: 134 INAMRR 139
>gi|336311114|ref|ZP_08566081.1| hypothetical protein SOHN41_01564 [Shewanella sp. HN-41]
gi|335865330|gb|EGM70356.1| hypothetical protein SOHN41_01564 [Shewanella sp. HN-41]
Length = 257
Score = 38.9 bits (89), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 5/88 (5%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ AG + ++G GGG + P L LGI P T A +F SSM+ YY F
Sbjct: 17 GLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAAWTYYRQHLFKP 75
Query: 403 PYALYFFALSIIAAFVGQHVLKKLIKIL 430
F+ ++ IA FVG + L+ +L
Sbjct: 76 A----FWYIAFIATFVGAALGSLLVYLL 99
>gi|427430367|ref|ZP_18920221.1| membrane protein, putative [Caenispirillum salinarum AK4]
gi|425878827|gb|EKV27538.1| membrane protein, putative [Caenispirillum salinarum AK4]
Length = 307
Score = 38.9 bits (89), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 31/49 (63%)
Query: 347 GMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
G++ G+ G+GGGF+L PL + LGIPP V+ T + +S V+ ++
Sbjct: 25 GVLSGMFGVGGGFLLTPLLIFLGIPPTVAVGTGANQVVGASVSGVLAHW 73
>gi|418245013|ref|ZP_12871425.1| hypothetical protein KIQ_05893 [Corynebacterium glutamicum ATCC
14067]
gi|354511028|gb|EHE83945.1| hypothetical protein KIQ_05893 [Corynebacterium glutamicum ATCC
14067]
Length = 248
Score = 38.9 bits (89), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 65/143 (45%), Gaps = 8/143 (5%)
Query: 310 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLEL 368
+ A+ +GR++ + + WR G+ G V GL+G GGGF++ P L L
Sbjct: 112 TSTAMLRGRKE-KRGASKSSLWR------VLVDGLVVGAVTGLVGAGGGFLVVPALALLG 164
Query: 369 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 428
G+ V+ T+L IT S + Y + L A +I+ + VG + ++ +
Sbjct: 165 GLSMPVAVGTSLVVITMKSFAGLAGYLTSVQLDWGLVLMVTAAAIVGSLVGSRLAGRMPE 224
Query: 429 ILGRASIIIFTLSFIIFVSALSL 451
L R +F L +FV +L L
Sbjct: 225 TLLRKGFGVFVLVMGVFVLSLEL 247
>gi|374995932|ref|YP_004971431.1| permease [Desulfosporosinus orientis DSM 765]
gi|357214298|gb|AET68916.1| putative permease [Desulfosporosinus orientis DSM 765]
Length = 257
Score = 38.9 bits (89), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 61/125 (48%), Gaps = 21/125 (16%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFA---ITFSSSMSV 391
LV C+ G A MV + G GGG I P L +GIPP ++ T FA +F+SS++
Sbjct: 11 NLVIICSLGFLAAMVDAIAG-GGGLISLPALLMVGIPPHLALGTNKFAASVASFNSSLTF 69
Query: 392 VE----YYLLKRFPVPYALYFFALSIIAAFVGQ-HVLKKLIKILGRASIIIFTLSFIIFV 446
++ L ++ +P+ L + AF+G VLK + L +A L I+FV
Sbjct: 70 ARSGKVHFPLVKWQIPFTL-------LGAFLGAWAVLKVSSEFLNKA-----VLFLILFV 117
Query: 447 SALSL 451
+L
Sbjct: 118 GVYTL 122
>gi|374703196|ref|ZP_09710066.1| hypothetical protein PseS9_07346 [Pseudomonas sp. S9]
Length = 260
Score = 38.9 bits (89), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 44/78 (56%), Gaps = 10/78 (12%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPL----FLELGIPPQV----SSATALFAITFS 386
+L+ Y G AG++ GL G+GGG I+ P+ F G P+V + T+L I F+
Sbjct: 2 ELLLYLVLGAAAGVMAGLFGVGGGLIIVPVLVASFAAHGFAPEVLTHLAVGTSLATIIFT 61
Query: 387 SSMSVVEYYLLK--RFPV 402
S SV+E++ L R+P+
Sbjct: 62 SINSVLEHHRLGAVRWPL 79
>gi|421587991|ref|ZP_16033328.1| hypothetical protein RCCGEPOP_04960 [Rhizobium sp. Pop5]
gi|403707384|gb|EJZ22401.1| hypothetical protein RCCGEPOP_04960 [Rhizobium sp. Pop5]
Length = 307
Score = 38.9 bits (89), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 18 LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
V + A VG + L++ +G+ +II +L ++IF+ ++GG+ L +
Sbjct: 78 LDVKLGTVLLVGGLTGATVGIWIF-SLLRRVGQLDLII-SLMYVIFLG--TVGGLMLLES 133
Query: 460 IKRIEH 465
I +
Sbjct: 134 INAMRR 139
>gi|163868912|ref|YP_001610139.1| hypothetical protein Btr_1874 [Bartonella tribocorum CIP 105476]
gi|161018586|emb|CAK02144.1| putative membrane protein [Bartonella tribocorum CIP 105476]
Length = 308
Score = 38.9 bits (89), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 392
G+ G++ ++G+GGGF + P L L +P V T+LF ITF SS + V
Sbjct: 184 GLIVGLLSSIMGIGGGFFMIPALIYLLRVPTNVVIGTSLFQITFVSSFTTV 234
>gi|452851399|ref|YP_007493083.1| conserved membrane protein of unknown function [Desulfovibrio
piezophilus]
gi|451895053|emb|CCH47932.1| conserved membrane protein of unknown function [Desulfovibrio
piezophilus]
Length = 420
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 19/64 (29%), Positives = 35/64 (54%), Gaps = 1/64 (1%)
Query: 338 FYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVEYYL 396
F ACG+ G + ++G+GGGF+ P+F+ LG+ + T + I F++ + + Y
Sbjct: 253 FVAACGLIVGFMAAIMGVGGGFLTFPMFVYVLGVSSFTTVGTDILQIIFTAGFAAISQYA 312
Query: 397 LKRF 400
+ F
Sbjct: 313 IYGF 316
>gi|145479593|ref|XP_001425819.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392891|emb|CAK58421.1| unnamed protein product [Paramecium tetraurelia]
Length = 487
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 21/76 (27%), Positives = 42/76 (55%), Gaps = 5/76 (6%)
Query: 96 GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHP-TLDIPVIDYDL 154
+GGG + P L I+ F++ +S ++ C++ G + V Q HP T + P++++++
Sbjct: 79 ALGGGVVKRPFLEAILNFNSSTSGDITACLMFGAQLVNQVIIFFQSHPDTPERPLVNFEI 138
Query: 155 ALLFQPLLVLGISIGV 170
L++ LGI I +
Sbjct: 139 GLVY----ALGIPISM 150
>gi|402828737|ref|ZP_10877622.1| sulfite exporter TauE/SafE [Slackia sp. CM382]
gi|402285895|gb|EJU34375.1| sulfite exporter TauE/SafE [Slackia sp. CM382]
Length = 300
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 40/62 (64%), Gaps = 2/62 (3%)
Query: 343 GITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFP 401
G+ G++ G+LG+GGG ++ PLF L G+ P ++AT+LF+I +S++ + +L
Sbjct: 11 GVAIGILSGMLGIGGGTVMVPLFRLAFGLDPLAATATSLFSI-IPTSLAGLSKHLRNGTC 69
Query: 402 VP 403
+P
Sbjct: 70 IP 71
>gi|209881434|ref|XP_002142155.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209557761|gb|EEA07806.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 525
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 41/180 (22%), Positives = 75/180 (41%), Gaps = 35/180 (19%)
Query: 83 IIGFFGTAC-GSVG---GVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNL 138
II C GS+ G GGG I +P+ + + + A+S ++ GG++ +V+ ++
Sbjct: 61 IIALITVGCLGSIAVSAGAGGGVISIPVFLVFMNMPFYQAIALSTSVILGGSLCSVILDI 120
Query: 139 RQRHPTL-----------------------DIPVIDYDLALLFQPLL----VLGISIGVA 171
++ P L +P++D L + LL +LGI I
Sbjct: 121 VKKKPELPEYVNTLNYFKSALDNKLDVNLNKLPIMDLPLVIFLSSLLSAGTLLGILISKV 180
Query: 172 FNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREAAKQLELIGNGYQTEECK 231
N I + ++ LLI +F + ++K + + E IT+ E ++ Q E K
Sbjct: 181 LNSILSFLILEFLLIYVFY----KTWIKFWKIREMEKITQSENNMEITPFQKFSQESEEK 236
>gi|260914000|ref|ZP_05920474.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325]
gi|260632087|gb|EEX50264.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325]
Length = 266
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 58/122 (47%), Gaps = 8/122 (6%)
Query: 96 GVGGGGIFVPML-NLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPTLDIPVIDYDL 154
G+GGG + VP+L L F + AA S +V G A+ + Y +Q+ I++
Sbjct: 24 GIGGGVLMVPILYTLFPEFPLQMIAATSLTIVIGSALINLTYFYKQKIQ------INFKA 77
Query: 155 ALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKREA 214
LL+ +++G+ +G + D I V+ I + +L+ R T K +++ K E
Sbjct: 78 MLLWSLGMIIGVQLGFEASFYLPDIFIIVVFISVLTILAIRTIFFSKST-KADSVEKNEN 136
Query: 215 AK 216
K
Sbjct: 137 LK 138
>gi|157374704|ref|YP_001473304.1| hypothetical protein Ssed_1565 [Shewanella sediminis HAW-EB3]
gi|157317078|gb|ABV36176.1| protein of unknown function DUF81 [Shewanella sediminis HAW-EB3]
Length = 257
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 5/77 (6%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
G+ AG + + G GGG + P L +GIPP ++ T A F SSM+ YY F
Sbjct: 17 GLIAGFIDAVAG-GGGLLSIPALLTVGIPPHLALGTNKLAACFGSSMAAFTYYRKNLF-T 74
Query: 403 PYALYFFALSIIAAFVG 419
P Y + IA F+G
Sbjct: 75 PNLWYH---TFIATFIG 88
>gi|91975280|ref|YP_567939.1| hypothetical protein RPD_0800 [Rhodopseudomonas palustris BisB5]
gi|91681736|gb|ABE38038.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
BisB5]
Length = 307
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 35/59 (59%), Gaps = 3/59 (5%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 390
++ A G G V G+ G+GGGF++ PL + +GI P V+ A+ + A +FS ++S
Sbjct: 13 NVLLILAMGAAVGFVSGMFGIGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALS 71
>gi|219669184|ref|YP_002459619.1| hypothetical protein Dhaf_3160 [Desulfitobacterium hafniense DCB-2]
gi|219539444|gb|ACL21183.1| protein of unknown function DUF81 [Desulfitobacterium hafniense
DCB-2]
Length = 443
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 37/67 (55%), Gaps = 3/67 (4%)
Query: 319 RKIASKGDEGTKWRASQLVFY--CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSS 376
I K G + ++++ FY A G+ AG+V G +G GGGF++ P + LG+ +S
Sbjct: 57 EAIGGKAAYGPSYFSAKM-FYGSIAIGVVAGLVTGCIGAGGGFLITPALMSLGVKGILSV 115
Query: 377 ATALFAI 383
T +F I
Sbjct: 116 GTDMFHI 122
>gi|423392833|ref|ZP_17370059.1| hypothetical protein ICG_04681 [Bacillus cereus BAG1X1-3]
gi|423421123|ref|ZP_17398212.1| hypothetical protein IE3_04595 [Bacillus cereus BAG3X2-1]
gi|401099674|gb|EJQ07676.1| hypothetical protein IE3_04595 [Bacillus cereus BAG3X2-1]
gi|401632812|gb|EJS50595.1| hypothetical protein ICG_04681 [Bacillus cereus BAG1X1-3]
Length = 252
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 8/74 (10%)
Query: 319 RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSA 377
+K S+ KW AS L F G V G++G GG F+L P+ L L +P + + A
Sbjct: 125 KKNNSRAVNYNKWLASMLAFIV------GSVSGIIGAGGAFLLVPIMLVVLKLPIRTTIA 178
Query: 378 TALFAITFSSSMSV 391
T++ AITF SS+ +
Sbjct: 179 TSI-AITFISSVGI 191
>gi|298527660|ref|ZP_07015064.1| protein of unknown function DUF81 [Desulfonatronospira
thiodismutans ASO3-1]
gi|298511312|gb|EFI35214.1| protein of unknown function DUF81 [Desulfonatronospira
thiodismutans ASO3-1]
Length = 254
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 321 IASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATA 379
+A +G W ++++ G+ G + G+LG+GGG + PL + L + P+ ++AT
Sbjct: 120 VAGQGTRAHVWSLARVLQASFLGLVIGFMAGMLGIGGGVFVVPLLIYMLQVNPRTAAATT 179
Query: 380 LFAITFSS 387
F + FSS
Sbjct: 180 AFIVCFSS 187
>gi|375111110|ref|ZP_09757321.1| hypothetical protein AJE_14130 [Alishewanella jeotgali KCTC 22429]
gi|374568652|gb|EHR39824.1| hypothetical protein AJE_14130 [Alishewanella jeotgali KCTC 22429]
Length = 262
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 38/87 (43%), Gaps = 1/87 (1%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
C AG + + G GGG + P L G+PP V T A TF S + + YY +
Sbjct: 16 CGVAFIAGYIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSFTASLTYYRKQL 74
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKL 426
F Y + I A +G V+ +L
Sbjct: 75 FNPHYWRNSLVFTAIGALLGTLVVDQL 101
>gi|385232877|ref|YP_005794219.1| permease protein [Ketogulonicigenium vulgare WSH-001]
gi|343461788|gb|AEM40223.1| Permease protein [Ketogulonicigenium vulgare WSH-001]
Length = 309
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
G G++ G+ G+GGGF++ PL + GIP V+ ATA I SS + + K
Sbjct: 24 GLGWIVGILSGMFGVGGGFLITPLLIFSGIPSAVAVATATNQIVASSVSGLFAHLRRKNV 83
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
+ ++ A +G + + + +G+ ++I TLS++I + +
Sbjct: 84 DLRMGTVLMIGGLVGAAIGVGIFNWM-RRMGQVDLMI-TLSYVILLGTI 130
>gi|116250358|ref|YP_766196.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841]
gi|424873568|ref|ZP_18297230.1| putative permease [Rhizobium leguminosarum bv. viciae WSM1455]
gi|115255006|emb|CAK06080.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae
3841]
gi|393169269|gb|EJC69316.1| putative permease [Rhizobium leguminosarum bv. viciae WSM1455]
Length = 307
Score = 38.9 bits (89), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 60/126 (47%), Gaps = 4/126 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 18 LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGS 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
V + A VG + L++ +G+ +II +L ++IF+ ++GG+ L +
Sbjct: 78 LDVKLGTVLLVGGLAGATVGIWIF-SLLRAIGQLDLII-SLMYVIFLG--TVGGLMLLES 133
Query: 460 IKRIEH 465
+ +
Sbjct: 134 VNAMRR 139
>gi|126734854|ref|ZP_01750600.1| integral membrane protein, putative [Roseobacter sp. CCS2]
gi|126715409|gb|EBA12274.1| integral membrane protein, putative [Roseobacter sp. CCS2]
Length = 306
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G+ G+GGGF++ PL +GIPP V+ AT I SS V+ + LKR
Sbjct: 29 GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAH--LKR 75
>gi|339504564|ref|YP_004691984.1| hypothetical protein RLO149_c030650 [Roseobacter litoralis Och 149]
gi|338758557|gb|AEI95021.1| hypothetical protein DUF1925 [Roseobacter litoralis Och 149]
Length = 312
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G+ G+GGGF++ PL +GIPP V+ AT I SS V+ + LKR
Sbjct: 37 GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAH--LKR 83
>gi|86751708|ref|YP_488204.1| hypothetical protein RPB_4610 [Rhodopseudomonas palustris HaA2]
gi|86574736|gb|ABD09293.1| Protein of unknown function DUF81 [Rhodopseudomonas palustris HaA2]
Length = 307
Score = 38.9 bits (89), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 3/59 (5%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 390
++ A G G V G+ G+GGGF+L PL + +GI P V+ A+ + A +FS ++S
Sbjct: 13 NVLLILAMGAAVGFVSGMFGVGGGFLLTPLLIFVGIAPAVAVASVTSHMAASSFSGALS 71
>gi|39933890|ref|NP_946166.1| hypothetical protein RPA0813 [Rhodopseudomonas palustris CGA009]
gi|192289309|ref|YP_001989914.1| hypothetical protein Rpal_0881 [Rhodopseudomonas palustris TIE-1]
gi|39647737|emb|CAE26257.1| possible permease [Rhodopseudomonas palustris CGA009]
gi|192283058|gb|ACE99438.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
TIE-1]
Length = 320
Score = 38.9 bits (89), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 35/59 (59%), Gaps = 3/59 (5%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 390
++ A G G V G+ G+GGGF++ PL + +GI P V+ A+ + A +FS ++S
Sbjct: 26 NVLLVLAMGAAVGFVSGMFGVGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALS 84
>gi|332296379|ref|YP_004438302.1| hypothetical protein Thena_1559 [Thermodesulfobium narugense DSM
14796]
gi|332179482|gb|AEE15171.1| protein of unknown function DUF81 [Thermodesulfobium narugense DSM
14796]
Length = 433
Score = 38.9 bits (89), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 7/73 (9%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
CG T G V ++G+GGGF+ PLF+ LG+ + T + I F+++ S + Y +
Sbjct: 259 CGFTVGFVAAIMGVGGGFLTFPLFVYGLGVSTFTTVGTDILQIIFTTAYSSIFNYAI--- 315
Query: 401 PVPYALYFFALSI 413
Y F++++I
Sbjct: 316 ---YGYIFYSIAI 325
>gi|310815340|ref|YP_003963304.1| hypothetical protein EIO_0856 [Ketogulonicigenium vulgare Y25]
gi|308754075|gb|ADO42004.1| integral membrane protein, putative [Ketogulonicigenium vulgare
Y25]
Length = 304
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
G G++ G+ G+GGGF++ PL + GIP V+ ATA I SS + + K
Sbjct: 19 GLGWIVGILSGMFGVGGGFLITPLLIFSGIPSAVAVATATNQIVASSVSGLFAHLRRKNV 78
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSAL 449
+ ++ A +G + + + +G+ ++I TLS++I + +
Sbjct: 79 DLRMGTVLMIGGLVGAAIGVGIFNWM-RRMGQVDLMI-TLSYVILLGTI 125
>gi|212640166|ref|YP_002316686.1| permease [Anoxybacillus flavithermus WK1]
gi|212561646|gb|ACJ34701.1| Predicted permease [Anoxybacillus flavithermus WK1]
Length = 258
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 58/113 (51%), Gaps = 9/113 (7%)
Query: 316 KGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQV 374
KG I KW A+ L F G++AG+VG G FIL P+ L L IP ++
Sbjct: 128 KGIDNIPLDQVTFNKWLAAALAFIV--GVSAGIVGA----AGAFILVPIMLVVLKIPTRM 181
Query: 375 SSATALFAITFSSSM-SVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKL 426
+ AT+L AITF SS+ + V +F A S+IA+ +G +V KK+
Sbjct: 182 TIATSL-AITFISSIGATVGKITTGQFDFWTAFIMVVASLIASPIGANVGKKV 233
>gi|403525486|ref|YP_006660373.1| hypothetical protein ARUE_c04000 [Arthrobacter sp. Rue61a]
gi|403227913|gb|AFR27335.1| putative integral membrane protein [Arthrobacter sp. Rue61a]
Length = 299
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 64/139 (46%), Gaps = 6/139 (4%)
Query: 311 AVALYKGRRKIASKGDEGTKWRASQLVF--YCACGITAGMVGGLLGLGGGFILGP-LFLE 367
+VA+ +GR+K K D+G +L G G++ GL+G GGGF++ P L L
Sbjct: 113 SVAMLRGRKK---KNDDGAAPVKHELPIGRVLLDGAVVGLITGLVGAGGGFLVVPALALL 169
Query: 368 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 427
G+P V+ T+L I S + Y + L A +II + +G + ++
Sbjct: 170 GGLPMSVAVGTSLVVIAMKSFAGLAGYLTTVQLDWGLTLGVTAAAIIGSLIGSKLAGRIP 229
Query: 428 KILGRASIIIFTLSFIIFV 446
+ R + F L+ FV
Sbjct: 230 ESALRKAFGWFVLAMGTFV 248
>gi|395785666|ref|ZP_10465394.1| hypothetical protein ME5_00712 [Bartonella tamiae Th239]
gi|423717442|ref|ZP_17691632.1| hypothetical protein MEG_01172 [Bartonella tamiae Th307]
gi|395424124|gb|EJF90311.1| hypothetical protein ME5_00712 [Bartonella tamiae Th239]
gi|395427657|gb|EJF93748.1| hypothetical protein MEG_01172 [Bartonella tamiae Th307]
Length = 308
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 59/123 (47%), Gaps = 21/123 (17%)
Query: 291 LYWVLNLLQIPVAGGVSAYEAVALYK-------GRR-KIASKGDEG--------TKWRAS 334
L +++LL + + GGV AV ++ GR+ I G ++RAS
Sbjct: 112 LDLIISLLYVILLGGVGGLMAVESWRAIMRTRSGRQVNIRRPGQHNWIHRLPLKLRFRAS 171
Query: 335 QL----VFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSM 389
+ + A G G++ ++G+GGGFI+ P L L +P V T+LF ITF ++
Sbjct: 172 MIYVSIIPVLAIGFIIGLLSSIMGVGGGFIMVPALIYLLRVPTNVVVGTSLFQITFVTAF 231
Query: 390 SVV 392
+ V
Sbjct: 232 TTV 234
>gi|229149118|ref|ZP_04277359.1| hypothetical protein bcere0011_6830 [Bacillus cereus m1550]
gi|228634317|gb|EEK90905.1| hypothetical protein bcere0011_6830 [Bacillus cereus m1550]
Length = 235
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 58/110 (52%), Gaps = 9/110 (8%)
Query: 319 RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSA 377
+K S + KW AS L F G V G++G GG F+L P+ L L +P +++ A
Sbjct: 106 KKNNSGAVKYNKWLASVLAFIV------GGVSGIIGAGGAFLLVPIMLVILKLPLRMTIA 159
Query: 378 TALFAITFSSSMSVVEYYLLK-RFPVPYALYFFALSIIAAFVGQHVLKKL 426
T++ AITF SS+ + ++ + + AL SI AA VG V K++
Sbjct: 160 TSI-AITFISSVGITTGKVITGQVVIIPALIIAVASIFAAPVGARVGKRI 208
>gi|365879405|ref|ZP_09418829.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 375]
gi|365292656|emb|CCD91360.1| putative permease; membrane protein [Bradyrhizobium sp. ORS 375]
Length = 308
Score = 38.5 bits (88), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 8/126 (6%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+ +
Sbjct: 18 LAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYWRRRA 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGV----G 455
A + +G L++ LG+ ++I LS+++ ++ S+GGV G
Sbjct: 78 IDPSLASVLMIGGALGTALGVSTF-TLLRSLGQLDLMI-ALSYVVLLT--SVGGVMFWEG 133
Query: 456 LAKMIK 461
L M++
Sbjct: 134 LRAMMR 139
>gi|325964992|ref|YP_004242898.1| permease [Arthrobacter phenanthrenivorans Sphe3]
gi|323471079|gb|ADX74764.1| putative permease [Arthrobacter phenanthrenivorans Sphe3]
Length = 299
Score = 38.5 bits (88), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 65/139 (46%), Gaps = 6/139 (4%)
Query: 311 AVALYKGRRKIASKGDEGTKWRASQLVF--YCACGITAGMVGGLLGLGGGFILGP-LFLE 367
+VA+ +GR++ K D+G +L G G+V GL+G GGGF++ P L L
Sbjct: 113 SVAMLRGRKR---KNDDGAAPAKHELPLGRVLLDGAVVGLVTGLVGAGGGFLVVPALALL 169
Query: 368 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 427
G+P V+ T+L I S + Y + L A +++ + +G + ++
Sbjct: 170 GGLPMSVAVGTSLVVIAMKSFAGLAGYLTTVQLDWGVTLGVTAAAVVGSLIGAKLAGRIP 229
Query: 428 KILGRASIIIFTLSFIIFV 446
+ + R + F L+ FV
Sbjct: 230 EAVLRKAFGWFVLAMGTFV 248
>gi|110678948|ref|YP_681955.1| hypothetical protein RD1_1642 [Roseobacter denitrificans OCh 114]
gi|109455064|gb|ABG31269.1| integral membrane protein, putative [Roseobacter denitrificans OCh
114]
Length = 312
Score = 38.5 bits (88), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G+ G+GGGF++ PL +GIPP V+ AT I SS V+ + LKR
Sbjct: 37 GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIVASSFSGVLAH--LKR 83
>gi|325959191|ref|YP_004290657.1| hypothetical protein Metbo_1448 [Methanobacterium sp. AL-21]
gi|325330623|gb|ADZ09685.1| protein of unknown function DUF81 [Methanobacterium sp. AL-21]
Length = 273
Score = 38.5 bits (88), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 46/99 (46%)
Query: 327 EGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFS 386
E K + +F G+ G++ G+ GL G + LG+P + T +F + F+
Sbjct: 148 ETLKLTGPKRIFASLFGVAGGVLAGVFGLSGTPPISAGLYSLGLPTLMVVGTTVFVLVFN 207
Query: 387 SSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKK 425
S + Y L RF V + +++ +F+G +L+K
Sbjct: 208 SLAGIGGYLFLGRFDVVLTVLLGGGAVVGSFLGPKLLEK 246
>gi|409435971|ref|ZP_11263175.1| conserved membrane hypothetical protein [Rhizobium mesoamericanum
STM3625]
gi|408752280|emb|CCM74324.1| conserved membrane hypothetical protein [Rhizobium mesoamericanum
STM3625]
Length = 306
Score = 38.5 bits (88), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 18 LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
V + A VG + L++ +G+ +II +L ++IF+ ++GG+ L +
Sbjct: 78 LDVKLGTVLLVGGLTGATVGIWIF-SLLRQIGQLDLII-SLLYVIFLG--TVGGLMLLES 133
Query: 460 IKRIEH 465
I +
Sbjct: 134 INALRR 139
>gi|424898188|ref|ZP_18321762.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM2297]
gi|393182415|gb|EJC82454.1| putative permease [Rhizobium leguminosarum bv. trifolii WSM2297]
Length = 307
Score = 38.5 bits (88), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 18 LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
V + A VG + L++ +G+ +II +L ++IF+ ++GG+ L +
Sbjct: 78 LDVKLGTVLLIGGLTGATVGIWIF-SLLRSIGQLDLII-SLMYVIFLG--TVGGLMLLES 133
Query: 460 IKRIEH 465
I +
Sbjct: 134 INAMRR 139
>gi|397169771|ref|ZP_10493201.1| hypothetical protein AEST_09670 [Alishewanella aestuarii B11]
gi|396088666|gb|EJI86246.1| hypothetical protein AEST_09670 [Alishewanella aestuarii B11]
Length = 260
Score = 38.5 bits (88), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 38/87 (43%), Gaps = 1/87 (1%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
C AG + + G GGG + P L G+PP V T A TF S + + YY +
Sbjct: 14 CGVAFIAGYIDAIAG-GGGLLTVPALLTAGLPPHVVLGTNKLAATFGSFTASLTYYRKQL 72
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKL 426
F Y + I A +G V+ +L
Sbjct: 73 FNPHYWRNSLVFTAIGALLGTLVVDQL 99
>gi|21322817|dbj|BAB97446.1| Hypothetical membrane protein [Corynebacterium glutamicum ATCC
13032]
Length = 263
Score = 38.5 bits (88), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 65/144 (45%), Gaps = 10/144 (6%)
Query: 310 EAVALYKGRRKIASKG-DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLE 367
+ A+ +GR++ KG + + WR G+ G V GL+G GGGF++ P L L
Sbjct: 127 TSTAMLRGRKQ--KKGASKSSLWR------VLVDGLVVGAVTGLVGAGGGFLVVPALALL 178
Query: 368 LGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLI 427
G+ V+ T+L IT S + Y + L A +I+ + G + ++
Sbjct: 179 GGLSMPVAVGTSLVVITMKSFAGLAGYLTSVQLDWGLVLMVTAAAIVGSLAGSRLAGRVP 238
Query: 428 KILGRASIIIFTLSFIIFVSALSL 451
+ L R +F L +FV L L
Sbjct: 239 ETLLRKGFGVFVLVMGVFVLGLEL 262
>gi|414153246|ref|ZP_11409573.1| conserved membrane hypothetical protein [Desulfotomaculum
hydrothermale Lam5 = DSM 18033]
gi|411455628|emb|CCO07476.1| conserved membrane hypothetical protein [Desulfotomaculum
hydrothermale Lam5 = DSM 18033]
Length = 339
Score = 38.5 bits (88), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 5/80 (6%)
Query: 344 ITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPV 402
I +G+V G LG+GGGFI P+ + LG+P ++ T LFAI ++S Y L + +
Sbjct: 186 IFSGLVSGFLGVGGGFIRVPILIYALGLPTVMAVGTDLFAILITNSWGAYIYALAGKVEI 245
Query: 403 PYALYFFALSIIAAFVGQHV 422
AL + ++ A VG +
Sbjct: 246 IGAL----VMVVGAAVGAQI 261
>gi|269216459|ref|ZP_06160313.1| hypothetical membrane protein, conserved, DUF81 family [Slackia
exigua ATCC 700122]
gi|269129988|gb|EEZ61070.1| hypothetical membrane protein, conserved, DUF81 family [Slackia
exigua ATCC 700122]
Length = 297
Score = 38.5 bits (88), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 33/46 (71%), Gaps = 1/46 (2%)
Query: 343 GITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSS 387
G+ G++ G+LG+GGG ++ PLF L G+ P ++AT+LF+I +S
Sbjct: 11 GVAIGILSGMLGIGGGTVMVPLFRLAFGLDPLAATATSLFSIIPTS 56
>gi|395793084|ref|ZP_10472493.1| hypothetical protein MEI_01114 [Bartonella vinsonii subsp.
arupensis Pm136co]
gi|423713627|ref|ZP_17687887.1| hypothetical protein ME1_00633 [Bartonella vinsonii subsp.
arupensis OK-94-513]
gi|395422254|gb|EJF88462.1| hypothetical protein ME1_00633 [Bartonella vinsonii subsp.
arupensis OK-94-513]
gi|395431922|gb|EJF97928.1| hypothetical protein MEI_01114 [Bartonella vinsonii subsp.
arupensis Pm136co]
Length = 306
Score = 38.5 bits (88), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 392
G+ G++ ++G+GGGF + P L L +P V T+LF ITF SS + V
Sbjct: 182 GLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQITFVSSFTTV 232
>gi|301064479|ref|ZP_07204889.1| conserved hypothetical protein [delta proteobacterium NaphS2]
gi|300441417|gb|EFK05772.1| conserved hypothetical protein [delta proteobacterium NaphS2]
Length = 472
Score = 38.5 bits (88), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 56/126 (44%), Gaps = 8/126 (6%)
Query: 75 GWRIVVGTIIGFFGTACGSVGGVGGGGIFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAV 134
GW +VG G F A S GVGGG +FVP L + G A S V G + ++
Sbjct: 343 GWIPIVG---GLFIAALASFLGVGGGFLFVPFLTAVAGLPMFLVAGTSGLAVLVGMIVSI 399
Query: 135 VYNLRQRHPTLDIPVIDYDLALLFQPLLVLGISIGVAFNVIFADWMITVLLIVLFLVLST 194
+ + + +I +L +F +G IG +D +++L IVL ++
Sbjct: 400 FTYMVAKQIPVHWSLIGAELVGIF-----VGAMIGPRTQKYLSDKALSILFIVLAFLVGI 454
Query: 195 RAFLKG 200
R LKG
Sbjct: 455 RYTLKG 460
>gi|115522742|ref|YP_779653.1| hypothetical protein RPE_0716 [Rhodopseudomonas palustris BisA53]
gi|115516689|gb|ABJ04673.1| protein of unknown function DUF81 [Rhodopseudomonas palustris
BisA53]
Length = 308
Score = 38.5 bits (88), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 35/59 (59%), Gaps = 3/59 (5%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 390
++ A G G V G+ G+GGGF++ PL + +GI P V+ A+ + A +FS ++S
Sbjct: 13 NVLLILAMGAAVGFVSGMFGIGGGFLMTPLLIFIGISPAVAVASVTSHMAASSFSGALS 71
>gi|223478732|ref|YP_002582987.1| hypothetical protein [Thermococcus sp. AM4]
gi|214033958|gb|EEB74784.1| conserved hypothetical protein [Thermococcus sp. AM4]
Length = 243
Score = 38.5 bits (88), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
GI +G+V GLLG+ GG + PLF +GIP + + T+ FA+ F++ E+Y L +
Sbjct: 136 VGILSGLVSGLLGISGGVLNVPLFHTFVGIPMRYAVGTSSFALFFTALAGAFEHYRLGQV 195
Query: 401 PVPYAL 406
+ AL
Sbjct: 196 DLHMAL 201
>gi|407783749|ref|ZP_11130944.1| hypothetical protein P24_15941 [Oceanibaculum indicum P24]
gi|407200184|gb|EKE70195.1| hypothetical protein P24_15941 [Oceanibaculum indicum P24]
Length = 306
Score = 38.5 bits (88), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 1/103 (0%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
++ G G + G+ G+GGGF++ PL + +GIPP V+ T I +S V+
Sbjct: 13 NVILILGMGWVVGFLSGIFGVGGGFLMTPLLIFIGIPPAVAVGTQANQIVAASVSGVIAQ 72
Query: 395 YLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIII 437
+ K A + + +G V L + LG+ +II
Sbjct: 73 WRRKNVDFKMGAVLLAGGLFGSTIGVWVFSWL-QTLGQIDLII 114
>gi|199597888|ref|ZP_03211313.1| Predicted permease [Lactobacillus rhamnosus HN001]
gi|258507211|ref|YP_003169962.1| permease [Lactobacillus rhamnosus GG]
gi|385826935|ref|YP_005864707.1| putative transporter protein [Lactobacillus rhamnosus GG]
gi|199591145|gb|EDY99226.1| Predicted permease [Lactobacillus rhamnosus HN001]
gi|257147138|emb|CAR86111.1| Permease [Lactobacillus rhamnosus GG]
gi|259648580|dbj|BAI40742.1| putative transporter protein [Lactobacillus rhamnosus GG]
Length = 280
Score = 38.5 bits (88), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 32/134 (23%), Positives = 70/134 (52%), Gaps = 14/134 (10%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVE 393
Q++ G+ G++G +LG+GGG I+ P+ + +G+P Q + ++ ++ +SS + +
Sbjct: 3 QMLLLMGTGVLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGG 453
Y + + A++ + + A +G V+ L+ G+ I+F + +IF SA +
Sbjct: 63 YLKDEMLNLRVAMFLEIATTVGAVIGA-VITGLVN--GKVLYILFG-ALLIF-SAFN--- 114
Query: 454 VGLAKMIKRIEHKE 467
MI+++ K+
Sbjct: 115 -----MIRKLRMKD 123
>gi|395789707|ref|ZP_10469217.1| hypothetical protein ME9_00934 [Bartonella taylorii 8TBB]
gi|395428545|gb|EJF94621.1| hypothetical protein ME9_00934 [Bartonella taylorii 8TBB]
Length = 306
Score = 38.5 bits (88), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVV 392
G+ G++ ++G+GGGF + P + L +P +V T+LF ITF SS + V
Sbjct: 181 IGLIVGLLSSIMGIGGGFFMIPALIYLMRVPTRVVIGTSLFQITFVSSFTTV 232
>gi|333908013|ref|YP_004481599.1| hypothetical protein Mar181_1641 [Marinomonas posidonica
IVIA-Po-181]
gi|333478019|gb|AEF54680.1| protein of unknown function DUF81 [Marinomonas posidonica
IVIA-Po-181]
Length = 253
Score = 38.5 bits (88), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 31/60 (51%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY 394
+ ++Y G G+ G+ G+GGG ++ PL L G PP ++ T L + S VV +
Sbjct: 2 EFIWYIFAGAVVGLAVGITGVGGGSLMTPLLLLFGFPPHIAVGTDLMYAGIAKSTGVVMH 61
>gi|395765751|ref|ZP_10446343.1| hypothetical protein MCO_01219 [Bartonella sp. DB5-6]
gi|395410946|gb|EJF77488.1| hypothetical protein MCO_01219 [Bartonella sp. DB5-6]
Length = 306
Score = 38.5 bits (88), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFLEL-GIPPQVSSATALFAITFSSSMSVV 392
G+ G++ ++G+GGGF + P + L +P +V T+LF ITF SS + V
Sbjct: 181 IGLIVGLLSSIMGIGGGFFMIPALIYLMRVPTRVVIGTSLFQITFVSSFTTV 232
>gi|218681756|ref|ZP_03529531.1| hypothetical protein RetlC8_23708 [Rhizobium etli CIAT 894]
Length = 191
Score = 38.5 bits (88), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 60/126 (47%), Gaps = 4/126 (3%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G + G+ G+GGGF++ PL + IPP V+ AT + SS + ++
Sbjct: 18 LGMGAAVGFLSGMFGVGGGFLITPLLIFYNIPPVVAVATGANQVVASSISGAITHFRRGT 77
Query: 400 FPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGLAKM 459
V + A VG + L++ +G+ +II +L ++IF+ ++GG+ L +
Sbjct: 78 LDVKLGTVLLIGGLSGATVGIWIF-SLLRAIGQLDLII-SLMYVIFLG--TVGGLMLLES 133
Query: 460 IKRIEH 465
I +
Sbjct: 134 INAMRR 139
>gi|414176339|ref|ZP_11430568.1| hypothetical protein HMPREF9695_04214 [Afipia broomeae ATCC 49717]
gi|410886492|gb|EKS34304.1| hypothetical protein HMPREF9695_04214 [Afipia broomeae ATCC 49717]
Length = 308
Score = 38.5 bits (88), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 33/53 (62%), Gaps = 3/53 (5%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATA---LFAITFSSSMS 390
A G G V G+ G+GGGF++ PL + +GI P V+ A+ + A +FS ++S
Sbjct: 19 AMGAAVGFVSGMFGIGGGFLMTPLLIFVGISPAVAVASVSSHMAASSFSGALS 71
>gi|159045387|ref|YP_001534181.1| hypothetical protein Dshi_2847 [Dinoroseobacter shibae DFL 12]
gi|157913147|gb|ABV94580.1| conserved hypothetical protein [Dinoroseobacter shibae DFL 12]
Length = 306
Score = 38.5 bits (88), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 1/49 (2%)
Query: 351 GLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G+ G+GGGF++ PL +GIPP V+ AT I +SS S V +L +R
Sbjct: 29 GMFGVGGGFLMTPLLFFIGIPPAVAVATEANQIV-ASSFSGVLAHLRRR 76
>gi|398307487|ref|ZP_10511073.1| hypothetical protein BvalD_19187 [Bacillus vallismortis DV1-F-3]
Length = 256
Score = 38.5 bits (88), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 32/69 (46%), Positives = 41/69 (59%), Gaps = 10/69 (14%)
Query: 324 KGDEGT--KWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATAL 380
KG+E KW AS L F G V G+LG GG FIL P + L L IP +++ A++L
Sbjct: 132 KGEEVIIHKWLASFLAFII------GGVSGILGAGGAFILVPIMLLILKIPVRITVASSL 185
Query: 381 FAITFSSSM 389
AITF SS+
Sbjct: 186 -AITFLSSI 193
>gi|55380318|ref|YP_138167.1| hypothetical protein rrnB0294 [Haloarcula marismortui ATCC 43049]
gi|55233043|gb|AAV48461.1| unknown [Haloarcula marismortui ATCC 43049]
Length = 342
Score = 38.5 bits (88), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 37/75 (49%), Gaps = 1/75 (1%)
Query: 347 GMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYA 405
G++ G LG+GGGFI P LF +G+P ++ T LF I FS + Y L +
Sbjct: 191 GLLSGFLGVGGGFIRMPALFYLIGVPVPIAVGTDLFEIVFSGGLGSFLYALDGGVDLSIV 250
Query: 406 LYFFALSIIAAFVGQ 420
L A S A VG
Sbjct: 251 LPLLAGSAFGARVGS 265
>gi|448651767|ref|ZP_21680806.1| hypothetical protein C435_06850 [Haloarcula californiae ATCC 33799]
gi|445770230|gb|EMA21297.1| hypothetical protein C435_06850 [Haloarcula californiae ATCC 33799]
Length = 342
Score = 38.5 bits (88), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 37/75 (49%), Gaps = 1/75 (1%)
Query: 347 GMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYA 405
G++ G LG+GGGFI P LF +G+P ++ T LF I FS + Y L +
Sbjct: 191 GLLSGFLGVGGGFIRMPALFYLIGVPVPIAVGTDLFEIVFSGGLGSFLYALDGGVDLSIV 250
Query: 406 LYFFALSIIAAFVGQ 420
L A S A VG
Sbjct: 251 LPLLAGSAFGARVGS 265
>gi|448644624|ref|ZP_21679080.1| hypothetical protein C436_20063 [Haloarcula sinaiiensis ATCC 33800]
gi|445757585|gb|EMA08928.1| hypothetical protein C436_20063 [Haloarcula sinaiiensis ATCC 33800]
Length = 342
Score = 38.5 bits (88), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 37/75 (49%), Gaps = 1/75 (1%)
Query: 347 GMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYA 405
G++ G LG+GGGFI P LF +G+P ++ T LF I FS + Y L +
Sbjct: 191 GLLSGFLGVGGGFIRMPALFYLIGVPVPIAVGTDLFEIVFSGGLGSFLYALDGGVDLSIV 250
Query: 406 LYFFALSIIAAFVGQ 420
L A S A VG
Sbjct: 251 LPLLAGSAFGARVGS 265
>gi|389852005|ref|YP_006354239.1| permease [Pyrococcus sp. ST04]
gi|388249311|gb|AFK22164.1| putative permease [Pyrococcus sp. ST04]
Length = 254
Score = 38.5 bits (88), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 50/91 (54%), Gaps = 2/91 (2%)
Query: 363 PLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHV 422
PL LG+P + AT+ FAI F+S+ S +++YLL V + II A +G +
Sbjct: 161 PLLTWLGMPIHYAVATSSFAIVFTSTSSAIKHYLLGNVEVQWLPLLVPGLIIGAQIGAKI 220
Query: 423 LKKL-IKILGRA-SIIIFTLSFIIFVSALSL 451
KK K L A ++++ L+F + + AL+L
Sbjct: 221 AKKTRAKSLKNAFAVVMVILAFRMILKALNL 251
>gi|451942306|ref|YP_007462943.1| putative membrane protein [Bartonella vinsonii subsp. berkhoffii
str. Winnie]
gi|451901693|gb|AGF76155.1| putative membrane protein [Bartonella vinsonii subsp. berkhoffii
str. Winnie]
Length = 306
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 392
G+ G++ ++G+GGGF + P L L +P V T+LF ITF SS + V
Sbjct: 182 GLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQITFVSSFTTV 232
>gi|383828154|ref|ZP_09983243.1| putative permease [Saccharomonospora xinjiangensis XJ-54]
gi|383460807|gb|EID52897.1| putative permease [Saccharomonospora xinjiangensis XJ-54]
Length = 289
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 62/138 (44%), Gaps = 4/138 (2%)
Query: 310 EAVALYKGRRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLEL 368
A A+ +G++K + +++ G+ G+V GL+G GGGF++ P L L
Sbjct: 112 TATAMIRGKKKATGENSGHGDLPLKRIILD---GLVVGLVTGLVGAGGGFLVVPALALLG 168
Query: 369 GIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIK 428
G+P ++ T+L I S + Y P L A +I+ + +G + ++ +
Sbjct: 169 GLPMAIAVGTSLVVIAMKSFAGLAGYLTTVALNWPLVLGVTAAAIVGSILGAMLTSRVPE 228
Query: 429 ILGRASIIIFTLSFIIFV 446
R IF L +FV
Sbjct: 229 AALRKGFGIFVLVMGVFV 246
>gi|418023649|ref|ZP_12662634.1| protein of unknown function DUF81 [Shewanella baltica OS625]
gi|353537532|gb|EHC07089.1| protein of unknown function DUF81 [Shewanella baltica OS625]
Length = 257
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 5/88 (5%)
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
S G+ AG + ++G GGG + P L LGI P T A +F SSM+
Sbjct: 6 EPSHWALLAVIGLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAA 64
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVG 419
YY F F+ ++ IA F+G
Sbjct: 65 WTYYRQHLFKPS----FWYMAFIATFIG 88
>gi|240850515|ref|YP_002971914.1| hypothetical membrane protein [Bartonella grahamii as4aup]
gi|240267638|gb|ACS51226.1| hypothetical membrane protein [Bartonella grahamii as4aup]
Length = 306
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 343 GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 392
G+ G++ ++G+GGGF + P L L +P V T+LF ITF SS + V
Sbjct: 182 GLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQITFVSSFTTV 232
>gi|126173862|ref|YP_001050011.1| hypothetical protein Sbal_1628 [Shewanella baltica OS155]
gi|386340622|ref|YP_006036988.1| hypothetical protein [Shewanella baltica OS117]
gi|125997067|gb|ABN61142.1| protein of unknown function DUF81 [Shewanella baltica OS155]
gi|334863023|gb|AEH13494.1| protein of unknown function DUF81 [Shewanella baltica OS117]
Length = 257
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 5/88 (5%)
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
S G+ AG + ++G GGG + P L LGI P T A +F SSM+
Sbjct: 6 EPSHWALLAVIGLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAA 64
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVG 419
YY F F+ ++ IA F+G
Sbjct: 65 WTYYRQHLFKPS----FWYMAFIATFIG 88
>gi|328952894|ref|YP_004370228.1| hypothetical protein Desac_1185 [Desulfobacca acetoxidans DSM
11109]
gi|328453218|gb|AEB09047.1| protein of unknown function DUF81 [Desulfobacca acetoxidans DSM
11109]
Length = 427
Score = 38.5 bits (88), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 1/65 (1%)
Query: 337 VFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITFSSSMSVVEYY 395
VF CG G+V ++G+GGGF+ P+F+ LG+ + T + I F++ + + Y
Sbjct: 258 VFVALCGFVVGVVAAIMGVGGGFLAFPMFVYLLGVSSFTTVGTDILQIIFTAGYAAISQY 317
Query: 396 LLKRF 400
+ F
Sbjct: 318 AIYGF 322
>gi|425737948|ref|ZP_18856217.1| permease [Staphylococcus massiliensis S46]
gi|425480853|gb|EKU48016.1| permease [Staphylococcus massiliensis S46]
Length = 273
Score = 38.5 bits (88), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 1/82 (1%)
Query: 342 CGITAGMVGGLLGLGGGFILGPLFL-ELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
C G++ GL G+GGG ++ PL L PP V+ T++ I FSS S + L
Sbjct: 162 CSFLIGIITGLFGIGGGALMTPLMLIVFRFPPHVAVGTSMMMIFFSSLTSSFGHILQGHV 221
Query: 401 PVPYALYFFALSIIAAFVGQHV 422
YAL S I A +G +
Sbjct: 222 DWFYALILVISSYIGAKIGVRI 243
>gi|330813138|ref|YP_004357377.1| transmembrane permease [Candidatus Pelagibacter sp. IMCC9063]
gi|327486233|gb|AEA80638.1| putative transmembrane permease [Candidatus Pelagibacter sp.
IMCC9063]
Length = 352
Score = 38.1 bits (87), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 31/44 (70%), Gaps = 1/44 (2%)
Query: 343 GITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSATALFAITF 385
GI G++ +LG+GGGF+L P+ + LG+P ++ + T+LF + F
Sbjct: 182 GIVVGILSSMLGVGGGFLLVPIMIYILGMPARLVAGTSLFVMIF 225
>gi|326795569|ref|YP_004313389.1| hypothetical protein Marme_2312 [Marinomonas mediterranea MMB-1]
gi|326546333|gb|ADZ91553.1| protein of unknown function DUF81 [Marinomonas mediterranea MMB-1]
Length = 253
Score = 38.1 bits (87), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATAL 380
++ +Y A G+ G+ G+ G+GGG ++ PL L G PP ++ T L
Sbjct: 2 EIFWYIAAGVGVGLAVGITGVGGGSLMTPLLLLFGFPPHIAIGTDL 47
>gi|160874768|ref|YP_001554084.1| hypothetical protein Sbal195_1651 [Shewanella baltica OS195]
gi|378708021|ref|YP_005272915.1| hypothetical protein [Shewanella baltica OS678]
gi|160860290|gb|ABX48824.1| protein of unknown function DUF81 [Shewanella baltica OS195]
gi|315267010|gb|ADT93863.1| protein of unknown function DUF81 [Shewanella baltica OS678]
Length = 257
Score = 38.1 bits (87), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 5/88 (5%)
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
S G+ AG + ++G GGG + P L LGI P T A +F SSM+
Sbjct: 6 EPSHWALLAVIGLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAA 64
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVG 419
YY F F+ ++ IA F+G
Sbjct: 65 WTYYRQHLFKPS----FWYMAFIATFIG 88
>gi|71395121|ref|XP_802308.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70861404|gb|EAN80862.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 261
Score = 38.1 bits (87), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 55/113 (48%), Gaps = 2/113 (1%)
Query: 102 IFVPMLNLIVGFDAKSSAAVSKCMVTGGAVSAVVYNLRQRHPT--LDIPVIDYDLALLFQ 159
I VPM L++G + +S+ + G ++ V+ +R+R P P+I+Y L
Sbjct: 111 ILVPMFCLLMGLPMDFAVGLSQSTICGQSILNVLIAIRKRFPCAGCSRPLINYQYLTLLV 170
Query: 160 PLLVLGISIGVAFNVIFADWMITVLLIVLFLVLSTRAFLKGVETWKKETITKR 212
PL V+G IG N + D VLL +L + R+ K + ++++ +R
Sbjct: 171 PLGVIGTLIGGVLNRLCPDLFRLVLLFLLLTAVLYRSVRKMIAQYREDQSERR 223
>gi|395782829|ref|ZP_10463201.1| hypothetical protein MCY_01598 [Bartonella rattimassiliensis 15908]
gi|395416707|gb|EJF83077.1| hypothetical protein MCY_01598 [Bartonella rattimassiliensis 15908]
Length = 306
Score = 38.1 bits (87), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 1/52 (1%)
Query: 342 CGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV 392
G+ G++ ++G+GGGF + P L L +P V T+LF ITF SS + V
Sbjct: 181 IGLIVGLLSSIMGIGGGFFMIPALIYLLRVPTSVVIGTSLFQITFVSSFTTV 232
>gi|313206800|ref|YP_004045977.1| hypothetical protein Riean_1314 [Riemerella anatipestifer ATCC
11845 = DSM 15868]
gi|383486108|ref|YP_005395020.1| hypothetical protein RA0C_1590 [Riemerella anatipestifer ATCC 11845
= DSM 15868]
gi|386321217|ref|YP_006017379.1| hypothetical protein RIA_0901 [Riemerella anatipestifer RA-GD]
gi|416109380|ref|ZP_11591339.1| hypothetical protein RAYM_00625 [Riemerella anatipestifer RA-YM]
gi|442313984|ref|YP_007355287.1| hypothetical protein G148_0288 [Riemerella anatipestifer RA-CH-2]
gi|312446116|gb|ADQ82471.1| protein of unknown function DUF81 [Riemerella anatipestifer ATCC
11845 = DSM 15868]
gi|315023873|gb|EFT36875.1| hypothetical protein RAYM_00625 [Riemerella anatipestifer RA-YM]
gi|325335760|gb|ADZ12034.1| Protein of unknown function DUF81 [Riemerella anatipestifer RA-GD]
gi|380460793|gb|AFD56477.1| hypothetical protein RA0C_1590 [Riemerella anatipestifer ATCC 11845
= DSM 15868]
gi|441482907|gb|AGC39593.1| hypothetical protein G148_0288 [Riemerella anatipestifer RA-CH-2]
Length = 264
Score = 38.1 bits (87), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 58/111 (52%), Gaps = 8/111 (7%)
Query: 318 RRKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSS 376
R K E + R + L+ + G+ G++ GL+G GGGF++ P L + LG+ + +
Sbjct: 130 RSKTEPSTLEISPSRNTLLI---SQGLLVGIITGLVGAGGGFLIIPALVMILGLKMKEAI 186
Query: 377 ATALFAITFSSSMSVVEYYLLKRFPVP--YALYFFALSIIAAFVGQHVLKK 425
T+LF IT SS++ V L + + + L F LSI+ +G + K+
Sbjct: 187 GTSLFIITLSSTIGFVSS--LDKVAIDWYFLLSFTGLSILGVLLGLALSKR 235
>gi|153000146|ref|YP_001365827.1| hypothetical protein Shew185_1617 [Shewanella baltica OS185]
gi|373949018|ref|ZP_09608979.1| protein of unknown function DUF81 [Shewanella baltica OS183]
gi|386325143|ref|YP_006021260.1| hypothetical protein [Shewanella baltica BA175]
gi|151364764|gb|ABS07764.1| protein of unknown function DUF81 [Shewanella baltica OS185]
gi|333819288|gb|AEG11954.1| protein of unknown function DUF81 [Shewanella baltica BA175]
gi|373885618|gb|EHQ14510.1| protein of unknown function DUF81 [Shewanella baltica OS183]
Length = 257
Score = 38.1 bits (87), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 5/88 (5%)
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
S G+ AG + ++G GGG + P L LGI P T A +F SSM+
Sbjct: 6 EPSHWALLAVIGLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAA 64
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVG 419
YY F F+ ++ IA F+G
Sbjct: 65 WTYYRQHLFKPS----FWYMAFIATFIG 88
>gi|433544184|ref|ZP_20500574.1| hypothetical protein D478_10790 [Brevibacillus agri BAB-2500]
gi|432184540|gb|ELK42051.1| hypothetical protein D478_10790 [Brevibacillus agri BAB-2500]
Length = 262
Score = 38.1 bits (87), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 45/71 (63%), Gaps = 6/71 (8%)
Query: 324 KGDEGTKWRASQLVFYCACGITA----GMVGGLLGLGGGFILGPLFLE-LGIPPQVSSAT 378
K ++GT+ +A ++ + A I A G+V G++G GG FIL P+ L L IP +V+ A+
Sbjct: 130 KKNKGTQEQADEIEYNRAVAIPAAFVVGIVSGIVGAGGAFILIPIMLTVLKIPTRVTIAS 189
Query: 379 ALFAITFSSSM 389
+L AI F S++
Sbjct: 190 SL-AIVFISAI 199
>gi|217973886|ref|YP_002358637.1| hypothetical protein Sbal223_2726 [Shewanella baltica OS223]
gi|217499021|gb|ACK47214.1| protein of unknown function DUF81 [Shewanella baltica OS223]
Length = 257
Score = 38.1 bits (87), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 5/88 (5%)
Query: 332 RASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSV 391
S G+ AG + ++G GGG + P L LGI P T A +F SSM+
Sbjct: 6 EPSHWALLAVIGLIAGFIDAVVG-GGGLLSIPALLTLGIAPHTVLGTNKLAASFGSSMAA 64
Query: 392 VEYYLLKRFPVPYALYFFALSIIAAFVG 419
YY F F+ ++ IA F+G
Sbjct: 65 WTYYRQHLFKPS----FWYMAFIATFIG 88
>gi|289551261|ref|YP_003472165.1| hypothetical protein SLGD_01950 [Staphylococcus lugdunensis
HKU09-01]
gi|385784877|ref|YP_005761050.1| hypothetical protein SLUG_19420 [Staphylococcus lugdunensis
N920143]
gi|418414556|ref|ZP_12987771.1| hypothetical protein HMPREF9308_00936 [Staphylococcus lugdunensis
ACS-027-V-Sch2]
gi|289180793|gb|ADC88038.1| hypothetical protein SLGD_01950 [Staphylococcus lugdunensis
HKU09-01]
gi|339895133|emb|CCB54450.1| putative membrane protein [Staphylococcus lugdunensis N920143]
gi|410877163|gb|EKS25060.1| hypothetical protein HMPREF9308_00936 [Staphylococcus lugdunensis
ACS-027-V-Sch2]
Length = 275
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 4/82 (4%)
Query: 316 KGRRKIASKGDEGTKWRAS--QLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPP 372
K + + + EG + S LV + A + G++ GL G+GGG ++ PL L + PP
Sbjct: 137 KPQYRKSYTDSEGVTYHYSVPPLVAFVAT-LFIGVLTGLFGIGGGALMTPLMLIIFRFPP 195
Query: 373 QVSSATALFAITFSSSMSVVEY 394
V+ T++ I FSS MS + +
Sbjct: 196 HVAVGTSMMMIFFSSVMSSIGH 217
>gi|418636065|ref|ZP_13198419.1| sulfite exporter TauE/SafE [Staphylococcus lugdunensis VCU139]
gi|374841224|gb|EHS04701.1| sulfite exporter TauE/SafE [Staphylococcus lugdunensis VCU139]
Length = 275
Score = 38.1 bits (87), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 4/82 (4%)
Query: 316 KGRRKIASKGDEGTKWRAS--QLVFYCACGITAGMVGGLLGLGGGFILGPLFLEL-GIPP 372
K + + + EG + S LV + A + G++ GL G+GGG ++ PL L + PP
Sbjct: 137 KPQYRKSYTDSEGVTYHYSVPPLVAFVAT-LFIGVLTGLFGIGGGALMTPLMLIIFRFPP 195
Query: 373 QVSSATALFAITFSSSMSVVEY 394
V+ T++ I FSS MS + +
Sbjct: 196 HVAVGTSMMMIFFSSVMSSIGH 217
>gi|365886110|ref|ZP_09425076.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3809]
gi|365338422|emb|CCD97607.1| putative permease; membrane protein [Bradyrhizobium sp. STM 3809]
Length = 319
Score = 38.1 bits (87), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%)
Query: 338 FYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+
Sbjct: 27 LLLAMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYW 84
>gi|146343498|ref|YP_001208546.1| permease membrane protein [Bradyrhizobium sp. ORS 278]
gi|146196304|emb|CAL80331.1| putative permease; putative membrane protein [Bradyrhizobium sp.
ORS 278]
Length = 331
Score = 38.1 bits (87), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%)
Query: 341 ACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYY 395
A G G V G+ G+GGGF++ PL + +GI P V+ A+ I SS + Y+
Sbjct: 42 AMGAAVGFVSGMFGIGGGFLMTPLLIFVGITPAVAVASVSSHIAASSFSGALSYW 96
>gi|384135922|ref|YP_005518636.1| hypothetical protein TC41_2211 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
gi|339290007|gb|AEJ44117.1| protein of unknown function DUF81 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
Length = 256
Score = 38.1 bits (87), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 55/115 (47%), Gaps = 6/115 (5%)
Query: 356 GGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIA 415
GGG I P L +G+PP + T FA T S+ S Y L + + F L ++A
Sbjct: 28 GGGLISLPALLSIGLPPAAALGTNKFAGTISAITSFSSYLLSGKVRLKLVGPLFPLGVMA 87
Query: 416 AFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL--GGVGLAKMIKRIEHKEY 468
+ +G +++ +L R F L ++ V+A +L +GL + I+ + K +
Sbjct: 88 SALGAYIVHQLPSSFLRP----FVLVMLVIVAAYTLWKKDLGLVESIRPLTRKTF 138
>gi|399887998|ref|ZP_10773875.1| hypothetical protein CarbS_05635 [Clostridium arbusti SL206]
Length = 274
Score = 38.1 bits (87), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 1/91 (1%)
Query: 329 TKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSS 388
++ S L+ C G A V + G GGG I P FL G+P +++ T F T +
Sbjct: 21 SQTTPSILLLLCIFGFIAAAVDSIAG-GGGLISLPAFLLAGLPARLALGTNKFCSTSGTL 79
Query: 389 MSVVEYYLLKRFPVPYALYFFALSIIAAFVG 419
MS EYY + Y F ++I A +G
Sbjct: 80 MSSFEYYKNGKMNFKLLKYLFPFTLIGAVIG 110
>gi|227534549|ref|ZP_03964598.1| permease [Lactobacillus paracasei subsp. paracasei ATCC 25302]
gi|227187798|gb|EEI67865.1| permease [Lactobacillus paracasei subsp. paracasei ATCC 25302]
Length = 280
Score = 38.1 bits (87), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 21/89 (23%), Positives = 48/89 (53%), Gaps = 1/89 (1%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVE 393
Q++ G+ G++G +LG+GGG I+ P+ + +G+P Q + ++ ++ +SS + +
Sbjct: 3 QMIMLMVTGLLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHV 422
Y + + A++ + + A +G V
Sbjct: 63 YLKDEMLNLRVAMFLEIATTVGAVIGAVV 91
>gi|387128098|ref|YP_006296703.1| permease [Methylophaga sp. JAM1]
gi|386275160|gb|AFI85058.1| putative permease [Methylophaga sp. JAM1]
Length = 269
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 8/67 (11%)
Query: 339 YCACGITAGMVGGLLGLGGGFILGP----LFLELGIPPQ----VSSATALFAITFSSSMS 390
Y A G+ AG + GL G+GGG ++ P LF + G+P Q ++ AT+L I +S S
Sbjct: 13 YAAAGMVAGTLAGLFGIGGGLVIVPVLVWLFSKQGLPDQYLIHMAVATSLMTIVVTSMSS 72
Query: 391 VVEYYLL 397
+ ++ L
Sbjct: 73 IFAHWRL 79
>gi|19551303|ref|NP_599305.1| hypothetical protein NCgl0052 [Corynebacterium glutamicum ATCC
13032]
gi|62388948|ref|YP_224350.1| hypothetical protein cg0072 [Corynebacterium glutamicum ATCC 13032]
gi|41324281|emb|CAF18621.1| conserved hypothetical membrane protein [Corynebacterium glutamicum
ATCC 13032]
Length = 248
Score = 38.1 bits (87), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 64/141 (45%), Gaps = 10/141 (7%)
Query: 313 ALYKGRRKIASKG-DEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGP-LFLELGI 370
A+ +GR++ KG + + WR G+ G V GL+G GGGF++ P L L G+
Sbjct: 115 AMLRGRKQ--KKGASKSSLWR------VLVDGLVVGAVTGLVGAGGGFLVVPALALLGGL 166
Query: 371 PPQVSSATALFAITFSSSMSVVEYYLLKRFPVPYALYFFALSIIAAFVGQHVLKKLIKIL 430
V+ T+L IT S + Y + L A +I+ + G + ++ + L
Sbjct: 167 SMPVAVGTSLVVITMKSFAGLAGYLTSVQLDWGLVLMVTAAAIVGSLAGSRLAGRVPETL 226
Query: 431 GRASIIIFTLSFIIFVSALSL 451
R +F L +FV L L
Sbjct: 227 LRKGFGVFVLVMGVFVLGLEL 247
>gi|448406174|ref|ZP_21572696.1| hypothetical protein C475_00150 [Halosimplex carlsbadense 2-9-1]
gi|445678013|gb|ELZ30508.1| hypothetical protein C475_00150 [Halosimplex carlsbadense 2-9-1]
Length = 343
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 340 CACGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSM 389
A G++ G LG+GGGFI P LF +G+P V+ T LF I FS +
Sbjct: 183 LAVAFATGLLSGFLGVGGGFIRMPALFYLIGVPVPVAVGTDLFEIVFSGGI 233
>gi|417985491|ref|ZP_12626075.1| membrane protein [Lactobacillus casei 32G]
gi|410528519|gb|EKQ03371.1| membrane protein [Lactobacillus casei 32G]
Length = 280
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 21/89 (23%), Positives = 48/89 (53%), Gaps = 1/89 (1%)
Query: 335 QLVFYCACGITAGMVGGLLGLGGGFILGPLF-LELGIPPQVSSATALFAITFSSSMSVVE 393
Q++ G+ G++G +LG+GGG I+ P+ + +G+P Q + ++ ++ +SS + +
Sbjct: 3 QMIMLMVTGLLGGILGAVLGIGGGMIITPILTMLMGLPIQYAIGASIVSVIATSSGATIA 62
Query: 394 YYLLKRFPVPYALYFFALSIIAAFVGQHV 422
Y + + A++ + + A +G V
Sbjct: 63 YLKDEMLNLRVAMFLEIATTVGAVIGAVV 91
>gi|336423425|ref|ZP_08603555.1| hypothetical protein HMPREF0993_02932 [Lachnospiraceae bacterium
5_1_57FAA]
gi|336004677|gb|EGN34737.1| hypothetical protein HMPREF0993_02932 [Lachnospiraceae bacterium
5_1_57FAA]
Length = 253
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 9/90 (10%)
Query: 337 VFYCACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEY-- 394
+ C AGMV + G GGG I P +L G+PP + AT + F ++++ +
Sbjct: 9 IIVCPLVFLAGMVDAIAG-GGGLISLPAYLIAGLPPHAAVATNKMSSPFGTALATYRFAR 67
Query: 395 -YLLK-RFPVPYALYFFALSIIAAFVGQHV 422
+L+ R VP + +II +F+G HV
Sbjct: 68 NHLINVRLAVPSVMA----AIIGSFIGSHV 93
>gi|325106350|ref|YP_004276004.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324975198|gb|ADY54182.1| protein of unknown function DUF81 [Pedobacter saltans DSM 12145]
Length = 265
Score = 38.1 bits (87), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 2/86 (2%)
Query: 343 GITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVV-EYYLLKRF 400
GI G++ GL+G GGGF++ P + L +G+P ++ T+L I +S + ++ L
Sbjct: 154 GIIVGVLSGLIGAGGGFLIVPAITLFMGVPVHLAVGTSLLIIALNSLVGFTGDFSHLSEI 213
Query: 401 PVPYALYFFALSIIAAFVGQHVLKKL 426
Y L F S + VG ++ KKL
Sbjct: 214 NWSYLLEFSLFSSVGVLVGVYLGKKL 239
>gi|149913247|ref|ZP_01901781.1| integral membrane protein, putative [Roseobacter sp. AzwK-3b]
gi|149813653|gb|EDM73479.1| integral membrane protein, putative [Roseobacter sp. AzwK-3b]
Length = 305
Score = 38.1 bits (87), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 48/88 (54%), Gaps = 4/88 (4%)
Query: 342 CGITAGMVGGLLGLGGGFILGP-LFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKRF 400
G++ G++ ++G+GGGFI+ P + LG+P +V T+LF I F ++ + + + F
Sbjct: 179 VGVSVGILAAIMGVGGGFIMVPAMIYLLGMPTKVVVGTSLFQIIFVTAFTTM-LHATTNF 237
Query: 401 PVPYALYFFAL--SIIAAFVGQHVLKKL 426
V AL L +I A +G + K+
Sbjct: 238 TVDVALAVLLLIGGVIGAQIGTQIGTKM 265
>gi|423434381|ref|ZP_17411362.1| hypothetical protein IE9_00562 [Bacillus cereus BAG4X12-1]
gi|401126850|gb|EJQ34583.1| hypothetical protein IE9_00562 [Bacillus cereus BAG4X12-1]
Length = 254
Score = 38.1 bits (87), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 58/110 (52%), Gaps = 9/110 (8%)
Query: 319 RKIASKGDEGTKWRASQLVFYCACGITAGMVGGLLGLGGGFILGPLFLE-LGIPPQVSSA 377
+K S + KW AS L F G V G++G GG F+L P+ L L +P +++ A
Sbjct: 125 KKNNSGAVKYNKWLASVLAFIV------GGVSGIIGAGGAFLLVPIMLVILKLPLRMTIA 178
Query: 378 TALFAITFSSSMSVVEYYLLK-RFPVPYALYFFALSIIAAFVGQHVLKKL 426
T++ AITF SS+ + ++ + V AL SI AA +G V KK+
Sbjct: 179 TSI-AITFISSVGITTGKVITGQVVVIPALIIAVASIFAAPLGARVGKKI 227
>gi|398385949|ref|ZP_10543964.1| putative permease [Sphingobium sp. AP49]
gi|397719442|gb|EJK80010.1| putative permease [Sphingobium sp. AP49]
Length = 304
Score = 38.1 bits (87), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 65/119 (54%), Gaps = 11/119 (9%)
Query: 340 CACGITAGMVGGLLGLGGGFILGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLKR 399
G G++ G+ G+GGGF+ PL + GIPP V++A+A +T +S VV + + R
Sbjct: 18 IGLGGVVGLLSGMFGVGGGFLTTPLLIFYGIPPTVAAASAASQVTGASVSGVVTH--MSR 75
Query: 400 FPVPYAL--YFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSLGGVGL 456
V + + A I+ A +G + +L++ +G+ +I I++V L LGG+GL
Sbjct: 76 GTVDFRMGGVLIAGGIVGAGLGVLIF-RLLQAIGQIDTVI----GILYV--LMLGGIGL 127
>gi|315658763|ref|ZP_07911632.1| conserved hypothetical protein [Staphylococcus lugdunensis M23590]
gi|315496218|gb|EFU84544.1| conserved hypothetical protein [Staphylococcus lugdunensis M23590]
Length = 275
Score = 38.1 bits (87), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 4/82 (4%)
Query: 316 KGRRKIASKGDEGTKWRAS--QLVFYCACGITAGMVGGLLGLGGGFILGPLFL-ELGIPP 372
K + + + EG + S LV + A + G++ GL G+GGG ++ PL L PP
Sbjct: 137 KPQYRKSYTDSEGVTYHYSVPPLVAFVAT-LFIGVLTGLFGIGGGALMTPLMLIVFRFPP 195
Query: 373 QVSSATALFAITFSSSMSVVEY 394
V+ T++ I FSS MS + +
Sbjct: 196 HVAVGTSMMMIFFSSVMSSIGH 217
>gi|427412871|ref|ZP_18903063.1| hypothetical protein HMPREF9282_00470 [Veillonella ratti
ACS-216-V-Col6b]
gi|425715687|gb|EKU78673.1| hypothetical protein HMPREF9282_00470 [Veillonella ratti
ACS-216-V-Col6b]
Length = 271
Score = 38.1 bits (87), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 54/113 (47%), Gaps = 6/113 (5%)
Query: 340 CACGITAGMVGGLLGLGGGFI-LGPLFLELGIPPQVSSATALFAITFSSSMSVVEYYLLK 398
C + G + + G+GGG I + L LG P +++AT+ F + S+ M V +Y L
Sbjct: 156 CLISVVVGFISSVFGIGGGLIHVAALVYLLGFPTHIATATSQFILFLSTIMGVTTHYFLG 215
Query: 399 RFPVPYALYFFALSIIAAFVGQHVLKKLIKILGRASIIIFTLSFIIFVSALSL 451
A+ ++I A +G + K+L +A+ I+ S + + AL L
Sbjct: 216 HIQWNIAIACGIGAVIGAQLGAAIAKRL-----KATSILMVFSVGVGILALQL 263
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.324 0.140 0.423
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,286,475,836
Number of Sequences: 23463169
Number of extensions: 302890098
Number of successful extensions: 1360197
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1384
Number of HSP's successfully gapped in prelim test: 2802
Number of HSP's that attempted gapping in prelim test: 1348470
Number of HSP's gapped (non-prelim): 13415
length of query: 477
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 331
effective length of database: 8,933,572,693
effective search space: 2957012561383
effective search space used: 2957012561383
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 79 (35.0 bits)