BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 007391
         (605 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|356518975|ref|XP_003528150.1| PREDICTED: uncharacterized protein LOC100782659 [Glycine max]
          Length = 649

 Score = 1029 bits (2660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 483/597 (80%), Positives = 537/597 (89%), Gaps = 2/597 (0%)

Query: 1   MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
           MDNSGNPQD VVPPVEGVAGGGTAYGWND  +     + G I+PT IPT DLVHVWCMPS
Sbjct: 1   MDNSGNPQDVVVPPVEGVAGGGTAYGWNDGGTHGLN-VKGPIDPTGIPTRDLVHVWCMPS 59

Query: 61  TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
           TANVGPQ+MPR LEPINLLAARNERESVQIA+RPKVSWS SS AG VQ+QCSDLCS SGD
Sbjct: 60  TANVGPQDMPRHLEPINLLAARNERESVQIAIRPKVSWSGSSVAGTVQIQCSDLCSTSGD 119

Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
           RL+VGQSL+LRRVVP+LGVPDALVP+DLPV QI+L PGETTA+W+SID P +QPPG YEG
Sbjct: 120 RLIVGQSLLLRRVVPILGVPDALVPVDLPVSQINLFPGETTALWISIDVPSSQPPGQYEG 179

Query: 181 EIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLR 239
           EI IT+ KAD E   Q L K EKH+L+ +L+ CLD VEPI+GKPL EVVER KS  T+LR
Sbjct: 180 EIAITAIKADAESPVQILSKVEKHQLYRDLKGCLDIVEPIDGKPLDEVVERVKSATTSLR 239

Query: 240 RVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTV 299
           R++ SP FSEFFSDNGP+D+MDEDAIS+LS+R+KL+LTVW+F+LP TPSLPAV GISDTV
Sbjct: 240 RILLSPSFSEFFSDNGPVDVMDEDAISSLSIRMKLNLTVWEFVLPETPSLPAVFGISDTV 299

Query: 300 IEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEY 359
           IEDRFGV+ G+ EWYEALDQHFKWLLQYRISP+FC+W + MRVLTYT PWPADHPKSDEY
Sbjct: 300 IEDRFGVQQGTAEWYEALDQHFKWLLQYRISPYFCKWADGMRVLTYTSPWPADHPKSDEY 359

Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSV 419
           FSDPRLAAYAVPY  V+S ND AKDY++K++E+LRTK HW+KAYFYLWDEPLN+E Y SV
Sbjct: 360 FSDPRLAAYAVPYKQVVSGNDAAKDYLQKQVEILRTKTHWRKAYFYLWDEPLNLEQYDSV 419

Query: 420 RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGN 479
           RNMASE+HAYAPDAR+LTTYYCGP+DAPL PTPFE+FVKVP FLRPH QIYCTSEWVLGN
Sbjct: 420 RNMASEIHAYAPDARILTTYYCGPNDAPLAPTPFEAFVKVPSFLRPHNQIYCTSEWVLGN 479

Query: 480 REDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFL 539
           REDLVKDI+TELQPENGEEWWTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGTGFL
Sbjct: 480 REDLVKDIITELQPENGEEWWTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFL 539

Query: 540 YWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           YWGANCYEKATV SAEI+FR GLPPGDGVL+YPGEVFS+S QPVASLRLERIL+GLQ
Sbjct: 540 YWGANCYEKATVASAEIKFRHGLPPGDGVLYYPGEVFSTSHQPVASLRLERILNGLQ 596


>gi|449460114|ref|XP_004147791.1| PREDICTED: uncharacterized protein LOC101205217 [Cucumis sativus]
 gi|449476778|ref|XP_004154831.1| PREDICTED: uncharacterized LOC101205217 [Cucumis sativus]
          Length = 649

 Score = 1014 bits (2621), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 480/597 (80%), Positives = 530/597 (88%), Gaps = 2/597 (0%)

Query: 1   MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
           MDN+GNPQ  +VPPVEGVAGGGTAYGWND    +S     SI+PTE+PTADLV VWCMPS
Sbjct: 1   MDNTGNPQGIIVPPVEGVAGGGTAYGWNDGTLHTSTLPKRSIDPTEVPTADLVDVWCMPS 60

Query: 61  TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
           TANVGPQEMPR LE INLLAARNERESVQIA+RPK+SW +SS AG+VQV   DLCS SGD
Sbjct: 61  TANVGPQEMPRRLETINLLAARNERESVQIAMRPKISWGASSVAGIVQVFSGDLCSTSGD 120

Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
           RLVVGQSL LRRVVP+LGVPDALVPLDLPV QI+L+PGETTAVWVSID P  QPPG YEG
Sbjct: 121 RLVVGQSLRLRRVVPILGVPDALVPLDLPVSQINLLPGETTAVWVSIDVPNMQPPGQYEG 180

Query: 181 EIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLR 239
           EIIIT+ K D E S+Q LGK EKH ++ ELR+CLD +E ++ KPL EVV+R KS   +L+
Sbjct: 181 EIIITAIKTDAESSTQYLGKAEKHEIYKELRSCLDIMEIVDEKPLEEVVKRVKSATASLK 240

Query: 240 RVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTV 299
           RVI SP FSEF+S+NG ID+MDEDA SNLSVRVK+ LTVWDF +PATPSLPAVIG+SDTV
Sbjct: 241 RVILSPSFSEFYSENGSIDVMDEDAFSNLSVRVKIMLTVWDFTIPATPSLPAVIGVSDTV 300

Query: 300 IEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEY 359
           IEDRFGV HG+DEW+EALD HFKWLLQYRISP+FCRWG+ MRVLTYTCPWPADHPKSDEY
Sbjct: 301 IEDRFGVEHGTDEWFEALDDHFKWLLQYRISPYFCRWGDGMRVLTYTCPWPADHPKSDEY 360

Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSV 419
           FSDPRL+AYAVPY  V   + G KDY+++E+E+LRTK HWKKAYFYLWDEPLNMEH+ SV
Sbjct: 361 FSDPRLSAYAVPYRAVFGGDTG-KDYLQREVEILRTKTHWKKAYFYLWDEPLNMEHFDSV 419

Query: 420 RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGN 479
           R+M+SE+ AYAPDARVLTTYYCGPSDAPL PT FE+FVKVP FLRPHTQIYCTSEWVLGN
Sbjct: 420 RSMSSEIRAYAPDARVLTTYYCGPSDAPLAPTTFEAFVKVPSFLRPHTQIYCTSEWVLGN 479

Query: 480 REDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFL 539
           REDLVKDI+ ELQPENGEEWWTYVCMGP DPHPNWHLGMRG+QHRAVMWRVWKEGGTGFL
Sbjct: 480 REDLVKDIIAELQPENGEEWWTYVCMGPGDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFL 539

Query: 540 YWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           YWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSS +PVAS+RLER+LSGLQ
Sbjct: 540 YWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSHEPVASVRLERLLSGLQ 596


>gi|356509690|ref|XP_003523579.1| PREDICTED: uncharacterized protein LOC100799554 [Glycine max]
          Length = 644

 Score = 1008 bits (2606), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 472/597 (79%), Positives = 536/597 (89%), Gaps = 7/597 (1%)

Query: 1   MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
           M  +GNPQD VVPPVEGVAGGGTAYGWND  +     + G I+PTEIPT DLVHVWCMP+
Sbjct: 1   MQLAGNPQDVVVPPVEGVAGGGTAYGWNDGGTHGLN-VKGPIDPTEIPTKDLVHVWCMPN 59

Query: 61  TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
           TANVGPQ+MPR LEPINLLAARNERESVQIA+RPKVSW  SS AG VQ+QCSDLCS SGD
Sbjct: 60  TANVGPQDMPRHLEPINLLAARNERESVQIAIRPKVSWGGSSVAGTVQIQCSDLCSTSGD 119

Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
           RL+VGQSL+LRRVVP+LGVPDALVP+DLPV QI+L PGETTA+W+SID P +QPPG YEG
Sbjct: 120 RLIVGQSLLLRRVVPILGVPDALVPVDLPVSQINLFPGETTALWISIDVPSSQPPGQYEG 179

Query: 181 EIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLR 239
           EI+IT+ K+D ++S     K EKH+L+ +L+ CLD VEPI+GKPL EVVER KST T+LR
Sbjct: 180 EIVITAIKSDADIS-----KVEKHQLYRDLKGCLDIVEPIDGKPLDEVVERVKSTTTSLR 234

Query: 240 RVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTV 299
           R++ SP FSEFFSDNGP+D+MDEDAIS+LS+R+KL+LTVW+F+LP TPSLPAV GISDTV
Sbjct: 235 RILLSPSFSEFFSDNGPVDVMDEDAISSLSLRMKLNLTVWEFVLPETPSLPAVFGISDTV 294

Query: 300 IEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEY 359
           IEDRFGV+ G+ EWYEALDQHFKWLLQYRISP+FC+W + MRVLTYT PWPADHPKSDEY
Sbjct: 295 IEDRFGVQQGTAEWYEALDQHFKWLLQYRISPYFCKWADGMRVLTYTSPWPADHPKSDEY 354

Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSV 419
           FSDPRLAAYAVPY  V+S N+ A+DY++K++E+LRTK HW+KAYFYLWDEPLN+E Y SV
Sbjct: 355 FSDPRLAAYAVPYKQVVSGNNSAEDYLQKQVEILRTKNHWRKAYFYLWDEPLNLEQYDSV 414

Query: 420 RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGN 479
           RNMASE+HAYAPDAR+LTTYYCGP+DAPL PTPF++FVKVP FLRPH QIYCTSEWVLGN
Sbjct: 415 RNMASEIHAYAPDARILTTYYCGPNDAPLAPTPFDAFVKVPSFLRPHNQIYCTSEWVLGN 474

Query: 480 REDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFL 539
           +EDLVKDI+ ELQPENGEEWWTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGTGFL
Sbjct: 475 QEDLVKDIIAELQPENGEEWWTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFL 534

Query: 540 YWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           YWGANCYEKATV SAEI+FR GLPPGDGVL+YPGEVFS+S QPVASLRLERIL+GLQ
Sbjct: 535 YWGANCYEKATVASAEIKFRHGLPPGDGVLYYPGEVFSTSHQPVASLRLERILNGLQ 591


>gi|224132110|ref|XP_002321258.1| predicted protein [Populus trichocarpa]
 gi|222862031|gb|EEE99573.1| predicted protein [Populus trichocarpa]
          Length = 652

 Score = 1005 bits (2598), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 484/601 (80%), Positives = 536/601 (89%), Gaps = 7/601 (1%)

Query: 1   MDNSG-NPQDSVVPPVEGVAGGGTAYGWNDN--CSQSSGPLNGSINPTEIPTADLVHVWC 57
           MDN+G NPQ  VVPPVEGVAGGGTAYGWND      S+    GSI+P+E+ T+DLVHVWC
Sbjct: 1   MDNTGANPQGIVVPPVEGVAGGGTAYGWNDGGGVHFSNSSPRGSIDPSEVLTSDLVHVWC 60

Query: 58  MPSTANVGPQEMP-RPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCS 116
           +PSTANVGPQE+P R LEPINLLAARNERESVQIALRPK +W  S +AGVVQVQCSDL S
Sbjct: 61  LPSTANVGPQEIPSRHLEPINLLAARNERESVQIALRPKATWGGSGSAGVVQVQCSDLTS 120

Query: 117 ASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPG 176
            SGDRLVVGQS+ LRRVV +LGVPDALVPLDLPV QI+L PGETTA+WVSID P AQP G
Sbjct: 121 TSGDRLVVGQSITLRRVVSILGVPDALVPLDLPVSQINLAPGETTALWVSIDVPSAQPQG 180

Query: 177 LYEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTA 235
            YEGE  IT+ KA+ E  SQ LGK ++H+L+ ELRNCLD +EP+EGKPL EVVERAKS  
Sbjct: 181 QYEGEFFITAIKAEAESPSQRLGKADRHQLYSELRNCLDIMEPVEGKPLDEVVERAKSVT 240

Query: 236 TTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGI 295
           T+LRRV+ SP+FSEF +DNGP+DMMDEDAISNL+VRVKL+LTVWDF+LPATPSLPAV GI
Sbjct: 241 TSLRRVLLSPVFSEFSTDNGPVDMMDEDAISNLTVRVKLNLTVWDFVLPATPSLPAVFGI 300

Query: 296 SDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPK 355
           SDTVIEDRFGV HGSDEWYEALDQHFKWLL YRISP+FCRWG +MRVLTYTCPWPADHPK
Sbjct: 301 SDTVIEDRFGVEHGSDEWYEALDQHFKWLLHYRISPYFCRWGGNMRVLTYTCPWPADHPK 360

Query: 356 SDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEH 415
           SDEYFSDPRLAAYAVPYS  +     A+DY++KEI++LRTK+HWKKAYFYLWDEPLN+E 
Sbjct: 361 SDEYFSDPRLAAYAVPYSQAVPG--AAQDYLQKEIDILRTKSHWKKAYFYLWDEPLNLEQ 418

Query: 416 YSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEW 475
           Y  VR+MAS++H YAPDARVLTTYYCGPSDAPLGPTPFE+FVKVPKFLRPHTQIYCTSEW
Sbjct: 419 YDMVRSMASKIHTYAPDARVLTTYYCGPSDAPLGPTPFEAFVKVPKFLRPHTQIYCTSEW 478

Query: 476 VLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGG 535
           VLG+REDL K+IV+ELQPENGEEWWTYVC+GPSDPHPNWH+GMRG+QHRAV WRVWKEG 
Sbjct: 479 VLGDREDLAKEIVSELQPENGEEWWTYVCLGPSDPHPNWHIGMRGTQHRAVFWRVWKEGA 538

Query: 536 TGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGL 595
           TGFLYWGANCYEKATVPSAEI FRRGLPPGDGVL+YPGEVFSSS QPVAS+RLERILSGL
Sbjct: 539 TGFLYWGANCYEKATVPSAEISFRRGLPPGDGVLYYPGEVFSSSHQPVASVRLERILSGL 598

Query: 596 Q 596
           Q
Sbjct: 599 Q 599


>gi|255538584|ref|XP_002510357.1| conserved hypothetical protein [Ricinus communis]
 gi|223551058|gb|EEF52544.1| conserved hypothetical protein [Ricinus communis]
          Length = 651

 Score = 1002 bits (2590), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 478/595 (80%), Positives = 530/595 (89%), Gaps = 2/595 (0%)

Query: 3   NSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTA 62
           ++GNP+D + PPVEGVAGGGT+YGW D          GSI+P+E+ TA+LVHVWCMPSTA
Sbjct: 5   SAGNPRDGI-PPVEGVAGGGTSYGWTDGGLHGLNLPKGSIDPSEVSTANLVHVWCMPSTA 63

Query: 63  NVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRL 122
           NVGPQE+PR LEP+NLLAARNERESVQIA+RPKVSWSSS +AG V VQC+DL S SGDRL
Sbjct: 64  NVGPQEIPRHLEPVNLLAARNERESVQIAIRPKVSWSSSGSAGAVHVQCTDLSSTSGDRL 123

Query: 123 VVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEI 182
           V GQS+ LR+VV +LGVPDALVPLD PV +ISL+PGETTA+WVSID P AQPPG YEG+ 
Sbjct: 124 VAGQSITLRKVVTILGVPDALVPLDHPVSRISLVPGETTAIWVSIDIPSAQPPGQYEGDF 183

Query: 183 IIT-SKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRV 241
           IIT +K + E  S C  K EKHRL+MELRNCLD VEPIEGKPL+EVVER KS +T+LRRV
Sbjct: 184 IITATKTEAEYQSHCFNKAEKHRLYMELRNCLDIVEPIEGKPLNEVVERVKSASTSLRRV 243

Query: 242 IFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIE 301
           + SP FSEFFSDNG +DMMDEDAISNLSVRVKLSLTVWDFILP TPS PAV GISDTVIE
Sbjct: 244 LLSPSFSEFFSDNGSVDMMDEDAISNLSVRVKLSLTVWDFILPVTPSFPAVFGISDTVIE 303

Query: 302 DRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFS 361
           DRFGV HG+DEWYEAL+QHFKWLLQYRISP+FCRWG SMRV  YTCPWPADHPKSDEY S
Sbjct: 304 DRFGVEHGTDEWYEALEQHFKWLLQYRISPYFCRWGTSMRVFGYTCPWPADHPKSDEYLS 363

Query: 362 DPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRN 421
           DPRLAAYAVPY+  +S ND  KDY++KEIE+LRTK HWKKAYFYLWDEPLN+EHY S+RN
Sbjct: 364 DPRLAAYAVPYNRAVSGNDAGKDYLQKEIEMLRTKPHWKKAYFYLWDEPLNLEHYDSLRN 423

Query: 422 MASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNRE 481
           MA E+H YAPDAR+LTTYYCGP+DAPL PTPFE+FVKVPKF+RPH QIYC SEWVLGNR+
Sbjct: 424 MAGEIHGYAPDARILTTYYCGPNDAPLAPTPFEAFVKVPKFMRPHIQIYCASEWVLGNRD 483

Query: 482 DLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYW 541
           DLVKDI++ELQPENGEEWWTYVC+GPSDPHPNWHLGMRG+QHRAVMWRVWKEGGTGFLYW
Sbjct: 484 DLVKDIISELQPENGEEWWTYVCLGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFLYW 543

Query: 542 GANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           GANCYEKATVPSAEIRFRRGLPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQ
Sbjct: 544 GANCYEKATVPSAEIRFRRGLPPGDGVLYYPGEVFSSSHKPVASLRLERLLSGLQ 598


>gi|225458333|ref|XP_002283035.1| PREDICTED: uncharacterized protein LOC100243809 [Vitis vinifera]
 gi|302142468|emb|CBI19671.3| unnamed protein product [Vitis vinifera]
          Length = 638

 Score =  974 bits (2519), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 464/587 (79%), Positives = 513/587 (87%), Gaps = 4/587 (0%)

Query: 11  VVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMP 70
            VPPVEGVAGGGTAYGW+D     S  L GS +PTE+P+ADL+HVWCMPSTANVGPQEMP
Sbjct: 2   TVPPVEGVAGGGTAYGWSDGVVHPSNSLKGSTDPTEVPSADLLHVWCMPSTANVGPQEMP 61

Query: 71  RPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLML 130
           RPLE + LLAARNERESVQIA+RPKVSW  S   G VQVQCSDLCS SGDRLVVG+SL L
Sbjct: 62  RPLEHVTLLAARNERESVQIAMRPKVSWGGS--GGAVQVQCSDLCSPSGDRLVVGESLKL 119

Query: 131 RRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIIT-SKAD 189
           RRVV +LGVPDALVPLDLPV QISL+PGETTA+WVSID P  QPPG YEGE+IIT +KAD
Sbjct: 120 RRVVSILGVPDALVPLDLPVSQISLLPGETTAIWVSIDVPSTQPPGQYEGELIITATKAD 179

Query: 190 TELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSE 249
            E  ++CLGK E+ +L+ EL+N L+ VEPI+GKPL EVVER KS  TTLR +  SP F E
Sbjct: 180 AESRAKCLGKAERRQLYSELKNFLEIVEPIDGKPLDEVVERVKSATTTLRSIFQSPSFCE 239

Query: 250 FFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHG 309
           FFSD  P+DMMDEDAIS+LSVR+KLSLTVW+F+LP TPSLPAV GISDTVIEDRFGV HG
Sbjct: 240 FFSDGHPVDMMDEDAISDLSVRMKLSLTVWNFVLPLTPSLPAVFGISDTVIEDRFGVEHG 299

Query: 310 SDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYA 369
           +DEWYEALD HFKWLLQYRISP+FCRWG+ MRVLTYTCPWPA HPKSDEYFSDPRLAAYA
Sbjct: 300 TDEWYEALDHHFKWLLQYRISPYFCRWGDGMRVLTYTCPWPAHHPKSDEYFSDPRLAAYA 359

Query: 370 VPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAY 429
           VPYS V+      KDY+++EIE L+TK HWKKAYFYLWDEPLN+EH+ ++RNMA E+ AY
Sbjct: 360 VPYSQVVPGG-AEKDYLQREIETLKTKTHWKKAYFYLWDEPLNLEHFDNIRNMACEVQAY 418

Query: 430 APDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVT 489
           A DAR+LTTYY GPSDAPL    FE+FVKVPKFLRPHTQIYCTSEWV GNREDLVKDI+ 
Sbjct: 419 ARDARILTTYYSGPSDAPLASNNFEAFVKVPKFLRPHTQIYCTSEWVFGNREDLVKDIIA 478

Query: 490 ELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA 549
           ELQPENGEEWWTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGTGFLYWGANCYEKA
Sbjct: 479 ELQPENGEEWWTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGANCYEKA 538

Query: 550 TVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           TVPSAE+ FRRGLPPGDGVLFYPGEV+S+S +PVAS+RLERILSGLQ
Sbjct: 539 TVPSAEVCFRRGLPPGDGVLFYPGEVYSTSHEPVASVRLERILSGLQ 585


>gi|297852256|ref|XP_002894009.1| hypothetical protein ARALYDRAFT_473837 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339851|gb|EFH70268.1| hypothetical protein ARALYDRAFT_473837 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 643

 Score =  961 bits (2485), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 456/596 (76%), Positives = 518/596 (86%), Gaps = 7/596 (1%)

Query: 1   MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
           MDN+G  + +V  PVEGVAGGGTAYG+ND     + PL  S +P+E+PTADLV+VWCMP+
Sbjct: 1   MDNNGLQEMTV--PVEGVAGGGTAYGFND-----AEPLKQSTDPSEVPTADLVNVWCMPN 53

Query: 61  TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
           T NVG QE PRPLEPINLLAARNERES QIA+RPKVSW++SS +G VQVQCSDLCS++GD
Sbjct: 54  TVNVGSQETPRPLEPINLLAARNERESFQIAMRPKVSWAASSPSGSVQVQCSDLCSSAGD 113

Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
           RLVVGQSL LRRVVP+LGVPDALVPLDLPV Q+SL PGET+ +WVSID P  QPPG YEG
Sbjct: 114 RLVVGQSLNLRRVVPVLGVPDALVPLDLPVSQLSLFPGETSVIWVSIDVPNRQPPGQYEG 173

Query: 181 EIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRR 240
           EII+++       S  LGK EK +L +EL NCLD +EPIEGKP+ EVVER K  +++LRR
Sbjct: 174 EIIVSAMKTDGGGSAHLGKHEKDQLCVELNNCLDIMEPIEGKPMDEVVERIKCASSSLRR 233

Query: 241 VIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVI 300
           ++FSP FSEF S NG  DMM+ED +SNLS+R+KL LTVW+FI+P TPSLP+VIG+SDTVI
Sbjct: 234 ILFSPSFSEFISTNGSTDMMEEDVVSNLSLRIKLRLTVWEFIIPVTPSLPSVIGVSDTVI 293

Query: 301 EDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYF 360
           EDRFGV  GS+EWYE LD HFKWLLQYRISP+FC+WGE MRVLTYT PWPADHPKSDEY 
Sbjct: 294 EDRFGVERGSEEWYEKLDLHFKWLLQYRISPYFCKWGEGMRVLTYTSPWPADHPKSDEYL 353

Query: 361 SDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVR 420
           SDPRLAAYAVPY  V++ +D  + Y+RKE+E+LR+K HWKKAYFYLWDEPLNMEH+ SVR
Sbjct: 354 SDPRLAAYAVPYRQVIAGDDIRESYLRKEVEILRSKPHWKKAYFYLWDEPLNMEHFDSVR 413

Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
            MASE++AYAPDARVLTTYYCGP DAPL PTPFESFVKVP  LRPHTQIYCTSEWVLGNR
Sbjct: 414 KMASEIYAYAPDARVLTTYYCGPGDAPLAPTPFESFVKVPNLLRPHTQIYCTSEWVLGNR 473

Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
           EDLVKDIV ELQ ENGEEWWTY+C+GPSDPHPNWHLGMRG+Q RAVMWRVWKEGGTGFLY
Sbjct: 474 EDLVKDIVEELQTENGEEWWTYICLGPSDPHPNWHLGMRGTQQRAVMWRVWKEGGTGFLY 533

Query: 541 WGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           WGANCYEKATVPSAE++FRRGLPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQ
Sbjct: 534 WGANCYEKATVPSAEVKFRRGLPPGDGVLYYPGEVFSSSSEPVASLRLERLLSGLQ 589


>gi|42562571|ref|NP_175129.3| uncharacterized protein [Arabidopsis thaliana]
 gi|30725314|gb|AAP37679.1| At1g45150 [Arabidopsis thaliana]
 gi|110742869|dbj|BAE99332.1| hypothetical protein [Arabidopsis thaliana]
 gi|332193963|gb|AEE32084.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 643

 Score =  949 bits (2452), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 447/596 (75%), Positives = 515/596 (86%), Gaps = 7/596 (1%)

Query: 1   MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
           MDN+ + + +V  PVEGVAGGGTAYG+ND     + PL  S +P+E+PTADLV+VWCMP+
Sbjct: 1   MDNNVSQEMTV--PVEGVAGGGTAYGFND-----AEPLKQSTDPSEVPTADLVNVWCMPN 53

Query: 61  TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
           T NVG QE PR LEPINLLAARNERES QIA+RPKVSW++SS +G+VQVQCSDLCS++GD
Sbjct: 54  TVNVGSQETPRALEPINLLAARNERESFQIAMRPKVSWAASSPSGIVQVQCSDLCSSAGD 113

Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
           RLVVGQSL LRRVVP+LGVPDALVPLDLPV Q+SL PGET+ +WVSID P  QPPG YEG
Sbjct: 114 RLVVGQSLKLRRVVPVLGVPDALVPLDLPVSQLSLFPGETSVIWVSIDVPTGQPPGQYEG 173

Query: 181 EIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRR 240
           EIII++       S  L K EK +L +EL  CLD +EPIEGKP+ EVVER K  +++LRR
Sbjct: 174 EIIISAMKTDGGGSSHLAKHEKDQLCVELNTCLDIMEPIEGKPMDEVVERIKCASSSLRR 233

Query: 241 VIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVI 300
           ++FSP FSEF S NG  DMM+ED +SNLS+R+KL LTVW+FI+P TPSLPAVIG+SDTVI
Sbjct: 234 ILFSPSFSEFISTNGSTDMMEEDVVSNLSLRIKLRLTVWEFIIPVTPSLPAVIGVSDTVI 293

Query: 301 EDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYF 360
           EDRF V HGS++WY+ LD HFKWLLQYRISP+FC+WGESMRVLTYT PWPADHPKSDEY 
Sbjct: 294 EDRFAVEHGSEDWYKKLDLHFKWLLQYRISPYFCKWGESMRVLTYTSPWPADHPKSDEYL 353

Query: 361 SDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVR 420
           SD RLAAYAVPY  V++ +D  + Y+RKE+E+LR+K HW KAYFYLWDEPLNMEH+ +VR
Sbjct: 354 SDSRLAAYAVPYRQVIAGDDSRESYLRKEVEILRSKPHWNKAYFYLWDEPLNMEHFDNVR 413

Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
            MASE++AYAPD+RVLTTYYCGP DAPL PTPFESFVKVP  LRP+TQIYCTSEWVLGNR
Sbjct: 414 KMASEIYAYAPDSRVLTTYYCGPGDAPLAPTPFESFVKVPNLLRPYTQIYCTSEWVLGNR 473

Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
           EDLVKDI+ ELQ ENGEEWWTY+C+GPSDPHPNWHLGMRG+Q RAVMWRVWKEGGTGFLY
Sbjct: 474 EDLVKDILDELQTENGEEWWTYICLGPSDPHPNWHLGMRGTQQRAVMWRVWKEGGTGFLY 533

Query: 541 WGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           WGANCYEKATVPSAE++FRRGLPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQ
Sbjct: 534 WGANCYEKATVPSAEVKFRRGLPPGDGVLYYPGEVFSSSSEPVASLRLERLLSGLQ 589


>gi|7767671|gb|AAF69168.1|AC007915_20 F27F5.22 [Arabidopsis thaliana]
          Length = 687

 Score =  918 bits (2372), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 443/635 (69%), Positives = 507/635 (79%), Gaps = 57/635 (8%)

Query: 14  PVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPL 73
           PVEGVAGGGTAYG+ND     + PL  S +P+E+PTADLV+VWCMP+T NVG QE PR L
Sbjct: 4   PVEGVAGGGTAYGFND-----AEPLKQSTDPSEVPTADLVNVWCMPNTVNVGSQETPRAL 58

Query: 74  EPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRV 133
           EPINLLAARNERES QIA+RPKVSW++SS +G+VQVQCSDLCS++GDRLVVGQSL LRRV
Sbjct: 59  EPINLLAARNERESFQIAMRPKVSWAASSPSGIVQVQCSDLCSSAGDRLVVGQSLKLRRV 118

Query: 134 VPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELS 193
           VP+LGVPDALVPLDLPV Q+SL PGET+ +WVSID P  QPPG YEGEIII++       
Sbjct: 119 VPVLGVPDALVPLDLPVSQLSLFPGETSVIWVSIDVPTGQPPGQYEGEIIISAMKTDGGG 178

Query: 194 SQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSD 253
           S  L K EK +L +EL  CLD +EPIEGKP+ EVVER K  +++LRR++FSP FSEF S 
Sbjct: 179 SSHLAKHEKDQLCVELNTCLDIMEPIEGKPMDEVVERIKCASSSLRRILFSPSFSEFIST 238

Query: 254 NGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEW 313
           NG  DMM+ED +SNLS+R+KL LTVW+FI+P TPSLPAVIG+SDTVIEDRF V HGS++W
Sbjct: 239 NGSTDMMEEDVVSNLSLRIKLRLTVWEFIIPVTPSLPAVIGVSDTVIEDRFAVEHGSEDW 298

Query: 314 YEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWP--------------------ADH 353
           Y+ LD HFKWLLQYRISP+FC+WGESMRVLTYT PWP                    ADH
Sbjct: 299 YKKLDLHFKWLLQYRISPYFCKWGESMRVLTYTSPWPANRFASRSELSICVPLFGFTADH 358

Query: 354 PKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNM 413
           PKSDEY SD RLAAYAVPY  V++ +D  + Y+RKE+E+LR+K HW KAYFYLWDEPLNM
Sbjct: 359 PKSDEYLSDSRLAAYAVPYRQVIAGDDSRESYLRKEVEILRSKPHWNKAYFYLWDEPLNM 418

Query: 414 EHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCT- 472
           EH+ +VR MASE++AYAPD+RVLTTYYCGP DAPL PTPFESFVKVP  LRP+TQIYCT 
Sbjct: 419 EHFDNVRKMASEIYAYAPDSRVLTTYYCGPGDAPLAPTPFESFVKVPNLLRPYTQIYCTS 478

Query: 473 -------------------------------SEWVLGNREDLVKDIVTELQPENGEEWWT 501
                                          SEWVLGNREDLVKDI+ ELQ ENGEEWWT
Sbjct: 479 KYVFGLKFSLFRHSPTWIDMEAVLLNHGLIFSEWVLGNREDLVKDILDELQTENGEEWWT 538

Query: 502 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 561
           Y+C+GPSDPHPNWHLGMRG+Q RAVMWRVWKEGGTGFLYWGANCYEKATVPSAE++FRRG
Sbjct: 539 YICLGPSDPHPNWHLGMRGTQQRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEVKFRRG 598

Query: 562 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           LPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQ
Sbjct: 599 LPPGDGVLYYPGEVFSSSSEPVASLRLERLLSGLQ 633


>gi|222637345|gb|EEE67477.1| hypothetical protein OsJ_24889 [Oryza sativa Japonica Group]
          Length = 709

 Score =  889 bits (2296), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 414/590 (70%), Positives = 496/590 (84%), Gaps = 1/590 (0%)

Query: 8   QDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQ 67
           Q+S VPPVEGVAGGGT+YGW D   Q+S   NG+I+PT+I +ADL+HVW MPSTANV  Q
Sbjct: 16  QNSSVPPVEGVAGGGTSYGWVDGGLQASSLGNGAIDPTKIHSADLLHVWSMPSTANVSQQ 75

Query: 68  EMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQS 127
           E PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQVQC+DLCS++GDRLVVGQS
Sbjct: 76  EAPRPLEHVNLLAARNERESFQIALRPKVSWATSGIAGSVQVQCTDLCSSAGDRLVVGQS 135

Query: 128 LMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS- 186
           + LRRVVPMLGVPDALVP+D    QI+L+PGET+A+WVS++ P  Q PGLYEGEI I++ 
Sbjct: 136 VTLRRVVPMLGVPDALVPIDPLNSQINLLPGETSAIWVSLNVPCGQQPGLYEGEIFISAV 195

Query: 187 KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPL 246
           +A+ E   + L K E+++L+ ELRNC+D  EP +     E+V+R  S +TTLRR++  P 
Sbjct: 196 RAEAESRGESLTKSERYQLYKELRNCIDITEPRDYSSSEEMVQRLTSASTTLRRMLALPS 255

Query: 247 FSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGV 306
           F +   +NG  DMMDED ++N++VR+KLSLTVWDF LP TPSLPAV GIS+TVIEDRF +
Sbjct: 256 FQDCQENNGLGDMMDEDIMNNVAVRLKLSLTVWDFTLPLTPSLPAVFGISETVIEDRFCL 315

Query: 307 RHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLA 366
            HG+  WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+ EY+SDPRLA
Sbjct: 316 EHGTKGWYDALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKAKEYYSDPRLA 375

Query: 367 AYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASEL 426
           AYAVPY+P+LSS D AK+ +R+E+E+L+++AHW K+YFYLWDEPLNME Y  + ++++EL
Sbjct: 376 AYAVPYAPILSSTDAAKNSLRREVEILKSEAHWSKSYFYLWDEPLNMEQYDVICSISNEL 435

Query: 427 HAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKD 486
            +YA D R+LTTYYCGPS + L P+ FE+FVKVP  LRPHTQI+CTSEWVLG REDLVKD
Sbjct: 436 RSYASDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWVLGTREDLVKD 495

Query: 487 IVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY 546
           IV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGTGFLYWG NCY
Sbjct: 496 IVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGTNCY 555

Query: 547 EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           EKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 556 EKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 605


>gi|115473013|ref|NP_001060105.1| Os07g0581300 [Oryza sativa Japonica Group]
 gi|33146840|dbj|BAC79829.1| unknown protein [Oryza sativa Japonica Group]
 gi|50509223|dbj|BAD30493.1| unknown protein [Oryza sativa Japonica Group]
 gi|113611641|dbj|BAF22019.1| Os07g0581300 [Oryza sativa Japonica Group]
 gi|215737152|dbj|BAG96081.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 658

 Score =  887 bits (2292), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 414/590 (70%), Positives = 496/590 (84%), Gaps = 1/590 (0%)

Query: 8   QDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQ 67
           Q+S VPPVEGVAGGGT+YGW D   Q+S   NG+I+PT+I +ADL+HVW MPSTANV  Q
Sbjct: 16  QNSSVPPVEGVAGGGTSYGWVDGGLQASSLGNGAIDPTKIHSADLLHVWSMPSTANVSQQ 75

Query: 68  EMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQS 127
           E PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQVQC+DLCS++GDRLVVGQS
Sbjct: 76  EAPRPLEHVNLLAARNERESFQIALRPKVSWATSGIAGSVQVQCTDLCSSAGDRLVVGQS 135

Query: 128 LMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS- 186
           + LRRVVPMLGVPDALVP+D    QI+L+PGET+A+WVS++ P  Q PGLYEGEI I++ 
Sbjct: 136 VTLRRVVPMLGVPDALVPIDPLNSQINLLPGETSAIWVSLNVPCGQQPGLYEGEIFISAV 195

Query: 187 KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPL 246
           +A+ E   + L K E+++L+ ELRNC+D  EP +     E+V+R  S +TTLRR++  P 
Sbjct: 196 RAEAESRGESLTKSERYQLYKELRNCIDITEPRDYSSSEEMVQRLTSASTTLRRMLALPS 255

Query: 247 FSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGV 306
           F +   +NG  DMMDED ++N++VR+KLSLTVWDF LP TPSLPAV GIS+TVIEDRF +
Sbjct: 256 FQDCQENNGLGDMMDEDIMNNVAVRLKLSLTVWDFTLPLTPSLPAVFGISETVIEDRFCL 315

Query: 307 RHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLA 366
            HG+  WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+ EY+SDPRLA
Sbjct: 316 EHGTKGWYDALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKAKEYYSDPRLA 375

Query: 367 AYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASEL 426
           AYAVPY+P+LSS D AK+ +R+E+E+L+++AHW K+YFYLWDEPLNME Y  + ++++EL
Sbjct: 376 AYAVPYAPILSSTDAAKNSLRREVEILKSEAHWSKSYFYLWDEPLNMEQYDVICSISNEL 435

Query: 427 HAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKD 486
            +YA D R+LTTYYCGPS + L P+ FE+FVKVP  LRPHTQI+CTSEWVLG REDLVKD
Sbjct: 436 RSYASDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWVLGTREDLVKD 495

Query: 487 IVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY 546
           IV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGTGFLYWG NCY
Sbjct: 496 IVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGTNCY 555

Query: 547 EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           EKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 556 EKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 605


>gi|218199904|gb|EEC82331.1| hypothetical protein OsI_26624 [Oryza sativa Indica Group]
          Length = 709

 Score =  886 bits (2290), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 412/590 (69%), Positives = 496/590 (84%), Gaps = 1/590 (0%)

Query: 8   QDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQ 67
           Q+S VPPVEGVAGGGT+YGW D   Q+S   NG+I+PT+I +ADL+HVW MPSTANV  Q
Sbjct: 16  QNSSVPPVEGVAGGGTSYGWVDGGLQASSLGNGAIDPTKIHSADLLHVWSMPSTANVSQQ 75

Query: 68  EMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQS 127
           E PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQVQC+DLCS++GDRLVVGQS
Sbjct: 76  EAPRPLEHVNLLAARNERESFQIALRPKVSWATSGIAGSVQVQCTDLCSSAGDRLVVGQS 135

Query: 128 LMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS- 186
           + LRRVVPMLGVPDALVP+D    QI+L+PGET+A+WVS++ P  Q PGLYEGEI +++ 
Sbjct: 136 VTLRRVVPMLGVPDALVPIDPLNSQINLLPGETSAIWVSLNVPCGQQPGLYEGEIFLSAV 195

Query: 187 KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPL 246
           +A++E   + L K E+++L+ ELRNC+D  E  +     E+V+R  S +TTLRR++  P 
Sbjct: 196 RAESESRGESLTKSERYQLYKELRNCIDITETRDYSSSEEMVQRLTSASTTLRRMLALPS 255

Query: 247 FSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGV 306
           F +   +NG  DMMDED ++N++VR+KLSLTVWDF LP TPSLPAV GIS+TVIEDRF +
Sbjct: 256 FQDCQENNGLGDMMDEDIMNNVAVRLKLSLTVWDFTLPLTPSLPAVFGISETVIEDRFCL 315

Query: 307 RHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLA 366
            HG+  WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+ EY+SDPRLA
Sbjct: 316 EHGTKGWYDALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKAKEYYSDPRLA 375

Query: 367 AYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASEL 426
           AYAVPY+P+LSS D AK+ +R+E+E+L+++AHW K+YFYLWDEPLNME Y  + ++++EL
Sbjct: 376 AYAVPYAPILSSTDAAKNSLRREVEILKSEAHWSKSYFYLWDEPLNMEQYDVICSISNEL 435

Query: 427 HAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKD 486
            +YA D R+LTTYYCGPS + L P+ FE+FVKVP  LRPHTQI+CTSEWVLG REDLVKD
Sbjct: 436 RSYASDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWVLGTREDLVKD 495

Query: 487 IVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY 546
           IV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGTGFLYWG NCY
Sbjct: 496 IVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGTNCY 555

Query: 547 EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           EKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 556 EKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 605


>gi|357122237|ref|XP_003562822.1| PREDICTED: uncharacterized protein LOC100840095 [Brachypodium
           distachyon]
          Length = 657

 Score =  875 bits (2261), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 411/594 (69%), Positives = 491/594 (82%), Gaps = 3/594 (0%)

Query: 5   GNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANV 64
           G  Q+  VPPVEGVAGGGT+YGW D   Q S      I+P ++ + DL+HVW MPSTANV
Sbjct: 12  GKTQEISVPPVEGVAGGGTSYGWVDGGLQGSSLGTSVIDPAKVHSTDLLHVWSMPSTANV 71

Query: 65  GPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVV 124
             QE PRPLE +NLLAARNERES QIALRPKVSW SS  AG VQ+QC+DLCS+SGDRLVV
Sbjct: 72  SQQEAPRPLEHVNLLAARNERESFQIALRPKVSWISSGIAGPVQIQCTDLCSSSGDRLVV 131

Query: 125 GQSLMLRRVVPMLGVPDALVPLDLPVC-QISLIPGETTAVWVSIDAPYAQPPGLYEGEII 183
           GQS+ LRRVVPMLGVPDALVP+D P+C QI+L+PGET+A+WVS++ P  Q PGLYEGEI 
Sbjct: 132 GQSVTLRRVVPMLGVPDALVPID-PLCPQINLLPGETSAIWVSLNVPCGQQPGLYEGEIF 190

Query: 184 IT-SKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVI 242
           IT ++A+T+  ++ L K E+++L+ ELR CLD  E  +     E+V+R  ST+TTL+R++
Sbjct: 191 ITATRAETDSRAESLPKSERYQLYRELRTCLDITESRDCSTPEEMVQRLTSTSTTLKRML 250

Query: 243 FSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIED 302
             P F +   +NG  DMMDED ++N++VRVKLSLTVWDF LP TPSLPAV GIS+TVIED
Sbjct: 251 VLPAFQDCQENNGLGDMMDEDVMNNVAVRVKLSLTVWDFTLPLTPSLPAVFGISETVIED 310

Query: 303 RFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSD 362
           RF + HG+  WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+ EY+SD
Sbjct: 311 RFCLEHGTKGWYDALDDHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKAKEYYSD 370

Query: 363 PRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNM 422
           PRLAAYAVPY+P+LS  D A++ +R+E+++L+T+AHW KAYFYLWDEPLNME Y  +RN+
Sbjct: 371 PRLAAYAVPYAPILSCTDAARNSLRREVDILKTEAHWSKAYFYLWDEPLNMEQYEVIRNI 430

Query: 423 ASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNRED 482
           ++EL  Y PD R+LTTYY GPS + L P+ FE+F KVP  LRPHTQI+CTSEWVLG RED
Sbjct: 431 SNELRTYTPDVRILTTYYAGPSGSELAPSTFEAFAKVPNVLRPHTQIFCTSEWVLGTRED 490

Query: 483 LVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG 542
           LVKDI+ EL+PE GEEWWTYVC+GP+DP PNWHLGMRG+QHRAVMWRVWKEGGTGFLYWG
Sbjct: 491 LVKDIIAELRPELGEEWWTYVCLGPTDPQPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWG 550

Query: 543 ANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
            NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVASLRLERILSG+Q
Sbjct: 551 TNCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASLRLERILSGMQ 604


>gi|293331693|ref|NP_001169555.1| uncharacterized protein LOC100383434 [Zea mays]
 gi|224030081|gb|ACN34116.1| unknown [Zea mays]
          Length = 657

 Score =  872 bits (2253), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 412/593 (69%), Positives = 488/593 (82%), Gaps = 1/593 (0%)

Query: 5   GNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANV 64
           G  Q+  VPPVEGVAGGGT+YGW D   + +    G I+PT++ + DL+HVW MPSTANV
Sbjct: 12  GKTQNVSVPPVEGVAGGGTSYGWVDGGLRGTNIGAGVIDPTKVHSDDLLHVWSMPSTANV 71

Query: 65  GPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVV 124
             QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQ+QC+DLCS+SGDRLVV
Sbjct: 72  SQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVQIQCTDLCSSSGDRLVV 131

Query: 125 GQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIII 184
           GQS+ LRRVVP+LGVPDALVP+D    QIS+ PGET AVWVS++ P  QPPGLYEGEI I
Sbjct: 132 GQSITLRRVVPILGVPDALVPIDPLSPQISIQPGETAAVWVSVNVPCGQPPGLYEGEIFI 191

Query: 185 TS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIF 243
           T+ K + +  ++ L K EK RL+ ELR+CLD   P +     E+V+R  S +T LRRV+ 
Sbjct: 192 TAVKTELDSRTESLPKSEKCRLYRELRSCLDLTGPRDYSSPEEMVQRLTSASTVLRRVLD 251

Query: 244 SPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDR 303
           +P   +   +NG  DMMDED I+N+SVR+KLSLTVWDF LP TPSLPAV GIS+TVIEDR
Sbjct: 252 NPALQDCQENNGFGDMMDEDVINNISVRLKLSLTVWDFTLPVTPSLPAVFGISETVIEDR 311

Query: 304 FGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDP 363
           F + HG++ WY ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK++EY+SDP
Sbjct: 312 FCLEHGTEGWYSALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKANEYYSDP 371

Query: 364 RLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMA 423
           RLAAYAVPY+P+LS  D AK+ +R+E+E+L++K HW KAYFYLWDEPLN+E Y  + N++
Sbjct: 372 RLAAYAVPYAPILSCTDAAKNSLRREVEILKSKPHWSKAYFYLWDEPLNVEQYDMICNIS 431

Query: 424 SELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDL 483
           +EL +YAPD R+LTTYYCGPS + L P+ FE+F KVP  LRPHTQI+CTSEWVLG REDL
Sbjct: 432 NELRSYAPDVRILTTYYCGPSGSELAPSTFEAFAKVPNVLRPHTQIFCTSEWVLGTREDL 491

Query: 484 VKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGA 543
           VKDIV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGTGFLYWG+
Sbjct: 492 VKDIVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGS 551

Query: 544 NCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 552 NCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 604


>gi|242046098|ref|XP_002460920.1| hypothetical protein SORBIDRAFT_02g037540 [Sorghum bicolor]
 gi|241924297|gb|EER97441.1| hypothetical protein SORBIDRAFT_02g037540 [Sorghum bicolor]
          Length = 657

 Score =  870 bits (2249), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 408/593 (68%), Positives = 490/593 (82%), Gaps = 1/593 (0%)

Query: 5   GNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANV 64
           G  Q+  VPPVEGVAGGGT+YGW D   + +    G I+PT++ + DL+HVW MPSTANV
Sbjct: 12  GKTQNVSVPPVEGVAGGGTSYGWVDGGLRGTNLGAGVIDPTKVHSEDLLHVWSMPSTANV 71

Query: 65  GPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVV 124
             QE+PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQ+QC+DLCS+SGDRLVV
Sbjct: 72  SQQEVPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVQIQCTDLCSSSGDRLVV 131

Query: 125 GQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIII 184
           GQS+ LRRVVP+LGVPDALVP+D    Q++L PGET AVWVS++ P  QPPGLYEGEI I
Sbjct: 132 GQSITLRRVVPILGVPDALVPIDPLSPQVTLQPGETAAVWVSLNVPCGQPPGLYEGEIFI 191

Query: 185 TS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIF 243
           T+ K + +  ++ L K EK+RL+ ELR+CLD   P +     E+V+R  S ++ LRRV+ 
Sbjct: 192 TAVKTELDSRTESLPKSEKYRLYRELRSCLDLTGPRDYSSPEEMVQRLTSASSALRRVLD 251

Query: 244 SPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDR 303
           +P   +   +NG  DMMDED ++N+SVR+KLSLTVWDF LP TPSLPAV GIS+TVIEDR
Sbjct: 252 NPALQDCQENNGFGDMMDEDVMNNVSVRLKLSLTVWDFTLPVTPSLPAVFGISETVIEDR 311

Query: 304 FGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDP 363
           F + HG++ WY ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK++EY+SDP
Sbjct: 312 FCLEHGTEGWYSALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKANEYYSDP 371

Query: 364 RLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMA 423
           RLAAYAVPY+P+LS  D AK+ +R+E+E+L++K HW KAYFYLWDEPLN+E Y  + N++
Sbjct: 372 RLAAYAVPYAPILSCTDAAKNSLRREVEILKSKPHWSKAYFYLWDEPLNVEQYDMICNIS 431

Query: 424 SELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDL 483
           +EL +YAPD R+LTTYYCGPS + L P+ FE+FVKVP  LRPHTQI+CTSEWVLG REDL
Sbjct: 432 NELRSYAPDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWVLGTREDL 491

Query: 484 VKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGA 543
           VKDI+ EL+P+ GEEWWTYVCMGPSDP PNWH+GMRG+QHRAVMWRVWKEGGTGFLYWG 
Sbjct: 492 VKDIIAELRPDLGEEWWTYVCMGPSDPQPNWHIGMRGTQHRAVMWRVWKEGGTGFLYWGT 551

Query: 544 NCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 552 NCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 604


>gi|168042677|ref|XP_001773814.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674929|gb|EDQ61431.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 651

 Score =  745 bits (1924), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/596 (61%), Positives = 458/596 (76%), Gaps = 15/596 (2%)

Query: 13  PPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRP 72
           PP+EGV GGGT YGWND     +  L   I+ +  PT+DLVHVWCMPSTA +G QE PRP
Sbjct: 4   PPIEGVGGGGTGYGWNDGSHTGTTILASEIDVSRQPTSDLVHVWCMPSTAIIGHQEPPRP 63

Query: 73  LEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRR 132
           LE ++LLAARNERES QIALRPK+SW+S    G +Q+ CSD CS SGDRL  G+ + +RR
Sbjct: 64  LERVSLLAARNERESAQIALRPKMSWTSGDMVGYLQIHCSDFCSPSGDRLNAGKEVTIRR 123

Query: 133 VVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS-KADTE 191
           VVP+LGVPDALVP+DLP  +I L+PGET A+WVS D P  QPPG+Y GEI IT+ + +TE
Sbjct: 124 VVPILGVPDALVPIDLP-SRIGLLPGETCALWVSFDVPVTQPPGVYIGEIWITAVRGETE 182

Query: 192 LSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFF 251
            +++ + + EK ++  +L+  L   E    +    + E  +S    L +V+ SPL S   
Sbjct: 183 FAAEKV-ESEKLQMKKDLQGFLAQAEAASNESAEVLTEALRSICEGLHQVLQSPLLSAGC 241

Query: 252 SDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSD 311
            D G +++ DE+  ++ SV+V+ S+TVWDF+LP TPSLPAV GIS+TVIEDR+ ++HGS 
Sbjct: 242 EDFGKMEI-DEEFQASPSVQVQFSITVWDFVLPITPSLPAVFGISETVIEDRYNLKHGSK 300

Query: 312 EWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVP 371
           EW+++L+ HF WLLQYR+SP+FCRWG++MRVLTYTCP+PA HPKS++Y+SDPRLAAYAVP
Sbjct: 301 EWFKSLNMHFDWLLQYRLSPYFCRWGDNMRVLTYTCPYPATHPKSEDYYSDPRLAAYAVP 360

Query: 372 YSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDE----------PLNMEHYSSVRN 421
           Y PVLSS+D AKD V+ E+E+L+TK HWKKAYFYLWDE          P+  E Y  +R+
Sbjct: 361 YIPVLSSSDTAKDVVKSELEILKTKPHWKKAYFYLWDEARISTRSQHGPVGFEQYEVIRS 420

Query: 422 MASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNRE 481
           +A E+   APDAR+LTTYYCGPSD  +    FESF+KVP FLRPHTQI+CTSEWVLG RE
Sbjct: 421 IAEEIRNTAPDARILTTYYCGPSDPSMKLDGFESFLKVPTFLRPHTQIFCTSEWVLGGRE 480

Query: 482 DLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYW 541
           DLVK I  E+Q +  EEWWTYVCMGP + HPNWHLGMRG+QHRAV+WRVWKEGGTGFLYW
Sbjct: 481 DLVKQITDEIQFDRSEEWWTYVCMGPGELHPNWHLGMRGTQHRAVIWRVWKEGGTGFLYW 540

Query: 542 GANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFS-SSRQPVASLRLERILSGLQ 596
           G NCYEKA+ PSAEIRFRRGLPPGDGVLFYPGEVF+  +  PVAS+RLER+LSG+Q
Sbjct: 541 GVNCYEKASSPSAEIRFRRGLPPGDGVLFYPGEVFNIGATLPVASVRLERLLSGMQ 596


>gi|302755140|ref|XP_002960994.1| hypothetical protein SELMODRAFT_74245 [Selaginella moellendorffii]
 gi|300171933|gb|EFJ38533.1| hypothetical protein SELMODRAFT_74245 [Selaginella moellendorffii]
          Length = 633

 Score =  720 bits (1859), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/591 (60%), Positives = 451/591 (76%), Gaps = 20/591 (3%)

Query: 9   DSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQE 68
           D   PPVEG++GGGT YGW D     S P  GS++  + P +DL  VWCMPSTA VG QE
Sbjct: 5   DLGAPPVEGLSGGGTGYGWGDCGIAVSRP--GSVDIAKNPASDLFSVWCMPSTATVGHQE 62

Query: 69  MPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQ-S 127
            PR L+ +NLL ARNERES QIALRPK+SW+     G VQV C D  SASGDR  +   S
Sbjct: 63  PPRALDQLNLLIARNERESAQIALRPKISWACGGAVGHVQVHCRDFVSASGDRWAIELLS 122

Query: 128 LMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS- 186
           + LRRVVP+LGVPDALVP+ +P CQ+SL+PGET+A+W+S+  P +Q PG+YEGE+  ++ 
Sbjct: 123 VSLRRVVPILGVPDALVPVSMPTCQVSLLPGETSALWLSVHVPSSQTPGVYEGEMTFSAV 182

Query: 187 KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPL 246
           KAD E S   + +G+K    +ELR  ++NV           +E  +     L+ ++  P 
Sbjct: 183 KADAEFS---VDEGDK----LELRKMVENVAAKMDDTRQNPMELLEEVRQDLQHLLDHPA 235

Query: 247 FSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGV 306
            +     NG +++ +E    +LS+++K+S+TVWDF+LP TP+LPAV G+S+TVIEDRF V
Sbjct: 236 LAH----NGKMEIDEE----SLSLKLKISITVWDFVLPVTPTLPAVFGVSETVIEDRFNV 287

Query: 307 RHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLA 366
            HGS  WY ALD+H++WLLQ+RISP+FCRWG++MR+L YTCPWPADH K++EY+SDPRLA
Sbjct: 288 EHGSSGWYNALDRHYQWLLQFRISPYFCRWGDNMRILAYTCPWPADHVKAEEYYSDPRLA 347

Query: 367 AYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASEL 426
           AYAVPY+PVLS+++  KD V +EIE+L TK HW+K+YFYLWDEPL+ + Y  +R M+ E+
Sbjct: 348 AYAVPYAPVLSNSNAVKDLVTREIEILSTKEHWRKSYFYLWDEPLSSDQYDFIRTMSEEI 407

Query: 427 HAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKD 486
            + AP++R+LTTYY GPSD    P  FE+F+KVP FLRPHTQI+CTSEWVLG REDLVK+
Sbjct: 408 RSIAPNSRILTTYYSGPSDVQYPPGSFEAFIKVPSFLRPHTQIFCTSEWVLGGREDLVKE 467

Query: 487 IVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY 546
           IV ELQP+  EEWWTYVCMGPSDPHPNWHLGMRG+QHR V+WR WKEGG+GFLYWG NCY
Sbjct: 468 IVAELQPDQREEWWTYVCMGPSDPHPNWHLGMRGTQHRGVLWRAWKEGGSGFLYWGTNCY 527

Query: 547 EKATVPSAEIRFRRGLPPGDGVLFYPGEVFS-SSRQPVASLRLERILSGLQ 596
           EK+  P+AEIRFRRGLPPGDGVLFYPGEVF+  S +PV+S+RLER+LSGLQ
Sbjct: 528 EKSLCPAAEIRFRRGLPPGDGVLFYPGEVFTPGSSEPVSSVRLERVLSGLQ 578


>gi|302767188|ref|XP_002967014.1| hypothetical protein SELMODRAFT_144564 [Selaginella moellendorffii]
 gi|300165005|gb|EFJ31613.1| hypothetical protein SELMODRAFT_144564 [Selaginella moellendorffii]
          Length = 582

 Score =  697 bits (1800), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/542 (61%), Positives = 418/542 (77%), Gaps = 18/542 (3%)

Query: 58  MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 117
           MPSTA VG QE PR L+ +NLL ARNERES QIALRPK+SW+     G VQV C D  S 
Sbjct: 1   MPSTATVGHQEPPRALDQLNLLIARNERESAQIALRPKISWACGGAVGHVQVHCRDFVSV 60

Query: 118 SGDRLVVGQ-SLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPG 176
           SGDR  +   S+ LRRVVP+LGVPDALVP+ +P CQ+SL+PGET+A+W+S+  P +Q PG
Sbjct: 61  SGDRWAIELLSVSLRRVVPILGVPDALVPVSMPTCQVSLLPGETSALWLSVHVPSSQTPG 120

Query: 177 LYEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTA 235
           +YEGE+  ++ KAD E     + +GEK    +ELR  ++ V           +E  +   
Sbjct: 121 VYEGEMTFSAAKADAEF---FVDEGEK----LELRKMVETVAAKMDDTRQNPMELLEEVR 173

Query: 236 TTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGI 295
             LR ++  P  +     NG +++ +E    +LS+++K+S+TVWDF+LP TP+LPAV G+
Sbjct: 174 QDLRHLLDHPALAH----NGKMEIDEE----SLSLKLKISITVWDFVLPVTPTLPAVFGV 225

Query: 296 SDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPK 355
           S+TVIEDRF V HGS +WY ALD+H++WLLQ+RISP+FCRWG++MR+L YTCPWPADH K
Sbjct: 226 SETVIEDRFNVEHGSSDWYNALDRHYQWLLQFRISPYFCRWGDNMRILAYTCPWPADHVK 285

Query: 356 SDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEH 415
           ++EY+SDPRLAAYAVPY+PVLS+++  KD V +EIE+L TK HW+K+YFYLWDEPL+ + 
Sbjct: 286 AEEYYSDPRLAAYAVPYAPVLSNSNAVKDLVTREIEILSTKEHWRKSYFYLWDEPLSSDQ 345

Query: 416 YSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEW 475
           Y  +R M+ E+ + AP+ R+LTTYY GPSD    P  FE+F+KVP FLRPHTQI+CTSEW
Sbjct: 346 YDFIRTMSEEIRSIAPNTRILTTYYSGPSDVQYPPGSFEAFIKVPSFLRPHTQIFCTSEW 405

Query: 476 VLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGG 535
           VLG REDLVK+IV ELQP+  EEWWTYVCMGPSDPHPNWHLGMRG+Q R V+WRVWKEGG
Sbjct: 406 VLGGREDLVKEIVAELQPDQREEWWTYVCMGPSDPHPNWHLGMRGTQQRGVLWRVWKEGG 465

Query: 536 TGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFS-SSRQPVASLRLERILSG 594
           +GFLYWG NCYEK+  P+AEIRFRRGLPPGDGVLFYPGEVF+  S +PV+S+RLER+LSG
Sbjct: 466 SGFLYWGTNCYEKSLCPAAEIRFRRGLPPGDGVLFYPGEVFTPGSSEPVSSVRLERVLSG 525

Query: 595 LQ 596
           LQ
Sbjct: 526 LQ 527


>gi|414590657|tpg|DAA41228.1| TPA: hypothetical protein ZEAMMB73_917393 [Zea mays]
          Length = 721

 Score =  686 bits (1770), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/682 (54%), Positives = 444/682 (65%), Gaps = 115/682 (16%)

Query: 5   GNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANV 64
           G  Q+  VPPVEGVAGGGT+YGW D   + +    G I+PT++ + DL+HVW MPSTANV
Sbjct: 12  GKTQNVSVPPVEGVAGGGTSYGWVDGGLRGTNIGAGVIDPTKVHSDDLLHVWSMPSTANV 71

Query: 65  GPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVV 124
             QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQ+QC+DLCS+SGDR   
Sbjct: 72  SQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVQIQCTDLCSSSGDR--- 128

Query: 125 GQSLMLRRVVPML-----GVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYE 179
            +       +PM+     GVPDALVP+D    QIS+ PGET AVWVS++ P  QPPGLYE
Sbjct: 129 -EDQHSHSSIPMVINVVPGVPDALVPIDPLSPQISIQPGETAAVWVSVNVPCGQPPGLYE 187

Query: 180 GEIIITS-KADTELSS-------------------------QCLGKGEKHRLFMELRNCL 213
           GEI IT+ K + E+ S                         + L K EK RL+ ELR+CL
Sbjct: 188 GEIFITAVKTELEILSNLVTLALISGLYFFADLISGSSSRTESLPKSEKCRLYRELRSCL 247

Query: 214 DNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVK 273
           D   P +     E+V+R  S +T LRRV+ +P   +   +NG  DMMDED I+N+SVR+K
Sbjct: 248 DLTGPRDYSSPEEMVQRLTSASTVLRRVLDNPALQDCQENNGFGDMMDEDVINNISVRLK 307

Query: 274 LSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFF 333
           LSLTVWDF LP TPSLPAV G+S                W     Q+   L    IS   
Sbjct: 308 LSLTVWDFTLPVTPSLPAVFGVS----------------W-----QYSFLLCSISISVLC 346

Query: 334 CRWG---ESMRVLTYTCP---------------------WPADHPKSDEYFSDPRLAAYA 369
             +G     +R++    P                        DHPK++EY+SDPRLAAYA
Sbjct: 347 NCYGTGCNGLRLMGLLGPAELAATEGSVGIAAALAKPAAGTPDHPKANEYYSDPRLAAYA 406

Query: 370 VPYSPVLS------------SNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYS 417
           VPY+P+LS            S D AK+ +R+E+E+L++K HW KAYFYLWDEPLN+E Y 
Sbjct: 407 VPYAPILSCLLLYLIWLLVNSTDAAKNSLRREVEILKSKPHWSKAYFYLWDEPLNVEQYD 466

Query: 418 SVRNMASELHAYAPDARVLTTYYCG-----------------------PSDAPLGPTPFE 454
            + N+++EL +YAPD R+LTTYYCG                       PS + L P+ FE
Sbjct: 467 MICNISNELRSYAPDVRILTTYYCGATCADLEHPVGVPGCPLSSRAAGPSGSELAPSTFE 526

Query: 455 SFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNW 514
           +F KVP  LRPHTQI+CTSEWVLG REDLVKDIV EL+P+ GEEWWTYVCMGPSDP PNW
Sbjct: 527 AFAKVPNVLRPHTQIFCTSEWVLGTREDLVKDIVAELRPDLGEEWWTYVCMGPSDPQPNW 586

Query: 515 HLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGE 574
           HLGMRG+QHRAVMWRVWKEGGTGFLYWG+NCYEKA +PSAEI FRRGLPPGDGVLFYPGE
Sbjct: 587 HLGMRGTQHRAVMWRVWKEGGTGFLYWGSNCYEKAMIPSAEICFRRGLPPGDGVLFYPGE 646

Query: 575 VFSSSRQPVASLRLERILSGLQ 596
           VFSSS +PVAS RLERILSG+Q
Sbjct: 647 VFSSSHEPVASTRLERILSGMQ 668


>gi|388508256|gb|AFK42194.1| unknown [Medicago truncatula]
          Length = 388

 Score =  631 bits (1628), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 287/337 (85%), Positives = 318/337 (94%), Gaps = 1/337 (0%)

Query: 260 MDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQ 319
           M+EDAISNLS+R+KL+LTVW+F+LP TPSLPAV GISDTVIEDRFGV+HG+ EWYEALDQ
Sbjct: 1   MEEDAISNLSLRLKLNLTVWEFVLPETPSLPAVFGISDTVIEDRFGVKHGTAEWYEALDQ 60

Query: 320 HFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSN 379
           HFKWLLQYRISP+FC+W + MRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPY  V+S N
Sbjct: 61  HFKWLLQYRISPYFCKWADGMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYKQVVSGN 120

Query: 380 DGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTY 439
           D AKDY++K++E+LRTK HW+KAYFYLWDEPLN+E Y SVRNMAS++HAYAPDAR+LTTY
Sbjct: 121 DAAKDYLQKQVEILRTKNHWRKAYFYLWDEPLNLEQYDSVRNMASDIHAYAPDARILTTY 180

Query: 440 YCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEW 499
           YCGP+DAPL PTPFE+FVKVP FLRPH QIYCTSEWVLGNREDLVKDI+ ELQPENGEEW
Sbjct: 181 YCGPNDAPLAPTPFEAFVKVPSFLRPHNQIYCTSEWVLGNREDLVKDIIAELQPENGEEW 240

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFR 559
           WTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGTGFLYWGANCYEKATV SAEI+FR
Sbjct: 241 WTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGANCYEKATVASAEIKFR 300

Query: 560 RGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
            GLPPGDGVL+YPGEVFS++ QPVASLRLER+LSGLQ
Sbjct: 301 HGLPPGDGVLYYPGEVFSTN-QPVASLRLERLLSGLQ 336


>gi|326494652|dbj|BAJ94445.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 408

 Score =  516 bits (1330), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 240/378 (63%), Positives = 301/378 (79%), Gaps = 1/378 (0%)

Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
           RLVVGQS+ LRRVVP+LGVPDALVP+D    QI+L+PGETTAVW+S++ P  Q PGLYEG
Sbjct: 5   RLVVGQSITLRRVVPILGVPDALVPIDPSSPQINLLPGETTAVWISLNVPCGQQPGLYEG 64

Query: 181 EIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLR 239
           EI IT+ +AD++  +  L K E+++L+  L+ CLD  E  +     E++ R  ST+TTLR
Sbjct: 65  EIFITAVRADSDSRADSLLKSERYQLYKGLKTCLDITESRDHLSSEEMILRLSSTSTTLR 124

Query: 240 RVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTV 299
           R++  P F ++  +NG  DMMDED ++N++VRVKLSLTVWDF LP TPSLPAV GIS+TV
Sbjct: 125 RMLVLPAFQDYHENNGLGDMMDEDVLNNVAVRVKLSLTVWDFTLPLTPSLPAVFGISETV 184

Query: 300 IEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEY 359
           IEDRF + HG+  WY+ALD HF WLLQYRISPFFCRWG+SMR+L YTCPWP DHPK++EY
Sbjct: 185 IEDRFCLEHGTKGWYDALDHHFGWLLQYRISPFFCRWGDSMRILAYTCPWPTDHPKANEY 244

Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSV 419
           +SDPRLAAYAVPY+P+LS  D AK+ +R+E+E+L+T+ HW KAYFYLWDEPLNME Y  +
Sbjct: 245 YSDPRLAAYAVPYAPILSCTDAAKNSLRREVEILKTEPHWSKAYFYLWDEPLNMEQYEVI 304

Query: 420 RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGN 479
            N+++EL  Y PD R+LTTYY GPS + L P+ FE+F KVP  LRPHTQI+CTSEWVLG 
Sbjct: 305 CNISNELRTYTPDVRILTTYYAGPSGSELAPSTFEAFAKVPNVLRPHTQIFCTSEWVLGT 364

Query: 480 REDLVKDIVTELQPENGE 497
           REDLVKDI+ EL+P+ GE
Sbjct: 365 REDLVKDIIAELRPDLGE 382


>gi|404484502|ref|ZP_11019706.1| hypothetical protein HMPREF9448_00112 [Barnesiella intestinihominis
           YIT 11860]
 gi|404339507|gb|EJZ65938.1| hypothetical protein HMPREF9448_00112 [Barnesiella intestinihominis
           YIT 11860]
          Length = 503

 Score =  225 bits (573), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 132/325 (40%), Positives = 174/325 (53%), Gaps = 30/325 (9%)

Query: 271 RVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKW---LLQY 327
           ++K+ L V+D  LP+TPSLPA  GI +  + D       S    + L    +W    L Y
Sbjct: 147 KIKIDLQVYDTALPSTPSLPAAFGIIEKNLID-------STSKEQTLQNKLEWAELCLDY 199

Query: 328 RISPFFCRW-GESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYV 386
           R++P+F  W   SM+    + PW  +  ++  + SD R   +AVPY   LS N+      
Sbjct: 200 RMNPYFSTWLANSMKHEASSSPWKWNDKRTVPFLSDKRFNRFAVPYHS-LSHNELDSLLQ 258

Query: 387 R-KEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSD 445
           R K+ +LL       K+YFYLWDEP  M+ Y  +   + E+H   P+A+VLTT+YCGP D
Sbjct: 259 RLKQTDLL------DKSYFYLWDEPAYMKEYHLIGQYSQEIHKLMPEAKVLTTFYCGPKD 312

Query: 446 APLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCM 505
                  F  F       R  TQI+  S W L   E       + L+    EEWWTYVCM
Sbjct: 313 GKYKDRLFSVF----DLWRGDTQIFSMSAWALQANEANADTCRSLLR--GNEEWWTYVCM 366

Query: 506 GPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPG 565
           GP +  PN  L M G QHRAV+WR WKE  TGFLYW  N Y      S  + FR+ LP G
Sbjct: 367 GPGEEQPNLLLTMDGYQHRAVLWRSWKERTTGFLYWAVNAY----AESDTLAFRKDLPEG 422

Query: 566 DGVLFYPGEVFSSSRQPVASLRLER 590
           DGVL YPG+ F+S+  PV S+R+ER
Sbjct: 423 DGVLIYPGQYFNST-SPVVSIRMER 446


>gi|404484501|ref|ZP_11019705.1| hypothetical protein HMPREF9448_00111 [Barnesiella intestinihominis
           YIT 11860]
 gi|404339506|gb|EJZ65937.1| hypothetical protein HMPREF9448_00111 [Barnesiella intestinihominis
           YIT 11860]
          Length = 513

 Score =  180 bits (457), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 113/330 (34%), Positives = 172/330 (52%), Gaps = 32/330 (9%)

Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 331
           V +S+ V +  LP TPS+ +V GI+    ++        ++  E        LL+YRISP
Sbjct: 160 VAISINVVNASLPETPSIASVFGINP---QNFIFTGLSEEQKIEKRKAASDLLLEYRISP 216

Query: 332 FFCRW-GESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEI 390
           +F  W   +M+   ++ P+  +  ++ EY +D R +  A+P S  LS +         E+
Sbjct: 217 YFSTWLSGTMKTECFSSPYAWNDDRTWEYLADKRFSRIALP-SHGLSDD---------EL 266

Query: 391 ELLRTKAH----WKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDA 446
           E++  KA       KA+FY+WDEP     Y  ++ ++  +H YAP+A+VLTT+YCGP+D 
Sbjct: 267 EMMLNKARETGLLNKAFFYVWDEPTKTNEYEQIKTLSDRIHRYAPEAKVLTTFYCGPTDG 326

Query: 447 PLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMG 506
                 F  F      L   T IYCT  W L + E+  +    +L+  +G+EWW+YVCM 
Sbjct: 327 EHKDDLFAVF----DILNGATSIYCTGVWALQDNENRSEQCKAKLK--SGQEWWSYVCMS 380

Query: 507 PSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGD 566
            +   P          +RA MWR +KE  +GFLYW  N +  +  P   +R R  LP GD
Sbjct: 381 NT---PGLASNSTAIGNRATMWRNYKEQNSGFLYWVVNGF-ASVYP---LRPRPELPEGD 433

Query: 567 GVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           G+L YPGE F +++    S+RLER   G +
Sbjct: 434 GILIYPGESFGTNK-ICTSVRLERWRDGAE 462


>gi|319640384|ref|ZP_07995108.1| hypothetical protein HMPREF9011_00705 [Bacteroides sp. 3_1_40A]
 gi|345517443|ref|ZP_08796919.1| hypothetical protein BSFG_04467 [Bacteroides sp. 4_3_47FAA]
 gi|254838009|gb|EET18318.1| hypothetical protein BSFG_04467 [Bacteroides sp. 4_3_47FAA]
 gi|317387987|gb|EFV68842.1| hypothetical protein HMPREF9011_00705 [Bacteroides sp. 3_1_40A]
          Length = 514

 Score =  168 bits (425), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 105/331 (31%), Positives = 160/331 (48%), Gaps = 31/331 (9%)

Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 331
           ++L   V    +P   S+P  +G+ +  + +    +    E    +D    ++L YR++P
Sbjct: 144 IQLDYNVHHTTIPLKSSIPITVGVENRCMTECLNDKEADKERQRWVD----FVLSYRMTP 199

Query: 332 FFC------RWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDY 385
            F       RW           PW  +  +S    +D R + Y +P+   LS N+ A   
Sbjct: 200 VFGTQITPERWQYEHSF----SPWAWNDKRSIRLLNDRRYSCYMLPFF-TLSENELASLL 254

Query: 386 VRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSD 445
                  ++ K   K++ FY+WDEP  ME Y  ++   + +  YA DAR+LTT++CGP +
Sbjct: 255 CN-----IQKKGKLKESLFYIWDEPAYMEDYEQIKRKVNIIRKYASDARILTTFFCGPRN 309

Query: 446 APLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCM 505
            P     +  F     +L+ H  +   S       E+ V+ I  ++ PE G +WW+YVC 
Sbjct: 310 GPRKGDLYAVF----DYLKHHIHVATISLAPCKGNEEEVQHIRYKV-PE-GIDWWSYVCW 363

Query: 506 GPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPG 565
            P    PN+ L M+G Q RA+MWR WK G  GFLYW  N Y K   P   I     +P G
Sbjct: 364 QPGGNEPNFLLQMKGIQQRAIMWRTWKNGSQGFLYWNCNIYHKRN-PFTYI---TDMPHG 419

Query: 566 DGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           DG+L YPG++    + P+AS RLER   G +
Sbjct: 420 DGILIYPGDIL-GCKGPIASARLERWRDGAE 449


>gi|294777879|ref|ZP_06743323.1| hypothetical protein CUU_2186 [Bacteroides vulgatus PC510]
 gi|294448333|gb|EFG16889.1| hypothetical protein CUU_2186 [Bacteroides vulgatus PC510]
          Length = 318

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 90/249 (36%), Positives = 130/249 (52%), Gaps = 17/249 (6%)

Query: 348 PWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLW 407
           PW  +  +S    +D R + Y +P+   LS N+ A          ++ K   K++ FY+W
Sbjct: 22  PWAWNDKRSIRLLNDRRYSCYMLPFF-TLSENELASLLCN-----IQKKGKLKESLFYIW 75

Query: 408 DEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHT 467
           DEP  ME Y  ++   + +  YA DAR+LTT++CGP + P     +  F     +L+ H 
Sbjct: 76  DEPAYMEDYEQIKRKVNIIRKYASDARILTTFFCGPRNGPRKGDLYAVF----DYLKHHI 131

Query: 468 QIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVM 527
            +   S       E+ V+ I  ++ PE G +WW+YVC  P    PN+ L M+G Q RA+M
Sbjct: 132 HVATISLAPCKGNEEEVQHIRYKV-PE-GIDWWSYVCWQPGGNEPNFLLQMKGIQQRAIM 189

Query: 528 WRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLR 587
           WR WK G  GFLYW  N Y K   P   I     +P GDG+L YPG++    + P+AS R
Sbjct: 190 WRTWKNGSQGFLYWNCNIYHKRN-PFTYI---TDMPHGDGILIYPGDIL-GCKGPIASAR 244

Query: 588 LERILSGLQ 596
           LER   G +
Sbjct: 245 LERWRDGAE 253


>gi|413951106|gb|AFW83755.1| hypothetical protein ZEAMMB73_317062 [Zea mays]
          Length = 1594

 Score =  148 bits (374), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 73/121 (60%), Positives = 89/121 (73%), Gaps = 1/121 (0%)

Query: 1   MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
           + N G  Q+  VP VEGVA G T+YGW D   + +    G I+PT + + +L+HVW MPS
Sbjct: 372 LGNGGKTQNVSVPTVEGVARG-TSYGWVDGGLRGTNLGAGVIDPTNVHSDNLLHVWSMPS 430

Query: 61  TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
           TANV  QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG V +QC+DLCS+SGD
Sbjct: 431 TANVSQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVLIQCTDLCSSSGD 490

Query: 121 R 121
           R
Sbjct: 491 R 491


>gi|413953324|gb|AFW85973.1| putative DUF1692 domain containing protein [Zea mays]
          Length = 1070

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/121 (60%), Positives = 89/121 (73%), Gaps = 1/121 (0%)

Query: 1   MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
           + N G  Q+  VP VEGVA G T+YGW D   + +    G I+PT + + +L+HVW MPS
Sbjct: 372 LGNGGKTQNVSVPTVEGVARG-TSYGWVDGGLRGTNLGAGVIDPTNVHSDNLLHVWSMPS 430

Query: 61  TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
           TANV  QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG V +QC+DLCS+SGD
Sbjct: 431 TANVSQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVLIQCTDLCSSSGD 490

Query: 121 R 121
           R
Sbjct: 491 R 491


>gi|413949740|gb|AFW82389.1| putative DUF1692 domain containing protein [Zea mays]
          Length = 1061

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/117 (61%), Positives = 87/117 (74%), Gaps = 1/117 (0%)

Query: 5   GNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANV 64
           G  Q+  VP VEGVA G T+YGW D   + +    G I+PT + + +L+HVW MPSTANV
Sbjct: 362 GKTQNVSVPTVEGVARG-TSYGWVDGGLRGTNLGAGVIDPTNVHSDNLLHVWSMPSTANV 420

Query: 65  GPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDR 121
             QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG V +QC+DLCS+SGDR
Sbjct: 421 SQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVLIQCTDLCSSSGDR 477


>gi|297822901|ref|XP_002879333.1| hypothetical protein ARALYDRAFT_902189 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325172|gb|EFH55592.1| hypothetical protein ARALYDRAFT_902189 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 113

 Score =  111 bits (278), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 71/174 (40%), Positives = 85/174 (48%), Gaps = 67/174 (38%)

Query: 265 ISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL 324
           +SNL+V +KL LTVW+FI+  T SL AVI +SDTVIEDRF V HGS+EWY+ L  HFKWL
Sbjct: 7   VSNLAVSIKLRLTVWEFIILVTLSLSAVICVSDTVIEDRFDVEHGSEEWYKKLGLHFKWL 66

Query: 325 LQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKD 384
           L +RI+                                           P  SSN+    
Sbjct: 67  LHHRIN-------------------------------------------PYFSSNNN--- 80

Query: 385 YVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTT 438
                              + LW +PLNMEH+ SV  MASE  AYA DARVLTT
Sbjct: 81  -------------------YNLW-QPLNMEHFDSVSKMASENFAYA-DARVLTT 113


>gi|354583721|ref|ZP_09002619.1| hypothetical protein PaelaDRAFT_3720 [Paenibacillus lactis 154]
 gi|353197601|gb|EHB63082.1| hypothetical protein PaelaDRAFT_3720 [Paenibacillus lactis 154]
          Length = 786

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 85/339 (25%), Positives = 138/339 (40%), Gaps = 51/339 (15%)

Query: 270 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRI 329
           VR+ + LTVWDF L          G+    +++  G   G + W + +++++   +++R+
Sbjct: 175 VRIPIELTVWDFELTDESHAKTNFGVWGGPVQEAHGNVVGEEAW-KYIEKYYYASVEHRL 233

Query: 330 SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKE 389
           +P +    +S  + +Y    P       +Y +DPR++AY +PY     + DG  D  R +
Sbjct: 234 TPGYLPIPDS-DINSYVERAP-------KYVNDPRISAYRLPY---YRTADGQPDIQRNK 282

Query: 390 --IELLRTKAHWKKAYFYL--WDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSD 445
             ++ LR      KAY+Y+   DEP   + Y+ V+ +   L   APD   L T    P D
Sbjct: 283 QLVDRLREAGLLSKAYYYVSEIDEP-TRDKYARVKQINDALEQAAPDVPHLVT--IQPVD 339

Query: 446 APLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCM 505
             +G                         WV  + E   +    E Q      WW Y  +
Sbjct: 340 ELVGDVDI---------------------WV-ADIEKFDEAFAKERQAAGDSVWW-YTYV 376

Query: 506 GPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRR----- 560
            P  P P++HL       R + W     G  G LYW    ++K      +   R      
Sbjct: 377 KPKHPFPSYHLDDDLVGTRLLTWMQRDHGVEGALYWATTQFQKYDAAQKKYVSRDVWTDP 436

Query: 561 -GLP--PGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
              P   GDG LFYPG        P+ ++RLE +   ++
Sbjct: 437 LAFPGANGDGYLFYPGTEVGVD-GPIGTIRLEVLRESME 474



 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 61/149 (40%), Gaps = 23/149 (15%)

Query: 50  ADLVHVWCMPSTANV-GPQEMPRPLEP-INLLAARNERESVQIALRPKVSWSSSSTAGVV 107
            DL  VW   +T  V   Q  P      I + AARNE ES Q+ ++      ++     +
Sbjct: 29  GDLFDVWVPTNTEKVMRDQAFPGETNSSIRIGAARNEYESGQVIVK------ANQPLRKL 82

Query: 108 QVQCSDLCSASGDRLVVGQSLMLRR------------VVPMLGVPDALVPLDLPVCQISL 155
           QV  SDL    G   +  + + L +              P    PDAL+PL+    Q+ +
Sbjct: 83  QVSMSDLKLTDGSAKIGREHIQLFKQHYIEVKTSTTPAYPKGWYPDALIPLN---QQLEV 139

Query: 156 IPGETTAVWVSIDAPYAQPPGLYEGEIII 184
             G    +W  +  P  Q PG Y GE+ +
Sbjct: 140 AEGHNQGIWFKVYVPKGQHPGTYTGEMTL 168


>gi|403382311|ref|ZP_10924368.1| hypothetical protein PJC66_21061 [Paenibacillus sp. JC66]
          Length = 796

 Score = 75.9 bits (185), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 86/343 (25%), Positives = 134/343 (39%), Gaps = 54/343 (15%)

Query: 270 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRI 329
           VR+ + LTVWDF L          GI    I++  G   G + W E +++++   +++R+
Sbjct: 177 VRIPVELTVWDFELTDENHSKTAFGIWGGPIQEAHGNVQGMEAW-EYIEKYYWASVEHRL 235

Query: 330 SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVR-K 388
           +P +      + +      +  +H  +  + +DPR++AY +PY        G  D  R K
Sbjct: 236 TPGY------LPIPDTDIDYYVEH--APRFINDPRVSAYRLPY---YRDAQGEPDIERIK 284

Query: 389 EI-ELLRTKAHWKKAYFYL--WDEPL----NMEHYSSVRNMASELHAYAPDARVLTTYYC 441
           E+ + LR +   +K YFY+   DEP+       +Y  V+ +   L   APD   L T   
Sbjct: 285 ELADKLRDRGMLEKGYFYISEIDEPVPHPNAANNYDRVKVINDALKQAAPDVPHLVTI-- 342

Query: 442 GPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWT 501
                     P E  +       P    Y               D   E Q E    WW 
Sbjct: 343 ---------QPLEELLGDVDIWSPEIDKYDY-------------DFARERQAEGEPVWW- 379

Query: 502 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRR- 560
           Y  + P  P P++H        R + W     G  G LYW    ++K      +   R  
Sbjct: 380 YTSVFPKHPFPSYHTDDDLVGARLLTWMQHDYGVEGTLYWATTQFQKYDSAQRKYVSRDV 439

Query: 561 -----GLP--PGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
                  P   GDG LFYPG        PV ++RLE +   ++
Sbjct: 440 WTDPLAFPGANGDGYLFYPGTEIGID-GPVGTIRLEVLRESME 481



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 64/148 (43%), Gaps = 24/148 (16%)

Query: 52  LVHVWCMPSTANVGPQEMPRPLEP---INLLAARNERESVQIALRPKVSWSSSSTAGVVQ 108
           L   W   ++  V   E P P +    + L AARNE ES Q+ +R     + +     +Q
Sbjct: 32  LFTAWVASNSQKVMRDE-PMPADSARTMQLAAARNEYESGQVIVR-----AGNHPLRKLQ 85

Query: 109 VQCSDLCSASGDRLVVGQSLMLRR------------VVPMLGVPDALVPLDLPVCQISLI 156
           V  SDL   +G   +  + + L +              P    PDAL+PL     ++ + 
Sbjct: 86  VSISDLKQENGAAKIHRRDIELFQQHYIEVTTSTTPAYPQGWYPDALIPLK---GKLEVG 142

Query: 157 PGETTAVWVSIDAPYAQPPGLYEGEIII 184
            G    +WV +  P  QP G+Y+GEI +
Sbjct: 143 AGHNQGIWVKVYVPKGQPAGVYKGEITL 170


>gi|374849806|dbj|BAL52811.1| hypothetical protein HGMM_F03C06C16 [uncultured prokaryote]
          Length = 994

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 85/303 (28%), Positives = 129/303 (42%), Gaps = 61/303 (20%)

Query: 270 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEAL-DQHFKWLLQYR 328
            +V L LTV+DF LP TP+L +  GI    IE    V+   D+   AL D++ +   ++R
Sbjct: 573 AQVPLMLTVYDFDLPRTPTLRSGFGIDARRIEQYHRVQSEQDK--RALWDRYMRNFREHR 630

Query: 329 ISPF-----------FCRWGESMR-VLTYTCPWPADHPKSDEY-FSDPRLAAYAVP---- 371
           ++P+           F   G + R VL +T    A     DE+ F+   L  + +P    
Sbjct: 631 LAPYNFYAYDHYEVRFEGEGANKRVVLDFTRFDRAAQRYLDEFGFNAFVLPIHGLPSGRH 690

Query: 372 --YSPVL--SSNDGA-------KDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVR 420
             YSP +     +G         DY+R+    LR +   KKAY Y +DEP   + Y  V+
Sbjct: 691 PNYSPGVFGGFREGTPEYERLWSDYLRQLTTHLRERGWLKKAYVYWFDEPEEAD-YPFVK 749

Query: 421 NMASELHAYAPD-ARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHT-QIYCTSEWVLG 478
            +   L   APD  R+LT     P +  +G    + +  +  F+ P   Q  C +     
Sbjct: 750 RVNERLKQVAPDLTRMLTEQ---PEEPLIGAV--DLWCPLTAFVSPEAIQARCQA----- 799

Query: 479 NREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGF 538
                            GEE W YVC GP  P+    +   G++ R  +W+ W+ G  G 
Sbjct: 800 -----------------GEEIWWYVCTGPRAPYATLFIDHPGTEMRVWLWQTWQYGVQGI 842

Query: 539 LYW 541
           L W
Sbjct: 843 LIW 845



 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 56/131 (42%), Gaps = 26/131 (19%)

Query: 67  QEMPRP---LEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLV 123
           +E P P   +  + L AAR E E VQI LRP+          + QV+ SDL    G   +
Sbjct: 445 RERPLPETTMHTVTLSAARGEYEPVQIVLRPQ------RNTTLRQVEISDLT--QGKHRL 496

Query: 124 VGQSLMLRRVVPM--------LG----VPDALVPLDLPVCQISLIPGETTAVWVSIDAPY 171
             + + LR V  +        LG     PD L PL  P   + L       +W+++  PY
Sbjct: 497 PAKHITLREVAYVRVAHPTDWLGEPGDYPDPLPPLKTP---LRLQAERNQPLWLTVYVPY 553

Query: 172 AQPPGLYEGEI 182
             P G Y G I
Sbjct: 554 GTPAGKYTGTI 564


>gi|392373328|ref|YP_003205161.1| hypothetical protein DAMO_0214 [Candidatus Methylomirabilis
           oxyfera]
 gi|258591021|emb|CBE67316.1| conserved exported protein of unknown function [Candidatus
           Methylomirabilis oxyfera]
          Length = 676

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 86/333 (25%), Positives = 125/333 (37%), Gaps = 34/333 (10%)

Query: 269 SVRVKLSLTVWDFILPATPSLPAVIG-ISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQY 327
           S+ + +SLTVW+F LP TP+L    G          FG    +D     +D+    LL++
Sbjct: 308 SIPIPISLTVWNFSLPTTPALRTNFGHFRSQQFAAAFGTSRYTDIHNTLMDKFDHELLRH 367

Query: 328 RISPFFCRWGE-SMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPY-------SPVLSSN 379
           R+SP      E S    T T    ++      +F    L +Y +P         P  +  
Sbjct: 368 RLSPARPSGTEPSYNAATGTID-SSNVQARMAHFISLGLTSYDLPLFDDWPWADPFGADR 426

Query: 380 DGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTY 439
           D A+ Y+   ++ L        AY    DEP     Y +VR+ A+  H   P A++L T 
Sbjct: 427 DKAQRYLSGILDWLGANDWLTLAYHDGIDEPEEASGYQAVRDEATNWHGLDPRAKMLITE 486

Query: 440 YCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEW 499
              P D   G       +  P F R     +     V                   GE+ 
Sbjct: 487 QTRPWDPTWGTLYGSVDIWTPYFSRFDPVTWAERRAV-------------------GEQS 527

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVP---SAEI 556
           W Y   G  +  P   L     + R   W  ++ G TG L W    +++ T P    A  
Sbjct: 528 WMYGAWG-DNGTPGDLLDRPIYEIRVPAWIGFQYGITGLLKWNTVYWDQVTDPWTNPATY 586

Query: 557 RFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLE 589
                +  GDG  FYPG        P+ASLRL+
Sbjct: 587 TLSGDIFNGDGAFFYPGTKV-GYEGPIASLRLK 618



 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 59/161 (36%), Gaps = 25/161 (15%)

Query: 44  PTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSST 103
           P   PTA  +  W   S A + P +         + AARNE E  Q+ ++      S + 
Sbjct: 148 PISTPTAAQITAWVTDSLARIQPTDPAGISTEATIKAARNEYEGFQVIVKAP----SDTA 203

Query: 104 AGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGV-------------PDALVPLDLPV 150
              V    SDL   +G  ++   ++ L R   +L               PD L+P   P 
Sbjct: 204 LSNVTATASDLTGPTG--VIASSNITLYREAYILVTTSSPASPYPTGWWPDPLIPFKHPE 261

Query: 151 CQISL------IPGETTAVWVSIDAPYAQPPGLYEGEIIIT 185
              +L        G    ++V +  P   P G Y G I ++
Sbjct: 262 TGANLGQPFTVDAGRNVPIYVEVYVPAGTPAGTYTGGIQVS 302


>gi|297791959|ref|XP_002863864.1| hypothetical protein ARALYDRAFT_917687 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309699|gb|EFH40123.1| hypothetical protein ARALYDRAFT_917687 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 460

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 28/51 (54%), Positives = 33/51 (64%), Gaps = 3/51 (5%)

Query: 299 VIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPW 349
           + E RF V HG +E Y+  D HFKWLLQY ISP+FC+W E   V  Y  PW
Sbjct: 14  LTESRFDVEHGIEECYKTFDLHFKWLLQYWISPYFCKWFE---VSKYVQPW 61


>gi|414591642|tpg|DAA42213.1| TPA: hypothetical protein ZEAMMB73_799052 [Zea mays]
          Length = 583

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 29/58 (50%), Positives = 38/58 (65%), Gaps = 12/58 (20%)

Query: 351 ADHPKSDEYFSDPRLAAYAVPYSPVLS------------SNDGAKDYVRKEIELLRTK 396
           ADHPK++EY+SDPRLAAY VPY+P+LS            S D AK  +R+E+E +  K
Sbjct: 337 ADHPKANEYYSDPRLAAYVVPYAPILSCLLLYLIWLLVNSTDAAKSSLRREVEGVSKK 394


>gi|153004484|ref|YP_001378809.1| hypothetical protein Anae109_1621 [Anaeromyxobacter sp. Fw109-5]
 gi|152028057|gb|ABS25825.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
          Length = 561

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 84/351 (23%), Positives = 134/351 (38%), Gaps = 53/351 (15%)

Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL-LQYRIS 330
           V + LTVWDF LP+T +L +  G++   +    G+       +  L   +  L L +R+S
Sbjct: 180 VPVELTVWDFELPSTATLRSAFGLAWGALPSGHGISSSDLAAFATLRARYGQLALDHRVS 239

Query: 331 ------------PFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSS 378
                         F R+   +   T     P     + EY           P + V + 
Sbjct: 240 LSHHDDGMWNDLEHFDRYYGPLMDGTAATRLPGARLTAVEYLG---------PLADVANL 290

Query: 379 NDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEP-LNMEHYSSVRNMASELHAYAPDARVL 436
              A+ Y        R++  W +  F Y  DEP      +  +   AS      P+ R L
Sbjct: 291 ARWAQRY--------RSRPGWFERLFQYTCDEPPYQGCGWGDIALRASAAKKADPEFRTL 342

Query: 437 TTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENG 496
            T     ++A       +  V V  F+   +  Y          +    D     +P+N 
Sbjct: 343 VTTTIQEAEANGATGLLDLVVPVVNFIDDKSGGYA-------GDQRPKYDAFLAAEPQN- 394

Query: 497 EEWWTYVCM----GPSDPH----PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEK 548
           E W    CM    G S  +    P++ +     ++RA+ W  +K   TG LYW       
Sbjct: 395 EVWLYQSCMSHGCGGSSAYGTGWPSYMVDASAVRNRAMQWLAFKYRATGELYWDTTYAYL 454

Query: 549 ATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
           +  P A +    G   GDG LFYPG   ++  ++  PVAS+RL+ I  G++
Sbjct: 455 SGDPWASVWEFDG--NGDGTLFYPGTPAKIGGTTHVPVASIRLKMIREGME 503


>gi|291514616|emb|CBK63826.1| hypothetical protein AL1_13720 [Alistipes shahii WAL 8301]
          Length = 573

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/239 (23%), Positives = 95/239 (39%), Gaps = 40/239 (16%)

Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEPLNMEHYSS 418
           + DPR+  Y   Y P L      ++++R +     +   W   Y  ++ DEPL+ E+ +S
Sbjct: 321 YDDPRVQQYIAAYFPAL------QEHLRSKTINDGSGRSWLDIYTQHIADEPLD-ENKTS 373

Query: 419 VRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLG 478
              +A ++   APD R++  Y     D  L        + VP+      +IY T      
Sbjct: 374 WEGLAHQVKQAAPDIRIIEAYRSSSYDPALID------ILVPQLDEFAWEIYRTM----- 422

Query: 479 NREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGF 538
                            G   W Y CM P     N ++ +   + R + W  +K G  G+
Sbjct: 423 ---------------PAGHSCWFYTCMYPRGNFANRYVTLPLIKTRLLHWINYKYGSPGY 467

Query: 539 LYWGANCYEKATVPSAEIRF-RRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           L+WG N +     P  ++       P GD  + YPG      R+   S+RL  +  G++
Sbjct: 468 LHWGFNAWGANGDPFGDVSAPANDWPGGDSHIVYPG-----YRKLYPSIRLTAMRDGIR 521


>gi|414591657|tpg|DAA42228.1| TPA: hypothetical protein ZEAMMB73_522235 [Zea mays]
          Length = 446

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/58 (48%), Positives = 37/58 (63%), Gaps = 12/58 (20%)

Query: 351 ADHPKSDEYFSDPRLAAYAVPYSPVLS------------SNDGAKDYVRKEIELLRTK 396
           ADHPK++EY+SDPRLA Y VPY+P+LS            S D AK  +R+E+E +  K
Sbjct: 200 ADHPKANEYYSDPRLATYVVPYAPILSCLLLYLIWLLVNSTDAAKSSLRREVEGVSKK 257


>gi|224024189|ref|ZP_03642555.1| hypothetical protein BACCOPRO_00912 [Bacteroides coprophilus DSM
           18228]
 gi|224017411|gb|EEF75423.1| hypothetical protein BACCOPRO_00912 [Bacteroides coprophilus DSM
           18228]
          Length = 566

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 51/207 (24%), Positives = 92/207 (44%), Gaps = 29/207 (14%)

Query: 410 PLNMEHYSSV-RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRP--- 465
           P+N E  S+  R     L ++     +   Y    +D P+  + F+S+V++ +F++    
Sbjct: 316 PINSEKASNFYRQFLPSLMSHLQKRGLKDIYVQHIADEPI-ESNFKSYVEIARFVKDICP 374

Query: 466 --------HTQIYCTSEWVLGNREDLVKDIVTELQPEN--GEEWWTYVCMGPSDPHPNWH 515
                   HT     +  +   + +  KD  +  Q     G+E W Y C+ P     N  
Sbjct: 375 DLRIIEACHTHNLENTVDIWVPQLNFYKDGYSFYQERQKAGDEVWFYTCLAPQGNFANRF 434

Query: 516 LGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAE---IRFRRG--LPPGDGVLF 570
           L +   + R + W  ++ G TG+L+WG N +++ + P  E   +    G  LP GD  + 
Sbjct: 435 LELPSIKTRLIHWLNFRYGATGYLHWGFNFWKENSDPYGETTTMNLESGNTLPGGDSWIV 494

Query: 571 YP--GEVFSSSRQPVASLRLERILSGL 595
           YP  G+++S       S+RLE +  G+
Sbjct: 495 YPKNGKLYS-------SIRLEAMRDGI 514


>gi|390946349|ref|YP_006410109.1| hypothetical protein Alfi_1070 [Alistipes finegoldii DSM 17242]
 gi|390422918|gb|AFL77424.1| hypothetical protein Alfi_1070 [Alistipes finegoldii DSM 17242]
          Length = 573

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/237 (24%), Positives = 93/237 (39%), Gaps = 40/237 (16%)

Query: 362 DPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEPLNMEHYSSVR 420
           DPR+  Y   Y P L      ++++R  +    +   W   Y  ++ DEPLN E+ +S  
Sbjct: 323 DPRVQRYIAAYFPAL------QEHLRSRMIDDGSGRSWLDIYTQHIADEPLN-ENKTSWE 375

Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
            +A ++   APD R++  Y     D  L        + VP+      +IY T        
Sbjct: 376 GLARQVKQAAPDIRIIEAYRSSSYDPALID------ILVPQLDEFVWEIYRTMP------ 423

Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
                          G   W Y CM P     N ++ +   + R + W  +K    G+L+
Sbjct: 424 --------------AGHSCWFYTCMYPRGNFANRYVTLPLIKTRLLHWINYKYDSPGYLH 469

Query: 541 WGANCYEKATVPSAEIRF-RRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           WG N +     P  ++       P GD  + YPG      R+   S+RL  +  G++
Sbjct: 470 WGFNAWGANGDPFGDVSAPANDWPGGDSHIVYPG-----YRKLYPSIRLAAMRDGIR 521


>gi|218779590|ref|YP_002430908.1| hypothetical protein Dalk_1743 [Desulfatibacillum alkenivorans
           AK-01]
 gi|218760974|gb|ACL03440.1| hypothetical protein Dalk_1743 [Desulfatibacillum alkenivorans
           AK-01]
          Length = 844

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/232 (24%), Positives = 94/232 (40%), Gaps = 33/232 (14%)

Query: 383 KDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCG 442
           KDY+    E LR   +  +AY+Y+ +EP + E Y +V   A+ L + APD +++ +    
Sbjct: 353 KDYMHATQEYLRGLGYLDRAYYYMANEPQDGEDYKAVAWYANLLKSAAPDLKLMVS---- 408

Query: 443 PSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY 502
                    P E       +      I+     VL N +    D+  + +  + EE W Y
Sbjct: 409 -------EEPKEEIYNNETYSGAKIDIWLP---VLNNYD---PDVSHDREKNHQEETWVY 455

Query: 503 VCMGPSDPHPN-WHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA--TVPSAEIRFR 559
              G   P+ N   L   G + +   W +WK    G  Y+  N + K   T P  +    
Sbjct: 456 FLHGTRPPYYNPITLDHPGIESKFTGWLLWKYRIRGIAYYSMNGWSKNPWTSPMTDGH-- 513

Query: 560 RGLPPGDGVLFYPG-------EVFSSSRQPVASLRLERILSGLQVRWICYYL 604
                GD  +FYP        +  +++ + V S+RLE +   L+     Y L
Sbjct: 514 ----NGDTFMFYPPSEDNSAIDYAANNHRLVPSIRLELMRDSLEDYEYLYLL 561


>gi|410456409|ref|ZP_11310270.1| hypothetical protein BABA_21191 [Bacillus bataviensis LMG 21833]
 gi|409928078|gb|EKN65201.1| hypothetical protein BABA_21191 [Bacillus bataviensis LMG 21833]
          Length = 548

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/214 (25%), Positives = 82/214 (38%), Gaps = 43/214 (20%)

Query: 390 IELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLG 449
           I+ ++      + +F++ DEP +++   S +N +  L  Y  D  V+        DA   
Sbjct: 315 IDFIKQNGLEHRVFFHVSDEP-HLDQVESYQNASEILQTYVKDFPVI--------DALSD 365

Query: 450 PTPFES-FVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENG-EEWWTYVCMGP 507
            T +E   VK P     H Q +                       +NG E  WTY C   
Sbjct: 366 YTFYEKGLVKTPIPSNDHIQPFL----------------------DNGVENLWTYHCCVQ 403

Query: 508 SDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGL 562
                N    M   ++R +  +++K    GFL+WG N +     +K   P          
Sbjct: 404 YKKVANRFFNMPSFRNRVLGMQLYKFNIAGFLHWGYNFWYSQYSKKPIDPFRNTDAHYAF 463

Query: 563 PPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           P GD  L YPGE       P+ S+RLE +   LQ
Sbjct: 464 PSGDAFLVYPGE-----EGPIESIRLEVLHEALQ 492


>gi|334364721|ref|ZP_08513701.1| hypothetical protein HMPREF9720_1375 [Alistipes sp. HGB5]
 gi|313159097|gb|EFR58472.1| hypothetical protein HMPREF9720_1375 [Alistipes sp. HGB5]
          Length = 514

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 57/237 (24%), Positives = 93/237 (39%), Gaps = 40/237 (16%)

Query: 362 DPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEPLNMEHYSSVR 420
           DPR+  Y   Y P L      ++++R  +    +   W   Y  ++ DEPLN E+ +S  
Sbjct: 264 DPRVQRYIAAYFPAL------QEHLRSRMIDDGSGRSWLDIYTQHIADEPLN-ENKTSWE 316

Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
            +A ++   APD R++  Y     D  L        + VP+      +IY T        
Sbjct: 317 GLARQVKQAAPDIRIIEAYRSSSYDPALID------ILVPQLDEFVWEIYRTMP------ 364

Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
                          G   W Y CM P     N ++ +   + R + W  +K    G+L+
Sbjct: 365 --------------AGHSCWFYTCMYPRGNFANRYVTLPLIKTRLLHWINYKYDSPGYLH 410

Query: 541 WGANCYEKATVPSAEIRF-RRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           WG N +     P  ++       P GD  + YPG      R+   S+RL  +  G++
Sbjct: 411 WGFNAWGANGDPFGDVSAPANDWPGGDSHIVYPG-----YRKLYPSIRLAAMRDGIR 462


>gi|414879440|tpg|DAA56571.1| TPA: hypothetical protein ZEAMMB73_699847 [Zea mays]
          Length = 659

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/58 (44%), Positives = 37/58 (63%), Gaps = 12/58 (20%)

Query: 351 ADHPKSDEYFSDPRLAAYAVPYSPVLS------------SNDGAKDYVRKEIELLRTK 396
           ADHPK++EY+S+PRLAAY  PY+P+LS            S + AK  +R+E+E +  K
Sbjct: 173 ADHPKANEYYSNPRLAAYVAPYAPILSCLLLYLIWLLVNSTNAAKSSLRREVEGVSKK 230


>gi|444913149|ref|ZP_21233303.1| hypothetical protein D187_05240 [Cystobacter fuscus DSM 2262]
 gi|444716152|gb|ELW57007.1| hypothetical protein D187_05240 [Cystobacter fuscus DSM 2262]
          Length = 546

 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 100/374 (26%), Positives = 136/374 (36%), Gaps = 60/374 (16%)

Query: 257 IDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEA 316
           +DM    A S     V  +  V  F+LPAT SLP   GIS   I    G++  S E    
Sbjct: 132 LDMEGAPAAS-----VPFTAEVQPFVLPATSSLPNSFGISLYSIAKGHGLKPESPEAQTL 186

Query: 317 LDQHFKWLLQYRIS-------PFFCRWGESMRVLTYTCPWPADHPKSDEYF--SDPRLAA 367
           L  +   LL +R+S       P   R+ E   VL +        P  D     S  R   
Sbjct: 187 LRDYVTALLAHRVSAHGMSMEPPPVRFEEGRAVLDFRAYDAEVGPFLDGSALPSGARFTT 246

Query: 368 YAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELH 427
             V  S    +++    Y R   E  + K    + +FY  DEP   E    VR  A  + 
Sbjct: 247 VDVRDSKAARTDEQKAAYYRAFAEHAKDKGWPAQLFFYAKDEP-KPEDVPLVRAQALRVR 305

Query: 428 AYAPDARVLTT-----YYCGPSD--APL--------GPTPFESFVKVPKF---LRPHTQI 469
               D  VL T        G +D  AP         GP    + V +      L P+ ++
Sbjct: 306 TAGKDVPVLVTSPLDEALRGSADILAPTLNCFFPRPGPQTCRNVVPLQTLRGKLAPNVKV 365

Query: 470 Y----CTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRA 525
           +    C S    G      KD  TE   +    W +Y+   P+              +RA
Sbjct: 366 WWYQSCNSHGCTGGP---AKDSATE---KAYSGWASYMVDHPA------------PLNRA 407

Query: 526 VMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQP 582
           +    +  G  G LY+          P  E+ F  G   GDG  FYPG       S  QP
Sbjct: 408 MGPLAFLSGVDGELYFDTVFAYNTKDPWKEV-FEFG-GNGDGTFFYPGTPAHTGLSRHQP 465

Query: 583 VASLRLERILSGLQ 596
           V SLRL+ +  GL+
Sbjct: 466 VVSLRLKHLRDGLE 479


>gi|430747974|ref|YP_007207103.1| hypothetical protein Sinac_7369 [Singulisphaera acidiphila DSM
           18658]
 gi|430019694|gb|AGA31408.1| hypothetical protein Sinac_7369 [Singulisphaera acidiphila DSM
           18658]
          Length = 577

 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 67/285 (23%), Positives = 104/285 (36%), Gaps = 61/285 (21%)

Query: 326 QYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFS--DPRLAAYAVPYSPVLSSNDGAK 383
           Q+  S  +  WG    +  Y         + D+Y    +P ++ ++  Y   L      K
Sbjct: 295 QFEWSHLWIYWGVENPMRIYK-------KEGDQYVMLWEPTISGFSDTYVNFL------K 341

Query: 384 DYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGP 443
            ++ +  + L  +   + +YF+L DEP   +H  + R     L   AP  +V+       
Sbjct: 342 QFLPEFKKFLTEEKMLETSYFHLSDEPGPGQHVQNYRRARQILREIAPWMKVMDAL---- 397

Query: 444 SDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYV 503
           SD   G    E    +P  L    Q Y  ++                  P      W Y 
Sbjct: 398 SDIEYGK---EGLTDIPIPLVSAAQAYIDAK-----------------IPH-----WVYY 432

Query: 504 CMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRR--- 560
           C GP+ P  N  +    S+ R   W  ++    GFL+WG N ++K     A   F     
Sbjct: 433 CCGPTGPWLNRFMDTPLSKIRMSGWLFYRHEAKGFLHWGFNYWDKMEREEAGDPFHDGSN 492

Query: 561 ----GLPPGDGVLFYPG----------EVFSSSRQPVASLRLERI 591
               G+P GD  + YPG          EVF+ S Q  A L+   I
Sbjct: 493 ASYPGIPFGDPFVIYPGPDGPIDSIRWEVFAESLQDYAILQTAGI 537


>gi|157370810|ref|YP_001478799.1| hypothetical protein Spro_2570 [Serratia proteamaculans 568]
 gi|157322574|gb|ABV41671.1| conserved hypothetical protein [Serratia proteamaculans 568]
          Length = 556

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 45/102 (44%), Gaps = 9/102 (8%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 554
           W Y C        N       +++R +  ++++   TGFL+WG N Y      +   P A
Sbjct: 400 WAYYCCVQKTEVANRFFAQPSARNRILGIQLYRYNITGFLHWGFNFYNSGHSREQLNPYA 459

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
               R   P GD  + YPGE  +    PV SLRL  +  GLQ
Sbjct: 460 VTDCRNAFPSGDAFVVYPGEDLT----PVESLRLRVLHQGLQ 497


>gi|442320014|ref|YP_007360035.1| hypothetical protein MYSTI_03035 [Myxococcus stipitatus DSM 14675]
 gi|441487656|gb|AGC44351.1| hypothetical protein MYSTI_03035 [Myxococcus stipitatus DSM 14675]
          Length = 561

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 96/369 (26%), Positives = 133/369 (36%), Gaps = 63/369 (17%)

Query: 266 SNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLL 325
           S   V V  ++ V  F+LPAT SLP   GIS   I    G+   S E    L  + + LL
Sbjct: 147 SREHVSVPFTVEVQPFVLPATASLPTSFGISQLSIARGHGLNAESSEAKALLRAYARMLL 206

Query: 326 QYRI-------SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSS 378
           ++R+       SP   R+ +   V+     W     +   +     L + A   +  L  
Sbjct: 207 EHRVSAHGMSMSPPPVRFEDGRAVVD----WREYDAEMAPFLDGSLLPSGARFTTTDLRD 262

Query: 379 NDGAKD------YVRKEIELLRTKAHWKKAYFYLWDE------PLNMEHYSSVRN----- 421
           N  A        Y R  +E  R K    + +FY  DE      PL +     VR      
Sbjct: 263 NKKAHTEAERVAYYRAFVEHFRKKDWPTQLFFYAKDEPKPQDVPLVLTQSRRVREAGGAR 322

Query: 422 ------MASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIY----C 471
                 M  EL A A D    T     P   P       +  ++ K LR  TQ++    C
Sbjct: 323 VLITTPMEGELPA-AADILAPTLNCFFPRPGPATCRAIHTVTELRKQLRSGTQVWWYQSC 381

Query: 472 TSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVW 531
            S    G          TE   E     W            ++ +      +RA+    +
Sbjct: 382 NSHGCNGG-------ASTEAAQERAYSGWA-----------SYMVDHSAMLNRAMGPLAF 423

Query: 532 KEGGTGFLYWGA-NCYEKATVPSAEIRFRRGLPPGDGVLFYPG--EVFSSSR-QPVASLR 587
             G  G LY+     Y     P  ++ F  G   GDG  FYPG  E    SR QPV SLR
Sbjct: 424 VNGVDGELYFDTVFAYNTKKDPWKDL-FEFG-GNGDGTFFYPGTPERLGDSRHQPVPSLR 481

Query: 588 LERILSGLQ 596
           L+ +  GL+
Sbjct: 482 LKHLRDGLE 490


>gi|86160501|ref|YP_467286.1| hypothetical protein Adeh_4085 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|85777012|gb|ABC83849.1| hypothetical protein Adeh_4085 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 539

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 91/348 (26%), Positives = 138/348 (39%), Gaps = 47/348 (13%)

Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL-LQYRIS 330
           V ++LTVW F LP+T SL +  G+S   +    GV   S +   AL   +  L L +RI+
Sbjct: 109 VPVTLTVWPFTLPSTASLKSAFGLSWGTLNTAHGV---SGDALSALRARYGQLALDHRIT 165

Query: 331 PFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEI 390
               R  +  R L +   +           + P   A +V Y   L  + G   +     
Sbjct: 166 --LSRIDDGNRDLAHFASFFGPLFDGASAATLPGAQATSVEY---LGGSSGYASWA---- 216

Query: 391 ELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGP 450
              +++    + + Y  DEP     +S +   A+   A +P  R L T     +DA  G 
Sbjct: 217 SFFQSRGWDDRLFQYTCDEPPLQCAWSDIPARAASARAVSPALRTLVTTTIQQADAA-GV 275

Query: 451 TP-FESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY------- 502
           TP  +  V V  FL          E   G +     D      P    E WTY       
Sbjct: 276 TPAIDVLVPVVNFLDDR-----AGERFAGPQR-AAYDAFLAGSPR--REVWTYQSCMSHG 327

Query: 503 ----VCMG-PSDPH------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV 551
               V MG PSD        P++ +     ++RA+ W  +    TG LY+          
Sbjct: 328 CGGTVDMGSPSDSDRYFTGWPSYMIDASAVRNRAMEWISFNHRVTGELYYETTMAYSHDP 387

Query: 552 PSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
            + +  F      GDG LFYPG   +V  +++ PVAS+RL+ I  G++
Sbjct: 388 WANQWDFSGN---GDGTLFYPGTPAKVGGTTQIPVASIRLKMIREGME 432


>gi|383454162|ref|YP_005368151.1| hypothetical protein COCOR_02162 [Corallococcus coralloides DSM
           2259]
 gi|380728520|gb|AFE04522.1| hypothetical protein COCOR_02162 [Corallococcus coralloides DSM
           2259]
          Length = 631

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 86/373 (23%), Positives = 133/373 (35%), Gaps = 77/373 (20%)

Query: 263 DAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDT-VIEDRFGVRHGSDEWYEALDQHF 321
           +A      +V   LTV D ++P+T SL +   +  T V     G    +    + L   +
Sbjct: 169 EAEGGFQRQVTARLTVVDAVMPSTSSLASAFPLLPTQVCRAHLGRNDCTPAELQPLLVRY 228

Query: 322 KWL-LQYRI---------------SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRL 365
           + L L++R+               S F+  WG S   L  T P            S  R+
Sbjct: 229 QQLSLEHRLTQPRLFLSGSGAQAWSDFYATWGPS---LDGTAP---------SRLSGARM 276

Query: 366 AA--YAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMA 423
            +  Y  P++       G  D+     E    +    +A+  + DEP +   +  V+   
Sbjct: 277 TSVEYTGPFT-----AGGLADFAGHMSE----RGWLARAHAKIGDEPFDATTFQQVQATG 327

Query: 424 SELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDL 483
           + +   AP  R + T     +   L     E  V +   L  H +         G   D 
Sbjct: 328 TLVRQAAPGLRTMLTV----NSMQLKLNGLEPLVDIAVPLVNHLE---------GTTPDF 374

Query: 484 VKD---IVTELQPENGEEWWTYVCMG--------------PSDPHPNWHLGMRGSQHRAV 526
           V D            G E W Y                  P    P++ +    ++ RA+
Sbjct: 375 VGDQSPTYAGFLSRPGTELWMYQSCASHGCAPGSLMPENQPGSGWPSYMVDRSSAKARAM 434

Query: 527 MWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGE---VFSSSRQPV 583
            W  ++ G  G LY+ A     A +P+A          GDG LFYPG    +   +  PV
Sbjct: 435 EWLAFRFGAKGELYYEAG----AMLPTAWTDQYHFGGNGDGTLFYPGTPAVIGGQTDVPV 490

Query: 584 ASLRLERILSGLQ 596
           ASLRL+ I  GLQ
Sbjct: 491 ASLRLKLIRQGLQ 503


>gi|197124590|ref|YP_002136541.1| hypothetical protein AnaeK_4209 [Anaeromyxobacter sp. K]
 gi|196174439|gb|ACG75412.1| conserved hypothetical protein [Anaeromyxobacter sp. K]
          Length = 618

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 84/346 (24%), Positives = 134/346 (38%), Gaps = 43/346 (12%)

Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 331
           V ++LTVW F LP+T SL +  G+S   +    GV    D       ++ +  L +R++ 
Sbjct: 186 VPVTLTVWPFTLPSTASLKSAFGLSWGTLNTAHGVS--GDALSTLRGRYGQLALDHRVT- 242

Query: 332 FFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIE 391
              R  +  R L +   +           + P   A +V Y   L  + G   +      
Sbjct: 243 -LSRIDDGNRDLAHFASFFGPLFDGGAATALPGAQATSVEY---LGGSSGYASWA----S 294

Query: 392 LLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT 451
             +++    + + Y  DEP     +S +   A+   A +P  R L T     +DA    +
Sbjct: 295 FFQSRGWDDRLFQYTCDEPPLQCAWSDIPARAASARAVSPALRTLVTTTVQQADAAGVTS 354

Query: 452 PFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY--------- 502
             +  V V  FL          E   G +     D      P    E WTY         
Sbjct: 355 SIDVLVPVVNFLDDRA-----GERFAGPQR-AAYDAFLAGSPR--REVWTYQSCMSHGCG 406

Query: 503 --VCMG-PSDPH------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPS 553
             V MG PSD        P++ +     ++RA+ W  +    TG LY+           +
Sbjct: 407 GTVDMGSPSDSDRYFTGWPSYMIDASAVRNRAMEWISFNHRVTGELYYETTMAYSHDPWN 466

Query: 554 AEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
            +  F      GDG LFYPG   +V  +++ PVAS+RL+ I  G++
Sbjct: 467 NQWDFSGN---GDGTLFYPGTPAKVGGTTQIPVASIRLKMIREGME 509



 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 41/142 (28%), Positives = 59/142 (41%), Gaps = 17/142 (11%)

Query: 55  VWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDL 114
           VW   ST  + P    R      L AARNE E+ Q+ +    +   ++TAG+       +
Sbjct: 43  VWTATSTEKIRPAATARAPGGAALTAARNEFEAFQVVITGAATGVRATTAGLTGPASLPV 102

Query: 115 CSASGDRLVVGQSLMLRRVVPMLGV----PDALVP--LDLPVCQISLIP-----GETTAV 163
                 RL     + L     + G     PDALVP   +L   + +  P     GE+ AV
Sbjct: 103 ------RLYREAIINLSNPSALDGGTGPWPDALVPDVDELAGERRNAFPFTVPAGESRAV 156

Query: 164 WVSIDAPYAQPPGLYEGEIIIT 185
           WV +  P   P G Y G + +T
Sbjct: 157 WVEVHVPPDAPAGEYAGSVQVT 178


>gi|153004261|ref|YP_001378586.1| hypothetical protein Anae109_1395 [Anaeromyxobacter sp. Fw109-5]
 gi|152027834|gb|ABS25602.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
          Length = 604

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 66/151 (43%), Gaps = 20/151 (13%)

Query: 50  ADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALR-PKVSWSSSST----A 104
           A    VW   +T  + P   PR      + AARNE E+ Q+ +  P    S+ +T    A
Sbjct: 19  AAAADVWVAGATEKIRPDAQPRQTTEARIAAARNEFEAFQVVVTGPARGVSARATSLEGA 78

Query: 105 GVV------QVQCSDLCSASGDRLVVGQ--SLMLRRVVPMLGVPDALVPLDLPVCQISLI 156
           GVV      +V   D+ +AS      G+    ++  V  ++G      P D+P       
Sbjct: 79  GVVDDVKLYRVDAIDVHTASALDGATGRWPDALVPDVDDVVGEKRNAFPFDVPA------ 132

Query: 157 PGETTAVWVSIDAPYAQPPGLYEGEIIITSK 187
            GE+ A+WV +  P    PG + GE+ I S+
Sbjct: 133 -GESRAIWVEVRVPPDAKPGTHFGEVTIASE 162



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 82/358 (22%), Positives = 127/358 (35%), Gaps = 58/358 (16%)

Query: 270 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRI 329
            ++ + LTVWDF LP+T SL    G+S  +I    GV   +D                 I
Sbjct: 166 AKIPVMLTVWDFELPSTASLKTHFGLSWGLIPSGHGVSPETDA---------------SI 210

Query: 330 SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSD-----PRLAAYAVPYSPVLS----SND 380
              +   G   RV          H   D +  +        A   +P + + +     N 
Sbjct: 211 RARYAALGLDHRVSLSGVADDGYHGDFDHFERNYAPLVDGTAKTRLPGAKLTTVKYVGNQ 270

Query: 381 GAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYY 440
            + D  R+  E  R K  + + + Y  DEP     +  +      +H   P+ R L T  
Sbjct: 271 TSVDEHRRWAEHFRAKGWFDRLFDYTCDEPPLTCSWDELPQRTKAVHEADPEFRTLVTTQ 330

Query: 441 CGPSDAPLGPTPFESFVKVPKFL--RPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEE 498
              ++        +  V V  ++  RP           LG         +   + E  E 
Sbjct: 331 IWDAEEHGVADEIDIMVPVVNWMDDRPGAG-------SLGQNRAKYDGFLA--KSEKKEL 381

Query: 499 WWTYVCM-----------GPSD------PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYW 541
           W    CM            PS+        P++ +     ++RA+ W  + E  TG LYW
Sbjct: 382 WLYQSCMSHGCGGTVNIGNPSEWDRYNTGWPSYMIDSSAVRNRAMEWISFLEDATGELYW 441

Query: 542 GANCYEKATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
                      S +  F      GDG LFYPG    +  S+  PVAS+RL+ I  G++
Sbjct: 442 ETAFAFTHDAWSNQWDFSGN---GDGTLFYPGTPARIGGSTDIPVASIRLKMIREGME 496


>gi|414879441|tpg|DAA56572.1| TPA: hypothetical protein ZEAMMB73_699847 [Zea mays]
          Length = 57

 Score = 50.1 bits (118), Expect = 0.003,   Method: Composition-based stats.
 Identities = 19/27 (70%), Positives = 25/27 (92%)

Query: 351 ADHPKSDEYFSDPRLAAYAVPYSPVLS 377
           ADHPK++EY+S+PRLAAY  PY+P+LS
Sbjct: 28  ADHPKANEYYSNPRLAAYVAPYAPILS 54


>gi|315648741|ref|ZP_07901837.1| hypothetical protein PVOR_25998 [Paenibacillus vortex V453]
 gi|315275943|gb|EFU39294.1| hypothetical protein PVOR_25998 [Paenibacillus vortex V453]
          Length = 558

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 54/222 (24%), Positives = 83/222 (37%), Gaps = 49/222 (22%)

Query: 385 YVRKEIELLRTKAHWKKAYFYLWDEPL--NMEHYSSVRNMASELHAYAPDARVLTTYYCG 442
           ++ K ++ +R     K+ YF+L DEP   ++E Y +   +   +    P    L+ Y   
Sbjct: 314 FLNKLVQFVRWNGLEKRVYFHLSDEPKLDDLETYRAASELVRPILKDFPIIDALSDYEFY 373

Query: 443 PSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP--ENG-EEW 499
            S     P P  +                                  ++QP  ++G E  
Sbjct: 374 KSGLIEHPIPASN----------------------------------DIQPFLDHGLEGL 399

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 554
           WTY C        N    M  S++R +  +++     GFL+WG N +     + A  P  
Sbjct: 400 WTYYCCAQYKQVSNRFFHMPSSRNRVLGIQLYTLKLRGFLHWGYNFWYAQFSKYAINPYQ 459

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
                 G P GD  L YPGE       PV S+RLE +   LQ
Sbjct: 460 VTDAGGGFPAGDAFLVYPGE-----EGPVESIRLEVLTEALQ 496


>gi|220919313|ref|YP_002494617.1| hypothetical protein A2cp1_4234 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219957167|gb|ACL67551.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 618

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 84/346 (24%), Positives = 133/346 (38%), Gaps = 43/346 (12%)

Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 331
           V ++LTVW F LP+T SL +  G+S   +    GV    D       ++ +  L +R++ 
Sbjct: 186 VPVTLTVWPFTLPSTASLKSAFGLSWGTLNTAHGV--SGDALSTLRGRYGQLALDHRVT- 242

Query: 332 FFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIE 391
              R  +  R L +   +           S P   A +V Y   L  + G   +      
Sbjct: 243 -LSRIDDGNRDLAHFASFFGPLFDGGAATSLPGAQATSVEY---LGGSSGYASWA----S 294

Query: 392 LLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT 451
             +++    + + Y  DEP     +  +   A+   A +P  R L T     +DA    +
Sbjct: 295 FFQSRGWDDRLFQYTCDEPPLQCAWGDIPARAASARAVSPALRTLVTTTVQQADAAGVTS 354

Query: 452 PFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY--------- 502
             +  V V  FL          E   G +     D      P    E WTY         
Sbjct: 355 SIDVLVPVVNFLDDR-----AGERFAGPQR-AAYDAFLAGSPR--REVWTYQSCMSHGCG 406

Query: 503 --VCMG-PSDPH------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPS 553
             V MG PSD        P++ +     ++RA+ W  +    TG LY+           +
Sbjct: 407 GTVDMGSPSDSDRYFTGWPSYMIDASAVRNRAMEWISFNHRVTGELYYETTMAYSHDPWN 466

Query: 554 AEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
            +  F      GDG LFYPG   +V  +++ PVAS+RL+ I  G++
Sbjct: 467 NQWDFSGN---GDGTLFYPGTPAKVGGTTQIPVASIRLKMIREGME 509


>gi|115376571|ref|ZP_01463803.1| hypothetical protein STIAU_2875 [Stigmatella aurantiaca DW4/3-1]
 gi|115366439|gb|EAU65442.1| hypothetical protein STIAU_2875 [Stigmatella aurantiaca DW4/3-1]
          Length = 620

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 90/213 (42%), Gaps = 31/213 (14%)

Query: 401 KAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVP 460
           +AY ++ DEP     + ++R  A      AP+ R L T     +   L     E  + V 
Sbjct: 270 RAYDFVGDEPPYGISFEALRQNAELTRQVAPELRTLVTT----NSRELDKYALEDLMDVA 325

Query: 461 KFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCM------GPSDPH-- 511
             +  H     T+    G++     D ++      G E W Y  CM      G + P   
Sbjct: 326 APVVNHMD--GTAPPFQGDQRATYHDFLSL----PGRELWLYQSCMSHGCAYGTNAPENQ 379

Query: 512 -----PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGD 566
                P++ +    ++ RA+ W  + EG +G LY+       A+  + + RF      GD
Sbjct: 380 PGAGWPSYMVDRSAAKARAMEWVTFLEGASGELYY-QTVGMLASAWTDQFRFNGN---GD 435

Query: 567 GVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
           G LFYPG    +  ++  PVAS+RL+ I  G+Q
Sbjct: 436 GTLFYPGTPAAIGGATDVPVASIRLKLIRLGVQ 468


>gi|304440737|ref|ZP_07400621.1| conserved hypothetical protein [Peptoniphilus duerdenii ATCC
           BAA-1640]
 gi|304370924|gb|EFM24546.1| conserved hypothetical protein [Peptoniphilus duerdenii ATCC
           BAA-1640]
          Length = 568

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 43/187 (22%), Positives = 74/187 (39%), Gaps = 13/187 (6%)

Query: 400 KKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLG---PTPFESF 456
           KK   + WD P   E+   +     +L A+  +  VL   Y   SD P      +  E++
Sbjct: 305 KKKKIFGWDTPSVGEYTKFLEKFIPDLVAHLKEWGVLDKTYFHISDVPREEHIKSYKEAY 364

Query: 457 VKVPKFLRPHTQIYCTSEWVLGNREDL-----VKDIVTELQPENGEEWWTYVCMGPSDPH 511
           + V    +        + +    +  +        ++ +   ++ +E WTY C+G     
Sbjct: 365 MSVNDLFKDLKTFEAVAHYDFFKKGLIELPVAASSVIHDFLDDDLDELWTYYCVGQFTEV 424

Query: 512 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSAEIRFRRGLPPGD 566
            N  + M  +++R    +++K   +GFL+WG N Y      K   P        G P GD
Sbjct: 425 ANRFMSMPSARNRIFGIQMYKFHISGFLHWGYNFYNSVLSYKKIDPYKVTDADDGFPAGD 484

Query: 567 GVLFYPG 573
             L YPG
Sbjct: 485 AFLVYPG 491


>gi|310817408|ref|YP_003949766.1| hypothetical protein STAUR_0130 [Stigmatella aurantiaca DW4/3-1]
 gi|309390480|gb|ADO67939.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
          Length = 650

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 90/213 (42%), Gaps = 31/213 (14%)

Query: 401 KAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVP 460
           +AY ++ DEP     + ++R  A      AP+ R L T     +   L     E  + V 
Sbjct: 300 RAYDFVGDEPPYGISFEALRQNAELTRQVAPELRTLVTT----NSRELDKYALEDLMDVA 355

Query: 461 KFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCM------GPSDPH-- 511
             +  H     T+    G++     D ++      G E W Y  CM      G + P   
Sbjct: 356 APVVNHMD--GTAPPFQGDQRATYHDFLSL----PGRELWLYQSCMSHGCAYGTNAPENQ 409

Query: 512 -----PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGD 566
                P++ +    ++ RA+ W  + EG +G LY+       A+  + + RF      GD
Sbjct: 410 PGAGWPSYMVDRSAAKARAMEWVTFLEGASGELYY-QTVGMLASAWTDQFRFNGN---GD 465

Query: 567 GVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
           G LFYPG    +  ++  PVAS+RL+ I  G+Q
Sbjct: 466 GTLFYPGTPAAIGGATDVPVASIRLKLIRLGVQ 498



 Score = 38.9 bits (89), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 42/159 (26%), Positives = 57/159 (35%), Gaps = 41/159 (25%)

Query: 55  VWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDL 114
           VW   +   V P   PR    + L AARNE  S Q+AL          + G+  V+ + L
Sbjct: 25  VWGESAMVKVRPNLAPRARPELQLTAARNEFVSFQVALH-------GGSTGLSGVR-AKL 76

Query: 115 CSASGDRLVVGQSLMLRRVVPMLGV------------PDALV--------------PLDL 148
               G   + G  + L RV  +  V            PD LV              P D+
Sbjct: 77  NGFVGPTSISGPDVTLYRVAYLTTVRPSVPGTPVGRWPDGLVPDVDEIAGEGRRAFPFDV 136

Query: 149 PVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSK 187
           P         E  A+WV +  P   P G Y G + + S 
Sbjct: 137 PA-------NEARAIWVDVHVPMDAPAGQYRGTVEVLSS 168


>gi|325662024|ref|ZP_08150643.1| hypothetical protein HMPREF0490_01381 [Lachnospiraceae bacterium
           4_1_37FAA]
 gi|325471687|gb|EGC74906.1| hypothetical protein HMPREF0490_01381 [Lachnospiraceae bacterium
           4_1_37FAA]
          Length = 562

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 42/101 (41%), Gaps = 13/101 (12%)

Query: 496 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-------YEK 548
           GE  WTYVC GP     N  L     + R + W   K   +GFL+WG N        YE 
Sbjct: 399 GETVWTYVCCGPEGHWLNRFLDFALLKGRMLFWGCAKNRISGFLHWGLNQFPGEMNPYEG 458

Query: 549 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLE 589
            + P+         P GD  L YPGE       P   +RLE
Sbjct: 459 TSCPN-HTGIGTNFPCGDSFLIYPGE-----EGPRMGMRLE 493


>gi|295798170|emb|CAX69035.1| Putative uncharacterized protein precursor [uncultured bacterium]
          Length = 563

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 80/352 (22%), Positives = 123/352 (34%), Gaps = 46/352 (13%)

Query: 269 SVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFG--VRHGSD----EWYEALDQHFK 322
           S  V   L VWDF LP TPS+    G     ++  +   V+ G D    +W    +Q  +
Sbjct: 182 SKTVSARLKVWDFALPQTPSMQTSFGSPAGRMKSWYANHVKVGKDAPIKDWTAVEEQCAQ 241

Query: 323 WLLQYRISPFF-CRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSP-----VL 376
            L ++RI+        E  +    T     +   +   F D R    A+  S      ++
Sbjct: 242 LLAEHRINATPPDELLEPQKQGDGTWRISEEKLNALGQFID-RYHVNALDVSKNFIFGII 300

Query: 377 SSNDGAKDYVRKEIELLRTKAHWKKA-----YFYLWDEPLNMEHYSSVRNMASELHAYAP 431
              D A+D +R  ++     A          Y YL DEP + E Y  VR     +     
Sbjct: 301 KDPDAARDEIRTRLKAFEMAAKQLNRPNLLFYVYLTDEPNDPEAYDYVRKWGKAIKEANS 360

Query: 432 DARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTEL 491
             +V+ T    P     G       +  P F                    L +      
Sbjct: 361 VVKVMITEQSTPQKTEWGDLYGAVDIWCPLF-------------------PLFEQGNAAR 401

Query: 492 QPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV 551
           +   GE  W Y  +   +P P WH+      +R   W  W+    G LYWG   +   T 
Sbjct: 402 RQALGETVWAYTALCQRNPTPWWHIDYPLLNYRVPAWISWRYRIRGLLYWGGMSFWNETG 461

Query: 552 PSAEIRFRRG------LPPGDGVLFYPGEVFSSSRQPVA-SLRLERILSGLQ 596
                 +  G      +  G+G L YPG    +    +A SLRL+ +  G++
Sbjct: 462 DPWRDAWTYGHKKSMLVYNGEGTLVYPGR--KAGYDGIAPSLRLKALRDGIE 511


>gi|410098375|ref|ZP_11293353.1| hypothetical protein HMPREF1076_02531 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409222249|gb|EKN15194.1| hypothetical protein HMPREF1076_02531 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 562

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 48/108 (44%), Gaps = 14/108 (12%)

Query: 495 NGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-------YE 547
            G+E W Y C+ P     N  L     + R + W  +K G TG+L+WG N        Y+
Sbjct: 411 KGDEVWFYTCLAPQGDFANRFLEQPLIKTRLIHWLNYKYGATGYLHWGFNQWFSDNDPYK 470

Query: 548 KATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGL 595
           + T  + E      LP GD  + YP      + +   S+RLE +  G+
Sbjct: 471 ETTTMNTES--GNTLPGGDSWIVYP-----DNGKLYGSIRLEAMRDGI 511


>gi|375308735|ref|ZP_09774018.1| hypothetical protein WG8_2543 [Paenibacillus sp. Aloe-11]
 gi|375079362|gb|EHS57587.1| hypothetical protein WG8_2543 [Paenibacillus sp. Aloe-11]
          Length = 570

 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 69/310 (22%), Positives = 106/310 (34%), Gaps = 70/310 (22%)

Query: 286 TPSLPAVIGISDTVIE------DRFGVRHGSDEWYEALDQHFKW--------LLQYRISP 331
           TP L   IG   T I+      D  G R G D+    LD   KW        +  + ++ 
Sbjct: 233 TPPLDTFIGNERTTIQLVDVSYDVNGYRFGFDK----LD---KWVQISEAVGITHFEMAH 285

Query: 332 FFCRWGESM--RVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKE 389
            F +WG     +++      P          +DP   ++   + P L+            
Sbjct: 286 LFSQWGAKYAPKIIVEVGGVPEQRFGWHTPANDPEFRSFLAAFLPALT------------ 333

Query: 390 IELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLG 449
            E L      +++ F++ DEP           +A  LH Y    + +  Y  G       
Sbjct: 334 -ERLHQLGIAERSLFHISDEP-----------VAGNLHTYLEAKQFVAPYLEG------- 374

Query: 450 PTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSD 509
             P    +   +F R         + V G+      D +     E     W Y C G + 
Sbjct: 375 -FPIIDAISDVEFYRRG----IIDQPVAGS------DTIHNFIDEGASNLWVYYCCGQNL 423

Query: 510 PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSAEIRFRRGLPP 564
              N  L M  S++R +  +++K    GFL+WG N Y      K   P  +       P 
Sbjct: 424 HVSNRFLAMPSSRNRILGVQMYKYRIKGFLHWGFNFYNSQYSLKKLNPYVDTAALDTFPS 483

Query: 565 GDGVLFYPGE 574
           GD  L YP E
Sbjct: 484 GDSFLVYPSE 493


>gi|383452115|ref|YP_005366104.1| hypothetical protein COCOR_00091 [Corallococcus coralloides DSM
           2259]
 gi|380727263|gb|AFE03265.1| hypothetical protein COCOR_00091 [Corallococcus coralloides DSM
           2259]
          Length = 645

 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 58/221 (26%), Positives = 87/221 (39%), Gaps = 31/221 (14%)

Query: 393 LRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTP 452
           ++ K    +AY  L DEP     ++ V      +   AP  R + T     +   L    
Sbjct: 296 MKAKGWLDRAYVQLGDEPPYGTPFAQVHATGELVRQAAPGLRTMLTT----NSRELKANG 351

Query: 453 FESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCM------ 505
            E  V     L  H  +  T     G++    +   T      G   W Y  CM      
Sbjct: 352 LEDAVDTAVPLVNH--LDGTDANFRGDQ----RGTYTRFLERPGTALWMYQSCMSHGCAY 405

Query: 506 GPSDPH-------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRF 558
           G + P        P++ L    ++ RA+ W  + +G TG LY+  +    +T  + + RF
Sbjct: 406 GTNAPENKPGAGWPSYMLDRSAAKARAMEWVTFLQGATGELYY-QSVGMLSTAWTDQYRF 464

Query: 559 RRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
                 GDG LFYPG    +   +  PVASLRL+ I  G+Q
Sbjct: 465 NGN---GDGTLFYPGTPEAIGGKTDVPVASLRLKLIRQGMQ 502


>gi|354580446|ref|ZP_08999351.1| hypothetical protein PaelaDRAFT_0452 [Paenibacillus lactis 154]
 gi|353202877|gb|EHB68326.1| hypothetical protein PaelaDRAFT_0452 [Paenibacillus lactis 154]
          Length = 554

 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 45/105 (42%), Gaps = 10/105 (9%)

Query: 497 EEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATV 551
           E  WTY C        N    +  +++R +  +++K    GFL+WG N +      +A  
Sbjct: 396 EGLWTYYCCSQYKEVSNRFFNLPSARNRILGMQLYKYNIEGFLHWGYNFWYSQYSRRAID 455

Query: 552 PSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           P        G P GD  L YPGE       PV S+RL+     LQ
Sbjct: 456 PYRVTDADSGFPSGDAFLVYPGE-----DGPVESIRLKVFHEALQ 495


>gi|395204636|ref|ZP_10395576.1| hypothetical protein PA08_1303 [Propionibacterium humerusii P08]
 gi|328907298|gb|EGG27064.1| hypothetical protein PA08_1303 [Propionibacterium humerusii P08]
          Length = 590

 Score = 48.1 bits (113), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 41/96 (42%), Gaps = 5/96 (5%)

Query: 483 LVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG 542
           +  D V + + +  E  W Y C+       N  +     + RA+ W++WK G  GFL+WG
Sbjct: 390 VATDAVLDFRRDGIEPAWVYHCVAQDVGVSNRFIAQESVRTRALGWQLWKFGVKGFLHWG 449

Query: 543 ANCYEKATV-----PSAEIRFRRGLPPGDGVLFYPG 573
            N Y          P A+     G   GD  + YPG
Sbjct: 450 FNFYYGQLSVCPIDPFADTSAGGGFISGDAFIVYPG 485


>gi|422439957|ref|ZP_16516771.1| conserved hypothetical protein [Propionibacterium acnes HL037PA3]
 gi|422471082|ref|ZP_16547582.1| conserved hypothetical protein [Propionibacterium acnes HL037PA2]
 gi|422573949|ref|ZP_16649509.1| conserved hypothetical protein [Propionibacterium acnes HL044PA1]
 gi|313837143|gb|EFS74857.1| conserved hypothetical protein [Propionibacterium acnes HL037PA2]
 gi|314927836|gb|EFS91667.1| conserved hypothetical protein [Propionibacterium acnes HL044PA1]
 gi|314971914|gb|EFT16012.1| conserved hypothetical protein [Propionibacterium acnes HL037PA3]
          Length = 493

 Score = 47.8 bits (112), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 41/96 (42%), Gaps = 5/96 (5%)

Query: 483 LVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG 542
           +  D V + + +  E  W Y C+       N  +     + RA+ W++WK G  GFL+WG
Sbjct: 293 VATDAVLDFRRDGIEPAWVYHCVAQDVGVSNRFIAQESVRTRALGWQLWKFGVKGFLHWG 352

Query: 543 ANCYEKATV-----PSAEIRFRRGLPPGDGVLFYPG 573
            N Y          P A+     G   GD  + YPG
Sbjct: 353 FNFYYGQLSVCPIDPFADTSAGGGFISGDAFIVYPG 388


>gi|223940729|ref|ZP_03632566.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223890585|gb|EEF57109.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 359

 Score = 47.8 bits (112), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 48/193 (24%), Positives = 77/193 (39%), Gaps = 20/193 (10%)

Query: 419 VRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLR---PHTQIYCTSEW 475
           ++    E H +     +L   Y   SD P G +  E++ +  + L    P  ++      
Sbjct: 120 LKQFLPEFHDFLAKENILEDSYFHLSDEP-GASHVENYKRARQVLHELAPWMKVMDALSD 178

Query: 476 VLGNRE---DLVKDIVTELQPENGEE--WWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRV 530
           +   R+   D+   +V   Q    E+   W Y C  P  P  N  +     + R   W  
Sbjct: 179 IQYGRQGLTDMPIPMVNSAQAYIDEKIPHWVYYCCAPQGPWLNRFMDTPLPKVRMAGWTF 238

Query: 531 WKEGGTGFLYWGANCYEK-----ATVPSAE--IRFRRGLPPGDGVLFYPGEVFSSSRQPV 583
           ++ G  GFL+WG N + K      T P  +  +    G+P GD  + YPG    +  QP+
Sbjct: 239 YRLGAKGFLHWGFNYWHKIEQEVVTDPLTDGCVSAWPGIPYGDPFVIYPG----ADGQPM 294

Query: 584 ASLRLERILSGLQ 596
            S+R E     LQ
Sbjct: 295 DSIRWEVFAESLQ 307


>gi|317048122|ref|YP_004115770.1| hypothetical protein Pat9b_1898 [Pantoea sp. At-9b]
 gi|316949739|gb|ADU69214.1| conserved hypothetical protein [Pantoea sp. At-9b]
          Length = 556

 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 9/102 (8%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 554
           WTY C        N  +     ++R +  ++W     GFL+WG N Y      +A  P A
Sbjct: 402 WTYYCCAQYLDVANRFMAQPSVRNRILGVQLWLYRIEGFLHWGFNFYNSELSREAIDPFA 461

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
                +  P GD  L YPG+ F+    P+ S+RL+ +L  +Q
Sbjct: 462 VTDGLQAFPAGDPFLVYPGKDFT----PLPSIRLKVLLEAMQ 499


>gi|153003376|ref|YP_001377701.1| hypothetical protein Anae109_0503 [Anaeromyxobacter sp. Fw109-5]
 gi|152026949|gb|ABS24717.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
          Length = 608

 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 87/350 (24%), Positives = 133/350 (38%), Gaps = 50/350 (14%)

Query: 269 SVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL-LQY 327
           S  V ++LTVW F LP+T SL +  G +   I    GV  G+ + + AL + +  L L +
Sbjct: 172 SATVPVTLTVWPFTLPSTASLKSAFGFTYGAIPGGHGV--GAADAFAALRERYGRLALDH 229

Query: 328 RISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVR 387
           RI+      G +      +   PA    +       R+ +Y + Y     S      Y  
Sbjct: 230 RITLSHVDDGSAAIDHAASLYGPAMDGAAPTALRGARMTSYELLYDAKSWST-----YFD 284

Query: 388 KEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAP 447
            E  L R        + Y  DEP     +S +   A+   A   + R L T     +DA 
Sbjct: 285 GEGWLDRL-------FQYTCDEPPLTCAWSDIPARAATARAA--NVRTLVTTSIQEADAQ 335

Query: 448 LGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCMG 506
                 +  V V  +L      Y       GN+     D      P    E W Y  CM 
Sbjct: 336 GVTGSIDVIVPVINYLDDREGTYA------GNQR-AKYDAFLAGSPR--RELWAYQSCMS 386

Query: 507 P-------------SDPH----PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA 549
                         SD +    P++ +     ++RA+ W  ++   TG LYW        
Sbjct: 387 HGCGGTVNFGSPSWSDRYFTGWPSYMIDASAVRNRAMEWLSFRYRVTGELYWETAYAYSH 446

Query: 550 TVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
              + +  F      GDG LFYPG   ++  ++  PVAS+RL+ I  G++
Sbjct: 447 DAWTNQWDFNGN---GDGTLFYPGTPAKIGGTTHVPVASIRLKMIREGME 493


>gi|374604784|ref|ZP_09677736.1| hypothetical protein PDENDC454_17493 [Paenibacillus dendritiformis
           C454]
 gi|374389614|gb|EHQ60984.1| hypothetical protein PDENDC454_17493 [Paenibacillus dendritiformis
           C454]
          Length = 536

 Score = 47.8 bits (112), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 14/130 (10%)

Query: 475 WVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEG 534
           WV  N++  +     E   ++G+  W Y C  P   + N  L     + R + W  +  G
Sbjct: 358 WVPTNKDYELNRDAYEAYRQSGDALWFYTCWNPGGEYLNRFLDFPLLKTRYLHWGNYLYG 417

Query: 535 GTGFLYWGANCYEKATVP--------SAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASL 586
             G+L+WG N Y     P        + ++  RR +P GD  + YPG+       P+ S+
Sbjct: 418 LDGYLHWGFNYYFPDQDPMELTNPLLAPDVHDRR-VPAGDTHIVYPGD-----GGPMLSM 471

Query: 587 RLERILSGLQ 596
           RLE + +G++
Sbjct: 472 RLEAMRAGVE 481


>gi|331085877|ref|ZP_08334960.1| hypothetical protein HMPREF0987_01263 [Lachnospiraceae bacterium
           9_1_43BFAA]
 gi|330406800|gb|EGG86305.1| hypothetical protein HMPREF0987_01263 [Lachnospiraceae bacterium
           9_1_43BFAA]
          Length = 562

 Score = 47.4 bits (111), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 42/101 (41%), Gaps = 13/101 (12%)

Query: 496 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-------YEK 548
           GE  WTYVC GP     N  L     + R + W   K   +GFL+WG N        YE 
Sbjct: 399 GETVWTYVCCGPEGHWLNRFLDFALLKGRMLFWGCAKNRISGFLHWGLNQFPGGMNPYEG 458

Query: 549 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLE 589
            + P+         P GD  L YPG+       P   +RLE
Sbjct: 459 TSCPN-HTGIGTNFPCGDSFLIYPGK-----EGPRMGMRLE 493


>gi|374324118|ref|YP_005077247.1| hypothetical protein HPL003_21470 [Paenibacillus terrae HPL-003]
 gi|357203127|gb|AET61024.1| hypothetical protein HPL003_21470 [Paenibacillus terrae HPL-003]
          Length = 570

 Score = 47.0 bits (110), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 38/94 (40%), Gaps = 5/94 (5%)

Query: 486 DIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC 545
           D +     E     W Y C G +    N  L M  S++R +  +++K    GFL+WG N 
Sbjct: 400 DTIHNFIDEGASNLWVYYCCGQNLHVSNRFLAMPSSRNRILGVQMYKYRIKGFLHWGFNF 459

Query: 546 YE-----KATVPSAEIRFRRGLPPGDGVLFYPGE 574
           Y      K   P  +       P GD  L YP E
Sbjct: 460 YNSQYSLKKLNPYVDTAALDTFPSGDSFLVYPSE 493


>gi|336426499|ref|ZP_08606509.1| hypothetical protein HMPREF0994_02515 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336010934|gb|EGN40914.1| hypothetical protein HMPREF0994_02515 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 562

 Score = 47.0 bits (110), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 43/102 (42%), Gaps = 10/102 (9%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV-----PSA 554
           WTY C G      N    M   ++R +  +++K    GFL WG N +          P  
Sbjct: 404 WTYYCCGQFREVSNRFFCMPSQRNRILGVQLYKYQIHGFLQWGFNFWNSMLSRYPINPYC 463

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
                   P GD  L YPGE       P+AS+R E ++ GLQ
Sbjct: 464 VTDAACAFPSGDASLVYPGE-----DGPIASIRAEVLMEGLQ 500


>gi|329927960|ref|ZP_08281988.1| hypothetical protein HMPREF9412_1819 [Paenibacillus sp. HGF5]
 gi|328938179|gb|EGG34575.1| hypothetical protein HMPREF9412_1819 [Paenibacillus sp. HGF5]
          Length = 554

 Score = 47.0 bits (110), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 45/102 (44%), Gaps = 10/102 (9%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 554
           WTY C        N    +  +++R +  +++K    GFL+WG N +     ++A  P  
Sbjct: 399 WTYYCCSQYKEVSNRFFNLPSARNRIIGIQLYKFNIEGFLHWGYNFWNSQYSKRAIDPFK 458

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
                 G P GD  + YPGE       P+ S+RL+     LQ
Sbjct: 459 VTDADCGFPSGDAFVVYPGE-----EGPIESIRLKVFQEALQ 495


>gi|261408450|ref|YP_003244691.1| hypothetical protein GYMC10_4664 [Paenibacillus sp. Y412MC10]
 gi|261284913|gb|ACX66884.1| conserved hypothetical protein [Paenibacillus sp. Y412MC10]
          Length = 554

 Score = 47.0 bits (110), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 45/102 (44%), Gaps = 10/102 (9%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 554
           WTY C        N    +  +++R +  +++K    GFL+WG N +     ++A  P  
Sbjct: 399 WTYYCCSQYKEVSNRFFNLPSARNRIIGIQLYKFNIEGFLHWGYNFWNSQYSKRAIDPFK 458

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
                 G P GD  + YPGE       P+ S+RL+     LQ
Sbjct: 459 VTDADCGFPSGDAFVVYPGE-----EGPIESIRLKVFQEALQ 495


>gi|365133220|ref|ZP_09342604.1| hypothetical protein HMPREF1032_00400 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363616030|gb|EHL67484.1| hypothetical protein HMPREF1032_00400 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 526

 Score = 46.2 bits (108), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 53/114 (46%), Gaps = 7/114 (6%)

Query: 485 KDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGAN 544
           ++  T LQ   GEE W Y C  P+ P  N  + +  +  R V+W       +GFL+WG N
Sbjct: 363 REEYTALQAA-GEEMWFYTCAFPAGPAMNRSMDLPLAVSRTVLWMGALYRLSGFLHWGFN 421

Query: 545 CYEKATVPSAEIRFRRG--LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
            Y    +  +     +G  LP GD  + YPG+      +P  S+R E   +G +
Sbjct: 422 YYIGDDIWHSACCPHKGALLPAGDAHIVYPGK----DGRPWRSMRFEAQRAGAE 471


>gi|116623984|ref|YP_826140.1| hypothetical protein Acid_4896 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116227146|gb|ABJ85855.1| hypothetical protein Acid_4896 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 543

 Score = 46.2 bits (108), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 47/165 (28%), Positives = 62/165 (37%), Gaps = 34/165 (20%)

Query: 55  VWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDL 114
           VW  PS   VG  +    +  ++L AAR E ES QI     V+  +S   G V +  SDL
Sbjct: 34  VWTAPSMQRVGMTDPAGSVSDVSLAAARGEYESFQI-----VANGASKGLGNVNLTVSDL 88

Query: 115 CSASGDRLVVGQSLMLRRVV---------------PMLG--VPDALVPLD-------LPV 150
               G  +  G   + R                  PM     PDAL+P         L  
Sbjct: 89  EGPDGKVIPHGNFTLYREKYMHVTSPSPNWKGSNQPMGAGWYPDALIPFTDPDTGKPLSG 148

Query: 151 CQISLIP-----GETTAVWVSIDAPYAQPPGLYEGEIIITSKADT 190
            +IS +P     G    VWV +  P     G Y+G   +TS   T
Sbjct: 149 AKISAVPFDVKAGNNQPVWVDLLVPQTAQAGTYKGTYTVTSNEGT 193


>gi|315648573|ref|ZP_07901671.1| hypothetical protein PVOR_25096 [Paenibacillus vortex V453]
 gi|315276052|gb|EFU39399.1| hypothetical protein PVOR_25096 [Paenibacillus vortex V453]
          Length = 554

 Score = 46.2 bits (108), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 44/102 (43%), Gaps = 10/102 (9%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSA 554
           WTY C        N    +  +++R +  +++K    GFL+WG N +      +A  P  
Sbjct: 399 WTYYCCSQYKEVSNRFFNLPSARNRILGIQLYKYNIEGFLHWGYNFWNSQYSRRAIDPFQ 458

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
                 G P GD  + YPGE       P+ S+RL+     LQ
Sbjct: 459 VTDADGGFPSGDAFVVYPGE-----EGPIESIRLKVFQEALQ 495


>gi|227495338|ref|ZP_03925654.1| conserved hypothetical protein [Actinomyces coleocanis DSM 15436]
 gi|226831208|gb|EEH63591.1| conserved hypothetical protein [Actinomyces coleocanis DSM 15436]
          Length = 532

 Score = 45.8 bits (107), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 46/198 (23%), Positives = 75/198 (37%), Gaps = 39/198 (19%)

Query: 392 LLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT 451
            L T+   + A+F++ DEP N +H  + R         A  A+V+           L   
Sbjct: 308 FLETEIGLEHAWFHVSDEP-NADHLEAYR---------AAKAQVVDLLAGTQVIDALSEP 357

Query: 452 PFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPH 511
            F+  V +P               V  N+ D  + +  E  P      W Y C+      
Sbjct: 358 EFQEVVDIPV--------------VATNKVDGFRAVGVE--PT-----WVYNCVAQDRLV 396

Query: 512 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEK-----ATVPSAEIRFRRGLPPGD 566
            N  +  RG++HR + ++++K    G L+W  N Y +        P  +     G   GD
Sbjct: 397 ANRFIAQRGTRHREIGFQLFKFNAKGILHWAFNFYNRQFSLGVLDPYKDTAAGGGFLSGD 456

Query: 567 GVLFYP---GEVFSSSRQ 581
             + YP   G+V+ S R 
Sbjct: 457 SFVVYPVADGKVYESLRH 474


>gi|331085437|ref|ZP_08334522.1| hypothetical protein HMPREF0987_00825 [Lachnospiraceae bacterium
           9_1_43BFAA]
 gi|330407675|gb|EGG87173.1| hypothetical protein HMPREF0987_00825 [Lachnospiraceae bacterium
           9_1_43BFAA]
          Length = 552

 Score = 45.8 bits (107), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 45/97 (46%), Gaps = 9/97 (9%)

Query: 497 EEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY--EKATVPSA 554
           E+ W Y C G  +   N  +   G + R +  +++K    GFL+WG N Y  EK+  P  
Sbjct: 392 EKLWGYYCTGQYEDVSNRFIVQPGYRTRILGVQMYKYQLDGFLHWGYNFYNSEKSLYPID 451

Query: 555 EIRFRR---GLPPGDGVLFYPGEVFSSSRQPVASLRL 588
             R        P GD  L YPG    + R+P  S+RL
Sbjct: 452 PYRCTDASGAFPSGDPFLVYPG----ADRKPEESIRL 484


>gi|186681830|ref|YP_001865026.1| hypothetical protein Npun_F1375 [Nostoc punctiforme PCC 73102]
 gi|186464282|gb|ACC80083.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 543

 Score = 45.4 bits (106), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 73/172 (42%), Gaps = 40/172 (23%)

Query: 55  VWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDL 114
           ++ +PS   +G  E         + AA+ E ESVQ+ ++     + SS    V +  SDL
Sbjct: 42  LYMVPSLKRIGQTEKITNTSLSKIYAAKGEYESVQLVIK-----APSSGLTNVNISVSDL 96

Query: 115 CSASGDRLVVGQSLMLRR----------------VVPMLGV---PDALVPLDLPVCQISL 155
              S ++++   ++ L R                + P LGV   PD L+P   PV Q   
Sbjct: 97  L-GSNNQIIPKNNITLYREHYVYVSHSSPNMRDNLNPPLGVGWYPDGLIPFLDPVTQKPP 155

Query: 156 IPGETTAV------------WVSIDAPYAQPPGLYEGEIIITS---KADTEL 192
           + GE  AV            WV +  P     G Y G+ I+TS   KA++++
Sbjct: 156 LTGELKAVPFRLQSQYNQPIWVDVFVPRNAKSGEYTGKFIVTSDQGKAESKI 207


>gi|197121796|ref|YP_002133747.1| hypothetical protein AnaeK_1387 [Anaeromyxobacter sp. K]
 gi|196171645|gb|ACG72618.1| Myxococcales GC_trans_RRR domain protein [Anaeromyxobacter sp. K]
          Length = 609

 Score = 45.4 bits (106), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 47/88 (53%), Gaps = 6/88 (6%)

Query: 512 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 571
           P++ +    S++RA+ W  + E  +G LYW      +A   S +  F      GDG LFY
Sbjct: 412 PSYMVDASASRNRAMEWITFLERASGELYWETAYSFRADPWSRQWDFSGN---GDGTLFY 468

Query: 572 PGE---VFSSSRQPVASLRLERILSGLQ 596
           PG+   +   +  PVAS+RL+ I +G+Q
Sbjct: 469 PGKPARIGGKTDVPVASVRLKMIRAGMQ 496



 Score = 39.3 bits (90), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 58/141 (41%), Gaps = 25/141 (17%)

Query: 61  TANVGPQEMPRPLEPINLLAARNERESVQIALR-------PKVSWSSSSTAGVVQVQCSD 113
           T  + P    RP    +L AARNE  + Q+ +         +V       A + +V   D
Sbjct: 35  TEKIRPDAKARPQTEAHLSAARNEFAAFQVVVTGPAKRVTARVEGLDGMDATLFRVDTLD 94

Query: 114 LCSASGDRLVVGQSLMLRRVVPMLGVPDALVPL--DLPVCQISLIP----GETTAVWVSI 167
           + S S      G+             PDALVP   D+   Q +  P     E+ AVWV +
Sbjct: 95  VTSPSAVDGGTGR------------WPDALVPDVDDVVGEQRNAFPFDVGTESRAVWVDV 142

Query: 168 DAPYAQPPGLYEGEIIITSKA 188
             P     G+Y+G ++I+S A
Sbjct: 143 HVPADARSGVYQGAVVISSDA 163


>gi|220916589|ref|YP_002491893.1| hypothetical protein A2cp1_1483 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219954443|gb|ACL64827.1| Myxococcales GC_trans_RRR domain protein [Anaeromyxobacter
           dehalogenans 2CP-1]
          Length = 609

 Score = 45.4 bits (106), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 47/88 (53%), Gaps = 6/88 (6%)

Query: 512 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 571
           P++ +    S++RA+ W  + E  +G LYW      +A   S +  F      GDG LFY
Sbjct: 412 PSYMVDASASRNRAMEWITFLERASGELYWETAYSFRADPWSRQWDFSGN---GDGTLFY 468

Query: 572 PGE---VFSSSRQPVASLRLERILSGLQ 596
           PG+   +   +  PVAS+RL+ I +G+Q
Sbjct: 469 PGKPARIGGKTDVPVASVRLKMIRAGMQ 496



 Score = 43.5 bits (101), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 60/147 (40%), Gaps = 25/147 (17%)

Query: 55  VWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALR-------PKVSWSSSSTAGVV 107
            W   +T  + P    RP    +L AARNE  + Q+ +         +V       A + 
Sbjct: 29  AWVASATEKIRPDAKARPQTEAHLSAARNEFAAFQVVVTGPAKRVTARVEGLDGMDATLF 88

Query: 108 QVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPL--DLPVCQISLIP----GETT 161
           +V   D+ S S      G+             PDALVP   D+   Q +  P     E+ 
Sbjct: 89  RVDTLDVTSPSAVDGGTGR------------WPDALVPDVDDVVGEQRNAFPFDVGAESR 136

Query: 162 AVWVSIDAPYAQPPGLYEGEIIITSKA 188
           AVWV +  P     G+Y+G ++I+S A
Sbjct: 137 AVWVDVHVPADARSGVYQGAVVISSDA 163


>gi|335045263|ref|ZP_08538286.1| hypothetical protein HMPREF9124_2064 [Oribacterium sp. oral taxon
           108 str. F0425]
 gi|333759049|gb|EGL36606.1| hypothetical protein HMPREF9124_2064 [Oribacterium sp. oral taxon
           108 str. F0425]
          Length = 556

 Score = 45.4 bits (106), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 53/213 (24%), Positives = 92/213 (43%), Gaps = 17/213 (7%)

Query: 400 KKAYFYLWDEP-LNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESF-- 456
           ++   + WD P +  E+   ++    EL A+  +  +L   Y   SD P      +SF  
Sbjct: 292 RECKLFSWDSPAVGGEYTEFLKVFLPELKAFLKEENILENSYFHISDEP-NEDNMDSFGA 350

Query: 457 --VKVPKFLRPHTQIYCTSEWVLGNREDLVKDIV--TELQP--ENG-EEWWTYVCMGPSD 509
               V + L     +   S + +  R  + + +V    ++P  E G +  W Y C G + 
Sbjct: 351 AVESVRELLADCKVMDALSSFEIYRRGYVQRPVVAVNHIEPFVEAGVKNLWAYYCTGQAV 410

Query: 510 PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY--EKATV---PSAEIRFRRGLPP 564
             PN  + M  +++R +    +     GFL+WG N Y  EK+     P A        P 
Sbjct: 411 DVPNRFIVMPSARNRILGVLCYIYQVEGFLHWGFNFYNSEKSIEHIDPYAVTDAGEAFPS 470

Query: 565 GDGVLFYPGEVFSSSRQPVASLRLERILSGLQV 597
           GD  + YPG+   ++ + + S+ LE  LS ++V
Sbjct: 471 GDPFIVYPGKD-GTAYESMRSVVLEEALSDIRV 502


>gi|453063163|gb|EMF04147.1| hypothetical protein F518_19198 [Serratia marcescens VGH107]
          Length = 556

 Score = 45.4 bits (106), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 42/102 (41%), Gaps = 9/102 (8%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA-----TVPSA 554
           W Y C        N       +++R +  +++     GFL+WG N Y  A       P A
Sbjct: 400 WAYYCCVQKTEVANRFFAQPSARNRILGVQLYLYRIAGFLHWGFNFYNSAHSRERINPYA 459

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
                   P GD  + YPGE      QPV SLRL  +  GLQ
Sbjct: 460 VTDSGHAFPSGDPFVVYPGE----DLQPVESLRLRVLHQGLQ 497


>gi|448242314|ref|YP_007406367.1| hypothetical protein SMWW4_v1c25510 [Serratia marcescens WW4]
 gi|445212678|gb|AGE18348.1| hypothetical protein SMWW4_v1c25510 [Serratia marcescens WW4]
          Length = 556

 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 42/102 (41%), Gaps = 9/102 (8%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA-----TVPSA 554
           W Y C        N       +++R +  +++     GFL+WG N Y  A       P A
Sbjct: 400 WAYYCCVQKTEVANRFFAQPSARNRILGVQLYLYRIAGFLHWGFNFYNSAHSRERINPYA 459

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
                   P GD  + YPGE      QPV SLRL  +  GLQ
Sbjct: 460 VTDSGHAFPSGDPFVVYPGE----DLQPVESLRLRVLHQGLQ 497


>gi|363898123|ref|ZP_09324658.1| hypothetical protein HMPREF9624_01220 [Oribacterium sp. ACB7]
 gi|361956490|gb|EHL09805.1| hypothetical protein HMPREF9624_01220 [Oribacterium sp. ACB7]
          Length = 557

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 52/212 (24%), Positives = 93/212 (43%), Gaps = 15/212 (7%)

Query: 400 KKAYFYLWDEP-LNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT--PFESF 456
           ++   + WD P +  E+   ++    EL A+  +  +L   Y   SD P       F + 
Sbjct: 293 RECKLFSWDSPAVGGEYTEFLKIFLPELKAFLKEENILGNSYFHISDEPNEDNMDSFGAA 352

Query: 457 VKVPKFLRPHTQIY-CTSEWVLGNREDLVKDIV--TELQP--ENG-EEWWTYVCMGPSDP 510
           V+  + L    ++    S + +  R  + + +V    ++P  E G +  W Y C G +  
Sbjct: 353 VESVRALLADCKVMDALSSFEIYRRGYVQRPVVAVNHIEPFVEAGVKNLWAYYCTGQAVD 412

Query: 511 HPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY--EKATV---PSAEIRFRRGLPPG 565
            PN  + M  +++R +    +     GFL+WG N Y  EK+     P A        P G
Sbjct: 413 VPNRFIVMPSARNRILGVLCYIYQVEGFLHWGFNFYNSEKSIEHIDPYAVTDAGEAFPSG 472

Query: 566 DGVLFYPGEVFSSSRQPVASLRLERILSGLQV 597
           D  + YPG+   ++ + + S+ LE  LS ++V
Sbjct: 473 DPFIVYPGKD-GTAYESMRSVVLEEALSDIRV 503


>gi|333994386|ref|YP_004526999.1| hypothetical protein TREAZ_0869 [Treponema azotonutricium ZAS-9]
 gi|333734735|gb|AEF80684.1| conserved hypothetical protein [Treponema azotonutricium ZAS-9]
          Length = 604

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 51/116 (43%), Gaps = 9/116 (7%)

Query: 486 DIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC 545
           D +T       +  W Y C+G S   PN  + +   + RA+   ++     GFL WG N 
Sbjct: 436 DAITPFLEAGIKNLWVYYCVGQSRRVPNRFIALPSPRTRAMGVLMYLYNIAGFLQWGYNY 495

Query: 546 YEKATVPSAEIRFRR--GL---PPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           Y  A   S    + +  GL   P GD  L YPG    +  +PV+S+  E    GL+
Sbjct: 496 YYSALSKSLVDPYLKTGGLKDWPGGDPFLVYPG----ADGKPVSSIHAEAHREGLE 547


>gi|383814347|ref|ZP_09969768.1| hypothetical protein SPM24T3_08339 [Serratia sp. M24T3]
 gi|383296757|gb|EIC85070.1| hypothetical protein SPM24T3_08339 [Serratia sp. M24T3]
          Length = 517

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 44/102 (43%), Gaps = 9/102 (8%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV-----PSA 554
           WTY C        N    +   ++R +  +++    TGFL+WG N Y          P A
Sbjct: 354 WTYYCCVQKLEVSNRFFALPSYRNRIIGVQLYLYSITGFLHWGFNFYNSGHSREHLDPFA 413

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
               +   P GD  + YPG+ +    QP+ SLRL  +   LQ
Sbjct: 414 ITDGQGAFPSGDLFVVYPGQDY----QPIESLRLMVLREALQ 451


>gi|354580346|ref|ZP_08999251.1| hypothetical protein PaelaDRAFT_0352 [Paenibacillus lactis 154]
 gi|353202777|gb|EHB68226.1| hypothetical protein PaelaDRAFT_0352 [Paenibacillus lactis 154]
          Length = 552

 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 56/223 (25%), Positives = 89/223 (39%), Gaps = 51/223 (22%)

Query: 385 YVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPS 444
           ++++ ++L+R      + +F++ DEP ++EH  + R  A  +                  
Sbjct: 311 FLKELVQLIRGLGIEDRIFFHVSDEP-HLEHLETYRKAAEIV------------------ 351

Query: 445 DAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIV---TELQP--ENG-EE 498
           D  +G  P               +I   S++    +E LV + +    +LQP  E+G   
Sbjct: 352 DVAVGDYP---------------RIDALSDYAF-YKEGLVPNPIPATDKLQPFLESGVAP 395

Query: 499 WWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC----YEKATV-PS 553
            WTY C        N        ++R +  +++K    GFL+WG N     Y K  V P 
Sbjct: 396 LWTYYCCSQYKQVANRFFSFPSERNRILGLQLYKYRIKGFLHWGFNFWNSQYSKRPVNPY 455

Query: 554 AEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
                  G P GD  L YPGE       PV SLR++     LQ
Sbjct: 456 LTTDADIGYPSGDAFLVYPGE-----DGPVCSLRMKVFREALQ 493


>gi|365132098|ref|ZP_09342072.1| hypothetical protein HMPREF1032_03868 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363617409|gb|EHL68801.1| hypothetical protein HMPREF1032_03868 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 547

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 50/195 (25%), Positives = 67/195 (34%), Gaps = 22/195 (11%)

Query: 385 YVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPS 444
           +V+   E LR      K  F++ DEP    H+ +  ++ +    Y   A +L  Y     
Sbjct: 297 FVQALAEFLRKYGWQDKVVFHIHDEP--DIHFKNEASLLARKRQYYLAAGILRKY----- 349

Query: 445 DAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVC 504
              L        V  P+F R    I     WV G      +    +     GE  W YVC
Sbjct: 350 ---LPNVRVIEAVASPEF-RGGVDI-----WVPGTPGYEARQADFDALTALGESVWAYVC 400

Query: 505 MGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSA------EIRF 558
            GP     N  L     + R + W        GFL+WG N +     P A          
Sbjct: 401 CGPEGNWLNRFLDFALLKGRLLFWGCAANRLGGFLHWGFNQFPAGMDPFAGTSCPNHTGI 460

Query: 559 RRGLPPGDGVLFYPG 573
               P GD  L YPG
Sbjct: 461 GTNFPCGDSFLVYPG 475


>gi|424868246|ref|ZP_18292005.1| hypothetical protein C75L2_00760029 [Leptospirillum sp. Group II
           'C75']
 gi|124515950|gb|EAY57459.1| protein of unknown function [Leptospirillum rubarum]
 gi|387221464|gb|EIJ76022.1| hypothetical protein C75L2_00760029 [Leptospirillum sp. Group II
           'C75']
          Length = 681

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 62/248 (25%), Positives = 96/248 (38%), Gaps = 38/248 (15%)

Query: 373 SPVLSSNDGAKDYVRKEIELLRTKAHWK-------KAYFYLWDEPLNMEHYSS-----VR 420
           SPV S   G  D   + +  L  + HWK       K + Y+ DEP++  +Y +     + 
Sbjct: 388 SPV-SDWKGVPDIATQNLAKLIVQ-HWKEKGWPIDKTFAYIADEPVHKLYYYADTYKLIA 445

Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
             A  LH  +P   V+ T      D P     +++ V   K +     +   + W  G  
Sbjct: 446 KNADSLHKGSPHIHVMVT------DVPY--ITYKNQVGHNKLI----MVGKVNIWA-GAS 492

Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
              +   + E Q E G+  W Y   GP     N  L   G   R   W  WK    G  Y
Sbjct: 493 AQFIPSRMQERQKE-GDHVWFYQAGGPPFIGQN-DLYSLGPGFRMWFWTAWKYHVNGVFY 550

Query: 541 WGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEV-----FSSSRQPVASLRLERILSGL 595
           W A+ +     P+      +GL  GDG + YPG       +     P+ S+R+ +   G 
Sbjct: 551 W-ADTFWNDNKPNMNPYVNQGL--GDGTIMYPGTELHFIGYPDIHGPIPSIRMAQWRRGY 607

Query: 596 Q-VRWICY 602
           +  R++ Y
Sbjct: 608 EDYRYLTY 615


>gi|410478328|ref|YP_006765965.1| hypothetical protein LFML04_0771 [Leptospirillum ferriphilum ML-04]
 gi|406773580|gb|AFS53005.1| hypothetical protein LFML04_0771 [Leptospirillum ferriphilum ML-04]
          Length = 681

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 62/248 (25%), Positives = 96/248 (38%), Gaps = 38/248 (15%)

Query: 373 SPVLSSNDGAKDYVRKEIELLRTKAHWK-------KAYFYLWDEPLNMEHYSS-----VR 420
           SPV S   G  D   + +  L  + HWK       K + Y+ DEP++  +Y +     + 
Sbjct: 388 SPV-SDWKGVPDIATQNLAKLIVQ-HWKEKGWPIDKTFAYIADEPVHKLYYYADTYKLIA 445

Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
             A  LH  +P   V+ T      D P     +++ V   K +     +   + W  G  
Sbjct: 446 KNADSLHKGSPHIHVMVT------DVPY--ITYKNQVGHNKLI----MVGKVNIWA-GAS 492

Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
              +   + E Q E G+  W Y   GP     N  L   G   R   W  WK    G  Y
Sbjct: 493 AQFIPSRMQERQKE-GDHVWFYQAGGPPFIGQN-DLYSLGPGFRMWFWTAWKYHVNGVFY 550

Query: 541 WGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEV-----FSSSRQPVASLRLERILSGL 595
           W A+ +     P+      +GL  GDG + YPG       +     P+ S+R+ +   G 
Sbjct: 551 W-ADTFWNDNKPNMNPYVNQGL--GDGTIMYPGTELHFIGYPDIHGPIPSIRMAQWRRGY 607

Query: 596 Q-VRWICY 602
           +  R++ Y
Sbjct: 608 EDYRYLTY 615


>gi|403220771|dbj|BAM38904.1| conserved hypothetical protein [Theileria orientalis strain
           Shintoku]
          Length = 210

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 34/149 (22%), Positives = 68/149 (45%), Gaps = 23/149 (15%)

Query: 337 GESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTK 396
           G+++++L +    P DH   +E+F D     Y   +  +  S+D +++ V   I +++  
Sbjct: 11  GQNLQILKFLFSIPNDHV--NEHFDDK----YVREFHRLDDSSDNSEELVTARI-IIKLI 63

Query: 397 AHWKKAYFYLWDEPL--NMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFE 454
            H  + +  + D+ +  NME ++ +R+++ ELH Y+            P     G    E
Sbjct: 64  KHEFEKFNLIRDQYITPNMERFTQIRHLSQELHPYSDT----------PCSTQAGCDKLE 113

Query: 455 SFVKVPKFLRPHTQ----IYCTSEWVLGN 479
             + +  ++R  T     I+ T   VLGN
Sbjct: 114 MLMNLCSYIRGGTSFAYDIFATMVHVLGN 142


>gi|225571626|ref|ZP_03780622.1| hypothetical protein CLOHYLEM_07724 [Clostridium hylemonae DSM
           15053]
 gi|225159703|gb|EEG72322.1| hypothetical protein CLOHYLEM_07724 [Clostridium hylemonae DSM
           15053]
          Length = 557

 Score = 43.9 bits (102), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 51/211 (24%), Positives = 81/211 (38%), Gaps = 21/211 (9%)

Query: 400 KKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKV 459
           K+   + W  P   E+   +     EL A   +  + T  Y   SD P  P   +++ + 
Sbjct: 291 KEEKIFGWHTPAVGEYTRFLHAFLPELTARLKEWGIDTVTYFHLSDEP-RPDDLDTYRQA 349

Query: 460 PKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGE----------EWWTYVCMGPSD 509
            + +    + Y T +  L + E     +V +  P N E          + WTY C+G   
Sbjct: 350 KESVADLLKGYHTFD-ALSSYEFYRHGLVDKPIPGNNEIDEFLEHGLTDMWTYYCVGQYL 408

Query: 510 PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSAEIRFRRGLPP 564
              N  + M   ++R    +++K    G L+WG N Y      +   P        G P 
Sbjct: 409 EVSNRFMSMPSLRNRIYGLQLYKYDIIGILHWGYNFYNSQFSLEHINPYETTDAGGGFPA 468

Query: 565 GDGVLFYPGEVFSSSRQPVASLRLERILSGL 595
           GD  L YPGE      +P  S+R+     GL
Sbjct: 469 GDPFLVYPGE----DGRPEESIRMMVHYEGL 495


>gi|284032103|ref|YP_003382034.1| carbohydrate binding family 6 [Kribbella flavida DSM 17836]
 gi|283811396|gb|ADB33235.1| Carbohydrate binding family 6 [Kribbella flavida DSM 17836]
          Length = 1437

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 90/221 (40%), Gaps = 36/221 (16%)

Query: 382 AKDYVRKEIELLRT----KAHWKKAYFYLWDEPLNMEHYSSVRNMASEL-HAYAPDARVL 436
           A+++ R+ +  L+T    K  + + Y  + DEP +  H  + R    EL   +AP  +  
Sbjct: 602 AQNFARQYLSALKTHLVAKGWFTQWYQSVGDEPGSPAHAETWRRAVDELIKVHAPGMKTS 661

Query: 437 TTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENG 496
           T Y   PS    G T FE  + V      H  +  T E          KD     Q   G
Sbjct: 662 TPYIGPPS--TWGAT-FEGRLNV------HVPLLSTHE--------SAKDYFRGRQAL-G 703

Query: 497 EEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG-ANCYEKATVPSAE 555
           +E WTYVC  P     N  +    S  R + W  +  G TG L+W  +N  E  T+ +  
Sbjct: 704 DEVWTYVCNRPLGAFYNRLIDQPLSAPRFMNWSNFANGVTGTLHWAYSNWKEDPTINAT- 762

Query: 556 IRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
                   PGD  + YP  V   +    ++LR + +  G++
Sbjct: 763 --------PGDTAIVYPDPV---NNDVTSTLRHDAMRDGIE 792


>gi|302388391|ref|YP_003824213.1| hypothetical protein Closa_4081 [Clostridium saccharolyticum WM1]
 gi|302199019|gb|ADL06590.1| conserved hypothetical protein [Clostridium saccharolyticum WM1]
          Length = 552

 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 44/102 (43%), Gaps = 9/102 (8%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE----KATVPSAE 555
           W+Y C        N  + M  +++RA   +V+K G  G L+WG N Y     +  +   E
Sbjct: 399 WSYYCTAQCVDVSNRFMAMPSARNRAYGLQVYKYGMEGILHWGFNFYNSEHSRHHINPYE 458

Query: 556 IRFRRG-LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           +    G  P GD  L YPG    S   P  S+RL  +    Q
Sbjct: 459 VTDCEGSFPSGDAFLVYPG----SDGIPEESIRLMVLCEAKQ 496


>gi|86158894|ref|YP_465679.1| hypothetical protein Adeh_2472 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|85775405|gb|ABC82242.1| hypothetical protein Adeh_2472 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 609

 Score = 43.5 bits (101), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 48/88 (54%), Gaps = 6/88 (6%)

Query: 512 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 571
           P++ +    S++RA+ W  + E  +G LYW      ++   +++  F      GDG LFY
Sbjct: 412 PSYMIDASASRNRAMEWITFLERASGELYWETAYSFRSDPWTSQWDFSGN---GDGTLFY 468

Query: 572 PGE---VFSSSRQPVASLRLERILSGLQ 596
           PG+   +   +  PVAS+R++ I +G+Q
Sbjct: 469 PGKPSRIGGKTDIPVASVRVKMIRAGMQ 496



 Score = 38.9 bits (89), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 58/135 (42%), Gaps = 13/135 (9%)

Query: 61  TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
           T  + P    RP    +L AARNE  + Q+ +      +  +     +V+  D  S S  
Sbjct: 35  TEKIRPDAKARPQTEAHLAAARNEFAAFQVVV------TGPAKGVTARVEGLDGLSVSLF 88

Query: 121 RLVVGQSLMLRRVVPMLGV-PDALVPL--DLPVCQISLIP----GETTAVWVSIDAPYAQ 173
           R+          V    G  PDALVP   D+   + +  P     E+ AVWV +  P   
Sbjct: 89  RVETLNVTSPSAVDGGTGRWPDALVPDVDDVVGEKRNAFPFDVGSESRAVWVDVHVPAGA 148

Query: 174 PPGLYEGEIIITSKA 188
             G+Y+G ++I+S A
Sbjct: 149 RSGIYQGAVVISSDA 163


>gi|403380550|ref|ZP_10922607.1| hypothetical protein PJC66_12097 [Paenibacillus sp. JC66]
          Length = 555

 Score = 43.1 bits (100), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 47/102 (46%), Gaps = 9/102 (8%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-YEKATV----PSA 554
           WTY C    +   N  + M   ++R + ++++K    GFL+WG N  Y + ++    P  
Sbjct: 399 WTYYCCSQYEEVSNRFIDMPSWRNRILGFQLYKFQIRGFLHWGYNFWYSQYSIRPINPYQ 458

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
           +       P GD  + YPG    S+ + V SLRL+      Q
Sbjct: 459 QTDANYAFPSGDPFVVYPG----SNGEAVLSLRLKVFYDAFQ 496


>gi|374373535|ref|ZP_09631195.1| hypothetical protein NiasoDRAFT_2351 [Niabella soli DSM 19437]
 gi|373234508|gb|EHP54301.1| hypothetical protein NiasoDRAFT_2351 [Niabella soli DSM 19437]
          Length = 576

 Score = 43.1 bits (100), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 29/123 (23%), Positives = 54/123 (43%), Gaps = 14/123 (11%)

Query: 474 EWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKE 533
           ++ + ++     +I+ E Q +     W   C   ++ +PN       ++H  + W    +
Sbjct: 414 DYCIASKHQFPDNILKERQQQGKLSTWYTCC---TEKYPNGFTFSPPAEHVWIGWYTAAK 470

Query: 534 GGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILS 593
              G+L W  N + +   P  + RFR   P GD    YPG        P +S+R E+++ 
Sbjct: 471 NMNGYLRWAYNSWVEH--PETDSRFR-SWPAGDTYQVYPG--------PASSIRFEKLIE 519

Query: 594 GLQ 596
           G+Q
Sbjct: 520 GIQ 522


>gi|256420134|ref|YP_003120787.1| hypothetical protein Cpin_1088 [Chitinophaga pinensis DSM 2588]
 gi|256035042|gb|ACU58586.1| hypothetical protein Cpin_1088 [Chitinophaga pinensis DSM 2588]
          Length = 560

 Score = 42.7 bits (99), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 19/109 (17%)

Query: 492 QPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-----Y 546
           + +N  E W YVC+ P   +PN  L     + R + W  +K   TGF++WG N      +
Sbjct: 415 RAQNKGELWYYVCVSPQYNYPNRFLENPLIKTRFLHWTNYKYDLTGFMHWGYNIWTGYPF 474

Query: 547 EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGL 595
           + +T  S           GD  + YP +      + ++S+RLE +  G+
Sbjct: 475 DFSTSNSV---------GGDAWIVYPKD-----GKIISSVRLEAMRDGI 509


>gi|386814775|ref|ZP_10101993.1| hypothetical protein Thini_0548 [Thiothrix nivea DSM 5205]
 gi|386419351|gb|EIJ33186.1| hypothetical protein Thini_0548 [Thiothrix nivea DSM 5205]
          Length = 604

 Score = 42.7 bits (99), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 86/381 (22%), Positives = 132/381 (34%), Gaps = 63/381 (16%)

Query: 264 AISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGV-RHGSDEWYEALDQHFK 322
           A     + + ++LTVWDF LP    L  V G +   + + +G  R G       L + + 
Sbjct: 177 ATGEGQLELPVTLTVWDFSLPERSPLRTVFGTNGYRVAEVYGFERTGKSAADNRLIRAYN 236

Query: 323 -WLLQYRISPF----------------FCRWGESMRVLTYTCPWPADHPKSDEYFSDPRL 365
            +LL + +SP                 F R    +  +T    + A    +  Y     +
Sbjct: 237 DFLLDHHLSPESFWDAAPEANADGLPDFGRQFAGLGTVTDNMRYYAQEKHASAY---TYV 293

Query: 366 AAYAVPYS-PVLSSNDGAKDYVRKEIELLRTKAHWKKAYF--YLWDEPLNMEHYSSVRNM 422
            A + P++ P+      A+ ++R   +     A  ++ Y      DEP   + Y   R  
Sbjct: 294 FADSYPFADPLGEDRQQAQRFMRAYADWCGKHAGAERCYTDPSFVDEPDTRDAYQYARRW 353

Query: 423 ASELHAYA-PDARVLTTYYCGP---SDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLG 478
                  + P    +      P    D  LG    +  V VPKF      +    + V G
Sbjct: 354 GEFFDEISLPKGENIHFQVSEPPLNEDPGLGSLVGKVEVWVPKFYDLWRDVDFLGKNVAG 413

Query: 479 NREDLVKDIVTELQPENGEEWWTYVCMGPSDPH----------------PNWHLGMRGSQ 522
            R               GEE W Y  +    P                 P W L      
Sbjct: 414 QRL------------AAGEEVWAYTSLVLDFPEYSKLNPKADVLKGSYPPVWQLDFPAIN 461

Query: 523 HRAVMWRVWKEGGTGFLYWGANC-YEKATVPSAEIRFRRGLPP-----GDGVLFYPG-EV 575
           +R   W   + G TG  YW     +E A V +    F    PP     GDG+L YPG + 
Sbjct: 462 YRIPTWLFHRYGVTGLGYWDTLAWFEGADVWNDAASFVSQNPPGIRFNGDGLLVYPGFKA 521

Query: 576 FSSSRQPVASLRLERILSGLQ 596
            +    P+ASLRL+ I   ++
Sbjct: 522 QTGFDGPLASLRLKWIRESVE 542



 Score = 39.7 bits (91), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 39/160 (24%), Positives = 64/160 (40%), Gaps = 27/160 (16%)

Query: 53  VHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCS 112
           + V  + +    G  E+    + + L AARNE E  Q  +       ++   G V VQ S
Sbjct: 31  LQVKSIGALDRFGRFELVTGSDKVELFAARNEYEGFQFVV-------TAGERGAVDVQAS 83

Query: 113 -DLCSASGDRLVVGQSLMLRRVVPMLGV-----------PDALVPLDLPVCQI------- 153
             +  +   +++ G  +   R V +              PD L+P D    +        
Sbjct: 84  ISVLRSVEGQVIDGLKVFRERYVKVSTPSPHSPYAPQYWPDILLPADNAGAEAAAYRAFP 143

Query: 154 -SLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTEL 192
            +L  GE   VWV I  P    PG+Y G+I +T+  + +L
Sbjct: 144 QNLTAGENLPVWVDIHIPADARPGVYTGKISVTATGEGQL 183


>gi|162450247|ref|YP_001612614.1| hypothetical protein sce1975 [Sorangium cellulosum So ce56]
 gi|161160829|emb|CAN92134.1| hypothetical protein predicted by Glimmer/Critica [Sorangium
           cellulosum So ce56]
          Length = 687

 Score = 42.4 bits (98), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 26/63 (41%), Positives = 36/63 (57%), Gaps = 4/63 (6%)

Query: 139 VPDALVPLDL--PVCQISLIPG--ETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSS 194
           VPDAL+P++L  P     L  G  ET AVW+ +  P    PG YEG +++ S +  EL+S
Sbjct: 176 VPDALIPVELAPPWAPYPLEVGARETRAVWIDLHVPEGALPGAYEGRVVVGSVSHGELAS 235

Query: 195 QCL 197
             L
Sbjct: 236 LEL 238


>gi|336428440|ref|ZP_08608421.1| hypothetical protein HMPREF0994_04427 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336005693|gb|EGN35737.1| hypothetical protein HMPREF0994_04427 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 522

 Score = 42.0 bits (97), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 42/91 (46%), Gaps = 3/91 (3%)

Query: 485 KDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGAN 544
           KD    LQ + GEE W Y C  P+    N  + +  +  R ++W       TGFL+WG N
Sbjct: 363 KDTFRLLQ-DAGEEIWFYTCAFPAGNIMNRSMDLPLTVSRLLLWMGASCRLTGFLHWGFN 421

Query: 545 CYEKATVPSAEIRFRRG--LPPGDGVLFYPG 573
            Y    + +      +G  LP GD  + YPG
Sbjct: 422 YYIGDDIWNRACCPHKGALLPAGDAHIVYPG 452


>gi|346306833|ref|ZP_08848983.1| hypothetical protein HMPREF9457_00692 [Dorea formicigenerans
           4_6_53AFAA]
 gi|345907730|gb|EGX77437.1| hypothetical protein HMPREF9457_00692 [Dorea formicigenerans
           4_6_53AFAA]
          Length = 558

 Score = 42.0 bits (97), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 42/94 (44%), Gaps = 9/94 (9%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA----TVPSAE 555
           WTY C G      N  + M  +++R    +++K    G L+WG N Y        +   E
Sbjct: 401 WTYYCTGQFYEVSNRFMSMPSARNRIYGVQLYKYKIIGVLHWGYNFYNSQYSIEHINPYE 460

Query: 556 IRFRRG-LPPGDGVLFYPGEVFSSSRQPVASLRL 588
           +    G  P GD  L YPGE    + QP  SLR+
Sbjct: 461 VTDAAGAFPSGDPFLVYPGE----NGQPEESLRM 490


>gi|166033167|ref|ZP_02235996.1| hypothetical protein DORFOR_02889 [Dorea formicigenerans ATCC
           27755]
 gi|166027524|gb|EDR46281.1| hypothetical protein DORFOR_02889 [Dorea formicigenerans ATCC
           27755]
          Length = 558

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 42/94 (44%), Gaps = 9/94 (9%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA----TVPSAE 555
           WTY C G      N  + M  +++R    +++K    G L+WG N Y        +   E
Sbjct: 401 WTYYCTGQFYEVSNRFMSMPSARNRIYGVQLYKYEIIGVLHWGYNFYNSQYSIEHINPYE 460

Query: 556 IRFRRG-LPPGDGVLFYPGEVFSSSRQPVASLRL 588
           +    G  P GD  L YPGE    + QP  SLR+
Sbjct: 461 VTDAAGAFPSGDPFLVYPGE----NGQPEESLRM 490


>gi|427442082|ref|ZP_18925530.1| conserved hypothetical protein [Pediococcus lolii NGRI 0510Q]
 gi|425786839|dbj|GAC46318.1| conserved hypothetical protein [Pediococcus lolii NGRI 0510Q]
          Length = 384

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 75/195 (38%), Gaps = 35/195 (17%)

Query: 384 DYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGP 443
           +YV+   + L+    W+KA   + DEP         ++  S L    PD +V   +   P
Sbjct: 137 NYVKALCDHLKDLQVWEKARL-IADEP-KQAQLKEFKDALSALKQMVPDLKVKVAFDKEP 194

Query: 444 ---SDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWW 500
                APL  T   SF             YCTS++             ++LQ  +  E  
Sbjct: 195 ILNELAPLVDTLATSF-------------YCTSQFG------------SQLQASHPGEVQ 229

Query: 501 TYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFR- 559
            Y+C  P  P+   H  +  ++ +  +       G   L W  NC+   +    +IR+  
Sbjct: 230 YYICNYPDHPNTFLHSPLLETRLQGTLTAFLPVNG--LLRWAFNCW--PSNAREDIRYNT 285

Query: 560 RGLPPGDGVLFYPGE 574
             LP GD  L YPGE
Sbjct: 286 SSLPIGDNCLVYPGE 300


>gi|206601987|gb|EDZ38469.1| Protein of unknown function [Leptospirillum sp. Group II '5-way
           CG']
          Length = 681

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 58/245 (23%), Positives = 95/245 (38%), Gaps = 37/245 (15%)

Query: 376 LSSNDGAKDYVRKEIELLRTKAHWK-------KAYFYLWDEPLNMEHYSS-----VRNMA 423
           +S   G  D   + +  L  + HWK       + + Y+ DEP++  +Y +     +   A
Sbjct: 390 ISDWKGVPDIATQNLAKLIVR-HWKEKGWPIDQTFAYIADEPVHKLYYYADTYKLIAKDA 448

Query: 424 SELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDL 483
             LH  +P   V+ T      D P     +++ V   K +     +   + W  G     
Sbjct: 449 DSLHKGSPHIHVMVT------DVPY--ITYKNQVGHNKLI----MVGKVNIWA-GASAQF 495

Query: 484 VKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGA 543
           +   +   Q E G++ W Y   GP     N  L   G   R   W  WK    G  YW A
Sbjct: 496 IPSRMQARQKE-GDQVWFYQAGGPPFIGQN-DLYSLGPGFRMWFWTAWKYHVNGVFYW-A 552

Query: 544 NCYEKATVPSAEIRFRRGLPPGDGVLFYPGEV-----FSSSRQPVASLRLERILSGLQ-V 597
           + +   T  +      +GL  GDG + YPG       F   + P+ S+R+ +   G +  
Sbjct: 553 DTFWNDTKENMNPYVNQGL--GDGTILYPGTELHFIGFPDIQGPIPSIRMAQWRRGYEDY 610

Query: 598 RWICY 602
           R++ Y
Sbjct: 611 RYLTY 615


>gi|365132343|ref|ZP_09342149.1| hypothetical protein HMPREF1032_03945 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363616981|gb|EHL68397.1| hypothetical protein HMPREF1032_03945 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 571

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 38/95 (40%), Gaps = 9/95 (9%)

Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA-----TVPSA 554
           W Y C   S   PN    M  +++R +   ++  G  GFL+WG N Y          P  
Sbjct: 416 WVYYCCAQSSLVPNRFFAMESARNRIMGVLMYLYGIKGFLHWGYNFYNSKFSLHPVDPYR 475

Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLE 589
                   P GD  L YPG        P++S+R E
Sbjct: 476 VTHADYAFPSGDPFLVYPG----PDGAPLSSVRAE 506


>gi|320536341|ref|ZP_08036383.1| PHP domain protein [Treponema phagedenis F0421]
 gi|320146822|gb|EFW38396.1| PHP domain protein [Treponema phagedenis F0421]
          Length = 869

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 31/118 (26%), Positives = 46/118 (38%), Gaps = 10/118 (8%)

Query: 486 DIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC 545
           D +     +N +  W Y C   +    N    M   ++R +   ++K    GFL+WG N 
Sbjct: 709 DSIAPFIAKNVKPLWAYYCSAQAVHVSNRFFAMPSWRNRILGMLLYKFDIDGFLHWGYNF 768

Query: 546 Y-----EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQVR 598
           Y      K   P          P GD    YPG+      +P+ S+RL+     LQ R
Sbjct: 769 YYTQYSRKLIDPFTVTDAGGAFPAGDSFSVYPGK-----DEPLPSIRLKVFYEALQDR 821


>gi|325663334|ref|ZP_08151784.1| hypothetical protein HMPREF0490_02525 [Lachnospiraceae bacterium
           4_1_37FAA]
 gi|325470788|gb|EGC74018.1| hypothetical protein HMPREF0490_02525 [Lachnospiraceae bacterium
           4_1_37FAA]
          Length = 555

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 42/106 (39%), Gaps = 9/106 (8%)

Query: 488 VTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE 547
           + E       + WTY C G      N  + M  +++R    +++K    G L+WG N Y 
Sbjct: 387 IEEFLEHGLTDMWTYYCTGQFYEVSNRFMSMPSARNRIYGIQLYKYDIIGILHWGYNFYN 446

Query: 548 -----KATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRL 588
                +   P        G P GD  L YPG    +   P  S+R+
Sbjct: 447 SQHSYEHINPYQVTDAANGFPAGDPFLVYPG----ADGHPEESIRM 488


>gi|386070149|ref|YP_005985045.1| hypothetical protein TIIST44_02580 [Propionibacterium acnes ATCC
           11828]
 gi|353454516|gb|AER05035.1| hypothetical protein TIIST44_02580 [Propionibacterium acnes ATCC
           11828]
          Length = 550

 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 43/180 (23%), Positives = 65/180 (36%), Gaps = 37/180 (20%)

Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
           W  A Y+++ DEP     Y+S R     +    P+A V+        DA   P  F + V
Sbjct: 320 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 369

Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 370 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 408

Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 409 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 468


>gi|50843406|ref|YP_056633.1| hypothetical protein PPA1958 [Propionibacterium acnes KPA171202]
 gi|289425612|ref|ZP_06427384.1| conserved hypothetical protein [Propionibacterium acnes SK187]
 gi|335053127|ref|ZP_08545978.1| hypothetical protein HMPREF9948_2283 [Propionibacterium sp.
           434-HC2]
 gi|387504316|ref|YP_005945545.1| hypothetical protein TIB1ST10_09970 [Propionibacterium acnes 6609]
 gi|419419836|ref|ZP_13960069.1| hypothetical protein TICEST70_01370 [Propionibacterium acnes
           PRP-38]
 gi|50841008|gb|AAT83675.1| hypothetical protein PPA1958 [Propionibacterium acnes KPA171202]
 gi|289153913|gb|EFD02606.1| conserved hypothetical protein [Propionibacterium acnes SK187]
 gi|333767978|gb|EGL45192.1| hypothetical protein HMPREF9948_2283 [Propionibacterium sp.
           434-HC2]
 gi|335278361|gb|AEH30266.1| hypothetical protein TIB1ST10_09970 [Propionibacterium acnes 6609]
 gi|379979557|gb|EIA12877.1| hypothetical protein TICEST70_01370 [Propionibacterium acnes
           PRP-38]
          Length = 550

 Score = 39.7 bits (91), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 49/204 (24%), Positives = 78/204 (38%), Gaps = 40/204 (19%)

Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 296 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 354

Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 355 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 384

Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC----YEKA 549
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N     + + 
Sbjct: 385 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 444

Query: 550 TV-PSAEIRFRRGLPPGDGVLFYP 572
            V P+A+       P GD  + YP
Sbjct: 445 LVDPNADTSADLAFPSGDSSVIYP 468


>gi|365131949|ref|ZP_09342011.1| hypothetical protein HMPREF1032_03807 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363617740|gb|EHL69113.1| hypothetical protein HMPREF1032_03807 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 679

 Score = 39.7 bits (91), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 29/54 (53%)

Query: 496 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA 549
           G+E W Y C+ P   + N  +       R++MW V++ G  G+L+WG N +  A
Sbjct: 439 GDEVWFYTCLAPKGNYLNRFIDQPIWIGRSLMWLVYRYGVEGYLHWGWNAWHYA 492


>gi|282855297|ref|ZP_06264629.1| conserved hypothetical protein [Propionibacterium acnes J139]
 gi|282581885|gb|EFB87270.1| conserved hypothetical protein [Propionibacterium acnes J139]
          Length = 499

 Score = 39.7 bits (91), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 43/180 (23%), Positives = 65/180 (36%), Gaps = 37/180 (20%)

Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
           W  A Y+++ DEP     Y+S R     +    P+A V+        DA   P  F + V
Sbjct: 269 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 318

Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 319 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 357

Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 358 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 417


>gi|422458853|ref|ZP_16535502.1| conserved hypothetical protein [Propionibacterium acnes HL050PA2]
 gi|422466399|ref|ZP_16542973.1| conserved hypothetical protein [Propionibacterium acnes HL110PA4]
 gi|422468180|ref|ZP_16544715.1| conserved hypothetical protein [Propionibacterium acnes HL110PA3]
 gi|422575224|ref|ZP_16650768.1| conserved hypothetical protein [Propionibacterium acnes HL001PA1]
 gi|314924019|gb|EFS87850.1| conserved hypothetical protein [Propionibacterium acnes HL001PA1]
 gi|314983039|gb|EFT27131.1| conserved hypothetical protein [Propionibacterium acnes HL110PA3]
 gi|315091619|gb|EFT63595.1| conserved hypothetical protein [Propionibacterium acnes HL110PA4]
 gi|315104095|gb|EFT76071.1| conserved hypothetical protein [Propionibacterium acnes HL050PA2]
          Length = 530

 Score = 39.3 bits (90), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 43/180 (23%), Positives = 65/180 (36%), Gaps = 37/180 (20%)

Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
           W  A Y+++ DEP     Y+S R     +    P+A V+        DA   P  F + V
Sbjct: 300 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 349

Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 350 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 388

Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 389 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 448


>gi|422391266|ref|ZP_16471359.1| hypothetical protein HMPREF9341_02296 [Propionibacterium acnes
           HL103PA1]
 gi|422464078|ref|ZP_16540689.1| conserved hypothetical protein [Propionibacterium acnes HL060PA1]
 gi|422564324|ref|ZP_16639979.1| conserved hypothetical protein [Propionibacterium acnes HL082PA2]
 gi|314967153|gb|EFT11252.1| conserved hypothetical protein [Propionibacterium acnes HL082PA2]
 gi|315093876|gb|EFT65852.1| conserved hypothetical protein [Propionibacterium acnes HL060PA1]
 gi|327325812|gb|EGE67604.1| hypothetical protein HMPREF9341_02296 [Propionibacterium acnes
           HL103PA1]
          Length = 530

 Score = 39.3 bits (90), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 43/180 (23%), Positives = 65/180 (36%), Gaps = 37/180 (20%)

Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
           W  A Y+++ DEP     Y+S R     +    P+A V+        DA   P  F + V
Sbjct: 300 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 349

Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 350 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 388

Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 389 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSHYLVDPNADTSADLAFPSGDSSVIYP 448


>gi|365963599|ref|YP_004945165.1| hypothetical protein TIA2EST36_09570 [Propionibacterium acnes
           TypeIA2 P.acn31]
 gi|365974778|ref|YP_004956337.1| hypothetical protein TIA2EST2_09530 [Propionibacterium acnes
           TypeIA2 P.acn33]
 gi|365740280|gb|AEW84482.1| hypothetical protein TIA2EST36_09570 [Propionibacterium acnes
           TypeIA2 P.acn31]
 gi|365744777|gb|AEW79974.1| hypothetical protein TIA2EST2_09530 [Propionibacterium acnes
           TypeIA2 P.acn33]
          Length = 499

 Score = 39.3 bits (90), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)

Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 245 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 303

Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 304 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 333

Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 334 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 393

Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
              P+A+       P GD  + YP
Sbjct: 394 LVDPNADTSADLAFPSGDSSVIYP 417


>gi|422395623|ref|ZP_16475656.1| hypothetical protein HMPREF9344_01398 [Propionibacterium acnes
           HL097PA1]
 gi|422454993|ref|ZP_16531671.1| conserved hypothetical protein [Propionibacterium acnes HL030PA1]
 gi|315107964|gb|EFT79940.1| conserved hypothetical protein [Propionibacterium acnes HL030PA1]
 gi|327333100|gb|EGE74827.1| hypothetical protein HMPREF9344_01398 [Propionibacterium acnes
           HL097PA1]
          Length = 530

 Score = 39.3 bits (90), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)

Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 276 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 334

Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 335 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 364

Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 365 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 424

Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
              P+A+       P GD  + YP
Sbjct: 425 LVDPNADTSADLAFPSGDSSVIYP 448


>gi|422433658|ref|ZP_16510524.1| conserved hypothetical protein [Propionibacterium acnes HL059PA2]
 gi|422436298|ref|ZP_16513148.1| hypothetical protein HMPREF9586_02402 [Propionibacterium acnes
           HL083PA2]
 gi|422441996|ref|ZP_16518802.1| conserved hypothetical protein [Propionibacterium acnes HL002PA1]
 gi|422445323|ref|ZP_16522072.1| conserved hypothetical protein [Propionibacterium acnes HL027PA1]
 gi|422511714|ref|ZP_16587855.1| conserved hypothetical protein [Propionibacterium acnes HL059PA1]
 gi|422540487|ref|ZP_16616353.1| conserved hypothetical protein [Propionibacterium acnes HL013PA1]
 gi|422540934|ref|ZP_16616795.1| conserved hypothetical protein [Propionibacterium acnes HL037PA1]
 gi|422546693|ref|ZP_16622518.1| conserved hypothetical protein [Propionibacterium acnes HL050PA3]
 gi|422548802|ref|ZP_16624611.1| conserved hypothetical protein [Propionibacterium acnes HL050PA1]
 gi|422558679|ref|ZP_16634417.1| hypothetical protein HMPREF9588_02503 [Propionibacterium acnes
           HL025PA2]
 gi|422561617|ref|ZP_16637301.1| conserved hypothetical protein [Propionibacterium acnes HL046PA1]
 gi|422571378|ref|ZP_16646963.1| conserved hypothetical protein [Propionibacterium acnes HL067PA1]
 gi|422577404|ref|ZP_16652937.1| conserved hypothetical protein [Propionibacterium acnes HL005PA4]
 gi|313763344|gb|EFS34708.1| conserved hypothetical protein [Propionibacterium acnes HL013PA1]
 gi|313815003|gb|EFS52717.1| conserved hypothetical protein [Propionibacterium acnes HL059PA1]
 gi|314916711|gb|EFS80542.1| conserved hypothetical protein [Propionibacterium acnes HL005PA4]
 gi|314919163|gb|EFS82994.1| conserved hypothetical protein [Propionibacterium acnes HL050PA1]
 gi|314921243|gb|EFS85074.1| conserved hypothetical protein [Propionibacterium acnes HL050PA3]
 gi|314930329|gb|EFS94160.1| conserved hypothetical protein [Propionibacterium acnes HL067PA1]
 gi|314956112|gb|EFT00508.1| conserved hypothetical protein [Propionibacterium acnes HL027PA1]
 gi|314959730|gb|EFT03832.1| conserved hypothetical protein [Propionibacterium acnes HL002PA1]
 gi|314969811|gb|EFT13909.1| conserved hypothetical protein [Propionibacterium acnes HL037PA1]
 gi|315098131|gb|EFT70107.1| conserved hypothetical protein [Propionibacterium acnes HL059PA2]
 gi|315102739|gb|EFT74715.1| conserved hypothetical protein [Propionibacterium acnes HL046PA1]
 gi|327452257|gb|EGE98911.1| hypothetical protein HMPREF9586_02402 [Propionibacterium acnes
           HL083PA2]
 gi|328752296|gb|EGF65912.1| hypothetical protein HMPREF9588_02503 [Propionibacterium acnes
           HL025PA2]
          Length = 522

 Score = 39.3 bits (90), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)

Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 268 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 326

Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 327 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 356

Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 357 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 416

Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
              P+A+       P GD  + YP
Sbjct: 417 LVDPNADTSADLAFPSGDSSVIYP 440


>gi|422426470|ref|ZP_16503391.1| hypothetical protein HMPREF9579_00231 [Propionibacterium acnes
           HL087PA1]
 gi|422453869|ref|ZP_16530551.1| hypothetical protein HMPREF9581_01536 [Propionibacterium acnes
           HL087PA3]
 gi|327451751|gb|EGE98405.1| hypothetical protein HMPREF9581_01536 [Propionibacterium acnes
           HL087PA3]
 gi|328756982|gb|EGF70598.1| hypothetical protein HMPREF9579_00231 [Propionibacterium acnes
           HL087PA1]
          Length = 522

 Score = 39.3 bits (90), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)

Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 268 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 326

Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 327 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 356

Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 357 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 416

Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
              P+A+       P GD  + YP
Sbjct: 417 LVDPNADTSADLAFPSGDSSVIYP 440


>gi|422451477|ref|ZP_16528179.1| conserved hypothetical protein [Propionibacterium acnes HL030PA2]
 gi|422499630|ref|ZP_16575891.1| conserved hypothetical protein [Propionibacterium acnes HL063PA2]
 gi|313829397|gb|EFS67111.1| conserved hypothetical protein [Propionibacterium acnes HL063PA2]
 gi|315108879|gb|EFT80855.1| conserved hypothetical protein [Propionibacterium acnes HL030PA2]
          Length = 522

 Score = 39.3 bits (90), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)

Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 268 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 326

Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 327 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 356

Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 357 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 416

Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
              P+A+       P GD  + YP
Sbjct: 417 LVDPNADTSADLAFPSGDSSVIYP 440


>gi|365965842|ref|YP_004947407.1| hypothetical protein TIA2EST22_09585 [Propionibacterium acnes
           TypeIA2 P.acn17]
 gi|365742523|gb|AEW82217.1| hypothetical protein TIA2EST22_09585 [Propionibacterium acnes
           TypeIA2 P.acn17]
          Length = 499

 Score = 39.3 bits (90), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)

Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 245 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 303

Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 304 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 333

Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 334 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 393

Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
              P+A+       P GD  + YP
Sbjct: 394 LVDPNADTSADLAFPSGDSSVIYP 417


>gi|342211509|ref|ZP_08704234.1| hypothetical protein HMPREF9949_0083 [Propionibacterium sp.
           CC003-HC2]
 gi|340767053|gb|EGR89578.1| hypothetical protein HMPREF9949_0083 [Propionibacterium sp.
           CC003-HC2]
          Length = 499

 Score = 38.9 bits (89), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)

Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 245 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 303

Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 304 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 333

Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 334 AGLDGMWVYTSCASTFWDPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 393

Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
              P+A+       P GD  + YP
Sbjct: 394 LVDPNADTSADLAFPSGDSSVIYP 417


>gi|335050668|ref|ZP_08543624.1| hypothetical protein HMPREF9947_1061 [Propionibacterium sp.
           409-HC1]
 gi|333769177|gb|EGL46316.1| hypothetical protein HMPREF9947_1061 [Propionibacterium sp.
           409-HC1]
          Length = 426

 Score = 38.9 bits (89), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)

Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 172 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 230

Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 231 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 260

Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 261 AGLDGMWVYTSCASTFWDPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 320

Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
              P+A+       P GD  + YP
Sbjct: 321 LVDPNADTSADLAFPSGDSSVIYP 344


>gi|354605511|ref|ZP_09023487.1| hypothetical protein HMPREF1003_00054 [Propionibacterium sp.
           5_U_42AFAA]
 gi|386024898|ref|YP_005943203.1| hypothetical protein PAZ_c20420 [Propionibacterium acnes 266]
 gi|332676356|gb|AEE73172.1| hypothetical protein PAZ_c20420 [Propionibacterium acnes 266]
 gi|353558520|gb|EHC27883.1| hypothetical protein HMPREF1003_00054 [Propionibacterium sp.
           5_U_42AFAA]
          Length = 550

 Score = 38.9 bits (89), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 43/180 (23%), Positives = 64/180 (35%), Gaps = 37/180 (20%)

Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
           W  A Y+++ DEP     Y+S R     +    P A V+        DA   P  F + V
Sbjct: 320 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGATVI--------DAVDDPR-FATVV 369

Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 370 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 408

Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 409 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 468


>gi|456739043|gb|EMF63610.1| hypothetical protein TIA1EST31_09784 [Propionibacterium acnes
           FZ1/2/0]
          Length = 550

 Score = 38.5 bits (88), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 43/180 (23%), Positives = 64/180 (35%), Gaps = 37/180 (20%)

Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
           W  A Y+++ DEP     Y+S R     +    P A V+        DA   P  F + V
Sbjct: 320 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPRATVI--------DAVDDPR-FATVV 369

Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 370 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 408

Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 409 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 468


>gi|289426957|ref|ZP_06428676.1| conserved hypothetical protein [Propionibacterium acnes J165]
 gi|417930403|ref|ZP_12573781.1| hypothetical protein HMPREF9205_1262 [Propionibacterium acnes
           SK182]
 gi|289159779|gb|EFD07964.1| conserved hypothetical protein [Propionibacterium acnes J165]
 gi|340772245|gb|EGR94754.1| hypothetical protein HMPREF9205_1262 [Propionibacterium acnes
           SK182]
          Length = 543

 Score = 38.5 bits (88), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 43/180 (23%), Positives = 64/180 (35%), Gaps = 37/180 (20%)

Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
           W  A Y+++ DEP     Y+S R     +    P A V+        DA   P  F + V
Sbjct: 313 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGATVI--------DAVDDPR-FATVV 362

Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 363 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 401

Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 402 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 461


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.136    0.439 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,760,158,765
Number of Sequences: 23463169
Number of extensions: 494229847
Number of successful extensions: 1011220
Number of sequences better than 100.0: 147
Number of HSP's better than 100.0 without gapping: 47
Number of HSP's successfully gapped in prelim test: 100
Number of HSP's that attempted gapping in prelim test: 1010923
Number of HSP's gapped (non-prelim): 267
length of query: 605
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 456
effective length of database: 8,863,183,186
effective search space: 4041611532816
effective search space used: 4041611532816
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)