BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 007713
         (592 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|356518975|ref|XP_003528150.1| PREDICTED: uncharacterized protein LOC100782659 [Glycine max]
          Length = 649

 Score = 1038 bits (2683), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 483/592 (81%), Positives = 538/592 (90%), Gaps = 1/592 (0%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANVGPQ+MPR LEPINLLAARNERESVQIA+RPKVSWS SS AG VQ+QCSDLCS 
Sbjct: 57  MPSTANVGPQDMPRHLEPINLLAARNERESVQIAIRPKVSWSGSSVAGTVQIQCSDLCST 116

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           SGDRL+VGQSL+LRRVVP+LGVPDALVP+DLPV QI+L PGETTA+W+SID P +QPPG 
Sbjct: 117 SGDRLIVGQSLLLRRVVPILGVPDALVPVDLPVSQINLFPGETTALWISIDVPSSQPPGQ 176

Query: 121 YEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTAT 179
           YEGEI IT+ KAD E   Q L K EKH+L+ +L+ CLD VEPI+GKPL EVVER KS  T
Sbjct: 177 YEGEIAITAIKADAESPVQILSKVEKHQLYRDLKGCLDIVEPIDGKPLDEVVERVKSATT 236

Query: 180 TLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGIS 239
           +LRR++ SP FSEFFSDNGP+D+MDEDAIS+LS+R+KL+LTVW+F+LP TPSLPAV GIS
Sbjct: 237 SLRRILLSPSFSEFFSDNGPVDVMDEDAISSLSIRMKLNLTVWEFVLPETPSLPAVFGIS 296

Query: 240 DTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKS 299
           DTVIEDRFGV+ G+ EWYEALDQHFKWLLQYRISP+FC+W + MRVLTYT PWPADHPKS
Sbjct: 297 DTVIEDRFGVQQGTAEWYEALDQHFKWLLQYRISPYFCKWADGMRVLTYTSPWPADHPKS 356

Query: 300 DEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHY 359
           DEYFSDPRLAAYAVPY  V+S ND AKDY++K++E+LRTK HW+KAYFYLWDEPLN+E Y
Sbjct: 357 DEYFSDPRLAAYAVPYKQVVSGNDAAKDYLQKQVEILRTKTHWRKAYFYLWDEPLNLEQY 416

Query: 360 SSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWV 419
            SVRNMASE+HAYAPDAR+LTTYYCGP+DAPL PTPFE+FVKVP FLRPH QIYCTSEWV
Sbjct: 417 DSVRNMASEIHAYAPDARILTTYYCGPNDAPLAPTPFEAFVKVPSFLRPHNQIYCTSEWV 476

Query: 420 LGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGT 479
           LGNREDLVKDI+TELQPENGEEWWTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGT
Sbjct: 477 LGNREDLVKDIITELQPENGEEWWTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGT 536

Query: 480 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 539
           GFLYWGANCYEKATV SAEI+FR GLPPGDGVL+YPGEVFS+S QPVASLRLERIL+GLQ
Sbjct: 537 GFLYWGANCYEKATVASAEIKFRHGLPPGDGVLYYPGEVFSTSHQPVASLRLERILNGLQ 596

Query: 540 DIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           DIEYL LYASRYGRDE +ALLE+TG Y GPERYT EH+PID MRG+IFNACR
Sbjct: 597 DIEYLRLYASRYGRDESIALLERTGVYFGPERYTFEHMPIDAMRGQIFNACR 648


>gi|255538584|ref|XP_002510357.1| conserved hypothetical protein [Ricinus communis]
 gi|223551058|gb|EEF52544.1| conserved hypothetical protein [Ricinus communis]
          Length = 651

 Score = 1030 bits (2663), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 489/592 (82%), Positives = 537/592 (90%), Gaps = 1/592 (0%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANVGPQE+PR LEP+NLLAARNERESVQIA+RPKVSWSSS +AG V VQC+DL S 
Sbjct: 59  MPSTANVGPQEIPRHLEPVNLLAARNERESVQIAIRPKVSWSSSGSAGAVHVQCTDLSST 118

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           SGDRLV GQS+ LR+VV +LGVPDALVPLD PV +ISL+PGETTA+WVSID P AQPPG 
Sbjct: 119 SGDRLVAGQSITLRKVVTILGVPDALVPLDHPVSRISLVPGETTAIWVSIDIPSAQPPGQ 178

Query: 121 YEGEIIIT-SKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTAT 179
           YEG+ IIT +K + E  S C  K EKHRL+MELRNCLD VEPIEGKPL+EVVER KS +T
Sbjct: 179 YEGDFIITATKTEAEYQSHCFNKAEKHRLYMELRNCLDIVEPIEGKPLNEVVERVKSAST 238

Query: 180 TLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGIS 239
           +LRRV+ SP FSEFFSDNG +DMMDEDAISNLSVRVKLSLTVWDFILP TPS PAV GIS
Sbjct: 239 SLRRVLLSPSFSEFFSDNGSVDMMDEDAISNLSVRVKLSLTVWDFILPVTPSFPAVFGIS 298

Query: 240 DTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKS 299
           DTVIEDRFGV HG+DEWYEAL+QHFKWLLQYRISP+FCRWG SMRV  YTCPWPADHPKS
Sbjct: 299 DTVIEDRFGVEHGTDEWYEALEQHFKWLLQYRISPYFCRWGTSMRVFGYTCPWPADHPKS 358

Query: 300 DEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHY 359
           DEY SDPRLAAYAVPY+  +S ND  KDY++KEIE+LRTK HWKKAYFYLWDEPLN+EHY
Sbjct: 359 DEYLSDPRLAAYAVPYNRAVSGNDAGKDYLQKEIEMLRTKPHWKKAYFYLWDEPLNLEHY 418

Query: 360 SSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWV 419
            S+RNMA E+H YAPDAR+LTTYYCGP+DAPL PTPFE+FVKVPKF+RPH QIYC SEWV
Sbjct: 419 DSLRNMAGEIHGYAPDARILTTYYCGPNDAPLAPTPFEAFVKVPKFMRPHIQIYCASEWV 478

Query: 420 LGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGT 479
           LGNR+DLVKDI++ELQPENGEEWWTYVC+GPSDPHPNWHLGMRG+QHRAVMWRVWKEGGT
Sbjct: 479 LGNRDDLVKDIISELQPENGEEWWTYVCLGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGT 538

Query: 480 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 539
           GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQ
Sbjct: 539 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLYYPGEVFSSSHKPVASLRLERLLSGLQ 598

Query: 540 DIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           DIEYL LYASR+GRDEG+ALLEKTG Y GPERYT+EH+PIDVMRGEI+N CR
Sbjct: 599 DIEYLKLYASRHGRDEGVALLEKTGVYVGPERYTLEHMPIDVMRGEIYNTCR 650


>gi|449460114|ref|XP_004147791.1| PREDICTED: uncharacterized protein LOC101205217 [Cucumis sativus]
 gi|449476778|ref|XP_004154831.1| PREDICTED: uncharacterized LOC101205217 [Cucumis sativus]
          Length = 649

 Score = 1025 bits (2649), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 482/592 (81%), Positives = 533/592 (90%), Gaps = 2/592 (0%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANVGPQEMPR LE INLLAARNERESVQIA+RPK+SW +SS AG+VQV   DLCS 
Sbjct: 58  MPSTANVGPQEMPRRLETINLLAARNERESVQIAMRPKISWGASSVAGIVQVFSGDLCST 117

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           SGDRLVVGQSL LRRVVP+LGVPDALVPLDLPV QI+L+PGETTAVWVSID P  QPPG 
Sbjct: 118 SGDRLVVGQSLRLRRVVPILGVPDALVPLDLPVSQINLLPGETTAVWVSIDVPNMQPPGQ 177

Query: 121 YEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTAT 179
           YEGEIIIT+ K D E S+Q LGK EKH ++ ELR+CLD +E ++ KPL EVV+R KS   
Sbjct: 178 YEGEIIITAIKTDAESSTQYLGKAEKHEIYKELRSCLDIMEIVDEKPLEEVVKRVKSATA 237

Query: 180 TLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGIS 239
           +L+RVI SP FSEF+S+NG ID+MDEDA SNLSVRVK+ LTVWDF +PATPSLPAVIG+S
Sbjct: 238 SLKRVILSPSFSEFYSENGSIDVMDEDAFSNLSVRVKIMLTVWDFTIPATPSLPAVIGVS 297

Query: 240 DTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKS 299
           DTVIEDRFGV HG+DEW+EALD HFKWLLQYRISP+FCRWG+ MRVLTYTCPWPADHPKS
Sbjct: 298 DTVIEDRFGVEHGTDEWFEALDDHFKWLLQYRISPYFCRWGDGMRVLTYTCPWPADHPKS 357

Query: 300 DEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHY 359
           DEYFSDPRL+AYAVPY  V   + G KDY+++E+E+LRTK HWKKAYFYLWDEPLNMEH+
Sbjct: 358 DEYFSDPRLSAYAVPYRAVFGGDTG-KDYLQREVEILRTKTHWKKAYFYLWDEPLNMEHF 416

Query: 360 SSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWV 419
            SVR+M+SE+ AYAPDARVLTTYYCGPSDAPL PT FE+FVKVP FLRPHTQIYCTSEWV
Sbjct: 417 DSVRSMSSEIRAYAPDARVLTTYYCGPSDAPLAPTTFEAFVKVPSFLRPHTQIYCTSEWV 476

Query: 420 LGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGT 479
           LGNREDLVKDI+ ELQPENGEEWWTYVCMGP DPHPNWHLGMRG+QHRAVMWRVWKEGGT
Sbjct: 477 LGNREDLVKDIIAELQPENGEEWWTYVCMGPGDPHPNWHLGMRGTQHRAVMWRVWKEGGT 536

Query: 480 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 539
           GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSS +PVAS+RLER+LSGLQ
Sbjct: 537 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSHEPVASVRLERLLSGLQ 596

Query: 540 DIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           DIEYL LYASRYGRDEG+ALLEKTG YQGPERYT +H+P+D+MRGE+FN CR
Sbjct: 597 DIEYLKLYASRYGRDEGVALLEKTGVYQGPERYTHDHMPVDLMRGEVFNLCR 648


>gi|224132110|ref|XP_002321258.1| predicted protein [Populus trichocarpa]
 gi|222862031|gb|EEE99573.1| predicted protein [Populus trichocarpa]
          Length = 652

 Score = 1024 bits (2647), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 487/593 (82%), Positives = 535/593 (90%), Gaps = 4/593 (0%)

Query: 1   MPSTANVGPQEMP-RPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCS 59
           +PSTANVGPQE+P R LEPINLLAARNERESVQIALRPK +W  S +AGVVQVQCSDL S
Sbjct: 61  LPSTANVGPQEIPSRHLEPINLLAARNERESVQIALRPKATWGGSGSAGVVQVQCSDLTS 120

Query: 60  ASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPG 119
            SGDRLVVGQS+ LRRVV +LGVPDALVPLDLPV QI+L PGETTA+WVSID P AQP G
Sbjct: 121 TSGDRLVVGQSITLRRVVSILGVPDALVPLDLPVSQINLAPGETTALWVSIDVPSAQPQG 180

Query: 120 LYEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTA 178
            YEGE  IT+ KA+ E  SQ LGK ++H+L+ ELRNCLD +EP+EGKPL EVVERAKS  
Sbjct: 181 QYEGEFFITAIKAEAESPSQRLGKADRHQLYSELRNCLDIMEPVEGKPLDEVVERAKSVT 240

Query: 179 TTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGI 238
           T+LRRV+ SP+FSEF +DNGP+DMMDEDAISNL+VRVKL+LTVWDF+LPATPSLPAV GI
Sbjct: 241 TSLRRVLLSPVFSEFSTDNGPVDMMDEDAISNLTVRVKLNLTVWDFVLPATPSLPAVFGI 300

Query: 239 SDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPK 298
           SDTVIEDRFGV HGSDEWYEALDQHFKWLL YRISP+FCRWG +MRVLTYTCPWPADHPK
Sbjct: 301 SDTVIEDRFGVEHGSDEWYEALDQHFKWLLHYRISPYFCRWGGNMRVLTYTCPWPADHPK 360

Query: 299 SDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEH 358
           SDEYFSDPRLAAYAVPYS  +     A+DY++KEI++LRTK+HWKKAYFYLWDEPLN+E 
Sbjct: 361 SDEYFSDPRLAAYAVPYSQAVPG--AAQDYLQKEIDILRTKSHWKKAYFYLWDEPLNLEQ 418

Query: 359 YSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEW 418
           Y  VR+MAS++H YAPDARVLTTYYCGPSDAPLGPTPFE+FVKVPKFLRPHTQIYCTSEW
Sbjct: 419 YDMVRSMASKIHTYAPDARVLTTYYCGPSDAPLGPTPFEAFVKVPKFLRPHTQIYCTSEW 478

Query: 419 VLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGG 478
           VLG+REDL K+IV+ELQPENGEEWWTYVC+GPSDPHPNWH+GMRG+QHRAV WRVWKEG 
Sbjct: 479 VLGDREDLAKEIVSELQPENGEEWWTYVCLGPSDPHPNWHIGMRGTQHRAVFWRVWKEGA 538

Query: 479 TGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGL 538
           TGFLYWGANCYEKATVPSAEI FRRGLPPGDGVL+YPGEVFSSS QPVAS+RLERILSGL
Sbjct: 539 TGFLYWGANCYEKATVPSAEISFRRGLPPGDGVLYYPGEVFSSSHQPVASVRLERILSGL 598

Query: 539 QDIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           QD+EYL LY SRYGR EGLALLEKTG Y GPERYT+EH PIDVMRGEIFN CR
Sbjct: 599 QDLEYLQLYTSRYGRGEGLALLEKTGMYLGPERYTLEHTPIDVMRGEIFNTCR 651


>gi|356509690|ref|XP_003523579.1| PREDICTED: uncharacterized protein LOC100799554 [Glycine max]
          Length = 644

 Score = 1018 bits (2631), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 472/592 (79%), Positives = 536/592 (90%), Gaps = 6/592 (1%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MP+TANVGPQ+MPR LEPINLLAARNERESVQIA+RPKVSW  SS AG VQ+QCSDLCS 
Sbjct: 57  MPNTANVGPQDMPRHLEPINLLAARNERESVQIAIRPKVSWGGSSVAGTVQIQCSDLCST 116

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           SGDRL+VGQSL+LRRVVP+LGVPDALVP+DLPV QI+L PGETTA+W+SID P +QPPG 
Sbjct: 117 SGDRLIVGQSLLLRRVVPILGVPDALVPVDLPVSQINLFPGETTALWISIDVPSSQPPGQ 176

Query: 121 YEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTAT 179
           YEGEI+IT+ K+D ++S     K EKH+L+ +L+ CLD VEPI+GKPL EVVER KST T
Sbjct: 177 YEGEIVITAIKSDADIS-----KVEKHQLYRDLKGCLDIVEPIDGKPLDEVVERVKSTTT 231

Query: 180 TLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGIS 239
           +LRR++ SP FSEFFSDNGP+D+MDEDAIS+LS+R+KL+LTVW+F+LP TPSLPAV GIS
Sbjct: 232 SLRRILLSPSFSEFFSDNGPVDVMDEDAISSLSLRMKLNLTVWEFVLPETPSLPAVFGIS 291

Query: 240 DTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKS 299
           DTVIEDRFGV+ G+ EWYEALDQHFKWLLQYRISP+FC+W + MRVLTYT PWPADHPKS
Sbjct: 292 DTVIEDRFGVQQGTAEWYEALDQHFKWLLQYRISPYFCKWADGMRVLTYTSPWPADHPKS 351

Query: 300 DEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHY 359
           DEYFSDPRLAAYAVPY  V+S N+ A+DY++K++E+LRTK HW+KAYFYLWDEPLN+E Y
Sbjct: 352 DEYFSDPRLAAYAVPYKQVVSGNNSAEDYLQKQVEILRTKNHWRKAYFYLWDEPLNLEQY 411

Query: 360 SSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWV 419
            SVRNMASE+HAYAPDAR+LTTYYCGP+DAPL PTPF++FVKVP FLRPH QIYCTSEWV
Sbjct: 412 DSVRNMASEIHAYAPDARILTTYYCGPNDAPLAPTPFDAFVKVPSFLRPHNQIYCTSEWV 471

Query: 420 LGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGT 479
           LGN+EDLVKDI+ ELQPENGEEWWTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGT
Sbjct: 472 LGNQEDLVKDIIAELQPENGEEWWTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGT 531

Query: 480 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 539
           GFLYWGANCYEKATV SAEI+FR GLPPGDGVL+YPGEVFS+S QPVASLRLERIL+GLQ
Sbjct: 532 GFLYWGANCYEKATVASAEIKFRHGLPPGDGVLYYPGEVFSTSHQPVASLRLERILNGLQ 591

Query: 540 DIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           D EYL LYASRYGRDE +ALLE+ G Y GPERYT EH+PID MRG+IFNACR
Sbjct: 592 DTEYLRLYASRYGRDESIALLERMGVYFGPERYTFEHMPIDAMRGQIFNACR 643


>gi|225458333|ref|XP_002283035.1| PREDICTED: uncharacterized protein LOC100243809 [Vitis vinifera]
 gi|302142468|emb|CBI19671.3| unnamed protein product [Vitis vinifera]
          Length = 638

 Score =  993 bits (2567), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 472/592 (79%), Positives = 523/592 (88%), Gaps = 4/592 (0%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANVGPQEMPRPLE + LLAARNERESVQIA+RPKVSW  S   G VQVQCSDLCS 
Sbjct: 49  MPSTANVGPQEMPRPLEHVTLLAARNERESVQIAMRPKVSWGGS--GGAVQVQCSDLCSP 106

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           SGDRLVVG+SL LRRVV +LGVPDALVPLDLPV QISL+PGETTA+WVSID P  QPPG 
Sbjct: 107 SGDRLVVGESLKLRRVVSILGVPDALVPLDLPVSQISLLPGETTAIWVSIDVPSTQPPGQ 166

Query: 121 YEGEIIIT-SKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTAT 179
           YEGE+IIT +KAD E  ++CLGK E+ +L+ EL+N L+ VEPI+GKPL EVVER KS  T
Sbjct: 167 YEGELIITATKADAESRAKCLGKAERRQLYSELKNFLEIVEPIDGKPLDEVVERVKSATT 226

Query: 180 TLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGIS 239
           TLR +  SP F EFFSD  P+DMMDEDAIS+LSVR+KLSLTVW+F+LP TPSLPAV GIS
Sbjct: 227 TLRSIFQSPSFCEFFSDGHPVDMMDEDAISDLSVRMKLSLTVWNFVLPLTPSLPAVFGIS 286

Query: 240 DTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKS 299
           DTVIEDRFGV HG+DEWYEALD HFKWLLQYRISP+FCRWG+ MRVLTYTCPWPA HPKS
Sbjct: 287 DTVIEDRFGVEHGTDEWYEALDHHFKWLLQYRISPYFCRWGDGMRVLTYTCPWPAHHPKS 346

Query: 300 DEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHY 359
           DEYFSDPRLAAYAVPYS V+      KDY+++EIE L+TK HWKKAYFYLWDEPLN+EH+
Sbjct: 347 DEYFSDPRLAAYAVPYSQVVPGG-AEKDYLQREIETLKTKTHWKKAYFYLWDEPLNLEHF 405

Query: 360 SSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWV 419
            ++RNMA E+ AYA DAR+LTTYY GPSDAPL    FE+FVKVPKFLRPHTQIYCTSEWV
Sbjct: 406 DNIRNMACEVQAYARDARILTTYYSGPSDAPLASNNFEAFVKVPKFLRPHTQIYCTSEWV 465

Query: 420 LGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGT 479
            GNREDLVKDI+ ELQPENGEEWWTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGT
Sbjct: 466 FGNREDLVKDIIAELQPENGEEWWTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGT 525

Query: 480 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 539
           GFLYWGANCYEKATVPSAE+ FRRGLPPGDGVLFYPGEV+S+S +PVAS+RLERILSGLQ
Sbjct: 526 GFLYWGANCYEKATVPSAEVCFRRGLPPGDGVLFYPGEVYSTSHEPVASVRLERILSGLQ 585

Query: 540 DIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           DIEYL LYASRYG++EG+ALLE+TG Y GPERYT+EH+ ID+MRGEIFNA R
Sbjct: 586 DIEYLKLYASRYGKEEGVALLERTGVYMGPERYTLEHMSIDMMRGEIFNAFR 637


>gi|297852256|ref|XP_002894009.1| hypothetical protein ARALYDRAFT_473837 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339851|gb|EFH70268.1| hypothetical protein ARALYDRAFT_473837 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 643

 Score =  983 bits (2541), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 458/591 (77%), Positives = 517/591 (87%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MP+T NVG QE PRPLEPINLLAARNERES QIA+RPKVSW++SS +G VQVQCSDLCS+
Sbjct: 51  MPNTVNVGSQETPRPLEPINLLAARNERESFQIAMRPKVSWAASSPSGSVQVQCSDLCSS 110

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           +GDRLVVGQSL LRRVVP+LGVPDALVPLDLPV Q+SL PGET+ +WVSID P  QPPG 
Sbjct: 111 AGDRLVVGQSLNLRRVVPVLGVPDALVPLDLPVSQLSLFPGETSVIWVSIDVPNRQPPGQ 170

Query: 121 YEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATT 180
           YEGEII+++       S  LGK EK +L +EL NCLD +EPIEGKP+ EVVER K  +++
Sbjct: 171 YEGEIIVSAMKTDGGGSAHLGKHEKDQLCVELNNCLDIMEPIEGKPMDEVVERIKCASSS 230

Query: 181 LRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISD 240
           LRR++FSP FSEF S NG  DMM+ED +SNLS+R+KL LTVW+FI+P TPSLP+VIG+SD
Sbjct: 231 LRRILFSPSFSEFISTNGSTDMMEEDVVSNLSLRIKLRLTVWEFIIPVTPSLPSVIGVSD 290

Query: 241 TVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSD 300
           TVIEDRFGV  GS+EWYE LD HFKWLLQYRISP+FC+WGE MRVLTYT PWPADHPKSD
Sbjct: 291 TVIEDRFGVERGSEEWYEKLDLHFKWLLQYRISPYFCKWGEGMRVLTYTSPWPADHPKSD 350

Query: 301 EYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYS 360
           EY SDPRLAAYAVPY  V++ +D  + Y+RKE+E+LR+K HWKKAYFYLWDEPLNMEH+ 
Sbjct: 351 EYLSDPRLAAYAVPYRQVIAGDDIRESYLRKEVEILRSKPHWKKAYFYLWDEPLNMEHFD 410

Query: 361 SVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVL 420
           SVR MASE++AYAPDARVLTTYYCGP DAPL PTPFESFVKVP  LRPHTQIYCTSEWVL
Sbjct: 411 SVRKMASEIYAYAPDARVLTTYYCGPGDAPLAPTPFESFVKVPNLLRPHTQIYCTSEWVL 470

Query: 421 GNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTG 480
           GNREDLVKDIV ELQ ENGEEWWTY+C+GPSDPHPNWHLGMRG+Q RAVMWRVWKEGGTG
Sbjct: 471 GNREDLVKDIVEELQTENGEEWWTYICLGPSDPHPNWHLGMRGTQQRAVMWRVWKEGGTG 530

Query: 481 FLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQD 540
           FLYWGANCYEKATVPSAE++FRRGLPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQD
Sbjct: 531 FLYWGANCYEKATVPSAEVKFRRGLPPGDGVLYYPGEVFSSSSEPVASLRLERLLSGLQD 590

Query: 541 IEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
            EYL LY S+YGR E + LLEKTG Y GPERYT+EH PIDV+RGE++N CR
Sbjct: 591 YEYLKLYESKYGRAEAMGLLEKTGVYTGPERYTLEHRPIDVLRGEVYNTCR 641


>gi|42562571|ref|NP_175129.3| uncharacterized protein [Arabidopsis thaliana]
 gi|30725314|gb|AAP37679.1| At1g45150 [Arabidopsis thaliana]
 gi|110742869|dbj|BAE99332.1| hypothetical protein [Arabidopsis thaliana]
 gi|332193963|gb|AEE32084.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 643

 Score =  974 bits (2518), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 450/591 (76%), Positives = 515/591 (87%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MP+T NVG QE PR LEPINLLAARNERES QIA+RPKVSW++SS +G+VQVQCSDLCS+
Sbjct: 51  MPNTVNVGSQETPRALEPINLLAARNERESFQIAMRPKVSWAASSPSGIVQVQCSDLCSS 110

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           +GDRLVVGQSL LRRVVP+LGVPDALVPLDLPV Q+SL PGET+ +WVSID P  QPPG 
Sbjct: 111 AGDRLVVGQSLKLRRVVPVLGVPDALVPLDLPVSQLSLFPGETSVIWVSIDVPTGQPPGQ 170

Query: 121 YEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATT 180
           YEGEIII++       S  L K EK +L +EL  CLD +EPIEGKP+ EVVER K  +++
Sbjct: 171 YEGEIIISAMKTDGGGSSHLAKHEKDQLCVELNTCLDIMEPIEGKPMDEVVERIKCASSS 230

Query: 181 LRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISD 240
           LRR++FSP FSEF S NG  DMM+ED +SNLS+R+KL LTVW+FI+P TPSLPAVIG+SD
Sbjct: 231 LRRILFSPSFSEFISTNGSTDMMEEDVVSNLSLRIKLRLTVWEFIIPVTPSLPAVIGVSD 290

Query: 241 TVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSD 300
           TVIEDRF V HGS++WY+ LD HFKWLLQYRISP+FC+WGESMRVLTYT PWPADHPKSD
Sbjct: 291 TVIEDRFAVEHGSEDWYKKLDLHFKWLLQYRISPYFCKWGESMRVLTYTSPWPADHPKSD 350

Query: 301 EYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYS 360
           EY SD RLAAYAVPY  V++ +D  + Y+RKE+E+LR+K HW KAYFYLWDEPLNMEH+ 
Sbjct: 351 EYLSDSRLAAYAVPYRQVIAGDDSRESYLRKEVEILRSKPHWNKAYFYLWDEPLNMEHFD 410

Query: 361 SVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVL 420
           +VR MASE++AYAPD+RVLTTYYCGP DAPL PTPFESFVKVP  LRP+TQIYCTSEWVL
Sbjct: 411 NVRKMASEIYAYAPDSRVLTTYYCGPGDAPLAPTPFESFVKVPNLLRPYTQIYCTSEWVL 470

Query: 421 GNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTG 480
           GNREDLVKDI+ ELQ ENGEEWWTY+C+GPSDPHPNWHLGMRG+Q RAVMWRVWKEGGTG
Sbjct: 471 GNREDLVKDILDELQTENGEEWWTYICLGPSDPHPNWHLGMRGTQQRAVMWRVWKEGGTG 530

Query: 481 FLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQD 540
           FLYWGANCYEKATVPSAE++FRRGLPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQD
Sbjct: 531 FLYWGANCYEKATVPSAEVKFRRGLPPGDGVLYYPGEVFSSSSEPVASLRLERLLSGLQD 590

Query: 541 IEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
            EYL LY S+YGR+E + LLEKTG Y GPERYT+EH PIDV+RGE++N CR
Sbjct: 591 YEYLKLYESKYGREEAMGLLEKTGVYTGPERYTLEHRPIDVLRGEVYNTCR 641


>gi|7767671|gb|AAF69168.1|AC007915_20 F27F5.22 [Arabidopsis thaliana]
          Length = 687

 Score =  945 bits (2443), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 450/643 (69%), Positives = 515/643 (80%), Gaps = 52/643 (8%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MP+T NVG QE PR LEPINLLAARNERES QIA+RPKVSW++SS +G+VQVQCSDLCS+
Sbjct: 43  MPNTVNVGSQETPRALEPINLLAARNERESFQIAMRPKVSWAASSPSGIVQVQCSDLCSS 102

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           +GDRLVVGQSL LRRVVP+LGVPDALVPLDLPV Q+SL PGET+ +WVSID P  QPPG 
Sbjct: 103 AGDRLVVGQSLKLRRVVPVLGVPDALVPLDLPVSQLSLFPGETSVIWVSIDVPTGQPPGQ 162

Query: 121 YEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATT 180
           YEGEIII++       S  L K EK +L +EL  CLD +EPIEGKP+ EVVER K  +++
Sbjct: 163 YEGEIIISAMKTDGGGSSHLAKHEKDQLCVELNTCLDIMEPIEGKPMDEVVERIKCASSS 222

Query: 181 LRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISD 240
           LRR++FSP FSEF S NG  DMM+ED +SNLS+R+KL LTVW+FI+P TPSLPAVIG+SD
Sbjct: 223 LRRILFSPSFSEFISTNGSTDMMEEDVVSNLSLRIKLRLTVWEFIIPVTPSLPAVIGVSD 282

Query: 241 TVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWP------- 293
           TVIEDRF V HGS++WY+ LD HFKWLLQYRISP+FC+WGESMRVLTYT PWP       
Sbjct: 283 TVIEDRFAVEHGSEDWYKKLDLHFKWLLQYRISPYFCKWGESMRVLTYTSPWPANRFASR 342

Query: 294 -------------ADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKA 340
                        ADHPKSDEY SD RLAAYAVPY  V++ +D  + Y+RKE+E+LR+K 
Sbjct: 343 SELSICVPLFGFTADHPKSDEYLSDSRLAAYAVPYRQVIAGDDSRESYLRKEVEILRSKP 402

Query: 341 HWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           HW KAYFYLWDEPLNMEH+ +VR MASE++AYAPD+RVLTTYYCGP DAPL PTPFESFV
Sbjct: 403 HWNKAYFYLWDEPLNMEHFDNVRKMASEIYAYAPDSRVLTTYYCGPGDAPLAPTPFESFV 462

Query: 401 KVPKFLRPHTQIYCT--------------------------------SEWVLGNREDLVK 428
           KVP  LRP+TQIYCT                                SEWVLGNREDLVK
Sbjct: 463 KVPNLLRPYTQIYCTSKYVFGLKFSLFRHSPTWIDMEAVLLNHGLIFSEWVLGNREDLVK 522

Query: 429 DIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC 488
           DI+ ELQ ENGEEWWTY+C+GPSDPHPNWHLGMRG+Q RAVMWRVWKEGGTGFLYWGANC
Sbjct: 523 DILDELQTENGEEWWTYICLGPSDPHPNWHLGMRGTQQRAVMWRVWKEGGTGFLYWGANC 582

Query: 489 YEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYA 548
           YEKATVPSAE++FRRGLPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQD EYL LY 
Sbjct: 583 YEKATVPSAEVKFRRGLPPGDGVLYYPGEVFSSSSEPVASLRLERLLSGLQDYEYLKLYE 642

Query: 549 SRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           S+YGR+E + LLEKTG Y GPERYT+EH PIDV+RGE++N CR
Sbjct: 643 SKYGREEAMGLLEKTGVYTGPERYTLEHRPIDVLRGEVYNTCR 685


>gi|115473013|ref|NP_001060105.1| Os07g0581300 [Oryza sativa Japonica Group]
 gi|33146840|dbj|BAC79829.1| unknown protein [Oryza sativa Japonica Group]
 gi|50509223|dbj|BAD30493.1| unknown protein [Oryza sativa Japonica Group]
 gi|113611641|dbj|BAF22019.1| Os07g0581300 [Oryza sativa Japonica Group]
 gi|215737152|dbj|BAG96081.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 658

 Score =  908 bits (2347), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 420/592 (70%), Positives = 501/592 (84%), Gaps = 1/592 (0%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANV  QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQVQC+DLCS+
Sbjct: 66  MPSTANVSQQEAPRPLEHVNLLAARNERESFQIALRPKVSWATSGIAGSVQVQCTDLCSS 125

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           +GDRLVVGQS+ LRRVVPMLGVPDALVP+D    QI+L+PGET+A+WVS++ P  Q PGL
Sbjct: 126 AGDRLVVGQSVTLRRVVPMLGVPDALVPIDPLNSQINLLPGETSAIWVSLNVPCGQQPGL 185

Query: 121 YEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTAT 179
           YEGEI I++ +A+ E   + L K E+++L+ ELRNC+D  EP +     E+V+R  S +T
Sbjct: 186 YEGEIFISAVRAEAESRGESLTKSERYQLYKELRNCIDITEPRDYSSSEEMVQRLTSAST 245

Query: 180 TLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGIS 239
           TLRR++  P F +   +NG  DMMDED ++N++VR+KLSLTVWDF LP TPSLPAV GIS
Sbjct: 246 TLRRMLALPSFQDCQENNGLGDMMDEDIMNNVAVRLKLSLTVWDFTLPLTPSLPAVFGIS 305

Query: 240 DTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKS 299
           +TVIEDRF + HG+  WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+
Sbjct: 306 ETVIEDRFCLEHGTKGWYDALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKA 365

Query: 300 DEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHY 359
            EY+SDPRLAAYAVPY+P+LSS D AK+ +R+E+E+L+++AHW K+YFYLWDEPLNME Y
Sbjct: 366 KEYYSDPRLAAYAVPYAPILSSTDAAKNSLRREVEILKSEAHWSKSYFYLWDEPLNMEQY 425

Query: 360 SSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWV 419
             + ++++EL +YA D R+LTTYYCGPS + L P+ FE+FVKVP  LRPHTQI+CTSEWV
Sbjct: 426 DVICSISNELRSYASDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWV 485

Query: 420 LGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGT 479
           LG REDLVKDIV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGT
Sbjct: 486 LGTREDLVKDIVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGT 545

Query: 480 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 539
           GFLYWG NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 546 GFLYWGTNCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 605

Query: 540 DIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           DIEYL LY+SRYGR+EGLALLEKTG Y GP+RY ++H PIDVMRGE++  CR
Sbjct: 606 DIEYLKLYSSRYGREEGLALLEKTGVYLGPDRYALDHGPIDVMRGEVYRTCR 657


>gi|293331693|ref|NP_001169555.1| uncharacterized protein LOC100383434 [Zea mays]
 gi|224030081|gb|ACN34116.1| unknown [Zea mays]
          Length = 657

 Score =  904 bits (2335), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 422/592 (71%), Positives = 498/592 (84%), Gaps = 1/592 (0%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANV  QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQ+QC+DLCS+
Sbjct: 65  MPSTANVSQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVQIQCTDLCSS 124

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           SGDRLVVGQS+ LRRVVP+LGVPDALVP+D    QIS+ PGET AVWVS++ P  QPPGL
Sbjct: 125 SGDRLVVGQSITLRRVVPILGVPDALVPIDPLSPQISIQPGETAAVWVSVNVPCGQPPGL 184

Query: 121 YEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTAT 179
           YEGEI IT+ K + +  ++ L K EK RL+ ELR+CLD   P +     E+V+R  S +T
Sbjct: 185 YEGEIFITAVKTELDSRTESLPKSEKCRLYRELRSCLDLTGPRDYSSPEEMVQRLTSAST 244

Query: 180 TLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGIS 239
            LRRV+ +P   +   +NG  DMMDED I+N+SVR+KLSLTVWDF LP TPSLPAV GIS
Sbjct: 245 VLRRVLDNPALQDCQENNGFGDMMDEDVINNISVRLKLSLTVWDFTLPVTPSLPAVFGIS 304

Query: 240 DTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKS 299
           +TVIEDRF + HG++ WY ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+
Sbjct: 305 ETVIEDRFCLEHGTEGWYSALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKA 364

Query: 300 DEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHY 359
           +EY+SDPRLAAYAVPY+P+LS  D AK+ +R+E+E+L++K HW KAYFYLWDEPLN+E Y
Sbjct: 365 NEYYSDPRLAAYAVPYAPILSCTDAAKNSLRREVEILKSKPHWSKAYFYLWDEPLNVEQY 424

Query: 360 SSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWV 419
             + N+++EL +YAPD R+LTTYYCGPS + L P+ FE+F KVP  LRPHTQI+CTSEWV
Sbjct: 425 DMICNISNELRSYAPDVRILTTYYCGPSGSELAPSTFEAFAKVPNVLRPHTQIFCTSEWV 484

Query: 420 LGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGT 479
           LG REDLVKDIV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGT
Sbjct: 485 LGTREDLVKDIVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGT 544

Query: 480 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 539
           GFLYWG+NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 545 GFLYWGSNCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 604

Query: 540 DIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           DIEYL LY+SRYGR+EGLAL+EKTG Y GP+RYT++H P+DVMRGE++  CR
Sbjct: 605 DIEYLKLYSSRYGREEGLALIEKTGMYLGPDRYTLDHGPVDVMRGEVYRTCR 656


>gi|222637345|gb|EEE67477.1| hypothetical protein OsJ_24889 [Oryza sativa Japonica Group]
          Length = 709

 Score =  904 bits (2335), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 418/588 (71%), Positives = 499/588 (84%), Gaps = 1/588 (0%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANV  QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQVQC+DLCS+
Sbjct: 66  MPSTANVSQQEAPRPLEHVNLLAARNERESFQIALRPKVSWATSGIAGSVQVQCTDLCSS 125

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           +GDRLVVGQS+ LRRVVPMLGVPDALVP+D    QI+L+PGET+A+WVS++ P  Q PGL
Sbjct: 126 AGDRLVVGQSVTLRRVVPMLGVPDALVPIDPLNSQINLLPGETSAIWVSLNVPCGQQPGL 185

Query: 121 YEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTAT 179
           YEGEI I++ +A+ E   + L K E+++L+ ELRNC+D  EP +     E+V+R  S +T
Sbjct: 186 YEGEIFISAVRAEAESRGESLTKSERYQLYKELRNCIDITEPRDYSSSEEMVQRLTSAST 245

Query: 180 TLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGIS 239
           TLRR++  P F +   +NG  DMMDED ++N++VR+KLSLTVWDF LP TPSLPAV GIS
Sbjct: 246 TLRRMLALPSFQDCQENNGLGDMMDEDIMNNVAVRLKLSLTVWDFTLPLTPSLPAVFGIS 305

Query: 240 DTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKS 299
           +TVIEDRF + HG+  WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+
Sbjct: 306 ETVIEDRFCLEHGTKGWYDALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKA 365

Query: 300 DEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHY 359
            EY+SDPRLAAYAVPY+P+LSS D AK+ +R+E+E+L+++AHW K+YFYLWDEPLNME Y
Sbjct: 366 KEYYSDPRLAAYAVPYAPILSSTDAAKNSLRREVEILKSEAHWSKSYFYLWDEPLNMEQY 425

Query: 360 SSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWV 419
             + ++++EL +YA D R+LTTYYCGPS + L P+ FE+FVKVP  LRPHTQI+CTSEWV
Sbjct: 426 DVICSISNELRSYASDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWV 485

Query: 420 LGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGT 479
           LG REDLVKDIV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGT
Sbjct: 486 LGTREDLVKDIVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGT 545

Query: 480 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 539
           GFLYWG NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 546 GFLYWGTNCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 605

Query: 540 DIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIF 587
           DIEYL LY+SRYGR+EGLALLEKTG Y GP+RY ++H PIDVMRGE++
Sbjct: 606 DIEYLKLYSSRYGREEGLALLEKTGVYLGPDRYALDHGPIDVMRGEVY 653


>gi|242046098|ref|XP_002460920.1| hypothetical protein SORBIDRAFT_02g037540 [Sorghum bicolor]
 gi|241924297|gb|EER97441.1| hypothetical protein SORBIDRAFT_02g037540 [Sorghum bicolor]
          Length = 657

 Score =  903 bits (2333), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 418/592 (70%), Positives = 500/592 (84%), Gaps = 1/592 (0%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANV  QE+PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQ+QC+DLCS+
Sbjct: 65  MPSTANVSQQEVPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVQIQCTDLCSS 124

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           SGDRLVVGQS+ LRRVVP+LGVPDALVP+D    Q++L PGET AVWVS++ P  QPPGL
Sbjct: 125 SGDRLVVGQSITLRRVVPILGVPDALVPIDPLSPQVTLQPGETAAVWVSLNVPCGQPPGL 184

Query: 121 YEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTAT 179
           YEGEI IT+ K + +  ++ L K EK+RL+ ELR+CLD   P +     E+V+R  S ++
Sbjct: 185 YEGEIFITAVKTELDSRTESLPKSEKYRLYRELRSCLDLTGPRDYSSPEEMVQRLTSASS 244

Query: 180 TLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGIS 239
            LRRV+ +P   +   +NG  DMMDED ++N+SVR+KLSLTVWDF LP TPSLPAV GIS
Sbjct: 245 ALRRVLDNPALQDCQENNGFGDMMDEDVMNNVSVRLKLSLTVWDFTLPVTPSLPAVFGIS 304

Query: 240 DTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKS 299
           +TVIEDRF + HG++ WY ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+
Sbjct: 305 ETVIEDRFCLEHGTEGWYSALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKA 364

Query: 300 DEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHY 359
           +EY+SDPRLAAYAVPY+P+LS  D AK+ +R+E+E+L++K HW KAYFYLWDEPLN+E Y
Sbjct: 365 NEYYSDPRLAAYAVPYAPILSCTDAAKNSLRREVEILKSKPHWSKAYFYLWDEPLNVEQY 424

Query: 360 SSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWV 419
             + N+++EL +YAPD R+LTTYYCGPS + L P+ FE+FVKVP  LRPHTQI+CTSEWV
Sbjct: 425 DMICNISNELRSYAPDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWV 484

Query: 420 LGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGT 479
           LG REDLVKDI+ EL+P+ GEEWWTYVCMGPSDP PNWH+GMRG+QHRAVMWRVWKEGGT
Sbjct: 485 LGTREDLVKDIIAELRPDLGEEWWTYVCMGPSDPQPNWHIGMRGTQHRAVMWRVWKEGGT 544

Query: 480 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 539
           GFLYWG NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 545 GFLYWGTNCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 604

Query: 540 DIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           DIEYL LY+SRYGR+EGLAL+EKTG Y GP+RYT++H P+DVMRGE++  CR
Sbjct: 605 DIEYLKLYSSRYGREEGLALMEKTGMYLGPDRYTLDHGPVDVMRGEVYRTCR 656


>gi|218199904|gb|EEC82331.1| hypothetical protein OsI_26624 [Oryza sativa Indica Group]
          Length = 709

 Score =  901 bits (2329), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 416/588 (70%), Positives = 499/588 (84%), Gaps = 1/588 (0%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANV  QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQVQC+DLCS+
Sbjct: 66  MPSTANVSQQEAPRPLEHVNLLAARNERESFQIALRPKVSWATSGIAGSVQVQCTDLCSS 125

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           +GDRLVVGQS+ LRRVVPMLGVPDALVP+D    QI+L+PGET+A+WVS++ P  Q PGL
Sbjct: 126 AGDRLVVGQSVTLRRVVPMLGVPDALVPIDPLNSQINLLPGETSAIWVSLNVPCGQQPGL 185

Query: 121 YEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTAT 179
           YEGEI +++ +A++E   + L K E+++L+ ELRNC+D  E  +     E+V+R  S +T
Sbjct: 186 YEGEIFLSAVRAESESRGESLTKSERYQLYKELRNCIDITETRDYSSSEEMVQRLTSAST 245

Query: 180 TLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGIS 239
           TLRR++  P F +   +NG  DMMDED ++N++VR+KLSLTVWDF LP TPSLPAV GIS
Sbjct: 246 TLRRMLALPSFQDCQENNGLGDMMDEDIMNNVAVRLKLSLTVWDFTLPLTPSLPAVFGIS 305

Query: 240 DTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKS 299
           +TVIEDRF + HG+  WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+
Sbjct: 306 ETVIEDRFCLEHGTKGWYDALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKA 365

Query: 300 DEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHY 359
            EY+SDPRLAAYAVPY+P+LSS D AK+ +R+E+E+L+++AHW K+YFYLWDEPLNME Y
Sbjct: 366 KEYYSDPRLAAYAVPYAPILSSTDAAKNSLRREVEILKSEAHWSKSYFYLWDEPLNMEQY 425

Query: 360 SSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWV 419
             + ++++EL +YA D R+LTTYYCGPS + L P+ FE+FVKVP  LRPHTQI+CTSEWV
Sbjct: 426 DVICSISNELRSYASDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWV 485

Query: 420 LGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGT 479
           LG REDLVKDIV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGT
Sbjct: 486 LGTREDLVKDIVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGT 545

Query: 480 GFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 539
           GFLYWG NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 546 GFLYWGTNCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 605

Query: 540 DIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIF 587
           DIEYL LY+SRYGR+EGLALLEKTG Y GP+RY ++H PIDVMRGE++
Sbjct: 606 DIEYLKLYSSRYGREEGLALLEKTGVYLGPDRYALDHGPIDVMRGEVY 653


>gi|357122237|ref|XP_003562822.1| PREDICTED: uncharacterized protein LOC100840095 [Brachypodium
           distachyon]
          Length = 657

 Score =  901 bits (2328), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 419/592 (70%), Positives = 500/592 (84%), Gaps = 3/592 (0%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANV  QE PRPLE +NLLAARNERES QIALRPKVSW SS  AG VQ+QC+DLCS+
Sbjct: 65  MPSTANVSQQEAPRPLEHVNLLAARNERESFQIALRPKVSWISSGIAGPVQIQCTDLCSS 124

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVC-QISLIPGETTAVWVSIDAPYAQPPG 119
           SGDRLVVGQS+ LRRVVPMLGVPDALVP+D P+C QI+L+PGET+A+WVS++ P  Q PG
Sbjct: 125 SGDRLVVGQSVTLRRVVPMLGVPDALVPID-PLCPQINLLPGETSAIWVSLNVPCGQQPG 183

Query: 120 LYEGEIIIT-SKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTA 178
           LYEGEI IT ++A+T+  ++ L K E+++L+ ELR CLD  E  +     E+V+R  ST+
Sbjct: 184 LYEGEIFITATRAETDSRAESLPKSERYQLYRELRTCLDITESRDCSTPEEMVQRLTSTS 243

Query: 179 TTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGI 238
           TTL+R++  P F +   +NG  DMMDED ++N++VRVKLSLTVWDF LP TPSLPAV GI
Sbjct: 244 TTLKRMLVLPAFQDCQENNGLGDMMDEDVMNNVAVRVKLSLTVWDFTLPLTPSLPAVFGI 303

Query: 239 SDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPK 298
           S+TVIEDRF + HG+  WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK
Sbjct: 304 SETVIEDRFCLEHGTKGWYDALDDHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPK 363

Query: 299 SDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEH 358
           + EY+SDPRLAAYAVPY+P+LS  D A++ +R+E+++L+T+AHW KAYFYLWDEPLNME 
Sbjct: 364 AKEYYSDPRLAAYAVPYAPILSCTDAARNSLRREVDILKTEAHWSKAYFYLWDEPLNMEQ 423

Query: 359 YSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEW 418
           Y  +RN+++EL  Y PD R+LTTYY GPS + L P+ FE+F KVP  LRPHTQI+CTSEW
Sbjct: 424 YEVIRNISNELRTYTPDVRILTTYYAGPSGSELAPSTFEAFAKVPNVLRPHTQIFCTSEW 483

Query: 419 VLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGG 478
           VLG REDLVKDI+ EL+PE GEEWWTYVC+GP+DP PNWHLGMRG+QHRAVMWRVWKEGG
Sbjct: 484 VLGTREDLVKDIIAELRPELGEEWWTYVCLGPTDPQPNWHLGMRGTQHRAVMWRVWKEGG 543

Query: 479 TGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGL 538
           TGFLYWG NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVASLRLERILSG+
Sbjct: 544 TGFLYWGTNCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASLRLERILSGM 603

Query: 539 QDIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNAC 590
           QDIEYLNLY+S+YGR+EGLALL+KTG Y GP+RY  +H PID MRGE++  C
Sbjct: 604 QDIEYLNLYSSKYGREEGLALLDKTGVYLGPDRYAHDHGPIDAMRGEVYRTC 655


>gi|168042677|ref|XP_001773814.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674929|gb|EDQ61431.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 651

 Score =  786 bits (2030), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/603 (62%), Positives = 471/603 (78%), Gaps = 15/603 (2%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTA +G QE PRPLE ++LLAARNERES QIALRPK+SW+S    G +Q+ CSD CS 
Sbjct: 49  MPSTAIIGHQEPPRPLERVSLLAARNERESAQIALRPKMSWTSGDMVGYLQIHCSDFCSP 108

Query: 61  SGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGL 120
           SGDRL  G+ + +RRVVP+LGVPDALVP+DLP  +I L+PGET A+WVS D P  QPPG+
Sbjct: 109 SGDRLNAGKEVTIRRVVPILGVPDALVPIDLP-SRIGLLPGETCALWVSFDVPVTQPPGV 167

Query: 121 YEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTAT 179
           Y GEI IT+ + +TE +++ + + EK ++  +L+  L   E    +    + E  +S   
Sbjct: 168 YIGEIWITAVRGETEFAAEKV-ESEKLQMKKDLQGFLAQAEAASNESAEVLTEALRSICE 226

Query: 180 TLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGIS 239
            L +V+ SPL S    D G +++ DE+  ++ SV+V+ S+TVWDF+LP TPSLPAV GIS
Sbjct: 227 GLHQVLQSPLLSAGCEDFGKMEI-DEEFQASPSVQVQFSITVWDFVLPITPSLPAVFGIS 285

Query: 240 DTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKS 299
           +TVIEDR+ ++HGS EW+++L+ HF WLLQYR+SP+FCRWG++MRVLTYTCP+PA HPKS
Sbjct: 286 ETVIEDRYNLKHGSKEWFKSLNMHFDWLLQYRLSPYFCRWGDNMRVLTYTCPYPATHPKS 345

Query: 300 DEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDE------- 352
           ++Y+SDPRLAAYAVPY PVLSS+D AKD V+ E+E+L+TK HWKKAYFYLWDE       
Sbjct: 346 EDYYSDPRLAAYAVPYIPVLSSSDTAKDVVKSELEILKTKPHWKKAYFYLWDEARISTRS 405

Query: 353 ---PLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPH 409
              P+  E Y  +R++A E+   APDAR+LTTYYCGPSD  +    FESF+KVP FLRPH
Sbjct: 406 QHGPVGFEQYEVIRSIAEEIRNTAPDARILTTYYCGPSDPSMKLDGFESFLKVPTFLRPH 465

Query: 410 TQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAV 469
           TQI+CTSEWVLG REDLVK I  E+Q +  EEWWTYVCMGP + HPNWHLGMRG+QHRAV
Sbjct: 466 TQIFCTSEWVLGGREDLVKQITDEIQFDRSEEWWTYVCMGPGELHPNWHLGMRGTQHRAV 525

Query: 470 MWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFS-SSRQPVAS 528
           +WRVWKEGGTGFLYWG NCYEKA+ PSAEIRFRRGLPPGDGVLFYPGEVF+  +  PVAS
Sbjct: 526 IWRVWKEGGTGFLYWGVNCYEKASSPSAEIRFRRGLPPGDGVLFYPGEVFNIGATLPVAS 585

Query: 529 LRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFN 588
           +RLER+LSG+QD EYL LY+S YGR E L+LLE+TGTY GPERYT+EH PI+ +RGEI++
Sbjct: 586 VRLERLLSGMQDYEYLQLYSSMYGRHEALSLLERTGTYTGPERYTLEHNPIESLRGEIYH 645

Query: 589 ACR 591
           A R
Sbjct: 646 AYR 648


>gi|302755140|ref|XP_002960994.1| hypothetical protein SELMODRAFT_74245 [Selaginella moellendorffii]
 gi|300171933|gb|EFJ38533.1| hypothetical protein SELMODRAFT_74245 [Selaginella moellendorffii]
          Length = 633

 Score =  769 bits (1985), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/594 (61%), Positives = 460/594 (77%), Gaps = 18/594 (3%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTA VG QE PR L+ +NLL ARNERES QIALRPK+SW+     G VQV C D  SA
Sbjct: 52  MPSTATVGHQEPPRALDQLNLLIARNERESAQIALRPKISWACGGAVGHVQVHCRDFVSA 111

Query: 61  SGDRLVVGQ-SLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPG 119
           SGDR  +   S+ LRRVVP+LGVPDALVP+ +P CQ+SL+PGET+A+W+S+  P +Q PG
Sbjct: 112 SGDRWAIELLSVSLRRVVPILGVPDALVPVSMPTCQVSLLPGETSALWLSVHVPSSQTPG 171

Query: 120 LYEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTA 178
           +YEGE+  ++ KAD E S   + +G+K    +ELR  ++NV           +E  +   
Sbjct: 172 VYEGEMTFSAVKADAEFS---VDEGDK----LELRKMVENVAAKMDDTRQNPMELLEEVR 224

Query: 179 TTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGI 238
             L+ ++  P  +     NG +++ +E    +LS+++K+S+TVWDF+LP TP+LPAV G+
Sbjct: 225 QDLQHLLDHPALAH----NGKMEIDEE----SLSLKLKISITVWDFVLPVTPTLPAVFGV 276

Query: 239 SDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPK 298
           S+TVIEDRF V HGS  WY ALD+H++WLLQ+RISP+FCRWG++MR+L YTCPWPADH K
Sbjct: 277 SETVIEDRFNVEHGSSGWYNALDRHYQWLLQFRISPYFCRWGDNMRILAYTCPWPADHVK 336

Query: 299 SDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEH 358
           ++EY+SDPRLAAYAVPY+PVLS+++  KD V +EIE+L TK HW+K+YFYLWDEPL+ + 
Sbjct: 337 AEEYYSDPRLAAYAVPYAPVLSNSNAVKDLVTREIEILSTKEHWRKSYFYLWDEPLSSDQ 396

Query: 359 YSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEW 418
           Y  +R M+ E+ + AP++R+LTTYY GPSD    P  FE+F+KVP FLRPHTQI+CTSEW
Sbjct: 397 YDFIRTMSEEIRSIAPNSRILTTYYSGPSDVQYPPGSFEAFIKVPSFLRPHTQIFCTSEW 456

Query: 419 VLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGG 478
           VLG REDLVK+IV ELQP+  EEWWTYVCMGPSDPHPNWHLGMRG+QHR V+WR WKEGG
Sbjct: 457 VLGGREDLVKEIVAELQPDQREEWWTYVCMGPSDPHPNWHLGMRGTQHRGVLWRAWKEGG 516

Query: 479 TGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFS-SSRQPVASLRLERILSG 537
           +GFLYWG NCYEK+  P+AEIRFRRGLPPGDGVLFYPGEVF+  S +PV+S+RLER+LSG
Sbjct: 517 SGFLYWGTNCYEKSLCPAAEIRFRRGLPPGDGVLFYPGEVFTPGSSEPVSSVRLERVLSG 576

Query: 538 LQDIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           LQD+EYLNLYAS +G+   +ALLEK G Y GPERYT +H  ID MR EI+ +CR
Sbjct: 577 LQDLEYLNLYASCFGKPASVALLEKCGVYLGPERYTQDHAAIDAMRTEIYCSCR 630


>gi|302767188|ref|XP_002967014.1| hypothetical protein SELMODRAFT_144564 [Selaginella moellendorffii]
 gi|300165005|gb|EFJ31613.1| hypothetical protein SELMODRAFT_144564 [Selaginella moellendorffii]
          Length = 582

 Score =  766 bits (1979), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/594 (61%), Positives = 457/594 (76%), Gaps = 18/594 (3%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTA VG QE PR L+ +NLL ARNERES QIALRPK+SW+     G VQV C D  S 
Sbjct: 1   MPSTATVGHQEPPRALDQLNLLIARNERESAQIALRPKISWACGGAVGHVQVHCRDFVSV 60

Query: 61  SGDRLVVGQ-SLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPG 119
           SGDR  +   S+ LRRVVP+LGVPDALVP+ +P CQ+SL+PGET+A+W+S+  P +Q PG
Sbjct: 61  SGDRWAIELLSVSLRRVVPILGVPDALVPVSMPTCQVSLLPGETSALWLSVHVPSSQTPG 120

Query: 120 LYEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTA 178
           +YEGE+  ++ KAD E     + +GEK    +ELR  ++ V           +E  +   
Sbjct: 121 VYEGEMTFSAAKADAEF---FVDEGEK----LELRKMVETVAAKMDDTRQNPMELLEEVR 173

Query: 179 TTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGI 238
             LR ++  P  +     NG +++ +E    +LS+++K+S+TVWDF+LP TP+LPAV G+
Sbjct: 174 QDLRHLLDHPALAH----NGKMEIDEE----SLSLKLKISITVWDFVLPVTPTLPAVFGV 225

Query: 239 SDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPK 298
           S+TVIEDRF V HGS +WY ALD+H++WLLQ+RISP+FCRWG++MR+L YTCPWPADH K
Sbjct: 226 SETVIEDRFNVEHGSSDWYNALDRHYQWLLQFRISPYFCRWGDNMRILAYTCPWPADHVK 285

Query: 299 SDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEH 358
           ++EY+SDPRLAAYAVPY+PVLS+++  KD V +EIE+L TK HW+K+YFYLWDEPL+ + 
Sbjct: 286 AEEYYSDPRLAAYAVPYAPVLSNSNAVKDLVTREIEILSTKEHWRKSYFYLWDEPLSSDQ 345

Query: 359 YSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEW 418
           Y  +R M+ E+ + AP+ R+LTTYY GPSD    P  FE+F+KVP FLRPHTQI+CTSEW
Sbjct: 346 YDFIRTMSEEIRSIAPNTRILTTYYSGPSDVQYPPGSFEAFIKVPSFLRPHTQIFCTSEW 405

Query: 419 VLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGG 478
           VLG REDLVK+IV ELQP+  EEWWTYVCMGPSDPHPNWHLGMRG+Q R V+WRVWKEGG
Sbjct: 406 VLGGREDLVKEIVAELQPDQREEWWTYVCMGPSDPHPNWHLGMRGTQQRGVLWRVWKEGG 465

Query: 479 TGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFS-SSRQPVASLRLERILSG 537
           +GFLYWG NCYEK+  P+AEIRFRRGLPPGDGVLFYPGEVF+  S +PV+S+RLER+LSG
Sbjct: 466 SGFLYWGTNCYEKSLCPAAEIRFRRGLPPGDGVLFYPGEVFTPGSSEPVSSVRLERVLSG 525

Query: 538 LQDIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFNACR 591
           LQD+EYLNLYAS +G+   +ALLEK G Y GPERYT +H  ID MR EI+ +CR
Sbjct: 526 LQDLEYLNLYASCFGKPASVALLEKCGVYLGPERYTQDHAAIDAMRTEIYCSCR 579


>gi|414590657|tpg|DAA41228.1| TPA: hypothetical protein ZEAMMB73_917393 [Zea mays]
          Length = 721

 Score =  717 bits (1852), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/681 (55%), Positives = 454/681 (66%), Gaps = 115/681 (16%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANV  QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG VQ+QC+DLCS+
Sbjct: 65  MPSTANVSQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVQIQCTDLCSS 124

Query: 61  SGDRLVVGQSLMLRRVVPML-----GVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYA 115
           SGDR    +       +PM+     GVPDALVP+D    QIS+ PGET AVWVS++ P  
Sbjct: 125 SGDR----EDQHSHSSIPMVINVVPGVPDALVPIDPLSPQISIQPGETAAVWVSVNVPCG 180

Query: 116 QPPGLYEGEIIITS-KADTELSS-------------------------QCLGKGEKHRLF 149
           QPPGLYEGEI IT+ K + E+ S                         + L K EK RL+
Sbjct: 181 QPPGLYEGEIFITAVKTELEILSNLVTLALISGLYFFADLISGSSSRTESLPKSEKCRLY 240

Query: 150 MELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAIS 209
            ELR+CLD   P +     E+V+R  S +T LRRV+ +P   +   +NG  DMMDED I+
Sbjct: 241 RELRSCLDLTGPRDYSSPEEMVQRLTSASTVLRRVLDNPALQDCQENNGFGDMMDEDVIN 300

Query: 210 NLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQ 269
           N+SVR+KLSLTVWDF LP TPSLPAV G+S                W     Q+   L  
Sbjct: 301 NISVRLKLSLTVWDFTLPVTPSLPAVFGVS----------------W-----QYSFLLCS 339

Query: 270 YRISPFFCRWG---ESMRVLTYTCP---------------------WPADHPKSDEYFSD 305
             IS     +G     +R++    P                        DHPK++EY+SD
Sbjct: 340 ISISVLCNCYGTGCNGLRLMGLLGPAELAATEGSVGIAAALAKPAAGTPDHPKANEYYSD 399

Query: 306 PRLAAYAVPYSPVLS------------SNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEP 353
           PRLAAYAVPY+P+LS            S D AK+ +R+E+E+L++K HW KAYFYLWDEP
Sbjct: 400 PRLAAYAVPYAPILSCLLLYLIWLLVNSTDAAKNSLRREVEILKSKPHWSKAYFYLWDEP 459

Query: 354 LNMEHYSSVRNMASELHAYAPDARVLTTYYCG-----------------------PSDAP 390
           LN+E Y  + N+++EL +YAPD R+LTTYYCG                       PS + 
Sbjct: 460 LNVEQYDMICNISNELRSYAPDVRILTTYYCGATCADLEHPVGVPGCPLSSRAAGPSGSE 519

Query: 391 LGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGP 450
           L P+ FE+F KVP  LRPHTQI+CTSEWVLG REDLVKDIV EL+P+ GEEWWTYVCMGP
Sbjct: 520 LAPSTFEAFAKVPNVLRPHTQIFCTSEWVLGTREDLVKDIVAELRPDLGEEWWTYVCMGP 579

Query: 451 SDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDG 510
           SDP PNWHLGMRG+QHRAVMWRVWKEGGTGFLYWG+NCYEKA +PSAEI FRRGLPPGDG
Sbjct: 580 SDPQPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGSNCYEKAMIPSAEICFRRGLPPGDG 639

Query: 511 VLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTGTYQGPE 570
           VLFYPGEVFSSS +PVAS RLERILSG+QDIEYL LY+SRYGR+EGLAL+EKTG Y GP+
Sbjct: 640 VLFYPGEVFSSSHEPVASTRLERILSGMQDIEYLKLYSSRYGREEGLALIEKTGMYLGPD 699

Query: 571 RYTIEHLPIDVMRGEIFNACR 591
           RYT++H P+DVMRGE++  CR
Sbjct: 700 RYTLDHGPVDVMRGEVYRTCR 720


>gi|388508256|gb|AFK42194.1| unknown [Medicago truncatula]
          Length = 388

 Score =  715 bits (1845), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/389 (83%), Positives = 361/389 (92%), Gaps = 1/389 (0%)

Query: 203 MDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQ 262
           M+EDAISNLS+R+KL+LTVW+F+LP TPSLPAV GISDTVIEDRFGV+HG+ EWYEALDQ
Sbjct: 1   MEEDAISNLSLRLKLNLTVWEFVLPETPSLPAVFGISDTVIEDRFGVKHGTAEWYEALDQ 60

Query: 263 HFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSN 322
           HFKWLLQYRISP+FC+W + MRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPY  V+S N
Sbjct: 61  HFKWLLQYRISPYFCKWADGMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYKQVVSGN 120

Query: 323 DGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTY 382
           D AKDY++K++E+LRTK HW+KAYFYLWDEPLN+E Y SVRNMAS++HAYAPDAR+LTTY
Sbjct: 121 DAAKDYLQKQVEILRTKNHWRKAYFYLWDEPLNLEQYDSVRNMASDIHAYAPDARILTTY 180

Query: 383 YCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEW 442
           YCGP+DAPL PTPFE+FVKVP FLRPH QIYCTSEWVLGNREDLVKDI+ ELQPENGEEW
Sbjct: 181 YCGPNDAPLAPTPFEAFVKVPSFLRPHNQIYCTSEWVLGNREDLVKDIIAELQPENGEEW 240

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFR 502
           WTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGTGFLYWGANCYEKATV SAEI+FR
Sbjct: 241 WTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGANCYEKATVASAEIKFR 300

Query: 503 RGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEK 562
            GLPPGDGVL+YPGEVFS++ QPVASLRLER+LSGLQDIEYL LYASRYGRDE  ALL++
Sbjct: 301 HGLPPGDGVLYYPGEVFSTN-QPVASLRLERLLSGLQDIEYLRLYASRYGRDEATALLDR 359

Query: 563 TGTYQGPERYTIEHLPIDVMRGEIFNACR 591
            G Y GPERYT EH+PID MRG+IFN CR
Sbjct: 360 MGVYFGPERYTHEHMPIDAMRGQIFNLCR 388


>gi|326494652|dbj|BAJ94445.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 408

 Score =  517 bits (1332), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 240/378 (63%), Positives = 301/378 (79%), Gaps = 1/378 (0%)

Query: 64  RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 123
           RLVVGQS+ LRRVVP+LGVPDALVP+D    QI+L+PGETTAVW+S++ P  Q PGLYEG
Sbjct: 5   RLVVGQSITLRRVVPILGVPDALVPIDPSSPQINLLPGETTAVWISLNVPCGQQPGLYEG 64

Query: 124 EIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLR 182
           EI IT+ +AD++  +  L K E+++L+  L+ CLD  E  +     E++ R  ST+TTLR
Sbjct: 65  EIFITAVRADSDSRADSLLKSERYQLYKGLKTCLDITESRDHLSSEEMILRLSSTSTTLR 124

Query: 183 RVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTV 242
           R++  P F ++  +NG  DMMDED ++N++VRVKLSLTVWDF LP TPSLPAV GIS+TV
Sbjct: 125 RMLVLPAFQDYHENNGLGDMMDEDVLNNVAVRVKLSLTVWDFTLPLTPSLPAVFGISETV 184

Query: 243 IEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEY 302
           IEDRF + HG+  WY+ALD HF WLLQYRISPFFCRWG+SMR+L YTCPWP DHPK++EY
Sbjct: 185 IEDRFCLEHGTKGWYDALDHHFGWLLQYRISPFFCRWGDSMRILAYTCPWPTDHPKANEY 244

Query: 303 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSV 362
           +SDPRLAAYAVPY+P+LS  D AK+ +R+E+E+L+T+ HW KAYFYLWDEPLNME Y  +
Sbjct: 245 YSDPRLAAYAVPYAPILSCTDAAKNSLRREVEILKTEPHWSKAYFYLWDEPLNMEQYEVI 304

Query: 363 RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGN 422
            N+++EL  Y PD R+LTTYY GPS + L P+ FE+F KVP  LRPHTQI+CTSEWVLG 
Sbjct: 305 CNISNELRTYTPDVRILTTYYAGPSGSELAPSTFEAFAKVPNVLRPHTQIFCTSEWVLGT 364

Query: 423 REDLVKDIVTELQPENGE 440
           REDLVKDI+ EL+P+ GE
Sbjct: 365 REDLVKDIIAELRPDLGE 382


>gi|404484502|ref|ZP_11019706.1| hypothetical protein HMPREF9448_00112 [Barnesiella intestinihominis
           YIT 11860]
 gi|404339507|gb|EJZ65938.1| hypothetical protein HMPREF9448_00112 [Barnesiella intestinihominis
           YIT 11860]
          Length = 503

 Score =  248 bits (634), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 147/379 (38%), Positives = 198/379 (52%), Gaps = 32/379 (8%)

Query: 214 RVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKW---LLQY 270
           ++K+ L V+D  LP+TPSLPA  GI +  + D       S    + L    +W    L Y
Sbjct: 147 KIKIDLQVYDTALPSTPSLPAAFGIIEKNLID-------STSKEQTLQNKLEWAELCLDY 199

Query: 271 RISPFFCRW-GESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYV 329
           R++P+F  W   SM+    + PW  +  ++  + SD R   +AVPY   LS N+      
Sbjct: 200 RMNPYFSTWLANSMKHEASSSPWKWNDKRTVPFLSDKRFNRFAVPYHS-LSHNELDSLLQ 258

Query: 330 R-KEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSD 388
           R K+ +LL       K+YFYLWDEP  M+ Y  +   + E+H   P+A+VLTT+YCGP D
Sbjct: 259 RLKQTDLL------DKSYFYLWDEPAYMKEYHLIGQYSQEIHKLMPEAKVLTTFYCGPKD 312

Query: 389 APLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCM 448
                  F  F       R  TQI+  S W L   E       + L+    EEWWTYVCM
Sbjct: 313 GKYKDRLFSVF----DLWRGDTQIFSMSAWALQANEANADTCRSLLR--GNEEWWTYVCM 366

Query: 449 GPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPG 508
           GP +  PN  L M G QHRAV+WR WKE  TGFLYW  N Y      S  + FR+ LP G
Sbjct: 367 GPGEEQPNLLLTMDGYQHRAVLWRSWKERTTGFLYWAVNAY----AESDTLAFRKDLPEG 422

Query: 509 DGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTGTYQG 568
           DGVL YPG+ F+S+  PV S+R+ER    ++D EYL     + GR +   L +    YQ 
Sbjct: 423 DGVLIYPGQYFNST-SPVVSIRMERWRDSMEDYEYLAKLEKKIGRAKSETLFQ--SIYQS 479

Query: 569 PERYTIEHLPIDVMRGEIF 587
           PE+Y+  +  I+  R  I 
Sbjct: 480 PEKYSKNNAEIEKFRKSIL 498


>gi|404484501|ref|ZP_11019705.1| hypothetical protein HMPREF9448_00111 [Barnesiella intestinihominis
           YIT 11860]
 gi|404339506|gb|EJZ65937.1| hypothetical protein HMPREF9448_00111 [Barnesiella intestinihominis
           YIT 11860]
          Length = 513

 Score =  202 bits (514), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 126/364 (34%), Positives = 189/364 (51%), Gaps = 34/364 (9%)

Query: 215 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 274
           V +S+ V +  LP TPS+ +V GI+    ++        ++  E        LL+YRISP
Sbjct: 160 VAISINVVNASLPETPSIASVFGINP---QNFIFTGLSEEQKIEKRKAASDLLLEYRISP 216

Query: 275 FFCRW-GESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEI 333
           +F  W   +M+   ++ P+  +  ++ EY +D R +  A+P S  LS +         E+
Sbjct: 217 YFSTWLSGTMKTECFSSPYAWNDDRTWEYLADKRFSRIALP-SHGLSDD---------EL 266

Query: 334 ELLRTKAH----WKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDA 389
           E++  KA       KA+FY+WDEP     Y  ++ ++  +H YAP+A+VLTT+YCGP+D 
Sbjct: 267 EMMLNKARETGLLNKAFFYVWDEPTKTNEYEQIKTLSDRIHRYAPEAKVLTTFYCGPTDG 326

Query: 390 PLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMG 449
                 F  F      L   T IYCT  W L + E+  +    +L+  +G+EWW+YVCM 
Sbjct: 327 EHKDDLFAVF----DILNGATSIYCTGVWALQDNENRSEQCKAKLK--SGQEWWSYVCMS 380

Query: 450 PSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGD 509
            +   P          +RA MWR +KE  +GFLYW  N +  +  P   +R R  LP GD
Sbjct: 381 NT---PGLASNSTAIGNRATMWRNYKEQNSGFLYWVVNGF-ASVYP---LRPRPELPEGD 433

Query: 510 GVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTGTYQGP 569
           G+L YPGE F +++    S+RLER   G +D + L LY  + GR   L+LL     Y+ P
Sbjct: 434 GILIYPGESFGTNK-ICTSVRLERWRDGAEDYDMLVLYEKKLGRSAALSLL--NNVYKSP 490

Query: 570 ERYT 573
             YT
Sbjct: 491 SNYT 494


>gi|319640384|ref|ZP_07995108.1| hypothetical protein HMPREF9011_00705 [Bacteroides sp. 3_1_40A]
 gi|345517443|ref|ZP_08796919.1| hypothetical protein BSFG_04467 [Bacteroides sp. 4_3_47FAA]
 gi|254838009|gb|EET18318.1| hypothetical protein BSFG_04467 [Bacteroides sp. 4_3_47FAA]
 gi|317387987|gb|EFV68842.1| hypothetical protein HMPREF9011_00705 [Bacteroides sp. 3_1_40A]
          Length = 514

 Score =  180 bits (457), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 114/380 (30%), Positives = 179/380 (47%), Gaps = 33/380 (8%)

Query: 215 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 274
           ++L   V    +P   S+P  +G+ +  + +    +    E    +D    ++L YR++P
Sbjct: 144 IQLDYNVHHTTIPLKSSIPITVGVENRCMTECLNDKEADKERQRWVD----FVLSYRMTP 199

Query: 275 FFC------RWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDY 328
            F       RW           PW  +  +S    +D R + Y +P+   LS N+ A   
Sbjct: 200 VFGTQITPERWQYEHSF----SPWAWNDKRSIRLLNDRRYSCYMLPFF-TLSENELASLL 254

Query: 329 VRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSD 388
                  ++ K   K++ FY+WDEP  ME Y  ++   + +  YA DAR+LTT++CGP +
Sbjct: 255 CN-----IQKKGKLKESLFYIWDEPAYMEDYEQIKRKVNIIRKYASDARILTTFFCGPRN 309

Query: 389 APLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCM 448
            P     +  F     +L+ H  +   S       E+ V+ I  ++ PE G +WW+YVC 
Sbjct: 310 GPRKGDLYAVF----DYLKHHIHVATISLAPCKGNEEEVQHIRYKV-PE-GIDWWSYVCW 363

Query: 449 GPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPG 508
            P    PN+ L M+G Q RA+MWR WK G  GFLYW  N Y K         +   +P G
Sbjct: 364 QPGGNEPNFLLQMKGIQQRAIMWRTWKNGSQGFLYWNCNIYHKRN----PFTYITDMPHG 419

Query: 509 DGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTGTYQG 568
           DG+L YPG++    + P+AS RLER   G +++E L L    Y +++   LL     Y  
Sbjct: 420 DGILIYPGDIL-GCKGPIASARLERWRDGAEEMELLRLVEKNYSKEKADTLL--NIVYSS 476

Query: 569 PERYTIEHLPIDVMRGEIFN 588
           P  +  +   I   R E+ +
Sbjct: 477 PLSFIEKAHNIPFFRKELLD 496


>gi|294777879|ref|ZP_06743323.1| hypothetical protein CUU_2186 [Bacteroides vulgatus PC510]
 gi|294448333|gb|EFG16889.1| hypothetical protein CUU_2186 [Bacteroides vulgatus PC510]
          Length = 318

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 99/298 (33%), Positives = 149/298 (50%), Gaps = 19/298 (6%)

Query: 291 PWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLW 350
           PW  +  +S    +D R + Y +P+   LS N+ A          ++ K   K++ FY+W
Sbjct: 22  PWAWNDKRSIRLLNDRRYSCYMLPFF-TLSENELASLLCN-----IQKKGKLKESLFYIW 75

Query: 351 DEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHT 410
           DEP  ME Y  ++   + +  YA DAR+LTT++CGP + P     +  F     +L+ H 
Sbjct: 76  DEPAYMEDYEQIKRKVNIIRKYASDARILTTFFCGPRNGPRKGDLYAVF----DYLKHHI 131

Query: 411 QIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVM 470
            +   S       E+ V+ I  ++ PE G +WW+YVC  P    PN+ L M+G Q RA+M
Sbjct: 132 HVATISLAPCKGNEEEVQHIRYKV-PE-GIDWWSYVCWQPGGNEPNFLLQMKGIQQRAIM 189

Query: 471 WRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLR 530
           WR WK G  GFLYW  N Y K         +   +P GDG+L YPG++    + P+AS R
Sbjct: 190 WRTWKNGSQGFLYWNCNIYHKRN----PFTYITDMPHGDGILIYPGDIL-GCKGPIASAR 244

Query: 531 LERILSGLQDIEYLNLYASRYGRDEGLALLEKTGTYQGPERYTIEHLPIDVMRGEIFN 588
           LER   G +++E L L    Y +++   LL     Y  P  +  +   I   R E+ +
Sbjct: 245 LERWRDGAEEMELLRLVEKNYSKEKADTLL--NIVYSSPLSFIEKAHNIPFFRKELLD 300


>gi|297822901|ref|XP_002879333.1| hypothetical protein ARALYDRAFT_902189 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325172|gb|EFH55592.1| hypothetical protein ARALYDRAFT_902189 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 113

 Score =  111 bits (278), Expect = 8e-22,   Method: Composition-based stats.
 Identities = 71/174 (40%), Positives = 85/174 (48%), Gaps = 67/174 (38%)

Query: 208 ISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL 267
           +SNL+V +KL LTVW+FI+  T SL AVI +SDTVIEDRF V HGS+EWY+ L  HFKWL
Sbjct: 7   VSNLAVSIKLRLTVWEFIILVTLSLSAVICVSDTVIEDRFDVEHGSEEWYKKLGLHFKWL 66

Query: 268 LQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKD 327
           L +RI+                                           P  SSN+    
Sbjct: 67  LHHRIN-------------------------------------------PYFSSNNN--- 80

Query: 328 YVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTT 381
                              + LW +PLNMEH+ SV  MASE  AYA DARVLTT
Sbjct: 81  -------------------YNLW-QPLNMEHFDSVSKMASENFAYA-DARVLTT 113


>gi|413951106|gb|AFW83755.1| hypothetical protein ZEAMMB73_317062 [Zea mays]
          Length = 1594

 Score =  105 bits (262), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 49/64 (76%), Positives = 55/64 (85%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANV  QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG V +QC+DLCS+
Sbjct: 428 MPSTANVSQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVLIQCTDLCSS 487

Query: 61  SGDR 64
           SGDR
Sbjct: 488 SGDR 491


>gi|413953324|gb|AFW85973.1| putative DUF1692 domain containing protein [Zea mays]
          Length = 1070

 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 49/64 (76%), Positives = 55/64 (85%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANV  QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG V +QC+DLCS+
Sbjct: 428 MPSTANVSQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVLIQCTDLCSS 487

Query: 61  SGDR 64
           SGDR
Sbjct: 488 SGDR 491


>gi|413949740|gb|AFW82389.1| putative DUF1692 domain containing protein [Zea mays]
          Length = 1061

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 49/64 (76%), Positives = 55/64 (85%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           MPSTANV  QE PRPLE +NLLAARNERES QIALRPKVSW++S  AG V +QC+DLCS+
Sbjct: 414 MPSTANVSQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVLIQCTDLCSS 473

Query: 61  SGDR 64
           SGDR
Sbjct: 474 SGDR 477


>gi|354583721|ref|ZP_09002619.1| hypothetical protein PaelaDRAFT_3720 [Paenibacillus lactis 154]
 gi|353197601|gb|EHB63082.1| hypothetical protein PaelaDRAFT_3720 [Paenibacillus lactis 154]
          Length = 786

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 91/350 (26%), Positives = 145/350 (41%), Gaps = 51/350 (14%)

Query: 213 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRI 272
           VR+ + LTVWDF L          G+    +++  G   G + W + +++++   +++R+
Sbjct: 175 VRIPIELTVWDFELTDESHAKTNFGVWGGPVQEAHGNVVGEEAW-KYIEKYYYASVEHRL 233

Query: 273 SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKE 332
           +P +    +S  + +Y    P       +Y +DPR++AY +PY     + DG  D  R +
Sbjct: 234 TPGYLPIPDS-DINSYVERAP-------KYVNDPRISAYRLPY---YRTADGQPDIQRNK 282

Query: 333 --IELLRTKAHWKKAYFYL--WDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSD 388
             ++ LR      KAY+Y+   DEP   + Y+ V+ +   L   APD   L T    P D
Sbjct: 283 QLVDRLREAGLLSKAYYYVSEIDEP-TRDKYARVKQINDALEQAAPDVPHLVT--IQPVD 339

Query: 389 APLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCM 448
             +G                         WV  + E   +    E Q      WW Y  +
Sbjct: 340 ELVGDVDI---------------------WV-ADIEKFDEAFAKERQAAGDSVWW-YTYV 376

Query: 449 GPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRR----- 503
            P  P P++HL       R + W     G  G LYW    ++K      +   R      
Sbjct: 377 KPKHPFPSYHLDDDLVGTRLLTWMQRDHGVEGALYWATTQFQKYDAAQKKYVSRDVWTDP 436

Query: 504 -GLP--PGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASR 550
              P   GDG LFYPG        P+ ++RLE +   ++D EYL LY  +
Sbjct: 437 LAFPGANGDGYLFYPGTEVGVD-GPIGTIRLEVLRESMEDYEYLWLYEQK 485



 Score = 45.4 bits (106), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 52/121 (42%), Gaps = 21/121 (17%)

Query: 19  INLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRR--- 75
           I + AARNE ES Q+ ++      ++     +QV  SDL    G   +  + + L +   
Sbjct: 57  IRIGAARNEYESGQVIVK------ANQPLRKLQVSMSDLKLTDGSAKIGREHIQLFKQHY 110

Query: 76  ---------VVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEII 126
                      P    PDAL+PL+    Q+ +  G    +W  +  P  Q PG Y GE+ 
Sbjct: 111 IEVKTSTTPAYPKGWYPDALIPLN---QQLEVAEGHNQGIWFKVYVPKGQHPGTYTGEMT 167

Query: 127 I 127
           +
Sbjct: 168 L 168


>gi|403382311|ref|ZP_10924368.1| hypothetical protein PJC66_21061 [Paenibacillus sp. JC66]
          Length = 796

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 92/354 (25%), Positives = 140/354 (39%), Gaps = 54/354 (15%)

Query: 213 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRI 272
           VR+ + LTVWDF L          GI    I++  G   G + W E +++++   +++R+
Sbjct: 177 VRIPVELTVWDFELTDENHSKTAFGIWGGPIQEAHGNVQGMEAW-EYIEKYYWASVEHRL 235

Query: 273 SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVR-K 331
           +P +      + +      +  +H  +  + +DPR++AY +PY        G  D  R K
Sbjct: 236 TPGY------LPIPDTDIDYYVEH--APRFINDPRVSAYRLPY---YRDAQGEPDIERIK 284

Query: 332 EI-ELLRTKAHWKKAYFYL--WDEPL----NMEHYSSVRNMASELHAYAPDARVLTTYYC 384
           E+ + LR +   +K YFY+   DEP+       +Y  V+ +   L   APD   L T   
Sbjct: 285 ELADKLRDRGMLEKGYFYISEIDEPVPHPNAANNYDRVKVINDALKQAAPDVPHLVTI-- 342

Query: 385 GPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWT 444
                     P E  +       P    Y               D   E Q E    WW 
Sbjct: 343 ---------QPLEELLGDVDIWSPEIDKYDY-------------DFARERQAEGEPVWW- 379

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRR- 503
           Y  + P  P P++H        R + W     G  G LYW    ++K      +   R  
Sbjct: 380 YTSVFPKHPFPSYHTDDDLVGARLLTWMQHDYGVEGTLYWATTQFQKYDSAQRKYVSRDV 439

Query: 504 -----GLP--PGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASR 550
                  P   GDG LFYPG        PV ++RLE +   ++D EYL  Y  R
Sbjct: 440 WTDPLAFPGANGDGYLFYPGTEIGID-GPVGTIRLEVLRESMEDYEYLWHYEQR 492



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 20/121 (16%)

Query: 19  INLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRR--- 75
           + L AARNE ES Q+ +R     + +     +QV  SDL   +G   +  + + L +   
Sbjct: 58  MQLAAARNEYESGQVIVR-----AGNHPLRKLQVSISDLKQENGAAKIHRRDIELFQQHY 112

Query: 76  ---------VVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEII 126
                      P    PDAL+PL     ++ +  G    +WV +  P  QP G+Y+GEI 
Sbjct: 113 IEVTTSTTPAYPQGWYPDALIPLK---GKLEVGAGHNQGIWVKVYVPKGQPAGVYKGEIT 169

Query: 127 I 127
           +
Sbjct: 170 L 170


>gi|374849806|dbj|BAL52811.1| hypothetical protein HGMM_F03C06C16 [uncultured prokaryote]
          Length = 994

 Score = 75.9 bits (185), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 102/389 (26%), Positives = 155/389 (39%), Gaps = 85/389 (21%)

Query: 213 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEAL-DQHFKWLLQYR 271
            +V L LTV+DF LP TP+L +  GI    IE    V+   D+   AL D++ +   ++R
Sbjct: 573 AQVPLMLTVYDFDLPRTPTLRSGFGIDARRIEQYHRVQSEQDK--RALWDRYMRNFREHR 630

Query: 272 ISPF-----------FCRWGESMRV-LTYTCPWPADHPKSDEY-FSDPRLAAYAVP---- 314
           ++P+           F   G + RV L +T    A     DE+ F+   L  + +P    
Sbjct: 631 LAPYNFYAYDHYEVRFEGEGANKRVVLDFTRFDRAAQRYLDEFGFNAFVLPIHGLPSGRH 690

Query: 315 --YSPVLSSN--DGA-------KDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVR 363
             YSP +     +G         DY+R+    LR +   KKAY Y +DEP   + Y  V+
Sbjct: 691 PNYSPGVFGGFREGTPEYERLWSDYLRQLTTHLRERGWLKKAYVYWFDEPEEAD-YPFVK 749

Query: 364 NMASELHAYAPD-ARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHT-QIYCTSEWVLG 421
            +   L   APD  R+LT     P +  +G    + +  +  F+ P   Q  C +     
Sbjct: 750 RVNERLKQVAPDLTRMLTEQ---PEEPLIGAV--DLWCPLTAFVSPEAIQARCQA----- 799

Query: 422 NREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGF 481
                            GEE W YVC GP  P+    +   G++ R  +W+ W+ G  G 
Sbjct: 800 -----------------GEEIWWYVCTGPRAPYATLFIDHPGTEMRVWLWQTWQYGVQGI 842

Query: 482 LYWGANCYEKATVPSAEIR----------FRRGLPP--------GDGVLFYPGEVFSSSR 523
           L W    +         ++          +   L P        GDG L YP     +  
Sbjct: 843 LIWQTTWWHNEFAYPDRLQDPWQDPMSYVWDANLKPGTRLFWGNGDGRLLYPPRRDPNRA 902

Query: 524 ------QPVASLRLERILSGLQDIEYLNL 546
                  P+ S+R E +  G +D EY  L
Sbjct: 903 TDPLIDDPIPSIRWECLRDGAEDYEYFVL 931



 Score = 43.5 bits (101), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 56/131 (42%), Gaps = 26/131 (19%)

Query: 10  QEMPRP---LEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLV 66
           +E P P   +  + L AAR E E VQI LRP+          + QV+ SDL    G   +
Sbjct: 445 RERPLPETTMHTVTLSAARGEYEPVQIVLRPQ------RNTTLRQVEISDLT--QGKHRL 496

Query: 67  VGQSLMLRRVVPM--------LG----VPDALVPLDLPVCQISLIPGETTAVWVSIDAPY 114
             + + LR V  +        LG     PD L PL  P   + L       +W+++  PY
Sbjct: 497 PAKHITLREVAYVRVAHPTDWLGEPGDYPDPLPPLKTP---LRLQAERNQPLWLTVYVPY 553

Query: 115 AQPPGLYEGEI 125
             P G Y G I
Sbjct: 554 GTPAGKYTGTI 564


>gi|392373328|ref|YP_003205161.1| hypothetical protein DAMO_0214 [Candidatus Methylomirabilis
           oxyfera]
 gi|258591021|emb|CBE67316.1| conserved exported protein of unknown function [Candidatus
           Methylomirabilis oxyfera]
          Length = 676

 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 131/349 (37%), Gaps = 34/349 (9%)

Query: 212 SVRVKLSLTVWDFILPATPSLPAVIG-ISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQY 270
           S+ + +SLTVW+F LP TP+L    G          FG    +D     +D+    LL++
Sbjct: 308 SIPIPISLTVWNFSLPTTPALRTNFGHFRSQQFAAAFGTSRYTDIHNTLMDKFDHELLRH 367

Query: 271 RISPFFCRWGE-SMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPY-------SPVLSSN 322
           R+SP      E S    T T    ++      +F    L +Y +P         P  +  
Sbjct: 368 RLSPARPSGTEPSYNAATGTID-SSNVQARMAHFISLGLTSYDLPLFDDWPWADPFGADR 426

Query: 323 DGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTY 382
           D A+ Y+   ++ L        AY    DEP     Y +VR+ A+  H   P A++L T 
Sbjct: 427 DKAQRYLSGILDWLGANDWLTLAYHDGIDEPEEASGYQAVRDEATNWHGLDPRAKMLITE 486

Query: 383 YCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEW 442
              P D   G       +  P F R     +     V                   GE+ 
Sbjct: 487 QTRPWDPTWGTLYGSVDIWTPYFSRFDPVTWAERRAV-------------------GEQS 527

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVP---SAEI 499
           W Y   G  +  P   L     + R   W  ++ G TG L W    +++ T P    A  
Sbjct: 528 WMYGAWG-DNGTPGDLLDRPIYEIRVPAWIGFQYGITGLLKWNTVYWDQVTDPWTNPATY 586

Query: 500 RFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYA 548
                +  GDG  FYPG        P+ASLRL+     + D +YL   A
Sbjct: 587 TLSGDIFNGDGAFFYPGTKV-GYEGPIASLRLKTFRDAVDDYDYLTQLA 634


>gi|410456409|ref|ZP_11310270.1| hypothetical protein BABA_21191 [Bacillus bataviensis LMG 21833]
 gi|409928078|gb|EKN65201.1| hypothetical protein BABA_21191 [Bacillus bataviensis LMG 21833]
          Length = 548

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 64/237 (27%), Positives = 96/237 (40%), Gaps = 43/237 (18%)

Query: 333 IELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLG 392
           I+ ++      + +F++ DEP +++   S +N +  L  Y  D  V+        DA   
Sbjct: 315 IDFIKQNGLEHRVFFHVSDEP-HLDQVESYQNASEILQTYVKDFPVI--------DALSD 365

Query: 393 PTPFES-FVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENG-EEWWTYVCMGP 450
            T +E   VK P     H Q +                       +NG E  WTY C   
Sbjct: 366 YTFYEKGLVKTPIPSNDHIQPFL----------------------DNGVENLWTYHCCVQ 403

Query: 451 SDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGL 505
                N    M   ++R +  +++K    GFL+WG N +     +K   P          
Sbjct: 404 YKKVANRFFNMPSFRNRVLGMQLYKFNIAGFLHWGYNFWYSQYSKKPIDPFRNTDAHYAF 463

Query: 506 PPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEK 562
           P GD  L YPGE       P+ S+RLE +   LQD+  L L  S  GR++ L LLE+
Sbjct: 464 PSGDAFLVYPGE-----EGPIESIRLEVLHEALQDLRALQLLESFIGREQVLELLEE 515


>gi|153004484|ref|YP_001378809.1| hypothetical protein Anae109_1621 [Anaeromyxobacter sp. Fw109-5]
 gi|152028057|gb|ABS25825.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
          Length = 561

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 94/370 (25%), Positives = 145/370 (39%), Gaps = 55/370 (14%)

Query: 215 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL-LQYRIS 273
           V + LTVWDF LP+T +L +  G++   +    G+       +  L   +  L L +R+S
Sbjct: 180 VPVELTVWDFELPSTATLRSAFGLAWGALPSGHGISSSDLAAFATLRARYGQLALDHRVS 239

Query: 274 ------------PFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSS 321
                         F R+   +   T     P     + EY           P + V + 
Sbjct: 240 LSHHDDGMWNDLEHFDRYYGPLMDGTAATRLPGARLTAVEYLG---------PLADVANL 290

Query: 322 NDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEP-LNMEHYSSVRNMASELHAYAPDARVL 379
              A+ Y        R++  W +  F Y  DEP      +  +   AS      P+ R L
Sbjct: 291 ARWAQRY--------RSRPGWFERLFQYTCDEPPYQGCGWGDIALRASAAKKADPEFRTL 342

Query: 380 TTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENG 439
            T     ++A       +  V V  F+   +  Y          +    D     +P+N 
Sbjct: 343 VTTTIQEAEANGATGLLDLVVPVVNFIDDKSGGYA-------GDQRPKYDAFLAAEPQN- 394

Query: 440 EEWWTYVCM----GPSDPH----PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEK 491
           E W    CM    G S  +    P++ +     ++RA+ W  +K   TG LYW       
Sbjct: 395 EVWLYQSCMSHGCGGSSAYGTGWPSYMVDASAVRNRAMQWLAFKYRATGELYWDTTYAYL 454

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDIEYLNLYA 548
           +  P A +    G   GDG LFYPG   ++  ++  PVAS+RL+ I  G++D EYL L A
Sbjct: 455 SGDPWASVWEFDG--NGDGTLFYPGTPAKIGGTTHVPVASIRLKMIREGMEDHEYLVLLA 512

Query: 549 SRYGRDEGLA 558
           +    D GLA
Sbjct: 513 AL--GDRGLA 520


>gi|315648741|ref|ZP_07901837.1| hypothetical protein PVOR_25998 [Paenibacillus vortex V453]
 gi|315275943|gb|EFU39294.1| hypothetical protein PVOR_25998 [Paenibacillus vortex V453]
          Length = 558

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 97/244 (39%), Gaps = 49/244 (20%)

Query: 328 YVRKEIELLRTKAHWKKAYFYLWDEPL--NMEHYSSVRNMASELHAYAPDARVLTTYYCG 385
           ++ K ++ +R     K+ YF+L DEP   ++E Y +   +   +    P    L+ Y   
Sbjct: 314 FLNKLVQFVRWNGLEKRVYFHLSDEPKLDDLETYRAASELVRPILKDFPIIDALSDYEFY 373

Query: 386 PSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP--ENG-EEW 442
            S     P P  +                                  ++QP  ++G E  
Sbjct: 374 KSGLIEHPIPASN----------------------------------DIQPFLDHGLEGL 399

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 497
           WTY C        N    M  S++R +  +++     GFL+WG N +     + A  P  
Sbjct: 400 WTYYCCAQYKQVSNRFFHMPSSRNRVLGIQLYTLKLRGFLHWGYNFWYAQFSKYAINPYQ 459

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                 G P GD  L YPGE       PV S+RLE +   LQD+  L L   + GR++ +
Sbjct: 460 VTDAGGGFPAGDAFLVYPGE-----EGPVESIRLEVLTEALQDLRALELLEQKVGREQVM 514

Query: 558 ALLE 561
           A+L+
Sbjct: 515 AMLQ 518


>gi|442320014|ref|YP_007360035.1| hypothetical protein MYSTI_03035 [Myxococcus stipitatus DSM 14675]
 gi|441487656|gb|AGC44351.1| hypothetical protein MYSTI_03035 [Myxococcus stipitatus DSM 14675]
          Length = 561

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 107/394 (27%), Positives = 145/394 (36%), Gaps = 67/394 (17%)

Query: 209 SNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLL 268
           S   V V  ++ V  F+LPAT SLP   GIS   I    G+   S E    L  + + LL
Sbjct: 147 SREHVSVPFTVEVQPFVLPATASLPTSFGISQLSIARGHGLNAESSEAKALLRAYARMLL 206

Query: 269 QYRIS-------PFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSS 321
           ++R+S       P   R+ +   V+     W     +   +     L + A   +  L  
Sbjct: 207 EHRVSAHGMSMSPPPVRFEDGRAVVD----WREYDAEMAPFLDGSLLPSGARFTTTDLRD 262

Query: 322 NDGAKD------YVRKEIELLRTKAHWKKAYFYLWDEP------LNMEHYSSVRN----- 364
           N  A        Y R  +E  R K    + +FY  DEP      L +     VR      
Sbjct: 263 NKKAHTEAERVAYYRAFVEHFRKKDWPTQLFFYAKDEPKPQDVPLVLTQSRRVREAGGAR 322

Query: 365 ------MASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIY----C 414
                 M  EL A A D    T     P   P       +  ++ K LR  TQ++    C
Sbjct: 323 VLITTPMEGELPAAA-DILAPTLNCFFPRPGPATCRAIHTVTELRKQLRSGTQVWWYQSC 381

Query: 415 TSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVW 474
            S    G          TE   E     W    +         H  M    +RA+    +
Sbjct: 382 NSHGCNGG-------ASTEAAQERAYSGWASYMVD--------HSAM---LNRAMGPLAF 423

Query: 475 KEGGTGFLYWGAN-CYEKATVPSAEIRFRRGLPPGDGVLFYPG--EVFSSSR-QPVASLR 530
             G  G LY+     Y     P  ++ F  G   GDG  FYPG  E    SR QPV SLR
Sbjct: 424 VNGVDGELYFDTVFAYNTKKDPWKDL-FEFG-GNGDGTFFYPGTPERLGDSRHQPVPSLR 481

Query: 531 LERILSGLQDIEYLNLYA----SRYGRDEGLALL 560
           L+ +  GL+D EYL L A     ++ RD    L+
Sbjct: 482 LKHLRDGLEDFEYLRLLARLGDEKFARDAARQLV 515


>gi|291514616|emb|CBK63826.1| hypothetical protein AL1_13720 [Alistipes shahii WAL 8301]
          Length = 573

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 60/250 (24%), Positives = 101/250 (40%), Gaps = 40/250 (16%)

Query: 303 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEPLNMEHYSS 361
           + DPR+  Y   Y P L      ++++R +     +   W   Y  ++ DEPL+ E+ +S
Sbjct: 321 YDDPRVQQYIAAYFPAL------QEHLRSKTINDGSGRSWLDIYTQHIADEPLD-ENKTS 373

Query: 362 VRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLG 421
              +A ++   APD R++  Y     D  L        + VP+      +IY T      
Sbjct: 374 WEGLAHQVKQAAPDIRIIEAYRSSSYDPALID------ILVPQLDEFAWEIYRTM----- 422

Query: 422 NREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGF 481
                            G   W Y CM P     N ++ +   + R + W  +K G  G+
Sbjct: 423 ---------------PAGHSCWFYTCMYPRGNFANRYVTLPLIKTRLLHWINYKYGSPGY 467

Query: 482 LYWGANCYEKATVPSAEIRF-RRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQD 540
           L+WG N +     P  ++       P GD  + YPG      R+   S+RL  +  G++D
Sbjct: 468 LHWGFNAWGANGDPFGDVSAPANDWPGGDSHIVYPG-----YRKLYPSIRLTAMRDGIRD 522

Query: 541 IEYLNLYASR 550
            + L +  +R
Sbjct: 523 YDLLKMVEAR 532


>gi|444913149|ref|ZP_21233303.1| hypothetical protein D187_05240 [Cystobacter fuscus DSM 2262]
 gi|444716152|gb|ELW57007.1| hypothetical protein D187_05240 [Cystobacter fuscus DSM 2262]
          Length = 546

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 106/384 (27%), Positives = 142/384 (36%), Gaps = 60/384 (15%)

Query: 200 IDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEA 259
           +DM    A S     V  +  V  F+LPAT SLP   GIS   I    G++  S E    
Sbjct: 132 LDMEGAPAAS-----VPFTAEVQPFVLPATSSLPNSFGISLYSIAKGHGLKPESPEAQTL 186

Query: 260 LDQHFKWLLQYRIS-------PFFCRWGESMRVLTYTCPWPADHPKSDEYF--SDPRLAA 310
           L  +   LL +R+S       P   R+ E   VL +        P  D     S  R   
Sbjct: 187 LRDYVTALLAHRVSAHGMSMEPPPVRFEEGRAVLDFRAYDAEVGPFLDGSALPSGARFTT 246

Query: 311 YAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELH 370
             V  S    +++    Y R   E  + K    + +FY  DEP   E    VR  A  + 
Sbjct: 247 VDVRDSKAARTDEQKAAYYRAFAEHAKDKGWPAQLFFYAKDEP-KPEDVPLVRAQALRVR 305

Query: 371 AYAPDARVLTT-----YYCGPSD--APL--------GPTPFESFVKVPKF---LRPHTQI 412
               D  VL T        G +D  AP         GP    + V +      L P+ ++
Sbjct: 306 TAGKDVPVLVTSPLDEALRGSADILAPTLNCFFPRPGPQTCRNVVPLQTLRGKLAPNVKV 365

Query: 413 Y----CTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRA 468
           +    C S    G      KD  TE   +    W +Y+   P+              +RA
Sbjct: 366 WWYQSCNSHGCTGGP---AKDSATE---KAYSGWASYMVDHPA------------PLNRA 407

Query: 469 VMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQP 525
           +    +  G  G LY+          P  E+ F  G   GDG  FYPG       S  QP
Sbjct: 408 MGPLAFLSGVDGELYFDTVFAYNTKDPWKEV-FEFG-GNGDGTFFYPGTPAHTGLSRHQP 465

Query: 526 VASLRLERILSGLQDIEYLNLYAS 549
           V SLRL+ +  GL+D EYL L  S
Sbjct: 466 VVSLRLKHLRDGLEDYEYLQLLES 489


>gi|115380168|ref|ZP_01467198.1| hypothetical protein STIAU_4742 [Stigmatella aurantiaca DW4/3-1]
 gi|310820633|ref|YP_003952991.1| hypothetical protein STAUR_3372 [Stigmatella aurantiaca DW4/3-1]
 gi|115362825|gb|EAU62030.1| hypothetical protein STIAU_4742 [Stigmatella aurantiaca DW4/3-1]
 gi|309393705|gb|ADO71164.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
          Length = 554

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 97/375 (25%), Positives = 141/375 (37%), Gaps = 42/375 (11%)

Query: 215 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 274
           V  ++ V  F+LPAT SLP   GIS   I    G+   S E +  L  + + LL++R+S 
Sbjct: 152 VPFTVEVQPFVLPATASLPTSFGISLYSISRGHGLSPESPEAHALLRDYARTLLEHRVSA 211

Query: 275 FF-------CRWGESMRVLTYTCPWPADHPKSDEYF--SDPRLAAYAVPYSPVLSSNDGA 325
                     R+     V+ +        P  D     S  R     V  +   S+++  
Sbjct: 212 HGLSMEAPPVRFENGRAVVDFRAYDEEMAPYFDGTLLPSGARFTTSDVRDNRKASTDEEK 271

Query: 326 KDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCG 385
             Y R   E  ++K    + +FY  DEP   E    VR  +  + A A    VL T    
Sbjct: 272 VAYYRAFAEHFQSKGWPAQLFFYAKDEP-KPEDVPLVRAQSKRVRA-AGKIPVLVT---- 325

Query: 386 PSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWW-- 443
              +PL     ++       L P    +         R  L    +    P N + WW  
Sbjct: 326 ---SPLD----DALQGTADILTPTLNCFFPRPGPQTCRNVLPLSKLRSRLPANAKVWWYQ 378

Query: 444 ---TYVCMGPSDPHPNWHLGMRG----------SQHRAVMWRVWKEGGTGFLYWGANCYE 490
              ++ C G   P         G            +RA+    +  G  G LY+    Y 
Sbjct: 379 SCNSHGCNGGPIPEAATERAYTGWASYMVDHPAPLNRAMGPLAFLTGVDGELYFD-TVYA 437

Query: 491 KATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDIEYLNLY 547
             T    +  F  G   GDG LFYPG    + + + QPV SLRL+ +  GL+D EYL+L 
Sbjct: 438 YNTQDPWKGLFEFG-GNGDGTLFYPGTPKRLGTQAHQPVVSLRLKHLRDGLEDYEYLHLL 496

Query: 548 ASRYGRDEGLALLEK 562
           A     D   AL  +
Sbjct: 497 AKLGDEDTARALARR 511


>gi|297791959|ref|XP_002863864.1| hypothetical protein ARALYDRAFT_917687 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309699|gb|EFH40123.1| hypothetical protein ARALYDRAFT_917687 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 460

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 28/51 (54%), Positives = 33/51 (64%), Gaps = 3/51 (5%)

Query: 242 VIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPW 292
           + E RF V HG +E Y+  D HFKWLLQY ISP+FC+W E   V  Y  PW
Sbjct: 14  LTESRFDVEHGIEECYKTFDLHFKWLLQYWISPYFCKWFE---VSKYVQPW 61


>gi|153004261|ref|YP_001378586.1| hypothetical protein Anae109_1395 [Anaeromyxobacter sp. Fw109-5]
 gi|152027834|gb|ABS25602.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
          Length = 604

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 90/373 (24%), Positives = 135/373 (36%), Gaps = 58/373 (15%)

Query: 213 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRI 272
            ++ + LTVWDF LP+T SL    G+S  +I    GV   +D                 I
Sbjct: 166 AKIPVMLTVWDFELPSTASLKTHFGLSWGLIPSGHGVSPETDA---------------SI 210

Query: 273 SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSD-----PRLAAYAVPYSPVLS----SND 323
              +   G   RV          H   D +  +        A   +P + + +     N 
Sbjct: 211 RARYAALGLDHRVSLSGVADDGYHGDFDHFERNYAPLVDGTAKTRLPGAKLTTVKYVGNQ 270

Query: 324 GAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYY 383
            + D  R+  E  R K  + + + Y  DEP     +  +      +H   P+ R L T  
Sbjct: 271 TSVDEHRRWAEHFRAKGWFDRLFDYTCDEPPLTCSWDELPQRTKAVHEADPEFRTLVTTQ 330

Query: 384 CGPSDAPLGPTPFESFVKVPKFL--RPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEE 441
              ++        +  V V  ++  RP           LG         +   + E  E 
Sbjct: 331 IWDAEEHGVADEIDIMVPVVNWMDDRPGAG-------SLGQNRAKYDGFLA--KSEKKEL 381

Query: 442 WWTYVCM-----------GPSD------PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYW 484
           W    CM            PS+        P++ +     ++RA+ W  + E  TG LYW
Sbjct: 382 WLYQSCMSHGCGGTVNIGNPSEWDRYNTGWPSYMIDSSAVRNRAMEWISFLEDATGELYW 441

Query: 485 GANCYEKATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDI 541
                      S +  F      GDG LFYPG    +  S+  PVAS+RL+ I  G++D 
Sbjct: 442 ETAFAFTHDAWSNQWDFSGN---GDGTLFYPGTPARIGGSTDIPVASIRLKMIREGMEDY 498

Query: 542 EYLNLYASRYGRD 554
           EYL L A    RD
Sbjct: 499 EYLKLLADAGQRD 511



 Score = 45.4 bits (106), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 63/141 (44%), Gaps = 20/141 (14%)

Query: 3   STANVGPQEMPRPLEPINLLAARNERESVQIALR-PKVSWSSSST----AGVV------Q 51
           +T  + P   PR      + AARNE E+ Q+ +  P    S+ +T    AGVV      +
Sbjct: 29  ATEKIRPDAQPRQTTEARIAAARNEFEAFQVVVTGPARGVSARATSLEGAGVVDDVKLYR 88

Query: 52  VQCSDLCSASGDRLVVGQ--SLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVS 109
           V   D+ +AS      G+    ++  V  ++G      P D+P        GE+ A+WV 
Sbjct: 89  VDAIDVHTASALDGATGRWPDALVPDVDDVVGEKRNAFPFDVPA-------GESRAIWVE 141

Query: 110 IDAPYAQPPGLYEGEIIITSK 130
           +  P    PG + GE+ I S+
Sbjct: 142 VRVPPDAKPGTHFGEVTIASE 162


>gi|224024189|ref|ZP_03642555.1| hypothetical protein BACCOPRO_00912 [Bacteroides coprophilus DSM
           18228]
 gi|224017411|gb|EEF75423.1| hypothetical protein BACCOPRO_00912 [Bacteroides coprophilus DSM
           18228]
          Length = 566

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 97/219 (44%), Gaps = 29/219 (13%)

Query: 353 PLNMEHYSSV-RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRP--- 408
           P+N E  S+  R     L ++     +   Y    +D P+  + F+S+V++ +F++    
Sbjct: 316 PINSEKASNFYRQFLPSLMSHLQKRGLKDIYVQHIADEPI-ESNFKSYVEIARFVKDICP 374

Query: 409 --------HTQIYCTSEWVLGNREDLVKDIVTELQPEN--GEEWWTYVCMGPSDPHPNWH 458
                   HT     +  +   + +  KD  +  Q     G+E W Y C+ P     N  
Sbjct: 375 DLRIIEACHTHNLENTVDIWVPQLNFYKDGYSFYQERQKAGDEVWFYTCLAPQGNFANRF 434

Query: 459 LGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAE---IRFRRG--LPPGDGVLF 513
           L +   + R + W  ++ G TG+L+WG N +++ + P  E   +    G  LP GD  + 
Sbjct: 435 LELPSIKTRLIHWLNFRYGATGYLHWGFNFWKENSDPYGETTTMNLESGNTLPGGDSWIV 494

Query: 514 YP--GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASR 550
           YP  G+++S       S+RLE +  G+ D   L + A +
Sbjct: 495 YPKNGKLYS-------SIRLEAMRDGIADYTLLQMLAQK 526


>gi|218779590|ref|YP_002430908.1| hypothetical protein Dalk_1743 [Desulfatibacillum alkenivorans
           AK-01]
 gi|218760974|gb|ACL03440.1| hypothetical protein Dalk_1743 [Desulfatibacillum alkenivorans
           AK-01]
          Length = 844

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 60/231 (25%), Positives = 97/231 (41%), Gaps = 33/231 (14%)

Query: 326 KDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCG 385
           KDY+    E LR   +  +AY+Y+ +EP + E Y +V   A+ L + APD +++ +    
Sbjct: 353 KDYMHATQEYLRGLGYLDRAYYYMANEPQDGEDYKAVAWYANLLKSAAPDLKLMVS---- 408

Query: 386 PSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY 445
                    P E       +      I+     VL N +    D+  + +  + EE W Y
Sbjct: 409 -------EEPKEEIYNNETYSGAKIDIWLP---VLNNYD---PDVSHDREKNHQEETWVY 455

Query: 446 VCMGPSDPHPN-WHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA--TVPSAEIRFR 502
              G   P+ N   L   G + +   W +WK    G  Y+  N + K   T P  +    
Sbjct: 456 FLHGTRPPYYNPITLDHPGIESKFTGWLLWKYRIRGIAYYSMNGWSKNPWTSPMTDGH-- 513

Query: 503 RGLPPGDGVLFYPG-------EVFSSSRQPVASLRLERILSGLQDIEYLNL 546
                GD  +FYP        +  +++ + V S+RLE +   L+D EYL L
Sbjct: 514 ----NGDTFMFYPPSEDNSAIDYAANNHRLVPSIRLELMRDSLEDYEYLYL 560


>gi|86160501|ref|YP_467286.1| hypothetical protein Adeh_4085 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|85777012|gb|ABC83849.1| hypothetical protein Adeh_4085 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 539

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 96/355 (27%), Positives = 143/355 (40%), Gaps = 47/355 (13%)

Query: 215 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL-LQYRIS 273
           V ++LTVW F LP+T SL +  G+S   +    GV   S +   AL   +  L L +RI+
Sbjct: 109 VPVTLTVWPFTLPSTASLKSAFGLSWGTLNTAHGV---SGDALSALRARYGQLALDHRIT 165

Query: 274 PFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEI 333
               R  +  R L +   +           + P   A +V Y   L  + G   +     
Sbjct: 166 --LSRIDDGNRDLAHFASFFGPLFDGASAATLPGAQATSVEY---LGGSSGYASWA---- 216

Query: 334 ELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGP 393
              +++    + + Y  DEP     +S +   A+   A +P  R L T     +DA  G 
Sbjct: 217 SFFQSRGWDDRLFQYTCDEPPLQCAWSDIPARAASARAVSPALRTLVTTTIQQADA-AGV 275

Query: 394 TP-FESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY------- 445
           TP  +  V V  FL          E   G +     D      P    E WTY       
Sbjct: 276 TPAIDVLVPVVNFLDDRA-----GERFAGPQR-AAYDAFLAGSPR--REVWTYQSCMSHG 327

Query: 446 ----VCMG-PSDPH------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV 494
               V MG PSD        P++ +     ++RA+ W  +    TG LY+          
Sbjct: 328 CGGTVDMGSPSDSDRYFTGWPSYMIDASAVRNRAMEWISFNHRVTGELYYETTMAYSHDP 387

Query: 495 PSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDIEYLNL 546
            + +  F      GDG LFYPG   +V  +++ PVAS+RL+ I  G++D EYL L
Sbjct: 388 WANQWDFSGN---GDGTLFYPGTPAKVGGTTQIPVASIRLKMIREGMEDYEYLKL 439


>gi|390946349|ref|YP_006410109.1| hypothetical protein Alfi_1070 [Alistipes finegoldii DSM 17242]
 gi|390422918|gb|AFL77424.1| hypothetical protein Alfi_1070 [Alistipes finegoldii DSM 17242]
          Length = 573

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 60/248 (24%), Positives = 99/248 (39%), Gaps = 40/248 (16%)

Query: 305 DPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEPLNMEHYSSVR 363
           DPR+  Y   Y P L      ++++R  +    +   W   Y  ++ DEPLN E+ +S  
Sbjct: 323 DPRVQRYIAAYFPAL------QEHLRSRMIDDGSGRSWLDIYTQHIADEPLN-ENKTSWE 375

Query: 364 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 423
            +A ++   APD R++  Y     D  L        + VP+      +IY T        
Sbjct: 376 GLARQVKQAAPDIRIIEAYRSSSYDPALID------ILVPQLDEFVWEIYRTMP------ 423

Query: 424 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 483
                          G   W Y CM P     N ++ +   + R + W  +K    G+L+
Sbjct: 424 --------------AGHSCWFYTCMYPRGNFANRYVTLPLIKTRLLHWINYKYDSPGYLH 469

Query: 484 WGANCYEKATVPSAEIRF-RRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIE 542
           WG N +     P  ++       P GD  + YPG      R+   S+RL  +  G++D +
Sbjct: 470 WGFNAWGANGDPFGDVSAPANDWPGGDSHIVYPG-----YRKLYPSIRLAAMRDGIRDYD 524

Query: 543 YLNLYASR 550
            L +  +R
Sbjct: 525 LLKMVEAR 532


>gi|405373491|ref|ZP_11028264.1| hypothetical protein A176_4825 [Chondromyces apiculatus DSM 436]
 gi|397087750|gb|EJJ18780.1| hypothetical protein A176_4825 [Myxococcus sp. (contaminant ex DSM
           436)]
          Length = 555

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 142/376 (37%), Gaps = 59/376 (15%)

Query: 207 AISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKW 266
           A S   V V  +  V  F LPAT SLP   GIS   I    G+   S +  E L  + + 
Sbjct: 139 AGSGKPVPVAFTAEVQPFALPATSSLPNSFGISMYSIARGHGLDPESAQARELLRDYGRA 198

Query: 267 LLQYRIS-------PFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVL 319
           LL++R+S       P   R+ +   V+     W A   +   +     L++ A   S  +
Sbjct: 199 LLEHRVSAHGMSMNPPPVRFEKGRAVVD----WSAYDAEMAPFLDGTLLSSGARFTSSDV 254

Query: 320 SSNDGAKD------YVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYA 373
             N  AK       Y R   E  R K    + +FY  DEP   E    V+  ++ + A  
Sbjct: 255 RDNKQAKTDAEKTAYYRAFAEHYRQKGWPAQLFFYAKDEP-KPEDVPLVKAQSTRVRA-- 311

Query: 374 PDARVLTTYYCGPSDAPLGPTPFESFVKVPK----FLRPHTQIYCTSEWVLGNREDLVKD 429
             A  +      P D  L  +   + +  P     + RP  Q   T       R+ L   
Sbjct: 312 --AGGIPVLVTSPLDDALNSS---ADILTPTLNCFYPRPGPQTCPTVLEATALRKRLSG- 365

Query: 430 IVTELQPENGEEWWTYVCM------GPSD------PHPNWHLGM---RGSQHRAVMWRVW 474
                   N   WW   C       GP D       +  W   M    G  +RA+    +
Sbjct: 366 --------NTRVWWYQSCNSHGCTGGPPDDKKVDAAYSGWASYMVDHSGPLNRAMGVLAF 417

Query: 475 KEGGTGFLYWGAN-CYEKATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLR 530
             G  G LY+     Y     P  ++ F  G   GDG LFYPG    V  S  QPV +LR
Sbjct: 418 SSGVDGELYFDTVFAYNTKKDPWKDV-FEFGGN-GDGTLFYPGTPNRVGPSGHQPVLTLR 475

Query: 531 LERILSGLQDIEYLNL 546
           L+ I  GL+D EYL L
Sbjct: 476 LKHIRDGLEDYEYLRL 491


>gi|261408450|ref|YP_003244691.1| hypothetical protein GYMC10_4664 [Paenibacillus sp. Y412MC10]
 gi|261284913|gb|ACX66884.1| conserved hypothetical protein [Paenibacillus sp. Y412MC10]
          Length = 554

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 58/125 (46%), Gaps = 10/125 (8%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 497
           WTY C        N    +  +++R +  +++K    GFL+WG N +     ++A  P  
Sbjct: 399 WTYYCCSQYKEVSNRFFNLPSARNRIIGIQLYKFNIEGFLHWGYNFWNSQYSKRAIDPFK 458

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                 G P GD  + YPGE       P+ S+RL+     LQD+  L L  ++ GR+  L
Sbjct: 459 VTDADCGFPSGDAFVVYPGE-----EGPIESIRLKVFQEALQDLRALQLLETQIGREAVL 513

Query: 558 ALLEK 562
             LE+
Sbjct: 514 QALEE 518


>gi|334364721|ref|ZP_08513701.1| hypothetical protein HMPREF9720_1375 [Alistipes sp. HGB5]
 gi|313159097|gb|EFR58472.1| hypothetical protein HMPREF9720_1375 [Alistipes sp. HGB5]
          Length = 514

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 60/248 (24%), Positives = 99/248 (39%), Gaps = 40/248 (16%)

Query: 305 DPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEPLNMEHYSSVR 363
           DPR+  Y   Y P L      ++++R  +    +   W   Y  ++ DEPLN E+ +S  
Sbjct: 264 DPRVQRYIAAYFPAL------QEHLRSRMIDDGSGRSWLDIYTQHIADEPLN-ENKTSWE 316

Query: 364 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 423
            +A ++   APD R++  Y     D  L        + VP+      +IY T        
Sbjct: 317 GLARQVKQAAPDIRIIEAYRSSSYDPALID------ILVPQLDEFVWEIYRTMP------ 364

Query: 424 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 483
                          G   W Y CM P     N ++ +   + R + W  +K    G+L+
Sbjct: 365 --------------AGHSCWFYTCMYPRGNFANRYVTLPLIKTRLLHWINYKYDSPGYLH 410

Query: 484 WGANCYEKATVPSAEIRF-RRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIE 542
           WG N +     P  ++       P GD  + YPG      R+   S+RL  +  G++D +
Sbjct: 411 WGFNAWGANGDPFGDVSAPANDWPGGDSHIVYPG-----YRKLYPSIRLAAMRDGIRDYD 465

Query: 543 YLNLYASR 550
            L +  +R
Sbjct: 466 LLKMVEAR 473


>gi|197124590|ref|YP_002136541.1| hypothetical protein AnaeK_4209 [Anaeromyxobacter sp. K]
 gi|196174439|gb|ACG75412.1| conserved hypothetical protein [Anaeromyxobacter sp. K]
          Length = 618

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 96/389 (24%), Positives = 150/389 (38%), Gaps = 47/389 (12%)

Query: 215 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 274
           V ++LTVW F LP+T SL +  G+S   +    GV    D       ++ +  L +R++ 
Sbjct: 186 VPVTLTVWPFTLPSTASLKSAFGLSWGTLNTAHGVS--GDALSTLRGRYGQLALDHRVT- 242

Query: 275 FFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIE 334
              R  +  R L +   +           + P   A +V Y   L  + G   +      
Sbjct: 243 -LSRIDDGNRDLAHFASFFGPLFDGGAATALPGAQATSVEY---LGGSSGYASWA----S 294

Query: 335 LLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT 394
             +++    + + Y  DEP     +S +   A+   A +P  R L T     +DA    +
Sbjct: 295 FFQSRGWDDRLFQYTCDEPPLQCAWSDIPARAASARAVSPALRTLVTTTVQQADAAGVTS 354

Query: 395 PFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCMG---- 449
             +  V V  FL          E   G +     D      P    E WTY  CM     
Sbjct: 355 SIDVLVPVVNFLDDRA-----GERFAGPQR-AAYDAFLAGSPR--REVWTYQSCMSHGCG 406

Query: 450 -------PSDPH------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPS 496
                  PSD        P++ +     ++RA+ W  +    TG LY+           +
Sbjct: 407 GTVDMGSPSDSDRYFTGWPSYMIDASAVRNRAMEWISFNHRVTGELYYETTMAYSHDPWN 466

Query: 497 AEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGR 553
            +  F      GDG LFYPG   +V  +++ PVAS+RL+ I  G++D EYL L       
Sbjct: 467 NQWDFSGN---GDGTLFYPGTPAKVGGTTQIPVASIRLKMIREGMEDYEYLKLLTD---L 520

Query: 554 DEGLALLEKTGTYQGPERYTIEHLPIDVM 582
            +G    E +G    P  Y  E  P D++
Sbjct: 521 GDGALAREISGALF-PHPYDAEVSPADLL 548


>gi|329927960|ref|ZP_08281988.1| hypothetical protein HMPREF9412_1819 [Paenibacillus sp. HGF5]
 gi|328938179|gb|EGG34575.1| hypothetical protein HMPREF9412_1819 [Paenibacillus sp. HGF5]
          Length = 554

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 58/125 (46%), Gaps = 10/125 (8%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 497
           WTY C        N    +  +++R +  +++K    GFL+WG N +     ++A  P  
Sbjct: 399 WTYYCCSQYKEVSNRFFNLPSARNRIIGIQLYKFNIEGFLHWGYNFWNSQYSKRAIDPFK 458

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                 G P GD  + YPGE       P+ S+RL+     LQD+  L L  ++ GR+  L
Sbjct: 459 VTDADCGFPSGDAFVVYPGE-----EGPIESIRLKVFQEALQDLRALQLLETQIGREAVL 513

Query: 558 ALLEK 562
             LE+
Sbjct: 514 QALEE 518


>gi|157370810|ref|YP_001478799.1| hypothetical protein Spro_2570 [Serratia proteamaculans 568]
 gi|157322574|gb|ABV41671.1| conserved hypothetical protein [Serratia proteamaculans 568]
          Length = 556

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 54/125 (43%), Gaps = 9/125 (7%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV-----PSA 497
           W Y C        N       +++R +  ++++   TGFL+WG N Y          P A
Sbjct: 400 WAYYCCVQKTEVANRFFAQPSARNRILGIQLYRYNITGFLHWGFNFYNSGHSREQLNPYA 459

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
               R   P GD  + YPGE  +    PV SLRL  +  GLQD+  L L  S   R    
Sbjct: 460 VTDCRNAFPSGDAFVVYPGEDLT----PVESLRLRVLHQGLQDMRALQLLESLTDRATVE 515

Query: 558 ALLEK 562
            L+E+
Sbjct: 516 TLIEQ 520


>gi|383457370|ref|YP_005371359.1| hypothetical protein COCOR_05406 [Corallococcus coralloides DSM
           2259]
 gi|380730349|gb|AFE06351.1| hypothetical protein COCOR_05406 [Corallococcus coralloides DSM
           2259]
          Length = 561

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 138/368 (37%), Gaps = 52/368 (14%)

Query: 215 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRIS- 273
           V  S+ V  F LPAT SLP   GIS   I    G+   S E    L ++ K LL++R+S 
Sbjct: 153 VPFSVEVQPFELPATSSLPNSFGISLYSIAKGHGIPAESPEAKALLREYGKALLEHRVSA 212

Query: 274 ------PFFCRWGESMRVLTYTCPWPADHPKSDEYF--SDPRLAAYAVPYSPVLSSNDGA 325
                 P   ++ +   VL +T       P  D     S  R     V  +   S++   
Sbjct: 213 HGMGMDPPPVKFQDGRAVLDWTAYDAEMAPFLDGSVLPSGARFTTVDVRDNRKASTDAEK 272

Query: 326 KDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCG 385
             Y R   +    K    + +FY  DEP   + +  V+  +  +     D  VL T    
Sbjct: 273 SAYYRAFQKHFADKGWKAQLFFYAKDEPKPAD-FPLVKAQSKRVRDAGADIPVLVTTSLN 331

Query: 386 PSDAPLGPTPFESFVKVPKFLRP------HTQIYCTSEWVLGNREDLVKDIVTELQPENG 439
           P  A LG     + +    + R       H     T+   LG+R  +             
Sbjct: 332 P--ALLGSVDILTPIINCFYPREGPQTCHHVANAATARTKLGSRAKV------------- 376

Query: 440 EEWWTYVCM------GPSD------PHPNWHLGMR---GSQHRAVMWRVWKEGGTGFLYW 484
             WW   CM      GP +       + +W   M       +RA+    ++ G  G LY+
Sbjct: 377 --WWYQSCMAHGCTGGPPEDKALDKAYSDWASYMVDHPAPLNRAMGPLAFESGVDGELYF 434

Query: 485 GANCYEKATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDI 541
               +   T              GDG LFYPG   ++  +  QPV SLRL+ I  GL+D 
Sbjct: 435 D-TVFAYNTKKDVWTDLFEFGGNGDGTLFYPGTPAKLGGTEHQPVVSLRLKHIRDGLEDY 493

Query: 542 EYLNLYAS 549
           EYL L  S
Sbjct: 494 EYLRLLTS 501


>gi|220919313|ref|YP_002494617.1| hypothetical protein A2cp1_4234 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219957167|gb|ACL67551.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 618

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 96/389 (24%), Positives = 149/389 (38%), Gaps = 47/389 (12%)

Query: 215 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 274
           V ++LTVW F LP+T SL +  G+S   +    GV    D       ++ +  L +R++ 
Sbjct: 186 VPVTLTVWPFTLPSTASLKSAFGLSWGTLNTAHGVS--GDALSTLRGRYGQLALDHRVT- 242

Query: 275 FFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIE 334
              R  +  R L +   +           S P   A +V Y   L  + G   +      
Sbjct: 243 -LSRIDDGNRDLAHFASFFGPLFDGGAATSLPGAQATSVEY---LGGSSGYASWA----S 294

Query: 335 LLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT 394
             +++    + + Y  DEP     +  +   A+   A +P  R L T     +DA    +
Sbjct: 295 FFQSRGWDDRLFQYTCDEPPLQCAWGDIPARAASARAVSPALRTLVTTTVQQADAAGVTS 354

Query: 395 PFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCMG---- 449
             +  V V  FL          E   G +     D      P    E WTY  CM     
Sbjct: 355 SIDVLVPVVNFLDDRA-----GERFAGPQR-AAYDAFLAGSPR--REVWTYQSCMSHGCG 406

Query: 450 -------PSDPH------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPS 496
                  PSD        P++ +     ++RA+ W  +    TG LY+           +
Sbjct: 407 GTVDMGSPSDSDRYFTGWPSYMIDASAVRNRAMEWISFNHRVTGELYYETTMAYSHDPWN 466

Query: 497 AEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGR 553
            +  F      GDG LFYPG   +V  +++ PVAS+RL+ I  G++D EYL L       
Sbjct: 467 NQWDFSGN---GDGTLFYPGTPAKVGGTTQIPVASIRLKMIREGMEDYEYLKLLTD---L 520

Query: 554 DEGLALLEKTGTYQGPERYTIEHLPIDVM 582
            +G    E +G    P  Y  E  P D++
Sbjct: 521 GDGALAREISGALF-PHPYDAEVSPADLL 548


>gi|354580346|ref|ZP_08999251.1| hypothetical protein PaelaDRAFT_0352 [Paenibacillus lactis 154]
 gi|353202777|gb|EHB68226.1| hypothetical protein PaelaDRAFT_0352 [Paenibacillus lactis 154]
          Length = 552

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 65/245 (26%), Positives = 101/245 (41%), Gaps = 51/245 (20%)

Query: 328 YVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPS 387
           ++++ ++L+R      + +F++ DEP ++EH  + R  A  +                  
Sbjct: 311 FLKELVQLIRGLGIEDRIFFHVSDEP-HLEHLETYRKAAEIV------------------ 351

Query: 388 DAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIV---TELQP--ENG-EE 441
           D  +G  P               +I   S++    +E LV + +    +LQP  E+G   
Sbjct: 352 DVAVGDYP---------------RIDALSDYAF-YKEGLVPNPIPATDKLQPFLESGVAP 395

Query: 442 WWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC----YEKATV-PS 496
            WTY C        N        ++R +  +++K    GFL+WG N     Y K  V P 
Sbjct: 396 LWTYYCCSQYKQVANRFFSFPSERNRILGLQLYKYRIKGFLHWGFNFWNSQYSKRPVNPY 455

Query: 497 AEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEG 556
                  G P GD  L YPGE       PV SLR++     LQD+  L L     GRDE 
Sbjct: 456 LTTDADIGYPSGDAFLVYPGE-----DGPVCSLRMKVFREALQDLRALELLEQTIGRDEV 510

Query: 557 LALLE 561
           L ++E
Sbjct: 511 LNMIE 515


>gi|336426499|ref|ZP_08606509.1| hypothetical protein HMPREF0994_02515 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336010934|gb|EGN40914.1| hypothetical protein HMPREF0994_02515 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 562

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 55/125 (44%), Gaps = 10/125 (8%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV-----PSA 497
           WTY C G      N    M   ++R +  +++K    GFL WG N +          P  
Sbjct: 404 WTYYCCGQFREVSNRFFCMPSQRNRILGVQLYKYQIHGFLQWGFNFWNSMLSRYPINPYC 463

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                   P GD  L YPGE       P+AS+R E ++ GLQD+  L L      R+  L
Sbjct: 464 VTDAACAFPSGDASLVYPGE-----DGPIASIRAEVLMEGLQDMRALQLLEQLTDRETVL 518

Query: 558 ALLEK 562
           ALLE+
Sbjct: 519 ALLEE 523


>gi|414591642|tpg|DAA42213.1| TPA: hypothetical protein ZEAMMB73_799052 [Zea mays]
          Length = 583

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 29/58 (50%), Positives = 38/58 (65%), Gaps = 12/58 (20%)

Query: 294 ADHPKSDEYFSDPRLAAYAVPYSPVLS------------SNDGAKDYVRKEIELLRTK 339
           ADHPK++EY+SDPRLAAY VPY+P+LS            S D AK  +R+E+E +  K
Sbjct: 337 ADHPKANEYYSDPRLAAYVVPYAPILSCLLLYLIWLLVNSTDAAKSSLRREVEGVSKK 394


>gi|153003376|ref|YP_001377701.1| hypothetical protein Anae109_0503 [Anaeromyxobacter sp. Fw109-5]
 gi|152026949|gb|ABS24717.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
          Length = 608

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 100/393 (25%), Positives = 150/393 (38%), Gaps = 54/393 (13%)

Query: 212 SVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL-LQY 270
           S  V ++LTVW F LP+T SL +  G +   I    GV  G+ + + AL + +  L L +
Sbjct: 172 SATVPVTLTVWPFTLPSTASLKSAFGFTYGAIPGGHGV--GAADAFAALRERYGRLALDH 229

Query: 271 RISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVR 330
           RI+      G +      +   PA    +       R+ +Y + Y     S      Y  
Sbjct: 230 RITLSHVDDGSAAIDHAASLYGPAMDGAAPTALRGARMTSYELLYDAKSWST-----YFD 284

Query: 331 KEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAP 390
            E  L R        + Y  DEP     +S +   A+   A   + R L T     +DA 
Sbjct: 285 GEGWLDRL-------FQYTCDEPPLTCAWSDIPARAATARAA--NVRTLVTTSIQEADAQ 335

Query: 391 LGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCMG 449
                 +  V V  +L      Y       GN+     D      P    E W Y  CM 
Sbjct: 336 GVTGSIDVIVPVINYLDDREGTYA------GNQR-AKYDAFLAGSPR--RELWAYQSCMS 386

Query: 450 P-------------SDPH----PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA 492
                         SD +    P++ +     ++RA+ W  ++   TG LYW        
Sbjct: 387 HGCGGTVNFGSPSWSDRYFTGWPSYMIDASAVRNRAMEWLSFRYRVTGELYWETAYAYSH 446

Query: 493 TVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDIEYLNLYAS 549
              + +  F      GDG LFYPG   ++  ++  PVAS+RL+ I  G++D EYL L + 
Sbjct: 447 DAWTNQWDFNGN---GDGTLFYPGTPAKIGGTTHVPVASIRLKMIREGMEDYEYLKLLSD 503

Query: 550 RYGRDEGLALLEKTGTYQGPERYTIEHLPIDVM 582
               DE  A       +  P  Y+ E  P  +M
Sbjct: 504 --AGDEAFAREVADALF--PNPYSTEVAPATLM 532


>gi|354580446|ref|ZP_08999351.1| hypothetical protein PaelaDRAFT_0452 [Paenibacillus lactis 154]
 gi|353202877|gb|EHB68326.1| hypothetical protein PaelaDRAFT_0452 [Paenibacillus lactis 154]
          Length = 554

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 10/120 (8%)

Query: 440 EEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATV 494
           E  WTY C        N    +  +++R +  +++K    GFL+WG N +      +A  
Sbjct: 396 EGLWTYYCCSQYKEVSNRFFNLPSARNRILGMQLYKYNIEGFLHWGYNFWYSQYSRRAID 455

Query: 495 PSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD 554
           P        G P GD  L YPGE       PV S+RL+     LQD+  L L   + GR+
Sbjct: 456 PYRVTDADSGFPSGDAFLVYPGE-----DGPVESIRLKVFHEALQDLRALQLLEQQIGRE 510


>gi|375308735|ref|ZP_09774018.1| hypothetical protein WG8_2543 [Paenibacillus sp. Aloe-11]
 gi|375079362|gb|EHS57587.1| hypothetical protein WG8_2543 [Paenibacillus sp. Aloe-11]
          Length = 570

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 125/600 (20%), Positives = 208/600 (34%), Gaps = 118/600 (19%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAA-RNERESVQIALRPKVSWSSSSTAGVVQVQCS---- 55
           + S   V P   P  L  ++ L   +NE  S Q+A R    +  S     VQ+  +    
Sbjct: 15  ISSLEKVFPVNKPLELAEVHELGLLKNESISYQLAYRYTRPYHMSQMEATVQLPKNPLVK 74

Query: 56  -DLCSASGDRLVVGQSLMLRRVVPMLG-------------VPDALVPLDLPVCQISLIPG 101
            ++ S   D + V Q   +  + P                 PD L P D     +  IP 
Sbjct: 75  VEVLSPISDLVNVRQVKCVPSMYPAYAETDDHYLTTEAGLFPDVLEPFD---GMLQFIPY 131

Query: 102 ETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKG-------EKHRLFMELR- 153
           +  ++W+ +  P A   G Y   I  T+     ++ Q +          E+  L  E   
Sbjct: 132 QWRSLWIEVTVPKAAKAGKYPVIIQFTATEGIVVNRQRMTIHIMEGVLPEQQLLHTEWLH 191

Query: 154 -NCLDNVEPIE--GKPLHEVVERAKSTATTLR-RVIFSPLFSEFFSDNGPIDMMDEDAIS 209
            +CL +   +E  G+    ++     TA      V+ +PLF+       P+D      I 
Sbjct: 192 GDCLADYYGVEVFGEEHWRILGNFIQTAAKRGINVLLTPLFTP------PLDTF----IG 241

Query: 210 NLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQ 269
           N    ++L                  + +S  V   RFG     D+W +  +     +  
Sbjct: 242 NERTTIQL------------------VDVSYDVNGYRFGF-DKLDKWVQISEA--VGITH 280

Query: 270 YRISPFFCRWGESM--RVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKD 327
           + ++  F +WG     +++      P          +DP   ++   + P L+       
Sbjct: 281 FEMAHLFSQWGAKYAPKIIVEVGGVPEQRFGWHTPANDPEFRSFLAAFLPALT------- 333

Query: 328 YVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPS 387
                 E L      +++ F++ DEP           +A  LH Y    + +  Y  G  
Sbjct: 334 ------ERLHQLGIAERSLFHISDEP-----------VAGNLHTYLEAKQFVAPYLEG-- 374

Query: 388 DAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVC 447
                  P    +   +F R         + V G+      D +     E     W Y C
Sbjct: 375 ------FPIIDAISDVEFYRRG----IIDQPVAGS------DTIHNFIDEGASNLWVYYC 418

Query: 448 MGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSAEIRFR 502
            G +    N  L M  S++R +  +++K    GFL+WG N Y      K   P  +    
Sbjct: 419 CGQNLHVSNRFLAMPSSRNRILGVQMYKYRIKGFLHWGFNFYNSQYSLKKLNPYVDTAAL 478

Query: 503 RGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEK 562
              P GD  L YP E        + SLRL  +   L D+  L    S   R+  L+L+E+
Sbjct: 479 DTFPSGDSFLVYPSE----DGTALESLRLVVLQDALTDLRALQYLESLSSREFVLSLIEE 534


>gi|414591657|tpg|DAA42228.1| TPA: hypothetical protein ZEAMMB73_522235 [Zea mays]
          Length = 446

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/58 (48%), Positives = 37/58 (63%), Gaps = 12/58 (20%)

Query: 294 ADHPKSDEYFSDPRLAAYAVPYSPVLS------------SNDGAKDYVRKEIELLRTK 339
           ADHPK++EY+SDPRLA Y VPY+P+LS            S D AK  +R+E+E +  K
Sbjct: 200 ADHPKANEYYSDPRLATYVVPYAPILSCLLLYLIWLLVNSTDAAKSSLRREVEGVSKK 257


>gi|295798170|emb|CAX69035.1| Putative uncharacterized protein precursor [uncultured bacterium]
          Length = 563

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 84/359 (23%), Positives = 128/359 (35%), Gaps = 46/359 (12%)

Query: 212 SVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFG--VRHGSD----EWYEALDQHFK 265
           S  V   L VWDF LP TPS+    G     ++  +   V+ G D    +W    +Q  +
Sbjct: 182 SKTVSARLKVWDFALPQTPSMQTSFGSPAGRMKSWYANHVKVGKDAPIKDWTAVEEQCAQ 241

Query: 266 WLLQYRISPFF-CRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSP-----VL 319
            L ++RI+        E  +    T     +   +   F D R    A+  S      ++
Sbjct: 242 LLAEHRINATPPDELLEPQKQGDGTWRISEEKLNALGQFID-RYHVNALDVSKNFIFGII 300

Query: 320 SSNDGAKDYVRKEIELLRTKAHWKKA-----YFYLWDEPLNMEHYSSVRNMASELHAYAP 374
              D A+D +R  ++     A          Y YL DEP + E Y  VR     +     
Sbjct: 301 KDPDAARDEIRTRLKAFEMAAKQLNRPNLLFYVYLTDEPNDPEAYDYVRKWGKAIKEANS 360

Query: 375 DARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTEL 434
             +V+ T    P     G       +  P F                    L +      
Sbjct: 361 VVKVMITEQSTPQKTEWGDLYGAVDIWCPLF-------------------PLFEQGNAAR 401

Query: 435 QPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV 494
           +   GE  W Y  +   +P P WH+      +R   W  W+    G LYWG   +   T 
Sbjct: 402 RQALGETVWAYTALCQRNPTPWWHIDYPLLNYRVPAWISWRYRIRGLLYWGGMSFWNETG 461

Query: 495 PSAEIRFRRG------LPPGDGVLFYPGEVFSSSRQPVA-SLRLERILSGLQDIEYLNL 546
                 +  G      +  G+G L YPG    +    +A SLRL+ +  G++D EYL +
Sbjct: 462 DPWRDAWTYGHKKSMLVYNGEGTLVYPGR--KAGYDGIAPSLRLKALRDGIEDYEYLTI 518


>gi|315648573|ref|ZP_07901671.1| hypothetical protein PVOR_25096 [Paenibacillus vortex V453]
 gi|315276052|gb|EFU39399.1| hypothetical protein PVOR_25096 [Paenibacillus vortex V453]
          Length = 554

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 56/125 (44%), Gaps = 10/125 (8%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSA 497
           WTY C        N    +  +++R +  +++K    GFL+WG N +      +A  P  
Sbjct: 399 WTYYCCSQYKEVSNRFFNLPSARNRILGIQLYKYNIEGFLHWGYNFWNSQYSRRAIDPFQ 458

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                 G P GD  + YPGE       P+ S+RL+     LQD+  L L  +  GR+  +
Sbjct: 459 VTDADGGFPSGDAFVVYPGE-----EGPIESIRLKVFQEALQDLRALQLLETLIGREAVM 513

Query: 558 ALLEK 562
             LE+
Sbjct: 514 QSLEE 518


>gi|333994386|ref|YP_004526999.1| hypothetical protein TREAZ_0869 [Treponema azotonutricium ZAS-9]
 gi|333734735|gb|AEF80684.1| conserved hypothetical protein [Treponema azotonutricium ZAS-9]
          Length = 604

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 63/140 (45%), Gaps = 9/140 (6%)

Query: 428 KDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGAN 487
            D +T       +  W Y C+G S   PN  + +   + RA+   ++     GFL WG N
Sbjct: 435 NDAITPFLEAGIKNLWVYYCVGQSRRVPNRFIALPSPRTRAMGVLMYLYNIAGFLQWGYN 494

Query: 488 CYEKATVPSAEIRFRR--GL---PPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIE 542
            Y  A   S    + +  GL   P GD  L YPG    +  +PV+S+  E    GL+D+ 
Sbjct: 495 YYYSALSKSLVDPYLKTGGLKDWPGGDPFLVYPG----ADGKPVSSIHAEAHREGLEDMR 550

Query: 543 YLNLYASRYGRDEGLALLEK 562
            L L  +  GRD  L ++ +
Sbjct: 551 ILALAETIKGRDAVLKIINE 570


>gi|317048122|ref|YP_004115770.1| hypothetical protein Pat9b_1898 [Pantoea sp. At-9b]
 gi|316949739|gb|ADU69214.1| conserved hypothetical protein [Pantoea sp. At-9b]
          Length = 556

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 57/124 (45%), Gaps = 9/124 (7%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSA 497
           WTY C        N  +     ++R +  ++W     GFL+WG N Y      +A  P A
Sbjct: 402 WTYYCCAQYLDVANRFMAQPSVRNRILGVQLWLYRIEGFLHWGFNFYNSELSREAIDPFA 461

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                +  P GD  L YPG+ F+    P+ S+RL+ +L  +QD+  L    S  GR    
Sbjct: 462 VTDGLQAFPAGDPFLVYPGKDFT----PLPSIRLKVLLEAMQDLRALQCLESLQGRAAVE 517

Query: 558 ALLE 561
           AL++
Sbjct: 518 ALVD 521


>gi|383454162|ref|YP_005368151.1| hypothetical protein COCOR_02162 [Corallococcus coralloides DSM
           2259]
 gi|380728520|gb|AFE04522.1| hypothetical protein COCOR_02162 [Corallococcus coralloides DSM
           2259]
          Length = 631

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 89/379 (23%), Positives = 137/379 (36%), Gaps = 77/379 (20%)

Query: 206 DAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDT-VIEDRFGVRHGSDEWYEALDQHF 264
           +A      +V   LTV D ++P+T SL +   +  T V     G    +    + L   +
Sbjct: 169 EAEGGFQRQVTARLTVVDAVMPSTSSLASAFPLLPTQVCRAHLGRNDCTPAELQPLLVRY 228

Query: 265 KWL-LQYRI---------------SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRL 308
           + L L++R+               S F+  WG S   L  T P            S  R+
Sbjct: 229 QQLSLEHRLTQPRLFLSGSGAQAWSDFYATWGPS---LDGTAP---------SRLSGARM 276

Query: 309 AA--YAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMA 366
            +  Y  P++       G  D+     E    +    +A+  + DEP +   +  V+   
Sbjct: 277 TSVEYTGPFT-----AGGLADFAGHMSE----RGWLARAHAKIGDEPFDATTFQQVQATG 327

Query: 367 SELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDL 426
           + +   AP  R + T     +   L     E  V +   L  H +         G   D 
Sbjct: 328 TLVRQAAPGLRTMLTV----NSMQLKLNGLEPLVDIAVPLVNHLE---------GTTPDF 374

Query: 427 VKD---IVTELQPENGEEWWTYVCMG--------------PSDPHPNWHLGMRGSQHRAV 469
           V D            G E W Y                  P    P++ +    ++ RA+
Sbjct: 375 VGDQSPTYAGFLSRPGTELWMYQSCASHGCAPGSLMPENQPGSGWPSYMVDRSSAKARAM 434

Query: 470 MWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGE---VFSSSRQPV 526
            W  ++ G  G LY+ A     A +P+A          GDG LFYPG    +   +  PV
Sbjct: 435 EWLAFRFGAKGELYYEAG----AMLPTAWTDQYHFGGNGDGTLFYPGTPAVIGGQTDVPV 490

Query: 527 ASLRLERILSGLQDIEYLN 545
           ASLRL+ I  GLQD E+L 
Sbjct: 491 ASLRLKLIRQGLQDYEWLK 509


>gi|374324118|ref|YP_005077247.1| hypothetical protein HPL003_21470 [Paenibacillus terrae HPL-003]
 gi|357203127|gb|AET61024.1| hypothetical protein HPL003_21470 [Paenibacillus terrae HPL-003]
          Length = 570

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 57/139 (41%), Gaps = 9/139 (6%)

Query: 429 DIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC 488
           D +     E     W Y C G +    N  L M  S++R +  +++K    GFL+WG N 
Sbjct: 400 DTIHNFIDEGASNLWVYYCCGQNLHVSNRFLAMPSSRNRILGVQMYKYRIKGFLHWGFNF 459

Query: 489 YE-----KATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEY 543
           Y      K   P  +       P GD  L YP E        + SLRL  +   L D+  
Sbjct: 460 YNSQYSLKKLNPYVDTAALDTFPSGDSFLVYPSE----DGTALESLRLVVLQDALTDLRA 515

Query: 544 LNLYASRYGRDEGLALLEK 562
           L    S   R+  L+L+E+
Sbjct: 516 LQYLESLSNREFVLSLIEE 534


>gi|403380550|ref|ZP_10922607.1| hypothetical protein PJC66_12097 [Paenibacillus sp. JC66]
          Length = 555

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 59/125 (47%), Gaps = 9/125 (7%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-YEKATV----PSA 497
           WTY C    +   N  + M   ++R + ++++K    GFL+WG N  Y + ++    P  
Sbjct: 399 WTYYCCSQYEEVSNRFIDMPSWRNRILGFQLYKFQIRGFLHWGYNFWYSQYSIRPINPYQ 458

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
           +       P GD  + YPG    S+ + V SLRL+      QD+  L    S  GRD+ L
Sbjct: 459 QTDANYAFPSGDPFVVYPG----SNGEAVLSLRLKVFYDAFQDLRALEKLESYIGRDQVL 514

Query: 558 ALLEK 562
             LE+
Sbjct: 515 LWLEE 519


>gi|302388391|ref|YP_003824213.1| hypothetical protein Closa_4081 [Clostridium saccharolyticum WM1]
 gi|302199019|gb|ADL06590.1| conserved hypothetical protein [Clostridium saccharolyticum WM1]
          Length = 552

 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 56/125 (44%), Gaps = 9/125 (7%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE----KATVPSAE 498
           W+Y C        N  + M  +++RA   +V+K G  G L+WG N Y     +  +   E
Sbjct: 399 WSYYCTAQCVDVSNRFMAMPSARNRAYGLQVYKYGMEGILHWGFNFYNSEHSRHHINPYE 458

Query: 499 IRFRRG-LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
           +    G  P GD  L YPG    S   P  S+RL  +    QD   L L  S  G++  +
Sbjct: 459 VTDCEGSFPSGDAFLVYPG----SDGIPEESIRLMVLCEAKQDHRALKLLESHIGKEAVI 514

Query: 558 ALLEK 562
            LLE+
Sbjct: 515 RLLEE 519


>gi|430747974|ref|YP_007207103.1| hypothetical protein Sinac_7369 [Singulisphaera acidiphila DSM
           18658]
 gi|430019694|gb|AGA31408.1| hypothetical protein Sinac_7369 [Singulisphaera acidiphila DSM
           18658]
          Length = 577

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 67/286 (23%), Positives = 104/286 (36%), Gaps = 56/286 (19%)

Query: 269 QYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFS--DPRLAAYAVPYSPVLSSNDGAK 326
           Q+  S  +  WG    +  Y         + D+Y    +P ++ ++  Y   L      K
Sbjct: 295 QFEWSHLWIYWGVENPMRIY-------KKEGDQYVMLWEPTISGFSDTYVNFL------K 341

Query: 327 DYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGP 386
            ++ +  + L  +   + +YF+L DEP   +H  + R     L   AP  +V+       
Sbjct: 342 QFLPEFKKFLTEEKMLETSYFHLSDEPGPGQHVQNYRRARQILREIAPWMKVMDAL---- 397

Query: 387 SDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYV 446
           SD   G    E    +P  L    Q Y  ++                  P      W Y 
Sbjct: 398 SDIEYGK---EGLTDIPIPLVSAAQAYIDAK-----------------IPH-----WVYY 432

Query: 447 CMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRR--- 503
           C GP+ P  N  +    S+ R   W  ++    GFL+WG N ++K     A   F     
Sbjct: 433 CCGPTGPWLNRFMDTPLSKIRMSGWLFYRHEAKGFLHWGFNYWDKMEREEAGDPFHDGSN 492

Query: 504 ----GLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLN 545
               G+P GD  + YPG        P+ S+R E     LQD   L 
Sbjct: 493 ASYPGIPFGDPFVIYPG-----PDGPIDSIRWEVFAESLQDYAILQ 533


>gi|383814347|ref|ZP_09969768.1| hypothetical protein SPM24T3_08339 [Serratia sp. M24T3]
 gi|383296757|gb|EIC85070.1| hypothetical protein SPM24T3_08339 [Serratia sp. M24T3]
          Length = 517

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 55/123 (44%), Gaps = 9/123 (7%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV-----PSA 497
           WTY C        N    +   ++R +  +++    TGFL+WG N Y          P A
Sbjct: 354 WTYYCCVQKLEVSNRFFALPSYRNRIIGVQLYLYSITGFLHWGFNFYNSGHSREHLDPFA 413

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
               +   P GD  + YPG+ +    QP+ SLRL  +   LQD+  L L  +   RD+  
Sbjct: 414 ITDGQGAFPSGDLFVVYPGQDY----QPIESLRLMVLREALQDLRALQLLEALTSRDQVE 469

Query: 558 ALL 560
           AL+
Sbjct: 470 ALI 472


>gi|108758339|ref|YP_630857.1| hypothetical protein MXAN_2638 [Myxococcus xanthus DK 1622]
 gi|108462219|gb|ABF87404.1| hypothetical protein MXAN_2638 [Myxococcus xanthus DK 1622]
          Length = 561

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 100/366 (27%), Positives = 141/366 (38%), Gaps = 51/366 (13%)

Query: 215 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRIS- 273
           V  +  V  F LPAT SLP   G+S   I    GV   S E  + L  + + LL++R+S 
Sbjct: 153 VPFTAEVQPFALPATSSLPNSFGVSMYSIARGHGVAPESAESRKLLRDYGRALLEHRVSA 212

Query: 274 ------PFFCRWGESMRVLTYTCPWPADHPKSDEYF--SDPRLAAYAVPYSPVLSSNDGA 325
                 P   R+ +   VL +        P  D     S  R  +  V  S    ++   
Sbjct: 213 HGMSMTPPPVRFEKGRAVLDWREYDAEMAPFLDGSLLPSGARFTSTDVRDSKQAKTDAEK 272

Query: 326 KDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTT---- 381
             Y R   E  R K    + +FY  DEP   E    V+  ++ + A      ++T+    
Sbjct: 273 TAYYRAFAEHFREKGWPAQLFFYAKDEP-KPEDVPLVKAQSTRVRAAGKIPVLVTSPLDD 331

Query: 382 YYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEE 441
              G +D  L PT    F     + RP  Q   +       R+ L  D            
Sbjct: 332 ALNGSADI-LTPT-LNCF-----YPRPGPQTCRSVVEARALRKRLAGDTRV--------- 375

Query: 442 WW-----TYVCMG--PSD-----PHPNWHLGMR---GSQHRAVMWRVWKEGGTGFLYWGA 486
           WW     ++ C G  P+D      +  W   M       +RA+    +  G  G LY+  
Sbjct: 376 WWYQSCNSHGCNGGPPADKAVDAAYSGWASYMVDHPAPLNRAMGVLAFSSGVDGELYFDT 435

Query: 487 N-CYEKATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDIE 542
              Y     P  ++ F  G   GDG LFYPG    V  S  QP+ +LRL+ I  GL+D E
Sbjct: 436 VFAYNTKKDPWKDV-FEFG-GNGDGTLFYPGTPARVGPSGHQPILTLRLKHIRDGLEDYE 493

Query: 543 YLNLYA 548
           YL L A
Sbjct: 494 YLRLLA 499


>gi|310817408|ref|YP_003949766.1| hypothetical protein STAUR_0130 [Stigmatella aurantiaca DW4/3-1]
 gi|309390480|gb|ADO67939.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
          Length = 650

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 58/219 (26%), Positives = 92/219 (42%), Gaps = 31/219 (14%)

Query: 344 KAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVP 403
           +AY ++ DEP     + ++R  A      AP+ R L T     +   L     E  + V 
Sbjct: 300 RAYDFVGDEPPYGISFEALRQNAELTRQVAPELRTLVTT----NSRELDKYALEDLMDVA 355

Query: 404 KFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCMG------------- 449
             +  H     T+    G++     D ++      G E W Y  CM              
Sbjct: 356 APVVNHMD--GTAPPFQGDQRATYHDFLSL----PGRELWLYQSCMSHGCAYGTNAPENQ 409

Query: 450 PSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGD 509
           P    P++ +    ++ RA+ W  + EG +G LY+       A+  + + RF      GD
Sbjct: 410 PGAGWPSYMVDRSAAKARAMEWVTFLEGASGELYY-QTVGMLASAWTDQFRFNGN---GD 465

Query: 510 GVLFYPG---EVFSSSRQPVASLRLERILSGLQDIEYLN 545
           G LFYPG    +  ++  PVAS+RL+ I  G+QD E+L 
Sbjct: 466 GTLFYPGTPAAIGGATDVPVASIRLKLIRLGVQDYEWLK 504


>gi|115376571|ref|ZP_01463803.1| hypothetical protein STIAU_2875 [Stigmatella aurantiaca DW4/3-1]
 gi|115366439|gb|EAU65442.1| hypothetical protein STIAU_2875 [Stigmatella aurantiaca DW4/3-1]
          Length = 620

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 59/219 (26%), Positives = 94/219 (42%), Gaps = 31/219 (14%)

Query: 344 KAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVP 403
           +AY ++ DEP     + ++R  A      AP+ R L T     +   L     E  + V 
Sbjct: 270 RAYDFVGDEPPYGISFEALRQNAELTRQVAPELRTLVTT----NSRELDKYALEDLMDVA 325

Query: 404 KFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCM------GPSDPH-- 454
             +  H     T+    G++     D ++      G E W Y  CM      G + P   
Sbjct: 326 APVVNHMD--GTAPPFQGDQRATYHDFLSL----PGRELWLYQSCMSHGCAYGTNAPENQ 379

Query: 455 -----PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGD 509
                P++ +    ++ RA+ W  + EG +G LY+       A+  + + RF      GD
Sbjct: 380 PGAGWPSYMVDRSAAKARAMEWVTFLEGASGELYY-QTVGMLASAWTDQFRFNGN---GD 435

Query: 510 GVLFYPG---EVFSSSRQPVASLRLERILSGLQDIEYLN 545
           G LFYPG    +  ++  PVAS+RL+ I  G+QD E+L 
Sbjct: 436 GTLFYPGTPAAIGGATDVPVASIRLKLIRLGVQDYEWLK 474


>gi|197121796|ref|YP_002133747.1| hypothetical protein AnaeK_1387 [Anaeromyxobacter sp. K]
 gi|196171645|gb|ACG72618.1| Myxococcales GC_trans_RRR domain protein [Anaeromyxobacter sp. K]
          Length = 609

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 52/97 (53%), Gaps = 6/97 (6%)

Query: 455 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 514
           P++ +    S++RA+ W  + E  +G LYW      +A   S +  F      GDG LFY
Sbjct: 412 PSYMVDASASRNRAMEWITFLERASGELYWETAYSFRADPWSRQWDFSGN---GDGTLFY 468

Query: 515 PGE---VFSSSRQPVASLRLERILSGLQDIEYLNLYA 548
           PG+   +   +  PVAS+RL+ I +G+QD EYL   A
Sbjct: 469 PGKPARIGGKTDVPVASVRLKMIRAGMQDYEYLKALA 505


>gi|220916589|ref|YP_002491893.1| hypothetical protein A2cp1_1483 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219954443|gb|ACL64827.1| Myxococcales GC_trans_RRR domain protein [Anaeromyxobacter
           dehalogenans 2CP-1]
          Length = 609

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 52/97 (53%), Gaps = 6/97 (6%)

Query: 455 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 514
           P++ +    S++RA+ W  + E  +G LYW      +A   S +  F      GDG LFY
Sbjct: 412 PSYMVDASASRNRAMEWITFLERASGELYWETAYSFRADPWSRQWDFSGN---GDGTLFY 468

Query: 515 PGE---VFSSSRQPVASLRLERILSGLQDIEYLNLYA 548
           PG+   +   +  PVAS+RL+ I +G+QD EYL   A
Sbjct: 469 PGKPARIGGKTDVPVASVRLKMIRAGMQDYEYLKALA 505



 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 59/142 (41%), Gaps = 25/142 (17%)

Query: 3   STANVGPQEMPRPLEPINLLAARNERESVQIALR-------PKVSWSSSSTAGVVQVQCS 55
           +T  + P    RP    +L AARNE  + Q+ +         +V       A + +V   
Sbjct: 34  ATEKIRPDAKARPQTEAHLSAARNEFAAFQVVVTGPAKRVTARVEGLDGMDATLFRVDTL 93

Query: 56  DLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPL--DLPVCQISLIP----GETTAVWVS 109
           D+ S S      G+             PDALVP   D+   Q +  P     E+ AVWV 
Sbjct: 94  DVTSPSAVDGGTGR------------WPDALVPDVDDVVGEQRNAFPFDVGAESRAVWVD 141

Query: 110 IDAPYAQPPGLYEGEIIITSKA 131
           +  P     G+Y+G ++I+S A
Sbjct: 142 VHVPADARSGVYQGAVVISSDA 163


>gi|453063163|gb|EMF04147.1| hypothetical protein F518_19198 [Serratia marcescens VGH107]
          Length = 556

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 63/150 (42%), Gaps = 10/150 (6%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA-----TVPSA 497
           W Y C        N       +++R +  +++     GFL+WG N Y  A       P A
Sbjct: 400 WAYYCCVQKTEVANRFFAQPSARNRILGVQLYLYRIAGFLHWGFNFYNSAHSRERINPYA 459

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                   P GD  + YPGE      QPV SLRL  +  GLQD+  L L  S   R    
Sbjct: 460 VTDSGHAFPSGDPFVVYPGE----DLQPVESLRLRVLHQGLQDMRALQLLESLTDRATVE 515

Query: 558 ALLE-KTGTYQGPERYTIEHLPIDVMRGEI 586
           AL+E + G     +RY  +  P+  +R  +
Sbjct: 516 ALIESELGMRITFKRYPHQAGPLLRLRDAV 545


>gi|448242314|ref|YP_007406367.1| hypothetical protein SMWW4_v1c25510 [Serratia marcescens WW4]
 gi|445212678|gb|AGE18348.1| hypothetical protein SMWW4_v1c25510 [Serratia marcescens WW4]
          Length = 556

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 63/150 (42%), Gaps = 10/150 (6%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA-----TVPSA 497
           W Y C        N       +++R +  +++     GFL+WG N Y  A       P A
Sbjct: 400 WAYYCCVQKTEVANRFFAQPSARNRILGVQLYLYRIAGFLHWGFNFYNSAHSRERINPYA 459

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                   P GD  + YPGE      QPV SLRL  +  GLQD+  L L  S   R    
Sbjct: 460 VTDSGHAFPSGDPFVVYPGE----DLQPVESLRLRVLHQGLQDMRALQLLESLTDRATVE 515

Query: 558 ALLE-KTGTYQGPERYTIEHLPIDVMRGEI 586
           AL+E + G     +RY  +  P+  +R  +
Sbjct: 516 ALIESELGMRITFKRYPHQAGPLLRLRDAV 545


>gi|383452115|ref|YP_005366104.1| hypothetical protein COCOR_00091 [Corallococcus coralloides DSM
           2259]
 gi|380727263|gb|AFE03265.1| hypothetical protein COCOR_00091 [Corallococcus coralloides DSM
           2259]
          Length = 645

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 61/227 (26%), Positives = 91/227 (40%), Gaps = 31/227 (13%)

Query: 336 LRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTP 395
           ++ K    +AY  L DEP     ++ V      +   AP  R + T     +   L    
Sbjct: 296 MKAKGWLDRAYVQLGDEPPYGTPFAQVHATGELVRQAAPGLRTMLTT----NSRELKANG 351

Query: 396 FESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCM------ 448
            E  V     L  H  +  T     G++    +   T      G   W Y  CM      
Sbjct: 352 LEDAVDTAVPLVNH--LDGTDANFRGDQ----RGTYTRFLERPGTALWMYQSCMSHGCAY 405

Query: 449 GPSDPH-------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRF 501
           G + P        P++ L    ++ RA+ W  + +G TG LY+  +    +T  + + RF
Sbjct: 406 GTNAPENKPGAGWPSYMLDRSAAKARAMEWVTFLQGATGELYY-QSVGMLSTAWTDQYRF 464

Query: 502 RRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDIEYLN 545
                 GDG LFYPG    +   +  PVASLRL+ I  G+QD E+L 
Sbjct: 465 NGN---GDGTLFYPGTPEAIGGKTDVPVASLRLKLIRQGMQDYEWLK 508


>gi|374604784|ref|ZP_09677736.1| hypothetical protein PDENDC454_17493 [Paenibacillus dendritiformis
           C454]
 gi|374389614|gb|EHQ60984.1| hypothetical protein PDENDC454_17493 [Paenibacillus dendritiformis
           C454]
          Length = 536

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 65/139 (46%), Gaps = 14/139 (10%)

Query: 418 WVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEG 477
           WV  N++  +     E   ++G+  W Y C  P   + N  L     + R + W  +  G
Sbjct: 358 WVPTNKDYELNRDAYEAYRQSGDALWFYTCWNPGGEYLNRFLDFPLLKTRYLHWGNYLYG 417

Query: 478 GTGFLYWGANCYEKATVP--------SAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASL 529
             G+L+WG N Y     P        + ++  RR +P GD  + YPG+       P+ S+
Sbjct: 418 LDGYLHWGFNYYFPDQDPMELTNPLLAPDVHDRR-VPAGDTHIVYPGD-----GGPMLSM 471

Query: 530 RLERILSGLQDIEYLNLYA 548
           RLE + +G++D E L++ A
Sbjct: 472 RLEAMRAGVEDYELLSILA 490


>gi|414879440|tpg|DAA56571.1| TPA: hypothetical protein ZEAMMB73_699847 [Zea mays]
          Length = 659

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/58 (44%), Positives = 37/58 (63%), Gaps = 12/58 (20%)

Query: 294 ADHPKSDEYFSDPRLAAYAVPYSPVLS------------SNDGAKDYVRKEIELLRTK 339
           ADHPK++EY+S+PRLAAY  PY+P+LS            S + AK  +R+E+E +  K
Sbjct: 173 ADHPKANEYYSNPRLAAYVAPYAPILSCLLLYLIWLLVNSTNAAKSSLRREVEGVSKK 230


>gi|331085437|ref|ZP_08334522.1| hypothetical protein HMPREF0987_00825 [Lachnospiraceae bacterium
           9_1_43BFAA]
 gi|330407675|gb|EGG87173.1| hypothetical protein HMPREF0987_00825 [Lachnospiraceae bacterium
           9_1_43BFAA]
          Length = 552

 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 58/129 (44%), Gaps = 9/129 (6%)

Query: 440 EEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY--EKATVPSA 497
           E+ W Y C G  +   N  +   G + R +  +++K    GFL+WG N Y  EK+  P  
Sbjct: 392 EKLWGYYCTGQYEDVSNRFIVQPGYRTRILGVQMYKYQLDGFLHWGYNFYNSEKSLYPID 451

Query: 498 EIRFRR---GLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD 554
             R        P GD  L YPG    + R+P  S+RL  +   + D+  + L      R+
Sbjct: 452 PYRCTDASGAFPSGDPFLVYPG----ADRKPEESIRLMLMDEAMNDLRAMYLLEGLTDRN 507

Query: 555 EGLALLEKT 563
             L  +E+T
Sbjct: 508 TVLKCIERT 516


>gi|335045263|ref|ZP_08538286.1| hypothetical protein HMPREF9124_2064 [Oribacterium sp. oral taxon
           108 str. F0425]
 gi|333759049|gb|EGL36606.1| hypothetical protein HMPREF9124_2064 [Oribacterium sp. oral taxon
           108 str. F0425]
          Length = 556

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 109/264 (41%), Gaps = 27/264 (10%)

Query: 343 KKAYFYLWDEP-LNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESF-- 399
           ++   + WD P +  E+   ++    EL A+  +  +L   Y   SD P      +SF  
Sbjct: 292 RECKLFSWDSPAVGGEYTEFLKVFLPELKAFLKEENILENSYFHISDEP-NEDNMDSFGA 350

Query: 400 --VKVPKFLRPHTQIYCTSEWVLGNREDLVKDIV--TELQP--ENG-EEWWTYVCMGPSD 452
               V + L     +   S + +  R  + + +V    ++P  E G +  W Y C G + 
Sbjct: 351 AVESVRELLADCKVMDALSSFEIYRRGYVQRPVVAVNHIEPFVEAGVKNLWAYYCTGQAV 410

Query: 453 PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY--EKATV---PSAEIRFRRGLPP 507
             PN  + M  +++R +    +     GFL+WG N Y  EK+     P A        P 
Sbjct: 411 DVPNRFIVMPSARNRILGVLCYIYQVEGFLHWGFNFYNSEKSIEHIDPYAVTDAGEAFPS 470

Query: 508 GDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTGTYQ 567
           GD  + YPG+   ++ + + S+ LE  LS   DI  L +     GRD  L  +E+T    
Sbjct: 471 GDPFIVYPGKD-GTAYESMRSVVLEEALS---DIRVLKMAEQSLGRDSVLKGIEETAGM- 525

Query: 568 GPERYTIEHLPIDVMRGEIFNACR 591
              + +  H P D      F A R
Sbjct: 526 ---KLSFMHYPKD---KSFFTALR 543


>gi|325662024|ref|ZP_08150643.1| hypothetical protein HMPREF0490_01381 [Lachnospiraceae bacterium
           4_1_37FAA]
 gi|325471687|gb|EGC74906.1| hypothetical protein HMPREF0490_01381 [Lachnospiraceae bacterium
           4_1_37FAA]
          Length = 562

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 45/109 (41%), Gaps = 13/109 (11%)

Query: 439 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-------YEK 491
           GE  WTYVC GP     N  L     + R + W   K   +GFL+WG N        YE 
Sbjct: 399 GETVWTYVCCGPEGHWLNRFLDFALLKGRMLFWGCAKNRISGFLHWGLNQFPGEMNPYEG 458

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQD 540
            + P+         P GD  L YPGE       P   +RLE    G +D
Sbjct: 459 TSCPN-HTGIGTNFPCGDSFLIYPGE-----EGPRMGMRLEAQRRGAED 501


>gi|410098375|ref|ZP_11293353.1| hypothetical protein HMPREF1076_02531 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409222249|gb|EKN15194.1| hypothetical protein HMPREF1076_02531 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 562

 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 49/117 (41%), Gaps = 10/117 (8%)

Query: 439 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAE 498
           G+E W Y C+ P     N  L     + R + W  +K G TG+L+WG N +     P  E
Sbjct: 412 GDEVWFYTCLAPQGDFANRFLEQPLIKTRLIHWLNYKYGATGYLHWGFNQWFSDNDPYKE 471

Query: 499 IRFR-----RGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASR 550
                      LP GD  + YP      + +   S+RLE +  G+ D   L +   +
Sbjct: 472 TTTMNTESGNTLPGGDSWIVYP-----DNGKLYGSIRLEAMRDGIADYTLLKMLEKK 523


>gi|86158894|ref|YP_465679.1| hypothetical protein Adeh_2472 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|85775405|gb|ABC82242.1| hypothetical protein Adeh_2472 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 609

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 52/94 (55%), Gaps = 6/94 (6%)

Query: 455 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 514
           P++ +    S++RA+ W  + E  +G LYW      ++   +++  F      GDG LFY
Sbjct: 412 PSYMIDASASRNRAMEWITFLERASGELYWETAYSFRSDPWTSQWDFSGN---GDGTLFY 468

Query: 515 PGE---VFSSSRQPVASLRLERILSGLQDIEYLN 545
           PG+   +   +  PVAS+R++ I +G+QD EYL 
Sbjct: 469 PGKPSRIGGKTDIPVASVRVKMIRAGMQDYEYLK 502


>gi|424868246|ref|ZP_18292005.1| hypothetical protein C75L2_00760029 [Leptospirillum sp. Group II
           'C75']
 gi|124515950|gb|EAY57459.1| protein of unknown function [Leptospirillum rubarum]
 gi|387221464|gb|EIJ76022.1| hypothetical protein C75L2_00760029 [Leptospirillum sp. Group II
           'C75']
          Length = 681

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 101/264 (38%), Gaps = 37/264 (14%)

Query: 316 SPVLSSNDGAKDYVRKEIELLRTKAHWK-------KAYFYLWDEPLNMEHYSS-----VR 363
           SPV S   G  D   + +  L  + HWK       K + Y+ DEP++  +Y +     + 
Sbjct: 388 SPV-SDWKGVPDIATQNLAKLIVQ-HWKEKGWPIDKTFAYIADEPVHKLYYYADTYKLIA 445

Query: 364 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 423
             A  LH  +P   V+ T      D P     +++ V   K +     +   + W  G  
Sbjct: 446 KNADSLHKGSPHIHVMVT------DVPY--ITYKNQVGHNKLI----MVGKVNIWA-GAS 492

Query: 424 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 483
              +   + E Q E G+  W Y   GP     N  L   G   R   W  WK    G  Y
Sbjct: 493 AQFIPSRMQERQKE-GDHVWFYQAGGPPFIGQN-DLYSLGPGFRMWFWTAWKYHVNGVFY 550

Query: 484 WGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEV-----FSSSRQPVASLRLERILSGL 538
           W A+ +     P+      +GL  GDG + YPG       +     P+ S+R+ +   G 
Sbjct: 551 W-ADTFWNDNKPNMNPYVNQGL--GDGTIMYPGTELHFIGYPDIHGPIPSIRMAQWRRGY 607

Query: 539 QDIEYLNLYASRYGRDEGLALLEK 562
           +D  YL      +   E   +++K
Sbjct: 608 EDYRYLTYLKQMHKEKEADRIVDK 631


>gi|363898123|ref|ZP_09324658.1| hypothetical protein HMPREF9624_01220 [Oribacterium sp. ACB7]
 gi|361956490|gb|EHL09805.1| hypothetical protein HMPREF9624_01220 [Oribacterium sp. ACB7]
          Length = 557

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/236 (25%), Positives = 102/236 (43%), Gaps = 18/236 (7%)

Query: 343 KKAYFYLWDEP-LNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT--PFESF 399
           ++   + WD P +  E+   ++    EL A+  +  +L   Y   SD P       F + 
Sbjct: 293 RECKLFSWDSPAVGGEYTEFLKIFLPELKAFLKEENILGNSYFHISDEPNEDNMDSFGAA 352

Query: 400 VKVPKFLRPHTQIY-CTSEWVLGNREDLVKDIV--TELQP--ENG-EEWWTYVCMGPSDP 453
           V+  + L    ++    S + +  R  + + +V    ++P  E G +  W Y C G +  
Sbjct: 353 VESVRALLADCKVMDALSSFEIYRRGYVQRPVVAVNHIEPFVEAGVKNLWAYYCTGQAVD 412

Query: 454 HPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY--EKATV---PSAEIRFRRGLPPG 508
            PN  + M  +++R +    +     GFL+WG N Y  EK+     P A        P G
Sbjct: 413 VPNRFIVMPSARNRILGVLCYIYQVEGFLHWGFNFYNSEKSIEHIDPYAVTDAGEAFPSG 472

Query: 509 DGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTG 564
           D  + YPG+   ++ + + S+ LE  LS   DI  L +     GRD  L  +E+T 
Sbjct: 473 DPFIVYPGKD-GTAYESMRSVVLEEALS---DIRVLKMAEQSLGRDRVLKGIEETA 524


>gi|410478328|ref|YP_006765965.1| hypothetical protein LFML04_0771 [Leptospirillum ferriphilum ML-04]
 gi|406773580|gb|AFS53005.1| hypothetical protein LFML04_0771 [Leptospirillum ferriphilum ML-04]
          Length = 681

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 101/264 (38%), Gaps = 37/264 (14%)

Query: 316 SPVLSSNDGAKDYVRKEIELLRTKAHWK-------KAYFYLWDEPLNMEHYSS-----VR 363
           SPV S   G  D   + +  L  + HWK       K + Y+ DEP++  +Y +     + 
Sbjct: 388 SPV-SDWKGVPDIATQNLAKLIVQ-HWKEKGWPIDKTFAYIADEPVHKLYYYADTYKLIA 445

Query: 364 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 423
             A  LH  +P   V+ T      D P     +++ V   K +     +   + W  G  
Sbjct: 446 KNADSLHKGSPHIHVMVT------DVPY--ITYKNQVGHNKLI----MVGKVNIWA-GAS 492

Query: 424 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 483
              +   + E Q E G+  W Y   GP     N  L   G   R   W  WK    G  Y
Sbjct: 493 AQFIPSRMQERQKE-GDHVWFYQAGGPPFIGQN-DLYSLGPGFRMWFWTAWKYHVNGVFY 550

Query: 484 WGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGE-----VFSSSRQPVASLRLERILSGL 538
           W A+ +     P+      +GL  GDG + YPG       +     P+ S+R+ +   G 
Sbjct: 551 W-ADTFWNDNKPNMNPYVNQGL--GDGTIMYPGTELHFIGYPDIHGPIPSIRMAQWRRGY 607

Query: 539 QDIEYLNLYASRYGRDEGLALLEK 562
           +D  YL      +   E   +++K
Sbjct: 608 EDYRYLTYLKQMHKEKEADRIVDK 631


>gi|365133220|ref|ZP_09342604.1| hypothetical protein HMPREF1032_00400 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363616030|gb|EHL67484.1| hypothetical protein HMPREF1032_00400 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 526

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 56/119 (47%), Gaps = 7/119 (5%)

Query: 428 KDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGAN 487
           ++  T LQ   GEE W Y C  P+ P  N  + +  +  R V+W       +GFL+WG N
Sbjct: 363 REEYTALQAA-GEEMWFYTCAFPAGPAMNRSMDLPLAVSRTVLWMGALYRLSGFLHWGFN 421

Query: 488 CYEKATVPSAEIRFRRG--LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYL 544
            Y    +  +     +G  LP GD  + YPG+      +P  S+R E   +G +D E L
Sbjct: 422 YYIGDDIWHSACCPHKGALLPAGDAHIVYPGK----DGRPWRSMRFEAQRAGAEDYELL 476


>gi|227495338|ref|ZP_03925654.1| conserved hypothetical protein [Actinomyces coleocanis DSM 15436]
 gi|226831208|gb|EEH63591.1| conserved hypothetical protein [Actinomyces coleocanis DSM 15436]
          Length = 532

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 53/231 (22%), Positives = 88/231 (38%), Gaps = 46/231 (19%)

Query: 335 LLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT 394
            L T+   + A+F++ DEP N +H  + R         A  A+V+           L   
Sbjct: 308 FLETEIGLEHAWFHVSDEP-NADHLEAYR---------AAKAQVVDLLAGTQVIDALSEP 357

Query: 395 PFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPH 454
            F+  V +P               V  N+ D  + +  E  P      W Y C+      
Sbjct: 358 EFQEVVDIPV--------------VATNKVDGFRAVGVE--PT-----WVYNCVAQDRLV 396

Query: 455 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEK-----ATVPSAEIRFRRGLPPGD 509
            N  +  RG++HR + ++++K    G L+W  N Y +        P  +     G   GD
Sbjct: 397 ANRFIAQRGTRHREIGFQLFKFNAKGILHWAFNFYNRQFSLGVLDPYKDTAAGGGFLSGD 456

Query: 510 GVLFYP---GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
             + YP   G+V+        SLR   +   + D+    L A ++GR+  L
Sbjct: 457 SFVVYPVADGKVYE-------SLRHHLLRQAMDDLALCQLAAEKHGREAVL 500


>gi|225571626|ref|ZP_03780622.1| hypothetical protein CLOHYLEM_07724 [Clostridium hylemonae DSM
           15053]
 gi|225159703|gb|EEG72322.1| hypothetical protein CLOHYLEM_07724 [Clostridium hylemonae DSM
           15053]
          Length = 557

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 93/236 (39%), Gaps = 21/236 (8%)

Query: 343 KKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKV 402
           K+   + W  P   E+   +     EL A   +  + T  Y   SD P  P   +++ + 
Sbjct: 291 KEEKIFGWHTPAVGEYTRFLHAFLPELTARLKEWGIDTVTYFHLSDEP-RPDDLDTYRQA 349

Query: 403 PKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGE----------EWWTYVCMGPSD 452
            + +    + Y T +  L + E     +V +  P N E          + WTY C+G   
Sbjct: 350 KESVADLLKGYHTFD-ALSSYEFYRHGLVDKPIPGNNEIDEFLEHGLTDMWTYYCVGQYL 408

Query: 453 PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSAEIRFRRGLPP 507
              N  + M   ++R    +++K    G L+WG N Y      +   P        G P 
Sbjct: 409 EVSNRFMSMPSLRNRIYGLQLYKYDIIGILHWGYNFYNSQFSLEHINPYETTDAGGGFPA 468

Query: 508 GDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKT 563
           GD  L YPGE      +P  S+R+     GL D+    L  S   R+  + L+E++
Sbjct: 469 GDPFLVYPGE----DGRPEESIRMMVHYEGLCDLRACRLLESLTSREYVMELIEES 520


>gi|284032103|ref|YP_003382034.1| carbohydrate binding family 6 [Kribbella flavida DSM 17836]
 gi|283811396|gb|ADB33235.1| Carbohydrate binding family 6 [Kribbella flavida DSM 17836]
          Length = 1437

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 96/233 (41%), Gaps = 38/233 (16%)

Query: 325 AKDYVRKEIELLRTKAHWKKAYFYLW-----DEPLNMEHYSSVRNMASEL-HAYAPDARV 378
           A+++ R+ +  L+T     K +F  W     DEP +  H  + R    EL   +AP  + 
Sbjct: 602 AQNFARQYLSALKTHLV-AKGWFTQWYQSVGDEPGSPAHAETWRRAVDELIKVHAPGMKT 660

Query: 379 LTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPEN 438
            T Y   PS    G T FE  + V      H  +  T E          KD     Q   
Sbjct: 661 STPYIGPPS--TWGAT-FEGRLNV------HVPLLSTHE--------SAKDYFRGRQAL- 702

Query: 439 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG-ANCYEKATVPSA 497
           G+E WTYVC  P     N  +    S  R + W  +  G TG L+W  +N  E  T+ + 
Sbjct: 703 GDEVWTYVCNRPLGAFYNRLIDQPLSAPRFMNWSNFANGVTGTLHWAYSNWKEDPTINAT 762

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASR 550
                    PGD  + YP  V   +    ++LR + +  G+++ E L + ++R
Sbjct: 763 ---------PGDTAIVYPDPV---NNDVTSTLRHDAMRDGIEEYELLKILSTR 803


>gi|374373535|ref|ZP_09631195.1| hypothetical protein NiasoDRAFT_2351 [Niabella soli DSM 19437]
 gi|373234508|gb|EHP54301.1| hypothetical protein NiasoDRAFT_2351 [Niabella soli DSM 19437]
          Length = 576

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/146 (22%), Positives = 63/146 (43%), Gaps = 14/146 (9%)

Query: 417 EWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKE 476
           ++ + ++     +I+ E Q +     W   C   ++ +PN       ++H  + W    +
Sbjct: 414 DYCIASKHQFPDNILKERQQQGKLSTWYTCC---TEKYPNGFTFSPPAEHVWIGWYTAAK 470

Query: 477 GGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILS 536
              G+L W  N + +   P  + RFR   P GD    YPG        P +S+R E+++ 
Sbjct: 471 NMNGYLRWAYNSWVEH--PETDSRFR-SWPAGDTYQVYPG--------PASSIRFEKLIE 519

Query: 537 GLQDIEYLNLYASRYGRDEGLALLEK 562
           G+QD E + +    Y +   +  L +
Sbjct: 520 GIQDFEKITVLRKAYTQKGAMGKLNQ 545


>gi|223940729|ref|ZP_03632566.1| conserved hypothetical protein [bacterium Ellin514]
 gi|223890585|gb|EEF57109.1| conserved hypothetical protein [bacterium Ellin514]
          Length = 359

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/251 (24%), Positives = 101/251 (40%), Gaps = 50/251 (19%)

Query: 309 AAYAVPYSP-VLSSNDGAKDYVRKEI----ELLRTKAHWKKAYFYLWDEP--LNMEHYSS 361
             Y + +SP + +++D   +++++ +    + L  +   + +YF+L DEP   ++E+Y  
Sbjct: 99  GKYVMLWSPDIGATSDTYVNFLKQFLPEFHDFLAKENILEDSYFHLSDEPGASHVENYKR 158

Query: 362 VRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLG 421
            R +   LH  AP  +V+       SD   G    +    +P  +    Q Y        
Sbjct: 159 ARQV---LHELAPWMKVMDAL----SDIQYGR---QGLTDMPIPMVNSAQAY-------- 200

Query: 422 NREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGF 481
                    + E  P      W Y C  P  P  N  +     + R   W  ++ G  GF
Sbjct: 201 ---------IDEKIPH-----WVYYCCAPQGPWLNRFMDTPLPKVRMAGWTFYRLGAKGF 246

Query: 482 LYWGANCYEK-----ATVPSAE--IRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERI 534
           L+WG N + K      T P  +  +    G+P GD  + YPG    +  QP+ S+R E  
Sbjct: 247 LHWGFNYWHKIEQEVVTDPLTDGCVSAWPGIPYGDPFVIYPG----ADGQPMDSIRWEVF 302

Query: 535 LSGLQDIEYLN 545
              LQD   L 
Sbjct: 303 AESLQDYAILQ 313


>gi|331085877|ref|ZP_08334960.1| hypothetical protein HMPREF0987_01263 [Lachnospiraceae bacterium
           9_1_43BFAA]
 gi|330406800|gb|EGG86305.1| hypothetical protein HMPREF0987_01263 [Lachnospiraceae bacterium
           9_1_43BFAA]
          Length = 562

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 45/109 (41%), Gaps = 13/109 (11%)

Query: 439 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-------YEK 491
           GE  WTYVC GP     N  L     + R + W   K   +GFL+WG N        YE 
Sbjct: 399 GETVWTYVCCGPEGHWLNRFLDFALLKGRMLFWGCAKNRISGFLHWGLNQFPGGMNPYEG 458

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQD 540
            + P+         P GD  L YPG+       P   +RLE    G +D
Sbjct: 459 TSCPN-HTGIGTNFPCGDSFLIYPGK-----EGPRMGMRLEAQRRGAED 501


>gi|386850221|ref|YP_006268234.1| hypothetical protein ACPL_5279 [Actinoplanes sp. SE50/110]
 gi|359837725|gb|AEV86166.1| hypothetical protein ACPL_5279 [Actinoplanes sp. SE50/110]
          Length = 829

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 21/131 (16%)

Query: 439 GEEWWTYV------CMGPSD-------PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG 485
           G++ W Y       C G  +       P  ++ +  + +Q+RA+ W  ++   TG LY+ 
Sbjct: 660 GKQMWLYTSCDVSGCGGADEQDAELRLPWVDYTIDTQAAQNRAMGWLAYRYRATGELYY- 718

Query: 486 ANCYEKATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDIE 542
           +   +  T  + +  F      GDG LFYPG    +  S+  P+ SLR++ I +G QD E
Sbjct: 719 STTQQLGTAWTDQWGFGGN---GDGTLFYPGTTARIGGSTPIPLESLRMKMIRNGYQDYE 775

Query: 543 YLNLYASRYGR 553
           YL L A R GR
Sbjct: 776 YLRL-ADRAGR 785


>gi|414879441|tpg|DAA56572.1| TPA: hypothetical protein ZEAMMB73_699847 [Zea mays]
          Length = 57

 Score = 50.1 bits (118), Expect = 0.003,   Method: Composition-based stats.
 Identities = 19/27 (70%), Positives = 25/27 (92%)

Query: 294 ADHPKSDEYFSDPRLAAYAVPYSPVLS 320
           ADHPK++EY+S+PRLAAY  PY+P+LS
Sbjct: 28  ADHPKANEYYSNPRLAAYVAPYAPILS 54


>gi|395204636|ref|ZP_10395576.1| hypothetical protein PA08_1303 [Propionibacterium humerusii P08]
 gi|328907298|gb|EGG27064.1| hypothetical protein PA08_1303 [Propionibacterium humerusii P08]
          Length = 590

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 58/145 (40%), Gaps = 9/145 (6%)

Query: 426 LVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG 485
           +  D V + + +  E  W Y C+       N  +     + RA+ W++WK G  GFL+WG
Sbjct: 390 VATDAVLDFRRDGIEPAWVYHCVAQDVGVSNRFIAQESVRTRALGWQLWKFGVKGFLHWG 449

Query: 486 ANCYEKATV-----PSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQD 540
            N Y          P A+     G   GD  + YPG        P  SLR   +     D
Sbjct: 450 FNFYYGQLSVCPIDPFADTSAGGGFISGDAFIVYPGP--DGVVWP--SLRHRLVRDAFVD 505

Query: 541 IEYLNLYASRYGRDEGLALLEKTGT 565
           +       S  GR + L++++   T
Sbjct: 506 LAAARTAESLLGRSQVLSIIDPDST 530


>gi|422439957|ref|ZP_16516771.1| conserved hypothetical protein [Propionibacterium acnes HL037PA3]
 gi|422471082|ref|ZP_16547582.1| conserved hypothetical protein [Propionibacterium acnes HL037PA2]
 gi|422573949|ref|ZP_16649509.1| conserved hypothetical protein [Propionibacterium acnes HL044PA1]
 gi|313837143|gb|EFS74857.1| conserved hypothetical protein [Propionibacterium acnes HL037PA2]
 gi|314927836|gb|EFS91667.1| conserved hypothetical protein [Propionibacterium acnes HL044PA1]
 gi|314971914|gb|EFT16012.1| conserved hypothetical protein [Propionibacterium acnes HL037PA3]
          Length = 493

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 58/145 (40%), Gaps = 9/145 (6%)

Query: 426 LVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG 485
           +  D V + + +  E  W Y C+       N  +     + RA+ W++WK G  GFL+WG
Sbjct: 293 VATDAVLDFRRDGIEPAWVYHCVAQDVGVSNRFIAQESVRTRALGWQLWKFGVKGFLHWG 352

Query: 486 ANCYEKATV-----PSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQD 540
            N Y          P A+     G   GD  + YPG        P  SLR   +     D
Sbjct: 353 FNFYYGQLSVCPIDPFADTSAGGGFISGDAFIVYPGP--DGVVWP--SLRHRLVRDAFVD 408

Query: 541 IEYLNLYASRYGRDEGLALLEKTGT 565
           +       S  GR + L++++   T
Sbjct: 409 LAAARTAESLLGRSQVLSIIDPDST 433


>gi|320536341|ref|ZP_08036383.1| PHP domain protein [Treponema phagedenis F0421]
 gi|320146822|gb|EFW38396.1| PHP domain protein [Treponema phagedenis F0421]
          Length = 869

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 56/139 (40%), Gaps = 10/139 (7%)

Query: 429 DIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC 488
           D +     +N +  W Y C   +    N    M   ++R +   ++K    GFL+WG N 
Sbjct: 709 DSIAPFIAKNVKPLWAYYCSAQAVHVSNRFFAMPSWRNRILGMLLYKFDIDGFLHWGYNF 768

Query: 489 Y-----EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEY 543
           Y      K   P          P GD    YPG+      +P+ S+RL+     LQD  +
Sbjct: 769 YYTQYSRKLIDPFTVTDAGGAFPAGDSFSVYPGK-----DEPLPSIRLKVFYEALQDRVF 823

Query: 544 LNLYASRYGRDEGLALLEK 562
           L     ++G+   +  LEK
Sbjct: 824 LKQMEKKFGKAGVIERLEK 842


>gi|304440737|ref|ZP_07400621.1| conserved hypothetical protein [Peptoniphilus duerdenii ATCC
           BAA-1640]
 gi|304370924|gb|EFM24546.1| conserved hypothetical protein [Peptoniphilus duerdenii ATCC
           BAA-1640]
          Length = 568

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 43/187 (22%), Positives = 74/187 (39%), Gaps = 13/187 (6%)

Query: 343 KKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLG---PTPFESF 399
           KK   + WD P   E+   +     +L A+  +  VL   Y   SD P      +  E++
Sbjct: 305 KKKKIFGWDTPSVGEYTKFLEKFIPDLVAHLKEWGVLDKTYFHISDVPREEHIKSYKEAY 364

Query: 400 VKVPKFLRPHTQIYCTSEWVLGNREDL-----VKDIVTELQPENGEEWWTYVCMGPSDPH 454
           + V    +        + +    +  +        ++ +   ++ +E WTY C+G     
Sbjct: 365 MSVNDLFKDLKTFEAVAHYDFFKKGLIELPVAASSVIHDFLDDDLDELWTYYCVGQFTEV 424

Query: 455 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSAEIRFRRGLPPGD 509
            N  + M  +++R    +++K   +GFL+WG N Y      K   P        G P GD
Sbjct: 425 ANRFMSMPSARNRIFGIQMYKFHISGFLHWGYNFYNSVLSYKKIDPYKVTDADDGFPAGD 484

Query: 510 GVLFYPG 516
             L YPG
Sbjct: 485 AFLVYPG 491


>gi|346306833|ref|ZP_08848983.1| hypothetical protein HMPREF9457_00692 [Dorea formicigenerans
           4_6_53AFAA]
 gi|345907730|gb|EGX77437.1| hypothetical protein HMPREF9457_00692 [Dorea formicigenerans
           4_6_53AFAA]
          Length = 558

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 9/124 (7%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA----TVPSAE 498
           WTY C G      N  + M  +++R    +++K    G L+WG N Y        +   E
Sbjct: 401 WTYYCTGQFYEVSNRFMSMPSARNRIYGVQLYKYKIIGVLHWGYNFYNSQYSIEHINPYE 460

Query: 499 IRFRRG-LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
           +    G  P GD  L YPGE    + QP  SLR+      + D+  L L  S   R+  +
Sbjct: 461 VTDAAGAFPSGDPFLVYPGE----NGQPEESLRMMVHDEAMTDLRALKLLESLTSREHVM 516

Query: 558 ALLE 561
            L+E
Sbjct: 517 ELIE 520


>gi|166033167|ref|ZP_02235996.1| hypothetical protein DORFOR_02889 [Dorea formicigenerans ATCC
           27755]
 gi|166027524|gb|EDR46281.1| hypothetical protein DORFOR_02889 [Dorea formicigenerans ATCC
           27755]
          Length = 558

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 9/124 (7%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA----TVPSAE 498
           WTY C G      N  + M  +++R    +++K    G L+WG N Y        +   E
Sbjct: 401 WTYYCTGQFYEVSNRFMSMPSARNRIYGVQLYKYEIIGVLHWGYNFYNSQYSIEHINPYE 460

Query: 499 IRFRRG-LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
           +    G  P GD  L YPGE    + QP  SLR+      + D+  L L  S   R+  +
Sbjct: 461 VTDAAGAFPSGDPFLVYPGE----NGQPEESLRMMVHDEAMTDLRALKLLESLTSREHVM 516

Query: 558 ALLE 561
            L+E
Sbjct: 517 ELIE 520


>gi|386814775|ref|ZP_10101993.1| hypothetical protein Thini_0548 [Thiothrix nivea DSM 5205]
 gi|386419351|gb|EIJ33186.1| hypothetical protein Thini_0548 [Thiothrix nivea DSM 5205]
          Length = 604

 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 89/388 (22%), Positives = 137/388 (35%), Gaps = 63/388 (16%)

Query: 207 AISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGV-RHGSDEWYEALDQHFK 265
           A     + + ++LTVWDF LP    L  V G +   + + +G  R G       L + + 
Sbjct: 177 ATGEGQLELPVTLTVWDFSLPERSPLRTVFGTNGYRVAEVYGFERTGKSAADNRLIRAYN 236

Query: 266 -WLLQYRISP----------------FFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRL 308
            +LL + +SP                 F R    +  +T    + A    +  Y     +
Sbjct: 237 DFLLDHHLSPESFWDAAPEANADGLPDFGRQFAGLGTVTDNMRYYAQEKHASAY---TYV 293

Query: 309 AAYAVPYS-PVLSSNDGAKDYVRKEIELLRTKAHWKKAYF--YLWDEPLNMEHYSSVRNM 365
            A + P++ P+      A+ ++R   +     A  ++ Y      DEP   + Y   R  
Sbjct: 294 FADSYPFADPLGEDRQQAQRFMRAYADWCGKHAGAERCYTDPSFVDEPDTRDAYQYARRW 353

Query: 366 ASELHAYA-PDARVLTTYYCGP---SDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLG 421
                  + P    +      P    D  LG    +  V VPKF      +    + V G
Sbjct: 354 GEFFDEISLPKGENIHFQVSEPPLNEDPGLGSLVGKVEVWVPKFYDLWRDVDFLGKNVAG 413

Query: 422 NREDLVKDIVTELQPENGEEWWTYVCMGPSDPH----------------PNWHLGMRGSQ 465
            R               GEE W Y  +    P                 P W L      
Sbjct: 414 QRL------------AAGEEVWAYTSLVLDFPEYSKLNPKADVLKGSYPPVWQLDFPAIN 461

Query: 466 HRAVMWRVWKEGGTGFLYWGANC-YEKATVPSAEIRFRRGLPP-----GDGVLFYPG-EV 518
           +R   W   + G TG  YW     +E A V +    F    PP     GDG+L YPG + 
Sbjct: 462 YRIPTWLFHRYGVTGLGYWDTLAWFEGADVWNDAASFVSQNPPGIRFNGDGLLVYPGFKA 521

Query: 519 FSSSRQPVASLRLERILSGLQDIEYLNL 546
            +    P+ASLRL+ I   ++D  Y++L
Sbjct: 522 QTGFDGPLASLRLKWIRESVEDYIYIDL 549



 Score = 39.3 bits (90), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 57/139 (41%), Gaps = 27/139 (19%)

Query: 17  EPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCS-DLCSASGDRLVVGQSLMLRR 75
           + + L AARNE E  Q  +       ++   G V VQ S  +  +   +++ G  +   R
Sbjct: 52  DKVELFAARNEYEGFQFVV-------TAGERGAVDVQASISVLRSVEGQVIDGLKVFRER 104

Query: 76  VVPMLGV-----------PDALVPLDLPVCQI--------SLIPGETTAVWVSIDAPYAQ 116
            V +              PD L+P D    +         +L  GE   VWV I  P   
Sbjct: 105 YVKVSTPSPHSPYAPQYWPDILLPADNAGAEAAAYRAFPQNLTAGENLPVWVDIHIPADA 164

Query: 117 PPGLYEGEIIITSKADTEL 135
            PG+Y G+I +T+  + +L
Sbjct: 165 RPGVYTGKISVTATGEGQL 183


>gi|256420134|ref|YP_003120787.1| hypothetical protein Cpin_1088 [Chitinophaga pinensis DSM 2588]
 gi|256035042|gb|ACU58586.1| hypothetical protein Cpin_1088 [Chitinophaga pinensis DSM 2588]
          Length = 560

 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 19/117 (16%)

Query: 435 QPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-----Y 489
           + +N  E W YVC+ P   +PN  L     + R + W  +K   TGF++WG N      +
Sbjct: 415 RAQNKGELWYYVCVSPQYNYPNRFLENPLIKTRFLHWTNYKYDLTGFMHWGYNIWTGYPF 474

Query: 490 EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNL 546
           + +T  S           GD  + YP +      + ++S+RLE +  G+ D E L L
Sbjct: 475 DFSTSNSV---------GGDAWIVYPKD-----GKIISSVRLEAMRDGIVDYELLRL 517


>gi|325663334|ref|ZP_08151784.1| hypothetical protein HMPREF0490_02525 [Lachnospiraceae bacterium
           4_1_37FAA]
 gi|325470788|gb|EGC74018.1| hypothetical protein HMPREF0490_02525 [Lachnospiraceae bacterium
           4_1_37FAA]
          Length = 555

 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 52/125 (41%), Gaps = 9/125 (7%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSA 497
           WTY C G      N  + M  +++R    +++K    G L+WG N Y      +   P  
Sbjct: 399 WTYYCTGQFYEVSNRFMSMPSARNRIYGIQLYKYDIIGILHWGYNFYNSQHSYEHINPYQ 458

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                 G P GD  L YPG    +   P  S+R+      L D+  L L  S   ++  +
Sbjct: 459 VTDAANGFPAGDPFLVYPG----ADGHPEESIRMMVHYEALTDLRALELLESLTSKEYVM 514

Query: 558 ALLEK 562
            L+E+
Sbjct: 515 ELIEE 519


>gi|266620802|ref|ZP_06113737.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479]
 gi|288867588|gb|EFC99886.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479]
          Length = 552

 Score = 47.8 bits (112), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/125 (24%), Positives = 55/125 (44%), Gaps = 9/125 (7%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFR 502
           W+Y C        N  + M  ++ R    ++++    G L+WG N Y      +    +R
Sbjct: 400 WSYYCTAQDFLVSNRFMAMPSARCRIYGAQIFRYAMEGILHWGYNFYNSQYSLTKLDPYR 459

Query: 503 R-----GLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                   P GD  L YPG    +  +P  S+R+  +   +QD+  +   A + GR+  +
Sbjct: 460 STDADGAFPSGDSFLVYPG----ADGKPEESVRMMVMCHTMQDVRAMEQLAEKKGREYVI 515

Query: 558 ALLEK 562
           ++LE+
Sbjct: 516 SMLEE 520


>gi|312131501|ref|YP_003998841.1| hypothetical protein Lbys_2826 [Leadbetterella byssophila DSM
           17132]
 gi|311908047|gb|ADQ18488.1| hypothetical protein Lbys_2826 [Leadbetterella byssophila DSM
           17132]
          Length = 566

 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 56/128 (43%), Gaps = 13/128 (10%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           Y+C   SD  PN     + ++   + W     G  G L W  N + +   P  + RFR  
Sbjct: 433 YICC--SDAFPNQFTFSQPAESTYLAWYAEATGFNGMLRWAFNSWVEH--PLQDSRFR-T 487

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTG 564
            P GD  + YP        Q  +S+R ER++ G+QD E + +      RD+ + L +   
Sbjct: 488 WPAGDTYIIYP--------QGRSSVRYERLVEGIQDYEKIQILKKTLERDKLVELQKAIV 539

Query: 565 TYQGPERY 572
            ++   R+
Sbjct: 540 PFKAANRF 547


>gi|365132098|ref|ZP_09342072.1| hypothetical protein HMPREF1032_03868 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363617409|gb|EHL68801.1| hypothetical protein HMPREF1032_03868 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 547

 Score = 47.4 bits (111), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 56/227 (24%), Positives = 79/227 (34%), Gaps = 43/227 (18%)

Query: 328 YVRKEIELLRTKAHWKKAYFYLWDEP-LNMEHYSSVRN-------MASELHAYAPDARVL 379
           +V+   E LR      K  F++ DEP ++ ++ +S+          A  L  Y P+ RV+
Sbjct: 297 FVQALAEFLRKYGWQDKVVFHIHDEPDIHFKNEASLLARKRQYYLAAGILRKYLPNVRVI 356

Query: 380 TTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENG 439
                               V  P+F R    I     WV G      +    +     G
Sbjct: 357 EA------------------VASPEF-RGGVDI-----WVPGTPGYEARQADFDALTALG 392

Query: 440 EEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSA-- 497
           E  W YVC GP     N  L     + R + W        GFL+WG N +     P A  
Sbjct: 393 ESVWAYVCCGPEGNWLNRFLDFALLKGRLLFWGCAANRLGGFLHWGFNQFPAGMDPFAGT 452

Query: 498 ----EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQD 540
                       P GD  L YPG     +  P   +R+E    G +D
Sbjct: 453 SCPNHTGIGTNFPCGDSFLVYPG-----TDGPWPGMRMEAARRGAED 494


>gi|338533818|ref|YP_004667152.1| hypothetical protein LILAB_20870 [Myxococcus fulvus HW-1]
 gi|337259914|gb|AEI66074.1| hypothetical protein LILAB_20870 [Myxococcus fulvus HW-1]
          Length = 562

 Score = 47.0 bits (110), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 24/44 (54%), Positives = 28/44 (63%), Gaps = 3/44 (6%)

Query: 508 GDGVLFYPG---EVFSSSRQPVASLRLERILSGLQDIEYLNLYA 548
           GDG LFYPG    V  S  QP+ +LRL+ I  GL+D EYL L A
Sbjct: 457 GDGTLFYPGTPARVGPSGHQPILTLRLKHIRDGLEDYEYLRLLA 500


>gi|206601987|gb|EDZ38469.1| Protein of unknown function [Leptospirillum sp. Group II '5-way
           CG']
          Length = 681

 Score = 47.0 bits (110), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 59/244 (24%), Positives = 94/244 (38%), Gaps = 36/244 (14%)

Query: 319 LSSNDGAKDYVRKEIELLRTKAHWK-------KAYFYLWDEPLNMEHYSS-----VRNMA 366
           +S   G  D   + +  L  + HWK       + + Y+ DEP++  +Y +     +   A
Sbjct: 390 ISDWKGVPDIATQNLAKLIVR-HWKEKGWPIDQTFAYIADEPVHKLYYYADTYKLIAKDA 448

Query: 367 SELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDL 426
             LH  +P   V+ T      D P     +++ V   K +     +   + W  G     
Sbjct: 449 DSLHKGSPHIHVMVT------DVPY--ITYKNQVGHNKLI----MVGKVNIWA-GASAQF 495

Query: 427 VKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGA 486
           +   +   Q E G++ W Y   GP     N  L   G   R   W  WK    G  YW A
Sbjct: 496 IPSRMQARQKE-GDQVWFYQAGGPPFIGQN-DLYSLGPGFRMWFWTAWKYHVNGVFYW-A 552

Query: 487 NCYEKATVPSAEIRFRRGLPPGDGVLFYPGEV-----FSSSRQPVASLRLERILSGLQDI 541
           + +   T  +      +GL  GDG + YPG       F   + P+ S+R+ +   G +D 
Sbjct: 553 DTFWNDTKENMNPYVNQGL--GDGTILYPGTELHFIGFPDIQGPIPSIRMAQWRRGYEDY 610

Query: 542 EYLN 545
            YL 
Sbjct: 611 RYLT 614


>gi|423293705|ref|ZP_17271832.1| hypothetical protein HMPREF1070_00497 [Bacteroides ovatus
           CL03T12C18]
 gi|392677663|gb|EIY71079.1| hypothetical protein HMPREF1070_00497 [Bacteroides ovatus
           CL03T12C18]
          Length = 564

 Score = 46.2 bits (108), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 59/134 (44%), Gaps = 14/134 (10%)

Query: 429 DIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC 488
           D V E +  +G+    Y C    + +PN        +   +MW     G  G+L W  N 
Sbjct: 413 DSVREERRRSGKLSTVYTCC--IEAYPNTFTFSEPGEASWLMWHAVAGGYDGYLRWAYNS 470

Query: 489 YEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYA 548
           + K   P  + RFR     GD  L YPG          +S+R+ER++ G+QD E + +  
Sbjct: 471 WTKD--PLHDSRFR-SWAAGDCYLVYPGS---------SSIRMERLVEGIQDAEKIRILR 518

Query: 549 SRYGRDEGLALLEK 562
             +  +  +A L+K
Sbjct: 519 KEFAENGEVAKLKK 532


>gi|434406083|ref|YP_007148968.1| hypothetical protein Cylst_4187 [Cylindrospermum stagnale PCC 7417]
 gi|428260338|gb|AFZ26288.1| hypothetical protein Cylst_4187 [Cylindrospermum stagnale PCC 7417]
          Length = 535

 Score = 46.2 bits (108), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 36/130 (27%), Positives = 57/130 (43%), Gaps = 3/130 (2%)

Query: 437 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGG-TGFLYWGANCYEKATVP 495
           +N  + W Y  + P D  P W +       R     + +  G TG LYW  + +      
Sbjct: 377 KNKGKVWFYTALVPDDYSPKWQVDFEPINFRIPHGFINQSLGLTGALYWRVDFWTDDPWN 436

Query: 496 SAEIRFRRGLP-PGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD 554
           + E  F+ G   PG+G+L YPG+        V S+RL+ +  G++D EY+ +       D
Sbjct: 437 NVETLFQDGNHYPGEGMLVYPGQQVGI-EGVVPSMRLKWLRDGVEDYEYIQILKELGRGD 495

Query: 555 EGLALLEKTG 564
             L +  K G
Sbjct: 496 WALEVSRKVG 505


>gi|254976138|ref|ZP_05272610.1| hypothetical protein CdifQC_12524 [Clostridium difficile QCD-66c26]
 gi|255093528|ref|ZP_05323006.1| hypothetical protein CdifC_12829 [Clostridium difficile CIP 107932]
 gi|255315270|ref|ZP_05356853.1| hypothetical protein CdifQCD-7_12997 [Clostridium difficile
           QCD-76w55]
 gi|255517939|ref|ZP_05385615.1| hypothetical protein CdifQCD-_12566 [Clostridium difficile
           QCD-97b34]
 gi|260684121|ref|YP_003215406.1| hypothetical protein CD196_2387 [Clostridium difficile CD196]
 gi|260687780|ref|YP_003218914.1| hypothetical protein CDR20291_2434 [Clostridium difficile R20291]
 gi|384361763|ref|YP_006199615.1| hypothetical protein CDBI1_12375 [Clostridium difficile BI1]
 gi|260210284|emb|CBA64575.1| conserved hypothetical protein [Clostridium difficile CD196]
 gi|260213797|emb|CBE05755.1| conserved hypothetical protein [Clostridium difficile R20291]
          Length = 582

 Score = 46.2 bits (108), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 7/110 (6%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           +VC  P    PN  L     ++R + W  +  G  GFL W  N + +     +  +F   
Sbjct: 430 FVCCFPE--KPNSFLSSPFVENRIIGWYTYYFGLDGFLRWDYNLWTEDPWKDSSYKFPI- 486

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD 554
              GD    YPG+      +PV S+R+E +  G+QD E   +     GR+
Sbjct: 487 WKAGDMFFVYPGK----DLKPVRSVRMENLRFGIQDFELFTMLEKEKGRE 532


>gi|333030500|ref|ZP_08458561.1| hypothetical protein Bcop_1384 [Bacteroides coprosuis DSM 18011]
 gi|332741097|gb|EGJ71579.1| hypothetical protein Bcop_1384 [Bacteroides coprosuis DSM 18011]
          Length = 599

 Score = 46.2 bits (108), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 54/121 (44%), Gaps = 16/121 (13%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           YVC   SD  PN       ++     W     G  GFL W  N + +   P  + RFR  
Sbjct: 456 YVCC--SDAFPNTFTFSEPAEAVYAGWYALASGLDGFLRWAYNSWVEN--PLTDSRFRT- 510

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD---EGLALLE 561
            P GD  + YP           +S+R ER++ G+QD+E + +  ++   +   +  ALLE
Sbjct: 511 WPAGDTYIVYP--------DARSSIRFERLIEGIQDVEKIKILQTKLKSNPSKKNNALLE 562

Query: 562 K 562
           +
Sbjct: 563 E 563


>gi|255307579|ref|ZP_05351750.1| hypothetical protein CdifA_13382 [Clostridium difficile ATCC 43255]
          Length = 582

 Score = 46.2 bits (108), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 7/110 (6%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           +VC  P    PN  L     ++R + W  +  G  GFL W  N + +     +  +F   
Sbjct: 430 FVCCFPE--KPNSFLSSPFVENRIIGWYTYYFGLDGFLRWDYNLWTEDPWKDSSYKFPI- 486

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD 554
              GD    YPG+      +PV S+R+E +  G+QD E   +     GR+
Sbjct: 487 WKAGDMFFVYPGK----DLKPVRSVRMENLRFGIQDFELFTMLEKEKGRE 532


>gi|423084302|ref|ZP_17072807.1| hypothetical protein HMPREF1122_03805 [Clostridium difficile
           002-P50-2011]
 gi|423087997|ref|ZP_17076382.1| hypothetical protein HMPREF1123_03536 [Clostridium difficile
           050-P50-2011]
 gi|357542607|gb|EHJ24649.1| hypothetical protein HMPREF1122_03805 [Clostridium difficile
           002-P50-2011]
 gi|357543525|gb|EHJ25542.1| hypothetical protein HMPREF1123_03536 [Clostridium difficile
           050-P50-2011]
          Length = 582

 Score = 46.2 bits (108), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 7/110 (6%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           +VC  P    PN  L     ++R + W  +  G  GFL W  N + +     +  +F   
Sbjct: 430 FVCCFPE--KPNSFLSSPFVENRIIGWYTYYFGLDGFLRWDYNLWTEDPWKDSSYKFPI- 486

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD 554
              GD    YPG+      +PV S+R+E +  G+QD E   +     GR+
Sbjct: 487 WKAGDMFFVYPGK----DLKPVRSVRMENLRFGIQDFELFTMLEKEKGRE 532


>gi|255101710|ref|ZP_05330687.1| hypothetical protein CdifQCD-6_12934 [Clostridium difficile
           QCD-63q42]
          Length = 582

 Score = 46.2 bits (108), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 7/110 (6%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           +VC  P    PN  L     ++R + W  +  G  GFL W  N + +     +  +F   
Sbjct: 430 FVCCFPE--KPNSFLSSPFVENRIIGWYTYYFGLDGFLRWDYNLWTEDPWKDSSYKFPI- 486

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD 554
              GD    YPG+      +PV S+R+E +  G+QD E   +     GR+
Sbjct: 487 WKAGDMFFVYPGK----DLKPVRSVRMENLRFGIQDFELFTMLEKEKGRE 532


>gi|126700164|ref|YP_001089061.1| hypothetical protein CD630_25470 [Clostridium difficile 630]
 gi|115251601|emb|CAJ69434.1| conserved hypothetical protein [Clostridium difficile 630]
          Length = 582

 Score = 46.2 bits (108), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 7/110 (6%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           +VC  P    PN  L     ++R + W  +  G  GFL W  N + +     +  +F   
Sbjct: 430 FVCCFPE--KPNSFLSSPFVENRIIGWYTYYFGLDGFLRWDYNLWTEDPWKDSSYKFPI- 486

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD 554
              GD    YPG+      +PV S+R+E +  G+QD E   +     GR+
Sbjct: 487 WKAGDMFFVYPGK----DLKPVRSVRMENLRFGIQDFELFTMLEKEKGRE 532


>gi|423092577|ref|ZP_17080381.1| hypothetical protein HMPREF9945_03572 [Clostridium difficile
           70-100-2010]
 gi|357553447|gb|EHJ35194.1| hypothetical protein HMPREF9945_03572 [Clostridium difficile
           70-100-2010]
          Length = 582

 Score = 46.2 bits (108), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 7/110 (6%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           +VC  P    PN  L     ++R + W  +  G  GFL W  N + +     +  +F   
Sbjct: 430 FVCCFPE--KPNSFLSSPFVENRIIGWYTYYFGLDGFLRWDYNLWTEDPWKDSSYKFPI- 486

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD 554
              GD    YPG+      +PV S+R+E +  G+QD E   +     GR+
Sbjct: 487 WKAGDMFFVYPGK----DLKPVRSVRMENLRFGIQDFELFTMLEKEKGRE 532


>gi|383785486|ref|YP_005470056.1| hypothetical protein LFE_2254 [Leptospirillum ferrooxidans C2-3]
 gi|383084399|dbj|BAM07926.1| hypothetical protein LFE_2254 [Leptospirillum ferrooxidans C2-3]
          Length = 673

 Score = 45.4 bits (106), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 57/143 (39%), Gaps = 19/143 (13%)

Query: 422 NREDLVKDIVTELQPENGEEWWTYVC----MGPSDPHPNWHLGMRGSQHRAVMWRVWKEG 477
           N E  + D + ++Q   GE  W Y      +G SD      L   G   R   W  WK  
Sbjct: 479 NSETFMPDRMAKVQ-TIGERTWFYQSGPPFIGASD------LYSTGVGFRMWFWAAWKYR 531

Query: 478 GTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEV-----FSSSRQPVASLRLE 532
             G  YW  + +   T+    +        GDG++FYPG       F     PV S+R+ 
Sbjct: 532 TNGVFYWADDFWPDNTM---AVNPYTNPGTGDGIIFYPGHQLHLIGFPDIDGPVPSIRMA 588

Query: 533 RILSGLQDIEYLNLYASRYGRDE 555
           +   G  D +Y  + A +  R+E
Sbjct: 589 QWRRGYDDYKYFFMLAKKGRREE 611


>gi|444917281|ref|ZP_21237383.1| hypothetical protein D187_09997 [Cystobacter fuscus DSM 2262]
 gi|444711176|gb|ELW52126.1| hypothetical protein D187_09997 [Cystobacter fuscus DSM 2262]
          Length = 568

 Score = 45.4 bits (106), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 55/227 (24%), Positives = 90/227 (39%), Gaps = 30/227 (13%)

Query: 337 RTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPF 396
           R +    +++ Y+ DEP     + +VR  AS     APD R L T      +        
Sbjct: 303 RARGWLSRSFDYVGDEPPYGTTWEAVRARASLTRQAAPDLRTLLTTNITELERHELLELI 362

Query: 397 ESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMG----PSD 452
           +  V +         I  T     G++     D +T     + E W    C      P +
Sbjct: 363 DILVVL------VNHIDGTEPPYEGDQRPKYDDFLTR---PHRELWLYQSCASHGCSPDE 413

Query: 453 PHP----------NWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFR 502
           P P          ++ +    ++ R + W  + EG +G LY+       +T  + + RF 
Sbjct: 414 PPPPENQPGQGWPSYMVDRPATKARGMQWVDFIEGASGELYY-QTVGLLSTAWTTQFRFN 472

Query: 503 RGLPPGDGVLFYPGE---VFSSSRQPVASLRLERILSGLQDIEYLNL 546
                GDG LFYPG    +  S+   V S+R++ I  G+QD E+L +
Sbjct: 473 GN---GDGTLFYPGLPSIIGGSTPVLVPSIRIKLIRLGVQDYEWLKM 516


>gi|50843406|ref|YP_056633.1| hypothetical protein PPA1958 [Propionibacterium acnes KPA171202]
 gi|289425612|ref|ZP_06427384.1| conserved hypothetical protein [Propionibacterium acnes SK187]
 gi|335053127|ref|ZP_08545978.1| hypothetical protein HMPREF9948_2283 [Propionibacterium sp.
           434-HC2]
 gi|387504316|ref|YP_005945545.1| hypothetical protein TIB1ST10_09970 [Propionibacterium acnes 6609]
 gi|419419836|ref|ZP_13960069.1| hypothetical protein TICEST70_01370 [Propionibacterium acnes
           PRP-38]
 gi|50841008|gb|AAT83675.1| hypothetical protein PPA1958 [Propionibacterium acnes KPA171202]
 gi|289153913|gb|EFD02606.1| conserved hypothetical protein [Propionibacterium acnes SK187]
 gi|333767978|gb|EGL45192.1| hypothetical protein HMPREF9948_2283 [Propionibacterium sp.
           434-HC2]
 gi|335278361|gb|AEH30266.1| hypothetical protein TIB1ST10_09970 [Propionibacterium acnes 6609]
 gi|379979557|gb|EIA12877.1| hypothetical protein TICEST70_01370 [Propionibacterium acnes
           PRP-38]
          Length = 550

 Score = 45.4 bits (106), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 98/267 (36%), Gaps = 47/267 (17%)

Query: 321 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 376
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 296 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 354

Query: 377 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 436
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 355 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 384

Query: 437 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 491
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 385 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 444

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY 551
              P+A+       P GD  + YP          V SLRL+ +    +D+  L       
Sbjct: 445 LVDPNADTSADLAFPSGDSSVIYP----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAV 500

Query: 552 GRDEGLALLEKTG---TYQGPERYTIE 575
           GR   + L+E      T     RY +E
Sbjct: 501 GRPTIVDLIEHLAPGSTADLDHRYPLE 527


>gi|336428440|ref|ZP_08608421.1| hypothetical protein HMPREF0994_04427 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336005693|gb|EGN35737.1| hypothetical protein HMPREF0994_04427 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 522

 Score = 45.4 bits (106), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 52/120 (43%), Gaps = 8/120 (6%)

Query: 428 KDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGAN 487
           KD    LQ + GEE W Y C  P+    N  + +  +  R ++W       TGFL+WG N
Sbjct: 363 KDTFRLLQ-DAGEEIWFYTCAFPAGNIMNRSMDLPLTVSRLLLWMGASCRLTGFLHWGFN 421

Query: 488 CYEKATVPSAEIRFRRG--LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLN 545
            Y    + +      +G  LP GD  + YPG        P  S+R E    G +D E  +
Sbjct: 422 YYIGDDIWNRACCPHKGALLPAGDAHIVYPG-----PDGPWISMRYEAQRGGAEDYELFS 476


>gi|386070149|ref|YP_005985045.1| hypothetical protein TIIST44_02580 [Propionibacterium acnes ATCC
           11828]
 gi|353454516|gb|AER05035.1| hypothetical protein TIIST44_02580 [Propionibacterium acnes ATCC
           11828]
          Length = 550

 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 54/227 (23%), Positives = 82/227 (36%), Gaps = 41/227 (18%)

Query: 342 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           W  A Y+++ DEP     Y+S R     +    P+A V+        DA   P  F + V
Sbjct: 320 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 369

Query: 401 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 460
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 370 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 408

Query: 461 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 515
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 409 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 468

Query: 516 GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEK 562
                     V SLRL+ +    +D+  L       GR   + L+E 
Sbjct: 469 ----RIDGSLVPSLRLKVLAQLHEDVRLLRRVEDAVGRPTVVELIEH 511


>gi|365963599|ref|YP_004945165.1| hypothetical protein TIA2EST36_09570 [Propionibacterium acnes
           TypeIA2 P.acn31]
 gi|365974778|ref|YP_004956337.1| hypothetical protein TIA2EST2_09530 [Propionibacterium acnes
           TypeIA2 P.acn33]
 gi|365740280|gb|AEW84482.1| hypothetical protein TIA2EST36_09570 [Propionibacterium acnes
           TypeIA2 P.acn31]
 gi|365744777|gb|AEW79974.1| hypothetical protein TIA2EST2_09530 [Propionibacterium acnes
           TypeIA2 P.acn33]
          Length = 499

 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 98/267 (36%), Gaps = 47/267 (17%)

Query: 321 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 376
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 245 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 303

Query: 377 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 436
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 304 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 333

Query: 437 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 491
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 334 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 393

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY 551
              P+A+       P GD  + YP          V SLRL+ +    +D+  L       
Sbjct: 394 LVDPNADTSADLAFPSGDSSVIYP----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAV 449

Query: 552 GRDEGLALLEKTG---TYQGPERYTIE 575
           GR   + L+E      T     RY +E
Sbjct: 450 GRPTIVDLIEHLAPGSTADLDHRYPLE 476


>gi|186681830|ref|YP_001865026.1| hypothetical protein Npun_F1375 [Nostoc punctiforme PCC 73102]
 gi|186464282|gb|ACC80083.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 543

 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 71/169 (42%), Gaps = 40/169 (23%)

Query: 1   MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 60
           +PS   +G  E         + AA+ E ESVQ+ ++     + SS    V +  SDL   
Sbjct: 45  VPSLKRIGQTEKITNTSLSKIYAAKGEYESVQLVIK-----APSSGLTNVNISVSDLL-G 98

Query: 61  SGDRLVVGQSLMLRR----------------VVPMLGV---PDALVPLDLPVCQISLIPG 101
           S ++++   ++ L R                + P LGV   PD L+P   PV Q   + G
Sbjct: 99  SNNQIIPKNNITLYREHYVYVSHSSPNMRDNLNPPLGVGWYPDGLIPFLDPVTQKPPLTG 158

Query: 102 ETTAV------------WVSIDAPYAQPPGLYEGEIIITS---KADTEL 135
           E  AV            WV +  P     G Y G+ I+TS   KA++++
Sbjct: 159 ELKAVPFRLQSQYNQPIWVDVFVPRNAKSGEYTGKFIVTSDQGKAESKI 207



 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 27/113 (23%), Positives = 54/113 (47%), Gaps = 4/113 (3%)

Query: 437 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGG-TGFLYWGANCYEKATVP 495
           + G++ W+Y  +   +  P W +  +   +R     + +  G TG LYW  + + K    
Sbjct: 379 QKGDKVWSYNTLVQDEYSPKWQIDFQPINYRIQPGFISQSLGLTGILYWQVDLWTKDPWH 438

Query: 496 SAEIRFRRGLP--PGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNL 546
             +   +      PG+G+L YPG+     +  + S+RL+ +  G++D EY+ +
Sbjct: 439 DIQTYSKNNGDNFPGEGMLVYPGKQV-GLQGVIPSMRLKWLRDGVEDYEYMEI 490


>gi|365965842|ref|YP_004947407.1| hypothetical protein TIA2EST22_09585 [Propionibacterium acnes
           TypeIA2 P.acn17]
 gi|365742523|gb|AEW82217.1| hypothetical protein TIA2EST22_09585 [Propionibacterium acnes
           TypeIA2 P.acn17]
          Length = 499

 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 98/267 (36%), Gaps = 47/267 (17%)

Query: 321 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 376
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 245 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 303

Query: 377 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 436
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 304 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 333

Query: 437 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 491
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 334 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 393

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY 551
              P+A+       P GD  + YP          V SLRL+ +    +D+  L       
Sbjct: 394 LVDPNADTSADLAFPSGDSSVIYP----RVDGSLVPSLRLKVLAQLHKDVRLLRRVEDAV 449

Query: 552 GRDEGLALLEKTG---TYQGPERYTIE 575
           GR   + L+E      T     RY +E
Sbjct: 450 GRPTIVDLIEHLAPGSTADLDHRYPLE 476


>gi|342211509|ref|ZP_08704234.1| hypothetical protein HMPREF9949_0083 [Propionibacterium sp.
           CC003-HC2]
 gi|340767053|gb|EGR89578.1| hypothetical protein HMPREF9949_0083 [Propionibacterium sp.
           CC003-HC2]
          Length = 499

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 98/267 (36%), Gaps = 47/267 (17%)

Query: 321 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 376
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 245 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 303

Query: 377 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 436
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 304 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 333

Query: 437 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 491
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 334 AGLDGMWVYTSCASTFWDPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 393

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY 551
              P+A+       P GD  + YP          V SLRL+ +    +D+  L       
Sbjct: 394 LVDPNADTSADLAFPSGDSSVIYP----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAV 449

Query: 552 GRDEGLALLEKTG---TYQGPERYTIE 575
           GR   + L+E      T     RY +E
Sbjct: 450 GRPTIVDLIEHLAPGSTADLDHRYPLE 476


>gi|197121758|ref|YP_002133709.1| hypothetical protein AnaeK_1349 [Anaeromyxobacter sp. K]
 gi|196171607|gb|ACG72580.1| conserved hypothetical protein [Anaeromyxobacter sp. K]
          Length = 552

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 50/98 (51%), Gaps = 6/98 (6%)

Query: 455 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 514
           P + +   G+  RAV W  W+ G +G LY+      +   P  ++R   G   GDG L Y
Sbjct: 412 PGYEVDRPGAAARAVAWLGWRRGLSGELYYDMLQAWRGD-PWTDVRAFAGN--GDGTLLY 468

Query: 515 PGE--VFSSSRQ-PVASLRLERILSGLQDIEYLNLYAS 549
           PG   +   +R  PV S+RL+ +   L+D+E L L ++
Sbjct: 469 PGRPAMLGGTRPFPVESIRLKIVRDALEDVELLRLASA 506


>gi|404405841|ref|ZP_10997425.1| hypothetical protein AJC13_10449 [Alistipes sp. JC136]
          Length = 578

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 47/108 (43%), Gaps = 13/108 (12%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           YVC   SD  PN       ++     W     G  GFL W  N +  A  P  + RFR  
Sbjct: 440 YVCC--SDRFPNMFTFSDPAEAVYAGWYTAACGYDGFLRWAYNSW--AGEPLTDSRFRT- 494

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYG 552
            P GD  + YPG          +S+R ER++ G+QD E +++    + 
Sbjct: 495 FPAGDTYIIYPGAR--------SSVRFERLVEGIQDAEKISILKKEFA 534


>gi|422391266|ref|ZP_16471359.1| hypothetical protein HMPREF9341_02296 [Propionibacterium acnes
           HL103PA1]
 gi|422464078|ref|ZP_16540689.1| conserved hypothetical protein [Propionibacterium acnes HL060PA1]
 gi|422564324|ref|ZP_16639979.1| conserved hypothetical protein [Propionibacterium acnes HL082PA2]
 gi|314967153|gb|EFT11252.1| conserved hypothetical protein [Propionibacterium acnes HL082PA2]
 gi|315093876|gb|EFT65852.1| conserved hypothetical protein [Propionibacterium acnes HL060PA1]
 gi|327325812|gb|EGE67604.1| hypothetical protein HMPREF9341_02296 [Propionibacterium acnes
           HL103PA1]
          Length = 530

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 54/227 (23%), Positives = 82/227 (36%), Gaps = 41/227 (18%)

Query: 342 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           W  A Y+++ DEP     Y+S R     +    P+A V+        DA   P  F + V
Sbjct: 300 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 349

Query: 401 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 460
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 350 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 388

Query: 461 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 515
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 389 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSHYLVDPNADTSADLAFPSGDSSVIYP 448

Query: 516 GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEK 562
                     V SLRL+ +    +D+  L       GR   + L+E 
Sbjct: 449 ----RIDGSLVPSLRLKVLAQLHEDVRLLRRVEDAVGRPTVVELIEH 491


>gi|427442082|ref|ZP_18925530.1| conserved hypothetical protein [Pediococcus lolii NGRI 0510Q]
 gi|425786839|dbj|GAC46318.1| conserved hypothetical protein [Pediococcus lolii NGRI 0510Q]
          Length = 384

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 61/252 (24%), Positives = 99/252 (39%), Gaps = 44/252 (17%)

Query: 327 DYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGP 386
           +YV+   + L+    W+KA   + DEP         ++  S L    PD +V   +   P
Sbjct: 137 NYVKALCDHLKDLQVWEKARL-IADEP-KQAQLKEFKDALSALKQMVPDLKVKVAFDKEP 194

Query: 387 ---SDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWW 443
                APL  T   SF             YCTS++             ++LQ  +  E  
Sbjct: 195 ILNELAPLVDTLATSF-------------YCTSQFG------------SQLQASHPGEVQ 229

Query: 444 TYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFR- 502
            Y+C  P  P+   H  +  ++ +  +       G   L W  NC+   +    +IR+  
Sbjct: 230 YYICNYPDHPNTFLHSPLLETRLQGTLTAFLPVNG--LLRWAFNCW--PSNAREDIRYNT 285

Query: 503 RGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEG--LALL 560
             LP GD  L YPGE    +   + SLR +++   ++D    +L  S    DE    A+L
Sbjct: 286 SSLPIGDNCLVYPGE----NGHLLLSLRYKQLERAVED---FSLIKSAKATDEKRTTAIL 338

Query: 561 EKTGTYQGPERY 572
           E       P+++
Sbjct: 339 EHFLEETAPQKW 350


>gi|422395623|ref|ZP_16475656.1| hypothetical protein HMPREF9344_01398 [Propionibacterium acnes
           HL097PA1]
 gi|422454993|ref|ZP_16531671.1| conserved hypothetical protein [Propionibacterium acnes HL030PA1]
 gi|315107964|gb|EFT79940.1| conserved hypothetical protein [Propionibacterium acnes HL030PA1]
 gi|327333100|gb|EGE74827.1| hypothetical protein HMPREF9344_01398 [Propionibacterium acnes
           HL097PA1]
          Length = 530

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 98/267 (36%), Gaps = 47/267 (17%)

Query: 321 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 376
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 276 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 334

Query: 377 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 436
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 335 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 364

Query: 437 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 491
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 365 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 424

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY 551
              P+A+       P GD  + YP          V SLRL+ +    +D+  L       
Sbjct: 425 LVDPNADTSADLAFPSGDSSVIYP----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAV 480

Query: 552 GRDEGLALLEKTG---TYQGPERYTIE 575
           GR   + L+E      T     RY +E
Sbjct: 481 GRPTIVDLIEHLAPGSTADLDHRYPLE 507


>gi|422426470|ref|ZP_16503391.1| hypothetical protein HMPREF9579_00231 [Propionibacterium acnes
           HL087PA1]
 gi|422453869|ref|ZP_16530551.1| hypothetical protein HMPREF9581_01536 [Propionibacterium acnes
           HL087PA3]
 gi|327451751|gb|EGE98405.1| hypothetical protein HMPREF9581_01536 [Propionibacterium acnes
           HL087PA3]
 gi|328756982|gb|EGF70598.1| hypothetical protein HMPREF9579_00231 [Propionibacterium acnes
           HL087PA1]
          Length = 522

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 98/267 (36%), Gaps = 47/267 (17%)

Query: 321 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 376
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 268 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 326

Query: 377 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 436
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 327 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 356

Query: 437 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 491
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 357 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 416

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY 551
              P+A+       P GD  + YP          V SLRL+ +    +D+  L       
Sbjct: 417 LVDPNADTSADLAFPSGDSSVIYP----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAV 472

Query: 552 GRDEGLALLEKTG---TYQGPERYTIE 575
           GR   + L+E      T     RY +E
Sbjct: 473 GRPTIVDLIEHLAPGSTADLDHRYPLE 499


>gi|422433658|ref|ZP_16510524.1| conserved hypothetical protein [Propionibacterium acnes HL059PA2]
 gi|422436298|ref|ZP_16513148.1| hypothetical protein HMPREF9586_02402 [Propionibacterium acnes
           HL083PA2]
 gi|422441996|ref|ZP_16518802.1| conserved hypothetical protein [Propionibacterium acnes HL002PA1]
 gi|422445323|ref|ZP_16522072.1| conserved hypothetical protein [Propionibacterium acnes HL027PA1]
 gi|422511714|ref|ZP_16587855.1| conserved hypothetical protein [Propionibacterium acnes HL059PA1]
 gi|422540487|ref|ZP_16616353.1| conserved hypothetical protein [Propionibacterium acnes HL013PA1]
 gi|422540934|ref|ZP_16616795.1| conserved hypothetical protein [Propionibacterium acnes HL037PA1]
 gi|422546693|ref|ZP_16622518.1| conserved hypothetical protein [Propionibacterium acnes HL050PA3]
 gi|422548802|ref|ZP_16624611.1| conserved hypothetical protein [Propionibacterium acnes HL050PA1]
 gi|422558679|ref|ZP_16634417.1| hypothetical protein HMPREF9588_02503 [Propionibacterium acnes
           HL025PA2]
 gi|422561617|ref|ZP_16637301.1| conserved hypothetical protein [Propionibacterium acnes HL046PA1]
 gi|422571378|ref|ZP_16646963.1| conserved hypothetical protein [Propionibacterium acnes HL067PA1]
 gi|422577404|ref|ZP_16652937.1| conserved hypothetical protein [Propionibacterium acnes HL005PA4]
 gi|313763344|gb|EFS34708.1| conserved hypothetical protein [Propionibacterium acnes HL013PA1]
 gi|313815003|gb|EFS52717.1| conserved hypothetical protein [Propionibacterium acnes HL059PA1]
 gi|314916711|gb|EFS80542.1| conserved hypothetical protein [Propionibacterium acnes HL005PA4]
 gi|314919163|gb|EFS82994.1| conserved hypothetical protein [Propionibacterium acnes HL050PA1]
 gi|314921243|gb|EFS85074.1| conserved hypothetical protein [Propionibacterium acnes HL050PA3]
 gi|314930329|gb|EFS94160.1| conserved hypothetical protein [Propionibacterium acnes HL067PA1]
 gi|314956112|gb|EFT00508.1| conserved hypothetical protein [Propionibacterium acnes HL027PA1]
 gi|314959730|gb|EFT03832.1| conserved hypothetical protein [Propionibacterium acnes HL002PA1]
 gi|314969811|gb|EFT13909.1| conserved hypothetical protein [Propionibacterium acnes HL037PA1]
 gi|315098131|gb|EFT70107.1| conserved hypothetical protein [Propionibacterium acnes HL059PA2]
 gi|315102739|gb|EFT74715.1| conserved hypothetical protein [Propionibacterium acnes HL046PA1]
 gi|327452257|gb|EGE98911.1| hypothetical protein HMPREF9586_02402 [Propionibacterium acnes
           HL083PA2]
 gi|328752296|gb|EGF65912.1| hypothetical protein HMPREF9588_02503 [Propionibacterium acnes
           HL025PA2]
          Length = 522

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 98/267 (36%), Gaps = 47/267 (17%)

Query: 321 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 376
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 268 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 326

Query: 377 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 436
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 327 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 356

Query: 437 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 491
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 357 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 416

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY 551
              P+A+       P GD  + YP          V SLRL+ +    +D+  L       
Sbjct: 417 LVDPNADTSADLAFPSGDSSVIYP----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAV 472

Query: 552 GRDEGLALLEKTG---TYQGPERYTIE 575
           GR   + L+E      T     RY +E
Sbjct: 473 GRPTIVDLIEHLAPGSTADLDHRYPLE 499


>gi|282855297|ref|ZP_06264629.1| conserved hypothetical protein [Propionibacterium acnes J139]
 gi|282581885|gb|EFB87270.1| conserved hypothetical protein [Propionibacterium acnes J139]
          Length = 499

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 54/227 (23%), Positives = 82/227 (36%), Gaps = 41/227 (18%)

Query: 342 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           W  A Y+++ DEP     Y+S R     +    P+A V+        DA   P  F + V
Sbjct: 269 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 318

Query: 401 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 460
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 319 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 357

Query: 461 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 515
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 358 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 417

Query: 516 GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEK 562
                     V SLRL+ +    +D+  L       GR   + L+E 
Sbjct: 418 ----RIDGSLVPSLRLKVLAQLHEDVRLLRRVEDAVGRPTVVELIEH 460


>gi|422458853|ref|ZP_16535502.1| conserved hypothetical protein [Propionibacterium acnes HL050PA2]
 gi|422466399|ref|ZP_16542973.1| conserved hypothetical protein [Propionibacterium acnes HL110PA4]
 gi|422468180|ref|ZP_16544715.1| conserved hypothetical protein [Propionibacterium acnes HL110PA3]
 gi|422575224|ref|ZP_16650768.1| conserved hypothetical protein [Propionibacterium acnes HL001PA1]
 gi|314924019|gb|EFS87850.1| conserved hypothetical protein [Propionibacterium acnes HL001PA1]
 gi|314983039|gb|EFT27131.1| conserved hypothetical protein [Propionibacterium acnes HL110PA3]
 gi|315091619|gb|EFT63595.1| conserved hypothetical protein [Propionibacterium acnes HL110PA4]
 gi|315104095|gb|EFT76071.1| conserved hypothetical protein [Propionibacterium acnes HL050PA2]
          Length = 530

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 54/227 (23%), Positives = 82/227 (36%), Gaps = 41/227 (18%)

Query: 342 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           W  A Y+++ DEP     Y+S R     +    P+A V+        DA   P  F + V
Sbjct: 300 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 349

Query: 401 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 460
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 350 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 388

Query: 461 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 515
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 389 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 448

Query: 516 GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEK 562
                     V SLRL+ +    +D+  L       GR   + L+E 
Sbjct: 449 ----RIDGSLVPSLRLKVLAQLHEDVRLLRRVEDAVGRPTVVELIEH 491


>gi|422451477|ref|ZP_16528179.1| conserved hypothetical protein [Propionibacterium acnes HL030PA2]
 gi|422499630|ref|ZP_16575891.1| conserved hypothetical protein [Propionibacterium acnes HL063PA2]
 gi|313829397|gb|EFS67111.1| conserved hypothetical protein [Propionibacterium acnes HL063PA2]
 gi|315108879|gb|EFT80855.1| conserved hypothetical protein [Propionibacterium acnes HL030PA2]
          Length = 522

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 98/267 (36%), Gaps = 47/267 (17%)

Query: 321 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 376
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 268 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 326

Query: 377 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 436
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 327 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 356

Query: 437 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 491
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 357 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 416

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY 551
              P+A+       P GD  + YP          V SLRL+ +    +D+  L       
Sbjct: 417 LVDPNADTSADLAFPSGDSSVIYP----RVDGSLVPSLRLKVLAQLHKDVRLLRRVEDAV 472

Query: 552 GRDEGLALLEKTG---TYQGPERYTIE 575
           GR   + L+E      T     RY +E
Sbjct: 473 GRPTIVDLIEHLAPGSTADLDHRYPLE 499


>gi|220916552|ref|YP_002491856.1| hypothetical protein A2cp1_1446 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219954406|gb|ACL64790.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 563

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 50/98 (51%), Gaps = 6/98 (6%)

Query: 455 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 514
           P + +   G+  RAV W  W+ G +G LY+      +   P  ++R   G   GDG L Y
Sbjct: 423 PGYEVDRPGAAARAVAWLGWRRGLSGELYYDMLQAWRGD-PWTDVRAFAGN--GDGTLLY 479

Query: 515 PGE--VFSSSRQ-PVASLRLERILSGLQDIEYLNLYAS 549
           PG   +   +R  PV S+RL+ +   L+D+E L L ++
Sbjct: 480 PGRPAMLGGTRPFPVESIRLKIVRDALEDVELLRLASA 517


>gi|404485075|ref|ZP_11020279.1| por secretion system C-terminal sorting domain-containing protein
           [Barnesiella intestinihominis YIT 11860]
 gi|404340080|gb|EJZ66511.1| por secretion system C-terminal sorting domain-containing protein
           [Barnesiella intestinihominis YIT 11860]
          Length = 697

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 48/118 (40%), Gaps = 14/118 (11%)

Query: 454 HPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLF 513
           HPN       ++   V W  + +G TGFL W  N + +  +           P GD  + 
Sbjct: 467 HPNQFTFSPPAESALVGWFAYAQGYTGFLRWAYNIWVQNVLKDTRC---ASAPAGDQHMV 523

Query: 514 YPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTGTYQGPER 571
           YPG        P  S+R+ R+  G+QD E L +   R    +GL   E     Q  +R
Sbjct: 524 YPG--------PRNSVRMARLREGIQDYEKLRVIMERL---KGLGTPEANENIQKLQR 570


>gi|255656530|ref|ZP_05401939.1| hypothetical protein CdifQCD-2_12719, partial [Clostridium
           difficile QCD-23m63]
          Length = 523

 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 48/110 (43%), Gaps = 7/110 (6%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           +VC  P    PN  +     ++R + W  +  G  GFL W  N + +     +  +F   
Sbjct: 371 FVCCFPE--KPNSFISSPFVENRIIGWYTYYFGLDGFLRWDYNLWTEDPWKDSSYKFPI- 427

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD 554
              GD    YPG+      +PV S+R+E +  G+QD E   +     GR+
Sbjct: 428 WKAGDMFFVYPGK----DLKPVRSVRVENLRFGIQDFELFTMLEKEKGRE 473


>gi|296450020|ref|ZP_06891783.1| conserved hypothetical protein, partial [Clostridium difficile
           NAP08]
 gi|296878401|ref|ZP_06902408.1| conserved hypothetical protein, partial [Clostridium difficile
           NAP07]
 gi|296261125|gb|EFH07957.1| conserved hypothetical protein [Clostridium difficile NAP08]
 gi|296430582|gb|EFH16422.1| conserved hypothetical protein [Clostridium difficile NAP07]
          Length = 522

 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 48/110 (43%), Gaps = 7/110 (6%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           +VC  P    PN  +     ++R + W  +  G  GFL W  N + +     +  +F   
Sbjct: 370 FVCCFPE--KPNSFISSPFVENRIIGWYTYYFGLDGFLRWDYNLWTEDPWKDSSYKFPI- 426

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRD 554
              GD    YPG+      +PV S+R+E +  G+QD E   +     GR+
Sbjct: 427 WKAGDMFFVYPGK----DLKPVRSVRVENLRFGIQDFELFTMLEKEKGRE 472


>gi|354605511|ref|ZP_09023487.1| hypothetical protein HMPREF1003_00054 [Propionibacterium sp.
           5_U_42AFAA]
 gi|386024898|ref|YP_005943203.1| hypothetical protein PAZ_c20420 [Propionibacterium acnes 266]
 gi|332676356|gb|AEE73172.1| hypothetical protein PAZ_c20420 [Propionibacterium acnes 266]
 gi|353558520|gb|EHC27883.1| hypothetical protein HMPREF1003_00054 [Propionibacterium sp.
           5_U_42AFAA]
          Length = 550

 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 86/243 (35%), Gaps = 44/243 (18%)

Query: 342 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           W  A Y+++ DEP     Y+S R     +    P A V+        DA   P  F + V
Sbjct: 320 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGATVI--------DAVDDPR-FATVV 369

Query: 401 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 460
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 370 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 408

Query: 461 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 515
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 409 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 468

Query: 516 GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTG---TYQGPERY 572
                     V SLRL+ +    +D+  L       GR   + L+E      T     RY
Sbjct: 469 ----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAVGRPTIVDLIEHLAPGSTADLDHRY 524

Query: 573 TIE 575
            +E
Sbjct: 525 PLE 527


>gi|295131485|ref|YP_003582148.1| Neuraminidase NanP [Propionibacterium acnes SK137]
 gi|291377132|gb|ADE00987.1| Neuraminidase NanP [Propionibacterium acnes SK137]
          Length = 499

 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 86/243 (35%), Gaps = 44/243 (18%)

Query: 342 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           W  A Y+++ DEP     Y+S R     +    P A V+        DA   P  F + V
Sbjct: 269 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGATVI--------DAVDDPR-FATVV 318

Query: 401 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 460
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 319 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 357

Query: 461 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 515
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 358 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 417

Query: 516 GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTG---TYQGPERY 572
                     V SLRL+ +    +D+  L       GR   + L+E      T     RY
Sbjct: 418 ----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAVGRPTIVDLIEHLAPGSTADLDHRY 473

Query: 573 TIE 575
            +E
Sbjct: 474 PLE 476


>gi|456739043|gb|EMF63610.1| hypothetical protein TIA1EST31_09784 [Propionibacterium acnes
           FZ1/2/0]
          Length = 550

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 86/243 (35%), Gaps = 44/243 (18%)

Query: 342 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           W  A Y+++ DEP     Y+S R     +    P A V+        DA   P  F + V
Sbjct: 320 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPRATVI--------DAVDDPR-FATVV 369

Query: 401 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 460
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 370 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 408

Query: 461 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 515
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 409 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 468

Query: 516 GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTG---TYQGPERY 572
                     V SLRL+ +    +D+  L       GR   + L+E      T     RY
Sbjct: 469 ----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAVGRPTIVDLIEHLAPGSTADLDHRY 524

Query: 573 TIE 575
            +E
Sbjct: 525 PLE 527


>gi|340622343|ref|YP_004740795.1| hypothetical protein Ccan_15720 [Capnocytophaga canimorsus Cc5]
 gi|339902609|gb|AEK23688.1| Hypothetical protein Ccan_15720 [Capnocytophaga canimorsus Cc5]
          Length = 596

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 59/132 (44%), Gaps = 13/132 (9%)

Query: 431 VTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE 490
           V E + + G+    Y C   ++P+PN     + ++   + W   K+   G+L W  N + 
Sbjct: 446 VLEKRQQQGKVTTYYTCC--TEPYPNTFTFSKPAESEWLAWYAAKDNLDGYLRWALNSW- 502

Query: 491 KATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASR 550
             + P  + RFR     GD  L YP           +SLR ER++ G+Q  E +N+    
Sbjct: 503 -TSEPLLDSRFR-AWAAGDTYLVYP--------MGRSSLRFERMVEGIQFYEKVNILRKE 552

Query: 551 YGRDEGLALLEK 562
           + + +    L+K
Sbjct: 553 FHQKQNAEALKK 564


>gi|116749617|ref|YP_846304.1| hypothetical protein Sfum_2187 [Syntrophobacter fumaroxidans MPOB]
 gi|116698681|gb|ABK17869.1| hypothetical protein Sfum_2187 [Syntrophobacter fumaroxidans MPOB]
          Length = 569

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 86/379 (22%), Positives = 135/379 (35%), Gaps = 84/379 (22%)

Query: 217 LSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSD--EWYEAL-DQHFKWLLQYRIS 273
           ++L VW+F LP    LP  I   +   +  +  RHG+D  E ++ L   + + +  Y+ +
Sbjct: 171 IALRVWNFALPE--DLPITIE-GNPFFKPAWFARHGADSPEKFDILVRDYLRSMRSYKFN 227

Query: 274 PFFCRWGESMRVLTYTCPWPA----------DHPKSDEY----FSDPRLAAYAVPYSPVL 319
                       L +  P+PA          D P  D        D     + VP  P  
Sbjct: 228 -----------ALGWILPFPAHQLRPPRKVEDFPAYDRMVHHVLEDLNYRFFRVPVLPGA 276

Query: 320 SSNDG--------AKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHA 371
              +G        AK +    +E L+      +A   + DEP     Y +VRN  + + +
Sbjct: 277 GGREGPGRGFVEEAKTFYPLWLEYLQRHRWENRAVIKIIDEP-EPAAYDAVRNAYALVKS 335

Query: 372 YAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIV 431
            AP  R   T   G    P      + +V   K+  P      T +              
Sbjct: 336 LAPGIR---TASAGREPDPKLAGAIDIWVSYAKYYNPAQIAEATRQ-------------- 378

Query: 432 TELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEK 491
                  G+E W Y+        P  +L       R + W +++    G L+WG   + +
Sbjct: 379 -------GQEVWLYINKLHGIDQPLVNL-------RLIGWYLYEYRFPGILFWGLAKWPQ 424

Query: 492 ---ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYA 548
                 P  E   RRG        +YP     ++ +PV + RLE +  G QD +YL L  
Sbjct: 425 DPWTANPGREDSLRRG------TFYYPD---PTTGKPVPTTRLEALRRGFQDCQYLLLVE 475

Query: 549 SRYGRDEGLALLEKTGTYQ 567
              G+   LA  E  G  Q
Sbjct: 476 EAIGQKR-LAPSEFAGIRQ 493


>gi|289426957|ref|ZP_06428676.1| conserved hypothetical protein [Propionibacterium acnes J165]
 gi|417930403|ref|ZP_12573781.1| hypothetical protein HMPREF9205_1262 [Propionibacterium acnes
           SK182]
 gi|289159779|gb|EFD07964.1| conserved hypothetical protein [Propionibacterium acnes J165]
 gi|340772245|gb|EGR94754.1| hypothetical protein HMPREF9205_1262 [Propionibacterium acnes
           SK182]
          Length = 543

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 86/243 (35%), Gaps = 44/243 (18%)

Query: 342 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           W  A Y+++ DEP     Y+S R     +    P A V+        DA   P  F + V
Sbjct: 313 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGATVI--------DAVDDPR-FATVV 362

Query: 401 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 460
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 363 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 401

Query: 461 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 515
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 402 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 461

Query: 516 GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTG---TYQGPERY 572
                     V SLRL+ +    +D+  L       GR   + L+E      T     RY
Sbjct: 462 ----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAVGRPTIVDLIEHLAPGSTADLDHRY 517

Query: 573 TIE 575
            +E
Sbjct: 518 PLE 520


>gi|335050668|ref|ZP_08543624.1| hypothetical protein HMPREF9947_1061 [Propionibacterium sp.
           409-HC1]
 gi|333769177|gb|EGL46316.1| hypothetical protein HMPREF9947_1061 [Propionibacterium sp.
           409-HC1]
          Length = 426

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 98/267 (36%), Gaps = 47/267 (17%)

Query: 321 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 376
           S +G +D++   +  L   ++ H W  A Y+++ DEP     Y+S R     +    P A
Sbjct: 172 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 230

Query: 377 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 436
            V+        DA   P  F + V VP  +  H                     + E + 
Sbjct: 231 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 260

Query: 437 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 491
              +  W Y     +   PN HLGM  ++ RA+   +W     G L+W  N +       
Sbjct: 261 AGLDGMWVYTSCASTFWDPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 320

Query: 492 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY 551
              P+A+       P GD  + YP          V SLRL+ +    +D+  L       
Sbjct: 321 LVDPNADTSADLAFPSGDSSVIYP----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAV 376

Query: 552 GRDEGLALLEKTG---TYQGPERYTIE 575
           GR   + L+E      T     RY +E
Sbjct: 377 GRPTIVDLIEHLAPGSTADLDHRYPLE 403


>gi|422425269|ref|ZP_16502212.1| hypothetical protein HMPREF9570_01721 [Propionibacterium acnes
           HL043PA1]
 gi|327446506|gb|EGE93160.1| hypothetical protein HMPREF9570_01721 [Propionibacterium acnes
           HL043PA1]
          Length = 530

 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 86/243 (35%), Gaps = 44/243 (18%)

Query: 342 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           W  A Y+++ DEP     Y+S R     +    P A V+        DA   P  F + V
Sbjct: 300 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGATVI--------DAVDDPR-FATVV 349

Query: 401 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 460
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 350 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 388

Query: 461 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 515
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 389 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 448

Query: 516 GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTG---TYQGPERY 572
                     V SLRL+ +    +D+  L       GR   + L+E      T     RY
Sbjct: 449 ----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAVGRPTIVDLIEHLAPGSTADLDHRY 504

Query: 573 TIE 575
            +E
Sbjct: 505 PLE 507


>gi|422386169|ref|ZP_16466290.1| hypothetical protein HMPREF9337_02422 [Propionibacterium acnes
           HL096PA3]
 gi|422392231|ref|ZP_16472304.1| hypothetical protein HMPREF9343_00726 [Propionibacterium acnes
           HL099PA1]
 gi|422430092|ref|ZP_16506978.1| conserved hypothetical protein [Propionibacterium acnes HL072PA2]
 gi|422438517|ref|ZP_16515359.1| hypothetical protein HMPREF9584_02020 [Propionibacterium acnes
           HL092PA1]
 gi|422448230|ref|ZP_16524960.1| conserved hypothetical protein [Propionibacterium acnes HL036PA3]
 gi|422460209|ref|ZP_16536849.1| conserved hypothetical protein [Propionibacterium acnes HL038PA1]
 gi|422475370|ref|ZP_16551823.1| conserved hypothetical protein [Propionibacterium acnes HL056PA1]
 gi|422477484|ref|ZP_16553910.1| conserved hypothetical protein [Propionibacterium acnes HL007PA1]
 gi|422479834|ref|ZP_16556241.1| conserved hypothetical protein [Propionibacterium acnes HL063PA1]
 gi|422482229|ref|ZP_16558627.1| conserved hypothetical protein [Propionibacterium acnes HL036PA1]
 gi|422486346|ref|ZP_16562698.1| hypothetical protein HMPREF9571_02624 [Propionibacterium acnes
           HL043PA2]
 gi|422487051|ref|ZP_16563389.1| hypothetical protein HMPREF9568_00642 [Propionibacterium acnes
           HL013PA2]
 gi|422490276|ref|ZP_16566592.1| hypothetical protein HMPREF9563_01330 [Propionibacterium acnes
           HL020PA1]
 gi|422493740|ref|ZP_16570038.1| conserved hypothetical protein [Propionibacterium acnes HL086PA1]
 gi|422499096|ref|ZP_16575366.1| conserved hypothetical protein [Propionibacterium acnes HL002PA3]
 gi|422502888|ref|ZP_16579132.1| conserved hypothetical protein [Propionibacterium acnes HL027PA2]
 gi|422506829|ref|ZP_16583051.1| conserved hypothetical protein [Propionibacterium acnes HL036PA2]
 gi|422507153|ref|ZP_16583363.1| conserved hypothetical protein [Propionibacterium acnes HL046PA2]
 gi|422514522|ref|ZP_16590642.1| conserved hypothetical protein [Propionibacterium acnes HL087PA2]
 gi|422515275|ref|ZP_16591389.1| conserved hypothetical protein [Propionibacterium acnes HL110PA2]
 gi|422517455|ref|ZP_16593549.1| conserved hypothetical protein [Propionibacterium acnes HL074PA1]
 gi|422521028|ref|ZP_16597065.1| conserved hypothetical protein [Propionibacterium acnes HL045PA1]
 gi|422522534|ref|ZP_16598557.1| conserved hypothetical protein [Propionibacterium acnes HL053PA2]
 gi|422526506|ref|ZP_16602502.1| conserved hypothetical protein [Propionibacterium acnes HL083PA1]
 gi|422529133|ref|ZP_16605104.1| conserved hypothetical protein [Propionibacterium acnes HL053PA1]
 gi|422530895|ref|ZP_16606851.1| conserved hypothetical protein [Propionibacterium acnes HL110PA1]
 gi|422533480|ref|ZP_16609415.1| conserved hypothetical protein [Propionibacterium acnes HL072PA1]
 gi|422536918|ref|ZP_16612807.1| conserved hypothetical protein [Propionibacterium acnes HL078PA1]
 gi|422544974|ref|ZP_16620805.1| conserved hypothetical protein [Propionibacterium acnes HL082PA1]
 gi|422553521|ref|ZP_16629304.1| conserved hypothetical protein [Propionibacterium acnes HL005PA3]
 gi|422555761|ref|ZP_16631525.1| conserved hypothetical protein [Propionibacterium acnes HL005PA2]
 gi|422559692|ref|ZP_16635417.1| conserved hypothetical protein [Propionibacterium acnes HL005PA1]
 gi|422566643|ref|ZP_16642274.1| conserved hypothetical protein [Propionibacterium acnes HL002PA2]
 gi|313773388|gb|EFS39354.1| conserved hypothetical protein [Propionibacterium acnes HL074PA1]
 gi|313793544|gb|EFS41587.1| conserved hypothetical protein [Propionibacterium acnes HL110PA1]
 gi|313802854|gb|EFS44067.1| conserved hypothetical protein [Propionibacterium acnes HL110PA2]
 gi|313806300|gb|EFS44816.1| conserved hypothetical protein [Propionibacterium acnes HL087PA2]
 gi|313810714|gb|EFS48428.1| conserved hypothetical protein [Propionibacterium acnes HL083PA1]
 gi|313819411|gb|EFS57125.1| conserved hypothetical protein [Propionibacterium acnes HL046PA2]
 gi|313821218|gb|EFS58932.1| conserved hypothetical protein [Propionibacterium acnes HL036PA1]
 gi|313822328|gb|EFS60042.1| conserved hypothetical protein [Propionibacterium acnes HL036PA2]
 gi|313826114|gb|EFS63828.1| conserved hypothetical protein [Propionibacterium acnes HL063PA1]
 gi|313831018|gb|EFS68732.1| conserved hypothetical protein [Propionibacterium acnes HL007PA1]
 gi|313833151|gb|EFS70865.1| conserved hypothetical protein [Propionibacterium acnes HL056PA1]
 gi|313838209|gb|EFS75923.1| conserved hypothetical protein [Propionibacterium acnes HL086PA1]
 gi|314926058|gb|EFS89889.1| conserved hypothetical protein [Propionibacterium acnes HL036PA3]
 gi|314962219|gb|EFT06320.1| conserved hypothetical protein [Propionibacterium acnes HL002PA2]
 gi|314963267|gb|EFT07367.1| conserved hypothetical protein [Propionibacterium acnes HL082PA1]
 gi|314973911|gb|EFT18007.1| conserved hypothetical protein [Propionibacterium acnes HL053PA1]
 gi|314976839|gb|EFT20934.1| conserved hypothetical protein [Propionibacterium acnes HL045PA1]
 gi|314979400|gb|EFT23494.1| conserved hypothetical protein [Propionibacterium acnes HL072PA2]
 gi|314984961|gb|EFT29053.1| conserved hypothetical protein [Propionibacterium acnes HL005PA1]
 gi|314986366|gb|EFT30458.1| conserved hypothetical protein [Propionibacterium acnes HL005PA2]
 gi|314988538|gb|EFT32629.1| conserved hypothetical protein [Propionibacterium acnes HL005PA3]
 gi|315079732|gb|EFT51720.1| conserved hypothetical protein [Propionibacterium acnes HL053PA2]
 gi|315080984|gb|EFT52960.1| conserved hypothetical protein [Propionibacterium acnes HL078PA1]
 gi|315083953|gb|EFT55929.1| conserved hypothetical protein [Propionibacterium acnes HL027PA2]
 gi|315085090|gb|EFT57066.1| conserved hypothetical protein [Propionibacterium acnes HL002PA3]
 gi|315089519|gb|EFT61495.1| conserved hypothetical protein [Propionibacterium acnes HL072PA1]
 gi|315097747|gb|EFT69723.1| conserved hypothetical protein [Propionibacterium acnes HL038PA1]
 gi|327325682|gb|EGE67479.1| hypothetical protein HMPREF9337_02422 [Propionibacterium acnes
           HL096PA3]
 gi|327443366|gb|EGE90020.1| hypothetical protein HMPREF9571_02624 [Propionibacterium acnes
           HL043PA2]
 gi|327447600|gb|EGE94254.1| hypothetical protein HMPREF9568_00642 [Propionibacterium acnes
           HL013PA2]
 gi|327452473|gb|EGE99127.1| hypothetical protein HMPREF9584_02020 [Propionibacterium acnes
           HL092PA1]
 gi|328755409|gb|EGF69025.1| hypothetical protein HMPREF9563_01330 [Propionibacterium acnes
           HL020PA1]
 gi|328761596|gb|EGF75113.1| hypothetical protein HMPREF9343_00726 [Propionibacterium acnes
           HL099PA1]
          Length = 530

 Score = 44.3 bits (103), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 86/243 (35%), Gaps = 44/243 (18%)

Query: 342 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           W  A Y+++ DEP     Y+S R     +    P A V+        DA   P  F + V
Sbjct: 300 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGATVI--------DAVDDPR-FATVV 349

Query: 401 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 460
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 350 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 388

Query: 461 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 515
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 389 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 448

Query: 516 GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTG---TYQGPERY 572
                     V SLRL+ +    +D+  L       GR   + L+E      T     RY
Sbjct: 449 ----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAVGRPTIVDLIEHLAPGSTADLDHRY 504

Query: 573 TIE 575
            +E
Sbjct: 505 PLE 507


>gi|422387503|ref|ZP_16467618.1| hypothetical protein HMPREF9338_01115 [Propionibacterium acnes
           HL096PA2]
 gi|327330870|gb|EGE72615.1| hypothetical protein HMPREF9338_01115 [Propionibacterium acnes
           HL096PA2]
          Length = 530

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 86/243 (35%), Gaps = 44/243 (18%)

Query: 342 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 400
           W  A Y+++ DEP     Y+S R     +    P A V+        DA   P  F + V
Sbjct: 300 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGATVI--------DAVDDPR-FATVV 349

Query: 401 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 460
            VP  +  H                     + E +    +  W Y     +   PN HLG
Sbjct: 350 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 388

Query: 461 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 515
           M  ++ RA+   +W     G L+W  N +          P+A+       P GD  + YP
Sbjct: 389 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 448

Query: 516 GEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTG---TYQGPERY 572
                     V SLRL+ +    +D+  L       GR   + L+E      T     RY
Sbjct: 449 ----RVDGSLVPSLRLKVLAQLHEDVRLLRRVEDAVGRPTIVDLIEHLAPGSTADLDHRY 504

Query: 573 TIE 575
            +E
Sbjct: 505 PLE 507


>gi|336422517|ref|ZP_08602661.1| hypothetical protein HMPREF0993_02038 [Lachnospiraceae bacterium
           5_1_57FAA]
 gi|336008443|gb|EGN38461.1| hypothetical protein HMPREF0993_02038 [Lachnospiraceae bacterium
           5_1_57FAA]
          Length = 559

 Score = 43.9 bits (102), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 50/124 (40%), Gaps = 9/124 (7%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSA 497
           WTY C G      N  + M  +++R    +++K    G L+WG N Y      +   P  
Sbjct: 399 WTYYCTGQYYEVSNRFMSMPSARNRIYGVQLYKYDIVGILHWGYNFYNSQFSIEHINPYE 458

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                   P GD  L YPG    +   P  S+R+      + D+  L L  S   ++  +
Sbjct: 459 VTDAGNAFPAGDPFLVYPG----ADGHPEESIRMMVHDEAMADLRALELLESLTSKEHVM 514

Query: 558 ALLE 561
            L+E
Sbjct: 515 ELIE 518


>gi|167759680|ref|ZP_02431807.1| hypothetical protein CLOSCI_02039 [Clostridium scindens ATCC 35704]
 gi|167662711|gb|EDS06841.1| hypothetical protein CLOSCI_02039 [Clostridium scindens ATCC 35704]
          Length = 559

 Score = 43.9 bits (102), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 50/124 (40%), Gaps = 9/124 (7%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSA 497
           WTY C G      N  + M  +++R    +++K    G L+WG N Y      +   P  
Sbjct: 399 WTYYCTGQYYEVSNRFMSMPSARNRIYGVQLYKYDIVGILHWGYNFYNSQFSIEHINPYE 458

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
                   P GD  L YPG    +   P  S+R+      + D+  L L  S   ++  +
Sbjct: 459 VTDAGNAFPAGDPFLVYPG----ADGHPEESIRMMVHDEAMADLRALELLESLTSKEHVM 514

Query: 558 ALLE 561
            L+E
Sbjct: 515 ELIE 518


>gi|266619418|ref|ZP_06112353.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479]
 gi|288869045|gb|EFD01344.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479]
          Length = 554

 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 58/238 (24%), Positives = 90/238 (37%), Gaps = 38/238 (15%)

Query: 335 LLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT 394
           +L+ +  +++ YF++ DEP + E   S R     + +   D RV+        DA     
Sbjct: 320 VLKEEDMFEQTYFHISDEP-HGEQMESYRAAKESIRSLLADCRVI--------DA----- 365

Query: 395 PFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPH 454
              SF      +  H  I C S+ +    E  VKD+            WTY C       
Sbjct: 366 -LSSFFIYQAGVVEHP-IVC-SDCLEPFLEAGVKDL------------WTYYCCVQGYEV 410

Query: 455 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSAEIRFRRGLPPGD 509
            N  + M  +++R +  +++     GFL+WG N Y       A  P        G P GD
Sbjct: 411 SNQFMAMPSARNRILGVQLYLYHIKGFLHWGYNFYNSQHSVSAVNPYLVTDAGGGFPSGD 470

Query: 510 GVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEKTGTYQ 567
             + YP E       P  SLR   +   + D+  L       GRD    L+ +   Y+
Sbjct: 471 PFVVYPSE----DGTPHDSLRFMVLTEAMYDLRALERLEELAGRDYTERLIHEGLEYR 524


>gi|266624158|ref|ZP_06117093.1| conserved hypothetical protein, partial [Clostridium hathewayi DSM
           13479]
 gi|288864007|gb|EFC96305.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479]
          Length = 527

 Score = 43.5 bits (101), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 55/147 (37%), Gaps = 15/147 (10%)

Query: 421 GNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTG 480
           GN E      + E      +  W Y C        N  + M  +++R    +++K    G
Sbjct: 382 GNNE------IAEFLEHGVDNMWVYYCTMQYLDVSNRFMAMPSARNRIYGLQIYKYDIIG 435

Query: 481 FLYWGANCYE-----KATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERIL 535
            L+WG N Y      +   P        G P GD  L YPGE       P  S+R   + 
Sbjct: 436 ILHWGYNFYNSQWSTRHINPYEVTDADYGFPAGDSFLVYPGE----DGYPEESIRGMVLN 491

Query: 536 SGLQDIEYLNLYASRYGRDEGLALLEK 562
             L D+    L  +   ++  L +LE+
Sbjct: 492 EALNDLRACQLLEALTSKEYVLGILEE 518


>gi|305664790|ref|YP_003861077.1| hypothetical protein FB2170_00755 [Maribacter sp. HTCC2170]
 gi|88707912|gb|EAR00151.1| hypothetical protein FB2170_00755 [Maribacter sp. HTCC2170]
          Length = 553

 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 46/113 (40%), Gaps = 12/113 (10%)

Query: 439 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKAT 493
           GE+   Y C+ P     N  L M   +     W        G+L+WG N Y     E + 
Sbjct: 397 GEKVLVYTCLIPGGKWLNRTLDMEKIRQVYFGWGASYYNTGGYLHWGLNQYKEDPFENSV 456

Query: 494 V--PSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYL 544
           V  PS        LP GD  + YPG     S  P++S+R E    G +D E L
Sbjct: 457 VKHPSPAASANNYLPAGDTHIVYPG-----SDGPLSSIRFEAHRIGCEDYELL 504


>gi|403220771|dbj|BAM38904.1| conserved hypothetical protein [Theileria orientalis strain
           Shintoku]
          Length = 210

 Score = 43.5 bits (101), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 34/149 (22%), Positives = 68/149 (45%), Gaps = 23/149 (15%)

Query: 280 GESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTK 339
           G+++++L +    P DH   +E+F D     Y   +  +  S+D +++ V   I +++  
Sbjct: 11  GQNLQILKFLFSIPNDHV--NEHFDDK----YVREFHRLDDSSDNSEELVTARI-IIKLI 63

Query: 340 AHWKKAYFYLWDEPL--NMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFE 397
            H  + +  + D+ +  NME ++ +R+++ ELH Y+            P     G    E
Sbjct: 64  KHEFEKFNLIRDQYITPNMERFTQIRHLSQELHPYSDT----------PCSTQAGCDKLE 113

Query: 398 SFVKVPKFLRPHTQ----IYCTSEWVLGN 422
             + +  ++R  T     I+ T   VLGN
Sbjct: 114 MLMNLCSYIRGGTSFAYDIFATMVHVLGN 142


>gi|423228561|ref|ZP_17214967.1| hypothetical protein HMPREF1063_00787 [Bacteroides dorei
           CL02T00C15]
 gi|423243826|ref|ZP_17224902.1| hypothetical protein HMPREF1064_01108 [Bacteroides dorei
           CL02T12C06]
 gi|392635869|gb|EIY29761.1| hypothetical protein HMPREF1063_00787 [Bacteroides dorei
           CL02T00C15]
 gi|392643971|gb|EIY37716.1| hypothetical protein HMPREF1064_01108 [Bacteroides dorei
           CL02T12C06]
          Length = 582

 Score = 43.1 bits (100), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 33/130 (25%), Positives = 54/130 (41%), Gaps = 23/130 (17%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           Y C   ++P PN       ++   + W   KE   G+L W  N + K   P  + RF   
Sbjct: 446 YTCC--TEPRPNTFTFSEPAEAEWLAWHSAKENLDGYLRWALNSWVKN--PLQDSRF-TA 500

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGR----------D 554
              GD  + YPG          +S+RLER+  G+Q  E + +    + R          D
Sbjct: 501 WAAGDTYMIYPG--------ARSSIRLERLTEGVQAFEKIRILKEEFERKGNKGAIKNID 552

Query: 555 EGLALLEKTG 564
           + L + +++G
Sbjct: 553 KALKMFDESG 562


>gi|265751189|ref|ZP_06087252.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|423239680|ref|ZP_17220796.1| hypothetical protein HMPREF1065_01419 [Bacteroides dorei
           CL03T12C01]
 gi|263238085|gb|EEZ23535.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
 gi|392645720|gb|EIY39443.1| hypothetical protein HMPREF1065_01419 [Bacteroides dorei
           CL03T12C01]
          Length = 582

 Score = 43.1 bits (100), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 33/130 (25%), Positives = 54/130 (41%), Gaps = 23/130 (17%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           Y C   ++P PN       ++   + W   KE   G+L W  N + K   P  + RF   
Sbjct: 446 YTCC--TEPRPNTFTFSEPAEAEWLAWHSAKENLDGYLRWALNSWVKN--PLQDSRF-TA 500

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGR----------D 554
              GD  + YPG          +S+RLER+  G+Q  E + +    + R          D
Sbjct: 501 WAAGDTYMIYPG--------ARSSIRLERLTEGVQAFEKIRILKEEFERKGNKGAIKNID 552

Query: 555 EGLALLEKTG 564
           + L + +++G
Sbjct: 553 KALKMFDESG 562


>gi|212693958|ref|ZP_03302086.1| hypothetical protein BACDOR_03482 [Bacteroides dorei DSM 17855]
 gi|212663490|gb|EEB24064.1| hypothetical protein BACDOR_03482 [Bacteroides dorei DSM 17855]
          Length = 582

 Score = 43.1 bits (100), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 33/130 (25%), Positives = 54/130 (41%), Gaps = 23/130 (17%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           Y C   ++P PN       ++   + W   KE   G+L W  N + K   P  + RF   
Sbjct: 446 YTCC--TEPRPNTFTFSEPAEAEWLAWHSAKENLDGYLRWALNSWVKN--PLQDSRF-TA 500

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGR----------D 554
              GD  + YPG          +S+RLER+  G+Q  E + +    + R          D
Sbjct: 501 WAAGDTYMIYPG--------ARSSIRLERLTEGVQAFEKIRILKEEFERKGNKGAIKNID 552

Query: 555 EGLALLEKTG 564
           + L + +++G
Sbjct: 553 KALKMFDESG 562


>gi|406899156|gb|EKD42509.1| hypothetical protein ACD_73C00130G0001, partial [uncultured
           bacterium]
          Length = 200

 Score = 42.7 bits (99), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 48/110 (43%), Gaps = 26/110 (23%)

Query: 463 GSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRR---------GLPP------ 507
           G   R   W  +K+G   + YW +  Y        E    +         G+ P      
Sbjct: 29  GVALRVNAWAQFKKGVRRWFYWESTYYNNFQAGLGETDVFKSAKTFGGTTGIDPVYGETG 88

Query: 508 -----GDGVLFYPG--EVFSSSR----QPVASLRLERILSGLQDIEYLNL 546
                GDGVLFYPG  ++F +S      P+ASLRL+    G+QD++YL L
Sbjct: 89  GLYSNGDGVLFYPGTDKIFPASSYGVDGPIASLRLKYWRRGIQDVDYLYL 138


>gi|153004228|ref|YP_001378553.1| hypothetical protein Anae109_1362 [Anaeromyxobacter sp. Fw109-5]
 gi|152027801|gb|ABS25569.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
          Length = 562

 Score = 42.4 bits (98), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 45/95 (47%), Gaps = 5/95 (5%)

Query: 455 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 514
           P + +    +  RAV W  W+ G +G LY+          P  ++R   G   GDG L Y
Sbjct: 414 PGYAIDRPATAARAVPWLGWRRGLSGELYYDVLHAWIDADPWHDVRAFAGN--GDGTLLY 471

Query: 515 PG---EVFSSSRQPVASLRLERILSGLQDIEYLNL 546
           PG   E+      PV S+RL+ I   L+D+E L +
Sbjct: 472 PGRPAELGGEHGFPVESIRLKVIRDALEDVELLRM 506


>gi|162450247|ref|YP_001612614.1| hypothetical protein sce1975 [Sorangium cellulosum So ce56]
 gi|161160829|emb|CAN92134.1| hypothetical protein predicted by Glimmer/Critica [Sorangium
           cellulosum So ce56]
          Length = 687

 Score = 42.4 bits (98), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 4/60 (6%)

Query: 82  VPDALVPLDL--PVCQISLIPG--ETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSS 137
           VPDAL+P++L  P     L  G  ET AVW+ +  P    PG YEG +++ S +  EL+S
Sbjct: 176 VPDALIPVELAPPWAPYPLEVGARETRAVWIDLHVPEGALPGAYEGRVVVGSVSHGELAS 235


>gi|86158927|ref|YP_465712.1| hypothetical protein Adeh_2505 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|85775438|gb|ABC82275.1| hypothetical protein Adeh_2505 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 564

 Score = 42.4 bits (98), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 46/91 (50%), Gaps = 6/91 (6%)

Query: 455 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 514
           P + +   G+  RAV W  W+ G +G LY+      +   P  ++R   G   GDG L Y
Sbjct: 424 PGYEIDRPGAAARAVAWLGWRRGLSGELYYDMLQAWRGD-PWTDVRAFAGN--GDGTLLY 480

Query: 515 PG---EVFSSSRQPVASLRLERILSGLQDIE 542
           PG   E+  +   PV S+RL+ +   L+D+E
Sbjct: 481 PGRPAELGGTRPFPVESIRLKIVRDALEDVE 511


>gi|227535774|ref|ZP_03965823.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227244262|gb|EEI94277.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 577

 Score = 42.0 bits (97), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 48/111 (43%), Gaps = 11/111 (9%)

Query: 452 DPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGV 511
           +P PN       S+   + W   + G  G+L W  +C+ K   P+ + RF   L  GD  
Sbjct: 446 EPFPNTFTSSGYSEATWLPWNSVQRGFDGYLRWAFDCWNKN--PNLDTRFGTWL-AGDAW 502

Query: 512 LFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEK 562
             YP            S+R E+++ G+QD E + +   +  ++   A L K
Sbjct: 503 FIYP--------DVKTSIRFEKLIEGVQDAEKIRIIREKLKKEGKTAELNK 545


>gi|237710381|ref|ZP_04540862.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
 gi|345516852|ref|ZP_08796337.1| hypothetical protein BSEG_03954 [Bacteroides dorei 5_1_36/D4]
 gi|229437736|gb|EEO47813.1| hypothetical protein BSEG_03954 [Bacteroides dorei 5_1_36/D4]
 gi|229455843|gb|EEO61564.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
          Length = 582

 Score = 42.0 bits (97), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 46/109 (42%), Gaps = 13/109 (11%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           Y C   ++P PN       ++   + W   KE   G+L W  N + K   P  + RF   
Sbjct: 446 YTCC--TEPRPNTFTFSEPAEAEWLAWHSAKENLDGYLRWALNSWVKN--PLQDSRF-TA 500

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGR 553
              GD  + YPG          +S+RLER+  G+Q  E + +    + R
Sbjct: 501 WAAGDTYMIYPG--------ARSSIRLERLTEGVQAFEKIRILKEEFER 541


>gi|326798853|ref|YP_004316672.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326549617|gb|ADZ78002.1| hypothetical protein Sph21_1440 [Sphingobacterium sp. 21]
          Length = 580

 Score = 42.0 bits (97), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 50/113 (44%), Gaps = 14/113 (12%)

Query: 452 DPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGV 511
           +P PN       ++   + W    +   G+L W  NC+     P  + R+   L  GD  
Sbjct: 449 EPFPNTFTSSSYAESTWLAWYALNKDFDGYLRWAFNCW--GPRPLQDTRYGTWL-AGDAW 505

Query: 512 LFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYAS---RYGRDEGLALLE 561
           L YPG          +S+R ER++ G+QD E   +  +   + GR+E L  LE
Sbjct: 506 LVYPG--------VRSSIRFERLVEGIQDYEKAKIIRAQLKKEGRNEELQKLE 550


>gi|304384577|ref|ZP_07366923.1| conserved hypothetical protein [Pediococcus acidilactici DSM 20284]
 gi|304328771|gb|EFL95991.1| conserved hypothetical protein [Pediococcus acidilactici DSM 20284]
          Length = 569

 Score = 42.0 bits (97), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 60/252 (23%), Positives = 97/252 (38%), Gaps = 44/252 (17%)

Query: 327 DYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGP 386
           +YV+   + L     W+K    + DEP         ++  S L    PD +V   +   P
Sbjct: 322 NYVKALCDHLNELQVWEKVRL-IADEP-KQAQLKEFKDALSALKQMVPDLKVKVAFDKEP 379

Query: 387 ---SDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWW 443
                APL  T   SF             YCTS++             ++LQ  +  E  
Sbjct: 380 ILNELAPLVDTLATSF-------------YCTSQFG------------SKLQASHPGEVQ 414

Query: 444 TYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFR- 502
            Y+C  P  P+   H  +  ++ +  +       G   L W  NC+   +    +IR+  
Sbjct: 415 YYICNYPDHPNTFLHSPLLETRLQGTLTAFLPVNG--LLRWAFNCW--PSNAREDIRYNT 470

Query: 503 RGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEG--LALL 560
             LP GD  L YPGE    +   + SLR +++   ++D    +L  S    DE    A+L
Sbjct: 471 SSLPIGDNCLVYPGE----NGHLLLSLRYKQLERAVED---FSLIKSAKAADEKRTTAIL 523

Query: 561 EKTGTYQGPERY 572
           E       P+++
Sbjct: 524 EHFLGETAPQKW 535


>gi|365132343|ref|ZP_09342149.1| hypothetical protein HMPREF1032_03945 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363616981|gb|EHL68397.1| hypothetical protein HMPREF1032_03945 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 571

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 38/95 (40%), Gaps = 9/95 (9%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA-----TVPSA 497
           W Y C   S   PN    M  +++R +   ++  G  GFL+WG N Y          P  
Sbjct: 416 WVYYCCAQSSLVPNRFFAMESARNRIMGVLMYLYGIKGFLHWGYNFYNSKFSLHPVDPYR 475

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLE 532
                   P GD  L YPG        P++S+R E
Sbjct: 476 VTHADYAFPSGDPFLVYPG----PDGAPLSSVRAE 506


>gi|189467789|ref|ZP_03016574.1| hypothetical protein BACINT_04181 [Bacteroides intestinalis DSM
           17393]
 gi|189436053|gb|EDV05038.1| hypothetical protein BACINT_04181 [Bacteroides intestinalis DSM
           17393]
          Length = 582

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 47/109 (43%), Gaps = 13/109 (11%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           Y C   S+P PN       ++   + W   KE   G+L W  N + K   P  + RF   
Sbjct: 446 YTCC--SEPRPNTFTFSPPAEAEWLAWHSAKENLDGYLRWALNSWVKN--PLQDSRF-TA 500

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGR 553
              GD  L YPG          +S+R+ER++ G+Q  E + +    + +
Sbjct: 501 WAAGDTYLIYPG--------ARSSIRMERLVEGVQAFEKVRILKEEFKK 541


>gi|375254368|ref|YP_005013535.1| hypothetical protein BFO_0590 [Tannerella forsythia ATCC 43037]
 gi|363408609|gb|AEW22295.1| hypothetical protein BFO_0590 [Tannerella forsythia ATCC 43037]
          Length = 605

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 49/114 (42%), Gaps = 14/114 (12%)

Query: 452 DPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGV 511
           +P+PN       ++   + W   KEG  G+L W  N + +   P  + RF      GD  
Sbjct: 475 EPYPNTFTFSSPAESEWLGWYAAKEGLDGYLRWAYNSWVRE--PLLDSRFYT-WAAGDTY 531

Query: 512 LFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDE---GLALLEK 562
             YPG          +SLR E +++G+Q  E + +    Y R     GL  +EK
Sbjct: 532 FIYPG--------ARSSLRFEHLIAGIQAFEKIRILKEEYTRTNNRTGLRRIEK 577


>gi|116623984|ref|YP_826140.1| hypothetical protein Acid_4896 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116227146|gb|ABJ85855.1| hypothetical protein Acid_4896 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 543

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 45/161 (27%), Positives = 60/161 (37%), Gaps = 34/161 (21%)

Query: 2   PSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSAS 61
           PS   VG  +    +  ++L AAR E ES QI     V+  +S   G V +  SDL    
Sbjct: 38  PSMQRVGMTDPAGSVSDVSLAAARGEYESFQI-----VANGASKGLGNVNLTVSDLEGPD 92

Query: 62  GDRLVVGQSLMLRRVV---------------PMLG--VPDALVPLDLP-------VCQIS 97
           G  +  G   + R                  PM     PDAL+P   P         +IS
Sbjct: 93  GKVIPHGNFTLYREKYMHVTSPSPNWKGSNQPMGAGWYPDALIPFTDPDTGKPLSGAKIS 152

Query: 98  LIP-----GETTAVWVSIDAPYAQPPGLYEGEIIITSKADT 133
            +P     G    VWV +  P     G Y+G   +TS   T
Sbjct: 153 AVPFDVKAGNNQPVWVDLLVPQTAQAGTYKGTYTVTSNEGT 193


>gi|255280540|ref|ZP_05345095.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469]
 gi|255269005|gb|EET62210.1| hypothetical protein BRYFOR_05874 [Marvinbryantia formatexigens DSM
           14469]
          Length = 566

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 32/116 (27%), Positives = 47/116 (40%), Gaps = 12/116 (10%)

Query: 443 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA-----TVPSA 497
           WTY C   +    N  +   G + R +  +++K    GFL WG N Y          P  
Sbjct: 406 WTYYCTAQAVDVSNRFIAQPGFRTRILGMQLYKFEIDGFLQWGYNFYNSEYSLYPINPYQ 465

Query: 498 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLR---LERILSGLQDIEYLNLYASR 550
                   P GD  L YPG+      QP  S+R   +E  ++ L+ ++YL     R
Sbjct: 466 CTDADGAFPSGDPFLVYPGK----DGQPEDSIRCMLMEETMNDLRAMKYLESLTDR 517


>gi|319902289|ref|YP_004162017.1| hypothetical protein Bache_2466 [Bacteroides helcogenes P 36-108]
 gi|319417320|gb|ADV44431.1| hypothetical protein Bache_2466 [Bacteroides helcogenes P 36-108]
          Length = 606

 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 45/109 (41%), Gaps = 12/109 (11%)

Query: 471 WRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLR 530
           W   K G  G+L W  N +     P  + RF      GD    YPG           SLR
Sbjct: 494 WYAAKAGLDGYLRWAYNSW--VIEPLVDSRFYT-WAAGDTYFVYPG--------ARTSLR 542

Query: 531 LERILSGLQDIEYLNLYASRYGRDEGLALLEKTG-TYQGPERYTIEHLP 578
            ER++SG+Q  E + +    + R    + L+K G   +  +   + H+P
Sbjct: 543 FERLISGVQAFEKVRILREEFTRTNNRSALKKIGKALEAFDETALNHIP 591


>gi|319641795|ref|ZP_07996475.1| hypothetical protein HMPREF9011_02073 [Bacteroides sp. 3_1_40A]
 gi|345518533|ref|ZP_08797981.1| hypothetical protein BSFG_02375 [Bacteroides sp. 4_3_47FAA]
 gi|254835919|gb|EET16228.1| hypothetical protein BSFG_02375 [Bacteroides sp. 4_3_47FAA]
 gi|317386584|gb|EFV67483.1| hypothetical protein HMPREF9011_02073 [Bacteroides sp. 3_1_40A]
          Length = 582

 Score = 40.4 bits (93), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/122 (26%), Positives = 51/122 (41%), Gaps = 16/122 (13%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           Y C   ++P PN       ++   + W   KE   G+L W  N + K   P  + RF   
Sbjct: 446 YTCC--TEPRPNTFTFSEPAEAEWLAWHSAKENLDGYLRWALNSWVKN--PLQDSRF-TA 500

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY---GRDEGLALLE 561
              GD  + YPG          +S+RLER+  G+Q  E + +    +   G    +  ++
Sbjct: 501 WAAGDTYMIYPG--------ARSSIRLERLTEGIQFFEKVRILKEEFEEKGNKGAIKNID 552

Query: 562 KT 563
           KT
Sbjct: 553 KT 554


>gi|300771575|ref|ZP_07081450.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300761564|gb|EFK58385.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 577

 Score = 40.4 bits (93), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 47/111 (42%), Gaps = 11/111 (9%)

Query: 452 DPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGV 511
           +P PN       S+   + W   + G  G+L W  +C+ K   P+ + RF   L  GD  
Sbjct: 446 EPFPNTFTSSGYSEATWLPWNSVQRGFDGYLRWAFDCWNKN--PNLDTRFGTWL-AGDAW 502

Query: 512 LFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRYGRDEGLALLEK 562
             YP            S+R E+++ G+QD E + +   +   +   A L K
Sbjct: 503 FIYP--------DVKTSIRFEKLIEGIQDAEKIRIIREKLKGEGKTAELNK 545


>gi|150004737|ref|YP_001299481.1| hypothetical protein BVU_2198 [Bacteroides vulgatus ATCC 8482]
 gi|149933161|gb|ABR39859.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
          Length = 582

 Score = 40.4 bits (93), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 32/122 (26%), Positives = 51/122 (41%), Gaps = 16/122 (13%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           Y C   ++P PN       ++   + W   KE   G+L W  N + K   P  + RF   
Sbjct: 446 YTCC--TEPRPNTFTFSEPAEAEWLAWHSAKENLDGYLRWALNSWVKN--PLQDSRF-TA 500

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY---GRDEGLALLE 561
              GD  + YPG          +S+RLER+  G+Q  E + +    +   G    +  ++
Sbjct: 501 WAAGDTYMIYPG--------ARSSIRLERLTEGIQFFEKVRILKEEFEEKGNKGAIKNID 552

Query: 562 KT 563
           KT
Sbjct: 553 KT 554


>gi|423312345|ref|ZP_17290282.1| hypothetical protein HMPREF1058_00894 [Bacteroides vulgatus
           CL09T03C04]
 gi|392688829|gb|EIY82113.1| hypothetical protein HMPREF1058_00894 [Bacteroides vulgatus
           CL09T03C04]
          Length = 582

 Score = 40.4 bits (93), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 32/122 (26%), Positives = 51/122 (41%), Gaps = 16/122 (13%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           Y C   ++P PN       ++   + W   KE   G+L W  N + K   P  + RF   
Sbjct: 446 YTCC--TEPRPNTFTFSEPAEAEWLAWHSAKENLDGYLRWALNSWVKN--PLQDSRF-TA 500

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY---GRDEGLALLE 561
              GD  + YPG          +S+RLER+  G+Q  E + +    +   G    +  ++
Sbjct: 501 WAAGDTYMIYPG--------ARSSIRLERLTEGIQFFEKVRILKEEFEEKGNKGAIKNID 552

Query: 562 KT 563
           KT
Sbjct: 553 KT 554


>gi|294775167|ref|ZP_06740693.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
 gi|294450979|gb|EFG19453.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
          Length = 566

 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 32/122 (26%), Positives = 51/122 (41%), Gaps = 16/122 (13%)

Query: 445 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 504
           Y C   ++P PN       ++   + W   KE   G+L W  N + K   P  + RF   
Sbjct: 430 YTCC--TEPRPNTFTFSEPAEAEWLAWHSAKENLDGYLRWALNSWVKN--PLQDSRF-TA 484

Query: 505 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQDIEYLNLYASRY---GRDEGLALLE 561
              GD  + YPG          +S+RLER+  G+Q  E + +    +   G    +  ++
Sbjct: 485 WAAGDTYMIYPG--------ARSSIRLERLTEGIQFFEKVRILKEEFEEKGNKGAIKNID 536

Query: 562 KT 563
           KT
Sbjct: 537 KT 538


>gi|383763514|ref|YP_005442496.1| hypothetical protein CLDAP_25590 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381383782|dbj|BAM00599.1| hypothetical protein CLDAP_25590 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 721

 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 24/49 (48%), Positives = 30/49 (61%), Gaps = 8/49 (16%)

Query: 508 GDGVLFYPG-------EVFSSSRQPVASLRLERILSGLQDIEYLNLYAS 549
           GDGVLFYPG       E +  S  P ASLRL+    G+QD++YL L A+
Sbjct: 575 GDGVLFYPGTDTRFPAESYGLS-GPFASLRLKMWRRGIQDVDYLTLAAA 622


>gi|403738597|ref|ZP_10951198.1| hypothetical protein AUCHE_08_02110 [Austwickia chelonae NBRC
           105200]
 gi|403191247|dbj|GAB77968.1| hypothetical protein AUCHE_08_02110 [Austwickia chelonae NBRC
           105200]
          Length = 553

 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 34/143 (23%), Positives = 53/143 (37%), Gaps = 13/143 (9%)

Query: 425 DLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYW 484
           D V+  + +  P      W Y C+       N  +    ++HRA+   +      GFL+W
Sbjct: 373 DHVESFLAQGHPAR----WVYHCVSQCRAVANRFIAQPMTRHRALALHLHGHRAEGFLHW 428

Query: 485 GANCY-----EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 539
           G N Y      +   P  E         GD  + YPG      RQ + SLR   + +   
Sbjct: 429 GFNFYRTQFGRRPVDPWQETAAGGAFLSGDAFVVYPG----PGRQALPSLRHRALSAAWD 484

Query: 540 DIEYLNLYASRYGRDEGLALLEK 562
           D+      A  +G +    LL +
Sbjct: 485 DLALWEAVAEVHGDEATRRLLAE 507


>gi|357061367|ref|ZP_09122124.1| hypothetical protein HMPREF9332_01681 [Alloprevotella rava F0323]
 gi|355374440|gb|EHG21735.1| hypothetical protein HMPREF9332_01681 [Alloprevotella rava F0323]
          Length = 583

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 11/87 (12%)

Query: 476 EGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERIL 535
            G  GFL W    +  A  P  + RFR    PGD  + YPG          +S+R ER++
Sbjct: 466 NGFNGFLRWAWMNWTDA--PLQDTRFRM-FTPGDTYMVYPG--------GRSSIRFERVI 514

Query: 536 SGLQDIEYLNLYASRYGRDEGLALLEK 562
            G+Q  E + L    Y +      L++
Sbjct: 515 EGVQQAEKVRLLREEYTKTNNTIALQQ 541


>gi|375099947|ref|ZP_09746210.1| putative carbohydrate binding protein,alpha-galactosidase family
            protein,BNR/Asp-box repeat protein [Saccharomonospora
            cyanea NA-134]
 gi|374660679|gb|EHR60557.1| putative carbohydrate binding protein,alpha-galactosidase family
            protein,BNR/Asp-box repeat protein [Saccharomonospora
            cyanea NA-134]
          Length = 1534

 Score = 39.7 bits (91), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 40/91 (43%), Gaps = 9/91 (9%)

Query: 467  RAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPV 526
            RA+ W V +    G+L W  N + K        R+ +G    D  L YPG+       PV
Sbjct: 1056 RALGWVVEQRDLDGYLRWTYNSWPKDVYRDPSFRYGQG----DEYLVYPGD-----DGPV 1106

Query: 527  ASLRLERILSGLQDIEYLNLYASRYGRDEGL 557
            +S+R E    G  D E L+L  +  G D  +
Sbjct: 1107 SSVRWEVFRDGQDDAEVLDLARTELGEDSAV 1137


>gi|376295973|ref|YP_005167203.1| hypothetical protein DND132_1187 [Desulfovibrio desulfuricans
           ND132]
 gi|323458534|gb|EGB14399.1| hypothetical protein DND132_1187 [Desulfovibrio desulfuricans
           ND132]
          Length = 657

 Score = 39.7 bits (91), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 6/49 (12%)

Query: 504 GLPPGDGVLFYPG------EVFSSSRQPVASLRLERILSGLQDIEYLNL 546
           G   GDG+LFYPG      E   +   P+ASLRL+    GLQD EYL +
Sbjct: 545 GYGNGDGLLFYPGTDTVYPEESCNLLGPIASLRLKLWRRGLQDYEYLRM 593


>gi|229822494|ref|YP_002884020.1| hypothetical protein Bcav_4017 [Beutenbergia cavernae DSM 12333]
 gi|229568407|gb|ACQ82258.1| conserved hypothetical protein [Beutenbergia cavernae DSM 12333]
          Length = 587

 Score = 39.7 bits (91), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 40/154 (25%), Positives = 65/154 (42%), Gaps = 15/154 (9%)

Query: 431 VTELQP--ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC 488
              +QP  + G   W Y C+  +    N  + +   ++R +  +++     GFL+WG N 
Sbjct: 416 TNHVQPFLDAGMTPWVYYCVSQNRDVANRFIALPSVRNRVLGRQLFAFSAPGFLHWGFNF 475

Query: 489 Y-----EKATVPSAEIRFRRGLPPGDGVLFYP---GEVFSSSRQPVASLRLE--RILSGL 538
           +      +   P A+       P GD  + YP   GE + S R  V +  ++  R L  L
Sbjct: 476 WFAQHSTRPIDPHADTSAGGAFPSGDPFIVYPGPNGEAWPSLRHRVFAQAMDDHRALQLL 535

Query: 539 QDIEYLNLYASRYGRDEGLALLEKTGTYQGPERY 572
           + +   +   +R   DEG AL     TY  PE Y
Sbjct: 536 RTLT--DQATARSYVDEGGALAYDAFTYD-PEHY 566


>gi|365131949|ref|ZP_09342011.1| hypothetical protein HMPREF1032_03807 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363617740|gb|EHL69113.1| hypothetical protein HMPREF1032_03807 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 679

 Score = 39.3 bits (90), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 29/54 (53%)

Query: 439 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA 492
           G+E W Y C+ P   + N  +       R++MW V++ G  G+L+WG N +  A
Sbjct: 439 GDEVWFYTCLAPKGNYLNRFIDQPIWIGRSLMWLVYRYGVEGYLHWGWNAWHYA 492


>gi|325971590|ref|YP_004247781.1| beta-mannosidase [Sphaerochaeta globus str. Buddy]
 gi|324026828|gb|ADY13587.1| Beta-mannosidase [Sphaerochaeta globus str. Buddy]
          Length = 832

 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 27/58 (46%), Gaps = 6/58 (10%)

Query: 423 REDLVKDIVTELQPENGEEWW-TYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGT 479
            E +V  ++ EL P     WW +    GPSD   NWH   RG  H    W VW EG +
Sbjct: 459 NEGVVGKVIKELDP--NRTWWPSSPSAGPSDFSDNWHNDKRGDMH---FWSVWHEGKS 511


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.137    0.434 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,309,310,819
Number of Sequences: 23463169
Number of extensions: 470010700
Number of successful extensions: 938653
Number of sequences better than 100.0: 201
Number of HSP's better than 100.0 without gapping: 47
Number of HSP's successfully gapped in prelim test: 154
Number of HSP's that attempted gapping in prelim test: 938228
Number of HSP's gapped (non-prelim): 367
length of query: 592
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 444
effective length of database: 8,886,646,355
effective search space: 3945670981620
effective search space used: 3945670981620
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 80 (35.4 bits)