BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 007391
(605 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356518975|ref|XP_003528150.1| PREDICTED: uncharacterized protein LOC100782659 [Glycine max]
Length = 649
Score = 1029 bits (2660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/597 (80%), Positives = 537/597 (89%), Gaps = 2/597 (0%)
Query: 1 MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
MDNSGNPQD VVPPVEGVAGGGTAYGWND + + G I+PT IPT DLVHVWCMPS
Sbjct: 1 MDNSGNPQDVVVPPVEGVAGGGTAYGWNDGGTHGLN-VKGPIDPTGIPTRDLVHVWCMPS 59
Query: 61 TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
TANVGPQ+MPR LEPINLLAARNERESVQIA+RPKVSWS SS AG VQ+QCSDLCS SGD
Sbjct: 60 TANVGPQDMPRHLEPINLLAARNERESVQIAIRPKVSWSGSSVAGTVQIQCSDLCSTSGD 119
Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
RL+VGQSL+LRRVVP+LGVPDALVP+DLPV QI+L PGETTA+W+SID P +QPPG YEG
Sbjct: 120 RLIVGQSLLLRRVVPILGVPDALVPVDLPVSQINLFPGETTALWISIDVPSSQPPGQYEG 179
Query: 181 EIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLR 239
EI IT+ KAD E Q L K EKH+L+ +L+ CLD VEPI+GKPL EVVER KS T+LR
Sbjct: 180 EIAITAIKADAESPVQILSKVEKHQLYRDLKGCLDIVEPIDGKPLDEVVERVKSATTSLR 239
Query: 240 RVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTV 299
R++ SP FSEFFSDNGP+D+MDEDAIS+LS+R+KL+LTVW+F+LP TPSLPAV GISDTV
Sbjct: 240 RILLSPSFSEFFSDNGPVDVMDEDAISSLSIRMKLNLTVWEFVLPETPSLPAVFGISDTV 299
Query: 300 IEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEY 359
IEDRFGV+ G+ EWYEALDQHFKWLLQYRISP+FC+W + MRVLTYT PWPADHPKSDEY
Sbjct: 300 IEDRFGVQQGTAEWYEALDQHFKWLLQYRISPYFCKWADGMRVLTYTSPWPADHPKSDEY 359
Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSV 419
FSDPRLAAYAVPY V+S ND AKDY++K++E+LRTK HW+KAYFYLWDEPLN+E Y SV
Sbjct: 360 FSDPRLAAYAVPYKQVVSGNDAAKDYLQKQVEILRTKTHWRKAYFYLWDEPLNLEQYDSV 419
Query: 420 RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGN 479
RNMASE+HAYAPDAR+LTTYYCGP+DAPL PTPFE+FVKVP FLRPH QIYCTSEWVLGN
Sbjct: 420 RNMASEIHAYAPDARILTTYYCGPNDAPLAPTPFEAFVKVPSFLRPHNQIYCTSEWVLGN 479
Query: 480 REDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFL 539
REDLVKDI+TELQPENGEEWWTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGTGFL
Sbjct: 480 REDLVKDIITELQPENGEEWWTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFL 539
Query: 540 YWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
YWGANCYEKATV SAEI+FR GLPPGDGVL+YPGEVFS+S QPVASLRLERIL+GLQ
Sbjct: 540 YWGANCYEKATVASAEIKFRHGLPPGDGVLYYPGEVFSTSHQPVASLRLERILNGLQ 596
>gi|449460114|ref|XP_004147791.1| PREDICTED: uncharacterized protein LOC101205217 [Cucumis sativus]
gi|449476778|ref|XP_004154831.1| PREDICTED: uncharacterized LOC101205217 [Cucumis sativus]
Length = 649
Score = 1014 bits (2621), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/597 (80%), Positives = 530/597 (88%), Gaps = 2/597 (0%)
Query: 1 MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
MDN+GNPQ +VPPVEGVAGGGTAYGWND +S SI+PTE+PTADLV VWCMPS
Sbjct: 1 MDNTGNPQGIIVPPVEGVAGGGTAYGWNDGTLHTSTLPKRSIDPTEVPTADLVDVWCMPS 60
Query: 61 TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
TANVGPQEMPR LE INLLAARNERESVQIA+RPK+SW +SS AG+VQV DLCS SGD
Sbjct: 61 TANVGPQEMPRRLETINLLAARNERESVQIAMRPKISWGASSVAGIVQVFSGDLCSTSGD 120
Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
RLVVGQSL LRRVVP+LGVPDALVPLDLPV QI+L+PGETTAVWVSID P QPPG YEG
Sbjct: 121 RLVVGQSLRLRRVVPILGVPDALVPLDLPVSQINLLPGETTAVWVSIDVPNMQPPGQYEG 180
Query: 181 EIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLR 239
EIIIT+ K D E S+Q LGK EKH ++ ELR+CLD +E ++ KPL EVV+R KS +L+
Sbjct: 181 EIIITAIKTDAESSTQYLGKAEKHEIYKELRSCLDIMEIVDEKPLEEVVKRVKSATASLK 240
Query: 240 RVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTV 299
RVI SP FSEF+S+NG ID+MDEDA SNLSVRVK+ LTVWDF +PATPSLPAVIG+SDTV
Sbjct: 241 RVILSPSFSEFYSENGSIDVMDEDAFSNLSVRVKIMLTVWDFTIPATPSLPAVIGVSDTV 300
Query: 300 IEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEY 359
IEDRFGV HG+DEW+EALD HFKWLLQYRISP+FCRWG+ MRVLTYTCPWPADHPKSDEY
Sbjct: 301 IEDRFGVEHGTDEWFEALDDHFKWLLQYRISPYFCRWGDGMRVLTYTCPWPADHPKSDEY 360
Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSV 419
FSDPRL+AYAVPY V + G KDY+++E+E+LRTK HWKKAYFYLWDEPLNMEH+ SV
Sbjct: 361 FSDPRLSAYAVPYRAVFGGDTG-KDYLQREVEILRTKTHWKKAYFYLWDEPLNMEHFDSV 419
Query: 420 RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGN 479
R+M+SE+ AYAPDARVLTTYYCGPSDAPL PT FE+FVKVP FLRPHTQIYCTSEWVLGN
Sbjct: 420 RSMSSEIRAYAPDARVLTTYYCGPSDAPLAPTTFEAFVKVPSFLRPHTQIYCTSEWVLGN 479
Query: 480 REDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFL 539
REDLVKDI+ ELQPENGEEWWTYVCMGP DPHPNWHLGMRG+QHRAVMWRVWKEGGTGFL
Sbjct: 480 REDLVKDIIAELQPENGEEWWTYVCMGPGDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFL 539
Query: 540 YWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
YWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSS +PVAS+RLER+LSGLQ
Sbjct: 540 YWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSHEPVASVRLERLLSGLQ 596
>gi|356509690|ref|XP_003523579.1| PREDICTED: uncharacterized protein LOC100799554 [Glycine max]
Length = 644
Score = 1008 bits (2606), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/597 (79%), Positives = 536/597 (89%), Gaps = 7/597 (1%)
Query: 1 MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
M +GNPQD VVPPVEGVAGGGTAYGWND + + G I+PTEIPT DLVHVWCMP+
Sbjct: 1 MQLAGNPQDVVVPPVEGVAGGGTAYGWNDGGTHGLN-VKGPIDPTEIPTKDLVHVWCMPN 59
Query: 61 TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
TANVGPQ+MPR LEPINLLAARNERESVQIA+RPKVSW SS AG VQ+QCSDLCS SGD
Sbjct: 60 TANVGPQDMPRHLEPINLLAARNERESVQIAIRPKVSWGGSSVAGTVQIQCSDLCSTSGD 119
Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
RL+VGQSL+LRRVVP+LGVPDALVP+DLPV QI+L PGETTA+W+SID P +QPPG YEG
Sbjct: 120 RLIVGQSLLLRRVVPILGVPDALVPVDLPVSQINLFPGETTALWISIDVPSSQPPGQYEG 179
Query: 181 EIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLR 239
EI+IT+ K+D ++S K EKH+L+ +L+ CLD VEPI+GKPL EVVER KST T+LR
Sbjct: 180 EIVITAIKSDADIS-----KVEKHQLYRDLKGCLDIVEPIDGKPLDEVVERVKSTTTSLR 234
Query: 240 RVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTV 299
R++ SP FSEFFSDNGP+D+MDEDAIS+LS+R+KL+LTVW+F+LP TPSLPAV GISDTV
Sbjct: 235 RILLSPSFSEFFSDNGPVDVMDEDAISSLSLRMKLNLTVWEFVLPETPSLPAVFGISDTV 294
Query: 300 IEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEY 359
IEDRFGV+ G+ EWYEALDQHFKWLLQYRISP+FC+W + MRVLTYT PWPADHPKSDEY
Sbjct: 295 IEDRFGVQQGTAEWYEALDQHFKWLLQYRISPYFCKWADGMRVLTYTSPWPADHPKSDEY 354
Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSV 419
FSDPRLAAYAVPY V+S N+ A+DY++K++E+LRTK HW+KAYFYLWDEPLN+E Y SV
Sbjct: 355 FSDPRLAAYAVPYKQVVSGNNSAEDYLQKQVEILRTKNHWRKAYFYLWDEPLNLEQYDSV 414
Query: 420 RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGN 479
RNMASE+HAYAPDAR+LTTYYCGP+DAPL PTPF++FVKVP FLRPH QIYCTSEWVLGN
Sbjct: 415 RNMASEIHAYAPDARILTTYYCGPNDAPLAPTPFDAFVKVPSFLRPHNQIYCTSEWVLGN 474
Query: 480 REDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFL 539
+EDLVKDI+ ELQPENGEEWWTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGTGFL
Sbjct: 475 QEDLVKDIIAELQPENGEEWWTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFL 534
Query: 540 YWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
YWGANCYEKATV SAEI+FR GLPPGDGVL+YPGEVFS+S QPVASLRLERIL+GLQ
Sbjct: 535 YWGANCYEKATVASAEIKFRHGLPPGDGVLYYPGEVFSTSHQPVASLRLERILNGLQ 591
>gi|224132110|ref|XP_002321258.1| predicted protein [Populus trichocarpa]
gi|222862031|gb|EEE99573.1| predicted protein [Populus trichocarpa]
Length = 652
Score = 1005 bits (2598), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/601 (80%), Positives = 536/601 (89%), Gaps = 7/601 (1%)
Query: 1 MDNSG-NPQDSVVPPVEGVAGGGTAYGWNDN--CSQSSGPLNGSINPTEIPTADLVHVWC 57
MDN+G NPQ VVPPVEGVAGGGTAYGWND S+ GSI+P+E+ T+DLVHVWC
Sbjct: 1 MDNTGANPQGIVVPPVEGVAGGGTAYGWNDGGGVHFSNSSPRGSIDPSEVLTSDLVHVWC 60
Query: 58 MPSTANVGPQEMP-RPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCS 116
+PSTANVGPQE+P R LEPINLLAARNERESVQIALRPK +W S +AGVVQVQCSDL S
Sbjct: 61 LPSTANVGPQEIPSRHLEPINLLAARNERESVQIALRPKATWGGSGSAGVVQVQCSDLTS 120
Query: 117 ASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPG 176
SGDRLVVGQS+ LRRVV +LGVPDALVPLDLPV QI+L PGETTA+WVSID P AQP G
Sbjct: 121 TSGDRLVVGQSITLRRVVSILGVPDALVPLDLPVSQINLAPGETTALWVSIDVPSAQPQG 180
Query: 177 LYEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTA 235
YEGE IT+ KA+ E SQ LGK ++H+L+ ELRNCLD +EP+EGKPL EVVERAKS
Sbjct: 181 QYEGEFFITAIKAEAESPSQRLGKADRHQLYSELRNCLDIMEPVEGKPLDEVVERAKSVT 240
Query: 236 TTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGI 295
T+LRRV+ SP+FSEF +DNGP+DMMDEDAISNL+VRVKL+LTVWDF+LPATPSLPAV GI
Sbjct: 241 TSLRRVLLSPVFSEFSTDNGPVDMMDEDAISNLTVRVKLNLTVWDFVLPATPSLPAVFGI 300
Query: 296 SDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPK 355
SDTVIEDRFGV HGSDEWYEALDQHFKWLL YRISP+FCRWG +MRVLTYTCPWPADHPK
Sbjct: 301 SDTVIEDRFGVEHGSDEWYEALDQHFKWLLHYRISPYFCRWGGNMRVLTYTCPWPADHPK 360
Query: 356 SDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEH 415
SDEYFSDPRLAAYAVPYS + A+DY++KEI++LRTK+HWKKAYFYLWDEPLN+E
Sbjct: 361 SDEYFSDPRLAAYAVPYSQAVPG--AAQDYLQKEIDILRTKSHWKKAYFYLWDEPLNLEQ 418
Query: 416 YSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEW 475
Y VR+MAS++H YAPDARVLTTYYCGPSDAPLGPTPFE+FVKVPKFLRPHTQIYCTSEW
Sbjct: 419 YDMVRSMASKIHTYAPDARVLTTYYCGPSDAPLGPTPFEAFVKVPKFLRPHTQIYCTSEW 478
Query: 476 VLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGG 535
VLG+REDL K+IV+ELQPENGEEWWTYVC+GPSDPHPNWH+GMRG+QHRAV WRVWKEG
Sbjct: 479 VLGDREDLAKEIVSELQPENGEEWWTYVCLGPSDPHPNWHIGMRGTQHRAVFWRVWKEGA 538
Query: 536 TGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGL 595
TGFLYWGANCYEKATVPSAEI FRRGLPPGDGVL+YPGEVFSSS QPVAS+RLERILSGL
Sbjct: 539 TGFLYWGANCYEKATVPSAEISFRRGLPPGDGVLYYPGEVFSSSHQPVASVRLERILSGL 598
Query: 596 Q 596
Q
Sbjct: 599 Q 599
>gi|255538584|ref|XP_002510357.1| conserved hypothetical protein [Ricinus communis]
gi|223551058|gb|EEF52544.1| conserved hypothetical protein [Ricinus communis]
Length = 651
Score = 1002 bits (2590), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 478/595 (80%), Positives = 530/595 (89%), Gaps = 2/595 (0%)
Query: 3 NSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTA 62
++GNP+D + PPVEGVAGGGT+YGW D GSI+P+E+ TA+LVHVWCMPSTA
Sbjct: 5 SAGNPRDGI-PPVEGVAGGGTSYGWTDGGLHGLNLPKGSIDPSEVSTANLVHVWCMPSTA 63
Query: 63 NVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRL 122
NVGPQE+PR LEP+NLLAARNERESVQIA+RPKVSWSSS +AG V VQC+DL S SGDRL
Sbjct: 64 NVGPQEIPRHLEPVNLLAARNERESVQIAIRPKVSWSSSGSAGAVHVQCTDLSSTSGDRL 123
Query: 123 VVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEI 182
V GQS+ LR+VV +LGVPDALVPLD PV +ISL+PGETTA+WVSID P AQPPG YEG+
Sbjct: 124 VAGQSITLRKVVTILGVPDALVPLDHPVSRISLVPGETTAIWVSIDIPSAQPPGQYEGDF 183
Query: 183 IIT-SKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRV 241
IIT +K + E S C K EKHRL+MELRNCLD VEPIEGKPL+EVVER KS +T+LRRV
Sbjct: 184 IITATKTEAEYQSHCFNKAEKHRLYMELRNCLDIVEPIEGKPLNEVVERVKSASTSLRRV 243
Query: 242 IFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIE 301
+ SP FSEFFSDNG +DMMDEDAISNLSVRVKLSLTVWDFILP TPS PAV GISDTVIE
Sbjct: 244 LLSPSFSEFFSDNGSVDMMDEDAISNLSVRVKLSLTVWDFILPVTPSFPAVFGISDTVIE 303
Query: 302 DRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFS 361
DRFGV HG+DEWYEAL+QHFKWLLQYRISP+FCRWG SMRV YTCPWPADHPKSDEY S
Sbjct: 304 DRFGVEHGTDEWYEALEQHFKWLLQYRISPYFCRWGTSMRVFGYTCPWPADHPKSDEYLS 363
Query: 362 DPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRN 421
DPRLAAYAVPY+ +S ND KDY++KEIE+LRTK HWKKAYFYLWDEPLN+EHY S+RN
Sbjct: 364 DPRLAAYAVPYNRAVSGNDAGKDYLQKEIEMLRTKPHWKKAYFYLWDEPLNLEHYDSLRN 423
Query: 422 MASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNRE 481
MA E+H YAPDAR+LTTYYCGP+DAPL PTPFE+FVKVPKF+RPH QIYC SEWVLGNR+
Sbjct: 424 MAGEIHGYAPDARILTTYYCGPNDAPLAPTPFEAFVKVPKFMRPHIQIYCASEWVLGNRD 483
Query: 482 DLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYW 541
DLVKDI++ELQPENGEEWWTYVC+GPSDPHPNWHLGMRG+QHRAVMWRVWKEGGTGFLYW
Sbjct: 484 DLVKDIISELQPENGEEWWTYVCLGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFLYW 543
Query: 542 GANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
GANCYEKATVPSAEIRFRRGLPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQ
Sbjct: 544 GANCYEKATVPSAEIRFRRGLPPGDGVLYYPGEVFSSSHKPVASLRLERLLSGLQ 598
>gi|225458333|ref|XP_002283035.1| PREDICTED: uncharacterized protein LOC100243809 [Vitis vinifera]
gi|302142468|emb|CBI19671.3| unnamed protein product [Vitis vinifera]
Length = 638
Score = 974 bits (2519), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 464/587 (79%), Positives = 513/587 (87%), Gaps = 4/587 (0%)
Query: 11 VVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMP 70
VPPVEGVAGGGTAYGW+D S L GS +PTE+P+ADL+HVWCMPSTANVGPQEMP
Sbjct: 2 TVPPVEGVAGGGTAYGWSDGVVHPSNSLKGSTDPTEVPSADLLHVWCMPSTANVGPQEMP 61
Query: 71 RPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLML 130
RPLE + LLAARNERESVQIA+RPKVSW S G VQVQCSDLCS SGDRLVVG+SL L
Sbjct: 62 RPLEHVTLLAARNERESVQIAMRPKVSWGGS--GGAVQVQCSDLCSPSGDRLVVGESLKL 119
Query: 131 RRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIIT-SKAD 189
RRVV +LGVPDALVPLDLPV QISL+PGETTA+WVSID P QPPG YEGE+IIT +KAD
Sbjct: 120 RRVVSILGVPDALVPLDLPVSQISLLPGETTAIWVSIDVPSTQPPGQYEGELIITATKAD 179
Query: 190 TELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSE 249
E ++CLGK E+ +L+ EL+N L+ VEPI+GKPL EVVER KS TTLR + SP F E
Sbjct: 180 AESRAKCLGKAERRQLYSELKNFLEIVEPIDGKPLDEVVERVKSATTTLRSIFQSPSFCE 239
Query: 250 FFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHG 309
FFSD P+DMMDEDAIS+LSVR+KLSLTVW+F+LP TPSLPAV GISDTVIEDRFGV HG
Sbjct: 240 FFSDGHPVDMMDEDAISDLSVRMKLSLTVWNFVLPLTPSLPAVFGISDTVIEDRFGVEHG 299
Query: 310 SDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYA 369
+DEWYEALD HFKWLLQYRISP+FCRWG+ MRVLTYTCPWPA HPKSDEYFSDPRLAAYA
Sbjct: 300 TDEWYEALDHHFKWLLQYRISPYFCRWGDGMRVLTYTCPWPAHHPKSDEYFSDPRLAAYA 359
Query: 370 VPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAY 429
VPYS V+ KDY+++EIE L+TK HWKKAYFYLWDEPLN+EH+ ++RNMA E+ AY
Sbjct: 360 VPYSQVVPGG-AEKDYLQREIETLKTKTHWKKAYFYLWDEPLNLEHFDNIRNMACEVQAY 418
Query: 430 APDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVT 489
A DAR+LTTYY GPSDAPL FE+FVKVPKFLRPHTQIYCTSEWV GNREDLVKDI+
Sbjct: 419 ARDARILTTYYSGPSDAPLASNNFEAFVKVPKFLRPHTQIYCTSEWVFGNREDLVKDIIA 478
Query: 490 ELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA 549
ELQPENGEEWWTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGTGFLYWGANCYEKA
Sbjct: 479 ELQPENGEEWWTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGANCYEKA 538
Query: 550 TVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
TVPSAE+ FRRGLPPGDGVLFYPGEV+S+S +PVAS+RLERILSGLQ
Sbjct: 539 TVPSAEVCFRRGLPPGDGVLFYPGEVYSTSHEPVASVRLERILSGLQ 585
>gi|297852256|ref|XP_002894009.1| hypothetical protein ARALYDRAFT_473837 [Arabidopsis lyrata subsp.
lyrata]
gi|297339851|gb|EFH70268.1| hypothetical protein ARALYDRAFT_473837 [Arabidopsis lyrata subsp.
lyrata]
Length = 643
Score = 961 bits (2485), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 456/596 (76%), Positives = 518/596 (86%), Gaps = 7/596 (1%)
Query: 1 MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
MDN+G + +V PVEGVAGGGTAYG+ND + PL S +P+E+PTADLV+VWCMP+
Sbjct: 1 MDNNGLQEMTV--PVEGVAGGGTAYGFND-----AEPLKQSTDPSEVPTADLVNVWCMPN 53
Query: 61 TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
T NVG QE PRPLEPINLLAARNERES QIA+RPKVSW++SS +G VQVQCSDLCS++GD
Sbjct: 54 TVNVGSQETPRPLEPINLLAARNERESFQIAMRPKVSWAASSPSGSVQVQCSDLCSSAGD 113
Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
RLVVGQSL LRRVVP+LGVPDALVPLDLPV Q+SL PGET+ +WVSID P QPPG YEG
Sbjct: 114 RLVVGQSLNLRRVVPVLGVPDALVPLDLPVSQLSLFPGETSVIWVSIDVPNRQPPGQYEG 173
Query: 181 EIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRR 240
EII+++ S LGK EK +L +EL NCLD +EPIEGKP+ EVVER K +++LRR
Sbjct: 174 EIIVSAMKTDGGGSAHLGKHEKDQLCVELNNCLDIMEPIEGKPMDEVVERIKCASSSLRR 233
Query: 241 VIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVI 300
++FSP FSEF S NG DMM+ED +SNLS+R+KL LTVW+FI+P TPSLP+VIG+SDTVI
Sbjct: 234 ILFSPSFSEFISTNGSTDMMEEDVVSNLSLRIKLRLTVWEFIIPVTPSLPSVIGVSDTVI 293
Query: 301 EDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYF 360
EDRFGV GS+EWYE LD HFKWLLQYRISP+FC+WGE MRVLTYT PWPADHPKSDEY
Sbjct: 294 EDRFGVERGSEEWYEKLDLHFKWLLQYRISPYFCKWGEGMRVLTYTSPWPADHPKSDEYL 353
Query: 361 SDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVR 420
SDPRLAAYAVPY V++ +D + Y+RKE+E+LR+K HWKKAYFYLWDEPLNMEH+ SVR
Sbjct: 354 SDPRLAAYAVPYRQVIAGDDIRESYLRKEVEILRSKPHWKKAYFYLWDEPLNMEHFDSVR 413
Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
MASE++AYAPDARVLTTYYCGP DAPL PTPFESFVKVP LRPHTQIYCTSEWVLGNR
Sbjct: 414 KMASEIYAYAPDARVLTTYYCGPGDAPLAPTPFESFVKVPNLLRPHTQIYCTSEWVLGNR 473
Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
EDLVKDIV ELQ ENGEEWWTY+C+GPSDPHPNWHLGMRG+Q RAVMWRVWKEGGTGFLY
Sbjct: 474 EDLVKDIVEELQTENGEEWWTYICLGPSDPHPNWHLGMRGTQQRAVMWRVWKEGGTGFLY 533
Query: 541 WGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
WGANCYEKATVPSAE++FRRGLPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQ
Sbjct: 534 WGANCYEKATVPSAEVKFRRGLPPGDGVLYYPGEVFSSSSEPVASLRLERLLSGLQ 589
>gi|42562571|ref|NP_175129.3| uncharacterized protein [Arabidopsis thaliana]
gi|30725314|gb|AAP37679.1| At1g45150 [Arabidopsis thaliana]
gi|110742869|dbj|BAE99332.1| hypothetical protein [Arabidopsis thaliana]
gi|332193963|gb|AEE32084.1| uncharacterized protein [Arabidopsis thaliana]
Length = 643
Score = 949 bits (2452), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/596 (75%), Positives = 515/596 (86%), Gaps = 7/596 (1%)
Query: 1 MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
MDN+ + + +V PVEGVAGGGTAYG+ND + PL S +P+E+PTADLV+VWCMP+
Sbjct: 1 MDNNVSQEMTV--PVEGVAGGGTAYGFND-----AEPLKQSTDPSEVPTADLVNVWCMPN 53
Query: 61 TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
T NVG QE PR LEPINLLAARNERES QIA+RPKVSW++SS +G+VQVQCSDLCS++GD
Sbjct: 54 TVNVGSQETPRALEPINLLAARNERESFQIAMRPKVSWAASSPSGIVQVQCSDLCSSAGD 113
Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
RLVVGQSL LRRVVP+LGVPDALVPLDLPV Q+SL PGET+ +WVSID P QPPG YEG
Sbjct: 114 RLVVGQSLKLRRVVPVLGVPDALVPLDLPVSQLSLFPGETSVIWVSIDVPTGQPPGQYEG 173
Query: 181 EIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRR 240
EIII++ S L K EK +L +EL CLD +EPIEGKP+ EVVER K +++LRR
Sbjct: 174 EIIISAMKTDGGGSSHLAKHEKDQLCVELNTCLDIMEPIEGKPMDEVVERIKCASSSLRR 233
Query: 241 VIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVI 300
++FSP FSEF S NG DMM+ED +SNLS+R+KL LTVW+FI+P TPSLPAVIG+SDTVI
Sbjct: 234 ILFSPSFSEFISTNGSTDMMEEDVVSNLSLRIKLRLTVWEFIIPVTPSLPAVIGVSDTVI 293
Query: 301 EDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYF 360
EDRF V HGS++WY+ LD HFKWLLQYRISP+FC+WGESMRVLTYT PWPADHPKSDEY
Sbjct: 294 EDRFAVEHGSEDWYKKLDLHFKWLLQYRISPYFCKWGESMRVLTYTSPWPADHPKSDEYL 353
Query: 361 SDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVR 420
SD RLAAYAVPY V++ +D + Y+RKE+E+LR+K HW KAYFYLWDEPLNMEH+ +VR
Sbjct: 354 SDSRLAAYAVPYRQVIAGDDSRESYLRKEVEILRSKPHWNKAYFYLWDEPLNMEHFDNVR 413
Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
MASE++AYAPD+RVLTTYYCGP DAPL PTPFESFVKVP LRP+TQIYCTSEWVLGNR
Sbjct: 414 KMASEIYAYAPDSRVLTTYYCGPGDAPLAPTPFESFVKVPNLLRPYTQIYCTSEWVLGNR 473
Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
EDLVKDI+ ELQ ENGEEWWTY+C+GPSDPHPNWHLGMRG+Q RAVMWRVWKEGGTGFLY
Sbjct: 474 EDLVKDILDELQTENGEEWWTYICLGPSDPHPNWHLGMRGTQQRAVMWRVWKEGGTGFLY 533
Query: 541 WGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
WGANCYEKATVPSAE++FRRGLPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQ
Sbjct: 534 WGANCYEKATVPSAEVKFRRGLPPGDGVLYYPGEVFSSSSEPVASLRLERLLSGLQ 589
>gi|7767671|gb|AAF69168.1|AC007915_20 F27F5.22 [Arabidopsis thaliana]
Length = 687
Score = 918 bits (2372), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 443/635 (69%), Positives = 507/635 (79%), Gaps = 57/635 (8%)
Query: 14 PVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPL 73
PVEGVAGGGTAYG+ND + PL S +P+E+PTADLV+VWCMP+T NVG QE PR L
Sbjct: 4 PVEGVAGGGTAYGFND-----AEPLKQSTDPSEVPTADLVNVWCMPNTVNVGSQETPRAL 58
Query: 74 EPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRV 133
EPINLLAARNERES QIA+RPKVSW++SS +G+VQVQCSDLCS++GDRLVVGQSL LRRV
Sbjct: 59 EPINLLAARNERESFQIAMRPKVSWAASSPSGIVQVQCSDLCSSAGDRLVVGQSLKLRRV 118
Query: 134 VPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELS 193
VP+LGVPDALVPLDLPV Q+SL PGET+ +WVSID P QPPG YEGEIII++
Sbjct: 119 VPVLGVPDALVPLDLPVSQLSLFPGETSVIWVSIDVPTGQPPGQYEGEIIISAMKTDGGG 178
Query: 194 SQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSD 253
S L K EK +L +EL CLD +EPIEGKP+ EVVER K +++LRR++FSP FSEF S
Sbjct: 179 SSHLAKHEKDQLCVELNTCLDIMEPIEGKPMDEVVERIKCASSSLRRILFSPSFSEFIST 238
Query: 254 NGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEW 313
NG DMM+ED +SNLS+R+KL LTVW+FI+P TPSLPAVIG+SDTVIEDRF V HGS++W
Sbjct: 239 NGSTDMMEEDVVSNLSLRIKLRLTVWEFIIPVTPSLPAVIGVSDTVIEDRFAVEHGSEDW 298
Query: 314 YEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWP--------------------ADH 353
Y+ LD HFKWLLQYRISP+FC+WGESMRVLTYT PWP ADH
Sbjct: 299 YKKLDLHFKWLLQYRISPYFCKWGESMRVLTYTSPWPANRFASRSELSICVPLFGFTADH 358
Query: 354 PKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNM 413
PKSDEY SD RLAAYAVPY V++ +D + Y+RKE+E+LR+K HW KAYFYLWDEPLNM
Sbjct: 359 PKSDEYLSDSRLAAYAVPYRQVIAGDDSRESYLRKEVEILRSKPHWNKAYFYLWDEPLNM 418
Query: 414 EHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCT- 472
EH+ +VR MASE++AYAPD+RVLTTYYCGP DAPL PTPFESFVKVP LRP+TQIYCT
Sbjct: 419 EHFDNVRKMASEIYAYAPDSRVLTTYYCGPGDAPLAPTPFESFVKVPNLLRPYTQIYCTS 478
Query: 473 -------------------------------SEWVLGNREDLVKDIVTELQPENGEEWWT 501
SEWVLGNREDLVKDI+ ELQ ENGEEWWT
Sbjct: 479 KYVFGLKFSLFRHSPTWIDMEAVLLNHGLIFSEWVLGNREDLVKDILDELQTENGEEWWT 538
Query: 502 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRG 561
Y+C+GPSDPHPNWHLGMRG+Q RAVMWRVWKEGGTGFLYWGANCYEKATVPSAE++FRRG
Sbjct: 539 YICLGPSDPHPNWHLGMRGTQQRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEVKFRRG 598
Query: 562 LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
LPPGDGVL+YPGEVFSSS +PVASLRLER+LSGLQ
Sbjct: 599 LPPGDGVLYYPGEVFSSSSEPVASLRLERLLSGLQ 633
>gi|222637345|gb|EEE67477.1| hypothetical protein OsJ_24889 [Oryza sativa Japonica Group]
Length = 709
Score = 889 bits (2296), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/590 (70%), Positives = 496/590 (84%), Gaps = 1/590 (0%)
Query: 8 QDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQ 67
Q+S VPPVEGVAGGGT+YGW D Q+S NG+I+PT+I +ADL+HVW MPSTANV Q
Sbjct: 16 QNSSVPPVEGVAGGGTSYGWVDGGLQASSLGNGAIDPTKIHSADLLHVWSMPSTANVSQQ 75
Query: 68 EMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQS 127
E PRPLE +NLLAARNERES QIALRPKVSW++S AG VQVQC+DLCS++GDRLVVGQS
Sbjct: 76 EAPRPLEHVNLLAARNERESFQIALRPKVSWATSGIAGSVQVQCTDLCSSAGDRLVVGQS 135
Query: 128 LMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS- 186
+ LRRVVPMLGVPDALVP+D QI+L+PGET+A+WVS++ P Q PGLYEGEI I++
Sbjct: 136 VTLRRVVPMLGVPDALVPIDPLNSQINLLPGETSAIWVSLNVPCGQQPGLYEGEIFISAV 195
Query: 187 KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPL 246
+A+ E + L K E+++L+ ELRNC+D EP + E+V+R S +TTLRR++ P
Sbjct: 196 RAEAESRGESLTKSERYQLYKELRNCIDITEPRDYSSSEEMVQRLTSASTTLRRMLALPS 255
Query: 247 FSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGV 306
F + +NG DMMDED ++N++VR+KLSLTVWDF LP TPSLPAV GIS+TVIEDRF +
Sbjct: 256 FQDCQENNGLGDMMDEDIMNNVAVRLKLSLTVWDFTLPLTPSLPAVFGISETVIEDRFCL 315
Query: 307 RHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLA 366
HG+ WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+ EY+SDPRLA
Sbjct: 316 EHGTKGWYDALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKAKEYYSDPRLA 375
Query: 367 AYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASEL 426
AYAVPY+P+LSS D AK+ +R+E+E+L+++AHW K+YFYLWDEPLNME Y + ++++EL
Sbjct: 376 AYAVPYAPILSSTDAAKNSLRREVEILKSEAHWSKSYFYLWDEPLNMEQYDVICSISNEL 435
Query: 427 HAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKD 486
+YA D R+LTTYYCGPS + L P+ FE+FVKVP LRPHTQI+CTSEWVLG REDLVKD
Sbjct: 436 RSYASDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWVLGTREDLVKD 495
Query: 487 IVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY 546
IV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGTGFLYWG NCY
Sbjct: 496 IVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGTNCY 555
Query: 547 EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
EKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 556 EKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 605
>gi|115473013|ref|NP_001060105.1| Os07g0581300 [Oryza sativa Japonica Group]
gi|33146840|dbj|BAC79829.1| unknown protein [Oryza sativa Japonica Group]
gi|50509223|dbj|BAD30493.1| unknown protein [Oryza sativa Japonica Group]
gi|113611641|dbj|BAF22019.1| Os07g0581300 [Oryza sativa Japonica Group]
gi|215737152|dbj|BAG96081.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 658
Score = 887 bits (2292), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/590 (70%), Positives = 496/590 (84%), Gaps = 1/590 (0%)
Query: 8 QDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQ 67
Q+S VPPVEGVAGGGT+YGW D Q+S NG+I+PT+I +ADL+HVW MPSTANV Q
Sbjct: 16 QNSSVPPVEGVAGGGTSYGWVDGGLQASSLGNGAIDPTKIHSADLLHVWSMPSTANVSQQ 75
Query: 68 EMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQS 127
E PRPLE +NLLAARNERES QIALRPKVSW++S AG VQVQC+DLCS++GDRLVVGQS
Sbjct: 76 EAPRPLEHVNLLAARNERESFQIALRPKVSWATSGIAGSVQVQCTDLCSSAGDRLVVGQS 135
Query: 128 LMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS- 186
+ LRRVVPMLGVPDALVP+D QI+L+PGET+A+WVS++ P Q PGLYEGEI I++
Sbjct: 136 VTLRRVVPMLGVPDALVPIDPLNSQINLLPGETSAIWVSLNVPCGQQPGLYEGEIFISAV 195
Query: 187 KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPL 246
+A+ E + L K E+++L+ ELRNC+D EP + E+V+R S +TTLRR++ P
Sbjct: 196 RAEAESRGESLTKSERYQLYKELRNCIDITEPRDYSSSEEMVQRLTSASTTLRRMLALPS 255
Query: 247 FSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGV 306
F + +NG DMMDED ++N++VR+KLSLTVWDF LP TPSLPAV GIS+TVIEDRF +
Sbjct: 256 FQDCQENNGLGDMMDEDIMNNVAVRLKLSLTVWDFTLPLTPSLPAVFGISETVIEDRFCL 315
Query: 307 RHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLA 366
HG+ WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+ EY+SDPRLA
Sbjct: 316 EHGTKGWYDALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKAKEYYSDPRLA 375
Query: 367 AYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASEL 426
AYAVPY+P+LSS D AK+ +R+E+E+L+++AHW K+YFYLWDEPLNME Y + ++++EL
Sbjct: 376 AYAVPYAPILSSTDAAKNSLRREVEILKSEAHWSKSYFYLWDEPLNMEQYDVICSISNEL 435
Query: 427 HAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKD 486
+YA D R+LTTYYCGPS + L P+ FE+FVKVP LRPHTQI+CTSEWVLG REDLVKD
Sbjct: 436 RSYASDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWVLGTREDLVKD 495
Query: 487 IVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY 546
IV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGTGFLYWG NCY
Sbjct: 496 IVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGTNCY 555
Query: 547 EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
EKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 556 EKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 605
>gi|218199904|gb|EEC82331.1| hypothetical protein OsI_26624 [Oryza sativa Indica Group]
Length = 709
Score = 886 bits (2290), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/590 (69%), Positives = 496/590 (84%), Gaps = 1/590 (0%)
Query: 8 QDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQ 67
Q+S VPPVEGVAGGGT+YGW D Q+S NG+I+PT+I +ADL+HVW MPSTANV Q
Sbjct: 16 QNSSVPPVEGVAGGGTSYGWVDGGLQASSLGNGAIDPTKIHSADLLHVWSMPSTANVSQQ 75
Query: 68 EMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQS 127
E PRPLE +NLLAARNERES QIALRPKVSW++S AG VQVQC+DLCS++GDRLVVGQS
Sbjct: 76 EAPRPLEHVNLLAARNERESFQIALRPKVSWATSGIAGSVQVQCTDLCSSAGDRLVVGQS 135
Query: 128 LMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS- 186
+ LRRVVPMLGVPDALVP+D QI+L+PGET+A+WVS++ P Q PGLYEGEI +++
Sbjct: 136 VTLRRVVPMLGVPDALVPIDPLNSQINLLPGETSAIWVSLNVPCGQQPGLYEGEIFLSAV 195
Query: 187 KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPL 246
+A++E + L K E+++L+ ELRNC+D E + E+V+R S +TTLRR++ P
Sbjct: 196 RAESESRGESLTKSERYQLYKELRNCIDITETRDYSSSEEMVQRLTSASTTLRRMLALPS 255
Query: 247 FSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGV 306
F + +NG DMMDED ++N++VR+KLSLTVWDF LP TPSLPAV GIS+TVIEDRF +
Sbjct: 256 FQDCQENNGLGDMMDEDIMNNVAVRLKLSLTVWDFTLPLTPSLPAVFGISETVIEDRFCL 315
Query: 307 RHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLA 366
HG+ WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+ EY+SDPRLA
Sbjct: 316 EHGTKGWYDALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKAKEYYSDPRLA 375
Query: 367 AYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASEL 426
AYAVPY+P+LSS D AK+ +R+E+E+L+++AHW K+YFYLWDEPLNME Y + ++++EL
Sbjct: 376 AYAVPYAPILSSTDAAKNSLRREVEILKSEAHWSKSYFYLWDEPLNMEQYDVICSISNEL 435
Query: 427 HAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKD 486
+YA D R+LTTYYCGPS + L P+ FE+FVKVP LRPHTQI+CTSEWVLG REDLVKD
Sbjct: 436 RSYASDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWVLGTREDLVKD 495
Query: 487 IVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY 546
IV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGTGFLYWG NCY
Sbjct: 496 IVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGTNCY 555
Query: 547 EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
EKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 556 EKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 605
>gi|357122237|ref|XP_003562822.1| PREDICTED: uncharacterized protein LOC100840095 [Brachypodium
distachyon]
Length = 657
Score = 875 bits (2261), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/594 (69%), Positives = 491/594 (82%), Gaps = 3/594 (0%)
Query: 5 GNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANV 64
G Q+ VPPVEGVAGGGT+YGW D Q S I+P ++ + DL+HVW MPSTANV
Sbjct: 12 GKTQEISVPPVEGVAGGGTSYGWVDGGLQGSSLGTSVIDPAKVHSTDLLHVWSMPSTANV 71
Query: 65 GPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVV 124
QE PRPLE +NLLAARNERES QIALRPKVSW SS AG VQ+QC+DLCS+SGDRLVV
Sbjct: 72 SQQEAPRPLEHVNLLAARNERESFQIALRPKVSWISSGIAGPVQIQCTDLCSSSGDRLVV 131
Query: 125 GQSLMLRRVVPMLGVPDALVPLDLPVC-QISLIPGETTAVWVSIDAPYAQPPGLYEGEII 183
GQS+ LRRVVPMLGVPDALVP+D P+C QI+L+PGET+A+WVS++ P Q PGLYEGEI
Sbjct: 132 GQSVTLRRVVPMLGVPDALVPID-PLCPQINLLPGETSAIWVSLNVPCGQQPGLYEGEIF 190
Query: 184 IT-SKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVI 242
IT ++A+T+ ++ L K E+++L+ ELR CLD E + E+V+R ST+TTL+R++
Sbjct: 191 ITATRAETDSRAESLPKSERYQLYRELRTCLDITESRDCSTPEEMVQRLTSTSTTLKRML 250
Query: 243 FSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIED 302
P F + +NG DMMDED ++N++VRVKLSLTVWDF LP TPSLPAV GIS+TVIED
Sbjct: 251 VLPAFQDCQENNGLGDMMDEDVMNNVAVRVKLSLTVWDFTLPLTPSLPAVFGISETVIED 310
Query: 303 RFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSD 362
RF + HG+ WY+ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK+ EY+SD
Sbjct: 311 RFCLEHGTKGWYDALDDHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKAKEYYSD 370
Query: 363 PRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNM 422
PRLAAYAVPY+P+LS D A++ +R+E+++L+T+AHW KAYFYLWDEPLNME Y +RN+
Sbjct: 371 PRLAAYAVPYAPILSCTDAARNSLRREVDILKTEAHWSKAYFYLWDEPLNMEQYEVIRNI 430
Query: 423 ASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNRED 482
++EL Y PD R+LTTYY GPS + L P+ FE+F KVP LRPHTQI+CTSEWVLG RED
Sbjct: 431 SNELRTYTPDVRILTTYYAGPSGSELAPSTFEAFAKVPNVLRPHTQIFCTSEWVLGTRED 490
Query: 483 LVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG 542
LVKDI+ EL+PE GEEWWTYVC+GP+DP PNWHLGMRG+QHRAVMWRVWKEGGTGFLYWG
Sbjct: 491 LVKDIIAELRPELGEEWWTYVCLGPTDPQPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWG 550
Query: 543 ANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVASLRLERILSG+Q
Sbjct: 551 TNCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASLRLERILSGMQ 604
>gi|293331693|ref|NP_001169555.1| uncharacterized protein LOC100383434 [Zea mays]
gi|224030081|gb|ACN34116.1| unknown [Zea mays]
Length = 657
Score = 872 bits (2253), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/593 (69%), Positives = 488/593 (82%), Gaps = 1/593 (0%)
Query: 5 GNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANV 64
G Q+ VPPVEGVAGGGT+YGW D + + G I+PT++ + DL+HVW MPSTANV
Sbjct: 12 GKTQNVSVPPVEGVAGGGTSYGWVDGGLRGTNIGAGVIDPTKVHSDDLLHVWSMPSTANV 71
Query: 65 GPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVV 124
QE PRPLE +NLLAARNERES QIALRPKVSW++S AG VQ+QC+DLCS+SGDRLVV
Sbjct: 72 SQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVQIQCTDLCSSSGDRLVV 131
Query: 125 GQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIII 184
GQS+ LRRVVP+LGVPDALVP+D QIS+ PGET AVWVS++ P QPPGLYEGEI I
Sbjct: 132 GQSITLRRVVPILGVPDALVPIDPLSPQISIQPGETAAVWVSVNVPCGQPPGLYEGEIFI 191
Query: 185 TS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIF 243
T+ K + + ++ L K EK RL+ ELR+CLD P + E+V+R S +T LRRV+
Sbjct: 192 TAVKTELDSRTESLPKSEKCRLYRELRSCLDLTGPRDYSSPEEMVQRLTSASTVLRRVLD 251
Query: 244 SPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDR 303
+P + +NG DMMDED I+N+SVR+KLSLTVWDF LP TPSLPAV GIS+TVIEDR
Sbjct: 252 NPALQDCQENNGFGDMMDEDVINNISVRLKLSLTVWDFTLPVTPSLPAVFGISETVIEDR 311
Query: 304 FGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDP 363
F + HG++ WY ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK++EY+SDP
Sbjct: 312 FCLEHGTEGWYSALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKANEYYSDP 371
Query: 364 RLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMA 423
RLAAYAVPY+P+LS D AK+ +R+E+E+L++K HW KAYFYLWDEPLN+E Y + N++
Sbjct: 372 RLAAYAVPYAPILSCTDAAKNSLRREVEILKSKPHWSKAYFYLWDEPLNVEQYDMICNIS 431
Query: 424 SELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDL 483
+EL +YAPD R+LTTYYCGPS + L P+ FE+F KVP LRPHTQI+CTSEWVLG REDL
Sbjct: 432 NELRSYAPDVRILTTYYCGPSGSELAPSTFEAFAKVPNVLRPHTQIFCTSEWVLGTREDL 491
Query: 484 VKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGA 543
VKDIV EL+P+ GEEWWTYVCMGPSDP PNWHLGMRG+QHRAVMWRVWKEGGTGFLYWG+
Sbjct: 492 VKDIVAELRPDLGEEWWTYVCMGPSDPQPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGS 551
Query: 544 NCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 552 NCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 604
>gi|242046098|ref|XP_002460920.1| hypothetical protein SORBIDRAFT_02g037540 [Sorghum bicolor]
gi|241924297|gb|EER97441.1| hypothetical protein SORBIDRAFT_02g037540 [Sorghum bicolor]
Length = 657
Score = 870 bits (2249), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/593 (68%), Positives = 490/593 (82%), Gaps = 1/593 (0%)
Query: 5 GNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANV 64
G Q+ VPPVEGVAGGGT+YGW D + + G I+PT++ + DL+HVW MPSTANV
Sbjct: 12 GKTQNVSVPPVEGVAGGGTSYGWVDGGLRGTNLGAGVIDPTKVHSEDLLHVWSMPSTANV 71
Query: 65 GPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVV 124
QE+PRPLE +NLLAARNERES QIALRPKVSW++S AG VQ+QC+DLCS+SGDRLVV
Sbjct: 72 SQQEVPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVQIQCTDLCSSSGDRLVV 131
Query: 125 GQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIII 184
GQS+ LRRVVP+LGVPDALVP+D Q++L PGET AVWVS++ P QPPGLYEGEI I
Sbjct: 132 GQSITLRRVVPILGVPDALVPIDPLSPQVTLQPGETAAVWVSLNVPCGQPPGLYEGEIFI 191
Query: 185 TS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIF 243
T+ K + + ++ L K EK+RL+ ELR+CLD P + E+V+R S ++ LRRV+
Sbjct: 192 TAVKTELDSRTESLPKSEKYRLYRELRSCLDLTGPRDYSSPEEMVQRLTSASSALRRVLD 251
Query: 244 SPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDR 303
+P + +NG DMMDED ++N+SVR+KLSLTVWDF LP TPSLPAV GIS+TVIEDR
Sbjct: 252 NPALQDCQENNGFGDMMDEDVMNNVSVRLKLSLTVWDFTLPVTPSLPAVFGISETVIEDR 311
Query: 304 FGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDP 363
F + HG++ WY ALD HF+WLLQYRISPFFCRWG+SMR+L YTCPWPADHPK++EY+SDP
Sbjct: 312 FCLEHGTEGWYSALDHHFRWLLQYRISPFFCRWGDSMRILAYTCPWPADHPKANEYYSDP 371
Query: 364 RLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMA 423
RLAAYAVPY+P+LS D AK+ +R+E+E+L++K HW KAYFYLWDEPLN+E Y + N++
Sbjct: 372 RLAAYAVPYAPILSCTDAAKNSLRREVEILKSKPHWSKAYFYLWDEPLNVEQYDMICNIS 431
Query: 424 SELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDL 483
+EL +YAPD R+LTTYYCGPS + L P+ FE+FVKVP LRPHTQI+CTSEWVLG REDL
Sbjct: 432 NELRSYAPDVRILTTYYCGPSGSELAPSTFEAFVKVPNVLRPHTQIFCTSEWVLGTREDL 491
Query: 484 VKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGA 543
VKDI+ EL+P+ GEEWWTYVCMGPSDP PNWH+GMRG+QHRAVMWRVWKEGGTGFLYWG
Sbjct: 492 VKDIIAELRPDLGEEWWTYVCMGPSDPQPNWHIGMRGTQHRAVMWRVWKEGGTGFLYWGT 551
Query: 544 NCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
NCYEKA +PSAEI FRRGLPPGDGVLFYPGEVFSSS +PVAS RLERILSG+Q
Sbjct: 552 NCYEKAMIPSAEICFRRGLPPGDGVLFYPGEVFSSSHEPVASTRLERILSGMQ 604
>gi|168042677|ref|XP_001773814.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674929|gb|EDQ61431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 651
Score = 745 bits (1924), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/596 (61%), Positives = 458/596 (76%), Gaps = 15/596 (2%)
Query: 13 PPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRP 72
PP+EGV GGGT YGWND + L I+ + PT+DLVHVWCMPSTA +G QE PRP
Sbjct: 4 PPIEGVGGGGTGYGWNDGSHTGTTILASEIDVSRQPTSDLVHVWCMPSTAIIGHQEPPRP 63
Query: 73 LEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRR 132
LE ++LLAARNERES QIALRPK+SW+S G +Q+ CSD CS SGDRL G+ + +RR
Sbjct: 64 LERVSLLAARNERESAQIALRPKMSWTSGDMVGYLQIHCSDFCSPSGDRLNAGKEVTIRR 123
Query: 133 VVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS-KADTE 191
VVP+LGVPDALVP+DLP +I L+PGET A+WVS D P QPPG+Y GEI IT+ + +TE
Sbjct: 124 VVPILGVPDALVPIDLP-SRIGLLPGETCALWVSFDVPVTQPPGVYIGEIWITAVRGETE 182
Query: 192 LSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFF 251
+++ + + EK ++ +L+ L E + + E +S L +V+ SPL S
Sbjct: 183 FAAEKV-ESEKLQMKKDLQGFLAQAEAASNESAEVLTEALRSICEGLHQVLQSPLLSAGC 241
Query: 252 SDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSD 311
D G +++ DE+ ++ SV+V+ S+TVWDF+LP TPSLPAV GIS+TVIEDR+ ++HGS
Sbjct: 242 EDFGKMEI-DEEFQASPSVQVQFSITVWDFVLPITPSLPAVFGISETVIEDRYNLKHGSK 300
Query: 312 EWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVP 371
EW+++L+ HF WLLQYR+SP+FCRWG++MRVLTYTCP+PA HPKS++Y+SDPRLAAYAVP
Sbjct: 301 EWFKSLNMHFDWLLQYRLSPYFCRWGDNMRVLTYTCPYPATHPKSEDYYSDPRLAAYAVP 360
Query: 372 YSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDE----------PLNMEHYSSVRN 421
Y PVLSS+D AKD V+ E+E+L+TK HWKKAYFYLWDE P+ E Y +R+
Sbjct: 361 YIPVLSSSDTAKDVVKSELEILKTKPHWKKAYFYLWDEARISTRSQHGPVGFEQYEVIRS 420
Query: 422 MASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNRE 481
+A E+ APDAR+LTTYYCGPSD + FESF+KVP FLRPHTQI+CTSEWVLG RE
Sbjct: 421 IAEEIRNTAPDARILTTYYCGPSDPSMKLDGFESFLKVPTFLRPHTQIFCTSEWVLGGRE 480
Query: 482 DLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYW 541
DLVK I E+Q + EEWWTYVCMGP + HPNWHLGMRG+QHRAV+WRVWKEGGTGFLYW
Sbjct: 481 DLVKQITDEIQFDRSEEWWTYVCMGPGELHPNWHLGMRGTQHRAVIWRVWKEGGTGFLYW 540
Query: 542 GANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFS-SSRQPVASLRLERILSGLQ 596
G NCYEKA+ PSAEIRFRRGLPPGDGVLFYPGEVF+ + PVAS+RLER+LSG+Q
Sbjct: 541 GVNCYEKASSPSAEIRFRRGLPPGDGVLFYPGEVFNIGATLPVASVRLERLLSGMQ 596
>gi|302755140|ref|XP_002960994.1| hypothetical protein SELMODRAFT_74245 [Selaginella moellendorffii]
gi|300171933|gb|EFJ38533.1| hypothetical protein SELMODRAFT_74245 [Selaginella moellendorffii]
Length = 633
Score = 720 bits (1859), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/591 (60%), Positives = 451/591 (76%), Gaps = 20/591 (3%)
Query: 9 DSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQE 68
D PPVEG++GGGT YGW D S P GS++ + P +DL VWCMPSTA VG QE
Sbjct: 5 DLGAPPVEGLSGGGTGYGWGDCGIAVSRP--GSVDIAKNPASDLFSVWCMPSTATVGHQE 62
Query: 69 MPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQ-S 127
PR L+ +NLL ARNERES QIALRPK+SW+ G VQV C D SASGDR + S
Sbjct: 63 PPRALDQLNLLIARNERESAQIALRPKISWACGGAVGHVQVHCRDFVSASGDRWAIELLS 122
Query: 128 LMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS- 186
+ LRRVVP+LGVPDALVP+ +P CQ+SL+PGET+A+W+S+ P +Q PG+YEGE+ ++
Sbjct: 123 VSLRRVVPILGVPDALVPVSMPTCQVSLLPGETSALWLSVHVPSSQTPGVYEGEMTFSAV 182
Query: 187 KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPL 246
KAD E S + +G+K +ELR ++NV +E + L+ ++ P
Sbjct: 183 KADAEFS---VDEGDK----LELRKMVENVAAKMDDTRQNPMELLEEVRQDLQHLLDHPA 235
Query: 247 FSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGV 306
+ NG +++ +E +LS+++K+S+TVWDF+LP TP+LPAV G+S+TVIEDRF V
Sbjct: 236 LAH----NGKMEIDEE----SLSLKLKISITVWDFVLPVTPTLPAVFGVSETVIEDRFNV 287
Query: 307 RHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLA 366
HGS WY ALD+H++WLLQ+RISP+FCRWG++MR+L YTCPWPADH K++EY+SDPRLA
Sbjct: 288 EHGSSGWYNALDRHYQWLLQFRISPYFCRWGDNMRILAYTCPWPADHVKAEEYYSDPRLA 347
Query: 367 AYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASEL 426
AYAVPY+PVLS+++ KD V +EIE+L TK HW+K+YFYLWDEPL+ + Y +R M+ E+
Sbjct: 348 AYAVPYAPVLSNSNAVKDLVTREIEILSTKEHWRKSYFYLWDEPLSSDQYDFIRTMSEEI 407
Query: 427 HAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKD 486
+ AP++R+LTTYY GPSD P FE+F+KVP FLRPHTQI+CTSEWVLG REDLVK+
Sbjct: 408 RSIAPNSRILTTYYSGPSDVQYPPGSFEAFIKVPSFLRPHTQIFCTSEWVLGGREDLVKE 467
Query: 487 IVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY 546
IV ELQP+ EEWWTYVCMGPSDPHPNWHLGMRG+QHR V+WR WKEGG+GFLYWG NCY
Sbjct: 468 IVAELQPDQREEWWTYVCMGPSDPHPNWHLGMRGTQHRGVLWRAWKEGGSGFLYWGTNCY 527
Query: 547 EKATVPSAEIRFRRGLPPGDGVLFYPGEVFS-SSRQPVASLRLERILSGLQ 596
EK+ P+AEIRFRRGLPPGDGVLFYPGEVF+ S +PV+S+RLER+LSGLQ
Sbjct: 528 EKSLCPAAEIRFRRGLPPGDGVLFYPGEVFTPGSSEPVSSVRLERVLSGLQ 578
>gi|302767188|ref|XP_002967014.1| hypothetical protein SELMODRAFT_144564 [Selaginella moellendorffii]
gi|300165005|gb|EFJ31613.1| hypothetical protein SELMODRAFT_144564 [Selaginella moellendorffii]
Length = 582
Score = 697 bits (1800), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/542 (61%), Positives = 418/542 (77%), Gaps = 18/542 (3%)
Query: 58 MPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSA 117
MPSTA VG QE PR L+ +NLL ARNERES QIALRPK+SW+ G VQV C D S
Sbjct: 1 MPSTATVGHQEPPRALDQLNLLIARNERESAQIALRPKISWACGGAVGHVQVHCRDFVSV 60
Query: 118 SGDRLVVGQ-SLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPG 176
SGDR + S+ LRRVVP+LGVPDALVP+ +P CQ+SL+PGET+A+W+S+ P +Q PG
Sbjct: 61 SGDRWAIELLSVSLRRVVPILGVPDALVPVSMPTCQVSLLPGETSALWLSVHVPSSQTPG 120
Query: 177 LYEGEIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTA 235
+YEGE+ ++ KAD E + +GEK +ELR ++ V +E +
Sbjct: 121 VYEGEMTFSAAKADAEF---FVDEGEK----LELRKMVETVAAKMDDTRQNPMELLEEVR 173
Query: 236 TTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGI 295
LR ++ P + NG +++ +E +LS+++K+S+TVWDF+LP TP+LPAV G+
Sbjct: 174 QDLRHLLDHPALAH----NGKMEIDEE----SLSLKLKISITVWDFVLPVTPTLPAVFGV 225
Query: 296 SDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPK 355
S+TVIEDRF V HGS +WY ALD+H++WLLQ+RISP+FCRWG++MR+L YTCPWPADH K
Sbjct: 226 SETVIEDRFNVEHGSSDWYNALDRHYQWLLQFRISPYFCRWGDNMRILAYTCPWPADHVK 285
Query: 356 SDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEH 415
++EY+SDPRLAAYAVPY+PVLS+++ KD V +EIE+L TK HW+K+YFYLWDEPL+ +
Sbjct: 286 AEEYYSDPRLAAYAVPYAPVLSNSNAVKDLVTREIEILSTKEHWRKSYFYLWDEPLSSDQ 345
Query: 416 YSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEW 475
Y +R M+ E+ + AP+ R+LTTYY GPSD P FE+F+KVP FLRPHTQI+CTSEW
Sbjct: 346 YDFIRTMSEEIRSIAPNTRILTTYYSGPSDVQYPPGSFEAFIKVPSFLRPHTQIFCTSEW 405
Query: 476 VLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGG 535
VLG REDLVK+IV ELQP+ EEWWTYVCMGPSDPHPNWHLGMRG+Q R V+WRVWKEGG
Sbjct: 406 VLGGREDLVKEIVAELQPDQREEWWTYVCMGPSDPHPNWHLGMRGTQQRGVLWRVWKEGG 465
Query: 536 TGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFS-SSRQPVASLRLERILSG 594
+GFLYWG NCYEK+ P+AEIRFRRGLPPGDGVLFYPGEVF+ S +PV+S+RLER+LSG
Sbjct: 466 SGFLYWGTNCYEKSLCPAAEIRFRRGLPPGDGVLFYPGEVFTPGSSEPVSSVRLERVLSG 525
Query: 595 LQ 596
LQ
Sbjct: 526 LQ 527
>gi|414590657|tpg|DAA41228.1| TPA: hypothetical protein ZEAMMB73_917393 [Zea mays]
Length = 721
Score = 686 bits (1770), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/682 (54%), Positives = 444/682 (65%), Gaps = 115/682 (16%)
Query: 5 GNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANV 64
G Q+ VPPVEGVAGGGT+YGW D + + G I+PT++ + DL+HVW MPSTANV
Sbjct: 12 GKTQNVSVPPVEGVAGGGTSYGWVDGGLRGTNIGAGVIDPTKVHSDDLLHVWSMPSTANV 71
Query: 65 GPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVV 124
QE PRPLE +NLLAARNERES QIALRPKVSW++S AG VQ+QC+DLCS+SGDR
Sbjct: 72 SQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVQIQCTDLCSSSGDR--- 128
Query: 125 GQSLMLRRVVPML-----GVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYE 179
+ +PM+ GVPDALVP+D QIS+ PGET AVWVS++ P QPPGLYE
Sbjct: 129 -EDQHSHSSIPMVINVVPGVPDALVPIDPLSPQISIQPGETAAVWVSVNVPCGQPPGLYE 187
Query: 180 GEIIITS-KADTELSS-------------------------QCLGKGEKHRLFMELRNCL 213
GEI IT+ K + E+ S + L K EK RL+ ELR+CL
Sbjct: 188 GEIFITAVKTELEILSNLVTLALISGLYFFADLISGSSSRTESLPKSEKCRLYRELRSCL 247
Query: 214 DNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVK 273
D P + E+V+R S +T LRRV+ +P + +NG DMMDED I+N+SVR+K
Sbjct: 248 DLTGPRDYSSPEEMVQRLTSASTVLRRVLDNPALQDCQENNGFGDMMDEDVINNISVRLK 307
Query: 274 LSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFF 333
LSLTVWDF LP TPSLPAV G+S W Q+ L IS
Sbjct: 308 LSLTVWDFTLPVTPSLPAVFGVS----------------W-----QYSFLLCSISISVLC 346
Query: 334 CRWG---ESMRVLTYTCP---------------------WPADHPKSDEYFSDPRLAAYA 369
+G +R++ P DHPK++EY+SDPRLAAYA
Sbjct: 347 NCYGTGCNGLRLMGLLGPAELAATEGSVGIAAALAKPAAGTPDHPKANEYYSDPRLAAYA 406
Query: 370 VPYSPVLS------------SNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYS 417
VPY+P+LS S D AK+ +R+E+E+L++K HW KAYFYLWDEPLN+E Y
Sbjct: 407 VPYAPILSCLLLYLIWLLVNSTDAAKNSLRREVEILKSKPHWSKAYFYLWDEPLNVEQYD 466
Query: 418 SVRNMASELHAYAPDARVLTTYYCG-----------------------PSDAPLGPTPFE 454
+ N+++EL +YAPD R+LTTYYCG PS + L P+ FE
Sbjct: 467 MICNISNELRSYAPDVRILTTYYCGATCADLEHPVGVPGCPLSSRAAGPSGSELAPSTFE 526
Query: 455 SFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNW 514
+F KVP LRPHTQI+CTSEWVLG REDLVKDIV EL+P+ GEEWWTYVCMGPSDP PNW
Sbjct: 527 AFAKVPNVLRPHTQIFCTSEWVLGTREDLVKDIVAELRPDLGEEWWTYVCMGPSDPQPNW 586
Query: 515 HLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGE 574
HLGMRG+QHRAVMWRVWKEGGTGFLYWG+NCYEKA +PSAEI FRRGLPPGDGVLFYPGE
Sbjct: 587 HLGMRGTQHRAVMWRVWKEGGTGFLYWGSNCYEKAMIPSAEICFRRGLPPGDGVLFYPGE 646
Query: 575 VFSSSRQPVASLRLERILSGLQ 596
VFSSS +PVAS RLERILSG+Q
Sbjct: 647 VFSSSHEPVASTRLERILSGMQ 668
>gi|388508256|gb|AFK42194.1| unknown [Medicago truncatula]
Length = 388
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 287/337 (85%), Positives = 318/337 (94%), Gaps = 1/337 (0%)
Query: 260 MDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQ 319
M+EDAISNLS+R+KL+LTVW+F+LP TPSLPAV GISDTVIEDRFGV+HG+ EWYEALDQ
Sbjct: 1 MEEDAISNLSLRLKLNLTVWEFVLPETPSLPAVFGISDTVIEDRFGVKHGTAEWYEALDQ 60
Query: 320 HFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSN 379
HFKWLLQYRISP+FC+W + MRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPY V+S N
Sbjct: 61 HFKWLLQYRISPYFCKWADGMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYKQVVSGN 120
Query: 380 DGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTY 439
D AKDY++K++E+LRTK HW+KAYFYLWDEPLN+E Y SVRNMAS++HAYAPDAR+LTTY
Sbjct: 121 DAAKDYLQKQVEILRTKNHWRKAYFYLWDEPLNLEQYDSVRNMASDIHAYAPDARILTTY 180
Query: 440 YCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEW 499
YCGP+DAPL PTPFE+FVKVP FLRPH QIYCTSEWVLGNREDLVKDI+ ELQPENGEEW
Sbjct: 181 YCGPNDAPLAPTPFEAFVKVPSFLRPHNQIYCTSEWVLGNREDLVKDIIAELQPENGEEW 240
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFR 559
WTYVCMGPSDPHPNWHLGMRG+QHRAVMWRVWKEGGTGFLYWGANCYEKATV SAEI+FR
Sbjct: 241 WTYVCMGPSDPHPNWHLGMRGTQHRAVMWRVWKEGGTGFLYWGANCYEKATVASAEIKFR 300
Query: 560 RGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
GLPPGDGVL+YPGEVFS++ QPVASLRLER+LSGLQ
Sbjct: 301 HGLPPGDGVLYYPGEVFSTN-QPVASLRLERLLSGLQ 336
>gi|326494652|dbj|BAJ94445.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 408
Score = 516 bits (1330), Expect = e-143, Method: Compositional matrix adjust.
Identities = 240/378 (63%), Positives = 301/378 (79%), Gaps = 1/378 (0%)
Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
RLVVGQS+ LRRVVP+LGVPDALVP+D QI+L+PGETTAVW+S++ P Q PGLYEG
Sbjct: 5 RLVVGQSITLRRVVPILGVPDALVPIDPSSPQINLLPGETTAVWISLNVPCGQQPGLYEG 64
Query: 181 EIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLR 239
EI IT+ +AD++ + L K E+++L+ L+ CLD E + E++ R ST+TTLR
Sbjct: 65 EIFITAVRADSDSRADSLLKSERYQLYKGLKTCLDITESRDHLSSEEMILRLSSTSTTLR 124
Query: 240 RVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTV 299
R++ P F ++ +NG DMMDED ++N++VRVKLSLTVWDF LP TPSLPAV GIS+TV
Sbjct: 125 RMLVLPAFQDYHENNGLGDMMDEDVLNNVAVRVKLSLTVWDFTLPLTPSLPAVFGISETV 184
Query: 300 IEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEY 359
IEDRF + HG+ WY+ALD HF WLLQYRISPFFCRWG+SMR+L YTCPWP DHPK++EY
Sbjct: 185 IEDRFCLEHGTKGWYDALDHHFGWLLQYRISPFFCRWGDSMRILAYTCPWPTDHPKANEY 244
Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSV 419
+SDPRLAAYAVPY+P+LS D AK+ +R+E+E+L+T+ HW KAYFYLWDEPLNME Y +
Sbjct: 245 YSDPRLAAYAVPYAPILSCTDAAKNSLRREVEILKTEPHWSKAYFYLWDEPLNMEQYEVI 304
Query: 420 RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGN 479
N+++EL Y PD R+LTTYY GPS + L P+ FE+F KVP LRPHTQI+CTSEWVLG
Sbjct: 305 CNISNELRTYTPDVRILTTYYAGPSGSELAPSTFEAFAKVPNVLRPHTQIFCTSEWVLGT 364
Query: 480 REDLVKDIVTELQPENGE 497
REDLVKDI+ EL+P+ GE
Sbjct: 365 REDLVKDIIAELRPDLGE 382
>gi|404484502|ref|ZP_11019706.1| hypothetical protein HMPREF9448_00112 [Barnesiella intestinihominis
YIT 11860]
gi|404339507|gb|EJZ65938.1| hypothetical protein HMPREF9448_00112 [Barnesiella intestinihominis
YIT 11860]
Length = 503
Score = 225 bits (573), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 132/325 (40%), Positives = 174/325 (53%), Gaps = 30/325 (9%)
Query: 271 RVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKW---LLQY 327
++K+ L V+D LP+TPSLPA GI + + D S + L +W L Y
Sbjct: 147 KIKIDLQVYDTALPSTPSLPAAFGIIEKNLID-------STSKEQTLQNKLEWAELCLDY 199
Query: 328 RISPFFCRW-GESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYV 386
R++P+F W SM+ + PW + ++ + SD R +AVPY LS N+
Sbjct: 200 RMNPYFSTWLANSMKHEASSSPWKWNDKRTVPFLSDKRFNRFAVPYHS-LSHNELDSLLQ 258
Query: 387 R-KEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSD 445
R K+ +LL K+YFYLWDEP M+ Y + + E+H P+A+VLTT+YCGP D
Sbjct: 259 RLKQTDLL------DKSYFYLWDEPAYMKEYHLIGQYSQEIHKLMPEAKVLTTFYCGPKD 312
Query: 446 APLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCM 505
F F R TQI+ S W L E + L+ EEWWTYVCM
Sbjct: 313 GKYKDRLFSVF----DLWRGDTQIFSMSAWALQANEANADTCRSLLR--GNEEWWTYVCM 366
Query: 506 GPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPG 565
GP + PN L M G QHRAV+WR WKE TGFLYW N Y S + FR+ LP G
Sbjct: 367 GPGEEQPNLLLTMDGYQHRAVLWRSWKERTTGFLYWAVNAY----AESDTLAFRKDLPEG 422
Query: 566 DGVLFYPGEVFSSSRQPVASLRLER 590
DGVL YPG+ F+S+ PV S+R+ER
Sbjct: 423 DGVLIYPGQYFNST-SPVVSIRMER 446
>gi|404484501|ref|ZP_11019705.1| hypothetical protein HMPREF9448_00111 [Barnesiella intestinihominis
YIT 11860]
gi|404339506|gb|EJZ65937.1| hypothetical protein HMPREF9448_00111 [Barnesiella intestinihominis
YIT 11860]
Length = 513
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 113/330 (34%), Positives = 172/330 (52%), Gaps = 32/330 (9%)
Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 331
V +S+ V + LP TPS+ +V GI+ ++ ++ E LL+YRISP
Sbjct: 160 VAISINVVNASLPETPSIASVFGINP---QNFIFTGLSEEQKIEKRKAASDLLLEYRISP 216
Query: 332 FFCRW-GESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEI 390
+F W +M+ ++ P+ + ++ EY +D R + A+P S LS + E+
Sbjct: 217 YFSTWLSGTMKTECFSSPYAWNDDRTWEYLADKRFSRIALP-SHGLSDD---------EL 266
Query: 391 ELLRTKAH----WKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDA 446
E++ KA KA+FY+WDEP Y ++ ++ +H YAP+A+VLTT+YCGP+D
Sbjct: 267 EMMLNKARETGLLNKAFFYVWDEPTKTNEYEQIKTLSDRIHRYAPEAKVLTTFYCGPTDG 326
Query: 447 PLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMG 506
F F L T IYCT W L + E+ + +L+ +G+EWW+YVCM
Sbjct: 327 EHKDDLFAVF----DILNGATSIYCTGVWALQDNENRSEQCKAKLK--SGQEWWSYVCMS 380
Query: 507 PSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGD 566
+ P +RA MWR +KE +GFLYW N + + P +R R LP GD
Sbjct: 381 NT---PGLASNSTAIGNRATMWRNYKEQNSGFLYWVVNGF-ASVYP---LRPRPELPEGD 433
Query: 567 GVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
G+L YPGE F +++ S+RLER G +
Sbjct: 434 GILIYPGESFGTNK-ICTSVRLERWRDGAE 462
>gi|319640384|ref|ZP_07995108.1| hypothetical protein HMPREF9011_00705 [Bacteroides sp. 3_1_40A]
gi|345517443|ref|ZP_08796919.1| hypothetical protein BSFG_04467 [Bacteroides sp. 4_3_47FAA]
gi|254838009|gb|EET18318.1| hypothetical protein BSFG_04467 [Bacteroides sp. 4_3_47FAA]
gi|317387987|gb|EFV68842.1| hypothetical protein HMPREF9011_00705 [Bacteroides sp. 3_1_40A]
Length = 514
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 105/331 (31%), Positives = 160/331 (48%), Gaps = 31/331 (9%)
Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 331
++L V +P S+P +G+ + + + + E +D ++L YR++P
Sbjct: 144 IQLDYNVHHTTIPLKSSIPITVGVENRCMTECLNDKEADKERQRWVD----FVLSYRMTP 199
Query: 332 FFC------RWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDY 385
F RW PW + +S +D R + Y +P+ LS N+ A
Sbjct: 200 VFGTQITPERWQYEHSF----SPWAWNDKRSIRLLNDRRYSCYMLPFF-TLSENELASLL 254
Query: 386 VRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSD 445
++ K K++ FY+WDEP ME Y ++ + + YA DAR+LTT++CGP +
Sbjct: 255 CN-----IQKKGKLKESLFYIWDEPAYMEDYEQIKRKVNIIRKYASDARILTTFFCGPRN 309
Query: 446 APLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCM 505
P + F +L+ H + S E+ V+ I ++ PE G +WW+YVC
Sbjct: 310 GPRKGDLYAVF----DYLKHHIHVATISLAPCKGNEEEVQHIRYKV-PE-GIDWWSYVCW 363
Query: 506 GPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPG 565
P PN+ L M+G Q RA+MWR WK G GFLYW N Y K P I +P G
Sbjct: 364 QPGGNEPNFLLQMKGIQQRAIMWRTWKNGSQGFLYWNCNIYHKRN-PFTYI---TDMPHG 419
Query: 566 DGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
DG+L YPG++ + P+AS RLER G +
Sbjct: 420 DGILIYPGDIL-GCKGPIASARLERWRDGAE 449
>gi|294777879|ref|ZP_06743323.1| hypothetical protein CUU_2186 [Bacteroides vulgatus PC510]
gi|294448333|gb|EFG16889.1| hypothetical protein CUU_2186 [Bacteroides vulgatus PC510]
Length = 318
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 90/249 (36%), Positives = 130/249 (52%), Gaps = 17/249 (6%)
Query: 348 PWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLW 407
PW + +S +D R + Y +P+ LS N+ A ++ K K++ FY+W
Sbjct: 22 PWAWNDKRSIRLLNDRRYSCYMLPFF-TLSENELASLLCN-----IQKKGKLKESLFYIW 75
Query: 408 DEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHT 467
DEP ME Y ++ + + YA DAR+LTT++CGP + P + F +L+ H
Sbjct: 76 DEPAYMEDYEQIKRKVNIIRKYASDARILTTFFCGPRNGPRKGDLYAVF----DYLKHHI 131
Query: 468 QIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVM 527
+ S E+ V+ I ++ PE G +WW+YVC P PN+ L M+G Q RA+M
Sbjct: 132 HVATISLAPCKGNEEEVQHIRYKV-PE-GIDWWSYVCWQPGGNEPNFLLQMKGIQQRAIM 189
Query: 528 WRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLR 587
WR WK G GFLYW N Y K P I +P GDG+L YPG++ + P+AS R
Sbjct: 190 WRTWKNGSQGFLYWNCNIYHKRN-PFTYI---TDMPHGDGILIYPGDIL-GCKGPIASAR 244
Query: 588 LERILSGLQ 596
LER G +
Sbjct: 245 LERWRDGAE 253
>gi|413951106|gb|AFW83755.1| hypothetical protein ZEAMMB73_317062 [Zea mays]
Length = 1594
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 73/121 (60%), Positives = 89/121 (73%), Gaps = 1/121 (0%)
Query: 1 MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
+ N G Q+ VP VEGVA G T+YGW D + + G I+PT + + +L+HVW MPS
Sbjct: 372 LGNGGKTQNVSVPTVEGVARG-TSYGWVDGGLRGTNLGAGVIDPTNVHSDNLLHVWSMPS 430
Query: 61 TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
TANV QE PRPLE +NLLAARNERES QIALRPKVSW++S AG V +QC+DLCS+SGD
Sbjct: 431 TANVSQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVLIQCTDLCSSSGD 490
Query: 121 R 121
R
Sbjct: 491 R 491
>gi|413953324|gb|AFW85973.1| putative DUF1692 domain containing protein [Zea mays]
Length = 1070
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/121 (60%), Positives = 89/121 (73%), Gaps = 1/121 (0%)
Query: 1 MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
+ N G Q+ VP VEGVA G T+YGW D + + G I+PT + + +L+HVW MPS
Sbjct: 372 LGNGGKTQNVSVPTVEGVARG-TSYGWVDGGLRGTNLGAGVIDPTNVHSDNLLHVWSMPS 430
Query: 61 TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
TANV QE PRPLE +NLLAARNERES QIALRPKVSW++S AG V +QC+DLCS+SGD
Sbjct: 431 TANVSQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVLIQCTDLCSSSGD 490
Query: 121 R 121
R
Sbjct: 491 R 491
>gi|413949740|gb|AFW82389.1| putative DUF1692 domain containing protein [Zea mays]
Length = 1061
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/117 (61%), Positives = 87/117 (74%), Gaps = 1/117 (0%)
Query: 5 GNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANV 64
G Q+ VP VEGVA G T+YGW D + + G I+PT + + +L+HVW MPSTANV
Sbjct: 362 GKTQNVSVPTVEGVARG-TSYGWVDGGLRGTNLGAGVIDPTNVHSDNLLHVWSMPSTANV 420
Query: 65 GPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDR 121
QE PRPLE +NLLAARNERES QIALRPKVSW++S AG V +QC+DLCS+SGDR
Sbjct: 421 SQQEAPRPLEKVNLLAARNERESFQIALRPKVSWATSGIAGSVLIQCTDLCSSSGDR 477
>gi|297822901|ref|XP_002879333.1| hypothetical protein ARALYDRAFT_902189 [Arabidopsis lyrata subsp.
lyrata]
gi|297325172|gb|EFH55592.1| hypothetical protein ARALYDRAFT_902189 [Arabidopsis lyrata subsp.
lyrata]
Length = 113
Score = 111 bits (278), Expect = 1e-21, Method: Composition-based stats.
Identities = 71/174 (40%), Positives = 85/174 (48%), Gaps = 67/174 (38%)
Query: 265 ISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL 324
+SNL+V +KL LTVW+FI+ T SL AVI +SDTVIEDRF V HGS+EWY+ L HFKWL
Sbjct: 7 VSNLAVSIKLRLTVWEFIILVTLSLSAVICVSDTVIEDRFDVEHGSEEWYKKLGLHFKWL 66
Query: 325 LQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKD 384
L +RI+ P SSN+
Sbjct: 67 LHHRIN-------------------------------------------PYFSSNNN--- 80
Query: 385 YVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTT 438
+ LW +PLNMEH+ SV MASE AYA DARVLTT
Sbjct: 81 -------------------YNLW-QPLNMEHFDSVSKMASENFAYA-DARVLTT 113
>gi|354583721|ref|ZP_09002619.1| hypothetical protein PaelaDRAFT_3720 [Paenibacillus lactis 154]
gi|353197601|gb|EHB63082.1| hypothetical protein PaelaDRAFT_3720 [Paenibacillus lactis 154]
Length = 786
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/339 (25%), Positives = 138/339 (40%), Gaps = 51/339 (15%)
Query: 270 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRI 329
VR+ + LTVWDF L G+ +++ G G + W + +++++ +++R+
Sbjct: 175 VRIPIELTVWDFELTDESHAKTNFGVWGGPVQEAHGNVVGEEAW-KYIEKYYYASVEHRL 233
Query: 330 SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKE 389
+P + +S + +Y P +Y +DPR++AY +PY + DG D R +
Sbjct: 234 TPGYLPIPDS-DINSYVERAP-------KYVNDPRISAYRLPY---YRTADGQPDIQRNK 282
Query: 390 --IELLRTKAHWKKAYFYL--WDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSD 445
++ LR KAY+Y+ DEP + Y+ V+ + L APD L T P D
Sbjct: 283 QLVDRLREAGLLSKAYYYVSEIDEP-TRDKYARVKQINDALEQAAPDVPHLVT--IQPVD 339
Query: 446 APLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCM 505
+G WV + E + E Q WW Y +
Sbjct: 340 ELVGDVDI---------------------WV-ADIEKFDEAFAKERQAAGDSVWW-YTYV 376
Query: 506 GPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRR----- 560
P P P++HL R + W G G LYW ++K + R
Sbjct: 377 KPKHPFPSYHLDDDLVGTRLLTWMQRDHGVEGALYWATTQFQKYDAAQKKYVSRDVWTDP 436
Query: 561 -GLP--PGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
P GDG LFYPG P+ ++RLE + ++
Sbjct: 437 LAFPGANGDGYLFYPGTEVGVD-GPIGTIRLEVLRESME 474
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 61/149 (40%), Gaps = 23/149 (15%)
Query: 50 ADLVHVWCMPSTANV-GPQEMPRPLEP-INLLAARNERESVQIALRPKVSWSSSSTAGVV 107
DL VW +T V Q P I + AARNE ES Q+ ++ ++ +
Sbjct: 29 GDLFDVWVPTNTEKVMRDQAFPGETNSSIRIGAARNEYESGQVIVK------ANQPLRKL 82
Query: 108 QVQCSDLCSASGDRLVVGQSLMLRR------------VVPMLGVPDALVPLDLPVCQISL 155
QV SDL G + + + L + P PDAL+PL+ Q+ +
Sbjct: 83 QVSMSDLKLTDGSAKIGREHIQLFKQHYIEVKTSTTPAYPKGWYPDALIPLN---QQLEV 139
Query: 156 IPGETTAVWVSIDAPYAQPPGLYEGEIII 184
G +W + P Q PG Y GE+ +
Sbjct: 140 AEGHNQGIWFKVYVPKGQHPGTYTGEMTL 168
>gi|403382311|ref|ZP_10924368.1| hypothetical protein PJC66_21061 [Paenibacillus sp. JC66]
Length = 796
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 86/343 (25%), Positives = 134/343 (39%), Gaps = 54/343 (15%)
Query: 270 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRI 329
VR+ + LTVWDF L GI I++ G G + W E +++++ +++R+
Sbjct: 177 VRIPVELTVWDFELTDENHSKTAFGIWGGPIQEAHGNVQGMEAW-EYIEKYYWASVEHRL 235
Query: 330 SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVR-K 388
+P + + + + +H + + +DPR++AY +PY G D R K
Sbjct: 236 TPGY------LPIPDTDIDYYVEH--APRFINDPRVSAYRLPY---YRDAQGEPDIERIK 284
Query: 389 EI-ELLRTKAHWKKAYFYL--WDEPL----NMEHYSSVRNMASELHAYAPDARVLTTYYC 441
E+ + LR + +K YFY+ DEP+ +Y V+ + L APD L T
Sbjct: 285 ELADKLRDRGMLEKGYFYISEIDEPVPHPNAANNYDRVKVINDALKQAAPDVPHLVTI-- 342
Query: 442 GPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWT 501
P E + P Y D E Q E WW
Sbjct: 343 ---------QPLEELLGDVDIWSPEIDKYDY-------------DFARERQAEGEPVWW- 379
Query: 502 YVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRR- 560
Y + P P P++H R + W G G LYW ++K + R
Sbjct: 380 YTSVFPKHPFPSYHTDDDLVGARLLTWMQHDYGVEGTLYWATTQFQKYDSAQRKYVSRDV 439
Query: 561 -----GLP--PGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
P GDG LFYPG PV ++RLE + ++
Sbjct: 440 WTDPLAFPGANGDGYLFYPGTEIGID-GPVGTIRLEVLRESME 481
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 64/148 (43%), Gaps = 24/148 (16%)
Query: 52 LVHVWCMPSTANVGPQEMPRPLEP---INLLAARNERESVQIALRPKVSWSSSSTAGVVQ 108
L W ++ V E P P + + L AARNE ES Q+ +R + + +Q
Sbjct: 32 LFTAWVASNSQKVMRDE-PMPADSARTMQLAAARNEYESGQVIVR-----AGNHPLRKLQ 85
Query: 109 VQCSDLCSASGDRLVVGQSLMLRR------------VVPMLGVPDALVPLDLPVCQISLI 156
V SDL +G + + + L + P PDAL+PL ++ +
Sbjct: 86 VSISDLKQENGAAKIHRRDIELFQQHYIEVTTSTTPAYPQGWYPDALIPLK---GKLEVG 142
Query: 157 PGETTAVWVSIDAPYAQPPGLYEGEIII 184
G +WV + P QP G+Y+GEI +
Sbjct: 143 AGHNQGIWVKVYVPKGQPAGVYKGEITL 170
>gi|374849806|dbj|BAL52811.1| hypothetical protein HGMM_F03C06C16 [uncultured prokaryote]
Length = 994
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 129/303 (42%), Gaps = 61/303 (20%)
Query: 270 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEAL-DQHFKWLLQYR 328
+V L LTV+DF LP TP+L + GI IE V+ D+ AL D++ + ++R
Sbjct: 573 AQVPLMLTVYDFDLPRTPTLRSGFGIDARRIEQYHRVQSEQDK--RALWDRYMRNFREHR 630
Query: 329 ISPF-----------FCRWGESMR-VLTYTCPWPADHPKSDEY-FSDPRLAAYAVP---- 371
++P+ F G + R VL +T A DE+ F+ L + +P
Sbjct: 631 LAPYNFYAYDHYEVRFEGEGANKRVVLDFTRFDRAAQRYLDEFGFNAFVLPIHGLPSGRH 690
Query: 372 --YSPVL--SSNDGA-------KDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVR 420
YSP + +G DY+R+ LR + KKAY Y +DEP + Y V+
Sbjct: 691 PNYSPGVFGGFREGTPEYERLWSDYLRQLTTHLRERGWLKKAYVYWFDEPEEAD-YPFVK 749
Query: 421 NMASELHAYAPD-ARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHT-QIYCTSEWVLG 478
+ L APD R+LT P + +G + + + F+ P Q C +
Sbjct: 750 RVNERLKQVAPDLTRMLTEQ---PEEPLIGAV--DLWCPLTAFVSPEAIQARCQA----- 799
Query: 479 NREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGF 538
GEE W YVC GP P+ + G++ R +W+ W+ G G
Sbjct: 800 -----------------GEEIWWYVCTGPRAPYATLFIDHPGTEMRVWLWQTWQYGVQGI 842
Query: 539 LYW 541
L W
Sbjct: 843 LIW 845
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 56/131 (42%), Gaps = 26/131 (19%)
Query: 67 QEMPRP---LEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLV 123
+E P P + + L AAR E E VQI LRP+ + QV+ SDL G +
Sbjct: 445 RERPLPETTMHTVTLSAARGEYEPVQIVLRPQ------RNTTLRQVEISDLT--QGKHRL 496
Query: 124 VGQSLMLRRVVPM--------LG----VPDALVPLDLPVCQISLIPGETTAVWVSIDAPY 171
+ + LR V + LG PD L PL P + L +W+++ PY
Sbjct: 497 PAKHITLREVAYVRVAHPTDWLGEPGDYPDPLPPLKTP---LRLQAERNQPLWLTVYVPY 553
Query: 172 AQPPGLYEGEI 182
P G Y G I
Sbjct: 554 GTPAGKYTGTI 564
>gi|392373328|ref|YP_003205161.1| hypothetical protein DAMO_0214 [Candidatus Methylomirabilis
oxyfera]
gi|258591021|emb|CBE67316.1| conserved exported protein of unknown function [Candidatus
Methylomirabilis oxyfera]
Length = 676
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 86/333 (25%), Positives = 125/333 (37%), Gaps = 34/333 (10%)
Query: 269 SVRVKLSLTVWDFILPATPSLPAVIG-ISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQY 327
S+ + +SLTVW+F LP TP+L G FG +D +D+ LL++
Sbjct: 308 SIPIPISLTVWNFSLPTTPALRTNFGHFRSQQFAAAFGTSRYTDIHNTLMDKFDHELLRH 367
Query: 328 RISPFFCRWGE-SMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPY-------SPVLSSN 379
R+SP E S T T ++ +F L +Y +P P +
Sbjct: 368 RLSPARPSGTEPSYNAATGTID-SSNVQARMAHFISLGLTSYDLPLFDDWPWADPFGADR 426
Query: 380 DGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTY 439
D A+ Y+ ++ L AY DEP Y +VR+ A+ H P A++L T
Sbjct: 427 DKAQRYLSGILDWLGANDWLTLAYHDGIDEPEEASGYQAVRDEATNWHGLDPRAKMLITE 486
Query: 440 YCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEW 499
P D G + P F R + V GE+
Sbjct: 487 QTRPWDPTWGTLYGSVDIWTPYFSRFDPVTWAERRAV-------------------GEQS 527
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVP---SAEI 556
W Y G + P L + R W ++ G TG L W +++ T P A
Sbjct: 528 WMYGAWG-DNGTPGDLLDRPIYEIRVPAWIGFQYGITGLLKWNTVYWDQVTDPWTNPATY 586
Query: 557 RFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLE 589
+ GDG FYPG P+ASLRL+
Sbjct: 587 TLSGDIFNGDGAFFYPGTKV-GYEGPIASLRLK 618
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 59/161 (36%), Gaps = 25/161 (15%)
Query: 44 PTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSST 103
P PTA + W S A + P + + AARNE E Q+ ++ S +
Sbjct: 148 PISTPTAAQITAWVTDSLARIQPTDPAGISTEATIKAARNEYEGFQVIVKAP----SDTA 203
Query: 104 AGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGV-------------PDALVPLDLPV 150
V SDL +G ++ ++ L R +L PD L+P P
Sbjct: 204 LSNVTATASDLTGPTG--VIASSNITLYREAYILVTTSSPASPYPTGWWPDPLIPFKHPE 261
Query: 151 CQISL------IPGETTAVWVSIDAPYAQPPGLYEGEIIIT 185
+L G ++V + P P G Y G I ++
Sbjct: 262 TGANLGQPFTVDAGRNVPIYVEVYVPAGTPAGTYTGGIQVS 302
>gi|297791959|ref|XP_002863864.1| hypothetical protein ARALYDRAFT_917687 [Arabidopsis lyrata subsp.
lyrata]
gi|297309699|gb|EFH40123.1| hypothetical protein ARALYDRAFT_917687 [Arabidopsis lyrata subsp.
lyrata]
Length = 460
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 28/51 (54%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 299 VIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPW 349
+ E RF V HG +E Y+ D HFKWLLQY ISP+FC+W E V Y PW
Sbjct: 14 LTESRFDVEHGIEECYKTFDLHFKWLLQYWISPYFCKWFE---VSKYVQPW 61
>gi|414591642|tpg|DAA42213.1| TPA: hypothetical protein ZEAMMB73_799052 [Zea mays]
Length = 583
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 38/58 (65%), Gaps = 12/58 (20%)
Query: 351 ADHPKSDEYFSDPRLAAYAVPYSPVLS------------SNDGAKDYVRKEIELLRTK 396
ADHPK++EY+SDPRLAAY VPY+P+LS S D AK +R+E+E + K
Sbjct: 337 ADHPKANEYYSDPRLAAYVVPYAPILSCLLLYLIWLLVNSTDAAKSSLRREVEGVSKK 394
>gi|153004484|ref|YP_001378809.1| hypothetical protein Anae109_1621 [Anaeromyxobacter sp. Fw109-5]
gi|152028057|gb|ABS25825.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
Length = 561
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/351 (23%), Positives = 134/351 (38%), Gaps = 53/351 (15%)
Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL-LQYRIS 330
V + LTVWDF LP+T +L + G++ + G+ + L + L L +R+S
Sbjct: 180 VPVELTVWDFELPSTATLRSAFGLAWGALPSGHGISSSDLAAFATLRARYGQLALDHRVS 239
Query: 331 ------------PFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSS 378
F R+ + T P + EY P + V +
Sbjct: 240 LSHHDDGMWNDLEHFDRYYGPLMDGTAATRLPGARLTAVEYLG---------PLADVANL 290
Query: 379 NDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEP-LNMEHYSSVRNMASELHAYAPDARVL 436
A+ Y R++ W + F Y DEP + + AS P+ R L
Sbjct: 291 ARWAQRY--------RSRPGWFERLFQYTCDEPPYQGCGWGDIALRASAAKKADPEFRTL 342
Query: 437 TTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENG 496
T ++A + V V F+ + Y + D +P+N
Sbjct: 343 VTTTIQEAEANGATGLLDLVVPVVNFIDDKSGGYA-------GDQRPKYDAFLAAEPQN- 394
Query: 497 EEWWTYVCM----GPSDPH----PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEK 548
E W CM G S + P++ + ++RA+ W +K TG LYW
Sbjct: 395 EVWLYQSCMSHGCGGSSAYGTGWPSYMVDASAVRNRAMQWLAFKYRATGELYWDTTYAYL 454
Query: 549 ATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
+ P A + G GDG LFYPG ++ ++ PVAS+RL+ I G++
Sbjct: 455 SGDPWASVWEFDG--NGDGTLFYPGTPAKIGGTTHVPVASIRLKMIREGME 503
>gi|291514616|emb|CBK63826.1| hypothetical protein AL1_13720 [Alistipes shahii WAL 8301]
Length = 573
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 95/239 (39%), Gaps = 40/239 (16%)
Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEPLNMEHYSS 418
+ DPR+ Y Y P L ++++R + + W Y ++ DEPL+ E+ +S
Sbjct: 321 YDDPRVQQYIAAYFPAL------QEHLRSKTINDGSGRSWLDIYTQHIADEPLD-ENKTS 373
Query: 419 VRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLG 478
+A ++ APD R++ Y D L + VP+ +IY T
Sbjct: 374 WEGLAHQVKQAAPDIRIIEAYRSSSYDPALID------ILVPQLDEFAWEIYRTM----- 422
Query: 479 NREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGF 538
G W Y CM P N ++ + + R + W +K G G+
Sbjct: 423 ---------------PAGHSCWFYTCMYPRGNFANRYVTLPLIKTRLLHWINYKYGSPGY 467
Query: 539 LYWGANCYEKATVPSAEIRF-RRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
L+WG N + P ++ P GD + YPG R+ S+RL + G++
Sbjct: 468 LHWGFNAWGANGDPFGDVSAPANDWPGGDSHIVYPG-----YRKLYPSIRLTAMRDGIR 521
>gi|414591657|tpg|DAA42228.1| TPA: hypothetical protein ZEAMMB73_522235 [Zea mays]
Length = 446
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 37/58 (63%), Gaps = 12/58 (20%)
Query: 351 ADHPKSDEYFSDPRLAAYAVPYSPVLS------------SNDGAKDYVRKEIELLRTK 396
ADHPK++EY+SDPRLA Y VPY+P+LS S D AK +R+E+E + K
Sbjct: 200 ADHPKANEYYSDPRLATYVVPYAPILSCLLLYLIWLLVNSTDAAKSSLRREVEGVSKK 257
>gi|224024189|ref|ZP_03642555.1| hypothetical protein BACCOPRO_00912 [Bacteroides coprophilus DSM
18228]
gi|224017411|gb|EEF75423.1| hypothetical protein BACCOPRO_00912 [Bacteroides coprophilus DSM
18228]
Length = 566
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/207 (24%), Positives = 92/207 (44%), Gaps = 29/207 (14%)
Query: 410 PLNMEHYSSV-RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRP--- 465
P+N E S+ R L ++ + Y +D P+ + F+S+V++ +F++
Sbjct: 316 PINSEKASNFYRQFLPSLMSHLQKRGLKDIYVQHIADEPI-ESNFKSYVEIARFVKDICP 374
Query: 466 --------HTQIYCTSEWVLGNREDLVKDIVTELQPEN--GEEWWTYVCMGPSDPHPNWH 515
HT + + + + KD + Q G+E W Y C+ P N
Sbjct: 375 DLRIIEACHTHNLENTVDIWVPQLNFYKDGYSFYQERQKAGDEVWFYTCLAPQGNFANRF 434
Query: 516 LGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAE---IRFRRG--LPPGDGVLF 570
L + + R + W ++ G TG+L+WG N +++ + P E + G LP GD +
Sbjct: 435 LELPSIKTRLIHWLNFRYGATGYLHWGFNFWKENSDPYGETTTMNLESGNTLPGGDSWIV 494
Query: 571 YP--GEVFSSSRQPVASLRLERILSGL 595
YP G+++S S+RLE + G+
Sbjct: 495 YPKNGKLYS-------SIRLEAMRDGI 514
>gi|390946349|ref|YP_006410109.1| hypothetical protein Alfi_1070 [Alistipes finegoldii DSM 17242]
gi|390422918|gb|AFL77424.1| hypothetical protein Alfi_1070 [Alistipes finegoldii DSM 17242]
Length = 573
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 93/237 (39%), Gaps = 40/237 (16%)
Query: 362 DPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEPLNMEHYSSVR 420
DPR+ Y Y P L ++++R + + W Y ++ DEPLN E+ +S
Sbjct: 323 DPRVQRYIAAYFPAL------QEHLRSRMIDDGSGRSWLDIYTQHIADEPLN-ENKTSWE 375
Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
+A ++ APD R++ Y D L + VP+ +IY T
Sbjct: 376 GLARQVKQAAPDIRIIEAYRSSSYDPALID------ILVPQLDEFVWEIYRTMP------ 423
Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
G W Y CM P N ++ + + R + W +K G+L+
Sbjct: 424 --------------AGHSCWFYTCMYPRGNFANRYVTLPLIKTRLLHWINYKYDSPGYLH 469
Query: 541 WGANCYEKATVPSAEIRF-RRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
WG N + P ++ P GD + YPG R+ S+RL + G++
Sbjct: 470 WGFNAWGANGDPFGDVSAPANDWPGGDSHIVYPG-----YRKLYPSIRLAAMRDGIR 521
>gi|218779590|ref|YP_002430908.1| hypothetical protein Dalk_1743 [Desulfatibacillum alkenivorans
AK-01]
gi|218760974|gb|ACL03440.1| hypothetical protein Dalk_1743 [Desulfatibacillum alkenivorans
AK-01]
Length = 844
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 94/232 (40%), Gaps = 33/232 (14%)
Query: 383 KDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCG 442
KDY+ E LR + +AY+Y+ +EP + E Y +V A+ L + APD +++ +
Sbjct: 353 KDYMHATQEYLRGLGYLDRAYYYMANEPQDGEDYKAVAWYANLLKSAAPDLKLMVS---- 408
Query: 443 PSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY 502
P E + I+ VL N + D+ + + + EE W Y
Sbjct: 409 -------EEPKEEIYNNETYSGAKIDIWLP---VLNNYD---PDVSHDREKNHQEETWVY 455
Query: 503 VCMGPSDPHPN-WHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA--TVPSAEIRFR 559
G P+ N L G + + W +WK G Y+ N + K T P +
Sbjct: 456 FLHGTRPPYYNPITLDHPGIESKFTGWLLWKYRIRGIAYYSMNGWSKNPWTSPMTDGH-- 513
Query: 560 RGLPPGDGVLFYPG-------EVFSSSRQPVASLRLERILSGLQVRWICYYL 604
GD +FYP + +++ + V S+RLE + L+ Y L
Sbjct: 514 ----NGDTFMFYPPSEDNSAIDYAANNHRLVPSIRLELMRDSLEDYEYLYLL 561
>gi|410456409|ref|ZP_11310270.1| hypothetical protein BABA_21191 [Bacillus bataviensis LMG 21833]
gi|409928078|gb|EKN65201.1| hypothetical protein BABA_21191 [Bacillus bataviensis LMG 21833]
Length = 548
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/214 (25%), Positives = 82/214 (38%), Gaps = 43/214 (20%)
Query: 390 IELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLG 449
I+ ++ + +F++ DEP +++ S +N + L Y D V+ DA
Sbjct: 315 IDFIKQNGLEHRVFFHVSDEP-HLDQVESYQNASEILQTYVKDFPVI--------DALSD 365
Query: 450 PTPFES-FVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENG-EEWWTYVCMGP 507
T +E VK P H Q + +NG E WTY C
Sbjct: 366 YTFYEKGLVKTPIPSNDHIQPFL----------------------DNGVENLWTYHCCVQ 403
Query: 508 SDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGL 562
N M ++R + +++K GFL+WG N + +K P
Sbjct: 404 YKKVANRFFNMPSFRNRVLGMQLYKFNIAGFLHWGYNFWYSQYSKKPIDPFRNTDAHYAF 463
Query: 563 PPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
P GD L YPGE P+ S+RLE + LQ
Sbjct: 464 PSGDAFLVYPGE-----EGPIESIRLEVLHEALQ 492
>gi|334364721|ref|ZP_08513701.1| hypothetical protein HMPREF9720_1375 [Alistipes sp. HGB5]
gi|313159097|gb|EFR58472.1| hypothetical protein HMPREF9720_1375 [Alistipes sp. HGB5]
Length = 514
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 93/237 (39%), Gaps = 40/237 (16%)
Query: 362 DPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEPLNMEHYSSVR 420
DPR+ Y Y P L ++++R + + W Y ++ DEPLN E+ +S
Sbjct: 264 DPRVQRYIAAYFPAL------QEHLRSRMIDDGSGRSWLDIYTQHIADEPLN-ENKTSWE 316
Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
+A ++ APD R++ Y D L + VP+ +IY T
Sbjct: 317 GLARQVKQAAPDIRIIEAYRSSSYDPALID------ILVPQLDEFVWEIYRTMP------ 364
Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
G W Y CM P N ++ + + R + W +K G+L+
Sbjct: 365 --------------AGHSCWFYTCMYPRGNFANRYVTLPLIKTRLLHWINYKYDSPGYLH 410
Query: 541 WGANCYEKATVPSAEIRF-RRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
WG N + P ++ P GD + YPG R+ S+RL + G++
Sbjct: 411 WGFNAWGANGDPFGDVSAPANDWPGGDSHIVYPG-----YRKLYPSIRLAAMRDGIR 462
>gi|414879440|tpg|DAA56571.1| TPA: hypothetical protein ZEAMMB73_699847 [Zea mays]
Length = 659
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 37/58 (63%), Gaps = 12/58 (20%)
Query: 351 ADHPKSDEYFSDPRLAAYAVPYSPVLS------------SNDGAKDYVRKEIELLRTK 396
ADHPK++EY+S+PRLAAY PY+P+LS S + AK +R+E+E + K
Sbjct: 173 ADHPKANEYYSNPRLAAYVAPYAPILSCLLLYLIWLLVNSTNAAKSSLRREVEGVSKK 230
>gi|444913149|ref|ZP_21233303.1| hypothetical protein D187_05240 [Cystobacter fuscus DSM 2262]
gi|444716152|gb|ELW57007.1| hypothetical protein D187_05240 [Cystobacter fuscus DSM 2262]
Length = 546
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 136/374 (36%), Gaps = 60/374 (16%)
Query: 257 IDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEA 316
+DM A S V + V F+LPAT SLP GIS I G++ S E
Sbjct: 132 LDMEGAPAAS-----VPFTAEVQPFVLPATSSLPNSFGISLYSIAKGHGLKPESPEAQTL 186
Query: 317 LDQHFKWLLQYRIS-------PFFCRWGESMRVLTYTCPWPADHPKSDEYF--SDPRLAA 367
L + LL +R+S P R+ E VL + P D S R
Sbjct: 187 LRDYVTALLAHRVSAHGMSMEPPPVRFEEGRAVLDFRAYDAEVGPFLDGSALPSGARFTT 246
Query: 368 YAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELH 427
V S +++ Y R E + K + +FY DEP E VR A +
Sbjct: 247 VDVRDSKAARTDEQKAAYYRAFAEHAKDKGWPAQLFFYAKDEP-KPEDVPLVRAQALRVR 305
Query: 428 AYAPDARVLTT-----YYCGPSD--APL--------GPTPFESFVKVPKF---LRPHTQI 469
D VL T G +D AP GP + V + L P+ ++
Sbjct: 306 TAGKDVPVLVTSPLDEALRGSADILAPTLNCFFPRPGPQTCRNVVPLQTLRGKLAPNVKV 365
Query: 470 Y----CTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRA 525
+ C S G KD TE + W +Y+ P+ +RA
Sbjct: 366 WWYQSCNSHGCTGGP---AKDSATE---KAYSGWASYMVDHPA------------PLNRA 407
Query: 526 VMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQP 582
+ + G G LY+ P E+ F G GDG FYPG S QP
Sbjct: 408 MGPLAFLSGVDGELYFDTVFAYNTKDPWKEV-FEFG-GNGDGTFFYPGTPAHTGLSRHQP 465
Query: 583 VASLRLERILSGLQ 596
V SLRL+ + GL+
Sbjct: 466 VVSLRLKHLRDGLE 479
>gi|430747974|ref|YP_007207103.1| hypothetical protein Sinac_7369 [Singulisphaera acidiphila DSM
18658]
gi|430019694|gb|AGA31408.1| hypothetical protein Sinac_7369 [Singulisphaera acidiphila DSM
18658]
Length = 577
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 67/285 (23%), Positives = 104/285 (36%), Gaps = 61/285 (21%)
Query: 326 QYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFS--DPRLAAYAVPYSPVLSSNDGAK 383
Q+ S + WG + Y + D+Y +P ++ ++ Y L K
Sbjct: 295 QFEWSHLWIYWGVENPMRIYK-------KEGDQYVMLWEPTISGFSDTYVNFL------K 341
Query: 384 DYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGP 443
++ + + L + + +YF+L DEP +H + R L AP +V+
Sbjct: 342 QFLPEFKKFLTEEKMLETSYFHLSDEPGPGQHVQNYRRARQILREIAPWMKVMDAL---- 397
Query: 444 SDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYV 503
SD G E +P L Q Y ++ P W Y
Sbjct: 398 SDIEYGK---EGLTDIPIPLVSAAQAYIDAK-----------------IPH-----WVYY 432
Query: 504 CMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRR--- 560
C GP+ P N + S+ R W ++ GFL+WG N ++K A F
Sbjct: 433 CCGPTGPWLNRFMDTPLSKIRMSGWLFYRHEAKGFLHWGFNYWDKMEREEAGDPFHDGSN 492
Query: 561 ----GLPPGDGVLFYPG----------EVFSSSRQPVASLRLERI 591
G+P GD + YPG EVF+ S Q A L+ I
Sbjct: 493 ASYPGIPFGDPFVIYPGPDGPIDSIRWEVFAESLQDYAILQTAGI 537
>gi|157370810|ref|YP_001478799.1| hypothetical protein Spro_2570 [Serratia proteamaculans 568]
gi|157322574|gb|ABV41671.1| conserved hypothetical protein [Serratia proteamaculans 568]
Length = 556
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 45/102 (44%), Gaps = 9/102 (8%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 554
W Y C N +++R + ++++ TGFL+WG N Y + P A
Sbjct: 400 WAYYCCVQKTEVANRFFAQPSARNRILGIQLYRYNITGFLHWGFNFYNSGHSREQLNPYA 459
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
R P GD + YPGE + PV SLRL + GLQ
Sbjct: 460 VTDCRNAFPSGDAFVVYPGEDLT----PVESLRLRVLHQGLQ 497
>gi|442320014|ref|YP_007360035.1| hypothetical protein MYSTI_03035 [Myxococcus stipitatus DSM 14675]
gi|441487656|gb|AGC44351.1| hypothetical protein MYSTI_03035 [Myxococcus stipitatus DSM 14675]
Length = 561
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 133/369 (36%), Gaps = 63/369 (17%)
Query: 266 SNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLL 325
S V V ++ V F+LPAT SLP GIS I G+ S E L + + LL
Sbjct: 147 SREHVSVPFTVEVQPFVLPATASLPTSFGISQLSIARGHGLNAESSEAKALLRAYARMLL 206
Query: 326 QYRI-------SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSS 378
++R+ SP R+ + V+ W + + L + A + L
Sbjct: 207 EHRVSAHGMSMSPPPVRFEDGRAVVD----WREYDAEMAPFLDGSLLPSGARFTTTDLRD 262
Query: 379 NDGAKD------YVRKEIELLRTKAHWKKAYFYLWDE------PLNMEHYSSVRN----- 421
N A Y R +E R K + +FY DE PL + VR
Sbjct: 263 NKKAHTEAERVAYYRAFVEHFRKKDWPTQLFFYAKDEPKPQDVPLVLTQSRRVREAGGAR 322
Query: 422 ------MASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIY----C 471
M EL A A D T P P + ++ K LR TQ++ C
Sbjct: 323 VLITTPMEGELPA-AADILAPTLNCFFPRPGPATCRAIHTVTELRKQLRSGTQVWWYQSC 381
Query: 472 TSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVW 531
S G TE E W ++ + +RA+ +
Sbjct: 382 NSHGCNGG-------ASTEAAQERAYSGWA-----------SYMVDHSAMLNRAMGPLAF 423
Query: 532 KEGGTGFLYWGA-NCYEKATVPSAEIRFRRGLPPGDGVLFYPG--EVFSSSR-QPVASLR 587
G G LY+ Y P ++ F G GDG FYPG E SR QPV SLR
Sbjct: 424 VNGVDGELYFDTVFAYNTKKDPWKDL-FEFG-GNGDGTFFYPGTPERLGDSRHQPVPSLR 481
Query: 588 LERILSGLQ 596
L+ + GL+
Sbjct: 482 LKHLRDGLE 490
>gi|86160501|ref|YP_467286.1| hypothetical protein Adeh_4085 [Anaeromyxobacter dehalogenans
2CP-C]
gi|85777012|gb|ABC83849.1| hypothetical protein Adeh_4085 [Anaeromyxobacter dehalogenans
2CP-C]
Length = 539
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 91/348 (26%), Positives = 138/348 (39%), Gaps = 47/348 (13%)
Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL-LQYRIS 330
V ++LTVW F LP+T SL + G+S + GV S + AL + L L +RI+
Sbjct: 109 VPVTLTVWPFTLPSTASLKSAFGLSWGTLNTAHGV---SGDALSALRARYGQLALDHRIT 165
Query: 331 PFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEI 390
R + R L + + + P A +V Y L + G +
Sbjct: 166 --LSRIDDGNRDLAHFASFFGPLFDGASAATLPGAQATSVEY---LGGSSGYASWA---- 216
Query: 391 ELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGP 450
+++ + + Y DEP +S + A+ A +P R L T +DA G
Sbjct: 217 SFFQSRGWDDRLFQYTCDEPPLQCAWSDIPARAASARAVSPALRTLVTTTIQQADAA-GV 275
Query: 451 TP-FESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY------- 502
TP + V V FL E G + D P E WTY
Sbjct: 276 TPAIDVLVPVVNFLDDR-----AGERFAGPQR-AAYDAFLAGSPR--REVWTYQSCMSHG 327
Query: 503 ----VCMG-PSDPH------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV 551
V MG PSD P++ + ++RA+ W + TG LY+
Sbjct: 328 CGGTVDMGSPSDSDRYFTGWPSYMIDASAVRNRAMEWISFNHRVTGELYYETTMAYSHDP 387
Query: 552 PSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
+ + F GDG LFYPG +V +++ PVAS+RL+ I G++
Sbjct: 388 WANQWDFSGN---GDGTLFYPGTPAKVGGTTQIPVASIRLKMIREGME 432
>gi|383454162|ref|YP_005368151.1| hypothetical protein COCOR_02162 [Corallococcus coralloides DSM
2259]
gi|380728520|gb|AFE04522.1| hypothetical protein COCOR_02162 [Corallococcus coralloides DSM
2259]
Length = 631
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 86/373 (23%), Positives = 133/373 (35%), Gaps = 77/373 (20%)
Query: 263 DAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDT-VIEDRFGVRHGSDEWYEALDQHF 321
+A +V LTV D ++P+T SL + + T V G + + L +
Sbjct: 169 EAEGGFQRQVTARLTVVDAVMPSTSSLASAFPLLPTQVCRAHLGRNDCTPAELQPLLVRY 228
Query: 322 KWL-LQYRI---------------SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRL 365
+ L L++R+ S F+ WG S L T P S R+
Sbjct: 229 QQLSLEHRLTQPRLFLSGSGAQAWSDFYATWGPS---LDGTAP---------SRLSGARM 276
Query: 366 AA--YAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMA 423
+ Y P++ G D+ E + +A+ + DEP + + V+
Sbjct: 277 TSVEYTGPFT-----AGGLADFAGHMSE----RGWLARAHAKIGDEPFDATTFQQVQATG 327
Query: 424 SELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDL 483
+ + AP R + T + L E V + L H + G D
Sbjct: 328 TLVRQAAPGLRTMLTV----NSMQLKLNGLEPLVDIAVPLVNHLE---------GTTPDF 374
Query: 484 VKD---IVTELQPENGEEWWTYVCMG--------------PSDPHPNWHLGMRGSQHRAV 526
V D G E W Y P P++ + ++ RA+
Sbjct: 375 VGDQSPTYAGFLSRPGTELWMYQSCASHGCAPGSLMPENQPGSGWPSYMVDRSSAKARAM 434
Query: 527 MWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGE---VFSSSRQPV 583
W ++ G G LY+ A A +P+A GDG LFYPG + + PV
Sbjct: 435 EWLAFRFGAKGELYYEAG----AMLPTAWTDQYHFGGNGDGTLFYPGTPAVIGGQTDVPV 490
Query: 584 ASLRLERILSGLQ 596
ASLRL+ I GLQ
Sbjct: 491 ASLRLKLIRQGLQ 503
>gi|197124590|ref|YP_002136541.1| hypothetical protein AnaeK_4209 [Anaeromyxobacter sp. K]
gi|196174439|gb|ACG75412.1| conserved hypothetical protein [Anaeromyxobacter sp. K]
Length = 618
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 134/346 (38%), Gaps = 43/346 (12%)
Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 331
V ++LTVW F LP+T SL + G+S + GV D ++ + L +R++
Sbjct: 186 VPVTLTVWPFTLPSTASLKSAFGLSWGTLNTAHGVS--GDALSTLRGRYGQLALDHRVT- 242
Query: 332 FFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIE 391
R + R L + + + P A +V Y L + G +
Sbjct: 243 -LSRIDDGNRDLAHFASFFGPLFDGGAATALPGAQATSVEY---LGGSSGYASWA----S 294
Query: 392 LLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT 451
+++ + + Y DEP +S + A+ A +P R L T +DA +
Sbjct: 295 FFQSRGWDDRLFQYTCDEPPLQCAWSDIPARAASARAVSPALRTLVTTTVQQADAAGVTS 354
Query: 452 PFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY--------- 502
+ V V FL E G + D P E WTY
Sbjct: 355 SIDVLVPVVNFLDDRA-----GERFAGPQR-AAYDAFLAGSPR--REVWTYQSCMSHGCG 406
Query: 503 --VCMG-PSDPH------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPS 553
V MG PSD P++ + ++RA+ W + TG LY+ +
Sbjct: 407 GTVDMGSPSDSDRYFTGWPSYMIDASAVRNRAMEWISFNHRVTGELYYETTMAYSHDPWN 466
Query: 554 AEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
+ F GDG LFYPG +V +++ PVAS+RL+ I G++
Sbjct: 467 NQWDFSGN---GDGTLFYPGTPAKVGGTTQIPVASIRLKMIREGME 509
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 59/142 (41%), Gaps = 17/142 (11%)
Query: 55 VWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDL 114
VW ST + P R L AARNE E+ Q+ + + ++TAG+ +
Sbjct: 43 VWTATSTEKIRPAATARAPGGAALTAARNEFEAFQVVITGAATGVRATTAGLTGPASLPV 102
Query: 115 CSASGDRLVVGQSLMLRRVVPMLGV----PDALVP--LDLPVCQISLIP-----GETTAV 163
RL + L + G PDALVP +L + + P GE+ AV
Sbjct: 103 ------RLYREAIINLSNPSALDGGTGPWPDALVPDVDELAGERRNAFPFTVPAGESRAV 156
Query: 164 WVSIDAPYAQPPGLYEGEIIIT 185
WV + P P G Y G + +T
Sbjct: 157 WVEVHVPPDAPAGEYAGSVQVT 178
>gi|153004261|ref|YP_001378586.1| hypothetical protein Anae109_1395 [Anaeromyxobacter sp. Fw109-5]
gi|152027834|gb|ABS25602.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
Length = 604
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 66/151 (43%), Gaps = 20/151 (13%)
Query: 50 ADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALR-PKVSWSSSST----A 104
A VW +T + P PR + AARNE E+ Q+ + P S+ +T A
Sbjct: 19 AAAADVWVAGATEKIRPDAQPRQTTEARIAAARNEFEAFQVVVTGPARGVSARATSLEGA 78
Query: 105 GVV------QVQCSDLCSASGDRLVVGQ--SLMLRRVVPMLGVPDALVPLDLPVCQISLI 156
GVV +V D+ +AS G+ ++ V ++G P D+P
Sbjct: 79 GVVDDVKLYRVDAIDVHTASALDGATGRWPDALVPDVDDVVGEKRNAFPFDVPA------ 132
Query: 157 PGETTAVWVSIDAPYAQPPGLYEGEIIITSK 187
GE+ A+WV + P PG + GE+ I S+
Sbjct: 133 -GESRAIWVEVRVPPDAKPGTHFGEVTIASE 162
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 82/358 (22%), Positives = 127/358 (35%), Gaps = 58/358 (16%)
Query: 270 VRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRI 329
++ + LTVWDF LP+T SL G+S +I GV +D I
Sbjct: 166 AKIPVMLTVWDFELPSTASLKTHFGLSWGLIPSGHGVSPETDA---------------SI 210
Query: 330 SPFFCRWGESMRVLTYTCPWPADHPKSDEYFSD-----PRLAAYAVPYSPVLS----SND 380
+ G RV H D + + A +P + + + N
Sbjct: 211 RARYAALGLDHRVSLSGVADDGYHGDFDHFERNYAPLVDGTAKTRLPGAKLTTVKYVGNQ 270
Query: 381 GAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYY 440
+ D R+ E R K + + + Y DEP + + +H P+ R L T
Sbjct: 271 TSVDEHRRWAEHFRAKGWFDRLFDYTCDEPPLTCSWDELPQRTKAVHEADPEFRTLVTTQ 330
Query: 441 CGPSDAPLGPTPFESFVKVPKFL--RPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEE 498
++ + V V ++ RP LG + + E E
Sbjct: 331 IWDAEEHGVADEIDIMVPVVNWMDDRPGAG-------SLGQNRAKYDGFLA--KSEKKEL 381
Query: 499 WWTYVCM-----------GPSD------PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYW 541
W CM PS+ P++ + ++RA+ W + E TG LYW
Sbjct: 382 WLYQSCMSHGCGGTVNIGNPSEWDRYNTGWPSYMIDSSAVRNRAMEWISFLEDATGELYW 441
Query: 542 GANCYEKATVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
S + F GDG LFYPG + S+ PVAS+RL+ I G++
Sbjct: 442 ETAFAFTHDAWSNQWDFSGN---GDGTLFYPGTPARIGGSTDIPVASIRLKMIREGME 496
>gi|414879441|tpg|DAA56572.1| TPA: hypothetical protein ZEAMMB73_699847 [Zea mays]
Length = 57
Score = 50.1 bits (118), Expect = 0.003, Method: Composition-based stats.
Identities = 19/27 (70%), Positives = 25/27 (92%)
Query: 351 ADHPKSDEYFSDPRLAAYAVPYSPVLS 377
ADHPK++EY+S+PRLAAY PY+P+LS
Sbjct: 28 ADHPKANEYYSNPRLAAYVAPYAPILS 54
>gi|315648741|ref|ZP_07901837.1| hypothetical protein PVOR_25998 [Paenibacillus vortex V453]
gi|315275943|gb|EFU39294.1| hypothetical protein PVOR_25998 [Paenibacillus vortex V453]
Length = 558
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/222 (24%), Positives = 83/222 (37%), Gaps = 49/222 (22%)
Query: 385 YVRKEIELLRTKAHWKKAYFYLWDEPL--NMEHYSSVRNMASELHAYAPDARVLTTYYCG 442
++ K ++ +R K+ YF+L DEP ++E Y + + + P L+ Y
Sbjct: 314 FLNKLVQFVRWNGLEKRVYFHLSDEPKLDDLETYRAASELVRPILKDFPIIDALSDYEFY 373
Query: 443 PSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP--ENG-EEW 499
S P P + ++QP ++G E
Sbjct: 374 KSGLIEHPIPASN----------------------------------DIQPFLDHGLEGL 399
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 554
WTY C N M S++R + +++ GFL+WG N + + A P
Sbjct: 400 WTYYCCAQYKQVSNRFFHMPSSRNRVLGIQLYTLKLRGFLHWGYNFWYAQFSKYAINPYQ 459
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
G P GD L YPGE PV S+RLE + LQ
Sbjct: 460 VTDAGGGFPAGDAFLVYPGE-----EGPVESIRLEVLTEALQ 496
>gi|220919313|ref|YP_002494617.1| hypothetical protein A2cp1_4234 [Anaeromyxobacter dehalogenans
2CP-1]
gi|219957167|gb|ACL67551.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans
2CP-1]
Length = 618
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 133/346 (38%), Gaps = 43/346 (12%)
Query: 272 VKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISP 331
V ++LTVW F LP+T SL + G+S + GV D ++ + L +R++
Sbjct: 186 VPVTLTVWPFTLPSTASLKSAFGLSWGTLNTAHGV--SGDALSTLRGRYGQLALDHRVT- 242
Query: 332 FFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIE 391
R + R L + + S P A +V Y L + G +
Sbjct: 243 -LSRIDDGNRDLAHFASFFGPLFDGGAATSLPGAQATSVEY---LGGSSGYASWA----S 294
Query: 392 LLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT 451
+++ + + Y DEP + + A+ A +P R L T +DA +
Sbjct: 295 FFQSRGWDDRLFQYTCDEPPLQCAWGDIPARAASARAVSPALRTLVTTTVQQADAAGVTS 354
Query: 452 PFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY--------- 502
+ V V FL E G + D P E WTY
Sbjct: 355 SIDVLVPVVNFLDDR-----AGERFAGPQR-AAYDAFLAGSPR--REVWTYQSCMSHGCG 406
Query: 503 --VCMG-PSDPH------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPS 553
V MG PSD P++ + ++RA+ W + TG LY+ +
Sbjct: 407 GTVDMGSPSDSDRYFTGWPSYMIDASAVRNRAMEWISFNHRVTGELYYETTMAYSHDPWN 466
Query: 554 AEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
+ F GDG LFYPG +V +++ PVAS+RL+ I G++
Sbjct: 467 NQWDFSGN---GDGTLFYPGTPAKVGGTTQIPVASIRLKMIREGME 509
>gi|115376571|ref|ZP_01463803.1| hypothetical protein STIAU_2875 [Stigmatella aurantiaca DW4/3-1]
gi|115366439|gb|EAU65442.1| hypothetical protein STIAU_2875 [Stigmatella aurantiaca DW4/3-1]
Length = 620
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 90/213 (42%), Gaps = 31/213 (14%)
Query: 401 KAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVP 460
+AY ++ DEP + ++R A AP+ R L T + L E + V
Sbjct: 270 RAYDFVGDEPPYGISFEALRQNAELTRQVAPELRTLVTT----NSRELDKYALEDLMDVA 325
Query: 461 KFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCM------GPSDPH-- 511
+ H T+ G++ D ++ G E W Y CM G + P
Sbjct: 326 APVVNHMD--GTAPPFQGDQRATYHDFLSL----PGRELWLYQSCMSHGCAYGTNAPENQ 379
Query: 512 -----PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGD 566
P++ + ++ RA+ W + EG +G LY+ A+ + + RF GD
Sbjct: 380 PGAGWPSYMVDRSAAKARAMEWVTFLEGASGELYY-QTVGMLASAWTDQFRFNGN---GD 435
Query: 567 GVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
G LFYPG + ++ PVAS+RL+ I G+Q
Sbjct: 436 GTLFYPGTPAAIGGATDVPVASIRLKLIRLGVQ 468
>gi|304440737|ref|ZP_07400621.1| conserved hypothetical protein [Peptoniphilus duerdenii ATCC
BAA-1640]
gi|304370924|gb|EFM24546.1| conserved hypothetical protein [Peptoniphilus duerdenii ATCC
BAA-1640]
Length = 568
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 43/187 (22%), Positives = 74/187 (39%), Gaps = 13/187 (6%)
Query: 400 KKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLG---PTPFESF 456
KK + WD P E+ + +L A+ + VL Y SD P + E++
Sbjct: 305 KKKKIFGWDTPSVGEYTKFLEKFIPDLVAHLKEWGVLDKTYFHISDVPREEHIKSYKEAY 364
Query: 457 VKVPKFLRPHTQIYCTSEWVLGNREDL-----VKDIVTELQPENGEEWWTYVCMGPSDPH 511
+ V + + + + + ++ + ++ +E WTY C+G
Sbjct: 365 MSVNDLFKDLKTFEAVAHYDFFKKGLIELPVAASSVIHDFLDDDLDELWTYYCVGQFTEV 424
Query: 512 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSAEIRFRRGLPPGD 566
N + M +++R +++K +GFL+WG N Y K P G P GD
Sbjct: 425 ANRFMSMPSARNRIFGIQMYKFHISGFLHWGYNFYNSVLSYKKIDPYKVTDADDGFPAGD 484
Query: 567 GVLFYPG 573
L YPG
Sbjct: 485 AFLVYPG 491
>gi|310817408|ref|YP_003949766.1| hypothetical protein STAUR_0130 [Stigmatella aurantiaca DW4/3-1]
gi|309390480|gb|ADO67939.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
Length = 650
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 90/213 (42%), Gaps = 31/213 (14%)
Query: 401 KAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVP 460
+AY ++ DEP + ++R A AP+ R L T + L E + V
Sbjct: 300 RAYDFVGDEPPYGISFEALRQNAELTRQVAPELRTLVTT----NSRELDKYALEDLMDVA 355
Query: 461 KFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCM------GPSDPH-- 511
+ H T+ G++ D ++ G E W Y CM G + P
Sbjct: 356 APVVNHMD--GTAPPFQGDQRATYHDFLSL----PGRELWLYQSCMSHGCAYGTNAPENQ 409
Query: 512 -----PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGD 566
P++ + ++ RA+ W + EG +G LY+ A+ + + RF GD
Sbjct: 410 PGAGWPSYMVDRSAAKARAMEWVTFLEGASGELYY-QTVGMLASAWTDQFRFNGN---GD 465
Query: 567 GVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
G LFYPG + ++ PVAS+RL+ I G+Q
Sbjct: 466 GTLFYPGTPAAIGGATDVPVASIRLKLIRLGVQ 498
Score = 38.9 bits (89), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 57/159 (35%), Gaps = 41/159 (25%)
Query: 55 VWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDL 114
VW + V P PR + L AARNE S Q+AL + G+ V+ + L
Sbjct: 25 VWGESAMVKVRPNLAPRARPELQLTAARNEFVSFQVALH-------GGSTGLSGVR-AKL 76
Query: 115 CSASGDRLVVGQSLMLRRVVPMLGV------------PDALV--------------PLDL 148
G + G + L RV + V PD LV P D+
Sbjct: 77 NGFVGPTSISGPDVTLYRVAYLTTVRPSVPGTPVGRWPDGLVPDVDEIAGEGRRAFPFDV 136
Query: 149 PVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSK 187
P E A+WV + P P G Y G + + S
Sbjct: 137 PA-------NEARAIWVDVHVPMDAPAGQYRGTVEVLSS 168
>gi|325662024|ref|ZP_08150643.1| hypothetical protein HMPREF0490_01381 [Lachnospiraceae bacterium
4_1_37FAA]
gi|325471687|gb|EGC74906.1| hypothetical protein HMPREF0490_01381 [Lachnospiraceae bacterium
4_1_37FAA]
Length = 562
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 42/101 (41%), Gaps = 13/101 (12%)
Query: 496 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-------YEK 548
GE WTYVC GP N L + R + W K +GFL+WG N YE
Sbjct: 399 GETVWTYVCCGPEGHWLNRFLDFALLKGRMLFWGCAKNRISGFLHWGLNQFPGEMNPYEG 458
Query: 549 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLE 589
+ P+ P GD L YPGE P +RLE
Sbjct: 459 TSCPN-HTGIGTNFPCGDSFLIYPGE-----EGPRMGMRLE 493
>gi|295798170|emb|CAX69035.1| Putative uncharacterized protein precursor [uncultured bacterium]
Length = 563
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 80/352 (22%), Positives = 123/352 (34%), Gaps = 46/352 (13%)
Query: 269 SVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFG--VRHGSD----EWYEALDQHFK 322
S V L VWDF LP TPS+ G ++ + V+ G D +W +Q +
Sbjct: 182 SKTVSARLKVWDFALPQTPSMQTSFGSPAGRMKSWYANHVKVGKDAPIKDWTAVEEQCAQ 241
Query: 323 WLLQYRISPFF-CRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSP-----VL 376
L ++RI+ E + T + + F D R A+ S ++
Sbjct: 242 LLAEHRINATPPDELLEPQKQGDGTWRISEEKLNALGQFID-RYHVNALDVSKNFIFGII 300
Query: 377 SSNDGAKDYVRKEIELLRTKAHWKKA-----YFYLWDEPLNMEHYSSVRNMASELHAYAP 431
D A+D +R ++ A Y YL DEP + E Y VR +
Sbjct: 301 KDPDAARDEIRTRLKAFEMAAKQLNRPNLLFYVYLTDEPNDPEAYDYVRKWGKAIKEANS 360
Query: 432 DARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTEL 491
+V+ T P G + P F L +
Sbjct: 361 VVKVMITEQSTPQKTEWGDLYGAVDIWCPLF-------------------PLFEQGNAAR 401
Query: 492 QPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV 551
+ GE W Y + +P P WH+ +R W W+ G LYWG + T
Sbjct: 402 RQALGETVWAYTALCQRNPTPWWHIDYPLLNYRVPAWISWRYRIRGLLYWGGMSFWNETG 461
Query: 552 PSAEIRFRRG------LPPGDGVLFYPGEVFSSSRQPVA-SLRLERILSGLQ 596
+ G + G+G L YPG + +A SLRL+ + G++
Sbjct: 462 DPWRDAWTYGHKKSMLVYNGEGTLVYPGR--KAGYDGIAPSLRLKALRDGIE 511
>gi|410098375|ref|ZP_11293353.1| hypothetical protein HMPREF1076_02531 [Parabacteroides goldsteinii
CL02T12C30]
gi|409222249|gb|EKN15194.1| hypothetical protein HMPREF1076_02531 [Parabacteroides goldsteinii
CL02T12C30]
Length = 562
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 48/108 (44%), Gaps = 14/108 (12%)
Query: 495 NGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-------YE 547
G+E W Y C+ P N L + R + W +K G TG+L+WG N Y+
Sbjct: 411 KGDEVWFYTCLAPQGDFANRFLEQPLIKTRLIHWLNYKYGATGYLHWGFNQWFSDNDPYK 470
Query: 548 KATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGL 595
+ T + E LP GD + YP + + S+RLE + G+
Sbjct: 471 ETTTMNTES--GNTLPGGDSWIVYP-----DNGKLYGSIRLEAMRDGI 511
>gi|375308735|ref|ZP_09774018.1| hypothetical protein WG8_2543 [Paenibacillus sp. Aloe-11]
gi|375079362|gb|EHS57587.1| hypothetical protein WG8_2543 [Paenibacillus sp. Aloe-11]
Length = 570
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 69/310 (22%), Positives = 106/310 (34%), Gaps = 70/310 (22%)
Query: 286 TPSLPAVIGISDTVIE------DRFGVRHGSDEWYEALDQHFKW--------LLQYRISP 331
TP L IG T I+ D G R G D+ LD KW + + ++
Sbjct: 233 TPPLDTFIGNERTTIQLVDVSYDVNGYRFGFDK----LD---KWVQISEAVGITHFEMAH 285
Query: 332 FFCRWGESM--RVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKE 389
F +WG +++ P +DP ++ + P L+
Sbjct: 286 LFSQWGAKYAPKIIVEVGGVPEQRFGWHTPANDPEFRSFLAAFLPALT------------ 333
Query: 390 IELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLG 449
E L +++ F++ DEP +A LH Y + + Y G
Sbjct: 334 -ERLHQLGIAERSLFHISDEP-----------VAGNLHTYLEAKQFVAPYLEG------- 374
Query: 450 PTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSD 509
P + +F R + V G+ D + E W Y C G +
Sbjct: 375 -FPIIDAISDVEFYRRG----IIDQPVAGS------DTIHNFIDEGASNLWVYYCCGQNL 423
Query: 510 PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSAEIRFRRGLPP 564
N L M S++R + +++K GFL+WG N Y K P + P
Sbjct: 424 HVSNRFLAMPSSRNRILGVQMYKYRIKGFLHWGFNFYNSQYSLKKLNPYVDTAALDTFPS 483
Query: 565 GDGVLFYPGE 574
GD L YP E
Sbjct: 484 GDSFLVYPSE 493
>gi|383452115|ref|YP_005366104.1| hypothetical protein COCOR_00091 [Corallococcus coralloides DSM
2259]
gi|380727263|gb|AFE03265.1| hypothetical protein COCOR_00091 [Corallococcus coralloides DSM
2259]
Length = 645
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 87/221 (39%), Gaps = 31/221 (14%)
Query: 393 LRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTP 452
++ K +AY L DEP ++ V + AP R + T + L
Sbjct: 296 MKAKGWLDRAYVQLGDEPPYGTPFAQVHATGELVRQAAPGLRTMLTT----NSRELKANG 351
Query: 453 FESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCM------ 505
E V L H + T G++ + T G W Y CM
Sbjct: 352 LEDAVDTAVPLVNH--LDGTDANFRGDQ----RGTYTRFLERPGTALWMYQSCMSHGCAY 405
Query: 506 GPSDPH-------PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRF 558
G + P P++ L ++ RA+ W + +G TG LY+ + +T + + RF
Sbjct: 406 GTNAPENKPGAGWPSYMLDRSAAKARAMEWVTFLQGATGELYY-QSVGMLSTAWTDQYRF 464
Query: 559 RRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
GDG LFYPG + + PVASLRL+ I G+Q
Sbjct: 465 NGN---GDGTLFYPGTPEAIGGKTDVPVASLRLKLIRQGMQ 502
>gi|354580446|ref|ZP_08999351.1| hypothetical protein PaelaDRAFT_0452 [Paenibacillus lactis 154]
gi|353202877|gb|EHB68326.1| hypothetical protein PaelaDRAFT_0452 [Paenibacillus lactis 154]
Length = 554
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 45/105 (42%), Gaps = 10/105 (9%)
Query: 497 EEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATV 551
E WTY C N + +++R + +++K GFL+WG N + +A
Sbjct: 396 EGLWTYYCCSQYKEVSNRFFNLPSARNRILGMQLYKYNIEGFLHWGYNFWYSQYSRRAID 455
Query: 552 PSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
P G P GD L YPGE PV S+RL+ LQ
Sbjct: 456 PYRVTDADSGFPSGDAFLVYPGE-----DGPVESIRLKVFHEALQ 495
>gi|395204636|ref|ZP_10395576.1| hypothetical protein PA08_1303 [Propionibacterium humerusii P08]
gi|328907298|gb|EGG27064.1| hypothetical protein PA08_1303 [Propionibacterium humerusii P08]
Length = 590
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 41/96 (42%), Gaps = 5/96 (5%)
Query: 483 LVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG 542
+ D V + + + E W Y C+ N + + RA+ W++WK G GFL+WG
Sbjct: 390 VATDAVLDFRRDGIEPAWVYHCVAQDVGVSNRFIAQESVRTRALGWQLWKFGVKGFLHWG 449
Query: 543 ANCYEKATV-----PSAEIRFRRGLPPGDGVLFYPG 573
N Y P A+ G GD + YPG
Sbjct: 450 FNFYYGQLSVCPIDPFADTSAGGGFISGDAFIVYPG 485
>gi|422439957|ref|ZP_16516771.1| conserved hypothetical protein [Propionibacterium acnes HL037PA3]
gi|422471082|ref|ZP_16547582.1| conserved hypothetical protein [Propionibacterium acnes HL037PA2]
gi|422573949|ref|ZP_16649509.1| conserved hypothetical protein [Propionibacterium acnes HL044PA1]
gi|313837143|gb|EFS74857.1| conserved hypothetical protein [Propionibacterium acnes HL037PA2]
gi|314927836|gb|EFS91667.1| conserved hypothetical protein [Propionibacterium acnes HL044PA1]
gi|314971914|gb|EFT16012.1| conserved hypothetical protein [Propionibacterium acnes HL037PA3]
Length = 493
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 41/96 (42%), Gaps = 5/96 (5%)
Query: 483 LVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG 542
+ D V + + + E W Y C+ N + + RA+ W++WK G GFL+WG
Sbjct: 293 VATDAVLDFRRDGIEPAWVYHCVAQDVGVSNRFIAQESVRTRALGWQLWKFGVKGFLHWG 352
Query: 543 ANCYEKATV-----PSAEIRFRRGLPPGDGVLFYPG 573
N Y P A+ G GD + YPG
Sbjct: 353 FNFYYGQLSVCPIDPFADTSAGGGFISGDAFIVYPG 388
>gi|223940729|ref|ZP_03632566.1| conserved hypothetical protein [bacterium Ellin514]
gi|223890585|gb|EEF57109.1| conserved hypothetical protein [bacterium Ellin514]
Length = 359
Score = 47.8 bits (112), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 77/193 (39%), Gaps = 20/193 (10%)
Query: 419 VRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLR---PHTQIYCTSEW 475
++ E H + +L Y SD P G + E++ + + L P ++
Sbjct: 120 LKQFLPEFHDFLAKENILEDSYFHLSDEP-GASHVENYKRARQVLHELAPWMKVMDALSD 178
Query: 476 VLGNRE---DLVKDIVTELQPENGEE--WWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRV 530
+ R+ D+ +V Q E+ W Y C P P N + + R W
Sbjct: 179 IQYGRQGLTDMPIPMVNSAQAYIDEKIPHWVYYCCAPQGPWLNRFMDTPLPKVRMAGWTF 238
Query: 531 WKEGGTGFLYWGANCYEK-----ATVPSAE--IRFRRGLPPGDGVLFYPGEVFSSSRQPV 583
++ G GFL+WG N + K T P + + G+P GD + YPG + QP+
Sbjct: 239 YRLGAKGFLHWGFNYWHKIEQEVVTDPLTDGCVSAWPGIPYGDPFVIYPG----ADGQPM 294
Query: 584 ASLRLERILSGLQ 596
S+R E LQ
Sbjct: 295 DSIRWEVFAESLQ 307
>gi|317048122|ref|YP_004115770.1| hypothetical protein Pat9b_1898 [Pantoea sp. At-9b]
gi|316949739|gb|ADU69214.1| conserved hypothetical protein [Pantoea sp. At-9b]
Length = 556
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 9/102 (8%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 554
WTY C N + ++R + ++W GFL+WG N Y +A P A
Sbjct: 402 WTYYCCAQYLDVANRFMAQPSVRNRILGVQLWLYRIEGFLHWGFNFYNSELSREAIDPFA 461
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
+ P GD L YPG+ F+ P+ S+RL+ +L +Q
Sbjct: 462 VTDGLQAFPAGDPFLVYPGKDFT----PLPSIRLKVLLEAMQ 499
>gi|153003376|ref|YP_001377701.1| hypothetical protein Anae109_0503 [Anaeromyxobacter sp. Fw109-5]
gi|152026949|gb|ABS24717.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
Length = 608
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 133/350 (38%), Gaps = 50/350 (14%)
Query: 269 SVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWL-LQY 327
S V ++LTVW F LP+T SL + G + I GV G+ + + AL + + L L +
Sbjct: 172 SATVPVTLTVWPFTLPSTASLKSAFGFTYGAIPGGHGV--GAADAFAALRERYGRLALDH 229
Query: 328 RISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVR 387
RI+ G + + PA + R+ +Y + Y S Y
Sbjct: 230 RITLSHVDDGSAAIDHAASLYGPAMDGAAPTALRGARMTSYELLYDAKSWST-----YFD 284
Query: 388 KEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAP 447
E L R + Y DEP +S + A+ A + R L T +DA
Sbjct: 285 GEGWLDRL-------FQYTCDEPPLTCAWSDIPARAATARAA--NVRTLVTTSIQEADAQ 335
Query: 448 LGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTY-VCMG 506
+ V V +L Y GN+ D P E W Y CM
Sbjct: 336 GVTGSIDVIVPVINYLDDREGTYA------GNQR-AKYDAFLAGSPR--RELWAYQSCMS 386
Query: 507 P-------------SDPH----PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA 549
SD + P++ + ++RA+ W ++ TG LYW
Sbjct: 387 HGCGGTVNFGSPSWSDRYFTGWPSYMIDASAVRNRAMEWLSFRYRVTGELYWETAYAYSH 446
Query: 550 TVPSAEIRFRRGLPPGDGVLFYPG---EVFSSSRQPVASLRLERILSGLQ 596
+ + F GDG LFYPG ++ ++ PVAS+RL+ I G++
Sbjct: 447 DAWTNQWDFNGN---GDGTLFYPGTPAKIGGTTHVPVASIRLKMIREGME 493
>gi|374604784|ref|ZP_09677736.1| hypothetical protein PDENDC454_17493 [Paenibacillus dendritiformis
C454]
gi|374389614|gb|EHQ60984.1| hypothetical protein PDENDC454_17493 [Paenibacillus dendritiformis
C454]
Length = 536
Score = 47.8 bits (112), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 14/130 (10%)
Query: 475 WVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEG 534
WV N++ + E ++G+ W Y C P + N L + R + W + G
Sbjct: 358 WVPTNKDYELNRDAYEAYRQSGDALWFYTCWNPGGEYLNRFLDFPLLKTRYLHWGNYLYG 417
Query: 535 GTGFLYWGANCYEKATVP--------SAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASL 586
G+L+WG N Y P + ++ RR +P GD + YPG+ P+ S+
Sbjct: 418 LDGYLHWGFNYYFPDQDPMELTNPLLAPDVHDRR-VPAGDTHIVYPGD-----GGPMLSM 471
Query: 587 RLERILSGLQ 596
RLE + +G++
Sbjct: 472 RLEAMRAGVE 481
>gi|331085877|ref|ZP_08334960.1| hypothetical protein HMPREF0987_01263 [Lachnospiraceae bacterium
9_1_43BFAA]
gi|330406800|gb|EGG86305.1| hypothetical protein HMPREF0987_01263 [Lachnospiraceae bacterium
9_1_43BFAA]
Length = 562
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 42/101 (41%), Gaps = 13/101 (12%)
Query: 496 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-------YEK 548
GE WTYVC GP N L + R + W K +GFL+WG N YE
Sbjct: 399 GETVWTYVCCGPEGHWLNRFLDFALLKGRMLFWGCAKNRISGFLHWGLNQFPGGMNPYEG 458
Query: 549 ATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLE 589
+ P+ P GD L YPG+ P +RLE
Sbjct: 459 TSCPN-HTGIGTNFPCGDSFLIYPGK-----EGPRMGMRLE 493
>gi|374324118|ref|YP_005077247.1| hypothetical protein HPL003_21470 [Paenibacillus terrae HPL-003]
gi|357203127|gb|AET61024.1| hypothetical protein HPL003_21470 [Paenibacillus terrae HPL-003]
Length = 570
Score = 47.0 bits (110), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 38/94 (40%), Gaps = 5/94 (5%)
Query: 486 DIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC 545
D + E W Y C G + N L M S++R + +++K GFL+WG N
Sbjct: 400 DTIHNFIDEGASNLWVYYCCGQNLHVSNRFLAMPSSRNRILGVQMYKYRIKGFLHWGFNF 459
Query: 546 YE-----KATVPSAEIRFRRGLPPGDGVLFYPGE 574
Y K P + P GD L YP E
Sbjct: 460 YNSQYSLKKLNPYVDTAALDTFPSGDSFLVYPSE 493
>gi|336426499|ref|ZP_08606509.1| hypothetical protein HMPREF0994_02515 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336010934|gb|EGN40914.1| hypothetical protein HMPREF0994_02515 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 562
Score = 47.0 bits (110), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 43/102 (42%), Gaps = 10/102 (9%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV-----PSA 554
WTY C G N M ++R + +++K GFL WG N + P
Sbjct: 404 WTYYCCGQFREVSNRFFCMPSQRNRILGVQLYKYQIHGFLQWGFNFWNSMLSRYPINPYC 463
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
P GD L YPGE P+AS+R E ++ GLQ
Sbjct: 464 VTDAACAFPSGDASLVYPGE-----DGPIASIRAEVLMEGLQ 500
>gi|329927960|ref|ZP_08281988.1| hypothetical protein HMPREF9412_1819 [Paenibacillus sp. HGF5]
gi|328938179|gb|EGG34575.1| hypothetical protein HMPREF9412_1819 [Paenibacillus sp. HGF5]
Length = 554
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 45/102 (44%), Gaps = 10/102 (9%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 554
WTY C N + +++R + +++K GFL+WG N + ++A P
Sbjct: 399 WTYYCCSQYKEVSNRFFNLPSARNRIIGIQLYKFNIEGFLHWGYNFWNSQYSKRAIDPFK 458
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
G P GD + YPGE P+ S+RL+ LQ
Sbjct: 459 VTDADCGFPSGDAFVVYPGE-----EGPIESIRLKVFQEALQ 495
>gi|261408450|ref|YP_003244691.1| hypothetical protein GYMC10_4664 [Paenibacillus sp. Y412MC10]
gi|261284913|gb|ACX66884.1| conserved hypothetical protein [Paenibacillus sp. Y412MC10]
Length = 554
Score = 47.0 bits (110), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 45/102 (44%), Gaps = 10/102 (9%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSA 554
WTY C N + +++R + +++K GFL+WG N + ++A P
Sbjct: 399 WTYYCCSQYKEVSNRFFNLPSARNRIIGIQLYKFNIEGFLHWGYNFWNSQYSKRAIDPFK 458
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
G P GD + YPGE P+ S+RL+ LQ
Sbjct: 459 VTDADCGFPSGDAFVVYPGE-----EGPIESIRLKVFQEALQ 495
>gi|365133220|ref|ZP_09342604.1| hypothetical protein HMPREF1032_00400 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363616030|gb|EHL67484.1| hypothetical protein HMPREF1032_00400 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 526
Score = 46.2 bits (108), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 53/114 (46%), Gaps = 7/114 (6%)
Query: 485 KDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGAN 544
++ T LQ GEE W Y C P+ P N + + + R V+W +GFL+WG N
Sbjct: 363 REEYTALQAA-GEEMWFYTCAFPAGPAMNRSMDLPLAVSRTVLWMGALYRLSGFLHWGFN 421
Query: 545 CYEKATVPSAEIRFRRG--LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
Y + + +G LP GD + YPG+ +P S+R E +G +
Sbjct: 422 YYIGDDIWHSACCPHKGALLPAGDAHIVYPGK----DGRPWRSMRFEAQRAGAE 471
>gi|116623984|ref|YP_826140.1| hypothetical protein Acid_4896 [Candidatus Solibacter usitatus
Ellin6076]
gi|116227146|gb|ABJ85855.1| hypothetical protein Acid_4896 [Candidatus Solibacter usitatus
Ellin6076]
Length = 543
Score = 46.2 bits (108), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 62/165 (37%), Gaps = 34/165 (20%)
Query: 55 VWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDL 114
VW PS VG + + ++L AAR E ES QI V+ +S G V + SDL
Sbjct: 34 VWTAPSMQRVGMTDPAGSVSDVSLAAARGEYESFQI-----VANGASKGLGNVNLTVSDL 88
Query: 115 CSASGDRLVVGQSLMLRRVV---------------PMLG--VPDALVPLD-------LPV 150
G + G + R PM PDAL+P L
Sbjct: 89 EGPDGKVIPHGNFTLYREKYMHVTSPSPNWKGSNQPMGAGWYPDALIPFTDPDTGKPLSG 148
Query: 151 CQISLIP-----GETTAVWVSIDAPYAQPPGLYEGEIIITSKADT 190
+IS +P G VWV + P G Y+G +TS T
Sbjct: 149 AKISAVPFDVKAGNNQPVWVDLLVPQTAQAGTYKGTYTVTSNEGT 193
>gi|315648573|ref|ZP_07901671.1| hypothetical protein PVOR_25096 [Paenibacillus vortex V453]
gi|315276052|gb|EFU39399.1| hypothetical protein PVOR_25096 [Paenibacillus vortex V453]
Length = 554
Score = 46.2 bits (108), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 44/102 (43%), Gaps = 10/102 (9%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSA 554
WTY C N + +++R + +++K GFL+WG N + +A P
Sbjct: 399 WTYYCCSQYKEVSNRFFNLPSARNRILGIQLYKYNIEGFLHWGYNFWNSQYSRRAIDPFQ 458
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
G P GD + YPGE P+ S+RL+ LQ
Sbjct: 459 VTDADGGFPSGDAFVVYPGE-----EGPIESIRLKVFQEALQ 495
>gi|227495338|ref|ZP_03925654.1| conserved hypothetical protein [Actinomyces coleocanis DSM 15436]
gi|226831208|gb|EEH63591.1| conserved hypothetical protein [Actinomyces coleocanis DSM 15436]
Length = 532
Score = 45.8 bits (107), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 46/198 (23%), Positives = 75/198 (37%), Gaps = 39/198 (19%)
Query: 392 LLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT 451
L T+ + A+F++ DEP N +H + R A A+V+ L
Sbjct: 308 FLETEIGLEHAWFHVSDEP-NADHLEAYR---------AAKAQVVDLLAGTQVIDALSEP 357
Query: 452 PFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPH 511
F+ V +P V N+ D + + E P W Y C+
Sbjct: 358 EFQEVVDIPV--------------VATNKVDGFRAVGVE--PT-----WVYNCVAQDRLV 396
Query: 512 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEK-----ATVPSAEIRFRRGLPPGD 566
N + RG++HR + ++++K G L+W N Y + P + G GD
Sbjct: 397 ANRFIAQRGTRHREIGFQLFKFNAKGILHWAFNFYNRQFSLGVLDPYKDTAAGGGFLSGD 456
Query: 567 GVLFYP---GEVFSSSRQ 581
+ YP G+V+ S R
Sbjct: 457 SFVVYPVADGKVYESLRH 474
>gi|331085437|ref|ZP_08334522.1| hypothetical protein HMPREF0987_00825 [Lachnospiraceae bacterium
9_1_43BFAA]
gi|330407675|gb|EGG87173.1| hypothetical protein HMPREF0987_00825 [Lachnospiraceae bacterium
9_1_43BFAA]
Length = 552
Score = 45.8 bits (107), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 45/97 (46%), Gaps = 9/97 (9%)
Query: 497 EEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY--EKATVPSA 554
E+ W Y C G + N + G + R + +++K GFL+WG N Y EK+ P
Sbjct: 392 EKLWGYYCTGQYEDVSNRFIVQPGYRTRILGVQMYKYQLDGFLHWGYNFYNSEKSLYPID 451
Query: 555 EIRFRR---GLPPGDGVLFYPGEVFSSSRQPVASLRL 588
R P GD L YPG + R+P S+RL
Sbjct: 452 PYRCTDASGAFPSGDPFLVYPG----ADRKPEESIRL 484
>gi|186681830|ref|YP_001865026.1| hypothetical protein Npun_F1375 [Nostoc punctiforme PCC 73102]
gi|186464282|gb|ACC80083.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 543
Score = 45.4 bits (106), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 73/172 (42%), Gaps = 40/172 (23%)
Query: 55 VWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDL 114
++ +PS +G E + AA+ E ESVQ+ ++ + SS V + SDL
Sbjct: 42 LYMVPSLKRIGQTEKITNTSLSKIYAAKGEYESVQLVIK-----APSSGLTNVNISVSDL 96
Query: 115 CSASGDRLVVGQSLMLRR----------------VVPMLGV---PDALVPLDLPVCQISL 155
S ++++ ++ L R + P LGV PD L+P PV Q
Sbjct: 97 L-GSNNQIIPKNNITLYREHYVYVSHSSPNMRDNLNPPLGVGWYPDGLIPFLDPVTQKPP 155
Query: 156 IPGETTAV------------WVSIDAPYAQPPGLYEGEIIITS---KADTEL 192
+ GE AV WV + P G Y G+ I+TS KA++++
Sbjct: 156 LTGELKAVPFRLQSQYNQPIWVDVFVPRNAKSGEYTGKFIVTSDQGKAESKI 207
>gi|197121796|ref|YP_002133747.1| hypothetical protein AnaeK_1387 [Anaeromyxobacter sp. K]
gi|196171645|gb|ACG72618.1| Myxococcales GC_trans_RRR domain protein [Anaeromyxobacter sp. K]
Length = 609
Score = 45.4 bits (106), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 512 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 571
P++ + S++RA+ W + E +G LYW +A S + F GDG LFY
Sbjct: 412 PSYMVDASASRNRAMEWITFLERASGELYWETAYSFRADPWSRQWDFSGN---GDGTLFY 468
Query: 572 PGE---VFSSSRQPVASLRLERILSGLQ 596
PG+ + + PVAS+RL+ I +G+Q
Sbjct: 469 PGKPARIGGKTDVPVASVRLKMIRAGMQ 496
Score = 39.3 bits (90), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 58/141 (41%), Gaps = 25/141 (17%)
Query: 61 TANVGPQEMPRPLEPINLLAARNERESVQIALR-------PKVSWSSSSTAGVVQVQCSD 113
T + P RP +L AARNE + Q+ + +V A + +V D
Sbjct: 35 TEKIRPDAKARPQTEAHLSAARNEFAAFQVVVTGPAKRVTARVEGLDGMDATLFRVDTLD 94
Query: 114 LCSASGDRLVVGQSLMLRRVVPMLGVPDALVPL--DLPVCQISLIP----GETTAVWVSI 167
+ S S G+ PDALVP D+ Q + P E+ AVWV +
Sbjct: 95 VTSPSAVDGGTGR------------WPDALVPDVDDVVGEQRNAFPFDVGTESRAVWVDV 142
Query: 168 DAPYAQPPGLYEGEIIITSKA 188
P G+Y+G ++I+S A
Sbjct: 143 HVPADARSGVYQGAVVISSDA 163
>gi|220916589|ref|YP_002491893.1| hypothetical protein A2cp1_1483 [Anaeromyxobacter dehalogenans
2CP-1]
gi|219954443|gb|ACL64827.1| Myxococcales GC_trans_RRR domain protein [Anaeromyxobacter
dehalogenans 2CP-1]
Length = 609
Score = 45.4 bits (106), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 512 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 571
P++ + S++RA+ W + E +G LYW +A S + F GDG LFY
Sbjct: 412 PSYMVDASASRNRAMEWITFLERASGELYWETAYSFRADPWSRQWDFSGN---GDGTLFY 468
Query: 572 PGE---VFSSSRQPVASLRLERILSGLQ 596
PG+ + + PVAS+RL+ I +G+Q
Sbjct: 469 PGKPARIGGKTDVPVASVRLKMIRAGMQ 496
Score = 43.5 bits (101), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 60/147 (40%), Gaps = 25/147 (17%)
Query: 55 VWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALR-------PKVSWSSSSTAGVV 107
W +T + P RP +L AARNE + Q+ + +V A +
Sbjct: 29 AWVASATEKIRPDAKARPQTEAHLSAARNEFAAFQVVVTGPAKRVTARVEGLDGMDATLF 88
Query: 108 QVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPL--DLPVCQISLIP----GETT 161
+V D+ S S G+ PDALVP D+ Q + P E+
Sbjct: 89 RVDTLDVTSPSAVDGGTGR------------WPDALVPDVDDVVGEQRNAFPFDVGAESR 136
Query: 162 AVWVSIDAPYAQPPGLYEGEIIITSKA 188
AVWV + P G+Y+G ++I+S A
Sbjct: 137 AVWVDVHVPADARSGVYQGAVVISSDA 163
>gi|335045263|ref|ZP_08538286.1| hypothetical protein HMPREF9124_2064 [Oribacterium sp. oral taxon
108 str. F0425]
gi|333759049|gb|EGL36606.1| hypothetical protein HMPREF9124_2064 [Oribacterium sp. oral taxon
108 str. F0425]
Length = 556
Score = 45.4 bits (106), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 53/213 (24%), Positives = 92/213 (43%), Gaps = 17/213 (7%)
Query: 400 KKAYFYLWDEP-LNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESF-- 456
++ + WD P + E+ ++ EL A+ + +L Y SD P +SF
Sbjct: 292 RECKLFSWDSPAVGGEYTEFLKVFLPELKAFLKEENILENSYFHISDEP-NEDNMDSFGA 350
Query: 457 --VKVPKFLRPHTQIYCTSEWVLGNREDLVKDIV--TELQP--ENG-EEWWTYVCMGPSD 509
V + L + S + + R + + +V ++P E G + W Y C G +
Sbjct: 351 AVESVRELLADCKVMDALSSFEIYRRGYVQRPVVAVNHIEPFVEAGVKNLWAYYCTGQAV 410
Query: 510 PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY--EKATV---PSAEIRFRRGLPP 564
PN + M +++R + + GFL+WG N Y EK+ P A P
Sbjct: 411 DVPNRFIVMPSARNRILGVLCYIYQVEGFLHWGFNFYNSEKSIEHIDPYAVTDAGEAFPS 470
Query: 565 GDGVLFYPGEVFSSSRQPVASLRLERILSGLQV 597
GD + YPG+ ++ + + S+ LE LS ++V
Sbjct: 471 GDPFIVYPGKD-GTAYESMRSVVLEEALSDIRV 502
>gi|453063163|gb|EMF04147.1| hypothetical protein F518_19198 [Serratia marcescens VGH107]
Length = 556
Score = 45.4 bits (106), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 42/102 (41%), Gaps = 9/102 (8%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA-----TVPSA 554
W Y C N +++R + +++ GFL+WG N Y A P A
Sbjct: 400 WAYYCCVQKTEVANRFFAQPSARNRILGVQLYLYRIAGFLHWGFNFYNSAHSRERINPYA 459
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
P GD + YPGE QPV SLRL + GLQ
Sbjct: 460 VTDSGHAFPSGDPFVVYPGE----DLQPVESLRLRVLHQGLQ 497
>gi|448242314|ref|YP_007406367.1| hypothetical protein SMWW4_v1c25510 [Serratia marcescens WW4]
gi|445212678|gb|AGE18348.1| hypothetical protein SMWW4_v1c25510 [Serratia marcescens WW4]
Length = 556
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 42/102 (41%), Gaps = 9/102 (8%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA-----TVPSA 554
W Y C N +++R + +++ GFL+WG N Y A P A
Sbjct: 400 WAYYCCVQKTEVANRFFAQPSARNRILGVQLYLYRIAGFLHWGFNFYNSAHSRERINPYA 459
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
P GD + YPGE QPV SLRL + GLQ
Sbjct: 460 VTDSGHAFPSGDPFVVYPGE----DLQPVESLRLRVLHQGLQ 497
>gi|363898123|ref|ZP_09324658.1| hypothetical protein HMPREF9624_01220 [Oribacterium sp. ACB7]
gi|361956490|gb|EHL09805.1| hypothetical protein HMPREF9624_01220 [Oribacterium sp. ACB7]
Length = 557
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 93/212 (43%), Gaps = 15/212 (7%)
Query: 400 KKAYFYLWDEP-LNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPT--PFESF 456
++ + WD P + E+ ++ EL A+ + +L Y SD P F +
Sbjct: 293 RECKLFSWDSPAVGGEYTEFLKIFLPELKAFLKEENILGNSYFHISDEPNEDNMDSFGAA 352
Query: 457 VKVPKFLRPHTQIY-CTSEWVLGNREDLVKDIV--TELQP--ENG-EEWWTYVCMGPSDP 510
V+ + L ++ S + + R + + +V ++P E G + W Y C G +
Sbjct: 353 VESVRALLADCKVMDALSSFEIYRRGYVQRPVVAVNHIEPFVEAGVKNLWAYYCTGQAVD 412
Query: 511 HPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY--EKATV---PSAEIRFRRGLPPG 565
PN + M +++R + + GFL+WG N Y EK+ P A P G
Sbjct: 413 VPNRFIVMPSARNRILGVLCYIYQVEGFLHWGFNFYNSEKSIEHIDPYAVTDAGEAFPSG 472
Query: 566 DGVLFYPGEVFSSSRQPVASLRLERILSGLQV 597
D + YPG+ ++ + + S+ LE LS ++V
Sbjct: 473 DPFIVYPGKD-GTAYESMRSVVLEEALSDIRV 503
>gi|333994386|ref|YP_004526999.1| hypothetical protein TREAZ_0869 [Treponema azotonutricium ZAS-9]
gi|333734735|gb|AEF80684.1| conserved hypothetical protein [Treponema azotonutricium ZAS-9]
Length = 604
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 51/116 (43%), Gaps = 9/116 (7%)
Query: 486 DIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC 545
D +T + W Y C+G S PN + + + RA+ ++ GFL WG N
Sbjct: 436 DAITPFLEAGIKNLWVYYCVGQSRRVPNRFIALPSPRTRAMGVLMYLYNIAGFLQWGYNY 495
Query: 546 YEKATVPSAEIRFRR--GL---PPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
Y A S + + GL P GD L YPG + +PV+S+ E GL+
Sbjct: 496 YYSALSKSLVDPYLKTGGLKDWPGGDPFLVYPG----ADGKPVSSIHAEAHREGLE 547
>gi|383814347|ref|ZP_09969768.1| hypothetical protein SPM24T3_08339 [Serratia sp. M24T3]
gi|383296757|gb|EIC85070.1| hypothetical protein SPM24T3_08339 [Serratia sp. M24T3]
Length = 517
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 44/102 (43%), Gaps = 9/102 (8%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATV-----PSA 554
WTY C N + ++R + +++ TGFL+WG N Y P A
Sbjct: 354 WTYYCCVQKLEVSNRFFALPSYRNRIIGVQLYLYSITGFLHWGFNFYNSGHSREHLDPFA 413
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
+ P GD + YPG+ + QP+ SLRL + LQ
Sbjct: 414 ITDGQGAFPSGDLFVVYPGQDY----QPIESLRLMVLREALQ 451
>gi|354580346|ref|ZP_08999251.1| hypothetical protein PaelaDRAFT_0352 [Paenibacillus lactis 154]
gi|353202777|gb|EHB68226.1| hypothetical protein PaelaDRAFT_0352 [Paenibacillus lactis 154]
Length = 552
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 56/223 (25%), Positives = 89/223 (39%), Gaps = 51/223 (22%)
Query: 385 YVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPS 444
++++ ++L+R + +F++ DEP ++EH + R A +
Sbjct: 311 FLKELVQLIRGLGIEDRIFFHVSDEP-HLEHLETYRKAAEIV------------------ 351
Query: 445 DAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIV---TELQP--ENG-EE 498
D +G P +I S++ +E LV + + +LQP E+G
Sbjct: 352 DVAVGDYP---------------RIDALSDYAF-YKEGLVPNPIPATDKLQPFLESGVAP 395
Query: 499 WWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC----YEKATV-PS 553
WTY C N ++R + +++K GFL+WG N Y K V P
Sbjct: 396 LWTYYCCSQYKQVANRFFSFPSERNRILGLQLYKYRIKGFLHWGFNFWNSQYSKRPVNPY 455
Query: 554 AEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
G P GD L YPGE PV SLR++ LQ
Sbjct: 456 LTTDADIGYPSGDAFLVYPGE-----DGPVCSLRMKVFREALQ 493
>gi|365132098|ref|ZP_09342072.1| hypothetical protein HMPREF1032_03868 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363617409|gb|EHL68801.1| hypothetical protein HMPREF1032_03868 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 547
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 67/195 (34%), Gaps = 22/195 (11%)
Query: 385 YVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPS 444
+V+ E LR K F++ DEP H+ + ++ + Y A +L Y
Sbjct: 297 FVQALAEFLRKYGWQDKVVFHIHDEP--DIHFKNEASLLARKRQYYLAAGILRKY----- 349
Query: 445 DAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVC 504
L V P+F R I WV G + + GE W YVC
Sbjct: 350 ---LPNVRVIEAVASPEF-RGGVDI-----WVPGTPGYEARQADFDALTALGESVWAYVC 400
Query: 505 MGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSA------EIRF 558
GP N L + R + W GFL+WG N + P A
Sbjct: 401 CGPEGNWLNRFLDFALLKGRLLFWGCAANRLGGFLHWGFNQFPAGMDPFAGTSCPNHTGI 460
Query: 559 RRGLPPGDGVLFYPG 573
P GD L YPG
Sbjct: 461 GTNFPCGDSFLVYPG 475
>gi|424868246|ref|ZP_18292005.1| hypothetical protein C75L2_00760029 [Leptospirillum sp. Group II
'C75']
gi|124515950|gb|EAY57459.1| protein of unknown function [Leptospirillum rubarum]
gi|387221464|gb|EIJ76022.1| hypothetical protein C75L2_00760029 [Leptospirillum sp. Group II
'C75']
Length = 681
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 96/248 (38%), Gaps = 38/248 (15%)
Query: 373 SPVLSSNDGAKDYVRKEIELLRTKAHWK-------KAYFYLWDEPLNMEHYSS-----VR 420
SPV S G D + + L + HWK K + Y+ DEP++ +Y + +
Sbjct: 388 SPV-SDWKGVPDIATQNLAKLIVQ-HWKEKGWPIDKTFAYIADEPVHKLYYYADTYKLIA 445
Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
A LH +P V+ T D P +++ V K + + + W G
Sbjct: 446 KNADSLHKGSPHIHVMVT------DVPY--ITYKNQVGHNKLI----MVGKVNIWA-GAS 492
Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
+ + E Q E G+ W Y GP N L G R W WK G Y
Sbjct: 493 AQFIPSRMQERQKE-GDHVWFYQAGGPPFIGQN-DLYSLGPGFRMWFWTAWKYHVNGVFY 550
Query: 541 WGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEV-----FSSSRQPVASLRLERILSGL 595
W A+ + P+ +GL GDG + YPG + P+ S+R+ + G
Sbjct: 551 W-ADTFWNDNKPNMNPYVNQGL--GDGTIMYPGTELHFIGYPDIHGPIPSIRMAQWRRGY 607
Query: 596 Q-VRWICY 602
+ R++ Y
Sbjct: 608 EDYRYLTY 615
>gi|410478328|ref|YP_006765965.1| hypothetical protein LFML04_0771 [Leptospirillum ferriphilum ML-04]
gi|406773580|gb|AFS53005.1| hypothetical protein LFML04_0771 [Leptospirillum ferriphilum ML-04]
Length = 681
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 96/248 (38%), Gaps = 38/248 (15%)
Query: 373 SPVLSSNDGAKDYVRKEIELLRTKAHWK-------KAYFYLWDEPLNMEHYSS-----VR 420
SPV S G D + + L + HWK K + Y+ DEP++ +Y + +
Sbjct: 388 SPV-SDWKGVPDIATQNLAKLIVQ-HWKEKGWPIDKTFAYIADEPVHKLYYYADTYKLIA 445
Query: 421 NMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNR 480
A LH +P V+ T D P +++ V K + + + W G
Sbjct: 446 KNADSLHKGSPHIHVMVT------DVPY--ITYKNQVGHNKLI----MVGKVNIWA-GAS 492
Query: 481 EDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLY 540
+ + E Q E G+ W Y GP N L G R W WK G Y
Sbjct: 493 AQFIPSRMQERQKE-GDHVWFYQAGGPPFIGQN-DLYSLGPGFRMWFWTAWKYHVNGVFY 550
Query: 541 WGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEV-----FSSSRQPVASLRLERILSGL 595
W A+ + P+ +GL GDG + YPG + P+ S+R+ + G
Sbjct: 551 W-ADTFWNDNKPNMNPYVNQGL--GDGTIMYPGTELHFIGYPDIHGPIPSIRMAQWRRGY 607
Query: 596 Q-VRWICY 602
+ R++ Y
Sbjct: 608 EDYRYLTY 615
>gi|403220771|dbj|BAM38904.1| conserved hypothetical protein [Theileria orientalis strain
Shintoku]
Length = 210
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 34/149 (22%), Positives = 68/149 (45%), Gaps = 23/149 (15%)
Query: 337 GESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTK 396
G+++++L + P DH +E+F D Y + + S+D +++ V I +++
Sbjct: 11 GQNLQILKFLFSIPNDHV--NEHFDDK----YVREFHRLDDSSDNSEELVTARI-IIKLI 63
Query: 397 AHWKKAYFYLWDEPL--NMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFE 454
H + + + D+ + NME ++ +R+++ ELH Y+ P G E
Sbjct: 64 KHEFEKFNLIRDQYITPNMERFTQIRHLSQELHPYSDT----------PCSTQAGCDKLE 113
Query: 455 SFVKVPKFLRPHTQ----IYCTSEWVLGN 479
+ + ++R T I+ T VLGN
Sbjct: 114 MLMNLCSYIRGGTSFAYDIFATMVHVLGN 142
>gi|225571626|ref|ZP_03780622.1| hypothetical protein CLOHYLEM_07724 [Clostridium hylemonae DSM
15053]
gi|225159703|gb|EEG72322.1| hypothetical protein CLOHYLEM_07724 [Clostridium hylemonae DSM
15053]
Length = 557
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 51/211 (24%), Positives = 81/211 (38%), Gaps = 21/211 (9%)
Query: 400 KKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKV 459
K+ + W P E+ + EL A + + T Y SD P P +++ +
Sbjct: 291 KEEKIFGWHTPAVGEYTRFLHAFLPELTARLKEWGIDTVTYFHLSDEP-RPDDLDTYRQA 349
Query: 460 PKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGE----------EWWTYVCMGPSD 509
+ + + Y T + L + E +V + P N E + WTY C+G
Sbjct: 350 KESVADLLKGYHTFD-ALSSYEFYRHGLVDKPIPGNNEIDEFLEHGLTDMWTYYCVGQYL 408
Query: 510 PHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE-----KATVPSAEIRFRRGLPP 564
N + M ++R +++K G L+WG N Y + P G P
Sbjct: 409 EVSNRFMSMPSLRNRIYGLQLYKYDIIGILHWGYNFYNSQFSLEHINPYETTDAGGGFPA 468
Query: 565 GDGVLFYPGEVFSSSRQPVASLRLERILSGL 595
GD L YPGE +P S+R+ GL
Sbjct: 469 GDPFLVYPGE----DGRPEESIRMMVHYEGL 495
>gi|284032103|ref|YP_003382034.1| carbohydrate binding family 6 [Kribbella flavida DSM 17836]
gi|283811396|gb|ADB33235.1| Carbohydrate binding family 6 [Kribbella flavida DSM 17836]
Length = 1437
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 90/221 (40%), Gaps = 36/221 (16%)
Query: 382 AKDYVRKEIELLRT----KAHWKKAYFYLWDEPLNMEHYSSVRNMASEL-HAYAPDARVL 436
A+++ R+ + L+T K + + Y + DEP + H + R EL +AP +
Sbjct: 602 AQNFARQYLSALKTHLVAKGWFTQWYQSVGDEPGSPAHAETWRRAVDELIKVHAPGMKTS 661
Query: 437 TTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENG 496
T Y PS G T FE + V H + T E KD Q G
Sbjct: 662 TPYIGPPS--TWGAT-FEGRLNV------HVPLLSTHE--------SAKDYFRGRQAL-G 703
Query: 497 EEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWG-ANCYEKATVPSAE 555
+E WTYVC P N + S R + W + G TG L+W +N E T+ +
Sbjct: 704 DEVWTYVCNRPLGAFYNRLIDQPLSAPRFMNWSNFANGVTGTLHWAYSNWKEDPTINAT- 762
Query: 556 IRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
PGD + YP V + ++LR + + G++
Sbjct: 763 --------PGDTAIVYPDPV---NNDVTSTLRHDAMRDGIE 792
>gi|302388391|ref|YP_003824213.1| hypothetical protein Closa_4081 [Clostridium saccharolyticum WM1]
gi|302199019|gb|ADL06590.1| conserved hypothetical protein [Clostridium saccharolyticum WM1]
Length = 552
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 44/102 (43%), Gaps = 9/102 (8%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE----KATVPSAE 555
W+Y C N + M +++RA +V+K G G L+WG N Y + + E
Sbjct: 399 WSYYCTAQCVDVSNRFMAMPSARNRAYGLQVYKYGMEGILHWGFNFYNSEHSRHHINPYE 458
Query: 556 IRFRRG-LPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
+ G P GD L YPG S P S+RL + Q
Sbjct: 459 VTDCEGSFPSGDAFLVYPG----SDGIPEESIRLMVLCEAKQ 496
>gi|86158894|ref|YP_465679.1| hypothetical protein Adeh_2472 [Anaeromyxobacter dehalogenans
2CP-C]
gi|85775405|gb|ABC82242.1| hypothetical protein Adeh_2472 [Anaeromyxobacter dehalogenans
2CP-C]
Length = 609
Score = 43.5 bits (101), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 48/88 (54%), Gaps = 6/88 (6%)
Query: 512 PNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFY 571
P++ + S++RA+ W + E +G LYW ++ +++ F GDG LFY
Sbjct: 412 PSYMIDASASRNRAMEWITFLERASGELYWETAYSFRSDPWTSQWDFSGN---GDGTLFY 468
Query: 572 PGE---VFSSSRQPVASLRLERILSGLQ 596
PG+ + + PVAS+R++ I +G+Q
Sbjct: 469 PGKPSRIGGKTDIPVASVRVKMIRAGMQ 496
Score = 38.9 bits (89), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 58/135 (42%), Gaps = 13/135 (9%)
Query: 61 TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
T + P RP +L AARNE + Q+ + + + +V+ D S S
Sbjct: 35 TEKIRPDAKARPQTEAHLAAARNEFAAFQVVV------TGPAKGVTARVEGLDGLSVSLF 88
Query: 121 RLVVGQSLMLRRVVPMLGV-PDALVPL--DLPVCQISLIP----GETTAVWVSIDAPYAQ 173
R+ V G PDALVP D+ + + P E+ AVWV + P
Sbjct: 89 RVETLNVTSPSAVDGGTGRWPDALVPDVDDVVGEKRNAFPFDVGSESRAVWVDVHVPAGA 148
Query: 174 PPGLYEGEIIITSKA 188
G+Y+G ++I+S A
Sbjct: 149 RSGIYQGAVVISSDA 163
>gi|403380550|ref|ZP_10922607.1| hypothetical protein PJC66_12097 [Paenibacillus sp. JC66]
Length = 555
Score = 43.1 bits (100), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 47/102 (46%), Gaps = 9/102 (8%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-YEKATV----PSA 554
WTY C + N + M ++R + ++++K GFL+WG N Y + ++ P
Sbjct: 399 WTYYCCSQYEEVSNRFIDMPSWRNRILGFQLYKFQIRGFLHWGYNFWYSQYSIRPINPYQ 458
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQ 596
+ P GD + YPG S+ + V SLRL+ Q
Sbjct: 459 QTDANYAFPSGDPFVVYPG----SNGEAVLSLRLKVFYDAFQ 496
>gi|374373535|ref|ZP_09631195.1| hypothetical protein NiasoDRAFT_2351 [Niabella soli DSM 19437]
gi|373234508|gb|EHP54301.1| hypothetical protein NiasoDRAFT_2351 [Niabella soli DSM 19437]
Length = 576
Score = 43.1 bits (100), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 29/123 (23%), Positives = 54/123 (43%), Gaps = 14/123 (11%)
Query: 474 EWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKE 533
++ + ++ +I+ E Q + W C ++ +PN ++H + W +
Sbjct: 414 DYCIASKHQFPDNILKERQQQGKLSTWYTCC---TEKYPNGFTFSPPAEHVWIGWYTAAK 470
Query: 534 GGTGFLYWGANCYEKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILS 593
G+L W N + + P + RFR P GD YPG P +S+R E+++
Sbjct: 471 NMNGYLRWAYNSWVEH--PETDSRFR-SWPAGDTYQVYPG--------PASSIRFEKLIE 519
Query: 594 GLQ 596
G+Q
Sbjct: 520 GIQ 522
>gi|256420134|ref|YP_003120787.1| hypothetical protein Cpin_1088 [Chitinophaga pinensis DSM 2588]
gi|256035042|gb|ACU58586.1| hypothetical protein Cpin_1088 [Chitinophaga pinensis DSM 2588]
Length = 560
Score = 42.7 bits (99), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 19/109 (17%)
Query: 492 QPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC-----Y 546
+ +N E W YVC+ P +PN L + R + W +K TGF++WG N +
Sbjct: 415 RAQNKGELWYYVCVSPQYNYPNRFLENPLIKTRFLHWTNYKYDLTGFMHWGYNIWTGYPF 474
Query: 547 EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGL 595
+ +T S GD + YP + + ++S+RLE + G+
Sbjct: 475 DFSTSNSV---------GGDAWIVYPKD-----GKIISSVRLEAMRDGI 509
>gi|386814775|ref|ZP_10101993.1| hypothetical protein Thini_0548 [Thiothrix nivea DSM 5205]
gi|386419351|gb|EIJ33186.1| hypothetical protein Thini_0548 [Thiothrix nivea DSM 5205]
Length = 604
Score = 42.7 bits (99), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 86/381 (22%), Positives = 132/381 (34%), Gaps = 63/381 (16%)
Query: 264 AISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGV-RHGSDEWYEALDQHFK 322
A + + ++LTVWDF LP L V G + + + +G R G L + +
Sbjct: 177 ATGEGQLELPVTLTVWDFSLPERSPLRTVFGTNGYRVAEVYGFERTGKSAADNRLIRAYN 236
Query: 323 -WLLQYRISPF----------------FCRWGESMRVLTYTCPWPADHPKSDEYFSDPRL 365
+LL + +SP F R + +T + A + Y +
Sbjct: 237 DFLLDHHLSPESFWDAAPEANADGLPDFGRQFAGLGTVTDNMRYYAQEKHASAY---TYV 293
Query: 366 AAYAVPYS-PVLSSNDGAKDYVRKEIELLRTKAHWKKAYF--YLWDEPLNMEHYSSVRNM 422
A + P++ P+ A+ ++R + A ++ Y DEP + Y R
Sbjct: 294 FADSYPFADPLGEDRQQAQRFMRAYADWCGKHAGAERCYTDPSFVDEPDTRDAYQYARRW 353
Query: 423 ASELHAYA-PDARVLTTYYCGP---SDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLG 478
+ P + P D LG + V VPKF + + V G
Sbjct: 354 GEFFDEISLPKGENIHFQVSEPPLNEDPGLGSLVGKVEVWVPKFYDLWRDVDFLGKNVAG 413
Query: 479 NREDLVKDIVTELQPENGEEWWTYVCMGPSDPH----------------PNWHLGMRGSQ 522
R GEE W Y + P P W L
Sbjct: 414 QRL------------AAGEEVWAYTSLVLDFPEYSKLNPKADVLKGSYPPVWQLDFPAIN 461
Query: 523 HRAVMWRVWKEGGTGFLYWGANC-YEKATVPSAEIRFRRGLPP-----GDGVLFYPG-EV 575
+R W + G TG YW +E A V + F PP GDG+L YPG +
Sbjct: 462 YRIPTWLFHRYGVTGLGYWDTLAWFEGADVWNDAASFVSQNPPGIRFNGDGLLVYPGFKA 521
Query: 576 FSSSRQPVASLRLERILSGLQ 596
+ P+ASLRL+ I ++
Sbjct: 522 QTGFDGPLASLRLKWIRESVE 542
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 64/160 (40%), Gaps = 27/160 (16%)
Query: 53 VHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCS 112
+ V + + G E+ + + L AARNE E Q + ++ G V VQ S
Sbjct: 31 LQVKSIGALDRFGRFELVTGSDKVELFAARNEYEGFQFVV-------TAGERGAVDVQAS 83
Query: 113 -DLCSASGDRLVVGQSLMLRRVVPMLGV-----------PDALVPLDLPVCQI------- 153
+ + +++ G + R V + PD L+P D +
Sbjct: 84 ISVLRSVEGQVIDGLKVFRERYVKVSTPSPHSPYAPQYWPDILLPADNAGAEAAAYRAFP 143
Query: 154 -SLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTEL 192
+L GE VWV I P PG+Y G+I +T+ + +L
Sbjct: 144 QNLTAGENLPVWVDIHIPADARPGVYTGKISVTATGEGQL 183
>gi|162450247|ref|YP_001612614.1| hypothetical protein sce1975 [Sorangium cellulosum So ce56]
gi|161160829|emb|CAN92134.1| hypothetical protein predicted by Glimmer/Critica [Sorangium
cellulosum So ce56]
Length = 687
Score = 42.4 bits (98), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 36/63 (57%), Gaps = 4/63 (6%)
Query: 139 VPDALVPLDL--PVCQISLIPG--ETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSS 194
VPDAL+P++L P L G ET AVW+ + P PG YEG +++ S + EL+S
Sbjct: 176 VPDALIPVELAPPWAPYPLEVGARETRAVWIDLHVPEGALPGAYEGRVVVGSVSHGELAS 235
Query: 195 QCL 197
L
Sbjct: 236 LEL 238
>gi|336428440|ref|ZP_08608421.1| hypothetical protein HMPREF0994_04427 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336005693|gb|EGN35737.1| hypothetical protein HMPREF0994_04427 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 522
Score = 42.0 bits (97), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 42/91 (46%), Gaps = 3/91 (3%)
Query: 485 KDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGAN 544
KD LQ + GEE W Y C P+ N + + + R ++W TGFL+WG N
Sbjct: 363 KDTFRLLQ-DAGEEIWFYTCAFPAGNIMNRSMDLPLTVSRLLLWMGASCRLTGFLHWGFN 421
Query: 545 CYEKATVPSAEIRFRRG--LPPGDGVLFYPG 573
Y + + +G LP GD + YPG
Sbjct: 422 YYIGDDIWNRACCPHKGALLPAGDAHIVYPG 452
>gi|346306833|ref|ZP_08848983.1| hypothetical protein HMPREF9457_00692 [Dorea formicigenerans
4_6_53AFAA]
gi|345907730|gb|EGX77437.1| hypothetical protein HMPREF9457_00692 [Dorea formicigenerans
4_6_53AFAA]
Length = 558
Score = 42.0 bits (97), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 42/94 (44%), Gaps = 9/94 (9%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA----TVPSAE 555
WTY C G N + M +++R +++K G L+WG N Y + E
Sbjct: 401 WTYYCTGQFYEVSNRFMSMPSARNRIYGVQLYKYKIIGVLHWGYNFYNSQYSIEHINPYE 460
Query: 556 IRFRRG-LPPGDGVLFYPGEVFSSSRQPVASLRL 588
+ G P GD L YPGE + QP SLR+
Sbjct: 461 VTDAAGAFPSGDPFLVYPGE----NGQPEESLRM 490
>gi|166033167|ref|ZP_02235996.1| hypothetical protein DORFOR_02889 [Dorea formicigenerans ATCC
27755]
gi|166027524|gb|EDR46281.1| hypothetical protein DORFOR_02889 [Dorea formicigenerans ATCC
27755]
Length = 558
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 42/94 (44%), Gaps = 9/94 (9%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA----TVPSAE 555
WTY C G N + M +++R +++K G L+WG N Y + E
Sbjct: 401 WTYYCTGQFYEVSNRFMSMPSARNRIYGVQLYKYEIIGVLHWGYNFYNSQYSIEHINPYE 460
Query: 556 IRFRRG-LPPGDGVLFYPGEVFSSSRQPVASLRL 588
+ G P GD L YPGE + QP SLR+
Sbjct: 461 VTDAAGAFPSGDPFLVYPGE----NGQPEESLRM 490
>gi|427442082|ref|ZP_18925530.1| conserved hypothetical protein [Pediococcus lolii NGRI 0510Q]
gi|425786839|dbj|GAC46318.1| conserved hypothetical protein [Pediococcus lolii NGRI 0510Q]
Length = 384
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 75/195 (38%), Gaps = 35/195 (17%)
Query: 384 DYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGP 443
+YV+ + L+ W+KA + DEP ++ S L PD +V + P
Sbjct: 137 NYVKALCDHLKDLQVWEKARL-IADEP-KQAQLKEFKDALSALKQMVPDLKVKVAFDKEP 194
Query: 444 ---SDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWW 500
APL T SF YCTS++ ++LQ + E
Sbjct: 195 ILNELAPLVDTLATSF-------------YCTSQFG------------SQLQASHPGEVQ 229
Query: 501 TYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKATVPSAEIRFR- 559
Y+C P P+ H + ++ + + G L W NC+ + +IR+
Sbjct: 230 YYICNYPDHPNTFLHSPLLETRLQGTLTAFLPVNG--LLRWAFNCW--PSNAREDIRYNT 285
Query: 560 RGLPPGDGVLFYPGE 574
LP GD L YPGE
Sbjct: 286 SSLPIGDNCLVYPGE 300
>gi|206601987|gb|EDZ38469.1| Protein of unknown function [Leptospirillum sp. Group II '5-way
CG']
Length = 681
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 58/245 (23%), Positives = 95/245 (38%), Gaps = 37/245 (15%)
Query: 376 LSSNDGAKDYVRKEIELLRTKAHWK-------KAYFYLWDEPLNMEHYSS-----VRNMA 423
+S G D + + L + HWK + + Y+ DEP++ +Y + + A
Sbjct: 390 ISDWKGVPDIATQNLAKLIVR-HWKEKGWPIDQTFAYIADEPVHKLYYYADTYKLIAKDA 448
Query: 424 SELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDL 483
LH +P V+ T D P +++ V K + + + W G
Sbjct: 449 DSLHKGSPHIHVMVT------DVPY--ITYKNQVGHNKLI----MVGKVNIWA-GASAQF 495
Query: 484 VKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGA 543
+ + Q E G++ W Y GP N L G R W WK G YW A
Sbjct: 496 IPSRMQARQKE-GDQVWFYQAGGPPFIGQN-DLYSLGPGFRMWFWTAWKYHVNGVFYW-A 552
Query: 544 NCYEKATVPSAEIRFRRGLPPGDGVLFYPGEV-----FSSSRQPVASLRLERILSGLQ-V 597
+ + T + +GL GDG + YPG F + P+ S+R+ + G +
Sbjct: 553 DTFWNDTKENMNPYVNQGL--GDGTILYPGTELHFIGFPDIQGPIPSIRMAQWRRGYEDY 610
Query: 598 RWICY 602
R++ Y
Sbjct: 611 RYLTY 615
>gi|365132343|ref|ZP_09342149.1| hypothetical protein HMPREF1032_03945 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363616981|gb|EHL68397.1| hypothetical protein HMPREF1032_03945 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 571
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 38/95 (40%), Gaps = 9/95 (9%)
Query: 500 WTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA-----TVPSA 554
W Y C S PN M +++R + ++ G GFL+WG N Y P
Sbjct: 416 WVYYCCAQSSLVPNRFFAMESARNRIMGVLMYLYGIKGFLHWGYNFYNSKFSLHPVDPYR 475
Query: 555 EIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLE 589
P GD L YPG P++S+R E
Sbjct: 476 VTHADYAFPSGDPFLVYPG----PDGAPLSSVRAE 506
>gi|320536341|ref|ZP_08036383.1| PHP domain protein [Treponema phagedenis F0421]
gi|320146822|gb|EFW38396.1| PHP domain protein [Treponema phagedenis F0421]
Length = 869
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 46/118 (38%), Gaps = 10/118 (8%)
Query: 486 DIVTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC 545
D + +N + W Y C + N M ++R + ++K GFL+WG N
Sbjct: 709 DSIAPFIAKNVKPLWAYYCSAQAVHVSNRFFAMPSWRNRILGMLLYKFDIDGFLHWGYNF 768
Query: 546 Y-----EKATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRLERILSGLQVR 598
Y K P P GD YPG+ +P+ S+RL+ LQ R
Sbjct: 769 YYTQYSRKLIDPFTVTDAGGAFPAGDSFSVYPGK-----DEPLPSIRLKVFYEALQDR 821
>gi|325663334|ref|ZP_08151784.1| hypothetical protein HMPREF0490_02525 [Lachnospiraceae bacterium
4_1_37FAA]
gi|325470788|gb|EGC74018.1| hypothetical protein HMPREF0490_02525 [Lachnospiraceae bacterium
4_1_37FAA]
Length = 555
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 42/106 (39%), Gaps = 9/106 (8%)
Query: 488 VTELQPENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYE 547
+ E + WTY C G N + M +++R +++K G L+WG N Y
Sbjct: 387 IEEFLEHGLTDMWTYYCTGQFYEVSNRFMSMPSARNRIYGIQLYKYDIIGILHWGYNFYN 446
Query: 548 -----KATVPSAEIRFRRGLPPGDGVLFYPGEVFSSSRQPVASLRL 588
+ P G P GD L YPG + P S+R+
Sbjct: 447 SQHSYEHINPYQVTDAANGFPAGDPFLVYPG----ADGHPEESIRM 488
>gi|386070149|ref|YP_005985045.1| hypothetical protein TIIST44_02580 [Propionibacterium acnes ATCC
11828]
gi|353454516|gb|AER05035.1| hypothetical protein TIIST44_02580 [Propionibacterium acnes ATCC
11828]
Length = 550
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 65/180 (36%), Gaps = 37/180 (20%)
Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
W A Y+++ DEP Y+S R + P+A V+ DA P F + V
Sbjct: 320 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 369
Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
VP + H + E + + W Y + PN HLG
Sbjct: 370 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 408
Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
M ++ RA+ +W G L+W N + P+A+ P GD + YP
Sbjct: 409 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 468
>gi|50843406|ref|YP_056633.1| hypothetical protein PPA1958 [Propionibacterium acnes KPA171202]
gi|289425612|ref|ZP_06427384.1| conserved hypothetical protein [Propionibacterium acnes SK187]
gi|335053127|ref|ZP_08545978.1| hypothetical protein HMPREF9948_2283 [Propionibacterium sp.
434-HC2]
gi|387504316|ref|YP_005945545.1| hypothetical protein TIB1ST10_09970 [Propionibacterium acnes 6609]
gi|419419836|ref|ZP_13960069.1| hypothetical protein TICEST70_01370 [Propionibacterium acnes
PRP-38]
gi|50841008|gb|AAT83675.1| hypothetical protein PPA1958 [Propionibacterium acnes KPA171202]
gi|289153913|gb|EFD02606.1| conserved hypothetical protein [Propionibacterium acnes SK187]
gi|333767978|gb|EGL45192.1| hypothetical protein HMPREF9948_2283 [Propionibacterium sp.
434-HC2]
gi|335278361|gb|AEH30266.1| hypothetical protein TIB1ST10_09970 [Propionibacterium acnes 6609]
gi|379979557|gb|EIA12877.1| hypothetical protein TICEST70_01370 [Propionibacterium acnes
PRP-38]
Length = 550
Score = 39.7 bits (91), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 78/204 (38%), Gaps = 40/204 (19%)
Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
S +G +D++ + L ++ H W A Y+++ DEP Y+S R + P A
Sbjct: 296 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 354
Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
V+ DA P F + V VP + H + E +
Sbjct: 355 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 384
Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANC----YEKA 549
+ W Y + PN HLGM ++ RA+ +W G L+W N + +
Sbjct: 385 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 444
Query: 550 TV-PSAEIRFRRGLPPGDGVLFYP 572
V P+A+ P GD + YP
Sbjct: 445 LVDPNADTSADLAFPSGDSSVIYP 468
>gi|365131949|ref|ZP_09342011.1| hypothetical protein HMPREF1032_03807 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363617740|gb|EHL69113.1| hypothetical protein HMPREF1032_03807 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 679
Score = 39.7 bits (91), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 29/54 (53%)
Query: 496 GEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCYEKA 549
G+E W Y C+ P + N + R++MW V++ G G+L+WG N + A
Sbjct: 439 GDEVWFYTCLAPKGNYLNRFIDQPIWIGRSLMWLVYRYGVEGYLHWGWNAWHYA 492
>gi|282855297|ref|ZP_06264629.1| conserved hypothetical protein [Propionibacterium acnes J139]
gi|282581885|gb|EFB87270.1| conserved hypothetical protein [Propionibacterium acnes J139]
Length = 499
Score = 39.7 bits (91), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 65/180 (36%), Gaps = 37/180 (20%)
Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
W A Y+++ DEP Y+S R + P+A V+ DA P F + V
Sbjct: 269 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 318
Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
VP + H + E + + W Y + PN HLG
Sbjct: 319 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 357
Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
M ++ RA+ +W G L+W N + P+A+ P GD + YP
Sbjct: 358 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 417
>gi|422458853|ref|ZP_16535502.1| conserved hypothetical protein [Propionibacterium acnes HL050PA2]
gi|422466399|ref|ZP_16542973.1| conserved hypothetical protein [Propionibacterium acnes HL110PA4]
gi|422468180|ref|ZP_16544715.1| conserved hypothetical protein [Propionibacterium acnes HL110PA3]
gi|422575224|ref|ZP_16650768.1| conserved hypothetical protein [Propionibacterium acnes HL001PA1]
gi|314924019|gb|EFS87850.1| conserved hypothetical protein [Propionibacterium acnes HL001PA1]
gi|314983039|gb|EFT27131.1| conserved hypothetical protein [Propionibacterium acnes HL110PA3]
gi|315091619|gb|EFT63595.1| conserved hypothetical protein [Propionibacterium acnes HL110PA4]
gi|315104095|gb|EFT76071.1| conserved hypothetical protein [Propionibacterium acnes HL050PA2]
Length = 530
Score = 39.3 bits (90), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 65/180 (36%), Gaps = 37/180 (20%)
Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
W A Y+++ DEP Y+S R + P+A V+ DA P F + V
Sbjct: 300 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 349
Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
VP + H + E + + W Y + PN HLG
Sbjct: 350 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 388
Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
M ++ RA+ +W G L+W N + P+A+ P GD + YP
Sbjct: 389 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 448
>gi|422391266|ref|ZP_16471359.1| hypothetical protein HMPREF9341_02296 [Propionibacterium acnes
HL103PA1]
gi|422464078|ref|ZP_16540689.1| conserved hypothetical protein [Propionibacterium acnes HL060PA1]
gi|422564324|ref|ZP_16639979.1| conserved hypothetical protein [Propionibacterium acnes HL082PA2]
gi|314967153|gb|EFT11252.1| conserved hypothetical protein [Propionibacterium acnes HL082PA2]
gi|315093876|gb|EFT65852.1| conserved hypothetical protein [Propionibacterium acnes HL060PA1]
gi|327325812|gb|EGE67604.1| hypothetical protein HMPREF9341_02296 [Propionibacterium acnes
HL103PA1]
Length = 530
Score = 39.3 bits (90), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 65/180 (36%), Gaps = 37/180 (20%)
Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
W A Y+++ DEP Y+S R + P+A V+ DA P F + V
Sbjct: 300 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPEATVI--------DAVDDPR-FATVV 349
Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
VP + H + E + + W Y + PN HLG
Sbjct: 350 DVPVTIYGH---------------------LLECEAAGLDGMWAYTSCASTFWEPNRHLG 388
Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
M ++ RA+ +W G L+W N + P+A+ P GD + YP
Sbjct: 389 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSHYLVDPNADTSADLAFPSGDSSVIYP 448
>gi|365963599|ref|YP_004945165.1| hypothetical protein TIA2EST36_09570 [Propionibacterium acnes
TypeIA2 P.acn31]
gi|365974778|ref|YP_004956337.1| hypothetical protein TIA2EST2_09530 [Propionibacterium acnes
TypeIA2 P.acn33]
gi|365740280|gb|AEW84482.1| hypothetical protein TIA2EST36_09570 [Propionibacterium acnes
TypeIA2 P.acn31]
gi|365744777|gb|AEW79974.1| hypothetical protein TIA2EST2_09530 [Propionibacterium acnes
TypeIA2 P.acn33]
Length = 499
Score = 39.3 bits (90), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)
Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
S +G +D++ + L ++ H W A Y+++ DEP Y+S R + P A
Sbjct: 245 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 303
Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
V+ DA P F + V VP + H + E +
Sbjct: 304 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 333
Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
+ W Y + PN HLGM ++ RA+ +W G L+W N +
Sbjct: 334 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 393
Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
P+A+ P GD + YP
Sbjct: 394 LVDPNADTSADLAFPSGDSSVIYP 417
>gi|422395623|ref|ZP_16475656.1| hypothetical protein HMPREF9344_01398 [Propionibacterium acnes
HL097PA1]
gi|422454993|ref|ZP_16531671.1| conserved hypothetical protein [Propionibacterium acnes HL030PA1]
gi|315107964|gb|EFT79940.1| conserved hypothetical protein [Propionibacterium acnes HL030PA1]
gi|327333100|gb|EGE74827.1| hypothetical protein HMPREF9344_01398 [Propionibacterium acnes
HL097PA1]
Length = 530
Score = 39.3 bits (90), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)
Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
S +G +D++ + L ++ H W A Y+++ DEP Y+S R + P A
Sbjct: 276 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 334
Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
V+ DA P F + V VP + H + E +
Sbjct: 335 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 364
Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
+ W Y + PN HLGM ++ RA+ +W G L+W N +
Sbjct: 365 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 424
Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
P+A+ P GD + YP
Sbjct: 425 LVDPNADTSADLAFPSGDSSVIYP 448
>gi|422433658|ref|ZP_16510524.1| conserved hypothetical protein [Propionibacterium acnes HL059PA2]
gi|422436298|ref|ZP_16513148.1| hypothetical protein HMPREF9586_02402 [Propionibacterium acnes
HL083PA2]
gi|422441996|ref|ZP_16518802.1| conserved hypothetical protein [Propionibacterium acnes HL002PA1]
gi|422445323|ref|ZP_16522072.1| conserved hypothetical protein [Propionibacterium acnes HL027PA1]
gi|422511714|ref|ZP_16587855.1| conserved hypothetical protein [Propionibacterium acnes HL059PA1]
gi|422540487|ref|ZP_16616353.1| conserved hypothetical protein [Propionibacterium acnes HL013PA1]
gi|422540934|ref|ZP_16616795.1| conserved hypothetical protein [Propionibacterium acnes HL037PA1]
gi|422546693|ref|ZP_16622518.1| conserved hypothetical protein [Propionibacterium acnes HL050PA3]
gi|422548802|ref|ZP_16624611.1| conserved hypothetical protein [Propionibacterium acnes HL050PA1]
gi|422558679|ref|ZP_16634417.1| hypothetical protein HMPREF9588_02503 [Propionibacterium acnes
HL025PA2]
gi|422561617|ref|ZP_16637301.1| conserved hypothetical protein [Propionibacterium acnes HL046PA1]
gi|422571378|ref|ZP_16646963.1| conserved hypothetical protein [Propionibacterium acnes HL067PA1]
gi|422577404|ref|ZP_16652937.1| conserved hypothetical protein [Propionibacterium acnes HL005PA4]
gi|313763344|gb|EFS34708.1| conserved hypothetical protein [Propionibacterium acnes HL013PA1]
gi|313815003|gb|EFS52717.1| conserved hypothetical protein [Propionibacterium acnes HL059PA1]
gi|314916711|gb|EFS80542.1| conserved hypothetical protein [Propionibacterium acnes HL005PA4]
gi|314919163|gb|EFS82994.1| conserved hypothetical protein [Propionibacterium acnes HL050PA1]
gi|314921243|gb|EFS85074.1| conserved hypothetical protein [Propionibacterium acnes HL050PA3]
gi|314930329|gb|EFS94160.1| conserved hypothetical protein [Propionibacterium acnes HL067PA1]
gi|314956112|gb|EFT00508.1| conserved hypothetical protein [Propionibacterium acnes HL027PA1]
gi|314959730|gb|EFT03832.1| conserved hypothetical protein [Propionibacterium acnes HL002PA1]
gi|314969811|gb|EFT13909.1| conserved hypothetical protein [Propionibacterium acnes HL037PA1]
gi|315098131|gb|EFT70107.1| conserved hypothetical protein [Propionibacterium acnes HL059PA2]
gi|315102739|gb|EFT74715.1| conserved hypothetical protein [Propionibacterium acnes HL046PA1]
gi|327452257|gb|EGE98911.1| hypothetical protein HMPREF9586_02402 [Propionibacterium acnes
HL083PA2]
gi|328752296|gb|EGF65912.1| hypothetical protein HMPREF9588_02503 [Propionibacterium acnes
HL025PA2]
Length = 522
Score = 39.3 bits (90), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)
Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
S +G +D++ + L ++ H W A Y+++ DEP Y+S R + P A
Sbjct: 268 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 326
Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
V+ DA P F + V VP + H + E +
Sbjct: 327 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 356
Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
+ W Y + PN HLGM ++ RA+ +W G L+W N +
Sbjct: 357 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 416
Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
P+A+ P GD + YP
Sbjct: 417 LVDPNADTSADLAFPSGDSSVIYP 440
>gi|422426470|ref|ZP_16503391.1| hypothetical protein HMPREF9579_00231 [Propionibacterium acnes
HL087PA1]
gi|422453869|ref|ZP_16530551.1| hypothetical protein HMPREF9581_01536 [Propionibacterium acnes
HL087PA3]
gi|327451751|gb|EGE98405.1| hypothetical protein HMPREF9581_01536 [Propionibacterium acnes
HL087PA3]
gi|328756982|gb|EGF70598.1| hypothetical protein HMPREF9579_00231 [Propionibacterium acnes
HL087PA1]
Length = 522
Score = 39.3 bits (90), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)
Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
S +G +D++ + L ++ H W A Y+++ DEP Y+S R + P A
Sbjct: 268 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 326
Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
V+ DA P F + V VP + H + E +
Sbjct: 327 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 356
Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
+ W Y + PN HLGM ++ RA+ +W G L+W N +
Sbjct: 357 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 416
Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
P+A+ P GD + YP
Sbjct: 417 LVDPNADTSADLAFPSGDSSVIYP 440
>gi|422451477|ref|ZP_16528179.1| conserved hypothetical protein [Propionibacterium acnes HL030PA2]
gi|422499630|ref|ZP_16575891.1| conserved hypothetical protein [Propionibacterium acnes HL063PA2]
gi|313829397|gb|EFS67111.1| conserved hypothetical protein [Propionibacterium acnes HL063PA2]
gi|315108879|gb|EFT80855.1| conserved hypothetical protein [Propionibacterium acnes HL030PA2]
Length = 522
Score = 39.3 bits (90), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)
Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
S +G +D++ + L ++ H W A Y+++ DEP Y+S R + P A
Sbjct: 268 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 326
Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
V+ DA P F + V VP + H + E +
Sbjct: 327 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 356
Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
+ W Y + PN HLGM ++ RA+ +W G L+W N +
Sbjct: 357 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 416
Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
P+A+ P GD + YP
Sbjct: 417 LVDPNADTSADLAFPSGDSSVIYP 440
>gi|365965842|ref|YP_004947407.1| hypothetical protein TIA2EST22_09585 [Propionibacterium acnes
TypeIA2 P.acn17]
gi|365742523|gb|AEW82217.1| hypothetical protein TIA2EST22_09585 [Propionibacterium acnes
TypeIA2 P.acn17]
Length = 499
Score = 39.3 bits (90), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)
Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
S +G +D++ + L ++ H W A Y+++ DEP Y+S R + P A
Sbjct: 245 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 303
Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
V+ DA P F + V VP + H + E +
Sbjct: 304 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 333
Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
+ W Y + PN HLGM ++ RA+ +W G L+W N +
Sbjct: 334 AGLDGMWVYTSCASTFWEPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 393
Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
P+A+ P GD + YP
Sbjct: 394 LVDPNADTSADLAFPSGDSSVIYP 417
>gi|342211509|ref|ZP_08704234.1| hypothetical protein HMPREF9949_0083 [Propionibacterium sp.
CC003-HC2]
gi|340767053|gb|EGR89578.1| hypothetical protein HMPREF9949_0083 [Propionibacterium sp.
CC003-HC2]
Length = 499
Score = 38.9 bits (89), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)
Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
S +G +D++ + L ++ H W A Y+++ DEP Y+S R + P A
Sbjct: 245 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 303
Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
V+ DA P F + V VP + H + E +
Sbjct: 304 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 333
Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
+ W Y + PN HLGM ++ RA+ +W G L+W N +
Sbjct: 334 AGLDGMWVYTSCASTFWDPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 393
Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
P+A+ P GD + YP
Sbjct: 394 LVDPNADTSADLAFPSGDSSVIYP 417
>gi|335050668|ref|ZP_08543624.1| hypothetical protein HMPREF9947_1061 [Propionibacterium sp.
409-HC1]
gi|333769177|gb|EGL46316.1| hypothetical protein HMPREF9947_1061 [Propionibacterium sp.
409-HC1]
Length = 426
Score = 38.9 bits (89), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 76/204 (37%), Gaps = 40/204 (19%)
Query: 378 SNDGAKDYVRKEIELLR--TKAH-WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDA 433
S +G +D++ + L ++ H W A Y+++ DEP Y+S R + P A
Sbjct: 172 STEGYRDFLAVLLPALDQWSRRHGWSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGA 230
Query: 434 RVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQP 493
V+ DA P F + V VP + H + E +
Sbjct: 231 TVI--------DAVDDPR-FATVVDVPVTIYGH---------------------LLECEA 260
Query: 494 ENGEEWWTYVCMGPSDPHPNWHLGMRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EK 548
+ W Y + PN HLGM ++ RA+ +W G L+W N +
Sbjct: 261 AGLDGMWVYTSCASTFWDPNRHLGMPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRY 320
Query: 549 ATVPSAEIRFRRGLPPGDGVLFYP 572
P+A+ P GD + YP
Sbjct: 321 LVDPNADTSADLAFPSGDSSVIYP 344
>gi|354605511|ref|ZP_09023487.1| hypothetical protein HMPREF1003_00054 [Propionibacterium sp.
5_U_42AFAA]
gi|386024898|ref|YP_005943203.1| hypothetical protein PAZ_c20420 [Propionibacterium acnes 266]
gi|332676356|gb|AEE73172.1| hypothetical protein PAZ_c20420 [Propionibacterium acnes 266]
gi|353558520|gb|EHC27883.1| hypothetical protein HMPREF1003_00054 [Propionibacterium sp.
5_U_42AFAA]
Length = 550
Score = 38.9 bits (89), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 64/180 (35%), Gaps = 37/180 (20%)
Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
W A Y+++ DEP Y+S R + P A V+ DA P F + V
Sbjct: 320 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGATVI--------DAVDDPR-FATVV 369
Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
VP + H + E + + W Y + PN HLG
Sbjct: 370 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 408
Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
M ++ RA+ +W G L+W N + P+A+ P GD + YP
Sbjct: 409 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 468
>gi|456739043|gb|EMF63610.1| hypothetical protein TIA1EST31_09784 [Propionibacterium acnes
FZ1/2/0]
Length = 550
Score = 38.5 bits (88), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 64/180 (35%), Gaps = 37/180 (20%)
Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
W A Y+++ DEP Y+S R + P A V+ DA P F + V
Sbjct: 320 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPRATVI--------DAVDDPR-FATVV 369
Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
VP + H + E + + W Y + PN HLG
Sbjct: 370 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 408
Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
M ++ RA+ +W G L+W N + P+A+ P GD + YP
Sbjct: 409 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 468
>gi|289426957|ref|ZP_06428676.1| conserved hypothetical protein [Propionibacterium acnes J165]
gi|417930403|ref|ZP_12573781.1| hypothetical protein HMPREF9205_1262 [Propionibacterium acnes
SK182]
gi|289159779|gb|EFD07964.1| conserved hypothetical protein [Propionibacterium acnes J165]
gi|340772245|gb|EGR94754.1| hypothetical protein HMPREF9205_1262 [Propionibacterium acnes
SK182]
Length = 543
Score = 38.5 bits (88), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 64/180 (35%), Gaps = 37/180 (20%)
Query: 399 WKKA-YFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFV 457
W A Y+++ DEP Y+S R + P A V+ DA P F + V
Sbjct: 313 WSDALYWHVSDEP-RANQYTSYRKAVDMVRRTVPGATVI--------DAVDDPR-FATVV 362
Query: 458 KVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEEWWTYVCMGPSDPHPNWHLG 517
VP + H + E + + W Y + PN HLG
Sbjct: 363 DVPVTIYGH---------------------LLECEAAGLDGMWVYTSCASTFWEPNRHLG 401
Query: 518 MRGSQHRAVMWRVWKEGGTGFLYWGANCY-----EKATVPSAEIRFRRGLPPGDGVLFYP 572
M ++ RA+ +W G L+W N + P+A+ P GD + YP
Sbjct: 402 MPLTRLRALGLLLWWHHTPGLLHWALNFWFDQFSRYLVDPNADTSADLAFPSGDSSVIYP 461
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.439
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,760,158,765
Number of Sequences: 23463169
Number of extensions: 494229847
Number of successful extensions: 1011220
Number of sequences better than 100.0: 147
Number of HSP's better than 100.0 without gapping: 47
Number of HSP's successfully gapped in prelim test: 100
Number of HSP's that attempted gapping in prelim test: 1010923
Number of HSP's gapped (non-prelim): 267
length of query: 605
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 456
effective length of database: 8,863,183,186
effective search space: 4041611532816
effective search space used: 4041611532816
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)