BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 040739
(594 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356556958|ref|XP_003546786.1| PREDICTED: uncharacterized protein LOC100783035 [Glycine max]
Length = 602
Score = 1032 bits (2668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 518/602 (86%), Positives = 550/602 (91%), Gaps = 12/602 (1%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M+RTRLN+R RCSGSTPSEESALD ERNCCSH NLPSLSPPTLQPFASAGQHCES+AAYF
Sbjct: 1 MERTRLNMRGRCSGSTPSEESALDLERNCCSHSNLPSLSPPTLQPFASAGQHCESSAAYF 60
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWP SRL+DAAEERANYF NLQK VLPETLG+LPKG QATTLLELMTIRAFHSKILRCY
Sbjct: 61 SWP--SRLNDAAEERANYFLNLQKEVLPETLGRLPKGHQATTLLELMTIRAFHSKILRCY 118
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 178
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
YFGAPEP KEQLYT+IVDDLRGGDP IGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT
Sbjct: 179 YFGAPEPVSKEQLYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRR----------PLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSF P TFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 298
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFADDFDMSTVTTSV+G+G+IGDVKI+DLQ+PISSLIGKQVVKVGRSSGLTTG VL
Sbjct: 299 GAFIPFADDFDMSTVTTSVRGVGDIGDVKIIDLQAPISSLIGKQVVKVGRSSGLTTGVVL 358
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICFLTD LVVGENQQTFDLEGDSGSLI++KG+NGEKPRPIGIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDLLVVGENQQTFDLEGDSGSLIMLKGDNGEKPRPIGIIWGGTAN 418
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLLNLLELDLITTDEGL+VAVQEQRA SAT IGSTVGD
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITTDEGLQVAVQEQRAVSATVIGSTVGD 478
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSPPDG+ KDKAEDK+EPLGLQIQ IP+ V S + PS+METEF LEDG+K GPS+E
Sbjct: 479 SSPPDGVLPKDKAEDKYEPLGLQIQSIPLGVVPSSQDMKPSIMETEFKLEDGIKVGPSIE 538
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSK 590
QFIPSF G SPLH+N+ D+ ++ENL+SL N CDED+C SLQLGDNEAKRRRS+ASTS
Sbjct: 539 HQFIPSFIGRSPLHKNSIQDRTATENLSSLRNNCDEDLCVSLQLGDNEAKRRRSEASTST 598
Query: 591 EE 592
EE
Sbjct: 599 EE 600
>gi|356525782|ref|XP_003531502.1| PREDICTED: uncharacterized protein LOC100806376 [Glycine max]
Length = 602
Score = 1026 bits (2653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 516/602 (85%), Positives = 548/602 (91%), Gaps = 12/602 (1%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M+R RLN+R CSGSTPSEESALD ERNCCSH NLPSLSPPTLQPFASAGQHCES+AAYF
Sbjct: 1 MERARLNMRGHCSGSTPSEESALDLERNCCSHSNLPSLSPPTLQPFASAGQHCESSAAYF 60
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWP SRL+DAAEERANYF NLQKGVLPETLG+LPKG QATTLLELMTIRAFHSKILRCY
Sbjct: 61 SWP--SRLNDAAEERANYFLNLQKGVLPETLGRLPKGHQATTLLELMTIRAFHSKILRCY 118
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 178
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
YFGAPEP PKEQLYT+IVDDLRGGDP IGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT
Sbjct: 179 YFGAPEPVPKEQLYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRR----------PLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSF P TFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 298
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFADDFDMSTVTTSV+G+G+IGDVKI+DLQ+PISSLIGKQVVKVGRSSGLTTG VL
Sbjct: 299 GAFIPFADDFDMSTVTTSVRGVGDIGDVKIIDLQAPISSLIGKQVVKVGRSSGLTTGVVL 358
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICFLTD LVVGENQQTFDLEGDSGSLI++KG+ GEKPRPIGIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDLLVVGENQQTFDLEGDSGSLIMLKGDIGEKPRPIGIIWGGTAN 418
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLLNLLELDLITTDEGL+VAVQEQRA SAT IGSTVGD
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITTDEGLQVAVQEQRAVSATVIGSTVGD 478
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSPPDG+ KDKAEDK+EPLGLQIQ IP+ V S + PS+METEF LEDG+ GPS+E
Sbjct: 479 SSPPDGVLPKDKAEDKYEPLGLQIQSIPLGVVPSSQDMKPSIMETEFKLEDGINVGPSIE 538
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSK 590
QFIPSF G SPLH+N+ D+ ++ENL+SL N CDED+C SLQLGDNEAKRRRS+ASTS
Sbjct: 539 HQFIPSFIGRSPLHKNSIQDRTATENLSSLRNNCDEDLCVSLQLGDNEAKRRRSEASTST 598
Query: 591 EE 592
EE
Sbjct: 599 EE 600
>gi|357451853|ref|XP_003596203.1| hypothetical protein MTR_2g069500 [Medicago truncatula]
gi|355485251|gb|AES66454.1| hypothetical protein MTR_2g069500 [Medicago truncatula]
Length = 603
Score = 1004 bits (2597), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 514/604 (85%), Positives = 542/604 (89%), Gaps = 14/604 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M+R RLN R RCSGSTPSEESALD ERNC H NLPSLSPPTLQPFASAGQH ESNAAYF
Sbjct: 1 MERPRLNSRVRCSGSTPSEESALDLERNCYGHSNLPSLSPPTLQPFASAGQHGESNAAYF 60
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWP SRL DAAEERANYF NLQKGVLPETLG+LPKGQQATTLLELMTIRAFHSKILRCY
Sbjct: 61 SWP--SRLPDAAEERANYFLNLQKGVLPETLGRLPKGQQATTLLELMTIRAFHSKILRCY 118
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 178
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
YFGAPEP PKEQ YT+IVDDLRGGDP IGSGSQVASQETYGTLGAIV+SQTGSRQVGFLT
Sbjct: 179 YFGAPEPVPKEQHYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRR----------PLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSF P TFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 298
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFADDFDM TVTTSV+G+G+IGDVKI+DLQSPIS+LIGKQVVKVGRSSGLTTG VL
Sbjct: 299 GAFIPFADDFDMCTVTTSVRGVGDIGDVKIIDLQSPISTLIGKQVVKVGRSSGLTTGIVL 358
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI+ KG+NGEKPRPIGIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIMFKGDNGEKPRPIGIIWGGTAN 418
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLKIG PPENWTSGVDLGRLLNLLELDLIT+DEGL+VAVQEQR ASAT +GS VGD
Sbjct: 419 RGRLKLKIGLPPENWTSGVDLGRLLNLLELDLITSDEGLRVAVQEQRTASATFMGSIVGD 478
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVK-AGPSV 529
SS PDGMH KD+ EDKFEPLGLQIQ IP+ VE +S E PS ME EF LEDG+K GPS+
Sbjct: 479 SSTPDGMHQKDRVEDKFEPLGLQIQSIPLGVEPNSQEMKPSTMEAEFKLEDGIKVGGPSI 538
Query: 530 ELQFIPSFTGHSPLHQNNPSDK-ASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDAST 588
E QFIPSF G SPLH++ DK A++ENL+SL N C+ED+C SLQLGDNEAKRRRS+AST
Sbjct: 539 EHQFIPSFIGRSPLHKHTVHDKAAAAENLSSLRNDCNEDLCVSLQLGDNEAKRRRSEAST 598
Query: 589 SKEE 592
S EE
Sbjct: 599 STEE 602
>gi|255544706|ref|XP_002513414.1| conserved hypothetical protein [Ricinus communis]
gi|223547322|gb|EEF48817.1| conserved hypothetical protein [Ricinus communis]
Length = 600
Score = 1002 bits (2591), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 508/604 (84%), Positives = 540/604 (89%), Gaps = 14/604 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M+ +RLN+RARCSGSTPSEESALD ERNCCSHPNLPSLSP TLQPF SAGQHCES+AAYF
Sbjct: 1 MECSRLNMRARCSGSTPSEESALDAERNCCSHPNLPSLSPRTLQPFVSAGQHCESSAAYF 60
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWP S RL+DA EERANYF+NLQKGVLPETL +LP+GQ+ATTLLELMTIRAFHSKILRCY
Sbjct: 61 SWP-SWRLNDAVEERANYFSNLQKGVLPETLNRLPRGQRATTLLELMTIRAFHSKILRCY 119
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RGVLTDIPAILVFVSRKVHKQWLSPIQCLP ALEGPGGVWCDVDVVEFS
Sbjct: 120 SLGTAIGFRIQRGVLTDIPAILVFVSRKVHKQWLSPIQCLPNALEGPGGVWCDVDVVEFS 179
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
YFGAPEPTPKEQLYT+IVDDLRGGD IGSG QVASQETYGTLGAIVKSQTG+RQVGFLT
Sbjct: 180 YFGAPEPTPKEQLYTEIVDDLRGGDLCIGSGFQVASQETYGTLGAIVKSQTGTRQVGFLT 239
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS F P TFVRAD
Sbjct: 240 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDDLWYGIFAGMNPETFVRAD 299
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFADDFDMSTVTTSVKG+G+IGDVKI+DLQ PI SLIGKQV+KVGRSSGLTTGT+L
Sbjct: 300 GAFIPFADDFDMSTVTTSVKGVGQIGDVKIIDLQCPIGSLIGKQVMKVGRSSGLTTGTIL 359
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AY LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI+MKGENGEKPRPIGIIWGGTAN
Sbjct: 360 AYGLEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIMKGENGEKPRPIGIIWGGTAN 419
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLLNLLEL LITTDEGLKVA+QEQR ASAT IGST+GD
Sbjct: 420 RGRLKLKVGQPPENWTSGVDLGRLLNLLELGLITTDEGLKVAIQEQRIASATTIGSTIGD 479
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSP DGM DK E E LGLQI+HIP+EVE + E NP L+ET FHLEDG+ PSVE
Sbjct: 480 SSPLDGMLPSDKVE---ESLGLQIEHIPLEVELGNSEINPRLVETNFHLEDGIMVAPSVE 536
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSK 590
QFIPSFT SPLH++N SDK ENLASL NGC+ED+C SL LGDNEAK+R S+ASTS
Sbjct: 537 HQFIPSFTRQSPLHKSNLSDKVVLENLASLRNGCNEDVCVSLHLGDNEAKKRSSNASTSI 596
Query: 591 EESK 594
EE K
Sbjct: 597 EEPK 600
>gi|224117600|ref|XP_002317619.1| predicted protein [Populus trichocarpa]
gi|222860684|gb|EEE98231.1| predicted protein [Populus trichocarpa]
Length = 597
Score = 998 bits (2580), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 497/601 (82%), Positives = 529/601 (88%), Gaps = 14/601 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M+R+R N+RA C+ STPS+ESAL ERN CSHP L S+ TLQPFASAGQHCESNAAYF
Sbjct: 1 MERSRNNMRAHCNVSTPSDESAL--ERNYCSHPRLTSVGSATLQPFASAGQHCESNAAYF 58
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPTSSRLSDAAEERANYFANLQKG+LPETLGQ PKGQ+ATTLL+LMTIRAFHSKILRCY
Sbjct: 59 SWPTSSRLSDAAEERANYFANLQKGILPETLGQFPKGQRATTLLDLMTIRAFHSKILRCY 118
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RGVLTDIPAILVFVSRKVHKQWLS +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSTVQCLPNALEGPGGVWCDVDVVEFS 178
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
YFGAP+PTPKEQLYT+IV+DLRG IGSGSQVASQETYGTLGAIV+SQ+GSRQVGFLT
Sbjct: 179 YFGAPQPTPKEQLYTEIVNDLRGDGLYIGSGSQVASQETYGTLGAIVRSQSGSRQVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHR----------RPLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPPTLGPGV LGAVERATSF P TFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPTLGPGVNLGAVERATSFITDDLWYGIFAGINPETFVRAD 298
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPF DDFDMSTV TSVKG+GEIGDVKI+DLQ PIS LIGKQV+KVGRSSGLTTGTV
Sbjct: 299 GAFIPFTDDFDMSTVNTSVKGVGEIGDVKIIDLQCPISDLIGKQVMKVGRSSGLTTGTVF 358
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AY LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI+MKGENGEKPRPIGIIWGGTAN
Sbjct: 359 AYGLEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIMKGENGEKPRPIGIIWGGTAN 418
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLL LELDLITT+EGL+ AVQEQRAASATAI ST+GD
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLYHLELDLITTNEGLQAAVQEQRAASATAICSTIGD 478
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSPPDGM D+ +DK E LGLQI+HIP EVE+ P++ SLMET FHLEDG+K PSVE
Sbjct: 479 SSPPDGMLPNDRMDDKLESLGLQIEHIPSEVENGIPKS--SLMETNFHLEDGIKLTPSVE 536
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSK 590
QFIPSF SPLHQNN SDK SENLASL NGCDEDI SL LGDNEAKRRRS + TS
Sbjct: 537 HQFIPSFIRQSPLHQNNVSDKKVSENLASLRNGCDEDIFVSLHLGDNEAKRRRSFSPTSM 596
Query: 591 E 591
E
Sbjct: 597 E 597
>gi|449453788|ref|XP_004144638.1| PREDICTED: uncharacterized protein LOC101217211 [Cucumis sativus]
gi|449504216|ref|XP_004162286.1| PREDICTED: uncharacterized protein LOC101225003 [Cucumis sativus]
Length = 601
Score = 934 bits (2413), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/604 (80%), Positives = 518/604 (85%), Gaps = 13/604 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M++TR N R CSGSTPSEESALD ERNCCSH +LPS S PTLQPFASAGQH N AYF
Sbjct: 1 MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYF 60
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPT RLS EERANYFANLQKGVLP+ L LPKGQ+A TLLELMTIRAFHSKILRCY
Sbjct: 61 SWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI++GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
YFGAP P PKEQLYT+IVDDLRG DP IGSGSQVASQETYGTLGAIV+SQTG RQVGFLT
Sbjct: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRR----------PLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSF P TFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFADDFDMSTVTTSVKG+G++GDVK +DLQSPIS+LIGKQVVKVGRSSGLTTGTVL
Sbjct: 301 GAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI++KGEN + +PIGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRDTLQPIGIIWGGTAN 420
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLLNLLELDLIT+DEGLK AVQEQ SAT IGS VGD
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGD 480
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSPPD K+K+E+K E LG QIQH+P EVE S + P L+ETEFHLE G+ PSVE
Sbjct: 481 SSPPDTTLPKEKSEEKSEQLGFQIQHMPTEVE-PSAKDRP-LLETEFHLEPGMNRAPSVE 538
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSK 590
QFIPS SP HQN+ D+A S+NL+ L + C ED+C SLQLGD+EAKRRRSDAS S
Sbjct: 539 HQFIPSLFSCSPSHQNSTLDRAVSQNLSLLRSDC-EDLCVSLQLGDHEAKRRRSDASVSM 597
Query: 591 EESK 594
EE K
Sbjct: 598 EELK 601
>gi|225462187|ref|XP_002267587.1| PREDICTED: uncharacterized protein LOC100261226 [Vitis vinifera]
Length = 603
Score = 879 bits (2271), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 450/601 (74%), Positives = 504/601 (83%), Gaps = 12/601 (1%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
MD+T+LN+R RCSGST SEESA + ERNCC H +LPS S PTLQPFASAGQH ESNAAYF
Sbjct: 1 MDQTKLNLRLRCSGSTLSEESAPNQERNCCCHSHLPSSSLPTLQPFASAGQHSESNAAYF 60
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPTSSRL+DAAEERANYF+NLQK VL ET G LPKGQQAT+LLE+MTIRAFHSKILRCY
Sbjct: 61 SWPTSSRLNDAAEERANYFSNLQKAVLSETPGPLPKGQQATSLLEVMTIRAFHSKILRCY 120
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RG+LTDIPAILVFVSRKVHKQWL+PIQC P LEGPGG+WCDVDVVEF+
Sbjct: 121 SLGTAIGFRIRRGMLTDIPAILVFVSRKVHKQWLNPIQCFPNVLEGPGGLWCDVDVVEFA 180
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
YFGAPE PKEQ YT+I+DDLRGGDP IGSGSQVASQ+ +GTLGAIV+SQTG+RQVGFLT
Sbjct: 181 YFGAPELAPKEQYYTEIMDDLRGGDPCIGSGSQVASQDGFGTLGAIVRSQTGNRQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHR----------RPLTFVRAD 290
NRHVAV+LDYP+QKMFHPLPPTLGPGVYLGAVERATSF P TFVRAD
Sbjct: 241 NRHVAVNLDYPSQKMFHPLPPTLGPGVYLGAVERATSFITDDLWFGIFAGINPETFVRAD 300
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFADDFDMST+TT VKG+GEIGDVK +DLQSP++S+IGKQVVKVGRSSGLTTGT+
Sbjct: 301 GAFIPFADDFDMSTITTLVKGVGEIGDVKKIDLQSPMNSIIGKQVVKVGRSSGLTTGTIF 360
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEY DE+G+C LTD +VVGENQQTFDLEGDSGSLI++ G++GEK RPIGIIWGG N
Sbjct: 361 AYALEYIDERGMCLLTDLIVVGENQQTFDLEGDSGSLIVLTGQDGEKARPIGIIWGGNGN 420
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGR+KLK G P ENWTS VD+GRLLNLLELDLITT EGL+VA+QEQ AASATAIGSTVGD
Sbjct: 421 RGRVKLKAGLPLENWTSAVDIGRLLNLLELDLITTSEGLRVALQEQMAASATAIGSTVGD 480
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSP D M KD+AE+KFE G QIQH P + SP+ N L+E EF LEDGV+ P E
Sbjct: 481 SSPQDKMLPKDRAEEKFESEGFQIQHDPWDDGLGSPDLNRPLVEAEFLLEDGVRVCPCFE 540
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDED--ICFSLQLGDNEAKRRRSDAST 588
QFIPSF PLH+N + + ENL+SL + DED SLQLGD+E KR R D S+
Sbjct: 541 HQFIPSFPEAPPLHENIEQARVTPENLSSLKHDTDEDDGAAISLQLGDHEPKRTRLDPSS 600
Query: 589 S 589
+
Sbjct: 601 N 601
>gi|225423710|ref|XP_002277727.1| PREDICTED: uncharacterized protein LOC100250825 [Vitis vinifera]
Length = 596
Score = 866 bits (2237), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 451/594 (75%), Positives = 508/594 (85%), Gaps = 12/594 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
MDRTRL++R SGS SEESALD ERN C+HPNLPS SPP LQ FAS GQ ESNAAYF
Sbjct: 1 MDRTRLDLRFHHSGSIQSEESALDLERNYCNHPNLPSPSPPPLQAFASGGQLSESNAAYF 60
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPTSSRL+DAAE+RANYF NLQKGVLPETLG+LP GQQATTLLELMTIRAFHSKILR +
Sbjct: 61 SWPTSSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RGVLT+IPAILVFV+RKVH+QWL+ IQCLP ALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRRGVLTEIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFS 180
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
Y+GAP PTPKEQLYT++VD LRG DP IGSGSQVASQETYGTLGAIVKS+TG++QVGFLT
Sbjct: 181 YYGAPAPTPKEQLYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNQQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATS F P TFVRAD
Sbjct: 241 NRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFADDF++S VTT+VKG+GEIGDV I+DLQSPI+SLIG+QVVKVGRSSGLTTGT++
Sbjct: 301 GAFIPFADDFNVSNVTTTVKGVGEIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 360
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLIL+ G+NGEKPRP+GIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLL+LLELDLITT EGL+ AV EQ ASA I STVG+
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSEGLQAAVHEQINASAAGIDSTVGE 480
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSV- 529
SSPP+ + LK+K E+ FEPLG+ +Q +P+E E PS + TEFH+E+GV+A P+V
Sbjct: 481 SSPPEPVLLKNKTEENFEPLGINLQQVPIEGESQQ-AVLPSFIHTEFHIEEGVEAAPNVE 539
Query: 530 ELQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRR 583
E QFIPS G SP+HQNN + +NL +L N +E++ SLQLG E KRR+
Sbjct: 540 EHQFIPSCPGKSPVHQNNKQENPELKNLWALRNTSEEEMAVSLQLGKPEPKRRK 593
>gi|297737962|emb|CBI27163.3| unnamed protein product [Vitis vinifera]
Length = 684
Score = 865 bits (2235), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 451/594 (75%), Positives = 508/594 (85%), Gaps = 12/594 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
MDRTRL++R SGS SEESALD ERN C+HPNLPS SPP LQ FAS GQ ESNAAYF
Sbjct: 89 MDRTRLDLRFHHSGSIQSEESALDLERNYCNHPNLPSPSPPPLQAFASGGQLSESNAAYF 148
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPTSSRL+DAAE+RANYF NLQKGVLPETLG+LP GQQATTLLELMTIRAFHSKILR +
Sbjct: 149 SWPTSSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 208
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RGVLT+IPAILVFV+RKVH+QWL+ IQCLP ALEGPGGVWCDVDVVEFS
Sbjct: 209 SLGTAIGFRIRRGVLTEIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFS 268
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
Y+GAP PTPKEQLYT++VD LRG DP IGSGSQVASQETYGTLGAIVKS+TG++QVGFLT
Sbjct: 269 YYGAPAPTPKEQLYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNQQVGFLT 328
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATS F P TFVRAD
Sbjct: 329 NRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 388
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFADDF++S VTT+VKG+GEIGDV I+DLQSPI+SLIG+QVVKVGRSSGLTTGT++
Sbjct: 389 GAFIPFADDFNVSNVTTTVKGVGEIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 448
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLIL+ G+NGEKPRP+GIIWGGTAN
Sbjct: 449 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 508
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLL+LLELDLITT EGL+ AV EQ ASA I STVG+
Sbjct: 509 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSEGLQAAVHEQINASAAGIDSTVGE 568
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSV- 529
SSPP+ + LK+K E+ FEPLG+ +Q +P+E E PS + TEFH+E+GV+A P+V
Sbjct: 569 SSPPEPVLLKNKTEENFEPLGINLQQVPIEGESQQA-VLPSFIHTEFHIEEGVEAAPNVE 627
Query: 530 ELQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRR 583
E QFIPS G SP+HQNN + +NL +L N +E++ SLQLG E KRR+
Sbjct: 628 EHQFIPSCPGKSPVHQNNKQENPELKNLWALRNTSEEEMAVSLQLGKPEPKRRK 681
>gi|255566289|ref|XP_002524131.1| conserved hypothetical protein [Ricinus communis]
gi|223536598|gb|EEF38242.1| conserved hypothetical protein [Ricinus communis]
Length = 593
Score = 863 bits (2229), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/593 (74%), Positives = 499/593 (84%), Gaps = 13/593 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
MDR +L++R SGST SEESALD ERNCC+HPN SP +LQPFAS+GQH ESNAAYF
Sbjct: 1 MDRNKLDLRLHHSGSTQSEESALDLERNCCNHPNPHWSSPTSLQPFASSGQHYESNAAYF 60
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPT SRL+D AE+RANYF NLQKGVLPETLG+LP GQQATTLLELMTIRAFHSKILR +
Sbjct: 61 SWPTLSRLNDTAEDRANYFGNLQKGVLPETLGRLPSGQQATTLLELMTIRAFHSKILRRF 120
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RGVLTDIPAILVFV+RKVH+QWLS +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
Y+GAP TPKEQLYT++VD LRG P IGSGSQVA+QETYGTLGAIVKS+TG+RQVGFLT
Sbjct: 181 YYGAPASTPKEQLYTELVDGLRGSYPCIGSGSQVANQETYGTLGAIVKSRTGNRQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATS F P TFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDELWYGIFAGTNPETFVRAD 300
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFA+DF+M+ VTTSVKG+GEIGDV +DLQSPI+SLIG+QVVKVGRSSGLTTGT++
Sbjct: 301 GAFIPFAEDFNMNNVTTSVKGVGEIGDVHSIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 360
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICF TDFLVVGENQQ FDLEGDSGSLIL+ G+NG+KPRP+GIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFFTDFLVVGENQQPFDLEGDSGSLILLTGQNGDKPRPVGIIWGGTAN 420
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLL+LLELDL+T++EGL+ VQ+Q+ SA + STVG+
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLVTSNEGLQ--VQDQKNVSAAGLDSTVGE 478
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSPPD + KD+ ED EPL L IQ + +E E T P TEFH+EDGV+ P+VE
Sbjct: 479 SSPPDRVLSKDRIEDNIEPLNLNIQQVLLEEESQHGLTAP-FTRTEFHIEDGVETAPNVE 537
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRR 583
QFIPSFTG +H N + ENL++L +G DE+I SL+LG+ E KRRR
Sbjct: 538 HQFIPSFTGGPMVHDKNKQENVELENLSALRHGSDEEIHVSLRLGEPEPKRRR 590
>gi|224136616|ref|XP_002322374.1| predicted protein [Populus trichocarpa]
gi|222869370|gb|EEF06501.1| predicted protein [Populus trichocarpa]
Length = 594
Score = 858 bits (2218), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/593 (73%), Positives = 496/593 (83%), Gaps = 12/593 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
MDR RL +R SGS+ SEESALD ERN CSHPNL SP LQPFAS GQH ESNAAYF
Sbjct: 1 MDRNRLGLRIHHSGSSQSEESALDLERNYCSHPNLLWSSPSPLQPFASGGQHSESNAAYF 60
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPT SRL+DAAE RANYF NLQKGVLPETLG+LP GQ+ATTLLELMTIRAFHSKILR +
Sbjct: 61 SWPTLSRLNDAAEVRANYFGNLQKGVLPETLGRLPSGQRATTLLELMTIRAFHSKILRRF 120
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RG LTDIPAILVFV+RKVH+QWLS +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRRGDLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
Y+G P TPKEQLYT++VD LRG DP IGSGSQVA+QETYGTLGAIVKS+TG+RQVGFLT
Sbjct: 181 YYGVPAATPKEQLYTELVDGLRGSDPCIGSGSQVANQETYGTLGAIVKSRTGNRQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATS F P TFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDELWYGIFAGTNPETFVRAD 300
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFA+DF+M+ V +VKG+GE+GDV ++DLQ+PI+SLIG+QVVKVGRSSGLTTGT++
Sbjct: 301 GAFIPFAEDFNMNNVNITVKGVGEVGDVHVIDLQAPINSLIGRQVVKVGRSSGLTTGTIM 360
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLIL+ G + EKPRP+GIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGRDCEKPRPVGIIWGGTAN 420
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLL+LLELD+ITT+EGL+ A+Q+QR A A I STVG+
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDIITTNEGLQAAIQDQRNALAQGIDSTVGE 480
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSP D + K+K E+ FEPL L IQ + E E +T P + EFH+ED V+A P+VE
Sbjct: 481 SSPLDRVPSKEKIEENFEPLNLNIQQVTGEGESQHGQT-PLFIGPEFHIEDAVEASPNVE 539
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRR 583
QFIPSF+G SP+H N P + +NL++L + DE +CFSL LG+ E KRR+
Sbjct: 540 HQFIPSFSGRSPMHDNTPQENPELKNLSALRSDSDE-MCFSLHLGEPEPKRRK 591
>gi|224114770|ref|XP_002332278.1| predicted protein [Populus trichocarpa]
gi|222832440|gb|EEE70917.1| predicted protein [Populus trichocarpa]
Length = 593
Score = 854 bits (2207), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/593 (73%), Positives = 500/593 (84%), Gaps = 13/593 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M+R RL +R SGS+ SEESALD ERN C+H SLSP LQPF S GQH ESNAAYF
Sbjct: 1 MERNRLGLRIHHSGSSQSEESALDLERNYCNHLPWSSLSP--LQPFTSGGQHSESNAAYF 58
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPT SRL+DAAE+RANYF NLQKGVLPETLG+LP GQQATTLLELMTIRAFHSKILR +
Sbjct: 59 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 118
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RG+LTDIPAILVFV+RKVH+QWLS +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRRGILTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 178
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
Y+GAP TPKEQLYT +VD LRG DP IGSGSQVA+QETYGTLGAIVKS+TG+RQVGFLT
Sbjct: 179 YYGAPAATPKEQLYTDLVDGLRGSDPCIGSGSQVANQETYGTLGAIVKSRTGNRQVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATS F P TFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 298
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFA DF+M+ VTT+VKG+GE+GDV ++DLQ+PI+SLIG+QVVKVGRSSGLTTGT++
Sbjct: 299 GAFIPFAGDFNMNNVTTTVKGVGEVGDVHVIDLQAPINSLIGRQVVKVGRSSGLTTGTIM 358
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLIL+KG++ EKP+P+GIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLKGQDCEKPQPVGIIWGGTAN 418
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+G PPENWTSGVDLGRLL+LLELDLITT++GL+ AVQ+QR ASA AI STVG+
Sbjct: 419 RGRLKLKVGLPPENWTSGVDLGRLLDLLELDLITTNDGLQAAVQDQRNASAPAIDSTVGE 478
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSP D + K+K E+ FEP+ L +Q V+ E ++ P + EFH+EDG +A P+VE
Sbjct: 479 SSPLDRVPSKEKIEENFEPINLNMQQGVVKGESQQGQS-PLFIGPEFHIEDGAEAAPNVE 537
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRR 583
QFIPSF+G S +H N P + +NL++L + DE++CFSLQLG E KRR+
Sbjct: 538 HQFIPSFSGQSLMHDNKPQETPELKNLSALRSDSDEEMCFSLQLGKPEPKRRK 590
>gi|147798987|emb|CAN61635.1| hypothetical protein VITISV_008456 [Vitis vinifera]
Length = 1092
Score = 831 bits (2146), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 451/656 (68%), Positives = 509/656 (77%), Gaps = 74/656 (11%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
MDRTRL++R SGS SEESALD ERN C+HPNLPS SPP LQ FAS GQ ESNAAYF
Sbjct: 435 MDRTRLDLRFHHSGSIQSEESALDLERNYCNHPNLPSPSPPPLQAFASGGQLSESNAAYF 494
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPTSSRL+DAAE+RANYF NLQKGVLPETLG+LP GQQATTLLELMTIRAFHSKILR +
Sbjct: 495 SWPTSSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 554
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RGVLT+IPAILVFV+RKVH+QWL+ IQCLP ALEGPGGVWCDVDVVEFS
Sbjct: 555 SLGTAIGFRIRRGVLTEIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFS 614
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQ--------------------------- 213
Y+GAP PTPKEQLYT++VD LRG DP IGSGSQ
Sbjct: 615 YYGAPAPTPKEQLYTELVDGLRGSDPCIGSGSQSIXEDYSCMGKTSGCNLFVQMLLELID 674
Query: 214 --------VASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGP 265
VASQETYGTLGAIVKS+TG++QVGFLTNRHVAVDLDYP+QKMFHPLPP+LGP
Sbjct: 675 KTNPGVVHVASQETYGTLGAIVKSRTGNQQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGP 734
Query: 266 GVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEI 315
GVYLGAVERATSF P TFVRADGAFIPFADDF++S VTT+VKG+GEI
Sbjct: 735 GVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFADDFNVSNVTTTVKGVGEI 794
Query: 316 GDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQ 375
G+V I+DLQSPI+SLIG+QVVKVGRSSGLTTGT++AYALEYNDEKGICF TDFLVVGENQ
Sbjct: 795 GEVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIMAYALEYNDEKGICFFTDFLVVGENQ 854
Query: 376 QTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPPENWTSGVDLGRLL 435
QTFDLEGDSGSLIL+ G+NGEKPRP+GIIWGGTANRGRLKLK+GQPPENWTSGVDLGRLL
Sbjct: 855 QTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLL 914
Query: 436 NLLELDLITTDEGLKV---------------------------AVQEQRAASATAIGSTV 468
+LLELDLITT EGL+V AV EQ ASA I STV
Sbjct: 915 DLLELDLITTSEGLQVLEAKIDLQKGFLTIQMMFFSWFIVNIAAVHEQINASAAGIDSTV 974
Query: 469 GDSSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPS 528
G+SSPP+ + LK+K E+ FEPLG+ +Q +P+E E PS + TEFH+E+GV+A P+
Sbjct: 975 GESSPPEPVLLKNKTEENFEPLGINLQQVPIEGESQQ-AVLPSFIHTEFHIEEGVEAAPN 1033
Query: 529 V-ELQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRR 583
V E QFIPS G SP+HQNN + +NL +L N +E++ SLQLG E KRR+
Sbjct: 1034 VEEHQFIPSCPGKSPVHQNNKQENPELKNLWALRNTSEEEMXVSLQLGKPEPKRRK 1089
>gi|356576393|ref|XP_003556316.1| PREDICTED: uncharacterized protein LOC100816119 isoform 1 [Glycine
max]
Length = 598
Score = 818 bits (2113), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/598 (71%), Positives = 489/598 (81%), Gaps = 15/598 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M++ +L++RA SGST SEESALD ER+ HPN PS SP LQPFA QH ESNAAYF
Sbjct: 1 MNQNQLDLRAHHSGSTQSEESALDLERSYYGHPN-PS-SPSPLQPFAGGAQHSESNAAYF 58
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPT SR +DAAE+RANYF NLQKGVLPETLG+LP GQQATTLLELMTIRAFHSKILR +
Sbjct: 59 SWPTLSRWNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 118
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+ GVLTDIPAILVFV+RKVH+QWL+ IQCLP ALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRGGVLTDIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFS 178
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
Y+GAP TPKEQLYT++ D LRG D +GSGSQVASQETYGTLGAIV+S++G+R+VGFLT
Sbjct: 179 YYGAPAQTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRSGNREVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATS F P TFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 298
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFA+DF+M+ V T+VKG+GEIGDV I+DLQSPI+SLIG+QVVKVGRSSGLTTGT++
Sbjct: 299 GAFIPFAEDFNMNNVITTVKGVGEIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 358
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL+ G+NGEKP P+GIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPCPVGIIWGGTAN 418
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLL+LLELDLITT+E L+ AV EQR SA I STVG+
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNEALQAAVLEQRNGSAAGIDSTVGE 478
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSP + +K+K E+ FEP L I VE E S NPS+ EFH++ ++ P+VE
Sbjct: 479 SSPT--VPIKEKLEESFEPFCLNIPLAQVEDE-PSQRVNPSIRPCEFHIKSEIEIAPNVE 535
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDAST 588
QFIPS+ G SP Q+ + ++LA L NG DED SL LG+ E KRR+ S+
Sbjct: 536 HQFIPSYAGKSPARQSYLKEDMELKSLAELRNGPDEDNFVSLHLGEPEMKRRKLSNSS 593
>gi|356521576|ref|XP_003529430.1| PREDICTED: uncharacterized protein LOC100796081 [Glycine max]
Length = 600
Score = 815 bits (2105), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/605 (71%), Positives = 492/605 (81%), Gaps = 16/605 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M++ RL++RA SGST SEESALD ER+ HPN PS P LQPFA QH ESNAAYF
Sbjct: 1 MNQNRLDLRAHHSGSTQSEESALDLERSYYGHPN-PSCPSP-LQPFAGGAQHSESNAAYF 58
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPT SR +DAAE+RANYF NLQKGVLPETLG+LP GQQATTLLELMTIRAFHSKILR +
Sbjct: 59 SWPTLSRWNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 118
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+ GVLTDIPAILVFV+RKV +QWL+ +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRGGVLTDIPAILVFVARKVRRQWLNHVQCLPAALEGPGGVWCDVDVVEFS 178
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
Y+GAP TPKEQLYT++ D LRG D +GSGSQVASQETYGTLGAIV+S+TG+R+VGFLT
Sbjct: 179 YYGAPAQTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRTGNREVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATS F P TFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 298
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFA+DF+M+ V T+VKG+GEI DV I+DLQSPI+SLIG+QVVKVGRSSGLTTGT++
Sbjct: 299 GAFIPFAEDFNMNNVITTVKGVGEISDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 358
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL+ G+NGEKPRP+GIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 418
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLL+LLELDLITT+E L+ AV EQR SA I STVG+
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNEALQAAVLEQRNGSAAGIDSTVGE 478
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSP + +K+K E+ FEP L I VE E S NPS+ +FH++ ++ P+VE
Sbjct: 479 SSPT--VPIKEKLEESFEPFCLNIPLAQVEDE-PSQRVNPSIRPCDFHIKSEIETAPNVE 535
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRR-SDASTS 589
QFIPS+ G SP Q+ + ++LA L NG DED SL LG+ E KRR+ S++S
Sbjct: 536 HQFIPSYAGKSPACQSYLKEDMELKSLAELRNGPDEDNFVSLHLGEPEMKRRKISNSSFC 595
Query: 590 KEESK 594
+E K
Sbjct: 596 IKELK 600
>gi|356576395|ref|XP_003556317.1| PREDICTED: uncharacterized protein LOC100816119 isoform 2 [Glycine
max]
Length = 600
Score = 813 bits (2101), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/600 (71%), Positives = 489/600 (81%), Gaps = 17/600 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M++ +L++RA SGST SEESALD ER+ HPN PS SP LQPFA QH ESNAAYF
Sbjct: 1 MNQNQLDLRAHHSGSTQSEESALDLERSYYGHPN-PS-SPSPLQPFAGGAQHSESNAAYF 58
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPT SR +DAAE+RANYF NLQKGVLPETLG+LP GQQATTLLELMTIRAFHSKILR +
Sbjct: 59 SWPTLSRWNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 118
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+ GVLTDIPAILVFV+RKVH+QWL+ IQCLP ALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRGGVLTDIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFS 178
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
Y+GAP TPKEQLYT++ D LRG D +GSGSQVASQETYGTLGAIV+S++G+R+VGFLT
Sbjct: 179 YYGAPAQTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRSGNREVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATS F P TFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 298
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFA+DF+M+ V T+VKG+GEIGDV I+DLQSPI+SLIG+QVVKVGRSSGLTTGT++
Sbjct: 299 GAFIPFAEDFNMNNVITTVKGVGEIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 358
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL+ G+NGEKP P+GIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPCPVGIIWGGTAN 418
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLK--VAVQEQRAASATAIGSTV 468
RGRLKLK+GQPPENWTSGVDLGRLL+LLELDLITT+E L+ AV EQR SA I STV
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNEALQAAAAVLEQRNGSAAGIDSTV 478
Query: 469 GDSSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPS 528
G+SSP + +K+K E+ FEP L I VE E S NPS+ EFH++ ++ P+
Sbjct: 479 GESSPT--VPIKEKLEESFEPFCLNIPLAQVEDE-PSQRVNPSIRPCEFHIKSEIEIAPN 535
Query: 529 VELQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDAST 588
VE QFIPS+ G SP Q+ + ++LA L NG DED SL LG+ E KRR+ S+
Sbjct: 536 VEHQFIPSYAGKSPARQSYLKEDMELKSLAELRNGPDEDNFVSLHLGEPEMKRRKLSNSS 595
>gi|357475191|ref|XP_003607881.1| hypothetical protein MTR_4g084020 [Medicago truncatula]
gi|124359654|gb|ABN06026.1| Peptidase, trypsin-like serine and cysteine proteases [Medicago
truncatula]
gi|355508936|gb|AES90078.1| hypothetical protein MTR_4g084020 [Medicago truncatula]
Length = 597
Score = 801 bits (2068), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/599 (69%), Positives = 484/599 (80%), Gaps = 18/599 (3%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M+R RL + A SGST SEESALD ERN HP S SP +Q FA QH E NAAYF
Sbjct: 1 MNRNRLGLSAHHSGSTQSEESALDLERNYYGHP---SSSPLHMQTFAVGVQHSEGNAAYF 57
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPT +R +DAAE+RANYF NLQKGVLPETLG+LP GQQATTLLELMTIRAFHSKILR +
Sbjct: 58 SWPTLNRWNDAAEDRANYFGNLQKGVLPETLGRLPSGQQATTLLELMTIRAFHSKILRRF 117
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+ GVLTDIPAILVFV+ KVH+QWL+ +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 118 SLGTAIGFRIRGGVLTDIPAILVFVAHKVHRQWLNHVQCLPAALEGPGGVWCDVDVVEFS 177
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
Y+GAP PTPKEQLYT++ D LRG D +GSGSQVASQETYGTLGAIV+S+TG+R+VGFLT
Sbjct: 178 YYGAPAPTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRTGNREVGFLT 237
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATS F P TFVRAD
Sbjct: 238 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 297
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFA+DF+M+ V TS++G+G+IG+V +DLQSPI+SLIG+QV+KVGRSSGLTTGT++
Sbjct: 298 GAFIPFAEDFNMNNVITSIRGVGDIGEVHRIDLQSPINSLIGRQVIKVGRSSGLTTGTIM 357
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL+ G+N EKPRP+GIIWGGTAN
Sbjct: 358 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNREKPRPVGIIWGGTAN 417
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKL++GQPPENWTSGVDLGRLL+LLELDL+TT+E L+ + QEQ S IGSTVG+
Sbjct: 418 RGRLKLRVGQPPENWTSGVDLGRLLDLLELDLVTTNETLQDSGQEQMNGSTAGIGSTVGE 477
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSP + +K+K E+ FEP L ++H+P VE S PSL EFH+ + ++ P+VE
Sbjct: 478 SSPT--VPIKEKLEESFEPFCLNMEHVP--VEEPSTIVKPSLRPCEFHIRNEIETVPNVE 533
Query: 531 LQFI-PSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDAST 588
QFI SF G SP+HQ+ + ++L+ L N DED SL LG+ EAKRR+ S+
Sbjct: 534 HQFIRTSFAGKSPVHQSFLKEDMQFKSLSELRNEPDEDNFVSLHLGEPEAKRRKHSNSS 592
>gi|449433481|ref|XP_004134526.1| PREDICTED: uncharacterized protein LOC101202735 [Cucumis sativus]
gi|449519914|ref|XP_004166979.1| PREDICTED: uncharacterized LOC101202735 [Cucumis sativus]
Length = 604
Score = 779 bits (2012), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/606 (68%), Positives = 490/606 (80%), Gaps = 17/606 (2%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
MDRTRL++ S ST SEESALD ERN CSH +LPS SP Q FA Q E+NAAYF
Sbjct: 1 MDRTRLDLTFHHSVSTQSEESALDLERNYCSHLHLPSSSPSPSQCFAPGSQLSETNAAYF 60
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPTSSRL+DAAE+RANYF NLQKGVLPE LG+LP GQ+ATTLLELMTIRAFHSKILR +
Sbjct: 61 SWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIRAFHSKILRRF 120
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI++G+LTDIPAI+VFV+RKVH+QWLS +QCLP ALEGPGG+WCDVDVVEFS
Sbjct: 121 SLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFS 180
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
Y+GAP TPKE++YT++VD LRG DP+IGSGSQVASQETYGTLGAIVKS+TG+RQVGFLT
Sbjct: 181 YYGAPAATPKEEVYTELVDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATS F P TFVRAD
Sbjct: 241 NRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD 300
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFA+DF+M+ V T VKG+GE+GDV +DLQSPI+SLIG++V+KVGRSSGLT GT++
Sbjct: 301 GAFIPFAEDFNMNNVVTFVKGVGEVGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIM 360
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYND KGICF TDFLVVG++QQTFDLEGDSGSLIL+ G++ EKPRP+GIIWGGTAN
Sbjct: 361 AYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTAN 420
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKLK+GQPPENWTSGVDLGRLL+LLELDLITT++GL+ AV EQR S I STV +
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQAAVHEQRNNSVGGIDSTVAE 480
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
S D + LK + ++ E LGL +Q I E E +P + F +E+G + PS+E
Sbjct: 481 SC-LDRIPLKYRLKENSELLGLSVQQISPEGESSQGMISP--FKHAFQIENGFEVTPSIE 537
Query: 531 LQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDN--EAKRRRS-DAS 587
LQFIP T +SPL Q N + +NL++L NG D ++ SLQLG++ EAKRR+ D
Sbjct: 538 LQFIPRLTSNSPLDQKNEQIQ-ELKNLSALRNGYDSEVSVSLQLGEHEPEAKRRKHLDCL 596
Query: 588 TSKEES 593
+S +ES
Sbjct: 597 SSIKES 602
>gi|124301256|gb|ABN04842.1| Peptidase, trypsin-like serine and cysteine proteases [Medicago
truncatula]
Length = 546
Score = 770 bits (1988), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/553 (71%), Positives = 457/553 (82%), Gaps = 18/553 (3%)
Query: 1 MDRTRLNIRARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYF 60
M+R RL + A SGST SEESALD ERN HP S SP +Q FA QH E NAAYF
Sbjct: 1 MNRNRLGLSAHHSGSTQSEESALDLERNYYGHP---SSSPLHMQTFAVGVQHSEGNAAYF 57
Query: 61 SWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCY 120
SWPT +R +DAAE+RANYF NLQKGVLPETLG+LP GQQATTLLELMTIRAFHSKILR +
Sbjct: 58 SWPTLNRWNDAAEDRANYFGNLQKGVLPETLGRLPSGQQATTLLELMTIRAFHSKILRRF 117
Query: 121 SLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+ GVLTDIPAILVFV+ KVH+QWL+ +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 118 SLGTAIGFRIRGGVLTDIPAILVFVAHKVHRQWLNHVQCLPAALEGPGGVWCDVDVVEFS 177
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
Y+GAP PTPKEQLYT++ D LRG D +GSGSQVASQETYGTLGAIV+S+TG+R+VGFLT
Sbjct: 178 YYGAPAPTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRTGNREVGFLT 237
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRAD 290
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATS F P TFVRAD
Sbjct: 238 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 297
Query: 291 GAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
GAFIPFA+DF+M+ V TS++G+G+IG+V +DLQSPI+SLIG+QV+KVGRSSGLTTGT++
Sbjct: 298 GAFIPFAEDFNMNNVITSIRGVGDIGEVHRIDLQSPINSLIGRQVIKVGRSSGLTTGTIM 357
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL+ G+N EKPRP+GIIWGGTAN
Sbjct: 358 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNREKPRPVGIIWGGTAN 417
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGD 470
RGRLKL++GQPPENWTSGVDLGRLL+LLELDL+TT+E L+ + QEQ S IGSTVG+
Sbjct: 418 RGRLKLRVGQPPENWTSGVDLGRLLDLLELDLVTTNETLQDSGQEQMNGSTAGIGSTVGE 477
Query: 471 SSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVE 530
SSP + +K+K E+ FEP L ++H+P VE S PSL EFH+ + ++ P+VE
Sbjct: 478 SSPT--VPIKEKLEESFEPFCLNMEHVP--VEEPSTIVKPSLRPCEFHIRNEIETVPNVE 533
Query: 531 LQFI-PSFTGHSP 542
QFI SF G SP
Sbjct: 534 HQFIRTSFAGKSP 546
>gi|357152457|ref|XP_003576125.1| PREDICTED: uncharacterized protein LOC100833303 [Brachypodium
distachyon]
Length = 598
Score = 761 bits (1965), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/588 (69%), Positives = 463/588 (78%), Gaps = 20/588 (3%)
Query: 13 SGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRLSDAA 72
+GS+ SE ALD ERN C+H + PP LQP ASAGQH ES+ AYFSWPTS+ + +A
Sbjct: 11 AGSSQSEGPALDMERNGCNH----NCCPPPLQPIASAGQHSESSVAYFSWPTSTLMHGSA 66
Query: 73 EERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKR 132
E RANYF NLQKGVLP LG+LPKGQQATTLL+LM IRAFHSKILR +SLGTAIGFRI++
Sbjct: 67 EGRANYFGNLQKGVLPGHLGRLPKGQQATTLLDLMIIRAFHSKILRRFSLGTAIGFRIRK 126
Query: 133 GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQ 192
G LTD PAILVFV+RKV+K+WL P QCLP ALEGPGGVWCDVDVVEFSY+GAP PTPKEQ
Sbjct: 127 GTLTDTPAILVFVARKVNKKWLRPTQCLPAALEGPGGVWCDVDVVEFSYYGAPAPTPKEQ 186
Query: 193 LYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPN 252
LY ++VD LRG DPSIGSGSQVAS ETYGTLGAIVKS+TGS+QVGFLTNRHVAVDLDYPN
Sbjct: 187 LYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGSKQVGFLTNRHVAVDLDYPN 246
Query: 253 QKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDM 302
QKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFADDFD+
Sbjct: 247 QKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDFDI 306
Query: 303 STVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGI 362
+ V+TSVKG+G IGD+K +DLQSPISSLIGKQVVKVGRSSGLTTGTV+AYALEYNDEKGI
Sbjct: 307 TNVSTSVKGVGIIGDIKAIDLQSPISSLIGKQVVKVGRSSGLTTGTVMAYALEYNDEKGI 366
Query: 363 CFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPP 422
CF TDFLVVGENQQTFDLEGDSGSLI++ G++GEKP+PIGIIWGGTANRGRLKLK GQ P
Sbjct: 367 CFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKLKSGQGP 426
Query: 423 ENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAA---SATAIGSTVGDSSPPDGMHL 479
ENWTSGVDLGRLL+LLELDLITT EGL+ A++EQR + +A A ST +SSP
Sbjct: 427 ENWTSGVDLGRLLDLLELDLITTSEGLQEALEEQRISLAAAAAAANSTATESSPVATPQE 486
Query: 480 KDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQFIPSFTG 539
+K + +EPLG+ IQ +P + + T+ EFH++ E QFIP+ G
Sbjct: 487 NEKVDKIYEPLGINIQQLPRDGSANL--TDQPFGSDEFHVDTVEGMNNVEERQFIPNLIG 544
Query: 540 HSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDAS 587
SP+ N +NL+ L N EDICFSL LG+ E KR RSD++
Sbjct: 545 MSPMRDNAREGNGGLDNLSELEN-SPEDICFSLHLGEREPKRLRSDST 591
>gi|226858186|gb|ACO87664.1| unknown [Brachypodium sylvaticum]
Length = 598
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/582 (69%), Positives = 460/582 (79%), Gaps = 20/582 (3%)
Query: 13 SGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRLSDAA 72
+GS+ SE ALD ERN C+H + PP+LQP ASAGQH ES+ AYFSWPTS+ + +A
Sbjct: 11 AGSSQSEGPALDMERNGCNH----NCCPPSLQPIASAGQHSESSVAYFSWPTSTLMHGSA 66
Query: 73 EERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKR 132
E RANYF NLQKGVLP LG+LPKGQQATTLL+LM IRAFHSKILR +SLGTAIGFRI++
Sbjct: 67 EGRANYFGNLQKGVLPGHLGRLPKGQQATTLLDLMIIRAFHSKILRRFSLGTAIGFRIRK 126
Query: 133 GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQ 192
G LTD PAILVFV+RKV+K+WL P QCLP ALEGPGGVWCDVDVVEFSY+GAP PTPKEQ
Sbjct: 127 GTLTDTPAILVFVARKVNKKWLGPTQCLPAALEGPGGVWCDVDVVEFSYYGAPAPTPKEQ 186
Query: 193 LYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPN 252
LY ++VD LRG DPSIGSGSQVAS ETYGTLGAIVKS+TGS+QVGFLTNRHVAVDLDYPN
Sbjct: 187 LYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGSKQVGFLTNRHVAVDLDYPN 246
Query: 253 QKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDM 302
QKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFADDFD+
Sbjct: 247 QKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDFDI 306
Query: 303 STVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGI 362
+ V TSVKG+G IGD+K +DLQSPISSLIGKQVVKVGRSSGLTTGTV+AYALEYNDEKGI
Sbjct: 307 TNVGTSVKGVGIIGDIKAIDLQSPISSLIGKQVVKVGRSSGLTTGTVMAYALEYNDEKGI 366
Query: 363 CFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPP 422
CF TDFLVVGENQQTFDLEGDSGSLI++ G++GEKP+PIGIIWGGTANRGRLKLK GQ P
Sbjct: 367 CFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKLKSGQGP 426
Query: 423 ENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAA---SATAIGSTVGDSSPPDGMHL 479
ENWTSGVDLGRLL+LLELDLITT EGL+ A++EQR + +ATA ST +SSP
Sbjct: 427 ENWTSGVDLGRLLDLLELDLITTSEGLQEALEEQRISLAAAATAANSTATESSPVATPQE 486
Query: 480 KDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQFIPSFTG 539
+K + +EPLG+ IQ +P + + T+ S EFH++ E QFIP+ G
Sbjct: 487 NEKVDKIYEPLGINIQQLPRDGSANP--TDQSFGSDEFHVDTLEGMNNVEERQFIPNLIG 544
Query: 540 HSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKR 581
SP+ N +NLA + N EDICFSL LG+ E KR
Sbjct: 545 MSPMRDNAREGNGGLDNLAEMDN-SPEDICFSLHLGEREPKR 585
>gi|15241646|ref|NP_199316.1| trypsin-like protein [Arabidopsis thaliana]
gi|79329912|ref|NP_001032013.1| trypsin-like protein [Arabidopsis thaliana]
gi|10177495|dbj|BAB10886.1| unnamed protein product [Arabidopsis thaliana]
gi|222423925|dbj|BAH19926.1| AT5G45030 [Arabidopsis thaliana]
gi|332007808|gb|AED95191.1| trypsin-like protein [Arabidopsis thaliana]
gi|332007809|gb|AED95192.1| trypsin-like protein [Arabidopsis thaliana]
Length = 607
Score = 751 bits (1940), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/615 (65%), Positives = 481/615 (78%), Gaps = 30/615 (4%)
Query: 1 MDRTRLNIRARCSGSTPSEESA-LDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAA- 58
M+ RL++R S S+ S ESA LD ++N +H L S SP LQPF S QH E++AA
Sbjct: 1 MEGKRLDLRFHHSTSSQSVESAALDLDKNVYNHIKLASSSP--LQPFPSGAQHPETSAAA 58
Query: 59 -YFSWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKIL 117
YFSWPTSSRL+D+AE+RANYFANLQKGVLPE+ LP G++ATTLLELM IRAFHSK L
Sbjct: 59 AYFSWPTSSRLNDSAEDRANYFANLQKGVLPESFDGLPTGKKATTLLELMMIRAFHSKNL 118
Query: 118 RCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVV 177
R +SLGTAIGFRI+RGVLT+I AILVFV+RKVHKQWL+P+QCLPTALEGPGGVWCDVDVV
Sbjct: 119 RRFSLGTAIGFRIRRGVLTNIAAILVFVARKVHKQWLNPLQCLPTALEGPGGVWCDVDVV 178
Query: 178 EFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVG 237
EF Y+GAP TPKEQ+YT++VDDLRG SIGSGSQVASQETYGTLGAIVKS+TG RQVG
Sbjct: 179 EFQYYGAPAQTPKEQVYTELVDDLRGSGSSIGSGSQVASQETYGTLGAIVKSKTGIRQVG 238
Query: 238 FLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFV 287
FLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATS F P TFV
Sbjct: 239 FLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 298
Query: 288 RADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTG 347
RADGAFIPFA+DF+ + VTT+VKG+GEIGD+ DLQSP++SLIG++VVKVGRSSGLTTG
Sbjct: 299 RADGAFIPFAEDFNTNNVTTTVKGIGEIGDIHATDLQSPVNSLIGRKVVKVGRSSGLTTG 358
Query: 348 TVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKG--ENGEKPRPIGIIW 405
T++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL+ E EKPRP+GIIW
Sbjct: 359 TIMAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLAAGDEKNEKPRPVGIIW 418
Query: 406 GGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQR-AASATAI 464
GGTANRGRLKLK+G+ PENWTSGVDLGR+LNLLELDLIT++EGL+ AV EQR A+
Sbjct: 419 GGTANRGRLKLKVGEQPENWTSGVDLGRVLNLLELDLITSNEGLQAAVLEQRNGIMCAAV 478
Query: 465 GSTVGDSSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVK 524
STV +SSP + K + FEP+ L +Q + +E ++ S + EF +ED ++
Sbjct: 479 DSTVVESSPGVCNISRCKTGENFEPINLNVQQVLIEDDN-------SNIHPEFQIEDVLE 531
Query: 525 AGPSV-ELQFIPSFTGH-SPLHQN-NPSDKASSENLASLWNGCDED-ICFSLQLGDNEA- 579
+ + E QFIPS + + S LHQ N + S+NL+SL D I FSLQLG+++
Sbjct: 532 SVAVIEEHQFIPSSSNNGSALHQKPNGPENLESKNLSSLKTSSSGDEIGFSLQLGESDTK 591
Query: 580 KRRRSDASTSKEESK 594
KR+R+D+ +E +
Sbjct: 592 KRKRTDSPDGSQEDE 606
>gi|20466342|gb|AAM20488.1| putative protein [Arabidopsis thaliana]
gi|25084087|gb|AAN72171.1| putative protein [Arabidopsis thaliana]
Length = 607
Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/615 (65%), Positives = 480/615 (78%), Gaps = 30/615 (4%)
Query: 1 MDRTRLNIRARCSGSTPSEESA-LDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAA- 58
M+ RL++R S S+ S ESA LD ++N +H L S SP LQPF S QH E++AA
Sbjct: 1 MEGKRLDLRFHHSTSSQSVESAALDLDKNVYNHIKLASSSP--LQPFPSGAQHPETSAAA 58
Query: 59 -YFSWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKIL 117
YFSWPTSSRL+D+AE+RANYFANLQKGVLPE+ LP G++ATTLLELM IRAFHSK L
Sbjct: 59 AYFSWPTSSRLNDSAEDRANYFANLQKGVLPESFDGLPTGKKATTLLELMMIRAFHSKNL 118
Query: 118 RCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVV 177
R +SLGTAIGFRI+RGVLT+I AILVFV+RKVHKQWL+P+QCLPTALEGPGGVWCDVDVV
Sbjct: 119 RRFSLGTAIGFRIRRGVLTNIAAILVFVARKVHKQWLNPLQCLPTALEGPGGVWCDVDVV 178
Query: 178 EFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVG 237
EF Y+GAP TPKEQ+YT++VDDLRG SIGSGSQVASQE YGTLGAIVKS+TG RQVG
Sbjct: 179 EFQYYGAPAQTPKEQVYTELVDDLRGSGSSIGSGSQVASQERYGTLGAIVKSKTGIRQVG 238
Query: 238 FLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFV 287
FLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATS F P TFV
Sbjct: 239 FLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 298
Query: 288 RADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTG 347
RADGAFIPFA+DF+ + VTT+VKG+GEIGD+ DLQSP++SLIG++VVKVGRSSGLTTG
Sbjct: 299 RADGAFIPFAEDFNTNNVTTTVKGIGEIGDIHATDLQSPVNSLIGRKVVKVGRSSGLTTG 358
Query: 348 TVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKG--ENGEKPRPIGIIW 405
T++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL+ E EKPRP+GIIW
Sbjct: 359 TIMAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLAAGDEKNEKPRPVGIIW 418
Query: 406 GGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQR-AASATAI 464
GGTANRGRLKLK+G+ PENWTSGVDLGR+LNLLELDLIT++EGL+ AV EQR A+
Sbjct: 419 GGTANRGRLKLKVGEQPENWTSGVDLGRVLNLLELDLITSNEGLQAAVLEQRNGIMCAAV 478
Query: 465 GSTVGDSSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVK 524
STV +SSP + K + FEP+ L +Q + +E ++ S + EF +ED ++
Sbjct: 479 DSTVVESSPGVCNISRCKTGENFEPINLNVQQVLIEDDN-------SNIHPEFQIEDVLE 531
Query: 525 AGPSV-ELQFIPSFTGH-SPLHQN-NPSDKASSENLASLWNGCDED-ICFSLQLGDNEA- 579
+ + E QFIPS + + S LHQ N + S+NL+SL D I FSLQLG+++
Sbjct: 532 SVAVIEEHQFIPSSSNNGSALHQKPNGPENLESKNLSSLKTSSSGDEIGFSLQLGESDTK 591
Query: 580 KRRRSDASTSKEESK 594
KR+R+D+ +E +
Sbjct: 592 KRKRTDSPDGSQEDE 606
>gi|115476358|ref|NP_001061775.1| Os08g0407200 [Oryza sativa Japonica Group]
gi|37572952|dbj|BAC98602.1| unknown protein [Oryza sativa Japonica Group]
gi|113623744|dbj|BAF23689.1| Os08g0407200 [Oryza sativa Japonica Group]
gi|125603365|gb|EAZ42690.1| hypothetical protein OsJ_27258 [Oryza sativa Japonica Group]
gi|215695285|dbj|BAG90476.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704499|dbj|BAG93933.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767959|dbj|BAH00188.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 590
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/589 (69%), Positives = 468/589 (79%), Gaps = 30/589 (5%)
Query: 13 SGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRLSDAA 72
+GS+ SE SALD ERN C+H PS LQP AS GQH ES+AAYFSWPTS+ + +A
Sbjct: 11 AGSSQSEGSALDMERNGCNHNCCPS----PLQPIASGGQHSESSAAYFSWPTSTLMHGSA 66
Query: 73 EERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKR 132
E RANYF NLQKGVLP LG+LP GQ+ATTLL+LM IRAFHSKILR +SLGTAIGFRIK+
Sbjct: 67 EGRANYFGNLQKGVLPGHLGRLPTGQRATTLLDLMIIRAFHSKILRRFSLGTAIGFRIKK 126
Query: 133 GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQ 192
G LTD PAILVFV+RKVH++WLSP QCLP LEGPGGVWCDVDVVEFSY+GAP PTPKEQ
Sbjct: 127 GTLTDTPAILVFVARKVHRKWLSPTQCLPAHLEGPGGVWCDVDVVEFSYYGAPAPTPKEQ 186
Query: 193 LYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPN 252
LY ++VD LRG DPSIGSGSQVAS ETYGTLGAIVKS+TG++QVGFLTNRHVAVDLDYPN
Sbjct: 187 LYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGNKQVGFLTNRHVAVDLDYPN 246
Query: 253 QKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDM 302
QKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFADD+D+
Sbjct: 247 QKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDYDI 306
Query: 303 STVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGI 362
++V TSVKG+G IGDVK +DLQSPISSLIG+QVVKVGRSSGLTTGTV+AYALEYNDEKGI
Sbjct: 307 TSVNTSVKGVGVIGDVKAIDLQSPISSLIGRQVVKVGRSSGLTTGTVVAYALEYNDEKGI 366
Query: 363 CFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPP 422
CF TDFLVVGENQQTFDLEGDSGSLI++ G++GEKP+PIGIIWGGTANRGRLKLK GQ P
Sbjct: 367 CFFTDFLVVGENQQTFDLEGDSGSLIILTGKDGEKPQPIGIIWGGTANRGRLKLKSGQGP 426
Query: 423 ENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQR---AASATAIGSTVGDSSPPDGMHL 479
ENWTSGVDLGRLL+LLELDLITT EGL+ A++EQR AA+A A ST G+SSP G
Sbjct: 427 ENWTSGVDLGRLLDLLELDLITTSEGLQEALEEQRIILAAAAAAANSTAGESSPVAGPQE 486
Query: 480 KDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSV-ELQFIPSFT 538
+K + +EPLG+ IQ +P ++ + T P EFH+ D V+ +V E QF+
Sbjct: 487 NEKVDKIYEPLGINIQQLP--RDNSATSTGPD----EFHV-DTVEGVTNVEERQFL---I 536
Query: 539 GHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDAS 587
G SP + ++ NLA L N EDICFSL LG+ E KR RSD+S
Sbjct: 537 GMSPAREGQEAN-GDLNNLAELEN-SPEDICFSLHLGEREPKRLRSDSS 583
>gi|125561508|gb|EAZ06956.1| hypothetical protein OsI_29197 [Oryza sativa Indica Group]
Length = 590
Score = 746 bits (1927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/589 (69%), Positives = 467/589 (79%), Gaps = 30/589 (5%)
Query: 13 SGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRLSDAA 72
+GS+ SE SALD ERN C+H PS LQP AS GQH ES+AAYFSWPTS+ + +A
Sbjct: 11 AGSSQSEGSALDMERNGCNHNCCPS----PLQPIASGGQHSESSAAYFSWPTSTLMHGSA 66
Query: 73 EERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKR 132
E RANYF NLQKGVLP LG+LP GQ+ATTLL+LM IRAFHSKILR +SLGTAIGFRIK+
Sbjct: 67 EGRANYFGNLQKGVLPGHLGRLPTGQRATTLLDLMIIRAFHSKILRRFSLGTAIGFRIKK 126
Query: 133 GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQ 192
G LTD PAILVFV+RKVH++WLS QCLP LEGPGGVWCDVDVVEFSY+GAP PTPKEQ
Sbjct: 127 GTLTDTPAILVFVARKVHRKWLSTTQCLPAHLEGPGGVWCDVDVVEFSYYGAPAPTPKEQ 186
Query: 193 LYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPN 252
LY ++VD LRG DPSIGSGSQVAS ETYGTLGAIVKS+TG++QVGFLTNRHVAVDLDYPN
Sbjct: 187 LYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGNKQVGFLTNRHVAVDLDYPN 246
Query: 253 QKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDM 302
QKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFADD+D+
Sbjct: 247 QKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDYDI 306
Query: 303 STVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGI 362
++V TSVKG+G IGDVK +DLQSPISSLIG+QVVKVGRSSGLTTGTV+AYALEYNDEKGI
Sbjct: 307 TSVNTSVKGVGVIGDVKAIDLQSPISSLIGRQVVKVGRSSGLTTGTVVAYALEYNDEKGI 366
Query: 363 CFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPP 422
CF TDFLVVGENQQTFDLEGDSGSLI++ G++GEKP+PIGIIWGGTANRGRLKLK GQ P
Sbjct: 367 CFFTDFLVVGENQQTFDLEGDSGSLIILTGKDGEKPQPIGIIWGGTANRGRLKLKSGQGP 426
Query: 423 ENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQR---AASATAIGSTVGDSSPPDGMHL 479
ENWTSGVDLGRLL+LLELDLITT EGL+ A++EQR AA+A A ST G+SSP G
Sbjct: 427 ENWTSGVDLGRLLDLLELDLITTSEGLQEALEEQRIILAAAAAAANSTAGESSPVAGPQE 486
Query: 480 KDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSV-ELQFIPSFT 538
+K + +EPLG+ IQ +P ++ + T P EFH+ D V+ +V E QF+
Sbjct: 487 NEKVDKIYEPLGINIQQLP--RDNSATSTGPD----EFHV-DTVEGVTNVEERQFL---I 536
Query: 539 GHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDAS 587
G SP + ++ NLA L N EDICFSL LG+ E KR RSD+S
Sbjct: 537 GMSPAREGQEAN-GDLNNLAELEN-SPEDICFSLHLGEREPKRLRSDSS 583
>gi|297794835|ref|XP_002865302.1| hypothetical protein ARALYDRAFT_917056 [Arabidopsis lyrata subsp.
lyrata]
gi|297311137|gb|EFH41561.1| hypothetical protein ARALYDRAFT_917056 [Arabidopsis lyrata subsp.
lyrata]
Length = 614
Score = 743 bits (1919), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/597 (66%), Positives = 468/597 (78%), Gaps = 32/597 (5%)
Query: 21 SALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAA--YFSWPTSSRLSDAAEERANY 78
+ALD ++N +H L S SP QPF S GQH E++AA YFSWPTS RL+D+AE+RANY
Sbjct: 24 AALDLDKNGYNHIKLASSSP--FQPFPSGGQHPETSAAAAYFSWPTSCRLNDSAEDRANY 81
Query: 79 FANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKRGVLTDI 138
FANLQKGVLPET LP G++ATTLLELM IRAFHSK LR +SLGTAIGFRI+RGVLT+I
Sbjct: 82 FANLQKGVLPETFDGLPTGKKATTLLELMMIRAFHSKNLRRFSLGTAIGFRIRRGVLTNI 141
Query: 139 PAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIV 198
AILVFV+RKVHKQWL+P+QCLPTALEGPGGVWCDVDVVEF Y+GAP TPKEQ+YT++V
Sbjct: 142 AAILVFVARKVHKQWLNPLQCLPTALEGPGGVWCDVDVVEFQYYGAPAQTPKEQVYTELV 201
Query: 199 DDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHP 258
DDLRG SIGSGSQVASQETYGTLGAIVKS+TG RQVGFLTNRHVAVDLDYP+QKMFHP
Sbjct: 202 DDLRGSGSSIGSGSQVASQETYGTLGAIVKSKTGIRQVGFLTNRHVAVDLDYPSQKMFHP 261
Query: 259 LPPTLGPGVYLGAVERATS----------FHHRRPLTFVRADGAFIPFADDFDMSTVTTS 308
LPP+LGPGVYLGAVERATS F P TFVRADGAFIPFA+DF+M+ VTT+
Sbjct: 262 LPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVTTT 321
Query: 309 VKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDF 368
VKG+GEIG++ DLQSPI+SLIG++VVKVGRSSGLTTGT++AYALEYNDEKGICFLTDF
Sbjct: 322 VKGIGEIGNIHATDLQSPINSLIGRKVVKVGRSSGLTTGTIMAYALEYNDEKGICFLTDF 381
Query: 369 LVVGENQQTFDLEGDSGSLILMKG--ENGEKPRPIGIIWGGTANRGRLKLKIGQPPENWT 426
LVVGENQQTFDLEGDSGSLIL+ E EKPRP+GIIWGGTANRGRLKLK+G+ PENWT
Sbjct: 382 LVVGENQQTFDLEGDSGSLILLAAGDEKNEKPRPVGIIWGGTANRGRLKLKVGEQPENWT 441
Query: 427 SGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATA-IGSTVGDSSPPDGMHLKDKAED 485
SGVDLGR+LNLLELDLIT++EGL+ AV EQR A I STV +SSP + K +
Sbjct: 442 SGVDLGRVLNLLELDLITSNEGLQAAVLEQRNGIMCAGIDSTVVESSPGVCNISRCKTGE 501
Query: 486 KFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSV-ELQFIPSFTGHS-PL 543
FEP+ L +Q + E + S + EF +ED +++ + E QFIPS + + L
Sbjct: 502 NFEPINLNVQQV-------LREEDSSNIHPEFQIEDVLESAAMIEEHQFIPSSSNNGYSL 554
Query: 544 HQN-NPSDKASSENLASL-WNGCDEDICFSLQLGDNEAKRRRS----DASTSKEESK 594
HQ N + S+NL+SL N ++I FSLQLG+++ K+R+ D S EES+
Sbjct: 555 HQKINGPENLESKNLSSLKTNSSGDEIGFSLQLGESDTKKRKRTDSPDGSQEHEESR 611
>gi|297834104|ref|XP_002884934.1| hypothetical protein ARALYDRAFT_478657 [Arabidopsis lyrata subsp.
lyrata]
gi|297330774|gb|EFH61193.1| hypothetical protein ARALYDRAFT_478657 [Arabidopsis lyrata subsp.
lyrata]
Length = 558
Score = 731 bits (1888), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/575 (68%), Positives = 447/575 (77%), Gaps = 49/575 (8%)
Query: 43 LQPFASAGQHCESNAA-YFSWPTSSRLSDAAEERANYFANLQKG------VLPETLGQLP 95
+ + S GQHCE AA YFSWPTSSRLS+AAEERANYF+NLQK V PE P
Sbjct: 1 MHQYGSTGQHCEFTAASYFSWPTSSRLSNAAEERANYFSNLQKEEEEDEEVSPEPASTDP 60
Query: 96 KGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLS 155
KGQ+ATTLLELMTIRAFHSKILRCYSLGTAIGFRI+RGVLTDIPAI+VFVSRKVHKQWLS
Sbjct: 61 KGQRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRRGVLTDIPAIIVFVSRKVHKQWLS 120
Query: 156 PIQCLPTALEGPGGVWCDVDVVEFSYFGAP--EPTPKEQLYTQIVDDLRGGDPSIGSGSQ 213
P+QCLPTALEG GG+WCDVDVVEFSYFG P +PTPK+ T IVD L+G DP IGSGSQ
Sbjct: 121 PLQCLPTALEGAGGIWCDVDVVEFSYFGEPDHQPTPKQTFTTDIVDHLQGSDPFIGSGSQ 180
Query: 214 VASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVE 273
VASQET GTLGAIV+SQTGSRQVGF+TNRHVAV+LDYP+QKMFHPLPP LGPGVYLGAVE
Sbjct: 181 VASQETCGTLGAIVRSQTGSRQVGFVTNRHVAVNLDYPSQKMFHPLPPALGPGVYLGAVE 240
Query: 274 RATS----------FHHRRPLTFVRADGAFIPFADDFDMSTVTTSVK-GLGEIGDVKIVD 322
RATS F P TFVRADGAFIPFADD+D+S VTTSVK G+GEIG+VK ++
Sbjct: 241 RATSFITDDLWFGIFAGTNPETFVRADGAFIPFADDYDLSRVTTSVKGGVGEIGEVKAIE 300
Query: 323 LQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT-FDLE 381
LQSP+ SL+GKQVVKVGRSSGLTTGTVLAYALEYNDEKG+CFLTDFLVVGEN ++ FDLE
Sbjct: 301 LQSPVGSLVGKQVVKVGRSSGLTTGTVLAYALEYNDEKGVCFLTDFLVVGENHRSPFDLE 360
Query: 382 GDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELD 441
GDSGSLI+MKGE EK RPIGIIWGGT +RGRLKLK+G+ PE+WT+GVDLGRLL L+LD
Sbjct: 361 GDSGSLIVMKGE--EKARPIGIIWGGTGSRGRLKLKVGECPESWTTGVDLGRLLTHLQLD 418
Query: 442 LITTDEGLKVAVQEQRAASATAIGSTVGDSSPPDGMHLKDKA--EDKFEP-LG-LQIQHI 497
LITTDEGLK AVQEQRAAS T + S V DSSPP K K E+K E LG LQ+QHI
Sbjct: 419 LITTDEGLKAAVQEQRAASTTGMSSMVADSSPPYVNLKKGKRNPEEKVEASLGPLQVQHI 478
Query: 498 PVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQFIPSFTGHSPLHQNNPSDKASSENL 557
+E +ET+ PSVE QF+P+F+G + + + E+L
Sbjct: 479 DLE----------ERIETK-------GGAPSVEHQFMPTFSGQC---SASAWPETAREDL 518
Query: 558 A-SLWNG-CDEDICFSLQLGDNEAKRRRSDASTSK 590
A L NG CD D+C L+LGD+ AKRRR+ + +
Sbjct: 519 AVGLTNGSCDGDLCVGLRLGDDGAKRRRTQVTKER 553
>gi|15230650|ref|NP_187901.1| trypsin-like protein [Arabidopsis thaliana]
gi|15795124|dbj|BAB02502.1| unnamed protein product [Arabidopsis thaliana]
gi|45773814|gb|AAS76711.1| At3g12950 [Arabidopsis thaliana]
gi|52627109|gb|AAU84681.1| At3g12950 [Arabidopsis thaliana]
gi|332641744|gb|AEE75265.1| trypsin-like protein [Arabidopsis thaliana]
Length = 558
Score = 729 bits (1882), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/571 (68%), Positives = 445/571 (77%), Gaps = 47/571 (8%)
Query: 46 FASAGQHCESNAA-YFSWPTSSRLSDAAEERANYFANLQKG------VLPETLGQLPKGQ 98
+ S GQHCE AA YFSWPTSSRLS+AAEERANYF+NLQK V PE + PKGQ
Sbjct: 4 YGSTGQHCEFTAASYFSWPTSSRLSNAAEERANYFSNLQKEEDDDDEVSPEPVSTEPKGQ 63
Query: 99 QATTLLELMTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQ 158
+ATTLLELMTIRAFHSK+LRCYSLGTAIGFRI+RGVLTDIPAI+VFVSRKVHKQWLSP+Q
Sbjct: 64 RATTLLELMTIRAFHSKMLRCYSLGTAIGFRIRRGVLTDIPAIIVFVSRKVHKQWLSPLQ 123
Query: 159 CLPTALEGPGGVWCDVDVVEFSYFGAP--EPTPKEQLYTQIVDDLRGGDPSIGSGSQVAS 216
CLPTALEG GG+WCDVDVVEFSYFG P +PTPK+ T IVD L+G DP IGSGSQVAS
Sbjct: 124 CLPTALEGAGGIWCDVDVVEFSYFGEPDHQPTPKQTFTTDIVDHLQGSDPFIGSGSQVAS 183
Query: 217 QETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERAT 276
QET GTLGAIV+SQTG RQVGF+TNRHVAV+LDYP+QKMFHPLPP LGPGVYLGAVERAT
Sbjct: 184 QETCGTLGAIVRSQTGGRQVGFVTNRHVAVNLDYPSQKMFHPLPPALGPGVYLGAVERAT 243
Query: 277 S----------FHHRRPLTFVRADGAFIPFADDFDMSTVTTSVK-GLGEIGDVKIVDLQS 325
S F P TFVRADGAFIPFADD+D+S VTTSVK G+GEIG+VK ++LQS
Sbjct: 244 SFITDDLWFGIFAGTNPETFVRADGAFIPFADDYDLSRVTTSVKGGVGEIGEVKAIELQS 303
Query: 326 PISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT-FDLEGDS 384
P+ SL+GKQVVKVGRSSGLTTGTVLAYALEYNDE+G+CFLTDFLVVGEN ++ FDLEGDS
Sbjct: 304 PVGSLVGKQVVKVGRSSGLTTGTVLAYALEYNDERGVCFLTDFLVVGENHRSPFDLEGDS 363
Query: 385 GSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLIT 444
GSLI+MKGE EK RPIGIIWGGT +RGRLKLK+G+ PE+WT+GVDLGRLL L+LDLIT
Sbjct: 364 GSLIVMKGE--EKARPIGIIWGGTGSRGRLKLKVGECPESWTTGVDLGRLLTHLQLDLIT 421
Query: 445 TDEGLKVAVQEQRAASATAIGSTVGDSSPPDGMHLKDKA--EDKFEP-LG-LQIQHIPVE 500
TDEGLK AVQEQRAAS T + S V DSSPP K+K E+K E LG LQ+QHI +E
Sbjct: 422 TDEGLKAAVQEQRAASTTGMSSMVADSSPPYVNLKKEKRSPEEKLEASLGPLQVQHIDLE 481
Query: 501 VEHHSPETNPSLMETEFHLEDGVKAGPSVELQFIPSFTGHSPLHQNNPSDKASSENLASL 560
+ET+ PSVE QF+P+F+G + + A + +A
Sbjct: 482 ----------ERIETK-------GGAPSVEHQFMPTFSGQ--CSASAWPETAREDLVAGF 522
Query: 561 WNG-CDEDICFSLQLGDNEAKRRRSDASTSK 590
NG CD D+C L+LGD+ AKRRR+ + +
Sbjct: 523 TNGSCDGDLCVGLRLGDDGAKRRRTQVTNER 553
>gi|159137849|gb|ABW89000.1| narrow leaf 1 [Oryza sativa Japonica Group]
gi|222629546|gb|EEE61678.1| hypothetical protein OsJ_16147 [Oryza sativa Japonica Group]
Length = 582
Score = 723 bits (1867), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/596 (62%), Positives = 447/596 (75%), Gaps = 30/596 (5%)
Query: 9 RARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
+A+ SG SEES+LD + H + P P++QP AS H E++AAYF WPTS+
Sbjct: 7 KAQLSGLAQSEESSLDVD-----HQSFPC--SPSIQPVASGCTHTENSAAYFLWPTSNLQ 59
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
AAE RANYF NLQKG+LP G+LPKGQQA +LL+LMTIRAFHSKILR +SLGTA+GF
Sbjct: 60 HCAAEGRANYFGNLQKGLLPRHPGRLPKGQQANSLLDLMTIRAFHSKILRRFSLGTAVGF 119
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
RI++G LTDIPAILVFV+RKVHK+WL+P QCLP LEGPGGVWCDVDVVEFSY+GAP T
Sbjct: 120 RIRKGDLTDIPAILVFVARKVHKKWLNPAQCLPAILEGPGGVWCDVDVVEFSYYGAPAQT 179
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
PKEQ+++++VD L G D IGSGSQVAS ET+GTLGAIVK +TG++QVGFLTN HVAVDL
Sbjct: 180 PKEQMFSELVDKLCGSDECIGSGSQVASHETFGTLGAIVKRRTGNKQVGFLTNHHVAVDL 239
Query: 249 DYPNQKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFAD 298
DYPNQKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFAD
Sbjct: 240 DYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFAD 299
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYND 358
DFD+STVTT V+G+G+IGDVK++DLQ P++SLIG+QV KVGRSSG TTGTV+AYALEYND
Sbjct: 300 DFDISTVTTVVRGVGDIGDVKVIDLQCPLNSLIGRQVCKVGRSSGHTTGTVMAYALEYND 359
Query: 359 EKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKI 418
EKGICF TD LVVGEN+QTFDLEGDSGSLI++ ++GEKPRPIGIIWGGTANRGRLKL
Sbjct: 360 EKGICFFTDILVVGENRQTFDLEGDSGSLIILTSQDGEKPRPIGIIWGGTANRGRLKLTS 419
Query: 419 GQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGDSSPPDGMH 478
PENWTSGVDLGRLL+ LELD+I T+E L+ AVQ+QR A A+ S VG+SS
Sbjct: 420 DHGPENWTSGVDLGRLLDRLELDIIITNESLQDAVQQQRFALVAAVTSAVGESSGVPVAI 479
Query: 479 LKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQFIPSFT 538
++K E+ FEPLG+QIQ +P S T ++E E QFI +F
Sbjct: 480 PEEKIEEIFEPLGIQIQQLPRHDVAASGTEGEEASNTVVNVE---------EHQFISNFV 530
Query: 539 GHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSKEESK 594
G SP+ D+ + ++ +L N +E++ SL LGD E KR RSD+ +S + K
Sbjct: 531 GMSPVR----DDQDAPRSITNLNNPSEEELAMSLHLGDREPKRLRSDSGSSLDLEK 582
>gi|297826993|ref|XP_002881379.1| hypothetical protein ARALYDRAFT_902611 [Arabidopsis lyrata subsp.
lyrata]
gi|297327218|gb|EFH57638.1| hypothetical protein ARALYDRAFT_902611 [Arabidopsis lyrata subsp.
lyrata]
Length = 577
Score = 723 bits (1866), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/584 (65%), Positives = 448/584 (76%), Gaps = 35/584 (5%)
Query: 11 RCSGSTPSEESALDFERN--CCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
+ + S+ SE+SALD ERN C S +P LQPF QH ESNA YFSWPT SRL
Sbjct: 12 QAAASSESEDSALDLERNHHCNHLSLPSSSTPSPLQPFTFNIQHAESNAPYFSWPTLSRL 71
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
+DA E+RANYF NLQKGVLPET+G+LP GQQATTLLELMTIRAFHSKILR +SLGTA+GF
Sbjct: 72 NDAVEDRANYFGNLQKGVLPETVGRLPSGQQATTLLELMTIRAFHSKILRRFSLGTAVGF 131
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
RI RGVLT++PAILVFV+RKVH+QWL+P+QCLP+ALEGPGGVWCDVDVVEF Y+GAP T
Sbjct: 132 RISRGVLTNVPAILVFVARKVHRQWLNPMQCLPSALEGPGGVWCDVDVVEFQYYGAPAAT 191
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
P EQ+Y ++VD LRG DP IGSGSQVASQETYGTLGAIVKS+TG+ QVGFLTNRHVAVDL
Sbjct: 192 PNEQVYNELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNHQVGFLTNRHVAVDL 251
Query: 249 DYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRADGAFIPFAD 298
DYP+QKMFHPLPP+LGPGVYLGAVERATS F P TFVRADGAFIPFA+
Sbjct: 252 DYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDQWYGIFAGTNPETFVRADGAFIPFAE 311
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYND 358
DF+ S VTT +KG+GEIG+V ++DLQSPI SLIGKQVVKVGRSSG TTGT++AYALEYND
Sbjct: 312 DFNTSNVTTMIKGIGEIGNVHVIDLQSPIDSLIGKQVVKVGRSSGYTTGTIMAYALEYND 371
Query: 359 EKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKI 418
EKGICFLTDFLV+GENQQTFDLEGDSGSLIL+ G NG+KPRP+GIIWGGTANRG+LKL
Sbjct: 372 EKGICFLTDFLVIGENQQTFDLEGDSGSLILLTGPNGQKPRPVGIIWGGTANRGKLKLIA 431
Query: 419 GQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGDSSPPDGMH 478
GQ PENWTSGVDLGRLL+LLELDLIT++ L+ A +E+R S TA+ STV SSPPD +
Sbjct: 432 GQEPENWTSGVDLGRLLDLLELDLITSNHELEAAAREERNTSVTALDSTVSQSSPPDPVP 491
Query: 479 LKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQ-FIPSF 537
+K ++ FEP I H EF +E+ +K P VE FI
Sbjct: 492 SGEKQDESFEPF---IPH-------------------EFRIEEAIKPTPEVEEHIFIAPI 529
Query: 538 TGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKR 581
+ + +K +NL +L N +E++ SL LG+ + K+
Sbjct: 530 SVNESTSAIKGQEKPKLDNLMALKNSSEEEVNVSLHLGEPKLKK 573
>gi|148906346|gb|ABR16328.1| unknown [Picea sitchensis]
Length = 683
Score = 721 bits (1862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/618 (65%), Positives = 459/618 (74%), Gaps = 42/618 (6%)
Query: 1 MDRTR-LNIRARCSGSTPSEESALDFER----NCCSHPNLPSLSPPTLQPFASAGQHCES 55
MD TR L + R SGS SEESALD E+ N HP S SPP LQ FAS GQH ES
Sbjct: 74 MDVTRALRLGRRYSGSMQSEESALDREQTVTGNSGRHPR--SDSPP-LQAFASGGQHSES 130
Query: 56 NAAYFSWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSK 115
+AA F WP S+RL+ AEERA YF +QK V ETL LP G QATTLL+LMTIRAFHSK
Sbjct: 131 SAACFRWPPSNRLNGTAEERAAYFGGVQKEVDSETLEHLPSGHQATTLLDLMTIRAFHSK 190
Query: 116 ILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVD 175
ILR YSLGTAIGFRI+ GVLT+IPAILVFV+RKVHKQWL +Q LP+ LEGPGGVWCDVD
Sbjct: 191 ILRRYSLGTAIGFRIREGVLTNIPAILVFVARKVHKQWLLDVQRLPSVLEGPGGVWCDVD 250
Query: 176 VVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQ 235
VVEFSY+GAP TPKEQLYT++V+ LRG D +IGSGSQVASQETYGTLGAIVKS+TGSRQ
Sbjct: 251 VVEFSYYGAPAATPKEQLYTELVEGLRGSDQTIGSGSQVASQETYGTLGAIVKSRTGSRQ 310
Query: 236 VGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLT 285
VGFLTNRHVAVDLDYPNQKMFHPLPP LGPGVYLGAVERATS F P T
Sbjct: 311 VGFLTNRHVAVDLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDLWYGIFAGMNPET 370
Query: 286 FVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLT 345
FVRADGAFIPFAD FD+S VTT+VKG+G++G+V +VDLQ+P+ SLIGKQVVKVGRSSGLT
Sbjct: 371 FVRADGAFIPFADSFDVSNVTTTVKGVGDMGEVMLVDLQAPVGSLIGKQVVKVGRSSGLT 430
Query: 346 TGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIW 405
GT++AYALEYNDEKGICF TDFLVVGEN+Q FDLEGDSGSLIL+ E+GEKPRP+GIIW
Sbjct: 431 RGTIMAYALEYNDEKGICFFTDFLVVGENKQAFDLEGDSGSLILVTEESGEKPRPVGIIW 490
Query: 406 GGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQ-RAASATAI 464
GGTANRGRLKLK G PENWTSGVDLGRLL+LL+L++IT GL+ AV+EQ R +SA AI
Sbjct: 491 GGTANRGRLKLKNGSGPENWTSGVDLGRLLDLLQLEMITGAGGLREAVEEQKRWSSAVAI 550
Query: 465 GSTVGDSSP------PDGMHLKDKAE--------DKFEPLGLQIQHIPVEVEHHSPETNP 510
STVG+SSP P + K+K E D + QH+ ++ E NP
Sbjct: 551 DSTVGESSPRGYRIGPLTLAEKEKTEEVCPLMQFDNDDMSSFHTQHLGIQ---SGAEVNP 607
Query: 511 SLMETEFHLEDGVKAGPSVELQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCD---ED 567
++EF + + SVE QF+ F H L + ENL++L +G D ED
Sbjct: 608 IFRQSEF-MTKLAEPSTSVEHQFMKDF--HRSLGHPEQAKSPKCENLSALRDGKDGSSED 664
Query: 568 ICFSLQLGDNEAKRRRSD 585
I L LGD EAKRRRS+
Sbjct: 665 ISIGLHLGDREAKRRRSN 682
>gi|116309879|emb|CAH66916.1| OSIGBa0126B18.9 [Oryza sativa Indica Group]
gi|125549723|gb|EAY95545.1| hypothetical protein OsI_17391 [Oryza sativa Indica Group]
Length = 588
Score = 721 bits (1862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/602 (62%), Positives = 448/602 (74%), Gaps = 36/602 (5%)
Query: 9 RARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
+A+ SG SEES+LD + H + P P++QP AS H E++AAYF WPTS+
Sbjct: 7 KAQLSGLAQSEESSLDVD-----HQSFPC--SPSIQPVASGCTHTENSAAYFLWPTSNLQ 59
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
AAE RANYF NLQKG+LP G+LPKGQQA +LL+LMTIRAFHSKILR +SLGTA+GF
Sbjct: 60 HCAAEGRANYFGNLQKGLLPRHPGRLPKGQQANSLLDLMTIRAFHSKILRRFSLGTAVGF 119
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
RI++G LTDIPAILVFV+RKVHK+WL+P QCLP LEGPGGVWCDVDVVEFSY+GAP T
Sbjct: 120 RIRKGDLTDIPAILVFVARKVHKKWLNPAQCLPAILEGPGGVWCDVDVVEFSYYGAPAQT 179
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
PKEQ+++++VD L G D IGSGSQVAS ET+GTLGAIVK +TG++QVGFLTNRHVAVDL
Sbjct: 180 PKEQMFSELVDKLCGSDECIGSGSQVASHETFGTLGAIVKRRTGNKQVGFLTNRHVAVDL 239
Query: 249 DYPNQKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFAD 298
DYPNQKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFAD
Sbjct: 240 DYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFAD 299
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYND 358
DFD+STVTT V+G+G+IGDVK++DLQ P++SLIG+QV KVGRSSG TTGTV+AYALEYND
Sbjct: 300 DFDISTVTTVVRGVGDIGDVKVIDLQCPLNSLIGRQVCKVGRSSGHTTGTVMAYALEYND 359
Query: 359 EKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKI 418
EKGICF TD LVVGEN+QTFDLEGDSGSLI++ ++GEKPRPIGIIWGGTANRGRLKL
Sbjct: 360 EKGICFFTDILVVGENRQTFDLEGDSGSLIILTSQDGEKPRPIGIIWGGTANRGRLKLTS 419
Query: 419 GQPPENWTSGVDLGRLLNLLELDLITTDEGL------KVAVQEQRAASATAIGSTVGDSS 472
PENWTSGVDLGRLL+ LELD+I T+E L K AVQ+QR A A+ S VG+SS
Sbjct: 420 DHGPENWTSGVDLGRLLDRLELDIIITNESLQEFAYYKDAVQQQRFALVAAVTSAVGESS 479
Query: 473 PPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQ 532
++K E+ FEPLG+QIQ +P S T ++E E Q
Sbjct: 480 GAPVAIPEEKVEEIFEPLGIQIQQLPRHDVAASGTEGEEASNTVVNVE---------EHQ 530
Query: 533 FIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSKEE 592
FI +F G SP+ D+ + ++ +L N +E++ SL LGD E KR RSD+ +S +
Sbjct: 531 FISNFVGMSPVR----DDQDAPRSITNLNNPSEEELAMSLHLGDREPKRLRSDSGSSLDL 586
Query: 593 SK 594
K
Sbjct: 587 EK 588
>gi|38344253|emb|CAD41791.2| OSJNBa0008M17.6 [Oryza sativa Japonica Group]
Length = 588
Score = 718 bits (1853), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/602 (62%), Positives = 447/602 (74%), Gaps = 36/602 (5%)
Query: 9 RARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
+A+ SG SEES+LD + H + P P++QP AS H E++AAYF WPTS+
Sbjct: 7 KAQLSGLAQSEESSLDVD-----HQSFPC--SPSIQPVASGCTHTENSAAYFLWPTSNLQ 59
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
AAE RANYF NLQKG+LP G+LPKGQQA +LL+LMTIRAFHSKILR +SLGTA+GF
Sbjct: 60 HCAAEGRANYFGNLQKGLLPRHPGRLPKGQQANSLLDLMTIRAFHSKILRRFSLGTAVGF 119
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
RI++G LTDIPAILVFV+RKVHK+WL+P QCLP LEGPGGVWCDVDVVEFSY+GAP T
Sbjct: 120 RIRKGDLTDIPAILVFVARKVHKKWLNPAQCLPAILEGPGGVWCDVDVVEFSYYGAPAQT 179
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
PKEQ+++++VD L G D IGSGSQVAS ET+GTLGAIVK +TG++QVGFLTN HVAVDL
Sbjct: 180 PKEQMFSELVDKLCGSDECIGSGSQVASHETFGTLGAIVKRRTGNKQVGFLTNHHVAVDL 239
Query: 249 DYPNQKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFAD 298
DYPNQKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFAD
Sbjct: 240 DYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFAD 299
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYND 358
DFD+STVTT V+G+G+IGDVK++DLQ P++SLIG+QV KVGRSSG TTGTV+AYALEYND
Sbjct: 300 DFDISTVTTVVRGVGDIGDVKVIDLQCPLNSLIGRQVCKVGRSSGHTTGTVMAYALEYND 359
Query: 359 EKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKI 418
EKGICF TD LVVGEN+QTFDLEGDSGSLI++ ++GEKPRPIGIIWGGTANRGRLKL
Sbjct: 360 EKGICFFTDILVVGENRQTFDLEGDSGSLIILTSQDGEKPRPIGIIWGGTANRGRLKLTS 419
Query: 419 GQPPENWTSGVDLGRLLNLLELDLITTDEGL------KVAVQEQRAASATAIGSTVGDSS 472
PENWTSGVDLGRLL+ LELD+I T+E L K AVQ+QR A A+ S VG+SS
Sbjct: 420 DHGPENWTSGVDLGRLLDRLELDIIITNESLQEFAYYKDAVQQQRFALVAAVTSAVGESS 479
Query: 473 PPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQ 532
++K E+ FEPLG+QIQ +P S T ++E E Q
Sbjct: 480 GVPVAIPEEKIEEIFEPLGIQIQQLPRHDVAASGTEGEEASNTVVNVE---------EHQ 530
Query: 533 FIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSKEE 592
FI +F G SP+ D+ + ++ +L N +E++ SL LGD E KR RSD+ +S +
Sbjct: 531 FISNFVGMSPVR----DDQDAPRSITNLNNPSEEELAMSLHLGDREPKRLRSDSGSSLDL 586
Query: 593 SK 594
K
Sbjct: 587 EK 588
>gi|414584860|tpg|DAA35431.1| TPA: hypothetical protein ZEAMMB73_495650 [Zea mays]
Length = 581
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/588 (67%), Positives = 449/588 (76%), Gaps = 43/588 (7%)
Query: 13 SGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRLSDAA 72
+GS+ SE S LD ERN C+H PS LQP ASAGQH ES+AAYFSWPTS+ + +A
Sbjct: 11 AGSSQSEGSGLDMERNGCNHNYCPS----PLQPIASAGQHSESSAAYFSWPTSTLMHGSA 66
Query: 73 EERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKR 132
E RANYF NLQKGVLP LG+LPKGQQATTLL+LM IRAFHSKILR +SLGTAIGFRI++
Sbjct: 67 EGRANYFGNLQKGVLPGHLGRLPKGQQATTLLDLMIIRAFHSKILRRFSLGTAIGFRIRK 126
Query: 133 GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQ 192
G LTD PAILVFV+RKVH++WLS QCLPTALEGPGGVWCDVDVVEFSY+GAP PTPKEQ
Sbjct: 127 GTLTDTPAILVFVARKVHRKWLSATQCLPTALEGPGGVWCDVDVVEFSYYGAPAPTPKEQ 186
Query: 193 LYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPN 252
LY ++VD LRG DP +GSGSQVAS ETYGTLGAIVKSQTG++QVGFLTNRHVAVDLDYPN
Sbjct: 187 LYDELVDGLRGSDPIVGSGSQVASLETYGTLGAIVKSQTGNKQVGFLTNRHVAVDLDYPN 246
Query: 253 QKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDM 302
QKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFADDFD+
Sbjct: 247 QKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDFDI 306
Query: 303 STVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGI 362
++V+TSVKG+G IGDVK +DLQS I SLIG+QVVKVGRSSGLTTGTV+AYALEYNDEKGI
Sbjct: 307 TSVSTSVKGVGVIGDVKAIDLQSSIGSLIGRQVVKVGRSSGLTTGTVVAYALEYNDEKGI 366
Query: 363 CFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPP 422
CF TDFLVVGENQQTFDLEGDSGSLI++ G++GEKP+PIGIIWGGTANRGRLKLK GQ P
Sbjct: 367 CFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKLKSGQGP 426
Query: 423 ENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQR----AASATAIGSTVGDSSPPDGMH 478
ENWTSGVDLGRLL+LLELDLITT EGL+ A++EQR AA+A A ST +SSP G
Sbjct: 427 ENWTSGVDLGRLLDLLELDLITTSEGLQAALEEQRITLAAAAAAATNSTATESSPVAGPQ 486
Query: 479 LKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQFIPSFT 538
DK + +EPLG+ I IP + S + P+ E +L
Sbjct: 487 ENDKIDKIYEPLGINI--IPRDSSSISTD-QPNENVEELNL------------------- 524
Query: 539 GHSPLHQNNPSDKASSENLASL-WNGCDEDICFSLQLGDNEAKRRRSD 585
SP+ +N NL L + IC +L LG+ E KR RSD
Sbjct: 525 -MSPM-RNGQEGNGDLNNLMDLELENSPDGICIALNLGEREPKRLRSD 570
>gi|242077610|ref|XP_002448741.1| hypothetical protein SORBIDRAFT_06g032440 [Sorghum bicolor]
gi|241939924|gb|EES13069.1| hypothetical protein SORBIDRAFT_06g032440 [Sorghum bicolor]
Length = 579
Score = 714 bits (1843), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/589 (67%), Positives = 450/589 (76%), Gaps = 43/589 (7%)
Query: 13 SGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRLSDAA 72
+GS+ SE S LD ERN CSH PS LQP ASAGQH ES+AAYFSWPTS+ + +A
Sbjct: 11 AGSSQSEGSGLDMERNGCSHNCCPS----PLQPIASAGQHSESSAAYFSWPTSTLMHGSA 66
Query: 73 EERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKR 132
E RANYF NLQKGVLP LG+LPKGQQATTLL+LM IRAFHSKILR +SLGTAIGFRI++
Sbjct: 67 EGRANYFGNLQKGVLPGHLGRLPKGQQATTLLDLMIIRAFHSKILRRFSLGTAIGFRIRK 126
Query: 133 GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQ 192
G LTD PAILVFV+RKVH++WLSP QCLP ALEGPGGVWCDVDVVEFSY+GAP PTPKEQ
Sbjct: 127 GTLTDTPAILVFVARKVHRKWLSPTQCLPAALEGPGGVWCDVDVVEFSYYGAPAPTPKEQ 186
Query: 193 LYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPN 252
LY ++VD LRG DP +GSGSQVAS ETYGTLGAIVKS+TG++QVGFLTNRHVAVDLDYPN
Sbjct: 187 LYDELVDGLRGSDPIVGSGSQVASLETYGTLGAIVKSRTGNKQVGFLTNRHVAVDLDYPN 246
Query: 253 QKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDM 302
QKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFADDFD+
Sbjct: 247 QKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDFDI 306
Query: 303 STVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGI 362
++V+TSVKG+G IGDVK +DLQSPI SLIG+QVVKVGRSSGLTTGTV+AYALEYNDEKGI
Sbjct: 307 TSVSTSVKGVGVIGDVKAIDLQSPIGSLIGRQVVKVGRSSGLTTGTVVAYALEYNDEKGI 366
Query: 363 CFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPP 422
CF TDFLVVGENQQTFDLEGDSGSLI++ G++GEKP+PIGIIWGGTANRGRLKLK GQ P
Sbjct: 367 CFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKLKSGQGP 426
Query: 423 ENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQR----AASATAIGSTVGDSSPPDGMH 478
ENWTSGVDLGRLL+LLELDLITT EGL+ A+ EQ+ AA+A A ST +SSP G
Sbjct: 427 ENWTSGVDLGRLLDLLELDLITTSEGLQAAIDEQKKTLAAAAAVATNSTATESSPVGGPQ 486
Query: 479 LKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQFIPSFT 538
DK + +EPLG+ I IP + S + ME EL +
Sbjct: 487 ENDKIDKIYEPLGINI--IPRDGSAISTDQPNENME---------------ELNLM---- 525
Query: 539 GHSPLHQNNPSDKASSENLASLWNGCDED-ICFSLQLGDNEAKRRRSDA 586
SP+ +N NL L + D I +L LG+ E KR R+D+
Sbjct: 526 --SPM-RNGEESNGELNNLLDLESENSPDGISIALNLGEREPKRLRTDS 571
>gi|18403763|ref|NP_565798.1| trypsin-like protein [Arabidopsis thaliana]
gi|20197214|gb|AAM14975.1| expressed protein [Arabidopsis thaliana]
gi|23297468|gb|AAN12976.1| unknown protein [Arabidopsis thaliana]
gi|330253980|gb|AEC09074.1| trypsin-like protein [Arabidopsis thaliana]
Length = 579
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/590 (65%), Positives = 449/590 (76%), Gaps = 41/590 (6%)
Query: 11 RCSGSTPSEESALDFERN--CCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
+ + S+ SE+SALD ERN C S SP LQPF QH ESNA YFSWPT SRL
Sbjct: 12 QAAASSESEDSALDLERNHHCNHLSLPSSSSPSPLQPFTLNIQHAESNAPYFSWPTLSRL 71
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
+D E+RANYF NLQKGVLPET+G+LP GQQATTLLELMTIRAFHSKILR +SLGTA+GF
Sbjct: 72 NDTVEDRANYFGNLQKGVLPETVGRLPSGQQATTLLELMTIRAFHSKILRRFSLGTAVGF 131
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
RI RGVLT++PAILVFV+RKVH+QWL+P+QCLP+ALEGPGGVWCDVDVVEF Y+GAP T
Sbjct: 132 RISRGVLTNVPAILVFVARKVHRQWLNPMQCLPSALEGPGGVWCDVDVVEFQYYGAPAAT 191
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
PKEQ+Y ++VD LRG DP IGSGSQVASQETYGTLGAIVKS+TG+ QVGFLTNRHVAVDL
Sbjct: 192 PKEQVYNELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNHQVGFLTNRHVAVDL 251
Query: 249 DYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRADGAFIPFAD 298
DYP+QKMFHPLPP+LGPGVYLGAVERATS F P TFVRADGAFIPFA+
Sbjct: 252 DYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDQWYGIFAGTNPETFVRADGAFIPFAE 311
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYND 358
DF+ S VTT +KG+GEIGDV ++DLQSPI SLIGKQVVKVGRSSG TTGT++AYALEYND
Sbjct: 312 DFNTSNVTTLIKGIGEIGDVHVIDLQSPIDSLIGKQVVKVGRSSGYTTGTIMAYALEYND 371
Query: 359 EKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKI 418
EKGICFLTDFLV+GENQQTFDLEGDSGSLIL+ G NG+KPRP+GIIWGGTANRGRLKL
Sbjct: 372 EKGICFLTDFLVIGENQQTFDLEGDSGSLILLTGPNGQKPRPVGIIWGGTANRGRLKLIA 431
Query: 419 GQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRA--ASATAIGSTVGDSSPPDG 476
GQ PENWTSGVDLGRLL+LLELDLIT++ L+ A + S TA+ STV SSPPD
Sbjct: 432 GQEPENWTSGVDLGRLLDLLELDLITSNHELEAAAAAREERNTSVTALDSTVSQSSPPDP 491
Query: 477 MHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQ---F 533
+ DK ++ FEP IP EFH+E+ +K P++E++ F
Sbjct: 492 VPSGDKQDESFEPF------IP----------------PEFHIEEAIK--PTLEVEEHIF 527
Query: 534 IPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRR 583
I + + + +NL +L N +E++ SL LG+ + K+ +
Sbjct: 528 IAPISVNESTSAIKGQEIPKLDNLMALKNSSEEEVNISLHLGEPKLKKPK 577
>gi|296082780|emb|CBI21785.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 709 bits (1831), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/495 (73%), Positives = 411/495 (83%), Gaps = 12/495 (2%)
Query: 107 MTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEG 166
MTIRAFHSKILRCYSLGTAIGFRI+RG+LTDIPAILVFVSRKVHKQWL+PIQC P LEG
Sbjct: 1 MTIRAFHSKILRCYSLGTAIGFRIRRGMLTDIPAILVFVSRKVHKQWLNPIQCFPNVLEG 60
Query: 167 PGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAI 226
PGG+WCDVDVVEF+YFGAPE PKEQ YT+I+DDLRGGDP IGSGSQVASQ+ +GTLGAI
Sbjct: 61 PGGLWCDVDVVEFAYFGAPELAPKEQYYTEIMDDLRGGDPCIGSGSQVASQDGFGTLGAI 120
Query: 227 VKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHR----- 281
V+SQTG+RQVGFLTNRHVAV+LDYP+QKMFHPLPPTLGPGVYLGAVERATSF
Sbjct: 121 VRSQTGNRQVGFLTNRHVAVNLDYPSQKMFHPLPPTLGPGVYLGAVERATSFITDDLWFG 180
Query: 282 -----RPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVV 336
P TFVRADGAFIPFADDFDMST+TT VKG+GEIGDVK +DLQSP++S+IGKQVV
Sbjct: 181 IFAGINPETFVRADGAFIPFADDFDMSTITTLVKGVGEIGDVKKIDLQSPMNSIIGKQVV 240
Query: 337 KVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGE 396
KVGRSSGLTTGT+ AYALEY DE+G+C LTD +VVGENQQTFDLEGDSGSLI++ G++GE
Sbjct: 241 KVGRSSGLTTGTIFAYALEYIDERGMCLLTDLIVVGENQQTFDLEGDSGSLIVLTGQDGE 300
Query: 397 KPRPIGIIWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQ 456
K RPIGIIWGG NRGR+KLK G P ENWTS VD+GRLLNLLELDLITT EGL+VA+QEQ
Sbjct: 301 KARPIGIIWGGNGNRGRVKLKAGLPLENWTSAVDIGRLLNLLELDLITTSEGLRVALQEQ 360
Query: 457 RAASATAIGSTVGDSSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETE 516
AASATAIGSTVGDSSP D M KD+AE+KFE G QIQH P + SP+ N L+E E
Sbjct: 361 MAASATAIGSTVGDSSPQDKMLPKDRAEEKFESEGFQIQHDPWDDGLGSPDLNRPLVEAE 420
Query: 517 FHLEDGVKAGPSVELQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDED--ICFSLQL 574
F LEDGV+ P E QFIPSF PLH+N + + ENL+SL + DED SLQL
Sbjct: 421 FLLEDGVRVCPCFEHQFIPSFPEAPPLHENIEQARVTPENLSSLKHDTDEDDGAAISLQL 480
Query: 575 GDNEAKRRRSDASTS 589
GD+E KR R D S++
Sbjct: 481 GDHEPKRTRLDPSSN 495
>gi|16604659|gb|AAL24122.1| unknown protein [Arabidopsis thaliana]
Length = 579
Score = 707 bits (1824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/590 (64%), Positives = 448/590 (75%), Gaps = 41/590 (6%)
Query: 11 RCSGSTPSEESALDFERN--CCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
+ + S+ SE+SALD ERN C S SP LQPF QH ESNA YFSWPT SRL
Sbjct: 12 QAAASSESEDSALDLERNHHCNHLSLPSSSSPSPLQPFTLNIQHAESNAPYFSWPTLSRL 71
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
+D E+RANYF NLQKGVLPET+G+LP GQQATTLLELMTIRAFHSKILR +SLGTA+GF
Sbjct: 72 NDTVEDRANYFGNLQKGVLPETVGRLPSGQQATTLLELMTIRAFHSKILRRFSLGTAVGF 131
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
RI RGVLT++PAILVFV+RKVH+QWL+P+QCLP+ALEGPGGVWCDVDVVEF Y+GAP T
Sbjct: 132 RISRGVLTNVPAILVFVARKVHRQWLNPMQCLPSALEGPGGVWCDVDVVEFQYYGAPAAT 191
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
PKEQ+Y ++VD LRG DP IGSGSQVASQETYGTLGAIVKS+TG+ QVGFLTNRHVAVDL
Sbjct: 192 PKEQVYNELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNHQVGFLTNRHVAVDL 251
Query: 249 DYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTFVRADGAFIPFAD 298
DYP+QKMFHPLPP+LGPGVYLGAVERATS F P TFVRADGAFIPFA+
Sbjct: 252 DYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDQWYGIFAGTNPETFVRADGAFIPFAE 311
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYND 358
D + S VTT +KG+GEIGDV ++DLQSPI SLIGKQVVKVGRSSG TTGT++AYALEYND
Sbjct: 312 DVNTSNVTTLIKGIGEIGDVHVIDLQSPIDSLIGKQVVKVGRSSGYTTGTIMAYALEYND 371
Query: 359 EKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKI 418
EKGICFLTDFLV+GENQQTFDLEGDSGSLIL+ G NG+KPRP+GIIWGGTANRGRLKL
Sbjct: 372 EKGICFLTDFLVIGENQQTFDLEGDSGSLILLTGPNGQKPRPVGIIWGGTANRGRLKLIA 431
Query: 419 GQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRA--ASATAIGSTVGDSSPPDG 476
GQ PENWTSGVDLGRLL+LLELDLIT++ L+ A + S TA+ STV SSPPD
Sbjct: 432 GQEPENWTSGVDLGRLLDLLELDLITSNHELEAAAAAREERNTSVTALDSTVSQSSPPDP 491
Query: 477 MHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQ---F 533
+ DK ++ FEP IP EFH+E+ +K P++E++ F
Sbjct: 492 VPSGDKQDESFEPF------IP----------------PEFHIEEAIK--PTLEVEEHIF 527
Query: 534 IPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRR 583
I + + + +NL +L N +E++ SL LG+ + K+ +
Sbjct: 528 IAPISVNESTSAIKGQEIPKLDNLMALKNSSEEEVNISLHLGEPKLKKPK 577
>gi|293335623|ref|NP_001168357.1| uncharacterized protein LOC100382125 [Zea mays]
gi|223942135|gb|ACN25151.1| unknown [Zea mays]
gi|223947737|gb|ACN27952.1| unknown [Zea mays]
gi|413919905|gb|AFW59837.1| hypothetical protein ZEAMMB73_955518 [Zea mays]
gi|413919906|gb|AFW59838.1| hypothetical protein ZEAMMB73_955518 [Zea mays]
Length = 581
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/496 (75%), Positives = 418/496 (84%), Gaps = 18/496 (3%)
Query: 13 SGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRLSDAA 72
+GS+ SE S LD ERN C+H PS LQP ASAGQH ES+AAYFSWPTS+ + +A
Sbjct: 11 AGSSQSEASGLDMERNGCNHNCCPS----PLQPIASAGQHSESSAAYFSWPTSTLMHGSA 66
Query: 73 EERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKR 132
E RANYF NLQKGVLP LG+LP GQQATTLL+LM IRAFHSKILR +SLGTAIGFRI++
Sbjct: 67 EGRANYFGNLQKGVLPGHLGRLPNGQQATTLLDLMIIRAFHSKILRRFSLGTAIGFRIRK 126
Query: 133 GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQ 192
G LTD PAILVFV+RKVH++WLSP QCLP ALEGPGGVWCDVDVVEFSY+GAP PTPKEQ
Sbjct: 127 GTLTDTPAILVFVARKVHRKWLSPTQCLPGALEGPGGVWCDVDVVEFSYYGAPAPTPKEQ 186
Query: 193 LYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPN 252
LY ++VD LRG DPSIGSGSQVAS ETYGTLGAIVKS+TG++QVGFLTNRHVAVDLDYPN
Sbjct: 187 LYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGNKQVGFLTNRHVAVDLDYPN 246
Query: 253 QKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDM 302
QKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFADDF++
Sbjct: 247 QKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDFEI 306
Query: 303 STVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGI 362
++V+TSVKG+G IG+VK +DLQSPI SLIG+QVVKVGRSSG+TTGTV+AYALEYNDEKGI
Sbjct: 307 ASVSTSVKGVGVIGNVKAIDLQSPIGSLIGRQVVKVGRSSGMTTGTVVAYALEYNDEKGI 366
Query: 363 CFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPP 422
CF TDFLVVGENQQTFDLEGDSGSLI++ G++GEKP+PIGIIWGGTANRGRLKLK GQ P
Sbjct: 367 CFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKLKSGQGP 426
Query: 423 ENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQR----AASATAIGSTVGDSSPPDGMH 478
ENWTSGVDLGRLL+LLELDLITT EGL+ A++EQR AA+A A ST +SSP G
Sbjct: 427 ENWTSGVDLGRLLDLLELDLITTSEGLQAALEEQRITLAAAAAAATNSTATESSPVAGPQ 486
Query: 479 LKDKAEDKFEPLGLQI 494
DK + +EPLG+ I
Sbjct: 487 EDDKIDKIYEPLGINI 502
>gi|413919907|gb|AFW59839.1| hypothetical protein ZEAMMB73_955518 [Zea mays]
Length = 555
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/496 (75%), Positives = 418/496 (84%), Gaps = 18/496 (3%)
Query: 13 SGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRLSDAA 72
+GS+ SE S LD ERN C+H PS LQP ASAGQH ES+AAYFSWPTS+ + +A
Sbjct: 11 AGSSQSEASGLDMERNGCNHNCCPS----PLQPIASAGQHSESSAAYFSWPTSTLMHGSA 66
Query: 73 EERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKR 132
E RANYF NLQKGVLP LG+LP GQQATTLL+LM IRAFHSKILR +SLGTAIGFRI++
Sbjct: 67 EGRANYFGNLQKGVLPGHLGRLPNGQQATTLLDLMIIRAFHSKILRRFSLGTAIGFRIRK 126
Query: 133 GVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQ 192
G LTD PAILVFV+RKVH++WLSP QCLP ALEGPGGVWCDVDVVEFSY+GAP PTPKEQ
Sbjct: 127 GTLTDTPAILVFVARKVHRKWLSPTQCLPGALEGPGGVWCDVDVVEFSYYGAPAPTPKEQ 186
Query: 193 LYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPN 252
LY ++VD LRG DPSIGSGSQVAS ETYGTLGAIVKS+TG++QVGFLTNRHVAVDLDYPN
Sbjct: 187 LYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGNKQVGFLTNRHVAVDLDYPN 246
Query: 253 QKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDM 302
QKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFADDF++
Sbjct: 247 QKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDFEI 306
Query: 303 STVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGI 362
++V+TSVKG+G IG+VK +DLQSPI SLIG+QVVKVGRSSG+TTGTV+AYALEYNDEKGI
Sbjct: 307 ASVSTSVKGVGVIGNVKAIDLQSPIGSLIGRQVVKVGRSSGMTTGTVVAYALEYNDEKGI 366
Query: 363 CFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPP 422
CF TDFLVVGENQQTFDLEGDSGSLI++ G++GEKP+PIGIIWGGTANRGRLKLK GQ P
Sbjct: 367 CFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKLKSGQGP 426
Query: 423 ENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQR----AASATAIGSTVGDSSPPDGMH 478
ENWTSGVDLGRLL+LLELDLITT EGL+ A++EQR AA+A A ST +SSP G
Sbjct: 427 ENWTSGVDLGRLLDLLELDLITTSEGLQAALEEQRITLAAAAAAATNSTATESSPVAGPQ 486
Query: 479 LKDKAEDKFEPLGLQI 494
DK + +EPLG+ I
Sbjct: 487 EDDKIDKIYEPLGINI 502
>gi|357165942|ref|XP_003580546.1| PREDICTED: uncharacterized protein LOC100839778 [Brachypodium
distachyon]
Length = 639
Score = 703 bits (1815), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/597 (62%), Positives = 447/597 (74%), Gaps = 27/597 (4%)
Query: 9 RARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
R + G T SEES+LD E C + P P++QP AS H E++AAYF WPTS+
Sbjct: 7 RMQLLGLTQSEESSLDVEGYCYHNETFPC--SPSMQPIASGCVHTENSAAYFLWPTSNLQ 64
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
AAE RANYF NLQKG+LP G+LPKGQQA +LL+LMT+RAFHSKILR +SLGTA+GF
Sbjct: 65 HCAAEGRANYFGNLQKGLLPVLPGKLPKGQQANSLLDLMTVRAFHSKILRRFSLGTAVGF 124
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
RIK+GVLTDIPAI+VFV+RKVHK+WL+P QCLP L GPGGVWCDVDVVEFSY+GAP T
Sbjct: 125 RIKKGVLTDIPAIIVFVARKVHKKWLNPNQCLPAILAGPGGVWCDVDVVEFSYYGAPAQT 184
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
PKEQ+++++V+ L G D IGSGSQVASQ+T+GTLGAIVK +T +RQVGFLTNRHVAVDL
Sbjct: 185 PKEQMFSELVNKLCGSDEYIGSGSQVASQDTFGTLGAIVKRRTNNRQVGFLTNRHVAVDL 244
Query: 249 DYPNQKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFAD 298
DYPNQKMFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFAD
Sbjct: 245 DYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFAD 304
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYND 358
DFD+STVTT V+ +GEIGDVK++DLQ PI+SLIG+QV KVGRSSG TTGTV+AYALEYND
Sbjct: 305 DFDISTVTTIVREVGEIGDVKVIDLQCPINSLIGRQVCKVGRSSGHTTGTVMAYALEYND 364
Query: 359 EKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKI 418
EKGICF TD LVVGEN+QTFDLEGDSGSLIL+ ++GEKP PIGIIWGGTANRGR+KL
Sbjct: 365 EKGICFFTDLLVVGENRQTFDLEGDSGSLILLTSQDGEKPLPIGIIWGGTANRGRIKLTS 424
Query: 419 GQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGDSSPPDGMH 478
PENWT+GVDLGRLL+ LELDLI T+E LK AVQ+ R A A+ S VG+SS
Sbjct: 425 DHGPENWTTGVDLGRLLDRLELDLIITNESLKDAVQQHRNALVAAVISAVGESSTVAATA 484
Query: 479 LKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSV-ELQFIPSF 537
++KAE+ FEPLG++IQ + P + ++ TE ED V E QFI +F
Sbjct: 485 PEEKAEEVFEPLGIKIQQL--------PRHDVTISATEG--EDTANTSADVEEHQFISNF 534
Query: 538 TGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSKEESK 594
SP + D+ + N+ +L N +E++ SL +GD E KR RSDA ++ + K
Sbjct: 535 GSMSPARR----DQDTPRNIGNLNNPSEEELTMSLHVGDREPKRLRSDAESNLDLEK 587
>gi|293336302|ref|NP_001169250.1| uncharacterized protein LOC100383111 [Zea mays]
gi|223975799|gb|ACN32087.1| unknown [Zea mays]
gi|414585456|tpg|DAA36027.1| TPA: hypothetical protein ZEAMMB73_252293 [Zea mays]
Length = 582
Score = 698 bits (1801), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/596 (61%), Positives = 446/596 (74%), Gaps = 30/596 (5%)
Query: 9 RARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
R + SG S+ES LD E +C + PS P++QP AS H E++AAYF WPTS+
Sbjct: 7 RTQLSGFAQSDESTLDVEGHCYHQQSFPS--SPSMQPIASGCTHTENSAAYFLWPTSNLQ 64
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
AAE RANYFANL KG+LP++ G+LPKGQQA +LL+LMTIRAFHSK+LRC+SLGTA+GF
Sbjct: 65 HCAAEGRANYFANLSKGLLPKS-GRLPKGQQANSLLDLMTIRAFHSKVLRCFSLGTAVGF 123
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
RI++G LTDIPAIL FV+RKVHK+WL+P QCLP +EGPGG+WCDVDVVEFSY+GAP
Sbjct: 124 RIRKGALTDIPAILCFVARKVHKKWLNPDQCLPAIVEGPGGIWCDVDVVEFSYYGAPAQN 183
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
PK Q++T++VD L G D IGSGSQVASQ+T+GTLGAIVK +TG++Q+GFLTNRHVAVDL
Sbjct: 184 PKVQMFTELVDKLCGSDECIGSGSQVASQDTFGTLGAIVKRRTGNKQIGFLTNRHVAVDL 243
Query: 249 DYPNQKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFAD 298
DYPNQKM+HPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFA
Sbjct: 244 DYPNQKMYHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFAH 303
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYND 358
DFD+STVTT+V+G+G+IGDVK++DLQSP++SLIG+QV K+GRSSG TTGTV+AYALEYND
Sbjct: 304 DFDISTVTTTVRGVGDIGDVKVIDLQSPLNSLIGRQVCKIGRSSGHTTGTVVAYALEYND 363
Query: 359 EKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKI 418
EKGI F TD LVVGEN+QTFDLEGDSGSLI++ G++ EKP PIGIIWGGTANRGRLKL+
Sbjct: 364 EKGISFFTDLLVVGENRQTFDLEGDSGSLIILTGQDNEKPCPIGIIWGGTANRGRLKLRC 423
Query: 419 GQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGDSSPPDGMH 478
PENWTSGVDLGRLL+ LELDLI T+E LK AVQ+QR A A S VG+SS
Sbjct: 424 DHGPENWTSGVDLGRLLDRLELDLIITNESLKDAVQQQRLALVAAANSAVGESSTAAVPA 483
Query: 479 LKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQFIPSFT 538
++K E FEPLG++I+ +P H T ++E E QFI +F
Sbjct: 484 PEEKVE-IFEPLGIKIEQLP---RHDVSATTEGDEAAVINVE---------ERQFISNFV 530
Query: 539 GHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDASTSKEESK 594
G SP+ D+ + +A+L N +E++ SL LGD EAKR R+D + + K
Sbjct: 531 GMSPVR----DDQDAPRQIANLNNPSEEELAMSLHLGDREAKRLRTDTESELDLEK 582
>gi|242074316|ref|XP_002447094.1| hypothetical protein SORBIDRAFT_06g028460 [Sorghum bicolor]
gi|241938277|gb|EES11422.1| hypothetical protein SORBIDRAFT_06g028460 [Sorghum bicolor]
Length = 607
Score = 692 bits (1786), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/619 (59%), Positives = 449/619 (72%), Gaps = 51/619 (8%)
Query: 9 RARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
RA+ SG S+ES LD E +C + P P++QP AS H E++AAYF WPTS+
Sbjct: 7 RAQLSGFAQSDESTLDVEGHCYHQQSFPC--SPSMQPIASGCTHTENSAAYFLWPTSNLQ 64
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
AAE RANYFANL KG+LP++ G+LPKGQQA +LL+LMTIRAFHSKILRC+SLGTA+GF
Sbjct: 65 HCAAEGRANYFANLSKGLLPKS-GKLPKGQQANSLLDLMTIRAFHSKILRCFSLGTAVGF 123
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
RI++GVLTDIPAIL FV+RKVHK+WL+P QCLP +EGPGG+WCDVDVVEFSY+GAP T
Sbjct: 124 RIRKGVLTDIPAILCFVARKVHKKWLNPTQCLPAIVEGPGGIWCDVDVVEFSYYGAPAQT 183
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQV-----------------------ASQETYGTLGA 225
PKEQ++T++VD L G D IGSGSQV ASQ+T+GTLGA
Sbjct: 184 PKEQMFTELVDKLCGSDECIGSGSQVLAKIDLNYLKVADKDSWNDAMAVASQDTFGTLGA 243
Query: 226 IVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSF------- 278
IVK +TG++Q+GFLTNRHVAVDLDYPNQKM+HPLPP LGPGVYLGAVERATSF
Sbjct: 244 IVKRRTGNKQIGFLTNRHVAVDLDYPNQKMYHPLPPNLGPGVYLGAVERATSFITDDVWY 303
Query: 279 ---HHRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQV 335
P TFVRADGAFIPFA DFD+STV+T+V+G+G+IGDVK +DLQ P++SLIG+QV
Sbjct: 304 GIYAGTNPETFVRADGAFIPFAHDFDISTVSTTVRGVGDIGDVKFIDLQCPLNSLIGRQV 363
Query: 336 VKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENG 395
K+GRSSG TTGTV+AYALEYNDEKGI F TD LVVGEN+QTFDLEGDSGSLI++ G++
Sbjct: 364 CKIGRSSGHTTGTVMAYALEYNDEKGISFFTDLLVVGENRQTFDLEGDSGSLIILTGQDS 423
Query: 396 EKPRPIGIIWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQE 455
EKPRPIGIIWGGTANRGRLKL+ PENWTSGVDLGRLL+ LELDLI T E LK AVQ+
Sbjct: 424 EKPRPIGIIWGGTANRGRLKLRCDHGPENWTSGVDLGRLLDRLELDLIITSESLKDAVQQ 483
Query: 456 QRAASATAIGSTVGDSSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMET 515
QR A A S VG+SS ++K E+ +EPLG++I+ +P + + S E
Sbjct: 484 QRLAMVAAANSAVGESSTAAVPVPEEKVEELYEPLGIKIEQLPRH------DVSASGTEG 537
Query: 516 EFHLEDGVKAGPSVELQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLG 575
E V+ E QFI +F G SP+ D+ + +A+L N +E++ SL LG
Sbjct: 538 EEAAVVNVE-----ERQFISNFVGMSPVR----GDQDAPRQIANLNNPSEEELAMSLHLG 588
Query: 576 DNEAKRRRSDASTSKEESK 594
D E KR R+D + + K
Sbjct: 589 DREPKRLRTDTESDLDLEK 607
>gi|413919513|gb|AFW59445.1| hypothetical protein ZEAMMB73_623071 [Zea mays]
Length = 566
Score = 679 bits (1753), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/578 (61%), Positives = 432/578 (74%), Gaps = 29/578 (5%)
Query: 9 RARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
RA+ SG S+ES LD E +CC P+ P P++QP S H E++AAYF WPTS+
Sbjct: 7 RAQLSGFAQSDESTLDVEGHCCHQPSFPC--SPSMQPIVSGCTHTENSAAYFLWPTSNLQ 64
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
AAE RANYFANL KG+LP+ +LPKGQQA +LL+LMTIRAFHSK+LRC+ LGTA+GF
Sbjct: 65 HCAAEGRANYFANLSKGLLPKIGRRLPKGQQANSLLDLMTIRAFHSKVLRCFGLGTAVGF 124
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
RI++GVLTDIPAIL FV+RKVHK+WL P CLP L GPGG+WCDVDVVEFSY+GAP T
Sbjct: 125 RIRKGVLTDIPAILCFVARKVHKKWLDPAHCLPAILAGPGGIWCDVDVVEFSYYGAPAQT 184
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
PK Q++T++VD L G D IGSGSQVASQ+T+GTLGAIVK +TG++ VGF+TNRHVAVDL
Sbjct: 185 PKVQIFTELVDKLCGSDECIGSGSQVASQDTFGTLGAIVKRRTGNKLVGFVTNRHVAVDL 244
Query: 249 DYPNQKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFAD 298
DYPNQKM+HPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFA
Sbjct: 245 DYPNQKMYHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFAH 304
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYND 358
DFD+STVTT+V+G+G+IGDVK++DLQ P++ LIG++V K+GRSSG TTGTV+AYALEYND
Sbjct: 305 DFDISTVTTTVRGVGDIGDVKVIDLQCPLNRLIGRRVCKIGRSSGHTTGTVMAYALEYND 364
Query: 359 EKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKI 418
EKGI F TD LVVGEN+QTFDLEGDSGSLI++ G++ EKPRPIGIIWGGTANRGRLKL+
Sbjct: 365 EKGISFFTDLLVVGENRQTFDLEGDSGSLIILTGQDSEKPRPIGIIWGGTANRGRLKLRC 424
Query: 419 GQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGDSSPPDGMH 478
P+NWTSGVDLGRLL+ LELDLI T E LK AVQ+QR A A A S G+SS
Sbjct: 425 DHGPQNWTSGVDLGRLLDRLELDLIITSESLKDAVQQQRRALAAAANSAAGESSTAAAPV 484
Query: 479 LKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQFIPSFT 538
L++K E+ FEPLG++I+ ++ H + + ++E E QFI +F
Sbjct: 485 LEEKVEEIFEPLGIKIE----QLRRHDVSASEAEEAAGINVE---------ERQFISNFV 531
Query: 539 GHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGD 576
G SP+ D+ + +A+L N +E++ L LGD
Sbjct: 532 GRSPVR----DDQGAPRQIANLNNPSEEELAMLLHLGD 565
>gi|297791289|ref|XP_002863529.1| hypothetical protein ARALYDRAFT_917030 [Arabidopsis lyrata subsp.
lyrata]
gi|297309364|gb|EFH39788.1| hypothetical protein ARALYDRAFT_917030 [Arabidopsis lyrata subsp.
lyrata]
Length = 578
Score = 666 bits (1719), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/619 (61%), Positives = 449/619 (72%), Gaps = 69/619 (11%)
Query: 1 MDRTRLNIRAR--CSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAA 58
M+ RL++R S S E +ALD ++N +H L S SP LQPF S GQH E++AA
Sbjct: 1 MEGKRLDLRFHHSVSSSQSVESAALDLDKNGYNHIKLASSSP--LQPFPSGGQHPETSAA 58
Query: 59 --YFSWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKI 116
YFSWPTSSRL+D+AE+RANYFANLQKGVLPET LP T+L
Sbjct: 59 AAYFSWPTSSRLNDSAEDRANYFANLQKGVLPETFDGLP------TIL------------ 100
Query: 117 LRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDV 176
VLT+I AILVFV+RKVHKQWL+P QCLPTALEGPGGVWCDVDV
Sbjct: 101 -----------------VLTNIAAILVFVARKVHKQWLNPPQCLPTALEGPGGVWCDVDV 143
Query: 177 VEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQV 236
VEF Y+GAP TPKEQ+YT++VDDLRG SIGSGSQVASQETYGTLGAIVKS+TG RQV
Sbjct: 144 VEFQYYGAPAQTPKEQVYTELVDDLRGSGSSIGSGSQVASQETYGTLGAIVKSKTGIRQV 203
Query: 237 GFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATS----------FHHRRPLTF 286
GFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATS F P TF
Sbjct: 204 GFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETF 263
Query: 287 VRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTT 346
VRADGAFIPFA+DF+M+ VTT+VKG+GEIG++ DLQSPI+SLIG++VVKVGRSSGLTT
Sbjct: 264 VRADGAFIPFAEDFNMNNVTTTVKGIGEIGNIHATDLQSPINSLIGRKVVKVGRSSGLTT 323
Query: 347 GTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKG--ENGEKPRPIGII 404
GT++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL+ E EKPRP+GII
Sbjct: 324 GTIMAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLAAGDEKNEKPRPVGII 383
Query: 405 WGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATA- 463
WGGTANRGRLKLK+G+ PENWTSGVDLGR+LNLLELDLIT++EGL+ AV EQR + A
Sbjct: 384 WGGTANRGRLKLKVGEQPENWTSGVDLGRVLNLLELDLITSNEGLQAAVLEQRNSIMCAG 443
Query: 464 IGSTVGDSSPPDGMHLKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGV 523
I STV +SSP + K + FEP+ L +Q + E + S + EF +ED +
Sbjct: 444 IDSTVVESSPGVCNISRCKTGENFEPINLNVQQV-------LREEDSSNIHPEFQIEDVL 496
Query: 524 KAGPSV-ELQFIPSFTGHS-PLHQN-NPSDKASSENLASL-WNGCDEDICFSLQLGDNEA 579
++ + E QFIPS + + LHQ N + S+NL+SL N ++I FSLQLG+++
Sbjct: 497 ESAAMIEEHQFIPSSSNNGYSLHQKINGPENLESKNLSSLKTNSSGDEIGFSLQLGESDT 556
Query: 580 KRRRS----DASTSKEESK 594
K+R+ D S EES+
Sbjct: 557 KKRKRTDSPDGSQEHEESR 575
>gi|302781773|ref|XP_002972660.1| hypothetical protein SELMODRAFT_98342 [Selaginella moellendorffii]
gi|302812925|ref|XP_002988149.1| hypothetical protein SELMODRAFT_127331 [Selaginella moellendorffii]
gi|300144255|gb|EFJ10941.1| hypothetical protein SELMODRAFT_127331 [Selaginella moellendorffii]
gi|300159261|gb|EFJ25881.1| hypothetical protein SELMODRAFT_98342 [Selaginella moellendorffii]
Length = 454
Score = 590 bits (1522), Expect = e-166, Method: Compositional matrix adjust.
Identities = 307/429 (71%), Positives = 351/429 (81%), Gaps = 14/429 (3%)
Query: 32 HPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRLSDAAEERANYFANLQKGVLPETL 91
HP S SPP LQ AS GQH ES+AAY WP +R++ AEERA YF+ LQK +T
Sbjct: 30 HPR--SESPP-LQAVASGGQHSESSAAYVLWP-PARINGTAEERAAYFSGLQKDAEMDTQ 85
Query: 92 GQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHK 151
++P GQQA+TLL+LMTIRAFHSK+LR YSLGTA+GFR + GVLT+IPAI+VFV+RKVHK
Sbjct: 86 QRVPSGQQASTLLDLMTIRAFHSKVLRRYSLGTALGFRTRAGVLTNIPAIIVFVARKVHK 145
Query: 152 QWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSG 211
QWL +Q LPTALEGPGGVWCDVDVVEFSY+GA TPKEQ+Y+++V+ LRG DP IGSG
Sbjct: 146 QWLLDVQRLPTALEGPGGVWCDVDVVEFSYYGASTVTPKEQIYSELVEGLRGNDPCIGSG 205
Query: 212 SQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGA 271
SQVASQETYGTLGAIV+SQTG+RQVGFLTNRHVAVDLDYPNQKMFHPLPP LGPGVYLGA
Sbjct: 206 SQVASQETYGTLGAIVRSQTGARQVGFLTNRHVAVDLDYPNQKMFHPLPPNLGPGVYLGA 265
Query: 272 VERATS----------FHHRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIV 321
VERATS F P TFVRADGAFIPFA+ FD S V+ V LGE+G+V V
Sbjct: 266 VERATSFITDDLWYGIFAGMNPETFVRADGAFIPFAESFDTSKVSVRVHSLGELGEVFRV 325
Query: 322 DLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLE 381
DLQ+PI S++G+ VVKVGRSSGLT G ++AYA+EYNDEKGICF TDFL+VGEN+Q FDLE
Sbjct: 326 DLQAPIESIVGQHVVKVGRSSGLTKGIIMAYAVEYNDEKGICFFTDFLIVGENKQAFDLE 385
Query: 382 GDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELD 441
GDSGSLI M E E PRP+GIIWGGTANRGRLKL+ G PENWTSGVDLGRLL+LL+LD
Sbjct: 386 GDSGSLISMTWERCENPRPVGIIWGGTANRGRLKLRSGHGPENWTSGVDLGRLLDLLQLD 445
Query: 442 LITTDEGLK 450
LITT+ L+
Sbjct: 446 LITTETSLQ 454
>gi|413919512|gb|AFW59444.1| hypothetical protein ZEAMMB73_623071 [Zea mays]
Length = 516
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 319/578 (55%), Positives = 390/578 (67%), Gaps = 79/578 (13%)
Query: 9 RARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
RA+ SG S+ES LD E +CC P+ P P++QP S H E++AAYF WPTS+
Sbjct: 7 RAQLSGFAQSDESTLDVEGHCCHQPSFPC--SPSMQPIVSGCTHTENSAAYFLWPTSNLQ 64
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
AAE RANYFANL KG+LP+ +LPKGQQA +LL+LMTIRAFHSK
Sbjct: 65 HCAAEGRANYFANLSKGLLPKIGRRLPKGQQANSLLDLMTIRAFHSK------------- 111
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
GPGG+WCDVDVVEFSY+GAP T
Sbjct: 112 -------------------------------------GPGGIWCDVDVVEFSYYGAPAQT 134
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
PK Q++T++VD L G D IGSGSQVASQ+T+GTLGAIVK +TG++ VGF+TNRHVAVDL
Sbjct: 135 PKVQIFTELVDKLCGSDECIGSGSQVASQDTFGTLGAIVKRRTGNKLVGFVTNRHVAVDL 194
Query: 249 DYPNQKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFAD 298
DYPNQKM+HPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFA
Sbjct: 195 DYPNQKMYHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFAH 254
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYND 358
DFD+STVTT+V+G+G+IGDVK++DLQ P++ LIG++V K+GRSSG TTGTV+AYALEYND
Sbjct: 255 DFDISTVTTTVRGVGDIGDVKVIDLQCPLNRLIGRRVCKIGRSSGHTTGTVMAYALEYND 314
Query: 359 EKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKI 418
EKGI F TD LVVGEN+QTFDLEGDSGSLI++ G++ EKPRPIGIIWGGTANRGRLKL+
Sbjct: 315 EKGISFFTDLLVVGENRQTFDLEGDSGSLIILTGQDSEKPRPIGIIWGGTANRGRLKLRC 374
Query: 419 GQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGDSSPPDGMH 478
P+NWTSGVDLGRLL+ LELDLI T E LK AVQ+QR A A A S G+SS
Sbjct: 375 DHGPQNWTSGVDLGRLLDRLELDLIITSESLKDAVQQQRRALAAAANSAAGESSTAAAPV 434
Query: 479 LKDKAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQFIPSFT 538
L++K E+ FEPLG++I+ ++ H + + ++E E QFI +F
Sbjct: 435 LEEKVEEIFEPLGIKIE----QLRRHDVSASEAEEAAGINVE---------ERQFISNFV 481
Query: 539 GHSPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGD 576
G SP+ D+ + +A+L N +E++ L LGD
Sbjct: 482 GRSPVR----DDQGAPRQIANLNNPSEEELAMLLHLGD 515
>gi|168064147|ref|XP_001784026.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664412|gb|EDQ51132.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 554 bits (1427), Expect = e-155, Method: Compositional matrix adjust.
Identities = 272/406 (66%), Positives = 329/406 (81%), Gaps = 14/406 (3%)
Query: 58 AYFSWPTSSRLSDAAEERANYFANLQK--GVLPETLGQLPKGQQATTLLELMTIRAFHSK 115
AY WP S +L +++ERA F L+K GV+ G P+GQQA+TLLELMTIRA+HSK
Sbjct: 1 AYLLWPGSDQLLGSSDERAACFIGLEKSGGVMYND-GVTPRGQQASTLLELMTIRAYHSK 59
Query: 116 ILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVD 175
LR LGTA+GFR +RG LT IPAI+VFV+RKVH QWL +Q LP+++EGPGG+WCDVD
Sbjct: 60 SLRQCGLGTALGFRTRRGELTSIPAIIVFVARKVHTQWLHELQVLPSSVEGPGGLWCDVD 119
Query: 176 VVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQ 235
VVEFSYFG P PK+QL ++I+D LRG D +IGSG+QVASQETYGTLGA+V+SQTG RQ
Sbjct: 120 VVEFSYFGVPTMVPKKQLSSEILDGLRGMDATIGSGTQVASQETYGTLGALVQSQTGLRQ 179
Query: 236 VGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHH----------RRPLT 285
+GF+TNRHVAVDLDYP QKMFHPLPP LGPGVYLGAV+RATSF P T
Sbjct: 180 LGFITNRHVAVDLDYPCQKMFHPLPPNLGPGVYLGAVKRATSFVKDDLWYGIFAGMNPET 239
Query: 286 FVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLT 345
FVRADGAFIPF++ FD+S VTTS+KG+G +GDV VDLQS ISS++G++VVKVGRSSG+T
Sbjct: 240 FVRADGAFIPFSETFDISKVTTSIKGIGSMGDVYRVDLQSQISSIVGRKVVKVGRSSGVT 299
Query: 346 TGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGEN-GEKPRPIGII 404
G ++ YA+EYNDE GICFLTDFL+VGE ++ FDLEGDSGSLIL+ EN EK +P+G+I
Sbjct: 300 KGVIMGYAVEYNDENGICFLTDFLIVGEKKKNFDLEGDSGSLILLSSENETEKAQPVGLI 359
Query: 405 WGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLK 450
WGGTANRGRLKL+ PENWTSGVDLGRLL++L+LD+ITTD+ L+
Sbjct: 360 WGGTANRGRLKLRNEHGPENWTSGVDLGRLLDILQLDIITTDQNLR 405
>gi|168009441|ref|XP_001757414.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691537|gb|EDQ77899.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 409
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 248/413 (60%), Positives = 308/413 (74%), Gaps = 15/413 (3%)
Query: 62 WPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYS 121
WPT + AE+RA +F++LQK + P+G QA TLL+LMTIRA HSK LRC+S
Sbjct: 1 WPTPRLQNGRAEQRATHFSSLQKKT--SCPSKRPRGHQAATLLDLMTIRALHSKTLRCFS 58
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
LGTA+GFRI+ GV TDIPAI+VFV+RKVH+ WL Q LP LEGPGGVWCDVDVVEFS
Sbjct: 59 LGTALGFRIRGGVQTDIPAIIVFVARKVHRHWLQEAQELPLILEGPGGVWCDVDVVEFSL 118
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
G+ P ++ +YT +V+ LRGGD +IGSGSQVA E YGTL AIV+S+TG QVGFLTN
Sbjct: 119 LGSQRP--QDPVYTDLVEGLRGGDATIGSGSQVACFELYGTLSAIVRSRTGLCQVGFLTN 176
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADG 291
RHVAV LD+P QK+FHPLPP LGPGVYLGAVER T+F P +FVRADG
Sbjct: 177 RHVAVSLDHPVQKLFHPLPPHLGPGVYLGAVERTTTFIRDDLWYGVFASTNPESFVRADG 236
Query: 292 AFIPFADDFDMST-VTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
AFIPF + D+ ++ VK +GEIG+V VDLQ+P++SLIGK V+KVGRSSG T G +L
Sbjct: 237 AFIPFDSNLDVRNFISPFVKSVGEIGEVISVDLQAPLNSLIGKHVIKVGRSSGFTEGCIL 296
Query: 351 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
AYALEYN++KG CF DFL+V ++ F+LEGD+GSLIL++GE GEKPRP+G++WGGT
Sbjct: 297 AYALEYNNDKGHCFFNDFLIVSDDNNAFELEGDTGSLILVRGEAGEKPRPVGVVWGGTTQ 356
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATA 463
+GRLKL + PENWTSGVDL RLL L+L ++T++E L A++ QR A +
Sbjct: 357 QGRLKLHKWKEPENWTSGVDLSRLLESLDLSIVTSNEALCEALEVQRQCRAAS 409
>gi|167999079|ref|XP_001752245.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696640|gb|EDQ82978.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 481 bits (1237), Expect = e-133, Method: Compositional matrix adjust.
Identities = 245/421 (58%), Positives = 310/421 (73%), Gaps = 15/421 (3%)
Query: 54 ESNAAYFSWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFH 113
E +A + WPTS + E RA +F LQK + + + P G QA TLL+LMTIRAFH
Sbjct: 2 EGSAHFVEWPTSQLQNGPVELRAIHFCTLQKQMSCSS--KWPHGYQAATLLDLMTIRAFH 59
Query: 114 SKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCD 173
SK LRCYSLG+A+GFRI+ GV TDIPAI+VFV+RKVH+ WL Q LP LEGPGG+WCD
Sbjct: 60 SKSLRCYSLGSALGFRIRGGVQTDIPAIIVFVARKVHRHWLYEAQELPLILEGPGGIWCD 119
Query: 174 VDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGS 233
VDVVEFS G P+P P E ++T++V+ L+G D +IGSGSQVA E YGTLGAIV+S+TG
Sbjct: 120 VDVVEFSLLG-PQP-PLEPVHTELVEGLQGRDATIGSGSQVACYELYGTLGAIVRSRTGL 177
Query: 234 RQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSF----------HHRRP 283
QVGFLTNRHVAV LD+P QK+F+PLPP LGPGVYLGAVER T+F P
Sbjct: 178 CQVGFLTNRHVAVSLDHPVQKLFYPLPPHLGPGVYLGAVERTTTFIRDDLWYGVFASMNP 237
Query: 284 LTFVRADGAFIPFADDFDMST-VTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSS 342
+F RADGAFIPF ++ D+ V+ SV+G+GEIG+V VDL +P++SLIGK V+KVGRSS
Sbjct: 238 ESFARADGAFIPFDNNLDVRNFVSPSVRGVGEIGEVMSVDLHAPLNSLIGKHVIKVGRSS 297
Query: 343 GLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIG 402
G+T G + AYA+EYN + G CF DFL+V ++ Q F+ EGDSGSLIL+ GE KPRPIG
Sbjct: 298 GVTKGCIFAYAVEYNSDIGHCFFNDFLIVSDDGQAFESEGDSGSLILVTGEAEGKPRPIG 357
Query: 403 IIWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASAT 462
++WGGT ++GRLK + + PE WTSGVDL RLL+ LEL +++++E L A++ QR A
Sbjct: 358 MVWGGTTHQGRLKFQSWKEPEKWTSGVDLSRLLDSLELSIVSSNEALCEALEMQRQCLAA 417
Query: 463 A 463
+
Sbjct: 418 S 418
>gi|302760907|ref|XP_002963876.1| hypothetical protein SELMODRAFT_80513 [Selaginella moellendorffii]
gi|300169144|gb|EFJ35747.1| hypothetical protein SELMODRAFT_80513 [Selaginella moellendorffii]
Length = 372
Score = 437 bits (1124), Expect = e-120, Method: Compositional matrix adjust.
Identities = 226/367 (61%), Positives = 283/367 (77%), Gaps = 13/367 (3%)
Query: 94 LPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQW 153
+ G+QA TL ELM IRA H K+ R LGTA+GFR + +TD PAI+VFV+RK+H QW
Sbjct: 1 MGTGRQARTLRELMAIRAIHGKMFRRLGLGTALGFRTRDRQVTDRPAIIVFVARKLHAQW 60
Query: 154 LSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQ 213
+ Q LP+ ++GPG +WCDVDVVEFSY G PKEQ+Y+++V+ LRG D SIG GSQ
Sbjct: 61 VLDGQMLPSTVQGPGDLWCDVDVVEFSYHGTSSAAPKEQVYSELVECLRGDDQSIGPGSQ 120
Query: 214 VASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVE 273
VAS E YGT+GA+V+S+TG Q+GFLTNRHVAVDLD+P QKMFHPLPP LGPGVYLG VE
Sbjct: 121 VASLEVYGTMGAVVRSRTGEHQIGFLTNRHVAVDLDFPYQKMFHPLPPNLGPGVYLGTVE 180
Query: 274 RATSFHHRRPL----------TFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDL 323
RATSF T VRADGAF+PFA FD S+VT ++KG+GE+G++ ++L
Sbjct: 181 RATSFVTDDLWYGMFATCCSETVVRADGAFVPFAASFDSSSVTATIKGVGEVGELFTINL 240
Query: 324 QSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGD 383
PI++L+GK +KVGRSSGLT GTV+AY +EY+D+KG+CF TD LVVG+ Q FD EGD
Sbjct: 241 DDPIANLVGKAAIKVGRSSGLTRGTVVAYGVEYHDDKGVCFFTDLLVVGDGGQ-FDSEGD 299
Query: 384 SGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLI 443
SGS+IL+ +G+KPRP+G+IWGGT+NRGRLKL+ G PENWTSGVDLGRLL+LL+LD+I
Sbjct: 300 SGSMILLC--DGDKPRPVGMIWGGTSNRGRLKLRQGHEPENWTSGVDLGRLLDLLQLDII 357
Query: 444 TTDEGLK 450
+ D LK
Sbjct: 358 SNDLALK 364
>gi|302813186|ref|XP_002988279.1| hypothetical protein SELMODRAFT_42830 [Selaginella moellendorffii]
gi|300144011|gb|EFJ10698.1| hypothetical protein SELMODRAFT_42830 [Selaginella moellendorffii]
Length = 358
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 211/343 (61%), Positives = 264/343 (76%), Gaps = 13/343 (3%)
Query: 97 GQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSP 156
G+QA TL ELM IRA H K+ R LGTA+GFR + +TD PAI+VFV+RK+H QW+
Sbjct: 2 GRQAGTLRELMAIRAIHGKMFRRLGLGTALGFRTRDRQVTDRPAIIVFVARKLHAQWVLD 61
Query: 157 IQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVAS 216
Q LP+ ++GPG +WCDVDVVEFSY GA PKEQ+Y+++V+ LRG D +G GSQVAS
Sbjct: 62 GQMLPSTVQGPGDLWCDVDVVEFSYHGASSAAPKEQVYSELVECLRGDDQCVGPGSQVAS 121
Query: 217 QETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERAT 276
E YGT+GA+V+S+TG Q+GFLTNRHVAVDLD+P QKMFHPLPP LGPGVYLG VERAT
Sbjct: 122 LEVYGTMGAVVRSRTGEHQIGFLTNRHVAVDLDFPYQKMFHPLPPNLGPGVYLGTVERAT 181
Query: 277 SFHHRRPL----------TFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSP 326
SF T VRADGAF+PFA FD S+VT S+KG+GE+G++ ++L P
Sbjct: 182 SFVTDDLWYGMFATCCSETVVRADGAFVPFAASFDSSSVTASIKGVGEVGELFTINLDDP 241
Query: 327 ISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGS 386
I++L+GK +KVGRSSGLT GTV+AY +EY+D+KG+CF TD LVVG+ Q FD EGDSGS
Sbjct: 242 IANLVGKAAIKVGRSSGLTRGTVVAYGVEYHDDKGVCFFTDLLVVGDGGQ-FDSEGDSGS 300
Query: 387 LILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPPENWTSGV 429
+IL+ +G+KPRP+G+IWGGT+NRGRLKL+ G P+NWTSGV
Sbjct: 301 MILLC--DGDKPRPVGMIWGGTSNRGRLKLRQGHEPQNWTSGV 341
>gi|413919514|gb|AFW59446.1| hypothetical protein ZEAMMB73_623071 [Zea mays]
Length = 302
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 186/270 (68%), Positives = 220/270 (81%), Gaps = 2/270 (0%)
Query: 9 RARCSGSTPSEESALDFERNCCSHPNLPSLSPPTLQPFASAGQHCESNAAYFSWPTSSRL 68
RA+ SG S+ES LD E +CC P+ P P++QP S H E++AAYF WPTS+
Sbjct: 7 RAQLSGFAQSDESTLDVEGHCCHQPSFPC--SPSMQPIVSGCTHTENSAAYFLWPTSNLQ 64
Query: 69 SDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGF 128
AAE RANYFANL KG+LP+ +LPKGQQA +LL+LMTIRAFHSK+LRC+ LGTA+GF
Sbjct: 65 HCAAEGRANYFANLSKGLLPKIGRRLPKGQQANSLLDLMTIRAFHSKVLRCFGLGTAVGF 124
Query: 129 RIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT 188
RI++GVLTDIPAIL FV+RKVHK+WL P CLP L GPGG+WCDVDVVEFSY+GAP T
Sbjct: 125 RIRKGVLTDIPAILCFVARKVHKKWLDPAHCLPAILAGPGGIWCDVDVVEFSYYGAPAQT 184
Query: 189 PKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
PK Q++T++VD L G D IGSGSQVASQ+T+GTLGAIVK +TG++ VGF+TNRHVAVDL
Sbjct: 185 PKVQIFTELVDKLCGSDECIGSGSQVASQDTFGTLGAIVKRRTGNKLVGFVTNRHVAVDL 244
Query: 249 DYPNQKMFHPLPPTLGPGVYLGAVERATSF 278
DYPNQKM+HPLPP LGPGVYLGAVERATSF
Sbjct: 245 DYPNQKMYHPLPPNLGPGVYLGAVERATSF 274
>gi|215695330|dbj|BAG90521.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 342
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 225/347 (64%), Positives = 261/347 (75%), Gaps = 26/347 (7%)
Query: 255 MFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDMST 304
MFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFADD+D+++
Sbjct: 1 MFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDYDITS 60
Query: 305 VTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICF 364
V TSVKG+G IGDVK +DLQSPISSLIG+QVVKVGRSSGLTTGTV+AYALEYNDEKGICF
Sbjct: 61 VNTSVKGVGVIGDVKAIDLQSPISSLIGRQVVKVGRSSGLTTGTVVAYALEYNDEKGICF 120
Query: 365 LTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPPEN 424
TDFLVVGENQQTFDLEGDSGSLI++ G++GEKP+PIGIIWGGTANRGRLKLK GQ PEN
Sbjct: 121 FTDFLVVGENQQTFDLEGDSGSLIILTGKDGEKPQPIGIIWGGTANRGRLKLKSGQGPEN 180
Query: 425 WTSGVDLGRLLNLLELDLITTDEGLKVAVQEQR---AASATAIGSTVGDSSPPDGMHLKD 481
WTSGVDLGRLL+LLELDLITT EGL+ A++EQR AA+A A ST G+SSP G +
Sbjct: 181 WTSGVDLGRLLDLLELDLITTSEGLQEALEEQRIILAAAAAAANSTAGESSPVAGPQENE 240
Query: 482 KAEDKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSV-ELQFIPSFTGH 540
K + +EPLG+ IQ +P ++ + T P EFH+ D V+ +V E QF+ G
Sbjct: 241 KVDKIYEPLGINIQQLP--RDNSATSTGPD----EFHV-DTVEGVTNVEERQFL---IGM 290
Query: 541 SPLHQNNPSDKASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDAS 587
SP + ++ NLA L N EDICFSL LG+ E KR RSD+S
Sbjct: 291 SPAREGQEAN-GDLNNLAELEN-SPEDICFSLHLGEREPKRLRSDSS 335
>gi|413919515|gb|AFW59447.1| hypothetical protein ZEAMMB73_623071 [Zea mays]
Length = 316
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 190/332 (57%), Positives = 235/332 (70%), Gaps = 27/332 (8%)
Query: 255 MFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDMST 304
M+HPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFA DFD+ST
Sbjct: 1 MYHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFAHDFDIST 60
Query: 305 VTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICF 364
VTT+V+G+G+IGDVK++DLQ P++ LIG++V K+GRSSG TTGTV+AYALEYNDEKGI F
Sbjct: 61 VTTTVRGVGDIGDVKVIDLQCPLNRLIGRRVCKIGRSSGHTTGTVMAYALEYNDEKGISF 120
Query: 365 LTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPPEN 424
TD LVVGEN+QTFDLEGDSGSLI++ G++ EKPRPIGIIWGGTANRGRLKL+ P+N
Sbjct: 121 FTDLLVVGENRQTFDLEGDSGSLIILTGQDSEKPRPIGIIWGGTANRGRLKLRCDHGPQN 180
Query: 425 WTSGVDLGRLLNLLELDLITTDEGLKVAVQEQRAASATAIGSTVGDSSPPDGMHLKDKAE 484
WTSGVDLGRLL+ LELDLI T E LK AVQ+QR A A A S G+SS L++K E
Sbjct: 181 WTSGVDLGRLLDRLELDLIITSESLKDAVQQQRRALAAAANSAAGESSTAAAPVLEEKVE 240
Query: 485 DKFEPLGLQIQHIPVEVEHHSPETNPSLMETEFHLEDGVKAGPSVELQFIPSFTGHSPLH 544
+ FEPLG++I+ ++ H + + ++E E QFI +F G SP+
Sbjct: 241 EIFEPLGIKIE----QLRRHDVSASEAEEAAGINVE---------ERQFISNFVGRSPVR 287
Query: 545 QNNPSDKASSENLASLWNGCDEDICFSLQLGD 576
D+ + +A+L N +E++ L LGD
Sbjct: 288 ----DDQGAPRQIANLNNPSEEELAMLLHLGD 315
>gi|115460532|ref|NP_001053866.1| Os04g0615000 [Oryza sativa Japonica Group]
gi|113565437|dbj|BAF15780.1| Os04g0615000 [Oryza sativa Japonica Group]
Length = 207
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 155/206 (75%), Positives = 174/206 (84%), Gaps = 10/206 (4%)
Query: 255 MFHPLPPTLGPGVYLGAVERATSF----------HHRRPLTFVRADGAFIPFADDFDMST 304
MFHPLPP LGPGVYLGAVERATSF P TFVRADGAFIPFADDFD+ST
Sbjct: 1 MFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDFDIST 60
Query: 305 VTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICF 364
VTT V+G+G+IGDVK++DLQ P++SLIG+QV KVGRSSG TTGTV+AYALEYNDEKGICF
Sbjct: 61 VTTVVRGVGDIGDVKVIDLQCPLNSLIGRQVCKVGRSSGHTTGTVMAYALEYNDEKGICF 120
Query: 365 LTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPPEN 424
TD LVVGEN+QTFDLEGDSGSLI++ ++GEKPRPIGIIWGGTANRGRLKL PEN
Sbjct: 121 FTDILVVGENRQTFDLEGDSGSLIILTSQDGEKPRPIGIIWGGTANRGRLKLTSDHGPEN 180
Query: 425 WTSGVDLGRLLNLLELDLITTDEGLK 450
WTSGVDLGRLL+ LELD+I T+E L+
Sbjct: 181 WTSGVDLGRLLDRLELDIIITNESLQ 206
>gi|218195570|gb|EEC77997.1| hypothetical protein OsI_17387 [Oryza sativa Indica Group]
Length = 999
Score = 304 bits (779), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 139/172 (80%), Positives = 156/172 (90%)
Query: 107 MTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEG 166
MTIRAFHSKILR +SLGTA+GFRI++G LTDIPAILVFV+RKVHK+WL+P QCLP LEG
Sbjct: 1 MTIRAFHSKILRRFSLGTAVGFRIRKGDLTDIPAILVFVARKVHKKWLNPAQCLPAILEG 60
Query: 167 PGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAI 226
PGGVWCDVDVVEFSY+GAP TPKEQ+++++VD L G D IGSGSQVAS ET+GTLGAI
Sbjct: 61 PGGVWCDVDVVEFSYYGAPAQTPKEQMFSELVDKLCGSDECIGSGSQVASHETFGTLGAI 120
Query: 227 VKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSF 278
VK +TG++QVGFLTN HVAVDLDYPNQKMFHPLPP LGPGVYLGAVERATSF
Sbjct: 121 VKRRTGNKQVGFLTNHHVAVDLDYPNQKMFHPLPPNLGPGVYLGAVERATSF 172
>gi|224286426|gb|ACN40920.1| unknown [Picea sitchensis]
Length = 170
Score = 192 bits (489), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 110/170 (64%), Positives = 122/170 (71%), Gaps = 8/170 (4%)
Query: 1 MDRTR-LNIRARCSGSTPSEESALDFER----NCCSHPNLPSLSPPTLQPFASAGQHCES 55
MD TR L + R SGS SEESALD E+ N HP S SPP LQ FAS GQ ES
Sbjct: 1 MDVTRALRLGRRYSGSMQSEESALDREQTVTGNSGRHPR--SDSPP-LQAFASGGQRSES 57
Query: 56 NAAYFSWPTSSRLSDAAEERANYFANLQKGVLPETLGQLPKGQQATTLLELMTIRAFHSK 115
+AA F WP S+RL+ AEERA YF +QK V ETL LP G QAT LL+LMTIRAFHSK
Sbjct: 58 SAACFRWPPSNRLNGTAEERAAYFGGIQKEVDSETLEHLPSGHQATALLDLMTIRAFHSK 117
Query: 116 ILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALE 165
ILR YSLGTAIGFRI+ GVLT+I AILVFV+RKVHKQWL +Q LP+ LE
Sbjct: 118 ILRRYSLGTAIGFRIREGVLTNILAILVFVARKVHKQWLLDVQRLPSVLE 167
>gi|357449481|ref|XP_003595017.1| Elongation factor 1-alpha [Medicago truncatula]
gi|355484065|gb|AES65268.1| Elongation factor 1-alpha [Medicago truncatula]
Length = 591
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/130 (58%), Positives = 83/130 (63%), Gaps = 14/130 (10%)
Query: 141 ILVFVSRKVHKQWLSP-IQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVD 199
IL K+H + L P Q L+GPGGVWCDVD+VE YF A +P PKEQ YT+IVD
Sbjct: 457 ILSTSRSKIHVEILHPGFQTSGNFLQGPGGVWCDVDMVEILYFSALDPVPKEQNYTEIVD 516
Query: 200 DLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFL-TNRHVAVDLDYPNQKMFHP 258
D RGGDP IGSGSQVASQ+TY TL VGFL T H VDLDY NQKMFHP
Sbjct: 517 DSRGGDPCIGSGSQVASQKTYRTL------------VGFLRTYCHAVVDLDYSNQKMFHP 564
Query: 259 LPPTLGPGVY 268
LP L VY
Sbjct: 565 LPHILSLEVY 574
>gi|357452683|ref|XP_003596618.1| Elongation factor 1-alpha [Medicago truncatula]
gi|355485666|gb|AES66869.1| Elongation factor 1-alpha [Medicago truncatula]
Length = 608
Score = 72.0 bits (175), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 36/62 (58%), Positives = 45/62 (72%), Gaps = 5/62 (8%)
Query: 194 YTQIVDDLRGGDPSIGSGSQVASQ-----ETYGTLGAIVKSQTGSRQVGFLTNRHVAVDL 248
YT+IVDDLRGG+P IGS SQ++ + +T G +SQTGSRQVGF T +HVA+DL
Sbjct: 547 YTEIVDDLRGGNPCIGSRSQMSEKSLVRSQTERNFGCTGRSQTGSRQVGFRTYQHVAIDL 606
Query: 249 DY 250
DY
Sbjct: 607 DY 608
>gi|419714426|ref|ZP_14241842.1| hypothetical protein S7W_08218 [Mycobacterium abscessus M94]
gi|382945545|gb|EIC69839.1| hypothetical protein S7W_08218 [Mycobacterium abscessus M94]
Length = 728
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 137/335 (40%), Gaps = 39/335 (11%)
Query: 99 QATTLLELMTIRAFHSKIL--RCYSLGTAIGFRIKR----GVLTDI----------PAIL 142
QA ++ +L+ R + L + +GTAIG + R G T + P ++
Sbjct: 16 QALSVTDLLAARDLYHHHLTNKPNVVGTAIGRYLIREQPGGARTLVNSRVEQGFSWPCVM 75
Query: 143 VFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLR 202
VF+S + L+P +P L P G V V+ TP+ + L
Sbjct: 76 VFISDWAAPKSLTPYDYVPKQLFMPDGRVVPVCKVQVDPAPVSTTTPRHPAPARWPTTLL 135
Query: 203 GGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHV----AVDLDYPNQKMFHP 258
GG + V +Q T G +V + LTNRHV ++D M
Sbjct: 136 GG--GLPVVVDVQNQSHTATAGCLVSD---GHSLYALTNRHVCGPAGQEID-----MVRG 185
Query: 259 LPPTLGPGVYLGAVERATSFHHRRPL----TFVRADGAFIPFADDFDMSTVTTSVKGLGE 314
L + GV G F P T++ D I D D T++ G+G+
Sbjct: 186 LARSR-IGVSSGQQLTRLPFGEVYPFSMTNTYLTLD---IGLVDVDDAGDWTSTAYGIGD 241
Query: 315 IGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGEN 374
IG + + LIG+ VV G SSGL G V+A Y G +++DFL+ +
Sbjct: 242 IGPMVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSVGGSEYVSDFLIAPDP 301
Query: 375 QQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
Q + + GDSG ++ EN +P P+ + WGG A
Sbjct: 302 QGSQTVPGDSG-MVWHLTENRARPAPLAVEWGGQA 335
>gi|420864658|ref|ZP_15328047.1| hypothetical protein MA4S0303_3019 [Mycobacterium abscessus
4S-0303]
gi|420869447|ref|ZP_15332829.1| hypothetical protein MA4S0726RA_2952 [Mycobacterium abscessus
4S-0726-RA]
gi|420873892|ref|ZP_15337268.1| hypothetical protein MA4S0726RB_2542 [Mycobacterium abscessus
4S-0726-RB]
gi|420990095|ref|ZP_15453251.1| hypothetical protein MA4S0206_3037 [Mycobacterium abscessus
4S-0206]
gi|421042016|ref|ZP_15505024.1| hypothetical protein MA4S0116R_2995 [Mycobacterium abscessus
4S-0116-R]
gi|421044246|ref|ZP_15507246.1| hypothetical protein MA4S0116S_2090 [Mycobacterium abscessus
4S-0116-S]
gi|392063374|gb|EIT89223.1| hypothetical protein MA4S0303_3019 [Mycobacterium abscessus
4S-0303]
gi|392065367|gb|EIT91215.1| hypothetical protein MA4S0726RB_2542 [Mycobacterium abscessus
4S-0726-RB]
gi|392068917|gb|EIT94764.1| hypothetical protein MA4S0726RA_2952 [Mycobacterium abscessus
4S-0726-RA]
gi|392184374|gb|EIV10025.1| hypothetical protein MA4S0206_3037 [Mycobacterium abscessus
4S-0206]
gi|392222944|gb|EIV48467.1| hypothetical protein MA4S0116R_2995 [Mycobacterium abscessus
4S-0116-R]
gi|392233699|gb|EIV59197.1| hypothetical protein MA4S0116S_2090 [Mycobacterium abscessus
4S-0116-S]
Length = 728
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 137/335 (40%), Gaps = 39/335 (11%)
Query: 99 QATTLLELMTIRAFHSKIL--RCYSLGTAIGFRIKR----GVLTDI----------PAIL 142
QA ++ +L+ R + L + +GTAIG + R G T + P ++
Sbjct: 16 QALSVTDLLAARDLYHHHLTNKPNVVGTAIGRYLIREQPGGARTLVNSRVEQGFSWPCVM 75
Query: 143 VFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLR 202
VF+S + L+P +P L P G V V+ TP+ + L
Sbjct: 76 VFISDWAAPKSLTPYDYVPKQLFMPDGRVVPVCKVQVDPAPVSTTTPRHPAPARWPTTLL 135
Query: 203 GGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHV----AVDLDYPNQKMFHP 258
GG + V +Q T G +V + + LTNRHV ++D M
Sbjct: 136 GG--GLPVVVDVQNQSHTATAGCLV---SDGHSLYALTNRHVCGPAGQEID-----MVRG 185
Query: 259 LPPTLGPGVYLGAVERATSFHHRRPL----TFVRADGAFIPFADDFDMSTVTTSVKGLGE 314
L + GV G F P T++ D I D D T++ G+G+
Sbjct: 186 LARSR-IGVSSGQQLTRLPFGEVYPFSMTNTYLTLD---IGLVDVDDAGDWTSTAYGIGD 241
Query: 315 IGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGEN 374
IG + + LIG+ VV G SSGL G V+A Y G +++DFL+ +
Sbjct: 242 IGPMVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSVGGSEYVSDFLIAPDP 301
Query: 375 QQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
Q + GDSG ++ EN +P P+ + WGG A
Sbjct: 302 QGPQTVPGDSG-MVWHLTENRARPAPLAVEWGGQA 335
>gi|419709529|ref|ZP_14236997.1| hypothetical protein OUW_08328 [Mycobacterium abscessus M93]
gi|382943410|gb|EIC67724.1| hypothetical protein OUW_08328 [Mycobacterium abscessus M93]
Length = 728
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 137/335 (40%), Gaps = 39/335 (11%)
Query: 99 QATTLLELMTIRAFHSKIL--RCYSLGTAIGFRIKR----GVLTDI----------PAIL 142
QA ++ +L+ R + L + +GTAIG + R G T + P ++
Sbjct: 16 QALSVTDLLAARDLYHHHLTNKPNVVGTAIGRYLIREQPGGARTLVNSRVEQGFSWPCVM 75
Query: 143 VFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLR 202
VF+S + L+P +P L P G V V+ TP+ + L
Sbjct: 76 VFISDWAAPKSLTPYDYVPKQLFMPDGRVVPVCKVQVDPAPVSTTTPRHPAPARWPTTLL 135
Query: 203 GGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHV----AVDLDYPNQKMFHP 258
GG + V +Q T G +V + + LTNRHV ++D M
Sbjct: 136 GG--GLPVVVDVQNQSHTATAGCLV---SDGHSLYALTNRHVCGPAGQEID-----MVRG 185
Query: 259 LPPTLGPGVYLGAVERATSFHHRRPL----TFVRADGAFIPFADDFDMSTVTTSVKGLGE 314
L + GV G F P T++ D I D D T++ G+G+
Sbjct: 186 LARSR-IGVSSGQQLTRLPFGEVYPFSMTNTYLTLD---IGLVDVDDAGDWTSTAYGIGD 241
Query: 315 IGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGEN 374
IG + + LIG+ VV G SSGL G V+A Y G +++DFL+ +
Sbjct: 242 IGPMVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSVGGSEYVSDFLIAPDP 301
Query: 375 QQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
Q + GDSG ++ EN +P P+ + WGG A
Sbjct: 302 QGPQTVPGDSG-MVWHLTENRARPAPLAVEWGGQA 335
>gi|388511095|gb|AFK43612.1| unknown [Medicago truncatula]
Length = 99
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 58/98 (59%), Gaps = 7/98 (7%)
Query: 494 IQHIPVEVEHHSPET--NPSLMETEFHLEDGVKAGPSVELQFI-PSFTGHSPLHQNNPSD 550
++H+PVE P T PSL EFH+ + ++ P+VE QFI SF G SP+HQ+ +
Sbjct: 1 MEHVPVE----EPSTIVKPSLRPCEFHIRNEIETVPNVEHQFIRTSFAGKSPVHQSFLKE 56
Query: 551 KASSENLASLWNGCDEDICFSLQLGDNEAKRRRSDAST 588
++L+ L N DED SL LG+ EAKRR+ S+
Sbjct: 57 DMQFKSLSELRNEPDEDNFVSLHLGEPEAKRRKHSNSS 94
>gi|418247622|ref|ZP_12874008.1| hypothetical protein MAB47J26_03320 [Mycobacterium abscessus 47J26]
gi|420932347|ref|ZP_15395622.1| hypothetical protein MM1S1510930_3180 [Mycobacterium massiliense
1S-151-0930]
gi|420939252|ref|ZP_15402521.1| hypothetical protein MM1S1520914_3384 [Mycobacterium massiliense
1S-152-0914]
gi|420952865|ref|ZP_15416108.1| hypothetical protein MM2B0626_3102 [Mycobacterium massiliense
2B-0626]
gi|420957036|ref|ZP_15420272.1| hypothetical protein MM2B0107_2440 [Mycobacterium massiliense
2B-0107]
gi|420962692|ref|ZP_15425916.1| hypothetical protein MM2B1231_3167 [Mycobacterium massiliense
2B-1231]
gi|420992988|ref|ZP_15456134.1| hypothetical protein MM2B0307_2407 [Mycobacterium massiliense
2B-0307]
gi|420998760|ref|ZP_15461896.1| hypothetical protein MM2B0912R_3420 [Mycobacterium massiliense
2B-0912-R]
gi|421003282|ref|ZP_15466405.1| hypothetical protein MM2B0912S_3107 [Mycobacterium massiliense
2B-0912-S]
gi|353452115|gb|EHC00509.1| hypothetical protein MAB47J26_03320 [Mycobacterium abscessus 47J26]
gi|392137106|gb|EIU62843.1| hypothetical protein MM1S1510930_3180 [Mycobacterium massiliense
1S-151-0930]
gi|392144767|gb|EIU70492.1| hypothetical protein MM1S1520914_3384 [Mycobacterium massiliense
1S-152-0914]
gi|392156377|gb|EIU82080.1| hypothetical protein MM2B0626_3102 [Mycobacterium massiliense
2B-0626]
gi|392179090|gb|EIV04742.1| hypothetical protein MM2B0307_2407 [Mycobacterium massiliense
2B-0307]
gi|392184901|gb|EIV10551.1| hypothetical protein MM2B0912R_3420 [Mycobacterium massiliense
2B-0912-R]
gi|392193854|gb|EIV19475.1| hypothetical protein MM2B0912S_3107 [Mycobacterium massiliense
2B-0912-S]
gi|392245605|gb|EIV71082.1| hypothetical protein MM2B1231_3167 [Mycobacterium massiliense
2B-1231]
gi|392251846|gb|EIV77317.1| hypothetical protein MM2B0107_2440 [Mycobacterium massiliense
2B-0107]
Length = 726
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 143/339 (42%), Gaps = 48/339 (14%)
Query: 99 QATTLLELMTIRAFHSKIL--RCYSLGTAIGFRIKR----GVLTDI----------PAIL 142
QA ++ +L+ R + L + +GTAIG + R G T + P ++
Sbjct: 15 QALSVTDLLAARDLYHHHLTNKPNVVGTAIGRYLIREQPGGARTLVNSRVEQGFSWPCVM 74
Query: 143 VFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEP---TPKEQLYTQIVD 199
VF+S + L+P +P L P G V V+ P P TP+ +
Sbjct: 75 VFISDWAAPKSLTPYDYVPKQLFMPDGRVVPVCKVQVD----PAPVSTTPRHPAPARWPT 130
Query: 200 DLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHV----AVDLDYPNQKM 255
L GG I V +Q T G +V + S + LTNRHV ++D M
Sbjct: 131 TLLGGGLPIVV--DVQNQSHTATAGCLV---SDSHSLYALTNRHVCGPAGQEID-----M 180
Query: 256 FHPLPPTLGPGVYLGAVERATSFHHRRPL----TFVRADGAFIPFADDFDMSTVTTSVKG 311
L + GV G F P T++ D I D D T++ G
Sbjct: 181 VRGLARSR-VGVSSGQQLTRLPFGEVYPFSMTNTYLTLD---IGLVDVDDAGDWTSTAYG 236
Query: 312 LGEIGD-VKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLV 370
+G+IG V D+ + + LIG+ VV G SSGL G V+A Y G +++DFL+
Sbjct: 237 IGDIGPMVDTGDMTNGLD-LIGQPVVAHGASSGLVAGKVMALFYRYKSMGGSEYVSDFLI 295
Query: 371 VGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
+ Q + GDSG ++ E+ +P P+ + WGG A
Sbjct: 296 APDPQGPQTVPGDSG-MVWHLTEDRARPAPLAVEWGGQA 333
>gi|169630314|ref|YP_001703963.1| hypothetical protein MAB_3233 [Mycobacterium abscessus ATCC 19977]
gi|420910850|ref|ZP_15374162.1| hypothetical protein MA6G0125R_2366 [Mycobacterium abscessus
6G-0125-R]
gi|420917303|ref|ZP_15380606.1| hypothetical protein MA6G0125S_3405 [Mycobacterium abscessus
6G-0125-S]
gi|420922468|ref|ZP_15385764.1| hypothetical protein MA6G0728S_3090 [Mycobacterium abscessus
6G-0728-S]
gi|420928131|ref|ZP_15391411.1| hypothetical protein MA6G1108_3333 [Mycobacterium abscessus
6G-1108]
gi|420967738|ref|ZP_15430942.1| hypothetical protein MM3A0810R_3493 [Mycobacterium abscessus
3A-0810-R]
gi|420978471|ref|ZP_15441648.1| hypothetical protein MA6G0212_3393 [Mycobacterium abscessus
6G-0212]
gi|420983854|ref|ZP_15447021.1| hypothetical protein MA6G0728R_3335 [Mycobacterium abscessus
6G-0728-R]
gi|421008973|ref|ZP_15472083.1| hypothetical protein MA3A0119R_3393 [Mycobacterium abscessus
3A-0119-R]
gi|421013827|ref|ZP_15476905.1| hypothetical protein MA3A0122R_3404 [Mycobacterium abscessus
3A-0122-R]
gi|421018771|ref|ZP_15481828.1| hypothetical protein MA3A0122S_2998 [Mycobacterium abscessus
3A-0122-S]
gi|421024437|ref|ZP_15487481.1| hypothetical protein MA3A0731_3523 [Mycobacterium abscessus
3A-0731]
gi|421030220|ref|ZP_15493251.1| hypothetical protein MA3A0930R_3458 [Mycobacterium abscessus
3A-0930-R]
gi|421035683|ref|ZP_15498701.1| hypothetical protein MA3A0930S_3391 [Mycobacterium abscessus
3A-0930-S]
gi|169242281|emb|CAM63309.1| Conserved hypothetical protein [Mycobacterium abscessus]
gi|392110194|gb|EIU35964.1| hypothetical protein MA6G0125S_3405 [Mycobacterium abscessus
6G-0125-S]
gi|392112844|gb|EIU38613.1| hypothetical protein MA6G0125R_2366 [Mycobacterium abscessus
6G-0125-R]
gi|392127121|gb|EIU52871.1| hypothetical protein MA6G0728S_3090 [Mycobacterium abscessus
6G-0728-S]
gi|392129249|gb|EIU54996.1| hypothetical protein MA6G1108_3333 [Mycobacterium abscessus
6G-1108]
gi|392162749|gb|EIU88438.1| hypothetical protein MA6G0212_3393 [Mycobacterium abscessus
6G-0212]
gi|392168850|gb|EIU94528.1| hypothetical protein MA6G0728R_3335 [Mycobacterium abscessus
6G-0728-R]
gi|392197121|gb|EIV22737.1| hypothetical protein MA3A0119R_3393 [Mycobacterium abscessus
3A-0119-R]
gi|392200682|gb|EIV26287.1| hypothetical protein MA3A0122R_3404 [Mycobacterium abscessus
3A-0122-R]
gi|392207401|gb|EIV32978.1| hypothetical protein MA3A0122S_2998 [Mycobacterium abscessus
3A-0122-S]
gi|392211234|gb|EIV36800.1| hypothetical protein MA3A0731_3523 [Mycobacterium abscessus
3A-0731]
gi|392223440|gb|EIV48962.1| hypothetical protein MA3A0930R_3458 [Mycobacterium abscessus
3A-0930-R]
gi|392224178|gb|EIV49699.1| hypothetical protein MA3A0930S_3391 [Mycobacterium abscessus
3A-0930-S]
gi|392250245|gb|EIV75719.1| hypothetical protein MM3A0810R_3493 [Mycobacterium abscessus
3A-0810-R]
Length = 728
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 92/336 (27%), Positives = 140/336 (41%), Gaps = 41/336 (12%)
Query: 99 QATTLLELMTIRAFHSKIL--RCYSLGTAIGFRIKR----GVLTDI----------PAIL 142
QA ++ +L+ R + L + +GTAIG + R G T + P ++
Sbjct: 16 QALSVTDLLAARDLYHHHLTNKPNVVGTAIGRYLIREQPGGARTLVNSRVEQGFSWPCVM 75
Query: 143 VFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLR 202
VF+S + L+P +P L P G V V+ TP+ + L
Sbjct: 76 VFISDWAAPKSLTPYDYVPKQLFMPDGRVVPVCKVQVDPAPVSTTTPRHPAPARWPTTLL 135
Query: 203 GGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHV----AVDLDYPNQKMFHP 258
GG + V +Q T G +V + + LTNRHV ++D M
Sbjct: 136 GG--GLPVVVDVQNQSHTATAGCLV---SDGHSLYALTNRHVCGPAGQEID-----MVRG 185
Query: 259 LPPTLGPGVYLGAVERATSFHHRRPL----TFVRADGAFIPFADDFDMSTVTTSVKGLGE 314
L + GV G F P T++ D I D D T++ G+G+
Sbjct: 186 LARSR-IGVSSGQQLTRLPFGEVYPFSMTNTYLTLD---IGLVDVDDAGDWTSTAYGIGD 241
Query: 315 IGD-VKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGE 373
IG V D+ + + LIG+ VV G SSGL G V+A Y G +++DFL+ +
Sbjct: 242 IGPMVDTGDMTNGL-DLIGQPVVAHGASSGLVGGKVMALFYRYKSVGGSEYVSDFLIAPD 300
Query: 374 NQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
Q + GDSG ++ EN +P P+ + WGG A
Sbjct: 301 PQGPQTVPGDSG-MVWHLTENRARPAPLAVEWGGQA 335
>gi|420942606|ref|ZP_15405862.1| hypothetical protein MM1S1530915_2728 [Mycobacterium massiliense
1S-153-0915]
gi|420948873|ref|ZP_15412123.1| hypothetical protein MM1S1540310_2737 [Mycobacterium massiliense
1S-154-0310]
gi|392147703|gb|EIU73421.1| hypothetical protein MM1S1530915_2728 [Mycobacterium massiliense
1S-153-0915]
gi|392155903|gb|EIU81609.1| hypothetical protein MM1S1540310_2737 [Mycobacterium massiliense
1S-154-0310]
Length = 716
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 93/338 (27%), Positives = 140/338 (41%), Gaps = 46/338 (13%)
Query: 99 QATTLLELMTIRAFHSKIL--RCYSLGTAIGFRIKR----GVLTDI----------PAIL 142
QA ++ +L+ R + L + +GTAIG + R G T + P ++
Sbjct: 5 QALSVTDLLAARDLYHHHLTNKPNVVGTAIGRYLIREQPGGARTLVNSRVEQGFSWPCVM 64
Query: 143 VFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEP---TPKEQLYTQIVD 199
VF+S + L+P +P L P G V V+ P P TP+ +
Sbjct: 65 VFISDWAAPKSLTPYDYVPKQLFMPDGRVVPVCKVQVD----PAPVSTTPRHPAPARWPT 120
Query: 200 DLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHV----AVDLDYPNQKM 255
L GG I V +Q T G +V + S + LTNRHV ++D M
Sbjct: 121 TLLGGGLPIVV--DVQNQSHTATAGCLV---SDSHSLYALTNRHVCGPAGQEID-----M 170
Query: 256 FHPLPPTLGPGVYLGAVERATSFHHRRPL----TFVRADGAFIPFADDFDMSTVTTSVKG 311
L + GV G F P T++ D I D D T++ G
Sbjct: 171 VRGLARSR-VGVSSGQQLTRLPFGEVYPFSMTNTYLTLD---IGLVDVDDAGDWTSTAYG 226
Query: 312 LGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVV 371
+G+IG + + LIG+ VV G SSGL G V+A Y G +++DFL+
Sbjct: 227 IGDIGPMVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSMGGSEYVSDFLIA 286
Query: 372 GENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
+ Q + GDSG ++ E+ +P P+ + WGG A
Sbjct: 287 PDPQGPQTVPGDSG-MVWHLTEDRARPAPLAVEWGGQA 323
>gi|418421347|ref|ZP_12994521.1| hypothetical protein MBOL_30670 [Mycobacterium abscessus subsp.
bolletii BD]
gi|363996427|gb|EHM17642.1| hypothetical protein MBOL_30670 [Mycobacterium abscessus subsp.
bolletii BD]
Length = 728
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 136/335 (40%), Gaps = 39/335 (11%)
Query: 99 QATTLLELMTIRAFHSKIL--RCYSLGTAIGFRIKR----GVLTDI----------PAIL 142
QA ++ +L+ R + L + +GTAIG + R G T + P ++
Sbjct: 16 QALSVTDLLAARDLYHHHLTNKPNVVGTAIGRYLIREQPGGARTLVNSRVEQGFSWPCVM 75
Query: 143 VFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLR 202
VF+S + L+P +P L P G V V+ TP+ + L
Sbjct: 76 VFISDWAAPKSLTPYDYVPKQLFMPDGRVVPVCKVQVDPAPVSTTTPRHPAPARWPTTLL 135
Query: 203 GGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHV----AVDLDYPNQKMFHP 258
GG I V +Q T G +V + LTNRHV ++D M
Sbjct: 136 GGGLPI--VVDVQNQSHTATAGCLVSD---GHSLYALTNRHVCGPAGQEID-----MVRG 185
Query: 259 LPPTLGPGVYLGAVERATSFHHRRPL----TFVRADGAFIPFADDFDMSTVTTSVKGLGE 314
L + GV G F P T++ D I D D T++ G+G+
Sbjct: 186 LARSR-IGVSSGQQLTRLPFGEVYPFSMTNTYLTLD---IGLVDVDDAGDWTSTAYGIGD 241
Query: 315 IGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGEN 374
IG + + LIG+ VV G SSGL G V+A Y G +++DFL+ +
Sbjct: 242 IGPMVDTGDMTNGLDLIGRPVVAHGASSGLVAGKVMALFYRYKSVGGSEYVSDFLIAPDP 301
Query: 375 QQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
Q + GDSG ++ E+ +P P+ + WGG A
Sbjct: 302 QGPQTVPGDSG-MVWHLTEDRARPGPLAVEWGGQA 335
>gi|365871159|ref|ZP_09410700.1| hypothetical protein MMAS_31020 [Mycobacterium massiliense CCUG
48898 = JCM 15300]
gi|421050237|ref|ZP_15513231.1| hypothetical protein MMCCUG48898_3242 [Mycobacterium massiliense
CCUG 48898 = JCM 15300]
gi|363994962|gb|EHM16180.1| hypothetical protein MMAS_31020 [Mycobacterium massiliense CCUG
48898 = JCM 15300]
gi|392238840|gb|EIV64333.1| hypothetical protein MMCCUG48898_3242 [Mycobacterium massiliense
CCUG 48898]
Length = 727
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 126/298 (42%), Gaps = 34/298 (11%)
Query: 124 TAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFG 183
T + R+K+G P ++VF+S + L+P +P L P G V V+
Sbjct: 59 TLVNSRVKQGF--SWPCVMVFISDWAAPKSLTPYDYVPKQLFMPDGRVVPVCKVQVD--- 113
Query: 184 APEP---TPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
P P TP+ + L GG I V +Q T G +V + LT
Sbjct: 114 -PAPVSTTPRHPAPARWPTTLLGGGLPIVV--DVQNQSHTATAGCLVSD---GHSLYALT 167
Query: 241 NRHV----AVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPL----TFVRADGA 292
NRHV ++D M L + GV G F P T++ D
Sbjct: 168 NRHVCGPAGQEID-----MVRGLARSR-VGVSSGQQLTRLPFGEVYPFSMTNTYLTLD-- 219
Query: 293 FIPFADDFDMSTVTTSVKGLGEIGD-VKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA 351
I D D T++ G+G+IG V D+ + + LIG+ VV G SSGL G V+A
Sbjct: 220 -IGLVDVDDAGDWTSTAYGIGDIGPMVDTGDMTNGLD-LIGQPVVAHGASSGLVAGKVMA 277
Query: 352 YALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
Y G +++DFL+ + Q + GDSG ++ E+ +P P+ + WGG A
Sbjct: 278 LFYRYKSMGGSEYVSDFLIAPDPQGPQTVPGDSG-MVWHLTEDRARPAPLAVEWGGQA 334
>gi|331269877|ref|YP_004396369.1| hypothetical protein CbC4_1696 [Clostridium botulinum BKT015925]
gi|329126427|gb|AEB76372.1| hypothetical protein CbC4_1696 [Clostridium botulinum BKT015925]
Length = 313
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 133/295 (45%), Gaps = 39/295 (13%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS- 180
+G +G ++K G+ T I VFV+RK+ + L +PT + G+ DV+ ++ +
Sbjct: 29 VGVGLGIKLKNGIDTGQNCIKVFVTRKLPQNSLCKNALVPTLYQ---GIITDVEEIQNNN 85
Query: 181 -YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFL 239
Y+ + +T+ V GG +IG S V +G+LG IVK G + F
Sbjct: 86 LYYPKNNFSSMNNPFTKRVRPTPGGY-AIGPASNV----LFGSLGCIVKDDMGKHYL-FS 139
Query: 240 TNRHVAVDLDYP-NQKMFHPLPPTLG--PGVYLGAVERATSFHHRRPLTFVRADGAFIPF 296
+ + D P ++ P P G P +G + + PL F A+ A
Sbjct: 140 SAHVLTADYTVPLGTEIIQPSYPFHGHAPNDTIGTLYKYI------PLNFTGANFADAGI 193
Query: 297 ADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALE- 355
A D+S V+ V IGD+K V L P+ L V K G +GLT GT+ + +
Sbjct: 194 ALVSDLSKVSNKV---ALIGDIKGVSL--PVLRL---SVKKTGYKTGLTKGTIKSIGVTR 245
Query: 356 -YNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
Y+ E G + L++ N GDSGS++ +N K IGI++GG A
Sbjct: 246 LYSYEHGAVLFKN-LILTSNMSN---PGDSGSILF---DNSNK--AIGILFGGDA 291
>gi|331271091|ref|YP_004385800.1| hypothetical protein CbC4_6003 [Clostridium botulinum BKT015925]
gi|329127586|gb|AEB77528.1| hypothetical protein CbC4_6003 [Clostridium botulinum BKT015925]
Length = 313
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 83/289 (28%), Positives = 124/289 (42%), Gaps = 61/289 (21%)
Query: 120 YSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEF 179
Y +G A+G++IK G +T+ I VFVS+KV L + +P +G + DVVE
Sbjct: 34 YIVGIALGYKIKNGFITNKKCIKVFVSKKVPLSNLYEHEVIPKFFKG-----IETDVVES 88
Query: 180 SYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQ-VASQETYGTLGAIVKSQTGSRQVGF 238
F A E T K + P IG S V++ G++G +V T R
Sbjct: 89 GKFSAAEFTGKVR-------------PVIGGYSIGVSNILRVGSMGCLV---TDGRYKYI 132
Query: 239 LTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLT-FVRADGAFIPFA 297
LTN H+ DL+ K+ P+ + PG Y G P T V +IP
Sbjct: 133 LTNNHIIADLN--KVKIGTPI---IQPGRYDGG----------NPNTDIVAILSKYIPLK 177
Query: 298 DDFDMSTVTTSVKGLGEIGDVKIVDL-------------QSPISSLIGKQVVKVGRSSGL 344
+ + TS + K++D Q P+ +IGK+V KVGRS+ +
Sbjct: 178 TE----GIITSPTNYMDCAIAKLIDESLVSPKIAIVGAPQEPMIPIIGKEVKKVGRSTEM 233
Query: 345 TTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDL--EGDSGSLILMK 391
TTG + + + I F + + E T + GDSGS++L K
Sbjct: 234 TTGRI----TDIDGTFHIKFGSKIFLFEEQIVTTCMCESGDSGSILLYK 278
>gi|414582515|ref|ZP_11439655.1| hypothetical protein MA5S1215_2581 [Mycobacterium abscessus
5S-1215]
gi|420880944|ref|ZP_15344311.1| hypothetical protein MA5S0304_2543 [Mycobacterium abscessus
5S-0304]
gi|420884687|ref|ZP_15348047.1| hypothetical protein MA5S0421_2798 [Mycobacterium abscessus
5S-0421]
gi|420890907|ref|ZP_15354254.1| hypothetical protein MA5S0422_3719 [Mycobacterium abscessus
5S-0422]
gi|420896690|ref|ZP_15360029.1| hypothetical protein MA5S0708_2471 [Mycobacterium abscessus
5S-0708]
gi|420901021|ref|ZP_15364352.1| hypothetical protein MA5S0817_2089 [Mycobacterium abscessus
5S-0817]
gi|420904996|ref|ZP_15368314.1| hypothetical protein MA5S1212_2226 [Mycobacterium abscessus
5S-1212]
gi|420973119|ref|ZP_15436311.1| hypothetical protein MA5S0921_3501 [Mycobacterium abscessus
5S-0921]
gi|392078167|gb|EIU03994.1| hypothetical protein MA5S0422_3719 [Mycobacterium abscessus
5S-0422]
gi|392080450|gb|EIU06276.1| hypothetical protein MA5S0421_2798 [Mycobacterium abscessus
5S-0421]
gi|392085853|gb|EIU11678.1| hypothetical protein MA5S0304_2543 [Mycobacterium abscessus
5S-0304]
gi|392096002|gb|EIU21797.1| hypothetical protein MA5S0708_2471 [Mycobacterium abscessus
5S-0708]
gi|392098382|gb|EIU24176.1| hypothetical protein MA5S0817_2089 [Mycobacterium abscessus
5S-0817]
gi|392102900|gb|EIU28686.1| hypothetical protein MA5S1212_2226 [Mycobacterium abscessus
5S-1212]
gi|392117667|gb|EIU43435.1| hypothetical protein MA5S1215_2581 [Mycobacterium abscessus
5S-1215]
gi|392164670|gb|EIU90358.1| hypothetical protein MA5S0921_3501 [Mycobacterium abscessus
5S-0921]
Length = 716
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 92/338 (27%), Positives = 138/338 (40%), Gaps = 46/338 (13%)
Query: 99 QATTLLELMTIRAFHSKIL--RCYSLGTAIGFRIKR----GVLTDI----------PAIL 142
QA ++ +L+ R + L + +GTAIG + R G T + P ++
Sbjct: 5 QALSVTDLLAARDLYHHHLTNKPNVVGTAIGRYLIREQPGGARTLVNSRVEQGFSWPCVM 64
Query: 143 VFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEP---TPKEQLYTQIVD 199
VF+S + L+P +P L P G V V+ P P TP+ +
Sbjct: 65 VFISDWAAPKSLTPYDYVPKQLFMPDGRVVPVCKVQVD----PAPVSTTPRHPAPARWPT 120
Query: 200 DLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHV----AVDLDYPNQKM 255
L GG I V +Q T G +V + LTNRHV ++D M
Sbjct: 121 TLLGGGLPIVV--DVQNQSHTATAGCLVSD---GHSLYALTNRHVCGPAGQEID-----M 170
Query: 256 FHPLPPTLGPGVYLGAVERATSFHHRRPL----TFVRADGAFIPFADDFDMSTVTTSVKG 311
L + GV G F P T++ D I D D T++ G
Sbjct: 171 VRGLARSR-VGVSSGQQLTRLPFGEVYPFSMTNTYLTLD---IGLVDVDDAGDWTSTAYG 226
Query: 312 LGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVV 371
+G+IG + + LIG+ VV G SSGL G V+A Y G +++DFL+
Sbjct: 227 IGDIGPMVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSMGGSEYVSDFLIA 286
Query: 372 GENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
+ Q + GDSG ++ E+ +P P+ + WGG A
Sbjct: 287 PDPQGPQTVPGDSG-MVWHLTEDRARPAPLAVEWGGQA 323
>gi|83595940|gb|ABC25300.1| hypothetical protein [uncultured marine bacterium Ant24C4]
Length = 396
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 71/260 (27%), Positives = 116/260 (44%), Gaps = 36/260 (13%)
Query: 177 VEFSYFGAPE-PTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQ 235
+ +S+ G P+ +P Q + Q V + +GG + GS GTLGAIVK ++G+
Sbjct: 131 INYSHGGVPQVKSPSTQPHVQPVTE-KGGIIACGSSINPVDIVGAGTLGAIVKDKSGAFY 189
Query: 236 VGFLTNRHVAVDLDYPNQKMFHPLPPTLGPG---VYLGAVERATSFHHRRPLTFVRADGA 292
LTN HV+ +Y P P L PG A++ T H+ L FV
Sbjct: 190 --GLTNNHVSGGCNYS-----APEIPILCPGPLDAKNCAIDPFTIGRHKNLLQFVDGLPE 242
Query: 293 FIPFADDFDMSTV-------TTSVKGLGEIGDVKIVDLQSPISSLIGK-QVVKVGRSSGL 344
+ + + D + +S +GL + D I +G +V K GR++GL
Sbjct: 243 NVDISKNSDAAIFALSKPDRVSSYQGLSQ-------DTPKHIGVPMGMMKVTKHGRTTGL 295
Query: 345 TTGTVLA-------YALEYNDEKGICFLTD-FLVVGENQQTFDLEGDSGSLILMKGENGE 396
T G ++ A Y + K + + D +L+ EN + F GDSGSL++ G+
Sbjct: 296 TRGKIIGISASPIDVAYSYGNMKKVVYFDDVWLIKKENDKPFSEPGDSGSLVIGTDSTGQ 355
Query: 397 KPRPIGIIWGGTANRGRLKL 416
K +G+++ G + G +
Sbjct: 356 K-IALGLVFAGNPHFGHTYM 374
>gi|323701635|ref|ZP_08113307.1| hypothetical protein DesniDRAFT_0519 [Desulfotomaculum nigrificans
DSM 574]
gi|333922305|ref|YP_004495885.1| hypothetical protein Desca_0068 [Desulfotomaculum carboxydivorans
CO-1-SRB]
gi|323533408|gb|EGB23275.1| hypothetical protein DesniDRAFT_0519 [Desulfotomaculum nigrificans
DSM 574]
gi|333747866|gb|AEF92973.1| hypothetical protein Desca_0068 [Desulfotomaculum carboxydivorans
CO-1-SRB]
Length = 334
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 132/329 (40%), Gaps = 78/329 (23%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++ T+ PAI+VFVS+K + LS Q +P + G + DV+E
Sbjct: 22 VGVGVGYKHVGMSRTERPAIIVFVSKKEAPENLSREQTVPIKING-----LETDVIEIGE 76
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQ-TGSRQVGFLT 240
E TQ+V R P I G + T GT GA+V+ + TG + + L+
Sbjct: 77 VRFLEE------RTQLV---RPAQPGISIGHY---RITAGTFGAVVRDRHTGEKLI--LS 122
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVY------------------------------LG 270
N H+ + N P L PG Y G
Sbjct: 123 NNHILANATSGNDGRAAIGDPILQPGEYDGGSKDDRIATLLRYIPIQKGEVPATCPVANG 182
Query: 271 AVERATSF-HHRRP---LTFVRADGAF----IPFADDFDMSTVTTSVKGLGEIGDVKIVD 322
A A F H RP L F + GA A +T + GLG +
Sbjct: 183 AARLANMFVHAVRPNYQLKFFKRGGAANIVDCAVARPLRPDLITEEILGLGLV------- 235
Query: 323 LQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYN---DEKGICFLTDFLVVGENQQTFD 379
Q + +G +VVK GR+SG+T GTV A + + D+ +D +V Q
Sbjct: 236 -QGVAEAKLGMKVVKSGRTSGITRGTVTAVGVTLDVKLDDNTSAHFSDQVVTDMKSQG-- 292
Query: 380 LEGDSGSLILMKGENGEKPRPIGIIWGGT 408
GDSGSL+L +G + +G+++ G+
Sbjct: 293 --GDSGSLVLTEGN-----KAVGLLFAGS 314
>gi|331269221|ref|YP_004395713.1| hypothetical protein CbC4_1036 [Clostridium botulinum BKT015925]
gi|329125771|gb|AEB75716.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 302
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 128/299 (42%), Gaps = 58/299 (19%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++I G IP I V VS K+ + P + +P +G DVV+
Sbjct: 24 VGVGLGYKITNGFCKFIPCIKVLVSTKIPPNEIPPNESIPEHFKG-----LITDVVQSGN 78
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
A T K + + GG SIG S + S G++ +V T + L+N
Sbjct: 79 ISASSLTTKAR-------PVLGGY-SIGPSSGIRS----GSMACLV---TDGKHYYILSN 123
Query: 242 RHVAVDLDYPNQKMFHPLP---PTLGPGVYLGA------VERATSFHHRRPLTFVRADGA 292
HV V Y N LP P L PG+ G V + + + +T
Sbjct: 124 NHVLV---YGNV-----LPIGTPVLQPGIEDGGQPLDDKVATLSKYAQLKFITHKETPTN 175
Query: 293 FI--PFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
+I A D S V++ L IG +K + SP+ +G+ V KVGRS+GLTTG +L
Sbjct: 176 YIDCALAQVNDKSLVSSK---LAIIGSIK--GITSPV---LGESVKKVGRSTGLTTGKIL 227
Query: 351 AYA--LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGG 407
+ + N + G C + + + + GDSGSL++ + +G+++ G
Sbjct: 228 SIGSTVSVNFKAGKCLFKNQITTTKMAE----AGDSGSLLVNSSHHA-----VGLLFSG 277
>gi|398353752|ref|YP_006399216.1| hypothetical protein USDA257_c39150 [Sinorhizobium fredii USDA 257]
gi|390129078|gb|AFL52459.1| hypothetical protein USDA257_c39150 [Sinorhizobium fredii USDA 257]
Length = 766
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 82/307 (26%), Positives = 128/307 (41%), Gaps = 65/307 (21%)
Query: 139 PAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAP--EPTPKEQLYTQ 196
P+ILVFV + V K+ L P + +P L P G V V+E AP E K L T
Sbjct: 79 PSILVFVEQWVSKKDLEPGEIVPKTLYLPDGRRVPVCVIE-----APKEEKNEKRPLTTV 133
Query: 197 I-VDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKM 255
V+++ GG P I S Q T+ +V + V LTNRHVA + + +
Sbjct: 134 FPVNNIGGGWPVI---SHNQGQSYAATIACLV---SDGHTVYALTNRHVAGE---AGEII 184
Query: 256 FHPLPPTLGPGVYLGAVER---ATSFHHRRPL------------TFVRADGAFIPFADDF 300
+ L G ER ++ H R L +V D I D
Sbjct: 185 YSRLG---------GKQERIGVSSEKHLTRALFTTHYPGWPGRDVYVNLDVGLI---DID 232
Query: 301 DMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEK 360
++ T ++ +G++G + + + + +LIG+ V G +SGL G + A Y
Sbjct: 233 NLDRWTAEIRDIGQMGKMVDLSVHTISLALIGRDVRGTGAASGLMQGEIAALFYRYKTNG 292
Query: 361 GICFLTDFLVVGE-----NQQTFDLE---GDSGSLILMKGE----------NGEKP---R 399
G ++ D L+ ++ T E GDSG+L L++ + G+KP
Sbjct: 293 GFEYVADLLIGPRPADDGDRNTVPFETHPGDSGTLWLLEPDKNDRSGKSPSKGKKPPDYL 352
Query: 400 PIGIIWG 406
P+ + WG
Sbjct: 353 PLAMQWG 359
>gi|258650626|ref|YP_003199782.1| hypothetical protein Namu_0364 [Nakamurella multipartita DSM 44233]
gi|258553851|gb|ACV76793.1| conserved hypothetical protein [Nakamurella multipartita DSM 44233]
Length = 765
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 128/318 (40%), Gaps = 60/318 (18%)
Query: 154 LSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPT-PKEQL---YTQIVDDLRGGDPSIG 209
L P +PT L P G V V++ EPT P L +T + GG P I
Sbjct: 120 LPPEDMIPTTLYLPDGRTVPVCVIQV------EPTVPDRDLLPAWTWPKSVIGGGFPLI- 172
Query: 210 SGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVA---------------VDLDYPNQK 254
S ++GA+V T V LT+RHVA VD+ +++
Sbjct: 173 --SHTQGTTNVASVGALV---TDGHTVYALTSRHVAGPAGQPIGTILRGQAVDVGRSSER 227
Query: 255 MFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGE 314
LP T VY F R T++ D A + D D ++ T + +G
Sbjct: 228 QLTRLPFT---QVY-------PDFPAHR--TYLTLDAALVEVNDLADWTSQTYGLPPVGA 275
Query: 315 IGDV--KIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVG 372
+ D+ + + +Q LI QV G +SG TG + A + G +TDFL+
Sbjct: 276 LADLSERNIGMQ-----LINAQVTAYGAASGRLTGRIAALFYRHRSMGGYDEITDFLIAP 330
Query: 373 ENQQTFDLEGDSGSL--ILMKGENGEKP----RPIGIIWGGTANRGRLKLKIGQPPENWT 426
+ Q GDSG++ ++ E + P RPI + WGG R P N+
Sbjct: 331 DPGQPSSQPGDSGTVWHLIEPSEQPDDPARRLRPIALQWGGQGVRPADP----GPGYNFA 386
Query: 427 SGVDLGRLLNLLELDLIT 444
L +L LL+++L+
Sbjct: 387 LAAGLTAILRLLDVELVV 404
>gi|170699116|ref|ZP_02890171.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
gi|170135991|gb|EDT04264.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
Length = 313
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 93/217 (42%), Gaps = 31/217 (14%)
Query: 209 GSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVY 268
GS ++ + GTLGAIVK GS LTN HV ++ + P L PGV+
Sbjct: 75 GSSISPGNEASAGTLGAIVKKSDGSLY--GLTNNHVTGGCNHSAIDL-----PILAPGVF 127
Query: 269 LGAVERATSF---HHRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQ- 324
A + F H L FV + D+ D + + E DV Q
Sbjct: 128 DVAAKTIIPFTIGFHSEVLPFVTGTAGNVSINDNTDAALFR-----IAEPADVSSRQGQQ 182
Query: 325 --SPISSL---IGKQVVKVGRSSGLTTGTVLAYAL---------EYNDEKGICFLTDFLV 370
+P +S+ +G +V KVGR++G TTG ++ L + N + I + + +
Sbjct: 183 YDTPANSVAPTVGMKVQKVGRTTGHTTGVIVGQQLRPIRVHAQSQRNKFQAIITMPNVYL 242
Query: 371 VGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGG 407
V + + F GDSGSL++ G +GII G
Sbjct: 243 VHGDYRPFSDSGDSGSLVVTNDGTGTN-YAVGIIMSG 278
>gi|414154359|ref|ZP_11410678.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
= DSM 18033]
gi|411454150|emb|CCO08582.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
= DSM 18033]
Length = 335
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 127/331 (38%), Gaps = 81/331 (24%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G + T+ PAI++FV +K Q LS +P + G DV+E
Sbjct: 22 VGVGVGHKYVDMQRTEQPAIIIFVKKKEEPQNLSREHLVPYQING-----LTTDVIEVGE 76
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKS-QTGSRQVGFLT 240
L + +R P + G + T GT GA+V+ QTG R + L+
Sbjct: 77 V--------RLLDEERTKHVRPAQPGLSIGH---YRVTAGTFGAVVRDRQTGERLI--LS 123
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVER-------------------------- 274
N H+ + P L PG Y G
Sbjct: 124 NNHILANATNGKDGRAAIGDPILQPGEYDGGTREDRIATLLRYIPLQKGEAPATCPVANG 183
Query: 275 ATSF-----HHRRP---LTFVRADGAFIPFADDFDMST------VTTSVKGLGEIGDVKI 320
A F H RP L F++ G P D ++ +T + G IG V+
Sbjct: 184 AARFLNIFVHTVRPNYDLRFIKRGGT--PNIVDCAVARPVRPELITDDILG---IGKVQG 238
Query: 321 VDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYN---DEKGICFLTDFLVVGENQQT 377
V+ P G QVVK GR++G+T GTV A D++ + D +V Q
Sbjct: 239 VERAKP-----GMQVVKSGRTTGITRGTVTAVGATMEVKLDDENTAYFADQVVTDMKSQG 293
Query: 378 FDLEGDSGSLILMKGENGEKPRPIGIIWGGT 408
GDSGSL+L ++ R +G+++ G+
Sbjct: 294 ----GDSGSLVL-----NQENRAVGLLFAGS 315
>gi|331271090|ref|YP_004385799.1| hypothetical protein CbC4_6002 [Clostridium botulinum BKT015925]
gi|329127585|gb|AEB77527.1| hypothetical protein CbC4_6002 [Clostridium botulinum BKT015925]
Length = 313
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 126/293 (43%), Gaps = 73/293 (24%)
Query: 120 YSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEF 179
Y +G A+G++IK G +T+ I VFVS+KV L + +P + + DVVE
Sbjct: 34 YVVGIALGYKIKNGFITNKKCIKVFVSKKVPLSNLYEHEVIPKFFK-----CIETDVVES 88
Query: 180 SYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGS-QVASQETYGTLGAIVKSQTGSRQVGF 238
F A E T K + P IG S V++ G+LG +V T R
Sbjct: 89 GEFSAAEFTGKVR-------------PVIGGYSIGVSNVRGVGSLGCLV---TDGRYKYI 132
Query: 239 LTNRHVAVDLDYPNQKMFHPLP---PTLGPGVYLGAVERATSFHHRRPLT-FVRADGAFI 294
L+N HV DL+ +P P + PG+ G +P T V +I
Sbjct: 133 LSNNHVIADLN--------KIPIGTPIIQPGLDDGG----------KPSTDIVALLSKYI 174
Query: 295 PFADDFDMSTVTTSVKGLGEIGDVKIVD--LQSPISSLIG-----------KQVVKVGRS 341
P + + TS + K+++ + SP +++G K V KVGRS
Sbjct: 175 PLKTE----GIITSPTNYTDCAIAKLINESIASPKIAIVGAPEGTMIPIIDKGVRKVGRS 230
Query: 342 SGLTTGTVL----AYALEYNDEKGICFLTDFLVVGENQQTFDLE-GDSGSLIL 389
+ +TTG + + + ++ ++ F + +V T+ E GDSGS++L
Sbjct: 231 TEMTTGRITDIDGTFHIRFDSKR--VFFEEQIV-----TTYMCEDGDSGSILL 276
>gi|253771263|ref|YP_003034130.1| hypothetical protein CLG_A0037 [Clostridium botulinum D str. 1873]
gi|253721415|gb|ACT33707.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 319
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/298 (26%), Positives = 121/298 (40%), Gaps = 55/298 (18%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++ G T I VFV++KV++ L +P +G D V+ Y
Sbjct: 43 VGVGLGYKVTSGFCTFQKCIKVFVTKKVYENELPEADLVPAIYKG-----IITDTVDSGY 97
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F T K + P I S GTLG +V T FL+N
Sbjct: 98 FQPQSLTEKIR-------------PVICGYSLGPVNALGGTLGCLV---TDGFSRFFLSN 141
Query: 242 RHVAVDLDY--PNQKMFHPLPPTLG--PGVYLGAV------ERATSFHHRRPLTFVRADG 291
HV D + N + P G P +G + ER T+F +RP +V
Sbjct: 142 NHVLADFNSLSINTPILQPSANDGGKSPADVVGNLSNFIPLERVTAF--KRPTNYVDC-- 197
Query: 292 AFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA 351
A D S + ++ +G K L S + KVG++S LTTGT+ A
Sbjct: 198 ---AIARLIDKSIASPAIALVGPPKGTKQPQLNSSVK--------KVGKTSELTTGTITA 246
Query: 352 YALEYNDEKGICFLTDFLVVGENQQTFDLE-GDSGSLILMKGENGEKPRPIGIIWGGT 408
+ Y + GI + L + TF + GDSGS +L+ +N +G+I GG+
Sbjct: 247 INVTYTADYGI---KEVLFKNQIVTTFLSQPGDSGS-VLLDNDN----YVLGLIIGGS 296
>gi|410669147|ref|YP_006921518.1| hypothetical protein Tph_c28540 [Thermacetogenium phaeum DSM 12270]
gi|409106894|gb|AFV13019.1| hypothetical protein Tph_c28540 [Thermacetogenium phaeum DSM 12270]
Length = 334
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 96/364 (26%), Positives = 150/364 (41%), Gaps = 93/364 (25%)
Query: 122 LGTAIGFRIKRGVL-TDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDV-DVVEF 179
+G IG++ KRG TD AI+ FV +KV + L +C+P + G V DV ++ E
Sbjct: 22 VGMGIGYK-KRGRQDTDELAIIFFVEKKVPAEALGVDECVPKRI---GRVCTDVIEIGEV 77
Query: 180 SYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVAS-QETYGTLGAIVKSQTGSRQVGF 238
+ G E +R P GS + + T GT GA+V+ + + ++
Sbjct: 78 QFLGRTEK-------------MRPAAP----GSSIGHVKVTAGTFGAVVRDRK-TGELMI 119
Query: 239 LTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFAD 298
L+N HV + L PGVY G E H R + R F+
Sbjct: 120 LSNNHVLANATDGLDGRARRGDLILQPGVYDGGSEEDVIGHLERFVPIYR-------FSR 172
Query: 299 DFDMSTVTTSVKGLGEI---------------GDVKIVD--LQSPI--SSLI-------- 331
+ D + SVK + + G +VD L P+ +I
Sbjct: 173 EADCNLAAMSVKAVNAVIHAFRPNYYVRLEKRGASNLVDCALARPVDPKEIIPEIIDIGK 232
Query: 332 ---------GKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLE- 381
G V K GR++G+T G + A + N G TD +V + Q +L+
Sbjct: 233 VNGVAQAEPGMAVKKSGRTTGVTEGKITAVHVTLNVTMGRN--TD-VVRFQEQVMAELKS 289
Query: 382 --GDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLE 439
GDSGSL+L + EN R +G+++ G++ + + P EN +LN LE
Sbjct: 290 QAGDSGSLVLDR-EN----RAVGLLFAGSS-----EYTVFNPIEN---------VLNKLE 330
Query: 440 LDLI 443
+DL+
Sbjct: 331 VDLV 334
>gi|228994928|ref|ZP_04154706.1| hypothetical protein bpmyx0001_55800 [Bacillus pseudomycoides DSM
12442]
gi|228764830|gb|EEM13606.1| hypothetical protein bpmyx0001_55800 [Bacillus pseudomycoides DSM
12442]
Length = 329
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 139/320 (43%), Gaps = 45/320 (14%)
Query: 105 ELMTIRAFHSKIL--RCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPT 162
+L+ I+ + +L + +G +GF+ G TD AI FV++K + + P +P
Sbjct: 7 KLLDIKEANENVLLNKPNVIGVDVGFKYVEGKRTDEIAIRTFVTKK---ENVGPEHEIPR 63
Query: 163 ALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGT 222
+EG + VE P P + T D L GG S+G + GT
Sbjct: 64 TIEGVKTDVIEEKKVELQVLKIPVGAPVLENETGKFDPLVGG-ISVGPCRAINGFIFVGT 122
Query: 223 LGAIVKSQTGSRQVGFLTNRHV-AVDLDYPN-QKMFHPLPPTLG--PGVYLGAVERA--- 275
LGAIV+ + + L+N HV VD ++ + +M P G G +GA++
Sbjct: 123 LGAIVQKE--DNKFYALSNFHVMGVDNNWKSGDEMTQPGRVDGGQCSGDIIGALDSVCLG 180
Query: 276 -TSFHHRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQ 334
+P+ D A ++ + + EI + I ++ +S IG
Sbjct: 181 DKINSQNKPV-----DAAI----------SIIKNRRTSPEI--LNIGKVKGKVSPTIGAS 223
Query: 335 VVKVGRSSGLTTGTVLAY----ALEYNDEKGICFLTDFLVVGENQQ---TFDLEGDSGSL 387
V K GR++GLT GT+ +++Y G+ L + + + + F GDSGS+
Sbjct: 224 VRKQGRTTGLTHGTITGLGRTSSIDYGSGIGVVTLKNQITIEPDTTKNPKFSDHGDSGSV 283
Query: 388 ILMKGENGEKPRPIGIIWGG 407
I+ E+ R IG+++GG
Sbjct: 284 IV-----DEQNRVIGLLFGG 298
>gi|333977577|ref|YP_004515522.1| hypothetical protein Desku_0073 [Desulfotomaculum kuznetsovii DSM
6115]
gi|333821058|gb|AEG13721.1| hypothetical protein Desku_0073 [Desulfotomaculum kuznetsovii DSM
6115]
Length = 334
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 81/338 (23%), Positives = 138/338 (40%), Gaps = 67/338 (19%)
Query: 108 TIRAFHSKILRCYSL-GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEG 166
++ K+LR ++ G +G + G T+ PA+++FV +KV L +Q +P ++G
Sbjct: 7 VLKKSREKLLRLPNVTGVGVGLKQVSGETTNRPALIIFVKKKVPSDGLVRVQQVPAYIDG 66
Query: 167 PGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAI 226
D++E + +L + R P + G S GT GA+
Sbjct: 67 -----LPTDIIEIG---------EVRLLSLRTGKERPAQPGMSIGHYKISA---GTFGAV 109
Query: 227 VKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLG--AVERATSFHHRRPL 284
VK + +++ L+N H+ + P L PG + G A +R + PL
Sbjct: 110 VKDRV-TKEPLILSNNHILANATDGKDGRAAVGDPILQPGPHDGGQAGDRIGTLLRFSPL 168
Query: 285 -------------TFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSP--ISS 329
VRA + + +G G I D + SP I+
Sbjct: 169 LRSIQEAECPVAEALVRAGNLLVRLVRPHYQLKMFQYYRG-GNIIDAAVARPDSPGLIND 227
Query: 330 LI--------------GKQVVKVGRSSGLTTGTVLAYALEY-----NDEKGICFLTDFLV 370
I G+ V+K GR++G++ GTV A + NDEKG + TD +V
Sbjct: 228 EILEIGKVEGVARVDPGQGVMKSGRTTGISEGTVTAVGVTLEVEIGNDEKG--WFTDQVV 285
Query: 371 VGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGT 408
+ + GDSGSL+L + + R +G+++ G+
Sbjct: 286 TDMSSRP----GDSGSLVLDR-----EKRAVGLLFAGS 314
>gi|331270863|ref|YP_004397300.1| hypothetical protein CbC4_5104 [Clostridium botulinum BKT015925]
gi|329127581|gb|AEB77524.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 316
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/305 (22%), Positives = 125/305 (40%), Gaps = 63/305 (20%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++IK G T+ + VFVSRK+ + L+ +P +G DV E
Sbjct: 39 VGVGLGYKIKNGFYTNQLCVQVFVSRKLPQNQLNSNDMIPVIYKG-----IPTDVKETGC 93
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVAS-QETYGTLGAIVKSQTGSRQVGFLT 240
F A K + P +G S A+ + GT+ +V + G + T
Sbjct: 94 FTACSFNKKIR-------------PVLGGYSISANMNKINGTVACLVTN--GVSKFALST 138
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGA---VERATSFHHRRPLTFVR--------A 289
N HV +++ K P + P G + S H P+ F++
Sbjct: 139 N-HVLANINILPMK-----SPIVQPAYLYGGHAPTDTIASLHKYIPIRFIKGHEEPTNST 192
Query: 290 DGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTV 349
D A + + ++ + +G++ VK S + +QV K+G S+ LTTGT+
Sbjct: 193 DCALGLLSKS---NILSDKIALIGKVTCVK--------SPKLNEQVRKIGASTELTTGTI 241
Query: 350 LA----YALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIW 405
+ + + Y+D+K + F L +GDSGS+++ K +G+++
Sbjct: 242 TSINTTFRVNYSDDKRVLFKDQILTTH-----MGADGDSGSILVNKNNCA-----VGLLF 291
Query: 406 GGTAN 410
+ N
Sbjct: 292 SASPN 296
>gi|443289395|ref|ZP_21028489.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
08]
gi|385887548|emb|CCH16563.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
08]
Length = 528
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/123 (35%), Positives = 59/123 (47%), Gaps = 17/123 (13%)
Query: 123 GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALE-GPGGVWCDVDVVEFSY 181
G A G R G TD PA++V+V RKV +Q+L + LP + GP + +VDVVE
Sbjct: 35 GLAYGRREVSGRRTDEPALVVYVVRKVPRQFLPTTRLLPRRVYFGPD--FVEVDVVETGP 92
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F A E T +E+ P+ S T GTLGA+V T + L+N
Sbjct: 93 FFAQEFTARER-------------PAPNGVSIAHIDVTAGTLGALVTDNTDG-SLCILSN 138
Query: 242 RHV 244
HV
Sbjct: 139 NHV 141
>gi|416350198|ref|ZP_11680813.1| hypothetical protein CBCST_04791 [Clostridium botulinum C str.
Stockholm]
gi|338196357|gb|EGO88555.1| hypothetical protein CBCST_04791 [Clostridium botulinum C str.
Stockholm]
Length = 314
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 74/309 (23%), Positives = 130/309 (42%), Gaps = 58/309 (18%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++K G T+ + VFV +K L+ +P+ +G D+ E Y
Sbjct: 37 VGVGLGYKVKNGFYTNQLCVQVFVGKKRTLNELNTNDIIPSIYKG-----IPTDIKETGY 91
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIG--SGSQVASQETYGTLGAIVKSQTGSRQVGFL 239
F A +++ P +G S S S YGT G +V + +G
Sbjct: 92 FKACSFNQRKR-------------PVLGGYSVSANGSDHIYGTAGCLVTNGVNKFVLG-- 136
Query: 240 TNRHVAVDLDY--PNQKMFHPLPPTLGPGVYLG--AVERATSFHHRRPLTFVRADGAFIP 295
TN HV V ++ N K+ P +Y G + + + H PL F++ I
Sbjct: 137 TN-HVLVKINELPINFKILQP------AYIYGGRSSFDTIATLHKYIPLRFIKGQEQPIN 189
Query: 296 FADD----FDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA 351
D S + S + IG V V ++P +G +V KVG ++ LT GT+++
Sbjct: 190 LTDCALGLLTKSNIMDS--NIALIGKVTCV--KNP---KLGTRVKKVGATTELTEGTIIS 242
Query: 352 ----YALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGG 407
+ + Y++ K + F D ++ +EGDSGS+++ K +G+++
Sbjct: 243 INANHTVFYSNGK-VAFFKDQILTSN----MAMEGDSGSILVDKNN-----CALGVLFAA 292
Query: 408 TANRGRLKL 416
N +L
Sbjct: 293 ANNTAYNRL 301
>gi|427382731|ref|ZP_18879451.1| hypothetical protein HMPREF9447_00484 [Bacteroides oleiciplenus YIT
12058]
gi|425729976|gb|EKU92827.1| hypothetical protein HMPREF9447_00484 [Bacteroides oleiciplenus YIT
12058]
Length = 435
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 97/233 (41%), Gaps = 51/233 (21%)
Query: 201 LRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHP-- 258
L+GG I G + GTLG VK + +V LTNRHV V + ++HP
Sbjct: 96 LKGGIQLINYGKGAGT----GTLGCFVKD--ANDRVYGLTNRHVGVSV---GSVLYHPKK 146
Query: 259 LPPTLGPGVY-------------LGAVERATSFHHRRPLTFVRADGAFIPFADDFDMSTV 305
P Y +G+V++ + D A I A D
Sbjct: 147 TPVHCCSEKYCNHDCCIIDVKGNIGSVKKISQL--------TTTDSAIIELATD------ 192
Query: 306 TTSVKGLGEIGDVKIVDLQSPIS--SLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGIC 363
VK EI D+ +V +S I+ L+G+ V K GR++ LTTG + + Y +
Sbjct: 193 ---VKWKNEIVDIGVVKGESTIAPEELLGQTVRKRGRTTCLTTGKI---DICYYESVSSY 246
Query: 364 FLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLKL 416
+ +V+ F GDSGS+++ K + + + ++WGG N G L
Sbjct: 247 QYREQIVIKNEGGIFAQGGDSGSVVVDKDD-----KVLALLWGGMGNDGVCNL 294
>gi|302388636|ref|YP_003824457.1| hypothetical protein Toce_0037 [Thermosediminibacter oceani DSM
16646]
gi|302199264|gb|ADL06834.1| conserved hypothetical protein [Thermosediminibacter oceani DSM
16646]
Length = 334
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 135/343 (39%), Gaps = 79/343 (23%)
Query: 109 IRAFHSKILRCYSL-GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGP 167
+R + K+LR ++ GT +G++I G +T+ PA++V V +K ++ L Q +P L+
Sbjct: 8 LRRYERKLLRLENVVGTGLGYKIIEGRITNEPAVIVLVRKKKPERELPASQVVPKKLD-- 65
Query: 168 GGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIV 227
D++E +L T R P + G + T GT GA+V
Sbjct: 66 ---EVYTDIIEVG---------DVRLLTARTQKTRPAMPGMSIGHY---KITAGTFGAVV 110
Query: 228 KSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGA--------------VE 273
+ Q + L+N HV + P + PG Y G VE
Sbjct: 111 RDQITGEPL-ILSNNHVLANASNGRDGRAAVGDPIMQPGPYDGGGPEDVIAHLYRFIPVE 169
Query: 274 RATSFHHRRPLT---------FVRA-----DGAFIPFADDFDM-----------STVTTS 308
+ + H R P+ FVR AF+ +++ ++
Sbjct: 170 KDVT-HSRCPIARRGENLLNFFVRMIRPDYRVAFMKHRAAYNLVDAAVAKPINPDYISPE 228
Query: 309 VKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGI---CFL 365
+ LGEI + IG +VK GR+SG++ V A ++ G
Sbjct: 229 ILDLGEIRGIA--------EPRIGMTLVKSGRTSGVSKSEVKALNVKIRVMMGAGEEATF 280
Query: 366 TDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGT 408
D ++ G Q GDSGSL+L EN E +G+++ G+
Sbjct: 281 YDQILTGPMAQP----GDSGSLVL--NENMEA---VGLLFAGS 314
>gi|331270818|ref|YP_004397255.1| hypothetical protein CbC4_5058 [Clostridium botulinum BKT015925]
gi|329127536|gb|AEB77479.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 315
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 110/276 (39%), Gaps = 41/276 (14%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G G+++K+G T+ + VFVSRK+ L+ +P +G DV E Y
Sbjct: 37 VGIGCGYKVKKGFYTNQLCVQVFVSRKISSNELNSNDIIPLIYKG-----IPTDVKETGY 91
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F TQ V + GG S S + YGT G +V T L+N
Sbjct: 92 FTTCS-------LTQRVRPVLGG----YSISTSMDERIYGTAGCLV---TNGVSKFVLSN 137
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPGVYLG---AVERATSFHHRRPLTFVRADGAFIPFAD 298
HV N M P P + G + + + H PL F+ +
Sbjct: 138 NHVI-----ANANMLPINSPITQPALKHGGHTSNDTIATLHKYMPLRFINGQQEPTNYT- 191
Query: 299 DFDMSTVTTSVKGLGEIGDV-KIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA----YA 353
D + +T S EI + K + +++P + V KVG SGLT G +++ +
Sbjct: 192 DCALGLLTKSNIMSSEIALIGKPICVKNP---KLNTHVRKVGAISGLTEGDIISVDATFR 248
Query: 354 LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
Y + K C D ++ Q GDSG++++
Sbjct: 249 SNYPNNKR-CLFKDQIITTPMAQ----NGDSGAILV 279
>gi|399021530|ref|ZP_10723627.1| hypothetical protein PMI16_04605 [Herbaspirillum sp. CF444]
gi|398091303|gb|EJL81750.1| hypothetical protein PMI16_04605 [Herbaspirillum sp. CF444]
Length = 351
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/182 (24%), Positives = 78/182 (42%), Gaps = 30/182 (16%)
Query: 238 FLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFA 297
L+N HV D + ++ PG + + + H P + A F+P A
Sbjct: 157 MLSNNHVLADCN------------SVAPGTVI--TQPSIEDHGNDPADVIGALSYFVPLA 202
Query: 298 DDFDMSTVTTSVKGLG--------EIGDVKIVDLQSPISS-LIGKQVVKVGRSSGLTTGT 348
S V ++ E G+ K+ + +P+++ +G +V K GR++G+T G
Sbjct: 203 APGGTSPVDAAIAAFDDTKNDPRMERGENKVEKMVAPVTAPYVGMEVQKSGRTTGVTKGK 262
Query: 349 VLAYALEYNDE---KGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIW 405
V A AL + G+ + + V F L GDSGS+I +N P+G+++
Sbjct: 263 VTAIALTIATDYAGYGVVTIQNTFSVKHVSGYFSLPGDSGSVITTASQN----NPVGLLF 318
Query: 406 GG 407
G
Sbjct: 319 AG 320
>gi|168041453|ref|XP_001773206.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675565|gb|EDQ62059.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 188
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 29/38 (76%)
Query: 376 QTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGR 413
+ F+L DS SLIL++ E GE+PR +G++WGG A+ GR
Sbjct: 49 RAFELGSDSQSLILVREEAGERPRLVGVVWGGCASNGR 86
>gi|433609843|ref|YP_007042212.1| hypothetical protein BN6_81220 [Saccharothrix espanaensis DSM
44229]
gi|407887696|emb|CCH35339.1| hypothetical protein BN6_81220 [Saccharothrix espanaensis DSM
44229]
Length = 318
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 89/343 (25%), Positives = 141/343 (41%), Gaps = 70/343 (20%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVH-KQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
+G IG + G T +P+I+V+V RK + Q+ P P L P DVVE +
Sbjct: 27 VGVDIGHKAVGGRCTGVPSIVVYVRRKGNAAQFTIP----PDVLGIP------TDVVEDT 76
Query: 181 YF-----GAPEPTPKEQLYTQIVDDLRGGDPSIGSG---SQVASQETY---GTLGAIVKS 229
+F +PE + + ++ + G PS V + Y GTLGA+V
Sbjct: 77 FFPHHTLASPEGVSGAERHELLIGGI-GVGPSRAVRFVPPDVPEADDYLVAGTLGALVTP 135
Query: 230 QTGSRQVGFLTNRHVAV--DLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFV 287
+ R + LT H+A D M HP R H R V
Sbjct: 136 RAKRRTMA-LTAFHIACVDDAWAVGDPMVHP--------------SRVDGGHPYRDQIGV 180
Query: 288 RADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTG 347
A A D + + T+ + E+ + +V Q +L+G+ V K GR++ LT G
Sbjct: 181 LARAALSGTVD--AAAILLTTPRSRAEVAGIGLVAGQG--EALVGQHVRKRGRTTALTAG 236
Query: 348 TV----LAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGI 403
V A L++ G+ L D + V + F GDSG+++L + R +G+
Sbjct: 237 VVASTDAAITLDFGTGLGVRTLRDQIRV---EGPFADHGDSGAVLL-----DDANRVVGL 288
Query: 404 IWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTD 446
GG+ +RG P N +L+ L++DL+T +
Sbjct: 289 YCGGSRDRG-----FANPIAN---------VLDQLDVDLLTVE 317
>gi|327401310|ref|YP_004342149.1| hypothetical protein Arcve_1431 [Archaeoglobus veneficus SNP6]
gi|327316818|gb|AEA47434.1| hypothetical protein Arcve_1431 [Archaeoglobus veneficus SNP6]
Length = 345
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 120/294 (40%), Gaps = 49/294 (16%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G IG+R++ +T I VFV++K+ K L+ + +P L+G DV+E
Sbjct: 69 VGVGIGYRVREYKVTPELCIQVFVTKKLRKDMLTERELVPQDLDG-----IRTDVIE--- 120
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
G E + +Y P+ S + T GT G IV+ + L+N
Sbjct: 121 TGVIEALTYKSMYR----------PAFPGCSIGHYRITAGTFGCIVQDKK-DHDFLILSN 169
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRR--PLT--FVRADGAFIPFA 297
HV + + N P L PG Y G +R ++ PL + D A A
Sbjct: 170 NHVLANSNNANIG-----DPILQPGPYDGGTQRNIIAKLKKFVPLLSGYNLVDAA---VA 221
Query: 298 DDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAY--ALE 355
DM V S+ +G V+ L G +V K GR++ G +++ ++
Sbjct: 222 KPLDMRYVKASIAKIGMPTGVR--------EPLHGLRVQKTGRTTQYNRGRIISTDATVK 273
Query: 356 YNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
G+ +L ++ GDSGSL+L G R +G+++ G++
Sbjct: 274 VGYGPGVTYLFKNQILTTRMAA---GGDSGSLLL-----GMCKRAVGLLFAGSS 319
>gi|425465752|ref|ZP_18845059.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389831923|emb|CCI24872.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 321
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 22/198 (11%)
Query: 219 TYGTLGAIVKSQTGS-RQVGFLTNRHVAVDLDYP--NQKMFHPLPPTLGPGVYLGAVERA 275
T GTLG +VK G ++ L+N HV D + + + P G +
Sbjct: 123 TAGTLGCLVKKTAGDDNEIFILSNNHVLADSNQAQIDDNIIEPGKLDQGTEPIAKLTDFE 182
Query: 276 TSFHHRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQV 335
T F +P F+ A A + +D S +T IG+V+ Q P++S + + V
Sbjct: 183 TIFLDDKP-NFIDAAIAKVINNNDVRPSILT--------IGNVQ----QPPMTSALYQSV 229
Query: 336 VKVGRSSGLTTGTVLAYALEYNDEKG--ICFLTDFLVVGENQQTFDLEGDSGSLILMKGE 393
K GR++G T G ++ A + G I D L + F GDSGSLI+
Sbjct: 230 RKHGRTTGHTIGVIMDIAADVRVRFGQKIANFEDQLAIQGVNGLFSQGGDSGSLIV---- 285
Query: 394 NGEKPRPIGIIWGGTANR 411
+ RP+G+++ G N+
Sbjct: 286 DAMTRRPVGLLFAGGGNQ 303
>gi|166366703|ref|YP_001658976.1| hypothetical protein MAE_39620 [Microcystis aeruginosa NIES-843]
gi|440756156|ref|ZP_20935357.1| hypothetical protein O53_4564 [Microcystis aeruginosa TAIHU98]
gi|166089076|dbj|BAG03784.1| hypothetical protein MAE_39620 [Microcystis aeruginosa NIES-843]
gi|440173378|gb|ELP52836.1| hypothetical protein O53_4564 [Microcystis aeruginosa TAIHU98]
Length = 321
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 22/198 (11%)
Query: 219 TYGTLGAIVKSQTGS-RQVGFLTNRHVAVDLDYP--NQKMFHPLPPTLGPGVYLGAVERA 275
T GTLG +VK G ++ L+N HV D + + + P G +
Sbjct: 123 TAGTLGCLVKKTAGDDNEIFILSNNHVLADSNQAQIDDNIIEPGKLDQGTEPIAKLTDFE 182
Query: 276 TSFHHRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQV 335
T F +P F+ A A + +D S +T IG+V+ Q P++S + + V
Sbjct: 183 TIFLDDKP-NFIDAAIAKVINNNDVRPSILT--------IGNVQ----QPPMTSALYQSV 229
Query: 336 VKVGRSSGLTTGTVLAYALEYNDEKG--ICFLTDFLVVGENQQTFDLEGDSGSLILMKGE 393
K GR++G T G ++ A + G I D L + F GDSGSLI+
Sbjct: 230 RKHGRTTGHTIGVIMDIAADVRVRFGQKIANFEDQLAIQGVNGLFSQGGDSGSLIV---- 285
Query: 394 NGEKPRPIGIIWGGTANR 411
+ RP+G+++ G N+
Sbjct: 286 DAMTRRPVGLLFAGGGNQ 303
>gi|398802706|ref|ZP_10561909.1| S1/P1 Nuclease [Polaromonas sp. CF318]
gi|398098944|gb|EJL89217.1| S1/P1 Nuclease [Polaromonas sp. CF318]
Length = 757
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 53/231 (22%), Positives = 92/231 (39%), Gaps = 45/231 (19%)
Query: 239 LTNRHVA---------------VDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRP 283
LTNRHV V++ + +++ LP T E SF ++
Sbjct: 179 LTNRHVCGEPGEPVHARLRGEEVEVGHASERQLTRLPFT----------EVYPSFAGKQ- 227
Query: 284 LTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSG 343
T++ D + D D T+SV G+GEIG + ++ Q+ LI V G +SG
Sbjct: 228 -TYLNLDVGLVEVDDARDW---TSSVYGIGEIGALADLNEQNLGLQLIDHPVSAFGAASG 283
Query: 344 LTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKP----- 398
G + A Y G ++ D L+ ++ GDSG++ +K E +
Sbjct: 284 HLEGRIKALFYRYKSVGGYDYVADLLIAPQDPAHQTQPGDSGTVWHLKAEEEKDSKGVPG 343
Query: 399 ----RPIGIIWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITT 445
RP+ + WG + N+ +L + LL+++L++
Sbjct: 344 KVSYRPLAVEWGAQT------FSVDGGAYNFALATNLSNVCKLLDVELVSA 388
>gi|253771282|ref|YP_003034117.1| hypothetical protein CLG_A0023 [Clostridium botulinum D str. 1873]
gi|253721434|gb|ACT33726.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 318
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 113/277 (40%), Gaps = 43/277 (15%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++K G T+ + VFVSRK K L+ +P +G DV E Y
Sbjct: 37 VGLGLGYKVKNGFYTNQLCVQVFVSRKFPKNQLNSNDIIPLIYKG-----IQTDVKETGY 91
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F A L +I L G S Q++ GT G +V T L+
Sbjct: 92 FKACF------LNKRIRPVLGGYSISTNMNDQIS-----GTAGCVV---TNGVSKFVLST 137
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPG-VYLGA--VERATSFHHRRPLTFVRADGAFIPFAD 298
HV +L+ M P + P +Y G + + H PL F++ + D
Sbjct: 138 NHVLANLN-----MLPMKTPIIQPAYIYRGHTPTDTIATLHKFIPLRFIKREEQPTNLTD 192
Query: 299 DFDMSTVTTSVK--GLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA----Y 352
V T + + IG KI ++SP +G V KVG +S LT GT+ + +
Sbjct: 193 CALGLLVKTDIMSDNIAFIG--KITCVKSP---KLGSHVRKVGETSELTQGTITSINATF 247
Query: 353 ALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
+ Y K + D ++ Q GDSGS+++
Sbjct: 248 TVGYITGK-VALFKDQIITTHMAQ----NGDSGSILV 279
>gi|253771303|ref|YP_003034113.1| hypothetical protein CLG_A0019 [Clostridium botulinum D str. 1873]
gi|253721455|gb|ACT33747.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 314
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 74/309 (23%), Positives = 129/309 (41%), Gaps = 58/309 (18%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++K G T+ + VFV +K L+ +P+ +G D+ E Y
Sbjct: 37 VGLGLGYKVKNGFYTNQLCVQVFVGKKRTLNELNTNDIIPSIYKG-----IPTDIKETGY 91
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIG--SGSQVASQETYGTLGAIVKSQTGSRQVGFL 239
F A +++ P +G S S S YGT G +V + +G
Sbjct: 92 FKACSFNQRKR-------------PVLGGYSVSANGSDHIYGTAGCLVTNGVNKFVLG-- 136
Query: 240 TNRHVAVDLDY--PNQKMFHPLPPTLGPGVYLG--AVERATSFHHRRPLTFVRADGAFIP 295
TN HV V ++ N K+ P +Y G + + + H PL F++ I
Sbjct: 137 TN-HVLVKINELPINFKILQP------AYIYGGRSSFDTIATLHKYIPLRFIKGQEQPIN 189
Query: 296 FADD----FDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA 351
D S + S + IG V V ++P +G +V KVG ++ LT GT+ +
Sbjct: 190 LTDCALGLLTKSNIMDS--NIALIGKVTCV--KNP---KLGTRVKKVGATTELTEGTITS 242
Query: 352 ----YALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGG 407
+ + Y++ K + F D ++ +EGDSGS+++ K +G+++
Sbjct: 243 INANHTVFYSNGK-VAFFKDQILTSN----MAMEGDSGSILVDKNN-----CALGVLFAA 292
Query: 408 TANRGRLKL 416
N +L
Sbjct: 293 ANNTAYNRL 301
>gi|334338755|ref|YP_004543735.1| hypothetical protein [Desulfotomaculum ruminis DSM 2154]
gi|334090109|gb|AEG58449.1| hypothetical protein Desru_0150 [Desulfotomaculum ruminis DSM 2154]
Length = 334
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 132/332 (39%), Gaps = 84/332 (25%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++ T+ PAI+VFV +K + LS +P + G + DV+E
Sbjct: 22 VGVGVGYKHVGLERTERPAIIVFVKKKETSENLSRENLVPYKING-----LETDVIEIGE 76
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQ-TGSRQVGFLT 240
+L ++ +R P + G + T GT GA+V+ + TG + + L+
Sbjct: 77 V---------RLLSERTQVIRPAQPGVSIGHY---RITAGTFGAVVRDRDTGEKLI--LS 122
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFADDF 300
N H+ + N P L PG Y G + R T +R +IP
Sbjct: 123 NNHILANASNGNDGRAAVGDPILQPGEYDGGTK------DNRIATLLR----YIPLQKGE 172
Query: 301 DMST--VTTSVKGLGEI--------GDVK---------IVD--LQSPISSLI-------- 331
++T V L I D++ +VD + P+ +
Sbjct: 173 SLATCPVANVAARLANILVHTLRPNYDLRFFKRGRAENLVDCAVARPVRENVIFEEVLGI 232
Query: 332 -----------GKQVVKVGRSSGLTTGTVLAYA----LEYNDEKGICFLTDFLVVGENQQ 376
G VVK GR++G+T GTV A ++ +DE F + ++Q
Sbjct: 233 GRIEGLAEARPGMPVVKSGRTTGITKGTVTAVGATLEVKLDDESTAHFSGQVVTNMKSQG 292
Query: 377 TFDLEGDSGSLILMKGENGEKPRPIGIIWGGT 408
GDSGSL+L +G R +G+++ G+
Sbjct: 293 -----GDSGSLVLTEGN-----RAVGLLFAGS 314
>gi|416350197|ref|ZP_11680812.1| hypothetical protein CBCST_04786 [Clostridium botulinum C str.
Stockholm]
gi|338196356|gb|EGO88554.1| hypothetical protein CBCST_04786 [Clostridium botulinum C str.
Stockholm]
Length = 310
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 121/282 (42%), Gaps = 53/282 (18%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++K G T+ + VFVSRK ++ ++ +P+ +G DV E Y
Sbjct: 32 VGIGLGYKVKNGFYTNQLCVQVFVSRKYYENDININDKIPSMYKG-----IPTDVKETGY 86
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIG--SGSQVASQETYGTLGAIVKSQTGSRQVGFL 239
F A K++ P +G S S + + GT G +VKS GS Q
Sbjct: 87 FRACSFRGKKR-------------PVLGGYSISGNMNSKNSGTAGCLVKS--GSAQFLLG 131
Query: 240 TNRHVAVDLDYPNQKMFHPL-PPTLGPGVYLGA---VERATSFHHRRPLTFVRADGAFIP 295
TN HV V+L+ P+ P + P + G + + H PL F++ I
Sbjct: 132 TN-HVIVNLN------MEPIAAPIVQPSLEYGGYTPTDTVATVHKFIPLRFIQGRDRPIN 184
Query: 296 FADDF--DMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL--- 350
D ++ + IG +K V ++P +G V KVG ++ LT GT+
Sbjct: 185 LTDCALGLLTKPNIMSNKIALIGKLKCV--KNP---KLGAHVKKVGETTELTEGTITSVN 239
Query: 351 -AYALEYNDEKGICFLTDFL--VVGENQQTFDLEGDSGSLIL 389
++ Y +++ F L +GE GDSGS+++
Sbjct: 240 ASFIAAYENDELALFKDQVLTSAMGE-------AGDSGSILV 274
>gi|331271149|ref|YP_004385858.1| hypothetical protein CbC4_6065 [Clostridium botulinum BKT015925]
gi|329127644|gb|AEB77586.1| hypothetical protein CbC4_6065 [Clostridium botulinum BKT015925]
Length = 320
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 121/303 (39%), Gaps = 63/303 (20%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++K G T I VFV++KV L+P +P +G D+V Y
Sbjct: 44 VGVGLGYKVKNGFCTCQKCIKVFVTKKVSSNELTPSDLVPPIYKG-----LMTDIVNCGY 98
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F + TQ + + G SIG + + GTLG +V T L+N
Sbjct: 99 F-------QPHSLTQRIRPVICGY-SIGPINFLG-----GTLGCLV---TDGFSRFMLSN 142
Query: 242 RHVAVDLDYPNQKMFHPLP---PTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFAD 298
HV + F+ P P L P G + P V F+P
Sbjct: 143 NHVLAN--------FNSFPINTPILQPSSNDGG---------KAPADVVANLTKFVPLNR 185
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIG-----------KQVVKVGRSSGLTTG 347
T V I + + SP +L+G V KVG++S LTTG
Sbjct: 186 VTAFRKPTNYVD--AAIARLTNKSIASPAIALVGPPKGTSPPQLNHHVKKVGKTSELTTG 243
Query: 348 TVLAYALEYNDEKGICFLTDFLVVGENQQTFDLE-GDSGSLILMKGENGEKPRPIGIIWG 406
T+ A + Y + GI + L + TF + GDSG+ +L+ +N +G+I G
Sbjct: 244 TITAINVTYTADYGI---KEVLFKNQIVTTFLSQPGDSGA-VLLDNDN----YVLGLIIG 295
Query: 407 GTA 409
G++
Sbjct: 296 GSS 298
>gi|331270132|ref|YP_004396624.1| hypothetical protein CbC4_1955 [Clostridium botulinum BKT015925]
gi|329126682|gb|AEB76627.1| hypothetical protein CbC4_1955 [Clostridium botulinum BKT015925]
Length = 322
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 138/342 (40%), Gaps = 75/342 (21%)
Query: 123 GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYF 182
G +G++ G T I VFVS+K+ ++ +P + DVVE F
Sbjct: 30 GIGLGYKKINGKCTFRKCIRVFVSKKLPSNDIAKEDLIPAYFN-----YIPTDVVESGVF 84
Query: 183 ------GAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQV 236
G PT Q I G IG YGTLG +VK++ + V
Sbjct: 85 TTCALNGRIRPT---QCGYSI------GPVGIG---------IYGTLGCLVKNKR-EKAV 125
Query: 237 GFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAF--- 293
L+ HV P +KM P + PGV G R + T ++ G F
Sbjct: 126 YLLSASHVL----NPLEKMSFG-TPIVQPGVLDGGNIRNDVIANLVRSTNIKYIGTFSKP 180
Query: 294 -----IPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGT 348
A D+S V+T++ +G+ D++ S IG++V KVGR++G T G
Sbjct: 181 ENTVDAAVAKVSDISLVSTTMAIVGK-------DVKQIASPKIGEKVFKVGRTTGYTEGE 233
Query: 349 VLAYALEYNDEKGICFLTDFLVVGENQQTFDL---EGDSGSLILMKGENGEKPRPIGIIW 405
+ D I + + + Q D+ +GDSGS++L E PIG++
Sbjct: 234 ITE-----TDVTQIINSSGKKALFKGQIAADVKSDKGDSGSVLL-----NENMNPIGLLM 283
Query: 406 GGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTDE 447
G + + ++ D+ ++ + L +++ITT E
Sbjct: 284 GASQS------------TVYSVFNDMKKVTSALNVEIITTSE 313
>gi|190891805|ref|YP_001978347.1| hypothetical protein RHECIAT_CH0002212 [Rhizobium etli CIAT 652]
gi|190697084|gb|ACE91169.1| hypothetical protein RHECIAT_CH0002212 [Rhizobium etli CIAT 652]
Length = 783
Score = 47.8 bits (112), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 70/155 (45%), Gaps = 18/155 (11%)
Query: 301 DMSTVTTSVKGLGEIGDVKIVDLQS-PISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDE 359
DM T+++ GL +I + V Q+ + L+ + VV VG +SGL G + A Y
Sbjct: 244 DMRDWTSNIYGLPKIKPLFDVYEQNLSLRRLMDQPVVAVGGASGLLQGKIKAMFYRYRSV 303
Query: 360 KGICFLTDFLVVGENQQTFDLEGDSGSL--ILMKGENG---EKP------RPIGIIWGGT 408
G +++DFL+ GDSG+L + M G +G E+P RP+ I WG
Sbjct: 304 GGFDYVSDFLIAPIPGGKVPRHGDSGALWHVQMPGPDGKQDERPLAQRDLRPLAIEWGAQ 363
Query: 409 ANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLI 443
G ++ L + LL+++L+
Sbjct: 364 V------FADGGERSTYSVASSLSNICKLLDVELV 392
>gi|420256689|ref|ZP_14759520.1| hypothetical protein PMI06_09988 [Burkholderia sp. BT03]
gi|398042752|gb|EJL35726.1| hypothetical protein PMI06_09988 [Burkholderia sp. BT03]
Length = 749
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 86/372 (23%), Positives = 135/372 (36%), Gaps = 73/372 (19%)
Query: 100 ATTLLELMTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKV------HKQW 153
A T ++ +R F + +R YS PA++V V V H +
Sbjct: 62 AETRVKAKGVRTFDNSEVRPYSW----------------PAVIVLVRDWVDTTEFGHGK- 104
Query: 154 LSPIQCLPTALEGPGGVWCDVDVVEFS----YFGAPEPTPKEQLYTQIVDDLRGGDPSIG 209
+ P +P L P G V VV GAP Y + GG P I
Sbjct: 105 VDPDHMVPRTLYMPDGRAVPVCVVAVEPTVPAAGAPADARWPSTY------IGGGCPLIA 158
Query: 210 SGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYL 269
+ E ++G +V T LTNRHV + P + + +G
Sbjct: 159 DAQGI---ERTASVGCLV---TDGHTTYALTNRHVCGEPGSPVKALLRGAVAEVGI---- 208
Query: 270 GAVERATSFHHRRPLT-----------FVRADGAFIPFADDFDMSTVTTSVKG-LGEIGD 317
A +R + R P T F+ D I D D S+ ++G +G + D
Sbjct: 209 -ASDRQLT---REPFTVVFPEFAGSRSFLTLDIGLIEVHDANDWSSQPFGIEGGIGNVAD 264
Query: 318 VKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT 377
+ + L LI + V G +SG GT+ A + G +++ FL+ N
Sbjct: 265 INELSLSL---QLIDQPVTAFGSASGALDGTIKALFYRHKSLAGYDYVSQFLIAPANGSP 321
Query: 378 FDLEGDSGSLILM------KGENGEKPRPIGIIWGGTANRGRLKLKIGQPPENWTSGVDL 431
GDSG+L + G+ + P+ I WGG + ++ N+ L
Sbjct: 322 QTQPGDSGTLWYLTSAASTAGDGERRLTPLAIEWGGQSLASDDGARL-----NYALATGL 376
Query: 432 GRLLNLLELDLI 443
LL++DL+
Sbjct: 377 STACQLLDVDLV 388
>gi|395448531|ref|YP_006388784.1| hypothetical protein YSA_09065 [Pseudomonas putida ND6]
gi|388562528|gb|AFK71669.1| hypothetical protein YSA_09065 [Pseudomonas putida ND6]
Length = 409
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 111/262 (42%), Gaps = 49/262 (18%)
Query: 177 VEFSYFGAP--EPTPKEQLYTQIVDDLRGGDP-------SIGSGSQVASQETY--GTLGA 225
V+FSY G E P ++ V G P I GS V + + + GTLG
Sbjct: 131 VDFSYIGKTTIETNPPPAPFSAAV-----GAPIWFTHSDRISCGSSVTTSQVFDAGTLGF 185
Query: 226 IVKSQTGSRQVGFLTNRHVAVDLDYPNQKM--FHPLP----PTLGPGVYLGAVERATSFH 279
+ + G R VGF +N HV + ++ M P P P P V +G +
Sbjct: 186 LARLADG-RLVGF-SNNHVTGECNHTPHGMHILSPSPMDASPASPPPVAIGTHFALAPLN 243
Query: 280 HRRP--LTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSL-IGKQVV 336
P +T D A + +S S++G G D S +L G +V
Sbjct: 244 SGDPNQITLQETDAAIFLVTEPDKVS----SMQGNG------FYDTPSETVALRAGLRVK 293
Query: 337 KVGRSSGLTTGTVLA-----YALEY--NDEKGICFLTDFLVV-GENQQTFDLEGDSGSLI 388
KVGR++GL GTVL + L Y N + I + + V G+ TF GDSGSL+
Sbjct: 294 KVGRTTGLRAGTVLGQMVAPFYLPYKSNRFQSIVYFSGVWAVQGDGGNTFSEGGDSGSLV 353
Query: 389 LMKGENGEKPRPIGIIWGGTAN 410
+ E+G R +G+++ G N
Sbjct: 354 VT--EDGT--RSVGVVFAGGNN 371
>gi|390573926|ref|ZP_10254079.1| hypothetical protein WQE_35945 [Burkholderia terrae BS001]
gi|389934138|gb|EIM96113.1| hypothetical protein WQE_35945 [Burkholderia terrae BS001]
Length = 833
Score = 47.4 bits (111), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 83/367 (22%), Positives = 134/367 (36%), Gaps = 63/367 (17%)
Query: 100 ATTLLELMTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKV------HKQW 153
A T ++ +R F + +R YS PA++V V V H +
Sbjct: 146 AETRVKAKGVRTFDNSEVRPYSW----------------PAVIVLVRDWVDTTEFGHGK- 188
Query: 154 LSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQ 213
+ P +P L P G V VV P + + + GG P I
Sbjct: 189 VDPDHMVPRTLYMPDGRAVPVCVVAVEPTVPAASAPADARWPSTY--IGGGCPLIADAQG 246
Query: 214 VASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVE 273
+ E ++G +V T LTNRHV + P + + +G A +
Sbjct: 247 I---ERTASVGCLV---TDGHTTYALTNRHVCGEPGSPVKALLRGAVAEVGI-----ASD 295
Query: 274 RATSFHHRRPLT-----------FVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVD 322
R + R P T F+ D I D D S+ ++G IG+V ++
Sbjct: 296 RQLT---REPFTVVFPEFAGSRSFLTLDIGLIEVHDANDWSSQPFGIEG--SIGNVADIN 350
Query: 323 LQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEG 382
S LI + + G +SG GT+ A + G +++ FL+ N G
Sbjct: 351 ELSLSLQLIDQPLTAFGSASGALDGTIKALFYRHKSLAGYDYVSQFLIAPANGSPQTQPG 410
Query: 383 DSGSLILM------KGENGEKPRPIGIIWGGTANRGRLKLKIGQPPENWTSGVDLGRLLN 436
DSG+L + G+ + P+ I WGG + ++ N+ L
Sbjct: 411 DSGTLWYLTSPANTTGDGERRLTPLAIEWGGQSLASDDGERL-----NYALATGLSTACQ 465
Query: 437 LLELDLI 443
LL++DL+
Sbjct: 466 LLDVDLV 472
>gi|331269488|ref|YP_004395980.1| hypothetical protein CbC4_1303 [Clostridium botulinum BKT015925]
gi|329126038|gb|AEB75983.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 312
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 72/283 (25%), Positives = 113/283 (39%), Gaps = 46/283 (16%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++ +G T I VFV+RK+ L+P Q +PT +G D+ +
Sbjct: 34 VGVGLGYKVTKGFYTKDKCIKVFVTRKLPNNQLAPQQLIPTIYKG-----IKTDIFQSGK 88
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDP--SIGSGSQVASQETYGTLGAIVKSQTGSRQVGFL 239
T K V + GG ++G+GS GTLG +V T + L
Sbjct: 89 LETRSLTNK-------VRPIIGGYSIGAVGAGST-------GTLGCLV---TKNNDYFIL 131
Query: 240 TNRHVAVDLDYPNQKMFHPL-PPTLGPGVYLGA---VERATSFHHRRPLTFVRADGAFIP 295
+N HV PL P L PG+ ++ P+ F + I
Sbjct: 132 SNNHVIARWGT------VPLNTPILQPGIQDKGNPKTDKVAVLSEYVPIKFQSVFSSPIN 185
Query: 296 FADDFDMSTVTTSV--KGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYA 353
+ D + S+ + IG +S I + +V KVGR++ LT GTV+A
Sbjct: 186 YVDCAIAKVINKSIASSAIAFIGKP-----ESTIVPRLNAKVQKVGRTTELTIGTVIAIN 240
Query: 354 LEYNDEKGICFLTDFLVVGENQQTFDLE--GDSGSLILMKGEN 394
+ IC T +E GDSGS++L + +N
Sbjct: 241 CTV---EVICPNNKIAKYKNQISTTAMEKIGDSGSVLLDENKN 280
>gi|422630026|ref|ZP_16695226.1| hypothetical protein PSYPI_09900 [Pseudomonas syringae pv. pisi
str. 1704B]
gi|330939286|gb|EGH42683.1| hypothetical protein PSYPI_09900 [Pseudomonas syringae pv. pisi
str. 1704B]
Length = 339
Score = 47.0 bits (110), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 73/293 (24%), Positives = 123/293 (41%), Gaps = 52/293 (17%)
Query: 141 ILVFVSRKVHKQWLSPIQCLPT-----ALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYT 195
I ++ RKV K+ L Q LP+ + P G+ V + K Q T
Sbjct: 39 ISIYTKRKVIKKDL---QVLPSNIWRQGIAYPQGLMDSVG----------KEATKPQGAT 85
Query: 196 QIVDDLRGGDPSIGSGSQVA--SQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDY--P 251
+ + GG + GS ++ + + GT+GA+V+ G + LTN HV+ + P
Sbjct: 86 FALHQIAGGHATYACGSSISPGNDASAGTMGALVRLPDG--LLYGLTNNHVSALCSHVAP 143
Query: 252 NQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFADDFDMSTV------ 305
N + P +GP A+ T H R L + F+++ D +
Sbjct: 144 NTPILAPGVLDVGPN----AIAPFTLGFHSRALEMRVGSLGNVDFSNNLDAAVFRIADEA 199
Query: 306 -TTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA----------YAL 354
+S++G + ++D P+ G +V KVGR++ T G +++ +A
Sbjct: 200 NVSSMQGGAYDTPLVVLD---PVE---GMRVQKVGRTTRHTQGQIVSRELRPLNVSYHAQ 253
Query: 355 EYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGG 407
Y I F F + G+N + F GDSGSLI+ + G +G+I+ G
Sbjct: 254 SYGFNGMIWFGNVFAIHGDNAE-FSKGGDSGSLIVAVDDAGLVLGAVGLIFAG 305
>gi|253771298|ref|YP_003034114.1| hypothetical protein CLG_A0020 [Clostridium botulinum D str. 1873]
gi|253721450|gb|ACT33742.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 310
Score = 46.6 bits (109), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 120/282 (42%), Gaps = 53/282 (18%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++K G T+ + VFVS+K + ++ +P+ +G DV E Y
Sbjct: 32 VGIGLGYKVKNGFYTNQLCVQVFVSKKYSENDININDKIPSMYKG-----IPTDVKETGY 86
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIG--SGSQVASQETYGTLGAIVKSQTGSRQVGFL 239
F A K++ P +G S S + + GT G +VKS GS Q
Sbjct: 87 FRACSFRGKKR-------------PVLGGYSISGNMNSKNSGTAGCLVKS--GSAQFLLG 131
Query: 240 TNRHVAVDLDYPNQKMFHPL-PPTLGPGVYLGA---VERATSFHHRRPLTFVRADGAFIP 295
TN HV V+L+ P+ P + P + G + + H PL F++ I
Sbjct: 132 TN-HVIVNLN------MEPIAAPIVQPSLEYGGYTPTDTVATVHKFIPLRFIQGRDRPIN 184
Query: 296 FADDF--DMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL--- 350
D ++ + IG +K V +SP +G V KVG ++ LT GT+
Sbjct: 185 LTDCALGLLTKPNIMSNKIALIGKLKCV--KSP---KLGAHVKKVGETTELTEGTITSVN 239
Query: 351 -AYALEYNDEKGICFLTDFLV--VGENQQTFDLEGDSGSLIL 389
++ Y +++ F L +GE GDSGS+++
Sbjct: 240 ASFIAAYENDELALFKDQVLTSAMGE-------AGDSGSILV 274
>gi|253680830|ref|ZP_04861633.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253562679|gb|EES92125.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 325
Score = 46.6 bits (109), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 80/298 (26%), Positives = 129/298 (43%), Gaps = 55/298 (18%)
Query: 126 IGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAP 185
+G++ +G+LT+ I VFVS+K+ L +P G DVV+ F +
Sbjct: 50 LGYKEIQGILTNEKCIKVFVSQKISSNNLPSADLIPPIYNG-----IKTDVVKSGIFTSC 104
Query: 186 EPTPKEQLYTQIVDDLRGGDPSIG-SGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHV 244
T K + + G SIG +G ++A GTLG IV++ + R L HV
Sbjct: 105 GLTEK-------IRPVPNGY-SIGPAGYKMA-----GTLGCIVQNPS-ERAYYILGTNHV 150
Query: 245 AVDLDYPNQKMFHPLPPTLGPGVYLGA------VERATSFHHRRPLTFVRADGAFI--PF 296
L K+ P+ L PGV G + T + + TF + +I
Sbjct: 151 LAQLG--KAKISTPI---LQPGVLDGGSVNTDIIANLTKYIPIKFKTFFKTPENYIDAAI 205
Query: 297 ADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAY---- 352
A+ ++S V+ V I + K D+ P IG++V KVGR++G TTG + +
Sbjct: 206 AEISNISLVSPKV----AIINNKFKDIGIP---EIGQEVFKVGRTTGYTTGRITSIDATA 258
Query: 353 ALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
++Y D G D ++ + GDSGS++ K N P+G++ + N
Sbjct: 259 IIKYPD--GTALFKDQILASTEVKV----GDSGSILATKNLN-----PLGMLSSASEN 305
>gi|253682715|ref|ZP_04863512.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253562427|gb|EES91879.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 318
Score = 46.2 bits (108), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 121/288 (42%), Gaps = 67/288 (23%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G IG+++++ VLT I VF S K+ L +P+ +G DV+E
Sbjct: 41 VGVGIGYKVQKEVLTSEKCIAVFASEKIPNNELKREDLVPSVYKG-----IKTDVIETGI 95
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGS-GSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
F ++ + +R P +G G + + YGT+G +V T + L+
Sbjct: 96 FST----------MKLSNRIR---PVLGGYGIAPVTTKYYGTMGCLV---TDGIENFILS 139
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGA------VERATSFHHRRPLTFVRADGAF- 293
+ H+ DL+ N K+ P+ L P + G V + F R + + +
Sbjct: 140 SNHILADLN--NIKLGTPI---LQPAIINGGNPEKDQVAVLSKFIPLRCINGTKRPENYM 194
Query: 294 -IPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAY 352
+ A + + V++ +K +G+ V+ +G+ V KVG S+ LTTG
Sbjct: 195 DVAIAKVINNNFVSSDIKFIGKPKGVR--------GHRLGQLVKKVGASTELTTGI---- 242
Query: 353 ALEYNDEKGICFLTDFLVVGENQQTFDLE-----------GDSGSLIL 389
I ++ ++V EN++ F ++ GDSGS++L
Sbjct: 243 ---------IQYINVTIIVDENKKQFLMKKQLVTNAMAKPGDSGSILL 281
>gi|332798101|ref|YP_004459600.1| hypothetical protein TepRe1_0081 [Tepidanaerobacter acetatoxydans
Re1]
gi|332695836|gb|AEE90293.1| hypothetical protein TepRe1_0081 [Tepidanaerobacter acetatoxydans
Re1]
Length = 334
Score = 45.8 bits (107), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 78/341 (22%), Positives = 138/341 (40%), Gaps = 75/341 (21%)
Query: 109 IRAFHSKILRCYSL-GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTAL-EG 166
+R KIL ++ G +G++ RG ++ PAI+V V K+ + LS +P L +
Sbjct: 8 LRQHEKKILSLENVVGLGLGYKTIRGRTSNKPAIIVLVKEKIPCEKLSKNNIIPKTLGDT 67
Query: 167 PGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAI 226
P DV+E +L V+ R P + G + T GT GA+
Sbjct: 68 P------TDVIEVGEI---------RLLAARVEKARPAKPGMSIGHY---KITAGTFGAL 109
Query: 227 VKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTF 286
V+ Q + + L+N HV + L PG Y G + R +
Sbjct: 110 VEDQKTGKPL-ILSNNHVLANATDGTDGKSAIGDAVLQPGAYDGGTSSDVIAYLERFVPI 168
Query: 287 VRADGA-FIPFADDFD--MSTVTTSVKGLGEIGDVK------IVD--LQSPISS------ 329
+++ GA A+ F+ ++++ VK +I +K +VD + SPI +
Sbjct: 169 LKSTGASHCAIANGFEKLINSILKIVKPDYQINFIKRTSSKNMVDAAVASPIKAEYVASE 228
Query: 330 -------------LIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQ 376
IG V K GR++G+TTG + K I + ++ + +
Sbjct: 229 IVGLGEIAGIEEPKIGAAVQKSGRTTGVTTGQI----------KAINVVIKVILSPKEEA 278
Query: 377 TFDLE---------GDSGSLILMKGENGEKPRPIGIIWGGT 408
F + GDSGS+++ ++ + IG+++ G+
Sbjct: 279 VFYEQILASSMAKPGDSGSIVV-----NDEMKAIGLLFAGS 314
>gi|326330454|ref|ZP_08196762.1| hypothetical protein NBCG_01888 [Nocardioidaceae bacterium Broad-1]
gi|325951729|gb|EGD43761.1| hypothetical protein NBCG_01888 [Nocardioidaceae bacterium Broad-1]
Length = 332
Score = 45.8 bits (107), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 70/305 (22%), Positives = 122/305 (40%), Gaps = 49/305 (16%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G +I G TD P+++V VS+K+ + +S +P ++G DV+E +
Sbjct: 39 VGVGVGLKITDGEQTDTPSVMVLVSQKMPTELVSDADTVPDTVDG-----TPTDVLEVGH 93
Query: 182 FGAPEPTPKEQLYTQIVDD------LRGGDPSIGSGSQVASQETYGTLGAIVKSQTG-SR 234
A ++ + TQ VD +R P G + T G +++ G
Sbjct: 94 LFAGGS--QQLMETQEVDAQTLALRIRPARPGFSVGHYKITAGTIGAGAYDLRTFPGIPP 151
Query: 235 QVGFLTNRHVAVDLDYPN--QKMFHPLPPTLG--PGVYLGAVERATSFHHRRPLTFVRAD 290
+ L+N HV + + + + P P G P +G + R +V A
Sbjct: 152 RYYVLSNNHVLANSNDASIGDPILQPGPFDGGTAPADVIGRLARFVPIRFDGSCNYVDAA 211
Query: 291 GAFIPF----ADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTT 346
A +PF D + T+ K ++ +G + K GR++ TT
Sbjct: 212 VAEVPFHVIDRDVYWNGYPATAAK-----------------AATVGMLLKKTGRTTNFTT 254
Query: 347 GTVLAYALEYNDEKGICFLTDFL--VVGENQQTFDLEGDSGSLILMKGENGEKPRPIGII 404
G V A A N G + F ++ N GDSGS++L N P+G++
Sbjct: 255 GRVTAVAATVNVNYGAGKVAKFCNQIITTNMSA---GGDSGSMVLDLQNN-----PVGLL 306
Query: 405 WGGTA 409
+ G++
Sbjct: 307 FAGSS 311
>gi|378551300|ref|ZP_09826516.1| hypothetical protein CCH26_14474 [Citricoccus sp. CH26A]
Length = 374
Score = 45.4 bits (106), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 122/311 (39%), Gaps = 51/311 (16%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G IG ++ G T P+ILVFV HK+ + + GV DV +
Sbjct: 31 VGVDIGEKVSHGKKTGEPSILVFVE---HKKPVKALPPEEVVPPEVDGVKTDVQEMVIEL 87
Query: 182 FGAPE-PTPKEQLYTQIVDDLRGGDPSIGSGS-------QVASQETY---GTLGAIVKSQ 230
A + P +Q+ L GG S+G +VA Y GTLGA+V+ +
Sbjct: 88 QAARQLLVPAQQVDPAAYPRLAGG-ISMGPARSIRMEPPEVAEAGEYVFVGTLGAMVRDR 146
Query: 231 TGSRQVGFLTNRHVAVDLD--YPNQKMFHPLPPTLGPGV--YLGAVERATSFHHRRPLTF 286
+ +TN HVA D +M P P G G++ RA +
Sbjct: 147 ASGATLA-MTNFHVACVDDGWAAGDRMIQPGRPDGGDATTQQFGSLARAVLSEN------ 199
Query: 287 VRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTT 346
DGA + + + V +IGDV + IG V K GR++ T
Sbjct: 200 --TDGAVVTVDEGKEWDNVVM------DIGDVA-----GSAEASIGLAVQKRGRTTQHTF 246
Query: 347 GTVLA----YALEYNDEKGICFL---TDFLVVGENQQTFDLEGDSGSLILMKGENGEKPR 399
GTV + +L+Y D G L L Q F GDSGS++L N
Sbjct: 247 GTVASAEATLSLDYGDGMGTRTLRHQVRILTDTARSQRFSEGGDSGSVVLDMDRN----- 301
Query: 400 PIGIIWGGTAN 410
+G+++ G+ +
Sbjct: 302 VVGLLFAGSTD 312
>gi|253682482|ref|ZP_04863279.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253562194|gb|EES91646.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 305
Score = 45.4 bits (106), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 116/282 (41%), Gaps = 57/282 (20%)
Query: 123 GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYF 182
G +G+++K G T I VFV KV K + +P+ + G+ DV+ + S
Sbjct: 30 GIGLGYKVKNGFDTHKKCIKVFVDVKVSKNNIPLHDLIPSYYD---GIETDVEQIGISTM 86
Query: 183 GAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNR 242
+ + + VD P IGS S GT G +V T R + L+N
Sbjct: 87 CSLKDKVRP------VDGGYNISPLIGSPS--------GTFGCLV---TDGRFMYLLSNC 129
Query: 243 HV-----AVDLDYPNQKMFHPLPPTLGPGVYLGA------VERATSFHHRRPLTFVRADG 291
HV A LD P L PG G + + + + +T +
Sbjct: 130 HVLATNGATPLD----------CPILQPGRKYGGKDPEDKIAILSKYIEPKYITPTSSPE 179
Query: 292 AFI--PFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTV 349
F+ A D+S V+ +K LG I + +++G+ V KVG ++ LT G +
Sbjct: 180 NFVDCAIAKITDLSKVSNKIKFLGNI--------KGTAPAILGESVQKVGCTTELTKGKI 231
Query: 350 LAYALEYNDE--KGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
+A + + KG C + ++ + + +GDSGS++L
Sbjct: 232 IALGVTITIQRPKGNCIFKNQILTNKMGE----KGDSGSILL 269
>gi|416366325|ref|ZP_11682805.1| hypothetical protein CBCST_17464 [Clostridium botulinum C str.
Stockholm]
gi|338193969|gb|EGO86547.1| hypothetical protein CBCST_17464 [Clostridium botulinum C str.
Stockholm]
Length = 295
Score = 45.4 bits (106), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 118/304 (38%), Gaps = 55/304 (18%)
Query: 107 MTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEG 166
M++ F SK +G +G++ G+ T I VFV+ K+ K L + +P EG
Sbjct: 1 MSVSIFLSK---SNVVGVGLGYKDIDGICTYEECIKVFVTEKISKNELPAKEIVPAVYEG 57
Query: 167 PGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAI 226
DVV F + L +++ L G I G+ T GTLGA+
Sbjct: 58 -----IKTDVVTGGVF------TECNLVSRVRPVLCGYAMGISDGA--TKSVTTGTLGAL 104
Query: 227 VKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRR---P 283
VK + + L + HV N+ + P + P ++ G V + + P
Sbjct: 105 VKDK---ENIYILGSGHV-----LTNENLVPLGTPIIQPSIHFGGVISKDTIAYLSKYIP 156
Query: 284 LTFVRADGAFIPFAD-----DFDMSTVTTSVKGLG----EIGDVKIVDLQSPISSLIGKQ 334
L ++ + + D +S VT + L E+ K+ D
Sbjct: 157 LRYISSTAIPENYVDCAIGKVLSISLVTPKIAILNSLPLEVSSAKLKD-----------T 205
Query: 335 VVKVGRSSGLTTGTVLAY---ALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMK 391
VVKVG SG TTGTV A + I F L +Q+ GDSGSL+L +
Sbjct: 206 VVKVGAISGYTTGTVEAVNATIWAHYSSGQILFKNQILTTLMSQK-----GDSGSLLLDR 260
Query: 392 GENG 395
N
Sbjct: 261 KGNA 264
>gi|331271154|ref|YP_004385863.1| hypothetical protein CbC4_6070 [Clostridium botulinum BKT015925]
gi|329127649|gb|AEB77591.1| hypothetical protein CbC4_6070 [Clostridium botulinum BKT015925]
Length = 302
Score = 45.4 bits (106), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 110/275 (40%), Gaps = 40/275 (14%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++I GV T I VFV K+ K L+ + +P +G D+VE +
Sbjct: 27 IGVGLGYKISNGVNTLTKCIKVFVKNKISKDKLNENEMIPKCYKGI-----PTDIVECGF 81
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
+ +T+ + + GG SIG G+ + + GT+G +VK R L
Sbjct: 82 ATSCG-------FTKRIRPVYGG-YSIGPGNALLN----GTMGCVVKDH---RYYYILGC 126
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPGV-YLGAVERATSFHHRRPLTFVRADGAFI--PFAD 298
HV D + P L G + T F P+ F + ++ A
Sbjct: 127 NHVLADENIEKIGAAIIQPSKLDSGTPSHDTIAHLTKF---IPIKFGSGEENYVDCAMAR 183
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAY--ALEY 356
D S VT + +G I V L G+ V K GR++ T G + A L
Sbjct: 184 IDDKSLVTPEIVIIGSIKGTSDVKL--------GESVRKCGRTTEFTIGRISAINTTLNI 235
Query: 357 NDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMK 391
N +KG C + + +GDSG++++ K
Sbjct: 236 NFKKGKCLFKNQIA----TSIMSSKGDSGAILVDK 266
>gi|425472558|ref|ZP_18851399.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389881340|emb|CCI38094.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 378
Score = 45.4 bits (106), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 80/326 (24%), Positives = 137/326 (42%), Gaps = 54/326 (16%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQW-LSPIQCLPTALEGPGGVWCDVDVVEFS 180
LGT IGFR G+LT + V+VS KV S +PT++ GG+ +++ V
Sbjct: 94 LGTGIGFRSVGGLLTPDVTLKVYVSEKVAGTIAASAFAAVPTSI---GGMPVEIEEV--- 147
Query: 181 YFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
++ TQ+ + R P S Q T GTLG +V + + ++ L+
Sbjct: 148 ----------GEIVTQLYNR-RYARPVRCGVSIGHPQVTAGTLGCLVVLR--NNKLCLLS 194
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAV---ERATSFHHRRPLTFVRADGAFIPFA 297
N HV + + N ++ P+ + PG G V +R + FVR + P
Sbjct: 195 NNHVIANSN--NARIGDPI---IQPGRVDGGVVPGDRIALLEN-----FVRVN---CPGP 241
Query: 298 DDFDMSTVTTSVKGLGEIGDVKIVDLQ---SPISSLIGKQVVKVGRSSGLTTGTVLAYAL 354
+ D + T+ + D + V+ +PI++ +G V K GR++ T GT+ +
Sbjct: 242 NLVDAAVAWTAFSFV----DPRHVNYTLNPTPIAARLGMTVKKNGRTTQATIGTITDINV 297
Query: 355 EYNDEKGIC----FLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
+ C F + G F GDSGSLI+ N +P+ +++ G +
Sbjct: 298 NISVGGYSCGAAQFRNQIGIRGIGGNPFSRGGDSGSLIVTANSN----QPVALLFAGRTD 353
Query: 411 RGRLKLKIGQPPENWTSGVDLGRLLN 436
+ P + S + + R +N
Sbjct: 354 N---SITFANPIGSVISQLSIQRFVN 376
>gi|229822411|ref|YP_002883937.1| hypothetical protein Bcav_3934 [Beutenbergia cavernae DSM 12333]
gi|229568324|gb|ACQ82175.1| conserved hypothetical protein [Beutenbergia cavernae DSM 12333]
Length = 350
Score = 45.1 bits (105), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 81/187 (43%), Gaps = 37/187 (19%)
Query: 219 TYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGA---VERA 275
T GTLGA V + G+R V L+N HV V P L PG + G +R
Sbjct: 140 TAGTLGAFV-TYDGARHV--LSNHHVLVG------SSGQPGDAVLQPGPFDGGSDPADRI 190
Query: 276 TSFHHRRPLTF-----VRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSL 330
+ H PL V A A + DD D + ++ G E+
Sbjct: 191 GALAHLVPLVAGEEAEVDAALASLDAPDDVDPAYPGGTLTGTSEVEG------------- 237
Query: 331 IGKQVVKVGRSSGLTTGTVLAYALE-----YNDEKG-ICFLTDFLVVGENQQTFDLEGDS 384
G+ V K+GR++G+T G V A ++ Y + G + F V GE +++F GDS
Sbjct: 238 -GEGVEKIGRTTGVTRGRVTAIEVDDLLVDYGEGLGTLSFSGQIEVEGEGEESFSDGGDS 296
Query: 385 GSLILMK 391
GSL+ ++
Sbjct: 297 GSLVYLR 303
>gi|379059056|ref|ZP_09849582.1| Equine arteritis virus peptidase S32 [Serinicoccus profundi MCCC
1A05965]
Length = 440
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 123/291 (42%), Gaps = 53/291 (18%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G IG +I G T +I+V+V +KV ++ Q +P L+ G+ DV +
Sbjct: 29 VGVDIGEKISDGKPTGEMSIVVYVEKKVAPSKVARSQKVPAELD---GIPTDVQELVIEL 85
Query: 182 FGAP-----EPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQV 236
G P +P +T I RGG SIG + + GT GA+V+ T + V
Sbjct: 86 QGGPGLYAGDPLSDTSKHTTI----RGGI-SIGP----SRHQNAGTAGALVRDTT-TGAV 135
Query: 237 GFLTNRHVA-VDLDYPNQKMFHPLPPTLGPGVYLG---AVERATSFHHRRPLTFVRADGA 292
LTN HVA VD + + L PG + AV++ + R + + DGA
Sbjct: 136 SLLTNFHVACVDTSWTAGETV------LQPGRFDSGNPAVDQVGTLT--RGVISEQVDGA 187
Query: 293 FIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISS---LIGKQVVKVGRSSGLTTGTV 349
+ D E+ ++VD+ + S + G V K GR++ T G V
Sbjct: 188 VVRLDGD--------------EVWADEVVDIGGVVGSTPAVAGMAVQKRGRTTEHTHGEV 233
Query: 350 LA----YALEYNDEKGICFLTDFLVVGENQQT--FDLEGDSGSLILMKGEN 394
++ L+Y D G+ L + + T F GDSGS+++ G
Sbjct: 234 VSVDATVTLDYGDGVGMRTLRRQVSIRPAAGTARFSDRGDSGSVVMNAGRQ 284
>gi|331269225|ref|YP_004395717.1| hypothetical protein CbC4_1040 [Clostridium botulinum BKT015925]
gi|329125775|gb|AEB75720.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 314
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 66/274 (24%), Positives = 115/274 (41%), Gaps = 38/274 (13%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++I T I VFVS KV + L +P +G + DVV+ Y
Sbjct: 36 VGVGVGYKIINNFYTSKKCITVFVSEKVDQNNLPLKDLIPAVYKG-----IETDVVQSGY 90
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F T K + ++GG G + AS T G+ G +V G+R+ N
Sbjct: 91 FVGASLTQK-------IRPVQGG---YSVGPESASNIT-GSQGCVVTD--GTRRYMLSCN 137
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPG--VYLGAVERATSFHHRRPLTFVRA--DGAFIPFA 297
+A + P L P+LG G AV T + + T + + + A
Sbjct: 138 HIIAHENMLPRNTQI--LQPSLGDGGKTTKDAVAYLTKYIPLKKKTTLNSPENDVDCAIA 195
Query: 298 DDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTG--TVLAYALE 355
+++ +++ + +G DL+ + +G++VVK GR++ T G T + ++
Sbjct: 196 REYEPGILSSKIYIIG--------DLKGVSAPNLGRKVVKSGRTTAYTEGSITTIGATVQ 247
Query: 356 YNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
E GI ++ Q EGDSG++++
Sbjct: 248 VKLELGIYIFKHQIITTSMGQ----EGDSGAVLV 277
>gi|253771278|ref|YP_003034119.1| hypothetical protein CLG_A0025 [Clostridium botulinum D str. 1873]
gi|253721430|gb|ACT33722.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 315
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 113/280 (40%), Gaps = 49/280 (17%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G G++IK G T+ I VFVS+K+ K L+ +P +G DV E +
Sbjct: 37 VGICCGYKIKEGFYTNQLCIQVFVSKKIPKNQLNSYDMIPLIYKG-----IPTDVKETGH 91
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIG--SGSQVASQETYGTLGAIVKSQTGSRQVGFL 239
F A +++ P +G S S + + GT G +V + +G
Sbjct: 92 FKACYLIERKR-------------PVLGGYSISTSMNDQISGTAGCVVTNGVNKFILG-- 136
Query: 240 TNRHVAVDLDYPNQKMFHPLPPTLGPG-VYLGAVERAT--SFHHRRPLTFVRADGAFIPF 296
TN +A N + P + P +Y G R T S + PL F++ + +
Sbjct: 137 TNHVLA------NSNVLPIKTPIIQPAYIYDGYTPRDTIASLYKYIPLRFIKGEEHPLNL 190
Query: 297 ADDFDMSTVTTSVKGLGEIGDVKIV---DLQSPISSLIGKQVVKVGRSSGLTTGTVLAYA 353
D + +T S +I KI L+S S +G V KVG S LT GT+ +
Sbjct: 191 T-DCALGLLTKS-----DIMSNKIAFIGKLRSVKSPKLGGHVKKVGAISELTEGTITGIS 244
Query: 354 ----LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
+ Y D + F+ L GDSGS+++
Sbjct: 245 GSILVSYLDGRRALFMDQIL-----TTRMSGNGDSGSILV 279
>gi|343500347|ref|ZP_08738242.1| hypothetical protein VITU9109_14061 [Vibrio tubiashii ATCC 19109]
gi|418477654|ref|ZP_13046779.1| hypothetical protein VT1337_04732 [Vibrio tubiashii NCIMB 1337 =
ATCC 19106]
gi|342820593|gb|EGU55413.1| hypothetical protein VITU9109_14061 [Vibrio tubiashii ATCC 19109]
gi|384574609|gb|EIF05071.1| hypothetical protein VT1337_04732 [Vibrio tubiashii NCIMB 1337 =
ATCC 19106]
Length = 445
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 91/209 (43%), Gaps = 41/209 (19%)
Query: 219 TYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPN--QKMFHPLPPTLGPGVYLGAVERAT 276
T GT+GA V + T V L+N HV + + N + M P P + G E+
Sbjct: 153 TAGTIGARVTNGT---NVFALSNNHVFANSNDTNVPENMLQPGP-------FDGGTEQND 202
Query: 277 SFHHRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVD-LQSPISSL----I 331
+F + DG+ A+ D + TS GE+ D +P S++ I
Sbjct: 203 TFASLTDYEPILFDGS----ANIMDAAVALTST---GELTTSTPADGYGTPDSTVNEAVI 255
Query: 332 GKQVVKVGRSSGLTTGTVLAYALEYNDEKGICF-----LTDF-LVVGE---NQQTFDLEG 382
G V K GR++G T GTV A N +C+ T L VG+ TF G
Sbjct: 256 GMSVKKYGRTTGFTQGTVDAINASVN----VCYEGSSTCTKLALFVGQIVVTPGTFSAGG 311
Query: 383 DSGSLILMKGENGEKPRPIGIIWGGTANR 411
DSGSLI+ N P+G+++ G+++
Sbjct: 312 DSGSLIVSSNGN----NPVGLLFAGSSSH 336
>gi|393726247|ref|ZP_10346174.1| hypothetical protein SPAM2_21549 [Sphingomonas sp. PAMC 26605]
Length = 736
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 102/245 (41%), Gaps = 39/245 (15%)
Query: 221 GTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHH 280
GT+G +V T + LTNRHVA + P+ + L + P
Sbjct: 152 GTVGCLV---TDGHKTFALTNRHVAGE---PDTVLSASLRGDVTP----------VGVAS 195
Query: 281 RRPLTFVRADGAFIPFADDFDMSTV-------------TTSVKGL-GEIGDVKIVDLQSP 326
+R LT + D F F+ T+ T+ V GL GE+G V ++ +
Sbjct: 196 KRSLTRLPLDDVFPTFSAQRTFLTLDVGLVDVDVVGDWTSRVFGLEGELGAVVDLNEDNL 255
Query: 327 ISSLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG- 385
+ LI +++ G SG G + A + G ++++FL+ E+ Q GDSG
Sbjct: 256 GTQLIDQRMEAFGAVSGHLVGRIKALFYRHKALAGYEYVSEFLIAPEDGQAQTCPGDSGM 315
Query: 386 --SLILMKGENGEKP-RPIGIIWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDL 442
L+ +G++ +P+ + WGG G + N++ L LL++DL
Sbjct: 316 VWHLVQTDAASGDRTLQPLAVEWGGQGLIGSDDRTL-----NFSLATGLATACQLLDVDL 370
Query: 443 ITTDE 447
+ T +
Sbjct: 371 VRTGD 375
>gi|416350192|ref|ZP_11680807.1| hypothetical protein CBCST_04751 [Clostridium botulinum C str.
Stockholm]
gi|338196351|gb|EGO88549.1| hypothetical protein CBCST_04751 [Clostridium botulinum C str.
Stockholm]
Length = 315
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 114/280 (40%), Gaps = 49/280 (17%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G G++IK G T+ I VFVS+K+ K L+ +P +G DV E +
Sbjct: 37 VGICCGYKIKEGFYTNQLCIQVFVSKKIPKNQLNSYDMIPLIYKG-----IPTDVKETGH 91
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIG--SGSQVASQETYGTLGAIVKSQTGSRQVGFL 239
F A +++ P +G S S + + GT G +V + +G
Sbjct: 92 FKACYLIERKR-------------PVLGGYSISTSMNDQISGTAGCVVTNGVNKFILG-- 136
Query: 240 TNRHVAVDLDYPNQKMFHPLPPTLGPG-VYLGAVERAT--SFHHRRPLTFVRADGAFIPF 296
TN +A N + P + P +Y G R T S + PL F++ + +
Sbjct: 137 TNHVLA------NSNVLPIKTPIIQPAYIYDGYTPRDTIASLYKYIPLRFIKGEEHPLNL 190
Query: 297 ADDFDMSTVTTSVKGLGEIGDVKIV---DLQSPISSLIGKQVVKVGRSSGLTTGTVLAYA 353
D + +T S +I KI L+S S +G V KVG S LT GT+ +
Sbjct: 191 T-DCALGLLTKS-----DIMSDKIAFIGKLRSVKSPKLGGHVKKVGAISELTEGTITGIS 244
Query: 354 ----LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
+ Y D + F+ L + GDSGS+++
Sbjct: 245 GSILVSYLDGRRALFMDQILTTRMSGN-----GDSGSILV 279
>gi|331270371|ref|YP_004396863.1| hypothetical protein CbC4_2201 [Clostridium botulinum BKT015925]
gi|329126921|gb|AEB76866.1| hypothetical protein CbC4_2201 [Clostridium botulinum BKT015925]
Length = 478
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 73/280 (26%), Positives = 109/280 (38%), Gaps = 50/280 (17%)
Query: 123 GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYF 182
+G++ +G++T P I VFVS K L P +P G D+V F
Sbjct: 199 AVGLGYKEIQGIVTTEPCIKVFVSEKTPPGNLPPSDLIPPIYNG-----IKTDIVASGVF 253
Query: 183 GAPEPTPKEQLYTQIVDDLRGGDP--SIG-SGSQVASQETYGTLGAIVKSQTGSRQVGFL 239
E T K +R P SIG +G +VA GTLG IV++ +
Sbjct: 254 TPCELTKK----------VRPAHPGYSIGPAGYKVA-----GTLGCIVQNPSEKAYYILS 298
Query: 240 TNRHVA----VDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIP 295
TN +A V +D P L PGV G + H ++ F
Sbjct: 299 TNHLLAQLGKVQID----------TPILQPGVLDGGKIDTDTIAHLTRYIPIKMKTLFKT 348
Query: 296 FADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSL----IGKQVVKVGRSSGLTTGTVLA 351
+ D + S L KI + + I L IG +V K+GR++G T G + A
Sbjct: 349 PENHVDAAIAKVSNTSLIS---SKIAIVNANIKRLGAPGIGDRVFKIGRTTGRTHGVITA 405
Query: 352 YALE--YNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
+ N +G + ++ + T GDSGS++L
Sbjct: 406 IDVTQVINYPEGKALFKEQILTSASGNT----GDSGSVLL 441
>gi|253771267|ref|YP_003034112.1| hypothetical protein CLG_A0018 [Clostridium botulinum D str. 1873]
gi|253721419|gb|ACT33711.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 308
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 119/295 (40%), Gaps = 49/295 (16%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++K G T+ + VFVSRK + ++ +P+ +G DV E Y
Sbjct: 37 VGLGLGYKVKNGFYTNQLCVQVFVSRKYSENEINIKDKIPSMYKG-----ILTDVKETGY 91
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F A K I L G S+ G+ E YGT G +V T L+
Sbjct: 92 FKACSLNKK------IRPVLGGYSISVYKGN-----EIYGTAGCVV---TNGVNKFVLST 137
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRR-PLTFVRADGAFIPFADDF 300
HV ++ K++ P VY G + HR PL +G P
Sbjct: 138 NHVLTKIN----KLYMHFPIIQPACVYGGTYSDTIATLHRYIPLHLF--NGGEPPILGLL 191
Query: 301 DMSTVTT-SVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA----YALE 355
+ + + +G++ VK S +G V KVG S LT G + + + +
Sbjct: 192 TNANIMNPEIAFIGKVTCVK--------SPKLGIPVRKVGAMSELTEGIITSINANHTVT 243
Query: 356 YNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
Y + + + F D ++ ++GDSGS+++ K IG+++ T N
Sbjct: 244 YTNGE-VAFFKDQILTSN----MAVKGDSGSILIDKNN-----CAIGLLFATTNN 288
>gi|86139781|ref|ZP_01058347.1| hypothetical protein MED193_12148 [Roseobacter sp. MED193]
gi|85823410|gb|EAQ43619.1| hypothetical protein MED193_12148 [Roseobacter sp. MED193]
Length = 516
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 57/123 (46%), Gaps = 15/123 (12%)
Query: 123 GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYF 182
G IGFR +RG TD + + V RK+ L P Q LP+ + G +DV+E +Y
Sbjct: 38 GIDIGFRWRRGQRTDEICLRMHVQRKLPIDALLPSQVLPSHVAG-----IALDVIEAAYQ 92
Query: 183 GAPEPTPKEQLYTQIVDDLRGGDP-SIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
+ EP Q T P ++G S E GT+G +V +T + G L+N
Sbjct: 93 PSLEPGASRQAATP--------QPYTMGGLCCGRSGEGAGTIGLVVIDRTTGKP-GILSN 143
Query: 242 RHV 244
HV
Sbjct: 144 WHV 146
>gi|416354626|ref|ZP_11681687.1| hypothetical protein CBCST_10406 [Clostridium botulinum C str.
Stockholm]
gi|338195372|gb|EGO87663.1| hypothetical protein CBCST_10406 [Clostridium botulinum C str.
Stockholm]
Length = 259
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 60/266 (22%), Positives = 112/266 (42%), Gaps = 56/266 (21%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G IG+++++ VLT I VF S+K+ L +P+ +G DV+E
Sbjct: 41 VGVGIGYKVQKEVLTSEKCIAVFASKKIPNNELKREDLVPSVYKG-----IKTDVIETGI 95
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGS-GSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
F ++ + +R P +G G + + YGT+G +V T + L+
Sbjct: 96 FST----------MKLSNRIR---PVLGGYGIAPVTTKYYGTMGCLV---TDGIENFILS 139
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGA------VERATSFHHRRPLTFVRADGAF- 293
+ H+ DL+ N K+ P+ L P + G V + F R + + +
Sbjct: 140 SNHILADLN--NIKLGTPI---LQPAIVNGGNPEKDQVAVLSKFIPLRSINGTKRPENYM 194
Query: 294 -IPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAY 352
+ A + + V++ +K +G+ V+ +G+ V KVG S+ LTTG +
Sbjct: 195 DVAIAKVINNNFVSSDIKFIGKPKGVR--------GHRLGQLVKKVGASTELTTGIIQ-- 244
Query: 353 ALEYNDEKGICFLTDFLVVGENQQTF 378
++ ++V EN++ F
Sbjct: 245 -----------YMNVTIIVDENKKQF 259
>gi|357409381|ref|YP_004921117.1| hypothetical protein Sfla_0132 [Streptomyces flavogriseus ATCC
33331]
gi|320006750|gb|ADW01600.1| hypothetical protein Sfla_0132 [Streptomyces flavogriseus ATCC
33331]
Length = 325
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 125/298 (41%), Gaps = 47/298 (15%)
Query: 123 GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYF 182
G +G R + G TD A++V + K + + P + LP L DV V
Sbjct: 28 GVGVGRRRRAGDKTDEYAVVVHLREKQPESKIPPARLLPAELRFTERSGRDVSV-RVDVQ 86
Query: 183 GAPEPTPKEQLYTQIVDDLRGGDPSIGS-GSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
P+PTP+ T V + GG S+G+ G+ V S GTLG V T +RQV L+N
Sbjct: 87 QHPKPTPQ----TDRVRPVPGG-VSVGTVGAHVGS----GTLGGWVW-DTVTRQVVALSN 136
Query: 242 RHVAVDLDYPNQKMFHPLPPTLG--PGVYLGAVERATSFHHRRPLTFVRADGAFIPFAD- 298
HV P + P G P + +V R S D A AD
Sbjct: 137 AHVF--GSRPGVSIIQPSSDDGGVTPDDRIASVMRTGSL-----------DAAIAEPADP 183
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYND 358
F +++ + EI + + + +V K GR++GLT GTV + +D
Sbjct: 184 SFVSASIVQGGPAVFEIAE-----------ATLDMRVQKTGRATGLTFGTVDLIDFD-SD 231
Query: 359 EKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGE----KPRPIGIIWGGTANRG 412
+G +D + E F L GDSG+L L+ + + + +G+ WGG+ G
Sbjct: 232 YRGSH--SDLWIDAEGAD-FSLGGDSGALYLLAPGSAAFATGRRQAVGLHWGGSGQDG 286
>gi|147676419|ref|YP_001210634.1| hypothetical protein PTH_0084 [Pelotomaculum thermopropionicum SI]
gi|146272516|dbj|BAF58265.1| hypothetical protein PTH_0084 [Pelotomaculum thermopropionicum SI]
Length = 335
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 80/342 (23%), Positives = 142/342 (41%), Gaps = 76/342 (22%)
Query: 110 RAFHSKILRCYSL----GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALE 165
RAF + SL G +G++ G T PA +++V +K+ L+ +P ++
Sbjct: 6 RAFKKTRAKLLSLENVVGIGVGYKQTGGENTGEPAFIIYVEKKMPAAGLARGSVIPKRID 65
Query: 166 GPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGA 225
G DV+E + ++ R P + G Q T GTLGA
Sbjct: 66 G-----LITDVIEIG---------RVKMLGVRTSRERPCQPGVSVGHY---QSTAGTLGA 108
Query: 226 IVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAV--ERATSFHHRRP 283
+V+ + ++++ L+N HV + ++ P L PG Y G +R P
Sbjct: 109 VVRDRE-TKKLMILSNNHVLANGSSESEAKAKQGDPILQPGPYDGGTLKDRIGVLDRYVP 167
Query: 284 L--TFVRAD---GAFIP---------FADDFDM---------STVTTSVKGLG------- 313
L + V+AD A + F ++++ +TV ++ L
Sbjct: 168 LVKSAVKADCPVAAAVARGGTRLLNIFKQNYEVRFYKRLYGENTVDCALARLDSEDLVKA 227
Query: 314 ---EIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAY----ALEYNDEKGICFLT 366
+IGD+ V P G V K GR++GLT+G V + +E D++ + F +
Sbjct: 228 TILDIGDITGVSEAGP-----GDLVQKSGRTTGLTSGVVKSVNTTLQVEMKDDEKLWF-S 281
Query: 367 DFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGT 408
D +V Q GDSGSL++ ++ + +G+++ G+
Sbjct: 282 DQVVADMVSQ----PGDSGSLVV-----DQERKVVGLLFAGS 314
>gi|416365266|ref|ZP_11682761.1| hypothetical protein CBCST_17192 [Clostridium botulinum C str.
Stockholm]
gi|338194035|gb|EGO86591.1| hypothetical protein CBCST_17192 [Clostridium botulinum C str.
Stockholm]
Length = 305
Score = 43.1 bits (100), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 116/282 (41%), Gaps = 57/282 (20%)
Query: 123 GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYF 182
G +G+++K G T I +FV KV + + +P+ + G+ DV+ + S
Sbjct: 30 GIGLGYKVKNGFDTHKKCIKIFVDVKVSENNIPLHDLIPSYYD---GIETDVEQIGISTM 86
Query: 183 GAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNR 242
+ + + VD P IGS S GT G +V T R + L+N
Sbjct: 87 CSLKDKVRP------VDGGYNISPLIGSPS--------GTFGCLV---TDGRFMYLLSNC 129
Query: 243 HV-----AVDLDYPNQKMFHPLPPTLGPGVYLGA------VERATSFHHRRPLTFVRADG 291
HV A LD P L PG G + + + + +T +
Sbjct: 130 HVLATNGATPLD----------CPILQPGRKYGGKDPEDKIAILSKYIEPKYITPTSSPE 179
Query: 292 AFI--PFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTV 349
F+ A D+S V+ +K LG I + +++G+ V KVG ++ LT G +
Sbjct: 180 NFVDCAIAKVTDLSKVSNKIKFLGNI--------KGTAPAILGESVQKVGCTTELTKGKI 231
Query: 350 LAYALEYNDE--KGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
+A + + KG C + ++ + + +GDSGS++L
Sbjct: 232 IALGVTITIQRPKGNCIFKNQILTNKMGE----KGDSGSILL 269
>gi|253573702|ref|ZP_04851045.1| predicted protein [Paenibacillus sp. oral taxon 786 str. D14]
gi|251847230|gb|EES75235.1| predicted protein [Paenibacillus sp. oral taxon 786 str. D14]
Length = 367
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 84/194 (43%), Gaps = 23/194 (11%)
Query: 210 SGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYL 269
+G V + ++ GT+G IV Q L+N HV VD N + F TL PG
Sbjct: 108 AGYSVGTSDSSGTVGLIVSGDASGCQRLILSNNHVLVD---NNTRRFS---ATLQPGGAD 161
Query: 270 G---AVERATSFHHRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEI-GDVKIVDLQS 325
G A +R L+ RA+ A S + + G + G V+
Sbjct: 162 GGTIAKDRIGQLDRFVKLSRKRANYIDAATAKPLRRSLLKPAYAVFGIVPGHVR------ 215
Query: 326 PISSLIGKQVVKVGRSSGLTTGTVLA----YALEYNDEKGICFLT-DFLVVGENQQTFDL 380
S IG ++ KVGR++G+ TGTV + ++Y D + +T V ++ L
Sbjct: 216 --SYKIGDRLKKVGRTTGVVTGTVESIHTDVQVDYGDYGNLGMITFKNQSVIRGKRPVSL 273
Query: 381 EGDSGSLILMKGEN 394
EGDSGS+ L + N
Sbjct: 274 EGDSGSVWLTRKGN 287
>gi|253681646|ref|ZP_04862443.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253561358|gb|EES90810.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 314
Score = 42.7 bits (99), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 76/300 (25%), Positives = 118/300 (39%), Gaps = 57/300 (19%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++ G T+ I V V++KV LSP + +P +G D+ E Y
Sbjct: 36 VGIGLGYKTSGGFRTNEKCINVLVTKKVPSYDLSPNEVIPKWYKG-----IKTDIYESGY 90
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQ-ETYGTLGAIVKSQTGSRQVGFLT 240
F K L V P++G S S + YGT+ IVK + + L+
Sbjct: 91 F-------KSHLLNSRV------RPALGGYSISPSTLKQYGTMACIVKDNLSNYFL--LS 135
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFADDF 300
HV +L+ P L G + S + PL F + + + D
Sbjct: 136 CNHVIANLNEVQLGTSIVQPSVLDNGK--SPTDSIGSLYKFIPLKFNTSTHLSVNYVDAA 193
Query: 301 -----DMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYA-- 353
D S V+ + LG+ + PI+ + V K GR++ +T G V
Sbjct: 194 LAIISDKSLVSNKIYILGKPNN--------PITPSLDLSVRKAGRTTNVTYGYVKLLGST 245
Query: 354 --LEYNDEKGICFLTDFLVVGENQQTFDL---EGDSGSLILMKGENGEKPRPIGIIWGGT 408
L + + G+ +NQ L GDSG+L LM EN PIG++ GG+
Sbjct: 246 VNLSFGSKSGLF---------KNQILTTLMSDTGDSGAL-LMDLEN----NPIGLVIGGS 291
>gi|294461761|gb|ADE76439.1| unknown [Picea sitchensis]
Length = 95
Score = 42.7 bits (99), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 41/82 (50%), Gaps = 6/82 (7%)
Query: 507 ETNPSLMETEFHLEDGVKAGPSVELQFIPSFTGHSPLHQNNPSDKASSENLASLWNGCD- 565
E NP ++EF + + SVE F+ F H L + ENL++L + D
Sbjct: 16 EVNPIFRQSEF-MTRLAEPSTSVEHPFMKDF--HRSLSHPEQAKSPKCENLSALRDVRDV 72
Query: 566 --EDICFSLQLGDNEAKRRRSD 585
EDI L LGD EAKRRRS+
Sbjct: 73 SSEDISIGLHLGDREAKRRRSN 94
>gi|253681834|ref|ZP_04862631.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253561546|gb|EES90998.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 317
Score = 42.7 bits (99), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 78/308 (25%), Positives = 121/308 (39%), Gaps = 67/308 (21%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G G++IK G T+ I VFV +K+ L+ +P+ +G D+ E
Sbjct: 35 VGIGCGYKIKNGFYTNQLCIQVFVRKKLPLNELNTNDLIPSTYKG-----IPTDIKETGG 89
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F A TQ + GG S + E GTLG +V T ++ + L+N
Sbjct: 90 FTACS-------LTQKIRPTPGGY----CISNEYNDEYLGTLGCLV---TDNKDLFLLSN 135
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFI--PFADD 299
HV +F+ P LG + +E ++ F + +I F +
Sbjct: 136 SHVLA--------IFNQAP--LGTKI----IEPSSEFRGNPKTDTIATLSKYIELKFIEG 181
Query: 300 FDMSTVTTSVKGLGEIGDVKIVD--LQSPISSLIG-----------KQVVKVGRSSGLTT 346
M T + G KI+D L SP +L+G + V KVG S LTT
Sbjct: 182 TSMPVNYT------DCGIAKIIDKSLVSPKIALVGIPKGLSNPKLNQPVKKVGAISELTT 235
Query: 347 GTVLA----YALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIG 402
GTV + + YND K + + + Q GDSG+++L IG
Sbjct: 236 GTVTSIHATVTVNYNDIKKLAIFKEQIFTNLLAQ----PGDSGAILLDTNNTA-----IG 286
Query: 403 IIWGGTAN 410
++ G+ N
Sbjct: 287 LLMSGSEN 294
>gi|253682243|ref|ZP_04863040.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253561955|gb|EES91407.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 314
Score = 42.7 bits (99), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 116/278 (41%), Gaps = 46/278 (16%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++I G+ T I+VFVS KV K L +P + G + DV+E Y
Sbjct: 36 VGIGLGYKIINGMYTSKKCIVVFVSHKVEKANLILKDLIPKSYMG-----IETDVLESGY 90
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F TQ + ++GG SIG S G+ G +V G+++ N
Sbjct: 91 FRGAS-------LTQRIRPVQGG-YSIGPES---VPNVTGSQGCVVTD--GTKKYMLSCN 137
Query: 242 RHVAVDLDYPNQKMF----HPLPPTL--GPGVYLGAVERATSFHHRRPLTFVRADGAFI- 294
+A N+ M L P+L G + A+ T + + T + + ++
Sbjct: 138 HVIA------NENMLPINTQILQPSLKDGSKITKDAIAYLTKYIPLKNKTAINSPENYVD 191
Query: 295 -PFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTG--TVLA 351
A +++ + + +G + V L GK+V+K GR++ T G T +
Sbjct: 192 CAIAREYEPGIFSPQIYMIGSLKGVSTPQL--------GKKVMKSGRTTSYTEGLITTIG 243
Query: 352 YALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
++ E GI + +V Q EGDSG++++
Sbjct: 244 VTVKVKLELGIYIFKNQIVTTAMGQ----EGDSGAVLV 277
>gi|253681939|ref|ZP_04862736.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253561651|gb|EES91103.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 326
Score = 42.4 bits (98), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 96/239 (40%), Gaps = 42/239 (17%)
Query: 123 GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYF 182
+GF + G+ T I VF+S+K+ K L +P +G D +E F
Sbjct: 44 AVGLGFNVINGICTHEKCIKVFLSKKLSKNSLPSSALIPPIYKG-----ITTDTIESGIF 98
Query: 183 GAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNR 242
QL ++I L G SIG A+Q T GT G +VK + L+
Sbjct: 99 ST------SQLTSRIRPVLEGY--SIGP----AAQNTAGTFGCLVK-DLKDNSINILSCN 145
Query: 243 HVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFADDFDM 302
HV L P L PG+ G + H T + +IP +
Sbjct: 146 HVLARLG-----TVPICAPILQPGLLDGG-----NIHTDVIATLSK----YIPIKYKGLV 191
Query: 303 STVTTSV-KGLGEIGDVKIVD-----LQSPISSL----IGKQVVKVGRSSGLTTGTVLA 351
S+ T V + ++ + +V L +P+ + +G+ V K+GR++G T G ++A
Sbjct: 192 SSPTNLVDAAIAKVSNPSLVSNKLAILNTPLRGVSEPNVGEHVFKIGRTTGSTEGYIVA 250
>gi|331271119|ref|YP_004385828.1| hypothetical protein CbC4_6031 [Clostridium botulinum BKT015925]
gi|329127614|gb|AEB77556.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 316
Score = 42.4 bits (98), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 132/320 (41%), Gaps = 54/320 (16%)
Query: 103 LLELMTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPT 162
+++L+ ++ + + +G +G++IK G T I VFVS K+HK L +P
Sbjct: 15 IIKLICNNEYNFFLNKANVIGIGLGYKIKGGFCTCKKCIKVFVSTKIHKAQLQTKDLIPI 74
Query: 163 ALEGPGGVWCDVDVVEFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETY-- 220
+G DV E YF K QL V P+IG S + Y
Sbjct: 75 MYKGI-----ITDVNEVGYF-------KFQLLNTKV------RPTIGGYSIGPNVPEYCS 116
Query: 221 --GTLGAIVKSQTGSRQVGFLTNRHVAVDLD--YPNQKMFHP-LPPTLGPGVYLGAVERA 275
G++G +VK S L++ HV L+ P + P L + P +G + R
Sbjct: 117 NIGSIGCLVKDSHSSY---LLSSCHVLSALNKLTPGTGVVQPSLYDSGTPADEVGKLARY 173
Query: 276 TSFH----HRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLI 331
S +P V D A + F + + + IG K +D ++ +
Sbjct: 174 ISLKPEGTFSKPTNLV--DAAIVRFDAHVE------GLPNIAFIGSPKGID-----NAAL 220
Query: 332 GKQVVKVGRSSGLTTGTVLAYAL--EYNDEKGICFLTDFLVVGENQQT-FDLEGDSGSLI 388
V K GR+S T+G V A + E + KG +T +L + T EGDSG+++
Sbjct: 221 NDGVFKAGRTSDETSGHVTAINVTCEISFSKGT-NVTKYLFKNQIMTTKMSSEGDSGAVL 279
Query: 389 LMKGENGEKPRPIGIIWGGT 408
+ + + +G++ G T
Sbjct: 280 VKANK-----KIVGLLVGCT 294
>gi|416347988|ref|ZP_11680103.1| hypothetical protein CBCST_00395 [Clostridium botulinum C str.
Stockholm]
gi|338197133|gb|EGO89307.1| hypothetical protein CBCST_00395 [Clostridium botulinum C str.
Stockholm]
Length = 306
Score = 42.0 bits (97), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 136/330 (41%), Gaps = 58/330 (17%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++IK G T + VFV+RK+ +S +P+ G D+V+
Sbjct: 29 IGVGLGYKIKNGFNTFKKCLSVFVTRKLPCYNISSSNLVPSYYWG-----IPTDIVDTGV 83
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F + K + + GG IG S GTLG IV T S+ LT
Sbjct: 84 FHLQKLNNK-------IRPVPGG-YDIGPAFIWDS----GTLGCIV---TDSKYYYILTC 128
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVER---ATSFHHRRPLTFVRADGAFIPFAD 298
H ++ ++ HP+ L P G R + P+ + + I +
Sbjct: 129 NHTITSKEF--LRLNHPI---LQPSSVYGGRYREDTIATLSKFIPIKYSTSSEEGINYV- 182
Query: 299 DFDMSTVTTSVKGLGEIGDV-KIVDLQSPISSLIGKQVVKVGRSSGLTTGTV--LAYALE 355
D M+ +TT + +I + +I + P +G V KVG ++ LT G + + +
Sbjct: 183 DCAMAKITTRSQISTKINFLGRIKGMAKP---SLGMSVQKVGATTELTKGNITSIGATIV 239
Query: 356 YNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLK 415
+N+++G C D ++ + GDSGS++L E IG++ G+ +
Sbjct: 240 FNEKQGKCIFFDQIITNKMSDF----GDSGSILL-----DENINAIGMLMSGSPTKSTF- 289
Query: 416 LKIGQPPENWTSGVDLGRLLNLLELDLITT 445
P E+ +LN L++ L+T+
Sbjct: 290 ----NPIES---------VLNALDVKLVTS 306
>gi|448413152|ref|ZP_21576998.1| hypothetical protein C475_21804 [Halosimplex carlsbadense 2-9-1]
gi|445667333|gb|ELZ19977.1| hypothetical protein C475_21804 [Halosimplex carlsbadense 2-9-1]
Length = 317
Score = 42.0 bits (97), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 107/285 (37%), Gaps = 33/285 (11%)
Query: 118 RCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVV 177
R +GTAIG + TD A++V V+RK+ + LS +PT +E C DV
Sbjct: 14 RANVVGTAIGPKRVGDRPTDEEALIVLVARKLPETQLSEADRIPTEIEF-DDAKCKTDVQ 72
Query: 178 EFSYFGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVG 237
E E+ D R P+ S + T GTLG+ +T +
Sbjct: 73 EVGDVRTQATAEAEERP----DRERRWRPAPAGVSFGHVETTAGTLGS-PPLETADGETV 127
Query: 238 FLTNRHVAVDLDY--PNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIP 295
LTN HVA ++ P + P P G AV RP D A +
Sbjct: 128 VLTNAHVAAPIEAAEPGDDVLQPGP--ADGGTEDDAVGSLVEGSEIRPDEPNTTDSAIVA 185
Query: 296 FADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALE 355
D + V G+GE P + K GR++G+TTG +
Sbjct: 186 ----VDPADFEDRVLGIGE-----FAGFAEPSTD---ATFTKSGRTTGVTTGDLRGRDAR 233
Query: 356 -----YNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENG 395
Y+DE T V G GDSGSLI ++ ++G
Sbjct: 234 IRVRGYHDEP--TLFTGIDVFG----PMSAAGDSGSLIGIEADDG 272
>gi|225166827|ref|YP_002650812.1| conserved hypothetical protein [Clostridium botulinum]
gi|253771383|ref|YP_003034185.1| hypothetical protein CLG_0044 [Clostridium botulinum D str. 1873]
gi|225007491|dbj|BAH29587.1| conserved hypothetical protein [Clostridium botulinum]
gi|253721360|gb|ACT33653.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 306
Score = 42.0 bits (97), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 135/330 (40%), Gaps = 58/330 (17%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++IK G T + VFV+RK+ +S +P+ G D+V
Sbjct: 29 IGVGLGYKIKNGFNTFKKCLSVFVTRKLPCYNISSSNLVPSYYWG-----IPTDIVNTGV 83
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F + K V + GG IG S GTLG IV T S+ LT
Sbjct: 84 FHLQKLNNK-------VRPVPGG-YDIGPAFIWDS----GTLGCIV---TDSKYYYILTC 128
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVER---ATSFHHRRPLTFVRADGAFIPFAD 298
H ++ ++ HP+ L P G R + P+ + + I +
Sbjct: 129 NHTITSKEF--LRLNHPI---LQPSSVYGGRYREDTIATLSKFIPIKYSTSSEEGINYV- 182
Query: 299 DFDMSTVTTSVKGLGEIGDV-KIVDLQSPISSLIGKQVVKVGRSSGLTTGTV--LAYALE 355
D M+ +TT + +I + +I + P +G V KVG ++ LT G + + +
Sbjct: 183 DCAMAKITTRSQISTKINFLGRIKGMAKP---SLGMSVQKVGATTELTKGNITSIGATIV 239
Query: 356 YNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRGRLK 415
+N+++G C D ++ + GDSGS++L E IG++ G+ +
Sbjct: 240 FNEKQGKCIFFDQIITNKMSDF----GDSGSILL-----DENINAIGMLMSGSPTKSTF- 289
Query: 416 LKIGQPPENWTSGVDLGRLLNLLELDLITT 445
P E+ +LN L++ L+T+
Sbjct: 290 ----NPIES---------VLNALDVKLVTS 306
>gi|331269223|ref|YP_004395715.1| hypothetical protein CbC4_1038 [Clostridium botulinum BKT015925]
gi|329125773|gb|AEB75718.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 312
Score = 42.0 bits (97), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 56/124 (45%), Gaps = 22/124 (17%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+RIK+G++T I VF S+KV LSP +P GG+ DVVE
Sbjct: 39 VGVGLGYRIKKGIVTTETCIKVFASKKVPDNELSPDDLIPPVY---GGI--KTDVVESGS 93
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETY-GTLGAIVKSQTGSRQVGFLT 240
F T D +R P++ S S + Y GTLG +V T L+
Sbjct: 94 FKGLSLT----------DRIR---PTLCGYSIGPSAQNYIGTLGCLV---TDGHDKFILS 137
Query: 241 NRHV 244
N HV
Sbjct: 138 NNHV 141
>gi|448319038|ref|ZP_21508546.1| hypothetical protein C492_21210 [Natronococcus jeotgali DSM 18795]
gi|445597027|gb|ELY51106.1| hypothetical protein C492_21210 [Natronococcus jeotgali DSM 18795]
Length = 443
Score = 42.0 bits (97), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 49/86 (56%), Gaps = 13/86 (15%)
Query: 330 LIGKQVVKVGRSSGLTTGTVLA----YALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG 385
L G+ V K GR++G+T+ TV A A+E+ E+G L D L+ G + GDSG
Sbjct: 224 LRGETVTKTGRTTGVTSATVEATSASVAVEFGAERGTVTLRDQLIAGYLSEG----GDSG 279
Query: 386 SLILMKGENGEKPRPIGIIWGGTANR 411
S + + E+GE +G+++ G+A +
Sbjct: 280 SPVFL--EDGEL---VGLLFAGSAQQ 300
>gi|253682406|ref|ZP_04863203.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253562118|gb|EES91570.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 317
Score = 42.0 bits (97), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 127/309 (41%), Gaps = 69/309 (22%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G G++IK G T+ I VFVS+K+ L+ +P+ +G D+ E
Sbjct: 35 VGIGCGYKIKNGFYTNQLCIQVFVSKKLPLNELNINDLIPSTYKG-----IPTDIKETGG 89
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIG--SGSQVASQETYGTLGAIVKSQTGSRQVGFL 239
F A T K + P+ G S S + E GTLG +VK ++ + L
Sbjct: 90 FTACSLTQKIR-------------PTPGGYSISNEYNNEYSGTLGCLVKD---NKDLFLL 133
Query: 240 TNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFADD 299
+N HV +F+ P LG + + E + T VR I F ++
Sbjct: 134 SNSHVLA--------IFNQAP--LGTKIIEPSNEFGGNPKTDTIATLVRYIK--IRFIEN 181
Query: 300 FDMSTVTTSVKGLGEIGDVKIVD--LQSPISSLIG-----------KQVVKVGRSSGLTT 346
++M T + G KI+D L SP +L G + + KVG S LTT
Sbjct: 182 YNMPFNYT------DCGIAKIIDKSLVSPEIALTGIPKGVSNPKLNQPIKKVGAISELTT 235
Query: 347 GTVLA----YALEYNDEKGICFLTDFLVVGENQQTFDLE-GDSGSLILMKGENGEKPRPI 401
G + + + Y+D K + + +F E GDSG+++L + N I
Sbjct: 236 GVITSIHNTLTVNYHDIKKSAIFKEQIFT-----SFMAEHGDSGAILLDQSNN-----VI 285
Query: 402 GIIWGGTAN 410
G++ G+ N
Sbjct: 286 GLLMSGSKN 294
>gi|332669503|ref|YP_004452511.1| Equine arteritis virus peptidase S32 [Cellulomonas fimi ATCC 484]
gi|332338541|gb|AEE45124.1| Equine arteritis virus peptidase S32 [Cellulomonas fimi ATCC 484]
Length = 618
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 123/311 (39%), Gaps = 49/311 (15%)
Query: 116 ILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVD 175
I R +G IG + G T AI+V V+RK+ L Q +P +++G D
Sbjct: 28 IARPGVVGVDIGEKWSDGRPTGRQAIVVHVARKLDAADLPDDQRIPASIDG-----VPTD 82
Query: 176 VVEFSYFGAPEPTPKEQLYTQI---VDDLRGG------DPSIGSGSQVASQETY-GTLGA 225
VVE E T + T + V L GG DP G+ A T GTLG
Sbjct: 83 VVEHRVVLHQEATVEGTPTTLMRGRVRPLAGGVSIGPVDPVTIQGASSAELRTVNGTLGV 142
Query: 226 IVKSQTGSRQVGFLTNRHVAVD--LDYPNQKMFHPL-PPTLGPGVYLGAVERATSFHHRR 282
+V + R + LTN HVA L+ + P GP +G + R
Sbjct: 143 VVTERHTGRALA-LTNWHVAAGDGLEDVGSRWVQPARADGGGPRDQVGVLVRGA------ 195
Query: 283 PLTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSS 342
D D + V G V I + ++ G V K GR++
Sbjct: 196 -------------LTDRIDAALVALVPGARWVPGIVGIGAVTGSADAVDGTLVRKHGRTT 242
Query: 343 GLTTGTVLA----YALEYNDEKGICFLTDFLVV--GENQQTFDLEGDSGSLILMKGENGE 396
GL TG V++ ++++ G L D + + Q +F GDSGS ++ E+G+
Sbjct: 243 GLRTGRVVSTDFTTSVDFGPGIGWRTLRDQIRIEPEPGQTSFSAGGDSGSAVV--DEDGK 300
Query: 397 KPRPIGIIWGG 407
+G++W G
Sbjct: 301 V---VGLLWAG 308
>gi|331269976|ref|YP_004396468.1| hypothetical protein CbC4_1797 [Clostridium botulinum BKT015925]
gi|329126526|gb|AEB76471.1| hypothetical protein CbC4_1797 [Clostridium botulinum BKT015925]
Length = 329
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 79/337 (23%), Positives = 136/337 (40%), Gaps = 67/337 (19%)
Query: 123 GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYF 182
+G + GV T I VF+S+K+ + L P +P +G D +E F
Sbjct: 47 AVGLGLNVVNGVCTFQKCIKVFLSKKLPENSLPPSALVPPIYKG-----IITDTIESGTF 101
Query: 183 GAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNR 242
+ QL +++ L G SIG A+Q T GT G +VK + L+
Sbjct: 102 SS------SQLTSRVRPVLEGY--SIGP----AAQNTAGTFGCLVK-DLNDHSINLLSCN 148
Query: 243 HVAVDLDYPNQKMFHPL-PPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFADDFD 301
HV L P+ P L PG+ G + H T R FIP
Sbjct: 149 HVLARLG------LVPIGAPILQPGLLDGG-----NIHTDVIATLSR----FIPIKFKGL 193
Query: 302 MSTVTTSV-KGLGEIGDVKIVD-----LQSPISSL----IGKQVVKVGRSSGLTTGTVLA 351
+S+ T + ++ + +V L++P+ + +G+ V K+GR++G T G ++A
Sbjct: 194 ISSPTNLADAAIAKVSNPSLVSNKLAILKTPLRGVAEPSLGEHVFKIGRTTGSTEGFIVA 253
Query: 352 YALEYNDE--KGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
+ + KG ++ GDSG+++ + N +G+++ T
Sbjct: 254 TDVSQLETYPKGKALFKHQIITSNPSD----PGDSGAILFDEHFNA-----LGLLFMTTD 304
Query: 410 NRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLITTD 446
+ N+TS + +L LL + LIT++
Sbjct: 305 KK------------NFTSFNLISDVLKLLNVSLITSN 329
>gi|297623499|ref|YP_003704933.1| hypothetical protein [Truepera radiovictrix DSM 17093]
gi|297164679|gb|ADI14390.1| conserved hypothetical protein [Truepera radiovictrix DSM 17093]
Length = 323
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 40/82 (48%), Gaps = 11/82 (13%)
Query: 331 IGKQVVKVGRSSGLTTGTVLAYALEYNDEK----GICFLTDFLVVGENQQTFDLEGDSGS 386
+G++V KVGR+SGLT GTV A F ++ G N TF GDSGS
Sbjct: 227 VGQRVFKVGRTSGLTFGTVSAVGARVPRVAYGFGSAAFEGSVIIEGLNGSTFSAPGDSGS 286
Query: 387 LIL-MKGENGEKPRPIGIIWGG 407
I +KG R +G ++ G
Sbjct: 287 GIYDLKG------RLVGFLYAG 302
>gi|331268643|ref|YP_004395135.1| hypothetical protein CbC4_0458 [Clostridium botulinum BKT015925]
gi|329125193|gb|AEB75138.1| hypothetical protein CbC4_0458 [Clostridium botulinum BKT015925]
Length = 273
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 120/295 (40%), Gaps = 58/295 (19%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++ G T I FV+ K+ ++ +PT +G DVVE S
Sbjct: 2 IGIGMGYKETNGFCTCQKCITTFVTNKIKSNRINSKDLIPTFYKG-----ILTDVVEMSI 56
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
P+ T+ + + GG SIG A GT G +V ++ LT
Sbjct: 57 -------PRTCSLTKRIRPVLGG-YSIGVDGLKA-----GTTGCLVAD---NKHDYILTC 100
Query: 242 RHVAV--DLDYPNQKMFHPLPPTLG--PGVYLGAVERATSFHHRRPLTFVRADGAFIPFA 297
HV ++ N+ + P P G P +G V + + R +V D A +
Sbjct: 101 NHVVAGNTIEKVNKVVVQPAPKFGGKVPKDAVGLVRKFVPVNVRGEFNYV--DAAIVQ-- 156
Query: 298 DDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYALEYN 357
D S++ + +G +K + IG++V KVG ++ LTTG V
Sbjct: 157 TDRSKSSI-----AIAYVGPIKGTNFTK-----IGQKVKKVGATTELTTGIV-------- 198
Query: 358 DEKGICFLTDFL---VVGENQQT---FDLEGDSGSLILMKGENGEKPRPIGIIWG 406
K + DFL V +NQ T +GDSGS++L +K +G++ G
Sbjct: 199 KTKFTVIIIDFLGRQVTFKNQTTTTKMSDDGDSGSILL-----NDKNEALGMLMG 248
>gi|225166828|ref|YP_002650813.1| conserved hypothetical protein [Clostridium botulinum]
gi|253771431|ref|YP_003034186.1| hypothetical protein CLG_0045 [Clostridium botulinum D str. 1873]
gi|225007492|dbj|BAH29588.1| conserved hypothetical protein [Clostridium botulinum]
gi|253721408|gb|ACT33701.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 306
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 81/341 (23%), Positives = 133/341 (39%), Gaps = 82/341 (24%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDV---DVVE 178
+G +G++IK G T + VFV+ K LP +CD+ D+V
Sbjct: 29 VGVGLGYKIKNGFNTFQKCLSVFVTNK-----------LP---------FCDIPSNDMVP 68
Query: 179 FSYFGAPEPTPKE-----QLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGS 233
Y+G P Q TQ + + GG IG V GTLG IV T
Sbjct: 69 SYYYGIPTDVINTGAFHLQKLTQKIRPVPGG-YDIGPALIVEG----GTLGCIV---TDG 120
Query: 234 RQVGFLTNRHV-----AVDLDYPNQKMFHPLPPTLGPGVY-LGAVERATSFHHRRPLTFV 287
+ LT H V + YP + P + G Y + R + + T
Sbjct: 121 KYYHILTCNHSLTAKEVVTVTYPITQ-----PSCVYGGNYPEDIIARISKYIPINNSTTT 175
Query: 288 RADGAFI--PFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLT 345
+ ++ A S ++T + LG I + + +G V KVG ++ LT
Sbjct: 176 NENINYVDCAIAKINKRSQISTKINFLGRI--------KGMTKASLGLNVQKVGANTELT 227
Query: 346 TGTV--LAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGI 403
GTV + LE+N+ +G D ++ + + EGDSGS+++ K + +G+
Sbjct: 228 EGTVTSVGATLEFNEPQGKFIFVDQIITNKMSE----EGDSGSILVDK-----NIQAVGM 278
Query: 404 IWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLIT 444
+ GG + + ++ +LN L + L+T
Sbjct: 279 LMGGGSTKSVFN--------------NIENVLNALSVKLVT 305
>gi|134096198|ref|YP_001101273.1| hypothetical protein HEAR3043 [Herminiimonas arsenicoxydans]
gi|133740101|emb|CAL63152.1| Conserved hypothetical protein [Herminiimonas arsenicoxydans]
Length = 359
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 59/235 (25%), Positives = 98/235 (41%), Gaps = 42/235 (17%)
Query: 201 LRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPN--QKMFHP 258
+ G + GS A GTLG +V+ +G + LTN HV+ +Y + +K+ P
Sbjct: 120 IHNGRYACGSSIHPAKVLGAGTLGCLVRDPSG--DIFALTNNHVSGMCNYASNGEKIIAP 177
Query: 259 LPPTLGPGVYLGAVERATSFHHRRPLTFVRA-----------DGAFIPFADDFDMSTVTT 307
P + ++ T +H R L V D A + +D S +
Sbjct: 178 G----HPDIIANGIDPFTIGYHSRSLPMVHGLPDNVDIATNNDAALLKLSD----SNLVC 229
Query: 308 SVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA-----YALEYNDE--- 359
S++G ++Q+ G V KVGR++GLT G ++ + + Y+
Sbjct: 230 SMQGQSYDTPSLTFEMQA------GFSVQKVGRTTGLTHGQIIGEIIAPHPVSYSVPGFG 283
Query: 360 KGICFLTDFLVVGEN--QQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTANRG 412
+ F + N F GDSGSL+ + NG++ IGI++ G N+G
Sbjct: 284 NHVSFFERVFAIHSNDPDTPFSQPGDSGSLVTTE-MNGDR-YAIGIVFAGN-NQG 335
>gi|402772295|ref|YP_006591832.1| protease [Methylocystis sp. SC2]
gi|401774315|emb|CCJ07181.1| Putative protease [Methylocystis sp. SC2]
Length = 495
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 49/186 (26%), Positives = 80/186 (43%), Gaps = 20/186 (10%)
Query: 237 GFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFH------HRRPLTFVRAD 290
GF+TN H + N FH L G +G + + R F +D
Sbjct: 236 GFITNSHCTKNRGVSNDDDFHQPNDPLLSGNKIGDEDADPPYFTGGQCPSGRKCRF--SD 293
Query: 291 GAFIPFADD---FDMSTVTTSVKGL--GEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLT 345
A+ + D F+++ T +V L V + ++P S++G ++ KVGR++G
Sbjct: 294 SAYADYRIDRGRFEIARTTNNVGSLTINSFPGVFRIMSETP-DSMVGMRLNKVGRTTGWA 352
Query: 346 TGTVLAYALEYN----DEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPI 401
G V A ++ N D + +C + V G N+ T + GDSGS + +
Sbjct: 353 FGDVRATCIDVNVADTDVRLLCQSSVARVSGTNKLTDN--GDSGSPVFSILPTASQASLH 410
Query: 402 GIIWGG 407
GI+WGG
Sbjct: 411 GILWGG 416
>gi|253771306|ref|YP_003034118.1| hypothetical protein CLG_A0024 [Clostridium botulinum D str. 1873]
gi|253721458|gb|ACT33750.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 311
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 69/283 (24%), Positives = 116/283 (40%), Gaps = 55/283 (19%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G G++IK G T+ +I VFVSRK L+ +P +G DV E Y
Sbjct: 33 VGIGCGYKIKGGFYTNQLSIQVFVSRKFSMNELNSNDIIPLTYKG-----MLTDVKETGY 87
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F A K++ G +IG+ + E GT G +V + SR V L+
Sbjct: 88 FRACSLNKKKRPVI--------GGYNIGTN---MNNEISGTAGCLVTNGV-SRFV--LST 133
Query: 242 RHVAVDLDYPNQKMFHPLP---PTLGPGVYLGA---VERATSFHHRRPLTFVRADGAFIP 295
HV +++ LP P + P G + + H PL ++ + I
Sbjct: 134 NHVLANIN--------KLPIKTPIIQPSYIHGGYTPTDTIATLHKFIPLRLIKEEEQPIN 185
Query: 296 FAD-DFDMST----VTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
D + T ++ ++ +G++ VK S +G V KVG ++ LT G ++
Sbjct: 186 LTDCALGLLTKPNIMSDNIAFIGKVNCVK--------SPKLGSHVRKVGSTTELTEGVIV 237
Query: 351 A----YALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
+ ++ Y D K F L ++ GDSG++++
Sbjct: 238 SINSVMSVTYWDGKRAFFEDQILTTHMARK-----GDSGAILV 275
>gi|331269605|ref|YP_004396097.1| hypothetical protein CbC4_1421 [Clostridium botulinum BKT015925]
gi|329126155|gb|AEB76100.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 311
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 62/249 (24%), Positives = 95/249 (38%), Gaps = 45/249 (18%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +GF+ +G T I VF S KV L P Q +P +G +EF+
Sbjct: 38 IGIGLGFKSIKGSNTSQKCIKVFTSEKVDNGELPPAQLVPAIYKGIRTDVVQSGNIEFTG 97
Query: 182 FGAPE-PTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
+ P P G SIG + + GT+G +V T V L
Sbjct: 98 LTQKKRPAP--------------GGYSIGPPLKTQT----GTMGCLV---TDGSDVFILG 136
Query: 241 NRHVAVDLDYPNQKMFHPL-PPTLGPGVYLGA---VERATSFHHRRPLTFVRADGAF-IP 295
N HV DL+ F P+ P + PG G + P+ F + +
Sbjct: 137 NNHVLADLN------FLPIGTPIMQPGPDDGGKANTDVIAKLTKYIPIKFHKKENYVDAA 190
Query: 296 FADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA---- 351
A D V+ S+ +G I + +L+ + KVGR++ T G + A
Sbjct: 191 IAKVIDKKLVSASIAFIGNIKGIGKPNLEEGVK--------KVGRTTEFTVGKISAIYAT 242
Query: 352 YALEYNDEK 360
Y L+YN ++
Sbjct: 243 YVLKYNSKE 251
>gi|253681776|ref|ZP_04862573.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253561488|gb|EES90940.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 321
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 68/283 (24%), Positives = 111/283 (39%), Gaps = 52/283 (18%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++ G+ T I VFV+ K+ K L + +P EG DVV
Sbjct: 39 VGVGLGYKDIDGICTYEECIKVFVTEKISKNELPAKEIVPAVYEGI-----KTDVV---- 89
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
+ + L +++ L G I G+ T GTLGA+VK + + L +
Sbjct: 90 --TGGVSTECNLVSRVRPVLCGYAMGISDGA--TKSVTTGTLGALVKDK---ENIYILGS 142
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRR---PLTFVRADGAFIPFAD 298
HV N+ + P + P ++ G V + + PL ++ + + D
Sbjct: 143 GHVL-----TNENLVPLGTPIIQPSIHFGGVISKDTIAYLSKYIPLRYISSTAIPENYVD 197
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPI---------SSLIGKQVVKVGRSSGLTTGTV 349
+G++ + +V + I S+ + VVKVG SG TTGTV
Sbjct: 198 C-----------AIGKVLSISLVTPKIAILNSLPLGVSSAKLKDTVVKVGAISGYTTGTV 246
Query: 350 LAY---ALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
A + + F L +Q+ GDSGSL+L
Sbjct: 247 EAVNATIWAHYSSGQVLFKNQILTTLMSQK-----GDSGSLLL 284
>gi|448637439|ref|ZP_21675677.1| hypothetical protein C436_02871 [Haloarcula sinaiiensis ATCC 33800]
gi|445764286|gb|EMA15441.1| hypothetical protein C436_02871 [Haloarcula sinaiiensis ATCC 33800]
Length = 429
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 75/300 (25%), Positives = 116/300 (38%), Gaps = 47/300 (15%)
Query: 123 GTAIGFRIKRGVL-TDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVW-------CDV 174
GT IG + + G + + +++VFV RKV + L + +P +E G + ++
Sbjct: 24 GTGIGPKQRAGEMDEEAESVIVFVERKVAEADLDDNEVIPEEIEIDGKTYKTDVQESGEI 83
Query: 175 DVVEFSYFGAPEPTPKEQLYTQIVDDL-------RGGDPSIGSGSQVASQETYGTLGAIV 227
+E P E + ++ R P+ S T GTLG
Sbjct: 84 KALELELTAPEAPMELEGRDRAEIKEIPASLSRTRRWRPAPAGVSVGHPDITAGTLGT-Q 142
Query: 228 KSQTGSRQVGFLTNRHVAVDLDYPNQ--KMFHPLP------PTLGPGVYLG--AVERATS 277
+T ++ FLTN HVA D N+ + P P P G LG ++ TS
Sbjct: 143 PLRTQDEKLVFLTNSHVAADSGRANRGDMVLQPGPYDGGTAPDDEIGSLLGFNVIDADTS 202
Query: 278 FHHRRPLTFVRADGAFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVK 337
P R D A + D + T + L E DL+ + +G K
Sbjct: 203 ----SPFPKNRTDSAIVEVTPDH----LQTDIWELHE-------DLRGFTDAEVGAIHTK 247
Query: 338 VGRSSGLTTGTVLAYALEYNDE--KGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENG 395
GR++G+T A +N G+ + D V + GDSGSLI M+ E+G
Sbjct: 248 SGRTTGVTQAKCTARHANFNVRYSHGVAKMVDCDVFNAMAKG----GDSGSLIGMEREDG 303
>gi|331270967|ref|YP_004385678.1| hypothetical protein CbC4_4103 [Clostridium botulinum BKT015925]
gi|329127359|gb|AEB77303.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 318
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 58/238 (24%), Positives = 97/238 (40%), Gaps = 44/238 (18%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G G+++K G T+ I VFVSRK + LS +P +G DV E +
Sbjct: 34 VGVGCGYKVKNGFYTNQLCIQVFVSRKFAQNQLSSNDMVPLMYKGI-----QTDVKETGH 88
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGS---GSQVASQETYGTLGAIVKSQTGSRQVGF 238
F A T K + P++G G++ + + GTLG +V T + +
Sbjct: 89 FTACSLTEKIR-------------PTLGGYIIGNEYDTVHS-GTLGCLV---TDGKNLFI 131
Query: 239 LTNRHVAVDLDYP--NQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRA--DGAFI 294
L+N HV ++ K+ P G V + F + ++A + A
Sbjct: 132 LSNNHVLASTNFAPLGNKIIQP-SYAFGGDFKTDVVAILSKFIPIKFEGIIKAPSNYADC 190
Query: 295 PFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLI---GKQVVKVGRSSGLTTGTV 349
A + S VTT + +G +P +++ ++V KVG + LTTG +
Sbjct: 191 AIAKVINKSLVTTQIAFIG-----------TPNGTIVPRLNQEVKKVGFKTELTTGKI 237
>gi|134297959|ref|YP_001111455.1| hypothetical protein Dred_0080 [Desulfotomaculum reducens MI-1]
gi|134050659|gb|ABO48630.1| conserved hypothetical protein [Desulfotomaculum reducens MI-1]
Length = 336
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 71/324 (21%), Positives = 125/324 (38%), Gaps = 67/324 (20%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++ T AI++FV++K LS + +P + G + DV+E
Sbjct: 22 VGVGVGYKHVGMERTQQKAIIIFVTKKEDLGNLSREELVPFKING-----LETDVIEVGD 76
Query: 182 FGAPEPTPKEQL------------------YTQIVDDLRGGDPSIGSGSQVASQETYGTL 223
E K+ + + +V D G+P I S + + + T G
Sbjct: 77 IRFLEEDRKKHVRPAQPGMSVGHYRVTAGTFGAMVRDRSTGEPLILSNNHILANGTDGKD 136
Query: 224 GAIVKS----QTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFH 279
G Q G G +R + P QK P + GV A +
Sbjct: 137 GRSAPGDLIFQPGEYDGGTKADRIATLIRFIPIQKGEAPASCPIANGVARIANMLVHTIR 196
Query: 280 HRRPLTFVRADGAFIPFADDFDMST--------VTTSVKGLGEIGDVKIVDLQSPISSLI 331
L F + +G A+ D + ++ + G+G++ Q I +
Sbjct: 197 PNYDLKFFKREGV----ANHVDCAVARPLSPDLISDEILGIGKV--------QGIIDAKP 244
Query: 332 GKQVVKVGRSSGLTTGTVLAYA----LEYNDEKGICFLTDFLVVGENQQTFDLE---GDS 384
G +V K GR++G+T+G V A ++ +D F NQ D++ GDS
Sbjct: 245 GMKVKKSGRTTGITSGVVTAIGTTMQVKMDDNNNAYF--------SNQVICDMKSQGGDS 296
Query: 385 GSLILMKGENGEKPRPIGIIWGGT 408
GSL+L +G + +G+++ G+
Sbjct: 297 GSLVLTEGN-----KAVGLLFAGS 315
>gi|416347989|ref|ZP_11680104.1| hypothetical protein CBCST_00400 [Clostridium botulinum C str.
Stockholm]
gi|338197134|gb|EGO89308.1| hypothetical protein CBCST_00400 [Clostridium botulinum C str.
Stockholm]
Length = 306
Score = 40.0 bits (92), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 80/341 (23%), Positives = 132/341 (38%), Gaps = 82/341 (24%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDV---DVVE 178
+G +G++IK G T + VFV+ K LP +CD+ D+V
Sbjct: 29 VGVGLGYKIKNGFNTFQKCLSVFVTNK-----------LP---------FCDIPSNDMVP 68
Query: 179 FSYFGAPEPTPKE-----QLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGS 233
Y+G P Q TQ + + GG IG V GTLG IV T
Sbjct: 69 SYYYGIPTDVINTGAFHLQKLTQKIRPVPGG-YDIGPALIVEG----GTLGCIV---TDG 120
Query: 234 RQVGFLTNRHV-----AVDLDYPNQKMFHPLPPTLGPGVY-LGAVERATSFHHRRPLTFV 287
+ LT H V + YP + P + G Y + R + + T
Sbjct: 121 KYYHILTCNHSLTAKEVVTVTYPITQ-----PSCVYGGNYPEDIIARISKYIPINNSTTT 175
Query: 288 RADGAFI--PFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLT 345
+ ++ A S ++T + LG I + L G V KVG ++ LT
Sbjct: 176 NENINYVDCAIAKINKRSQISTKINFLGRIKGITKASL--------GLNVQKVGANTELT 227
Query: 346 TGTV--LAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGI 403
GTV + LE+N+ +G D ++ + + +GDSG++++ K + +G+
Sbjct: 228 EGTVTSVGATLEFNEPRGKSIFVDQIITNKMSE----KGDSGAILVDK-----NIQAVGL 278
Query: 404 IWGGTANRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLIT 444
+ GG + + ++ +LN L + L+T
Sbjct: 279 LMGGGSTKSVFN--------------NIENVLNALSVKLVT 305
>gi|253681904|ref|ZP_04862701.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253561616|gb|EES91068.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 317
Score = 40.0 bits (92), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 69/305 (22%), Positives = 118/305 (38%), Gaps = 59/305 (19%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G G++IK G T+ I VFV +K+ L+ +P+ +G D+ E
Sbjct: 35 VGIGCGYKIKNGFYTNQLCIQVFVRKKIPLNELNINDLIPSTYKG-----IPTDIKETGG 89
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSG--SQVASQETYGTLGAIVKSQTGSRQVGFL 239
F A T K + P+ G S + + GTLG +VK ++ + L
Sbjct: 90 FTACSLTQKIR-------------PTPGGYIISNKYNTDYSGTLGCLVKD---NKHLFLL 133
Query: 240 TNRHVAVDLDYPN--QKMFHPLPPTLGPGVYLG--AVERATSFHHRRPLTFVRADGAFIP 295
+N HV ++ + K+ P G + G + + L F+ G
Sbjct: 134 SNNHVLAMMNKLSLGTKIIQP------SGDFGGDSKTDTIATLSKYIELKFIEGRGIHFN 187
Query: 296 FAD-----DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL 350
+ D D S V+ + +G + + L P+ KVG S LTTG +
Sbjct: 188 YTDCAIAKIIDKSLVSPEIALVGILKGISNPKLNQPVK--------KVGAISELTTGVIT 239
Query: 351 AYA----LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWG 406
+ + ++Y+ + +V + D GDSGS++L E IG++
Sbjct: 240 SISSTLTVDYDTINKSAIFKEQVVTTK----MDESGDSGSILL-----DENNHAIGLLMS 290
Query: 407 GTANR 411
G+ N
Sbjct: 291 GSKNN 295
>gi|331269490|ref|YP_004395982.1| hypothetical protein CbC4_1305 [Clostridium botulinum BKT015925]
gi|329126040|gb|AEB75985.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 307
Score = 40.0 bits (92), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 86/335 (25%), Positives = 133/335 (39%), Gaps = 70/335 (20%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++IK G T I+VFVS+KV L+ +P +G DV+E
Sbjct: 30 VGVGLGYKIKCGFETSQKCIMVFVSQKVPSNSLNSNDIIPDVYKGI-----VTDVLESGC 84
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F K Q T+ V GG SIGS + V T G L T + LT+
Sbjct: 85 F-------KTQSLTKKVRPTMGG-YSIGS-TTVGEASTLGCL------VTDGKYKYILTS 129
Query: 242 RHVAVDLDYP-NQKMFHPLPPTLG--PGVYLGAVE-----RATSFHHRRPLTFVRADGAF 293
H V ++ K+ P P G P +G + + T+F H P V
Sbjct: 130 NHGIVKDEFAIGTKVLQPAIPDGGKVPQDVVGTISKFIPVKNTTFFH-EPKNVVDCAAVI 188
Query: 294 IPFADDFDMSTVTTSVKGLGE--IGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA 351
+ S V+ + G+ +G V +L+S + KVGR++ T G VL+
Sbjct: 189 V-----LQESLVSPLIYGINTPPLG-VANGELKSTVH--------KVGRTTEKTLGKVLS 234
Query: 352 Y--ALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTA 409
+E D+ +V E +GDSGS++L +G IG++ GG+
Sbjct: 235 INAVMELEDQGKKNIYKKQIVTTEMCS----DGDSGSILLNQGN-----YAIGLVVGGS- 284
Query: 410 NRGRLKLKIGQPPENWTSGVDLGRLLNLLELDLIT 444
+ +T + +L L L L+T
Sbjct: 285 -------------DTYTICNTMSNVLTALNLKLVT 306
>gi|401662288|emb|CCG27838.1| putative serine protease [Aeropyrum spring-shaped virus]
Length = 326
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 76/182 (41%), Gaps = 22/182 (12%)
Query: 94 LPKGQQATTLLELMTIRAFHSKILRCYSLGTAIGFRIKRGVLTDIPAILVFVSRKVHKQW 153
+P+ + +LE + YS I RI+RG + D P I V+V +K+ +
Sbjct: 1 MPRKEVVAHILEKRRSELLSKPNVVGYS--NVIQKRIRRGRVVDEPVIRVYVKKKLPRNL 58
Query: 154 LSPIQCLPTALEGPGGVWCD-VDVVEFSYFGAPEPTPKEQ-LYTQIVDDLRGGDPSIGSG 211
L P +P +E G+ D V++ E + +P LYT P I
Sbjct: 59 LRPQDLVPEEVE---GIRTDVVEIGEVEAWALLQPRAAASPLYTGRY------RPVIAGV 109
Query: 212 SQVASQETYGTLGAIVKSQTGSRQVGFLTNRHVAVDLDYPN---QKMFHPLPPTLGPGVY 268
S Q T GTLG VK+ ++ F +N HV PN Q+ + P L PG Y
Sbjct: 110 SIGHYQITAGTLGWYVKAPNA--EILFASNAHVFT----PNASGQEGQYEGDPILQPGPY 163
Query: 269 LG 270
G
Sbjct: 164 DG 165
>gi|119195329|ref|XP_001248268.1| predicted protein [Coccidioides immitis RS]
Length = 640
Score = 39.7 bits (91), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 7/77 (9%)
Query: 332 GKQVVKVGRSSGLTTGTV--LAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
G +VVK+GRSS T G + + L + + + L ++ VV + F GDSGS +L
Sbjct: 524 GSRVVKIGRSSDYTVGYLNGVESYLTFRNTQLEVTLAEWAVVAASTHPFCARGDSGSFVL 583
Query: 390 MKGENGEKPRPIGIIWG 406
++ IG++WG
Sbjct: 584 NDADD-----LIGLLWG 595
>gi|392862500|gb|EAS36850.2| hypothetical protein CIMG_02039 [Coccidioides immitis RS]
Length = 513
Score = 39.7 bits (91), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 7/77 (9%)
Query: 332 GKQVVKVGRSSGLTTGTV--LAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
G +VVK+GRSS T G + + L + + + L ++ VV + F GDSGS +L
Sbjct: 393 GSRVVKIGRSSDYTVGYLNGVESYLTFRNTQLEVTLAEWAVVAASTHPFCARGDSGSFVL 452
Query: 390 MKGENGEKPRPIGIIWG 406
++ IG++WG
Sbjct: 453 NDADD-----LIGLLWG 464
>gi|416359011|ref|ZP_11682291.1| hypothetical protein CBCST_14284 [Clostridium botulinum C str.
Stockholm]
gi|338194656|gb|EGO87063.1| hypothetical protein CBCST_14284 [Clostridium botulinum C str.
Stockholm]
Length = 314
Score = 39.3 bits (90), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 75/300 (25%), Positives = 117/300 (39%), Gaps = 57/300 (19%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G++ G T+ I V V++KV LSP + +P +G D+ E
Sbjct: 36 VGIGLGYKTSGGFRTNEKCINVLVTKKVPSYDLSPNEVIPKWYKG-----IKTDIYESGS 90
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQ-ETYGTLGAIVKSQTGSRQVGFLT 240
F K L V P++G S S + YGT+ IVK + + L+
Sbjct: 91 F-------KSHLLNSRV------RPALGGYSISPSTLKQYGTMACIVKDNLSNYFL--LS 135
Query: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFADDF 300
HV +L+ P L G + S + PL F + + + D
Sbjct: 136 CNHVIANLNKVQLGTSIVQPSVLDNGK--SPTDSIGSLYKFIPLKFNTSTHLSVNYVDAA 193
Query: 301 -----DMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLAYA-- 353
D S V+ + LG+ + PI+ + V K GR++ +T G V
Sbjct: 194 LAIISDKSLVSNKIYILGKPNN--------PITPSLDLSVRKAGRTTNVTYGYVKLLGST 245
Query: 354 --LEYNDEKGICFLTDFLVVGENQQTFDL---EGDSGSLILMKGENGEKPRPIGIIWGGT 408
L + + G+ +NQ L GDSG+L LM EN PIG++ GG+
Sbjct: 246 VNLSFGSKSGLF---------KNQILTTLMSDTGDSGAL-LMDLEN----NPIGLVIGGS 291
>gi|253682421|ref|ZP_04863218.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253562133|gb|EES91585.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 309
Score = 39.3 bits (90), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 72/300 (24%), Positives = 115/300 (38%), Gaps = 62/300 (20%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++ +G T I VF +KV + + +P +G +EFS
Sbjct: 36 VGIGLGYKLTKGFNTSQKCIKVFARKKVGNGEIPEAELVPPIYKGIKTDVVQSGNIEFSK 95
Query: 182 FGAPE-PTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 240
+ P P G SIG + + GT+G +V T + L
Sbjct: 96 LSEKKRPVP--------------GGYSIG----IPLETQTGTMGCLV---TDGSDIFVLG 134
Query: 241 NRHVAVDLDYPNQKMFHPL-PPTLGPGVYLGA---VERATSFHHRRPLTFVRADGAFIPF 296
N HV D++ F PL P + PG G + P+ F + +
Sbjct: 135 NNHVLSDMN------FVPLGTPVMQPGPEDGGKVNTDTIAKLAKYVPIKFNKKE------ 182
Query: 297 ADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA----Y 352
+ D + S K L G I L+ + + V KVGR++ LT G + A Y
Sbjct: 183 -NYVDAAIAKVSDKKLVSAGIAFIGYLKGIGKPNLEEGVKKVGRTTDLTVGKISAVYATY 241
Query: 353 ALEYNDEKGICFLTDFLVVGENQQTFDLE----GDSGSLILMKGENGEKPRPIGIIWGGT 408
L+YND K + F Q F + GDSG++++ K IG++ G+
Sbjct: 242 VLKYND-KDVLF---------KDQIFTTDMADYGDSGAILV-----DYKNYAIGLLMAGS 286
>gi|416354542|ref|ZP_11681680.1| hypothetical protein CBCST_10351 [Clostridium botulinum C str.
Stockholm]
gi|338195387|gb|EGO87676.1| hypothetical protein CBCST_10351 [Clostridium botulinum C str.
Stockholm]
Length = 331
Score = 39.3 bits (90), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 65/281 (23%), Positives = 118/281 (41%), Gaps = 50/281 (17%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++K+G T I V+V+RK+ + ++ +P +G DV+E
Sbjct: 35 VGIGLGYKMKKGFYTSQLCIQVYVTRKLTRNIINSQNLVPDMYKG-----ILTDVIETGI 89
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQ---ETYGTLGAIVKSQTGSRQVGF 238
F + T K + P++G G + ++ ++ GTLG +V T + +
Sbjct: 90 FKSNSLTGKVR-------------PTLG-GYIIGNEYKLDSGGTLGCLV---TDGKDLFI 132
Query: 239 LTNRHVAVDLDYP--NQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVR-----ADG 291
L+N HV + K+ P G + V + F ++P+ R AD
Sbjct: 133 LSNNHVLASNNAAPIGTKIIQPSYDD-GGSLKTDVVAILSKFVPKKPMETFRNPTNYADC 191
Query: 292 AFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA 351
A + +++ ++ GL Q PI + + + V KVG + LTTG ++
Sbjct: 192 AIAKIINK-SLASPKIALVGLP----------QEPIIAKLNQSVKKVGAVTELTTGIIIG 240
Query: 352 YALEYNDEKGICFLTDFLVVGENQ---QTFDLEGDSGSLIL 389
+ K F T + +NQ + GDSG+L+L
Sbjct: 241 INVT---AKMNSFSTGKTFLFKNQIATSSMSDGGDSGALLL 278
>gi|253682179|ref|ZP_04862976.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253561891|gb|EES91343.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 311
Score = 38.9 bits (89), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 74/280 (26%), Positives = 115/280 (41%), Gaps = 56/280 (20%)
Query: 123 GTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYF 182
G A+G++ G+ T++ I VFV K+ L P +P +G C DV E F
Sbjct: 38 GIALGYKEVNGINTNMKCITVFVEEKLPLNELKPFDQIPKYYKG----IC-TDVFESGAF 92
Query: 183 GAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQE--TYGTLGAIVKSQTGSRQVGFLT 240
Y Q ++ + P++G G ++++E GTLG +V T + L
Sbjct: 93 -----------YVQSLN--KKIRPTLG-GYSISNEEFSRTGTLGCLV---TDGKYKYILG 135
Query: 241 NRHVAVDLDYPNQKMF--HPLPPTLGPGVYLGAVERATSFHHRRPLTFVRADGAFIPFAD 298
N H+ L N+ L P+ G G LG V + PL F + F+
Sbjct: 136 NNHI---LASSNKAKIGSSILQPSKGDGGVLG-VSTVATLSKFIPLDF-QGKNNFV---- 186
Query: 299 DFDMSTVTT------SVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVL-- 350
D ++ VT+ ++ +G + VK L P V+KVGR+S LT G +
Sbjct: 187 DSAIAKVTSPNIALPNIALVGPLKGVKDASLSQP--------VMKVGRTSELTKGRISQM 238
Query: 351 -AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIL 389
A L + D ++ EGDSGS++L
Sbjct: 239 HAVMLLKASSTMKYIMIDQIITDRMSD----EGDSGSILL 274
>gi|225166799|ref|YP_002650784.1| conserved hypothetical protein [Clostridium botulinum]
gi|253771329|ref|YP_003034155.1| hypothetical protein CLG_0014 [Clostridium botulinum D str. 1873]
gi|225007463|dbj|BAH29559.1| conserved hypothetical protein [Clostridium botulinum]
gi|253721306|gb|ACT33599.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 314
Score = 38.9 bits (89), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 116/297 (39%), Gaps = 47/297 (15%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G G++IK+G T+ I VFVS+K + L+ +P +G DV E Y
Sbjct: 37 VGVGCGYKIKKGFYTNQLCIQVFVSKKCPENQLNSNDMIPLMYKG-----IPTDVKETGY 91
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F KE+ + G G G E GT G +V S SR V L
Sbjct: 92 FSPCSFNIKER-------PVPG-----GYGISANMSEIIGTAGCVV-SNGVSRFV--LGT 136
Query: 242 RHVAVDLDYPNQKMFHPLPPTLGPGVY---LGAVERATSFHHRRPLTFVRADGAFIPFAD 298
HV +++ M P + P + + + PL F++ + I
Sbjct: 137 NHVLANIN-----MLPMKTPIVQPDYAHDGYAPTDTIATLYKYIPLRFIKGEDQPINLT- 190
Query: 299 DFDMSTVTTSVKGLGEIGDV-KIVDLQSPISSLIGKQVVKVGR----SSGLTTGTVLAYA 353
D + +T S +I + K+ ++SP + V KVG + G T T
Sbjct: 191 DCAIGLLTNSNIMSNKIAFIGKVSHIKSP---KLNASVKKVGTITEFTRGFITSTSSVVV 247
Query: 354 LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILMKGENGEKPRPIGIIWGGTAN 410
+ YN+ K F D + Q +GDSG++++ + +GI+ G + N
Sbjct: 248 INYNNGKR-AFFKDQIFTTYMAQ----KGDSGAILV-----DDNNFALGILCGYSPN 294
>gi|253681630|ref|ZP_04862427.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253561342|gb|EES90794.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 322
Score = 38.9 bits (89), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 65/281 (23%), Positives = 117/281 (41%), Gaps = 50/281 (17%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
+G +G+++K+G T I V+V+RK+ + + +P +G DV+E
Sbjct: 35 VGIGLGYKMKKGFYTSQLCIQVYVTRKLTRNIIDSQNLVPNMYKG-----ILTDVIETGI 89
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQ---ETYGTLGAIVKSQTGSRQVGF 238
F + T K + P++G G + ++ ++ GTLG +V T + +
Sbjct: 90 FKSNSLTGKVR-------------PTLG-GYIIGNEYKLDSGGTLGCLV---TDGKDLFI 132
Query: 239 LTNRHVAVDLDYP--NQKMFHPLPPTLGPGVYLGAVERATSFHHRRPLTFVR-----ADG 291
L+N HV + K+ P G + V + F ++P+ R AD
Sbjct: 133 LSNNHVLASNNAAPIGTKIIQPSYDD-GGSLKTDVVAILSKFVPKKPMETFRNPTNYADC 191
Query: 292 AFIPFADDFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVVKVGRSSGLTTGTVLA 351
A + +++ ++ GL Q PI + + + V KVG + LTTG ++
Sbjct: 192 AIAKIINK-SLASPKIALVGLP----------QEPIIAKLNQSVKKVGAVTELTTGIIIG 240
Query: 352 YALEYNDEKGICFLTDFLVVGENQ---QTFDLEGDSGSLIL 389
+ K F T + +NQ + GDSG+L+L
Sbjct: 241 INVT---AKMNSFSTGKTFLFKNQIATSSMSDGGDSGALLL 278
>gi|253681159|ref|ZP_04861962.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253563008|gb|EES92454.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 312
Score = 38.9 bits (89), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 126/296 (42%), Gaps = 43/296 (14%)
Query: 122 LGTAIGFRIKRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSY 181
LG +G+++K G T I VFV+ K+ + LS +P+ +G DV E Y
Sbjct: 29 LGVGLGYKVKNGFSTCQKCIKVFVTTKLSQNQLSCQDLIPSQYKG-----ILTDVTEVGY 83
Query: 182 FGAPEPTPKEQLYTQIVDDLRGGDPSIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLTN 241
F K QL + V + G SIG + + G++G +VK + Q L++
Sbjct: 84 F-------KFQLLNRKVRPIICG-YSIGP-NVTEYYKNVGSIGCLVKDK--ENQEYLLSS 132
Query: 242 RHVAVDLDYPNQKMFHPL-PPTLGPGVYLGAVE--RATSFHHRRPLTFVRADGAFIPFAD 298
HV L+ PL + P +Y +E PL + + F+ ++
Sbjct: 133 AHVITALNKI------PLGTDVVQPSLYDMGMEGGEIGKLSKYIPL---KQEEIFLKTSN 183
Query: 299 DFDMSTVTTSVKGLGEIGDVKIVDLQSPISSLIGKQVV-KVGRSSGLTTGTVLAYALE-- 355
D + + + G + D+ + + + + K VV KVGR+S T+G V A +
Sbjct: 184 FVDAAIIKLN-SGEAALSDIAFLGKPTGVDTAALKDVVFKVGRTSEETSGIVTAINVTCK 242
Query: 356 --YNDEKGICFLTDFLVVGENQQT-FDLEGDSGSLILMKGENGEKPRPIGIIWGGT 408
+ND K L ++ + T +GDSG+ +L + + +G++ G T
Sbjct: 243 IPFNDGKK---LNKYIFKNQIMTTKMSSDGDSGASLLKSNK-----KVVGLLIGST 290
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.134 0.399
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,870,222,191
Number of Sequences: 23463169
Number of extensions: 450421163
Number of successful extensions: 973408
Number of sequences better than 100.0: 198
Number of HSP's better than 100.0 without gapping: 73
Number of HSP's successfully gapped in prelim test: 125
Number of HSP's that attempted gapping in prelim test: 973011
Number of HSP's gapped (non-prelim): 229
length of query: 594
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 446
effective length of database: 8,886,646,355
effective search space: 3963444274330
effective search space used: 3963444274330
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 80 (35.4 bits)