BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 007435
(604 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224136616|ref|XP_002322374.1| predicted protein [Populus trichocarpa]
gi|222869370|gb|EEF06501.1| predicted protein [Populus trichocarpa]
Length = 594
Score = 1016 bits (2626), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 510/595 (85%), Positives = 542/595 (91%), Gaps = 1/595 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M++NR LR +SGSSQSEESALDLERNYC HPNL SSPSPLQPFASGGQHSESNAAYF
Sbjct: 1 MDRNRLGLRIHHSGSSQSEESALDLERNYCSHPNLLWSSPSPLQPFASGGQHSESNAAYF 60
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPTLSRLNDAAE RANYFGNLQKGVLPETLGRLP+GQ+ATTLLELMTIRAFHSKILRRF
Sbjct: 61 SWPTLSRLNDAAEVRANYFGNLQKGVLPETLGRLPSGQRATTLLELMTIRAFHSKILRRF 120
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIRRG LTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRRGDLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
YYG PA TPKE+LYTELVDGLRGSDPCIGSGSQVA+QETYGTLGAIV+SRTGN+QVGFLT
Sbjct: 181 YYGVPAATPKEQLYTELVDGLRGSDPCIGSGSQVANQETYGTLGAIVKSRTGNRQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITD+LWYGIFAGTNPETFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDELWYGIFAGTNPETFVRAD 300
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFAEDFN+NNV +VKGVGE+GDVH+IDLQ+PINSLIGRQV+KVGRSSGLTTGT+M
Sbjct: 301 GAFIPFAEDFNMNNVNITVKGVGEVGDVHVIDLQAPINSLIGRQVVKVGRSSGLTTGTIM 360
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTG++ EKPRPVGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGRDCEKPRPVGIIWGGTAN 420
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLLDLLELD+I TNEG QAA+QDQRNA A I+STVGE
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDIITTNEGLQAAIQDQRNALAQGIDSTVGE 480
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGH 540
S P +R SKEK E EP NLNIQQ +GES+ G TP FI EFH+ED +E+S NV H
Sbjct: 481 SSPLDRVPSKEKIEENFEPLNLNIQQVTGEGESQHGQTPLFIGPEFHIEDAVEASPNVEH 540
Query: 541 QFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSD 595
QFIPSF+GRSPMH N QEN K+LSALR+ DE + SL LGEPEPKRRK SD
Sbjct: 541 QFIPSFSGRSPMHDNTPQENPELKNLSALRSDSDEMCF-SLHLGEPEPKRRKQSD 594
>gi|224114770|ref|XP_002332278.1| predicted protein [Populus trichocarpa]
gi|222832440|gb|EEE70917.1| predicted protein [Populus trichocarpa]
Length = 593
Score = 1013 bits (2620), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 510/595 (85%), Positives = 544/595 (91%), Gaps = 2/595 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
ME+NR LR +SGSSQSEESALDLERNYC+H LP SS SPLQPF SGGQHSESNAAYF
Sbjct: 1 MERNRLGLRIHHSGSSQSEESALDLERNYCNH--LPWSSLSPLQPFTSGGQHSESNAAYF 58
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF
Sbjct: 59 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 118
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIRRG+LTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRRGILTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 178
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
YYGAPA TPKE+LYT+LVDGLRGSDPCIGSGSQVA+QETYGTLGAIV+SRTGN+QVGFLT
Sbjct: 179 YYGAPAATPKEQLYTDLVDGLRGSDPCIGSGSQVANQETYGTLGAIVKSRTGNRQVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 298
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFA DFN+NNVTT+VKGVGE+GDVH+IDLQ+PINSLIGRQV+KVGRSSGLTTGT+M
Sbjct: 299 GAFIPFAGDFNMNNVTTTVKGVGEVGDVHVIDLQAPINSLIGRQVVKVGRSSGLTTGTIM 358
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILL GQ+ EKP+PVGIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLKGQDCEKPQPVGIIWGGTAN 418
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVG PP NWTSGVDLGRLLDLLELDLI TN+G QAAVQDQRNASA AI+STVGE
Sbjct: 419 RGRLKLKVGLPPENWTSGVDLGRLLDLLELDLITTNDGLQAAVQDQRNASAPAIDSTVGE 478
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGH 540
S P +R SKEK E EP NLN+QQ +V GES+QG +P FI EFH+EDG E++ NV H
Sbjct: 479 SSPLDRVPSKEKIEENFEPINLNMQQGVVKGESQQGQSPLFIGPEFHIEDGAEAAPNVEH 538
Query: 541 QFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSD 595
QFIPSF+G+S MH N QE K+LSALR+ DE+ SLQLG+PEPKRRK D
Sbjct: 539 QFIPSFSGQSLMHDNKPQETPELKNLSALRSDSDEEMCFSLQLGKPEPKRRKQLD 593
>gi|255566289|ref|XP_002524131.1| conserved hypothetical protein [Ricinus communis]
gi|223536598|gb|EEF38242.1| conserved hypothetical protein [Ricinus communis]
Length = 593
Score = 1003 bits (2592), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 502/595 (84%), Positives = 544/595 (91%), Gaps = 2/595 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M++N+ DLR +SGS+QSEESALDLERN C+HPN SSP+ LQPFAS GQH ESNAAYF
Sbjct: 1 MDRNKLDLRLHHSGSTQSEESALDLERNCCNHPNPHWSSPTSLQPFASSGQHYESNAAYF 60
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPTLSRLND AEDRANYFGNLQKGVLPETLGRLP+GQQATTLLELMTIRAFHSKILRRF
Sbjct: 61 SWPTLSRLNDTAEDRANYFGNLQKGVLPETLGRLPSGQQATTLLELMTIRAFHSKILRRF 120
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
YYGAPA TPKE+LYTELVDGLRGS PCIGSGSQVA+QETYGTLGAIV+SRTGN+QVGFLT
Sbjct: 181 YYGAPASTPKEQLYTELVDGLRGSYPCIGSGSQVANQETYGTLGAIVKSRTGNRQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITD+LWYGIFAGTNPETFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDELWYGIFAGTNPETFVRAD 300
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFAEDFN+NNVTTSVKGVGEIGDVH IDLQSPINSLIGRQV+KVGRSSGLTTGT+M
Sbjct: 301 GAFIPFAEDFNMNNVTTSVKGVGEIGDVHSIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 360
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICFFTDFLVVGENQQ FDLEGDSGSLILLTGQNG+KPRPVGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFFTDFLVVGENQQPFDLEGDSGSLILLTGQNGDKPRPVGIIWGGTAN 420
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLLDLLELDL+ +NEG Q VQDQ+N SAA ++STVGE
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLVTSNEGLQ--VQDQKNVSAAGLDSTVGE 478
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGH 540
S P +R SK++ + +EP NLNIQQ L++ ES+ G T PF TEFH+EDG+E++ NV H
Sbjct: 479 SSPPDRVLSKDRIEDNIEPLNLNIQQVLLEEESQHGLTAPFTRTEFHIEDGVETAPNVEH 538
Query: 541 QFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSD 595
QFIPSFTG +H N QEN ++LSALR+G DE+ +VSL+LGEPEPKRR+ SD
Sbjct: 539 QFIPSFTGGPMVHDKNKQENVELENLSALRHGSDEEIHVSLRLGEPEPKRRRQSD 593
>gi|297737962|emb|CBI27163.3| unnamed protein product [Vitis vinifera]
Length = 684
Score = 997 bits (2577), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/596 (85%), Positives = 548/596 (91%), Gaps = 1/596 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M++ R DLRF +SGS QSEESALDLERNYC+HPNLPS SP PLQ FASGGQ SESNAAYF
Sbjct: 89 MDRTRLDLRFHHSGSIQSEESALDLERNYCNHPNLPSPSPPPLQAFASGGQLSESNAAYF 148
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPT SRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF
Sbjct: 149 SWPTSSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 208
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIRRGVLT+IPAILVFVARKVHRQWL+H+QCLPAALEGPGGVWCDVDVVEFS
Sbjct: 209 SLGTAIGFRIRRGVLTEIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFS 268
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
YYGAPAPTPKE+LYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIV+SRTGNQQVGFLT
Sbjct: 269 YYGAPAPTPKEQLYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNQQVGFLT 328
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD
Sbjct: 329 NRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 388
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFA+DFN++NVTT+VKGVGEIGDV+IIDLQSPINSLIGRQV+KVGRSSGLTTGT+M
Sbjct: 389 GAFIPFADDFNVSNVTTTVKGVGEIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 448
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN
Sbjct: 449 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 508
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLLDLLELDLI T+EG QAAV +Q NASAA I+STVGE
Sbjct: 509 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSEGLQAAVHEQINASAAGIDSTVGE 568
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNV-G 539
S P E K KT E EP +N+QQ ++GES+Q P FIHTEFH+E+G+E++ NV
Sbjct: 569 SSPPEPVLLKNKTEENFEPLGINLQQVPIEGESQQAVLPSFIHTEFHIEEGVEAAPNVEE 628
Query: 540 HQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSD 595
HQFIPS G+SP+HQNN QEN K+L ALRN +E+ VSLQLG+PEPKRRK +D
Sbjct: 629 HQFIPSCPGKSPVHQNNKQENPELKNLWALRNTSEEEMAVSLQLGKPEPKRRKQAD 684
>gi|225423710|ref|XP_002277727.1| PREDICTED: uncharacterized protein LOC100250825 [Vitis vinifera]
Length = 596
Score = 996 bits (2575), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/596 (85%), Positives = 548/596 (91%), Gaps = 1/596 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M++ R DLRF +SGS QSEESALDLERNYC+HPNLPS SP PLQ FASGGQ SESNAAYF
Sbjct: 1 MDRTRLDLRFHHSGSIQSEESALDLERNYCNHPNLPSPSPPPLQAFASGGQLSESNAAYF 60
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPT SRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF
Sbjct: 61 SWPTSSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIRRGVLT+IPAILVFVARKVHRQWL+H+QCLPAALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRRGVLTEIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFS 180
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
YYGAPAPTPKE+LYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIV+SRTGNQQVGFLT
Sbjct: 181 YYGAPAPTPKEQLYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNQQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD
Sbjct: 241 NRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFA+DFN++NVTT+VKGVGEIGDV+IIDLQSPINSLIGRQV+KVGRSSGLTTGT+M
Sbjct: 301 GAFIPFADDFNVSNVTTTVKGVGEIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 360
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLLDLLELDLI T+EG QAAV +Q NASAA I+STVGE
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSEGLQAAVHEQINASAAGIDSTVGE 480
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNV-G 539
S P E K KT E EP +N+QQ ++GES+Q P FIHTEFH+E+G+E++ NV
Sbjct: 481 SSPPEPVLLKNKTEENFEPLGINLQQVPIEGESQQAVLPSFIHTEFHIEEGVEAAPNVEE 540
Query: 540 HQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSD 595
HQFIPS G+SP+HQNN QEN K+L ALRN +E+ VSLQLG+PEPKRRK +D
Sbjct: 541 HQFIPSCPGKSPVHQNNKQENPELKNLWALRNTSEEEMAVSLQLGKPEPKRRKQAD 596
>gi|356521576|ref|XP_003529430.1| PREDICTED: uncharacterized protein LOC100796081 [Glycine max]
Length = 600
Score = 971 bits (2509), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 497/604 (82%), Positives = 530/604 (87%), Gaps = 4/604 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M +NR DLR +SGS+QSEESALDLER+Y HPN S PSPLQPFA G QHSESNAAYF
Sbjct: 1 MNQNRLDLRAHHSGSTQSEESALDLERSYYGHPN--PSCPSPLQPFAGGAQHSESNAAYF 58
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPTLSR NDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF
Sbjct: 59 SWPTLSRWNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 118
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIR GVLTDIPAILVFVARKV RQWL+HVQCLPAALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRGGVLTDIPAILVFVARKVRRQWLNHVQCLPAALEGPGGVWCDVDVVEFS 178
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
YYGAPA TPKE+LYTEL DGLRGSD C+GSGSQVASQETYGTLGAIVRSRTGN++VGFLT
Sbjct: 179 YYGAPAQTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRTGNREVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 298
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFAEDFN+NNV T+VKGVGEI DV+IIDLQSPINSLIGRQV+KVGRSSGLTTGT+M
Sbjct: 299 GAFIPFAEDFNMNNVITTVKGVGEISDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 358
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 418
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLLDLLELDLI TNE QAAV +QRN SAA I+STVGE
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNEALQAAVLEQRNGSAAGIDSTVGE 478
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGH 540
S P KEK E EPF LNI V+ E Q P +FH++ IE++ NV H
Sbjct: 479 SSPT--VPIKEKLEESFEPFCLNIPLAQVEDEPSQRVNPSIRPCDFHIKSEIETAPNVEH 536
Query: 541 QFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLNV 600
QFIPS+ G+SP Q+ +E+ KSL+ LRNGPDEDN+VSL LGEPE KRRK S++S +
Sbjct: 537 QFIPSYAGKSPACQSYLKEDMELKSLAELRNGPDEDNFVSLHLGEPEMKRRKISNSSFCI 596
Query: 601 QESK 604
+E K
Sbjct: 597 KELK 600
>gi|356576393|ref|XP_003556316.1| PREDICTED: uncharacterized protein LOC100816119 isoform 1 [Glycine
max]
Length = 598
Score = 969 bits (2506), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 496/602 (82%), Positives = 530/602 (88%), Gaps = 4/602 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M +N+ DLR +SGS+QSEESALDLER+Y HPN SSPSPLQPFA G QHSESNAAYF
Sbjct: 1 MNQNQLDLRAHHSGSTQSEESALDLERSYYGHPN--PSSPSPLQPFAGGAQHSESNAAYF 58
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPTLSR NDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF
Sbjct: 59 SWPTLSRWNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 118
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIR GVLTDIPAILVFVARKVHRQWL+H+QCLPAALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRGGVLTDIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFS 178
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
YYGAPA TPKE+LYTEL DGLRGSD C+GSGSQVASQETYGTLGAIVRSR+GN++VGFLT
Sbjct: 179 YYGAPAQTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRSGNREVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 298
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFAEDFN+NNV T+VKGVGEIGDV+IIDLQSPINSLIGRQV+KVGRSSGLTTGT+M
Sbjct: 299 GAFIPFAEDFNMNNVITTVKGVGEIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 358
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLILLTGQNGEKP PVGIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPCPVGIIWGGTAN 418
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLLDLLELDLI TNE QAAV +QRN SAA I+STVGE
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNEALQAAVLEQRNGSAAGIDSTVGE 478
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGH 540
S P KEK E EPF LNI V+ E Q P EFH++ IE + NV H
Sbjct: 479 SSPT--VPIKEKLEESFEPFCLNIPLAQVEDEPSQRVNPSIRPCEFHIKSEIEIAPNVEH 536
Query: 541 QFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLNV 600
QFIPS+ G+SP Q+ +E+ KSL+ LRNGPDEDN+VSL LGEPE KRRK S++S +
Sbjct: 537 QFIPSYAGKSPARQSYLKEDMELKSLAELRNGPDEDNFVSLHLGEPEMKRRKLSNSSFCI 596
Query: 601 QE 602
+E
Sbjct: 597 KE 598
>gi|356576395|ref|XP_003556317.1| PREDICTED: uncharacterized protein LOC100816119 isoform 2 [Glycine
max]
Length = 600
Score = 965 bits (2494), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 496/604 (82%), Positives = 530/604 (87%), Gaps = 6/604 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M +N+ DLR +SGS+QSEESALDLER+Y HPN SSPSPLQPFA G QHSESNAAYF
Sbjct: 1 MNQNQLDLRAHHSGSTQSEESALDLERSYYGHPN--PSSPSPLQPFAGGAQHSESNAAYF 58
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPTLSR NDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF
Sbjct: 59 SWPTLSRWNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 118
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIR GVLTDIPAILVFVARKVHRQWL+H+QCLPAALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRGGVLTDIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFS 178
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
YYGAPA TPKE+LYTEL DGLRGSD C+GSGSQVASQETYGTLGAIVRSR+GN++VGFLT
Sbjct: 179 YYGAPAQTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRSGNREVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 298
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFAEDFN+NNV T+VKGVGEIGDV+IIDLQSPINSLIGRQV+KVGRSSGLTTGT+M
Sbjct: 299 GAFIPFAEDFNMNNVITTVKGVGEIGDVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIM 358
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLILLTGQNGEKP PVGIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPCPVGIIWGGTAN 418
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQ--AAVQDQRNASAAAIESTV 478
RGRLKLKVGQPP NWTSGVDLGRLLDLLELDLI TNE Q AAV +QRN SAA I+STV
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNEALQAAAAVLEQRNGSAAGIDSTV 478
Query: 479 GESPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNV 538
GES P KEK E EPF LNI V+ E Q P EFH++ IE + NV
Sbjct: 479 GESSPT--VPIKEKLEESFEPFCLNIPLAQVEDEPSQRVNPSIRPCEFHIKSEIEIAPNV 536
Query: 539 GHQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSL 598
HQFIPS+ G+SP Q+ +E+ KSL+ LRNGPDEDN+VSL LGEPE KRRK S++S
Sbjct: 537 EHQFIPSYAGKSPARQSYLKEDMELKSLAELRNGPDEDNFVSLHLGEPEMKRRKLSNSSF 596
Query: 599 NVQE 602
++E
Sbjct: 597 CIKE 600
>gi|147798987|emb|CAN61635.1| hypothetical protein VITISV_008456 [Vitis vinifera]
Length = 1092
Score = 963 bits (2489), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 510/658 (77%), Positives = 548/658 (83%), Gaps = 63/658 (9%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M++ R DLRF +SGS QSEESALDLERNYC+HPNLPS SP PLQ FASGGQ SESNAAYF
Sbjct: 435 MDRTRLDLRFHHSGSIQSEESALDLERNYCNHPNLPSPSPPPLQAFASGGQLSESNAAYF 494
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPT SRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF
Sbjct: 495 SWPTSSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 554
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIRRGVLT+IPAILVFVARKVHRQWL+H+QCLPAALEGPGGVWCDVDVVEFS
Sbjct: 555 SLGTAIGFRIRRGVLTEIPAILVFVARKVHRQWLNHIQCLPAALEGPGGVWCDVDVVEFS 614
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQ--------------------------- 213
YYGAPAPTPKE+LYTELVDGLRGSDPCIGSGSQ
Sbjct: 615 YYGAPAPTPKEQLYTELVDGLRGSDPCIGSGSQSIXEDYSCMGKTSGCNLFVQMLLELID 674
Query: 214 --------VASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGP 265
VASQETYGTLGAIV+SRTGNQQVGFLTNRHVAVDLDYP+QKMFHPLPPSLGP
Sbjct: 675 KTNPGVVHVASQETYGTLGAIVKSRTGNQQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGP 734
Query: 266 GVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEI 325
GVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFA+DFN++NVTT+VKGVGEI
Sbjct: 735 GVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFADDFNVSNVTTTVKGVGEI 794
Query: 326 GDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICFFTDFLVVGENQ 385
G+V+IIDLQSPINSLIGRQV+KVGRSSGLTTGT+MAYALEYNDEKGICFFTDFLVVGENQ
Sbjct: 795 GEVNIIDLQSPINSLIGRQVVKVGRSSGLTTGTIMAYALEYNDEKGICFFTDFLVVGENQ 854
Query: 386 QTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPVNWTSGVDLGRLL 445
QTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPP NWTSGVDLGRLL
Sbjct: 855 QTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLL 914
Query: 446 DLLELDLIATNEGFQ---------------------------AAVQDQRNASAAAIESTV 478
DLLELDLI T+EG Q AAV +Q NASAA I+STV
Sbjct: 915 DLLELDLITTSEGLQVLEAKIDLQKGFLTIQMMFFSWFIVNIAAVHEQINASAAGIDSTV 974
Query: 479 GESPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNV 538
GES P E K KT E EP +N+QQ ++GES+Q P FIHTEFH+E+G+E++ NV
Sbjct: 975 GESSPPEPVLLKNKTEENFEPLGINLQQVPIEGESQQAVLPSFIHTEFHIEEGVEAAPNV 1034
Query: 539 -GHQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSD 595
HQFIPS G+SP+HQNN QEN K+L ALRN +E+ VSLQLG+PEPKRRK +D
Sbjct: 1035 EEHQFIPSCPGKSPVHQNNKQENPELKNLWALRNTSEEEMXVSLQLGKPEPKRRKQAD 1092
>gi|357475191|ref|XP_003607881.1| hypothetical protein MTR_4g084020 [Medicago truncatula]
gi|124359654|gb|ABN06026.1| Peptidase, trypsin-like serine and cysteine proteases [Medicago
truncatula]
gi|355508936|gb|AES90078.1| hypothetical protein MTR_4g084020 [Medicago truncatula]
Length = 597
Score = 934 bits (2414), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 482/602 (80%), Positives = 521/602 (86%), Gaps = 7/602 (1%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M +NR L +SGS+QSEESALDLERNY HP SSSP +Q FA G QHSE NAAYF
Sbjct: 1 MNRNRLGLSAHHSGSTQSEESALDLERNYYGHP---SSSPLHMQTFAVGVQHSEGNAAYF 57
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPTL+R NDAAEDRANYFGNLQKGVLPETLGRLP+GQQATTLLELMTIRAFHSKILRRF
Sbjct: 58 SWPTLNRWNDAAEDRANYFGNLQKGVLPETLGRLPSGQQATTLLELMTIRAFHSKILRRF 117
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIR GVLTDIPAILVFVA KVHRQWL+HVQCLPAALEGPGGVWCDVDVVEFS
Sbjct: 118 SLGTAIGFRIRGGVLTDIPAILVFVAHKVHRQWLNHVQCLPAALEGPGGVWCDVDVVEFS 177
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
YYGAPAPTPKE+LYTEL DGLRGSD C+GSGSQVASQETYGTLGAIVRSRTGN++VGFLT
Sbjct: 178 YYGAPAPTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRTGNREVGFLT 237
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD
Sbjct: 238 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 297
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFAEDFN+NNV TS++GVG+IG+VH IDLQSPINSLIGRQV+KVGRSSGLTTGT+M
Sbjct: 298 GAFIPFAEDFNMNNVITSIRGVGDIGEVHRIDLQSPINSLIGRQVIKVGRSSGLTTGTIM 357
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLILLTGQN EKPRPVGIIWGGTAN
Sbjct: 358 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNREKPRPVGIIWGGTAN 417
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKL+VGQPP NWTSGVDLGRLLDLLELDL+ TNE Q + Q+Q N S A I STVGE
Sbjct: 418 RGRLKLRVGQPPENWTSGVDLGRLLDLLELDLVTTNETLQDSGQEQMNGSTAGIGSTVGE 477
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGH 540
S P KEK E EPF LN++ V+ E P EFH+ + IE+ NV H
Sbjct: 478 SSPT--VPIKEKLEESFEPFCLNMEHVPVE-EPSTIVKPSLRPCEFHIRNEIETVPNVEH 534
Query: 541 QFI-PSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLN 599
QFI SF G+SP+HQ+ +E+ KSLS LRN PDEDN+VSL LGEPE KRRKHS++SL+
Sbjct: 535 QFIRTSFAGKSPVHQSFLKEDMQFKSLSELRNEPDEDNFVSLHLGEPEAKRRKHSNSSLS 594
Query: 600 VQ 601
++
Sbjct: 595 LK 596
>gi|224117600|ref|XP_002317619.1| predicted protein [Populus trichocarpa]
gi|222860684|gb|EEE98231.1| predicted protein [Populus trichocarpa]
Length = 597
Score = 900 bits (2326), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 441/593 (74%), Positives = 502/593 (84%), Gaps = 5/593 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
ME++R ++R + S+ S+ESAL ERNYC HP L S + LQPFAS GQH ESNAAYF
Sbjct: 1 MERSRNNMRAHCNVSTPSDESAL--ERNYCSHPRLTSVGSATLQPFASAGQHCESNAAYF 58
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPT SRL+DAAE+RANYF NLQKG+LPETLG+ P GQ+ATTLL+LMTIRAFHSKILR +
Sbjct: 59 SWPTSSRLSDAAEERANYFANLQKGILPETLGQFPKGQRATTLLDLMTIRAFHSKILRCY 118
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIRRGVLTDIPAILVFV+RKVH+QWLS VQCLP ALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSTVQCLPNALEGPGGVWCDVDVVEFS 178
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
Y+GAP PTPKE+LYTE+V+ LRG IGSGSQVASQETYGTLGAIVRS++G++QVGFLT
Sbjct: 179 YFGAPQPTPKEQLYTEIVNDLRGDGLYIGSGSQVASQETYGTLGAIVRSQSGSRQVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPP+LGPGV LGAVERATSFITDDLWYGIFAG NPETFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPTLGPGVNLGAVERATSFITDDLWYGIFAGINPETFVRAD 298
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPF +DF+++ V TSVKGVGEIGDV IIDLQ PI+ LIG+QVMKVGRSSGLTTGTV
Sbjct: 299 GAFIPFTDDFDMSTVNTSVKGVGEIGDVKIIDLQCPISDLIGKQVMKVGRSSGLTTGTVF 358
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AY LEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLI++ G+NGEKPRP+GIIWGGTAN
Sbjct: 359 AYGLEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIMKGENGEKPRPIGIIWGGTAN 418
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLL LELDLI TNEG QAAVQ+QR ASA AI ST+G+
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLYHLELDLITTNEGLQAAVQEQRAASATAICSTIGD 478
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQG-PTPPFIHTEFHVEDGIESSSNVG 539
S P + ++ ++LE L I+ + E E G P + T FH+EDGI+ + +V
Sbjct: 479 SSPPDGMLPNDRMDDKLESLGLQIEH--IPSEVENGIPKSSLMETNFHLEDGIKLTPSVE 536
Query: 540 HQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRK 592
HQFIPSF +SP+HQNN + K S++L++LRNG DED +VSL LG+ E KRR+
Sbjct: 537 HQFIPSFIRQSPLHQNNVSDKKVSENLASLRNGCDEDIFVSLHLGDNEAKRRR 589
>gi|356525782|ref|XP_003531502.1| PREDICTED: uncharacterized protein LOC100806376 [Glycine max]
Length = 602
Score = 900 bits (2325), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/603 (75%), Positives = 512/603 (84%), Gaps = 4/603 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
ME+ R ++R SGS+ SEESALDLERN C H NLPS SP LQPFAS GQH ES+AAYF
Sbjct: 1 MERARLNMRGHCSGSTPSEESALDLERNCCSHSNLPSLSPPTLQPFASAGQHCESSAAYF 60
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWP SRLNDAAE+RANYF NLQKGVLPETLGRLP G QATTLLELMTIRAFHSKILR +
Sbjct: 61 SWP--SRLNDAAEERANYFLNLQKGVLPETLGRLPKGHQATTLLELMTIRAFHSKILRCY 118
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIRRGVLTDIPAILVFV+RKVH+QWLS +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 178
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
Y+GAP P PKE+LYTE+VD LRG DPCIGSGSQVASQETYGTLGAIV+S+TG++QVGFLT
Sbjct: 179 YFGAPEPVPKEQLYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPETFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 298
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFA+DF+++ VTTSV+GVG+IGDV IIDLQ+PI+SLIG+QV+KVGRSSGLTTG V+
Sbjct: 299 GAFIPFADDFDMSTVTTSVRGVGDIGDVKIIDLQAPISSLIGKQVVKVGRSSGLTTGVVL 358
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICF TD LVVGENQQTFDLEGDSGSLI+L G GEKPRP+GIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDLLVVGENQQTFDLEGDSGSLIMLKGDIGEKPRPIGIIWGGTAN 418
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLL+LLELDLI T+EG Q AVQ+QR SA I STVG+
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITTDEGLQVAVQEQRAVSATVIGSTVGD 478
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQ-DLVDGESEQGPTPPFIHTEFHVEDGIESSSNVG 539
S P + K+K ++ EP L IQ L S Q P + TEF +EDGI ++
Sbjct: 479 SSPPDGVLPKDKAEDKYEPLGLQIQSIPLGVVPSSQDMKPSIMETEFKLEDGINVGPSIE 538
Query: 540 HQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLN 599
HQFIPSF GRSP+H+N+ Q+ +++LS+LRN DED VSLQLG+ E KRR+ S+ S +
Sbjct: 539 HQFIPSFIGRSPLHKNSIQDRTATENLSSLRNNCDEDLCVSLQLGDNEAKRRR-SEASTS 597
Query: 600 VQE 602
+E
Sbjct: 598 TEE 600
>gi|356556958|ref|XP_003546786.1| PREDICTED: uncharacterized protein LOC100783035 [Glycine max]
Length = 602
Score = 899 bits (2322), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/603 (75%), Positives = 513/603 (85%), Gaps = 4/603 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
ME+ R ++R + SGS+ SEESALDLERN C H NLPS SP LQPFAS GQH ES+AAYF
Sbjct: 1 MERTRLNMRGRCSGSTPSEESALDLERNCCSHSNLPSLSPPTLQPFASAGQHCESSAAYF 60
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWP SRLNDAAE+RANYF NLQK VLPETLGRLP G QATTLLELMTIRAFHSKILR +
Sbjct: 61 SWP--SRLNDAAEERANYFLNLQKEVLPETLGRLPKGHQATTLLELMTIRAFHSKILRCY 118
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIRRGVLTDIPAILVFV+RKVH+QWLS +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 178
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
Y+GAP P KE+LYTE+VD LRG DPCIGSGSQVASQETYGTLGAIV+S+TG++QVGFLT
Sbjct: 179 YFGAPEPVSKEQLYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVKSQTGSRQVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPETFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 298
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFA+DF+++ VTTSV+GVG+IGDV IIDLQ+PI+SLIG+QV+KVGRSSGLTTG V+
Sbjct: 299 GAFIPFADDFDMSTVTTSVRGVGDIGDVKIIDLQAPISSLIGKQVVKVGRSSGLTTGVVL 358
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICF TD LVVGENQQTFDLEGDSGSLI+L G NGEKPRP+GIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDLLVVGENQQTFDLEGDSGSLIMLKGDNGEKPRPIGIIWGGTAN 418
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLL+LLELDLI T+EG Q AVQ+QR SA I STVG+
Sbjct: 419 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITTDEGLQVAVQEQRAVSATVIGSTVGD 478
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQ-DLVDGESEQGPTPPFIHTEFHVEDGIESSSNVG 539
S P + K+K ++ EP L IQ L S Q P + TEF +EDGI+ ++
Sbjct: 479 SSPPDGVLPKDKAEDKYEPLGLQIQSIPLGVVPSSQDMKPSIMETEFKLEDGIKVGPSIE 538
Query: 540 HQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLN 599
HQFIPSF GRSP+H+N+ Q+ +++LS+LRN DED VSLQLG+ E KRR+ S+ S +
Sbjct: 539 HQFIPSFIGRSPLHKNSIQDRTATENLSSLRNNCDEDLCVSLQLGDNEAKRRR-SEASTS 597
Query: 600 VQE 602
+E
Sbjct: 598 TEE 600
>gi|449433481|ref|XP_004134526.1| PREDICTED: uncharacterized protein LOC101202735 [Cucumis sativus]
gi|449519914|ref|XP_004166979.1| PREDICTED: uncharacterized LOC101202735 [Cucumis sativus]
Length = 604
Score = 897 bits (2317), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/605 (78%), Positives = 520/605 (85%), Gaps = 5/605 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M++ R DL F +S S+QSEESALDLERNYC H +LPSSSPSP Q FA G Q SE+NAAYF
Sbjct: 1 MDRTRLDLTFHHSVSTQSEESALDLERNYCSHLHLPSSSPSPSQCFAPGSQLSETNAAYF 60
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPT SRLNDAAEDRANYFGNLQKGVLPE LGRLPTGQ+ATTLLELMTIRAFHSKILRRF
Sbjct: 61 SWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIRAFHSKILRRF 120
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI++G+LTDIPAI+VFVARKVHRQWLS VQCLPAALEGPGG+WCDVDVVEFS
Sbjct: 121 SLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFS 180
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
YYGAPA TPKEE+YTELVDGLRGSDP IGSGSQVASQETYGTLGAIV+SRTG +QVGFLT
Sbjct: 181 YYGAPAATPKEEVYTELVDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDD+WYGIFAGTNPETFVRAD
Sbjct: 241 NRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD 300
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFAEDFN+NNV T VKGVGE+GDV+ IDLQSPINSLIGR+V+KVGRSSGLT GT+M
Sbjct: 301 GAFIPFAEDFNMNNVVTFVKGVGEVGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIM 360
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYND KGICFFTDFLVVG++QQTFDLEGDSGSLILLTGQ+ EKPRPVGIIWGGTAN
Sbjct: 361 AYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTAN 420
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLLDLLELDLI TN+G QAAV +QRN S I+STV E
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQAAVHEQRNNSVGGIDSTVAE 480
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGH 540
S +R K + E E L++QQ +GES QG PF H F +E+G E + ++
Sbjct: 481 S-CLDRIPLKYRLKENSELLGLSVQQISPEGESSQGMISPFKHA-FQIENGFEVTPSIEL 538
Query: 541 QFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLG--EPEPKRRKHSDTSL 598
QFIP T SP+ Q N Q + K+LSALRNG D + VSLQLG EPE KRRKH D
Sbjct: 539 QFIPRLTSNSPLDQKNEQIQE-LKNLSALRNGYDSEVSVSLQLGEHEPEAKRRKHLDCLS 597
Query: 599 NVQES 603
+++ES
Sbjct: 598 SIKES 602
>gi|255544706|ref|XP_002513414.1| conserved hypothetical protein [Ricinus communis]
gi|223547322|gb|EEF48817.1| conserved hypothetical protein [Ricinus communis]
Length = 600
Score = 877 bits (2267), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 443/607 (72%), Positives = 510/607 (84%), Gaps = 10/607 (1%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
ME +R ++R + SGS+ SEESALD ERN C HPNLPS SP LQPF S GQH ES+AAYF
Sbjct: 1 MECSRLNMRARCSGSTPSEESALDAERNCCSHPNLPSLSPRTLQPFVSAGQHCESSAAYF 60
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWP+ RLNDA E+RANYF NLQKGVLPETL RLP GQ+ATTLLELMTIRAFHSKILR +
Sbjct: 61 SWPSW-RLNDAVEERANYFSNLQKGVLPETLNRLPRGQRATTLLELMTIRAFHSKILRCY 119
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRI+RGVLTDIPAILVFV+RKVH+QWLS +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 120 SLGTAIGFRIQRGVLTDIPAILVFVSRKVHKQWLSPIQCLPNALEGPGGVWCDVDVVEFS 179
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
Y+GAP PTPKE+LYTE+VD LRG D CIGSG QVASQETYGTLGAIV+S+TG +QVGFLT
Sbjct: 180 YFGAPEPTPKEQLYTEIVDDLRGGDLCIGSGFQVASQETYGTLGAIVKSQTGTRQVGFLT 239
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDDLWYGIFAG NPETFVRAD
Sbjct: 240 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDDLWYGIFAGMNPETFVRAD 299
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFA+DF+++ VTTSVKGVG+IGDV IIDLQ PI SLIG+QVMKVGRSSGLTTGT++
Sbjct: 300 GAFIPFADDFDMSTVTTSVKGVGQIGDVKIIDLQCPIGSLIGKQVMKVGRSSGLTTGTIL 359
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AY LEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLI++ G+NGEKPRP+GIIWGGTAN
Sbjct: 360 AYGLEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIMKGENGEKPRPIGIIWGGTAN 419
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLL+LLEL LI T+EG + A+Q+QR ASA I ST+G+
Sbjct: 420 RGRLKLKVGQPPENWTSGVDLGRLLNLLELGLITTDEGLKVAIQEQRIASATTIGSTIGD 479
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPT---PPFIHTEFHVEDGIESSSN 537
S P + +K E +L +Q + + E E G + P + T FH+EDGI + +
Sbjct: 480 SSPLDGMLPSDKVEE-----SLGLQIEHIPLEVELGNSEINPRLVETNFHLEDGIMVAPS 534
Query: 538 VGHQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTS 597
V HQFIPSFT +SP+H++N + ++L++LRNG +ED VSL LG+ E K+R S+ S
Sbjct: 535 VEHQFIPSFTRQSPLHKSNLSDKVVLENLASLRNGCNEDVCVSLHLGDNEAKKRS-SNAS 593
Query: 598 LNVQESK 604
+++E K
Sbjct: 594 TSIEEPK 600
>gi|124301256|gb|ABN04842.1| Peptidase, trypsin-like serine and cysteine proteases [Medicago
truncatula]
Length = 546
Score = 870 bits (2247), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 451/552 (81%), Positives = 480/552 (86%), Gaps = 7/552 (1%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M +NR L +SGS+QSEESALDLERNY HP SSSP +Q FA G QHSE NAAYF
Sbjct: 1 MNRNRLGLSAHHSGSTQSEESALDLERNYYGHP---SSSPLHMQTFAVGVQHSEGNAAYF 57
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPTL+R NDAAEDRANYFGNLQKGVLPETLGRLP+GQQATTLLELMTIRAFHSKILRRF
Sbjct: 58 SWPTLNRWNDAAEDRANYFGNLQKGVLPETLGRLPSGQQATTLLELMTIRAFHSKILRRF 117
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIR GVLTDIPAILVFVA KVHRQWL+HVQCLPAALEGPGGVWCDVDVVEFS
Sbjct: 118 SLGTAIGFRIRGGVLTDIPAILVFVAHKVHRQWLNHVQCLPAALEGPGGVWCDVDVVEFS 177
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
YYGAPAPTPKE+LYTEL DGLRGSD C+GSGSQVASQETYGTLGAIVRSRTGN++VGFLT
Sbjct: 178 YYGAPAPTPKEQLYTELADGLRGSDSCVGSGSQVASQETYGTLGAIVRSRTGNREVGFLT 237
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD
Sbjct: 238 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 297
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFAEDFN+NNV TS++GVG+IG+VH IDLQSPINSLIGRQV+KVGRSSGLTTGT+M
Sbjct: 298 GAFIPFAEDFNMNNVITSIRGVGDIGEVHRIDLQSPINSLIGRQVIKVGRSSGLTTGTIM 357
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLILLTGQN EKPRPVGIIWGGTAN
Sbjct: 358 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLTGQNREKPRPVGIIWGGTAN 417
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKL+VGQPP NWTSGVDLGRLLDLLELDL+ TNE Q + Q+Q N S A I STVGE
Sbjct: 418 RGRLKLRVGQPPENWTSGVDLGRLLDLLELDLVTTNETLQDSGQEQMNGSTAGIGSTVGE 477
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGH 540
S P KEK E EPF LN++ V+ E P EFH+ + IE+ NV H
Sbjct: 478 SSPT--VPIKEKLEESFEPFCLNMEHVPVE-EPSTIVKPSLRPCEFHIRNEIETVPNVEH 534
Query: 541 QFI-PSFTGRSP 551
QFI SF G+SP
Sbjct: 535 QFIRTSFAGKSP 546
>gi|357451853|ref|XP_003596203.1| hypothetical protein MTR_2g069500 [Medicago truncatula]
gi|355485251|gb|AES66454.1| hypothetical protein MTR_2g069500 [Medicago truncatula]
Length = 603
Score = 865 bits (2235), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 444/605 (73%), Positives = 507/605 (83%), Gaps = 6/605 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
ME+ R + R + SGS+ SEESALDLERN H NLPS SP LQPFAS GQH ESNAAYF
Sbjct: 1 MERPRLNSRVRCSGSTPSEESALDLERNCYGHSNLPSLSPPTLQPFASAGQHGESNAAYF 60
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWP SRL DAAE+RANYF NLQKGVLPETLGRLP GQQATTLLELMTIRAFHSKILR +
Sbjct: 61 SWP--SRLPDAAEERANYFLNLQKGVLPETLGRLPKGQQATTLLELMTIRAFHSKILRCY 118
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIRRGVLTDIPAILVFV+RKVH+QWLS +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 119 SLGTAIGFRIRRGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 178
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
Y+GAP P PKE+ YTE+VD LRG DPCIGSGSQVASQETYGTLGAIVRS+TG++QVGFLT
Sbjct: 179 YFGAPEPVPKEQHYTEIVDDLRGGDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLT 238
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPETFVRAD
Sbjct: 239 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 298
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFA+DF++ VTTSV+GVG+IGDV IIDLQSPI++LIG+QV+KVGRSSGLTTG V+
Sbjct: 299 GAFIPFADDFDMCTVTTSVRGVGDIGDVKIIDLQSPISTLIGKQVVKVGRSSGLTTGIVL 358
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLI+ G NGEKPRP+GIIWGGTAN
Sbjct: 359 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIMFKGDNGEKPRPIGIIWGGTAN 418
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLK+G PP NWTSGVDLGRLL+LLELDLI ++EG + AVQ+QR ASA + S VG+
Sbjct: 419 RGRLKLKIGLPPENWTSGVDLGRLLNLLELDLITSDEGLRVAVQEQRTASATFMGSIVGD 478
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGE-SEQGPTPPFIHTEFHVEDGIE-SSSNV 538
S + K++ ++ EP L IQ + E + Q P + EF +EDGI+ ++
Sbjct: 479 SSTPDGMHQKDRVEDKFEPLGLQIQSIPLGVEPNSQEMKPSTMEAEFKLEDGIKVGGPSI 538
Query: 539 GHQFIPSFTGRSPMHQNNAQEN-KGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTS 597
HQFIPSF GRSP+H++ + +++LS+LRN +ED VSLQLG+ E KRR+ S+ S
Sbjct: 539 EHQFIPSFIGRSPLHKHTVHDKAAAAENLSSLRNDCNEDLCVSLQLGDNEAKRRR-SEAS 597
Query: 598 LNVQE 602
+ +E
Sbjct: 598 TSTEE 602
>gi|15241646|ref|NP_199316.1| trypsin-like protein [Arabidopsis thaliana]
gi|79329912|ref|NP_001032013.1| trypsin-like protein [Arabidopsis thaliana]
gi|10177495|dbj|BAB10886.1| unnamed protein product [Arabidopsis thaliana]
gi|222423925|dbj|BAH19926.1| AT5G45030 [Arabidopsis thaliana]
gi|332007808|gb|AED95191.1| trypsin-like protein [Arabidopsis thaliana]
gi|332007809|gb|AED95192.1| trypsin-like protein [Arabidopsis thaliana]
Length = 607
Score = 853 bits (2204), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/614 (72%), Positives = 502/614 (81%), Gaps = 18/614 (2%)
Query: 1 MEKNRWDLRFQNSGSSQSEESA-LDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAA- 58
ME R DLRF +S SSQS ESA LDL++N +H L SSSP LQPF SG QH E++AA
Sbjct: 1 MEGKRLDLRFHHSTSSQSVESAALDLDKNVYNHIKLASSSP--LQPFPSGAQHPETSAAA 58
Query: 59 -YFSWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKIL 117
YFSWPT SRLND+AEDRANYF NLQKGVLPE+ LPTG++ATTLLELM IRAFHSK L
Sbjct: 59 AYFSWPTSSRLNDSAEDRANYFANLQKGVLPESFDGLPTGKKATTLLELMMIRAFHSKNL 118
Query: 118 RRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVV 177
RRFSLGTAIGFRIRRGVLT+I AILVFVARKVH+QWL+ +QCLP ALEGPGGVWCDVDVV
Sbjct: 119 RRFSLGTAIGFRIRRGVLTNIAAILVFVARKVHKQWLNPLQCLPTALEGPGGVWCDVDVV 178
Query: 178 EFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVG 237
EF YYGAPA TPKE++YTELVD LRGS IGSGSQVASQETYGTLGAIV+S+TG +QVG
Sbjct: 179 EFQYYGAPAQTPKEQVYTELVDDLRGSGSSIGSGSQVASQETYGTLGAIVKSKTGIRQVG 238
Query: 238 FLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 297
FLTNRHVAVDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV
Sbjct: 239 FLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 298
Query: 298 RADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTG 357
RADGAFIPFAEDFN NNVTT+VKG+GEIGD+H DLQSP+NSLIGR+V+KVGRSSGLTTG
Sbjct: 299 RADGAFIPFAEDFNTNNVTTTVKGIGEIGDIHATDLQSPVNSLIGRKVVKVGRSSGLTTG 358
Query: 358 TVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTG--QNGEKPRPVGIIW 415
T+MAYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLILL + EKPRPVGIIW
Sbjct: 359 TIMAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLAAGDEKNEKPRPVGIIW 418
Query: 416 GGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNA-SAAAI 474
GGTANRGRLKLKVG+ P NWTSGVDLGR+L+LLELDLI +NEG QAAV +QRN AA+
Sbjct: 419 GGTANRGRLKLKVGEQPENWTSGVDLGRVLNLLELDLITSNEGLQAAVLEQRNGIMCAAV 478
Query: 475 ESTVGESPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIES 534
+STV ES P S+ KT E EP NLN+QQ L++ ++ IH EF +ED +ES
Sbjct: 479 DSTVVESSPGVCNISRCKTGENFEPINLNVQQVLIEDDNSN------IHPEFQIEDVLES 532
Query: 535 SSNV-GHQFIPSFTGR-SPMHQN-NAQENKGSKSLSALRNGPDEDNY-VSLQLGEPEPKR 590
+ + HQFIPS + S +HQ N EN SK+LS+L+ D SLQLGE + K+
Sbjct: 533 VAVIEEHQFIPSSSNNGSALHQKPNGPENLESKNLSSLKTSSSGDEIGFSLQLGESDTKK 592
Query: 591 RKHSDTSLNVQESK 604
RK +D+ QE +
Sbjct: 593 RKRTDSPDGSQEDE 606
>gi|20466342|gb|AAM20488.1| putative protein [Arabidopsis thaliana]
gi|25084087|gb|AAN72171.1| putative protein [Arabidopsis thaliana]
Length = 607
Score = 850 bits (2197), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/614 (72%), Positives = 501/614 (81%), Gaps = 18/614 (2%)
Query: 1 MEKNRWDLRFQNSGSSQSEESA-LDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAA- 58
ME R DLRF +S SSQS ESA LDL++N +H L SSSP LQPF SG QH E++AA
Sbjct: 1 MEGKRLDLRFHHSTSSQSVESAALDLDKNVYNHIKLASSSP--LQPFPSGAQHPETSAAA 58
Query: 59 -YFSWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKIL 117
YFSWPT SRLND+AEDRANYF NLQKGVLPE+ LPTG++ATTLLELM IRAFHSK L
Sbjct: 59 AYFSWPTSSRLNDSAEDRANYFANLQKGVLPESFDGLPTGKKATTLLELMMIRAFHSKNL 118
Query: 118 RRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVV 177
RRFSLGTAIGFRIRRGVLT+I AILVFVARKVH+QWL+ +QCLP ALEGPGGVWCDVDVV
Sbjct: 119 RRFSLGTAIGFRIRRGVLTNIAAILVFVARKVHKQWLNPLQCLPTALEGPGGVWCDVDVV 178
Query: 178 EFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVG 237
EF YYGAPA TPKE++YTELVD LRGS IGSGSQVASQE YGTLGAIV+S+TG +QVG
Sbjct: 179 EFQYYGAPAQTPKEQVYTELVDDLRGSGSSIGSGSQVASQERYGTLGAIVKSKTGIRQVG 238
Query: 238 FLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 297
FLTNRHVAVDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV
Sbjct: 239 FLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 298
Query: 298 RADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTG 357
RADGAFIPFAEDFN NNVTT+VKG+GEIGD+H DLQSP+NSLIGR+V+KVGRSSGLTTG
Sbjct: 299 RADGAFIPFAEDFNTNNVTTTVKGIGEIGDIHATDLQSPVNSLIGRKVVKVGRSSGLTTG 358
Query: 358 TVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTG--QNGEKPRPVGIIW 415
T+MAYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLILL + EKPRPVGIIW
Sbjct: 359 TIMAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLAAGDEKNEKPRPVGIIW 418
Query: 416 GGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNA-SAAAI 474
GGTANRGRLKLKVG+ P NWTSGVDLGR+L+LLELDLI +NEG QAAV +QRN AA+
Sbjct: 419 GGTANRGRLKLKVGEQPENWTSGVDLGRVLNLLELDLITSNEGLQAAVLEQRNGIMCAAV 478
Query: 475 ESTVGESPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIES 534
+STV ES P S+ KT E EP NLN+QQ L++ ++ IH EF +ED +ES
Sbjct: 479 DSTVVESSPGVCNISRCKTGENFEPINLNVQQVLIEDDNSN------IHPEFQIEDVLES 532
Query: 535 SSNV-GHQFIPSFTGR-SPMHQN-NAQENKGSKSLSALRNGPDEDNY-VSLQLGEPEPKR 590
+ + HQFIPS + S +HQ N EN SK+LS+L+ D SLQLGE + K+
Sbjct: 533 VAVIEEHQFIPSSSNNGSALHQKPNGPENLESKNLSSLKTSSSGDEIGFSLQLGESDTKK 592
Query: 591 RKHSDTSLNVQESK 604
RK +D+ QE +
Sbjct: 593 RKRTDSPDGSQEDE 606
>gi|449453788|ref|XP_004144638.1| PREDICTED: uncharacterized protein LOC101217211 [Cucumis sativus]
gi|449504216|ref|XP_004162286.1| PREDICTED: uncharacterized protein LOC101225003 [Cucumis sativus]
Length = 601
Score = 843 bits (2178), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/604 (71%), Positives = 495/604 (81%), Gaps = 3/604 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
ME+ R + R SGS+ SEESALDLERN C H +LPS S LQPFAS GQH N AYF
Sbjct: 1 MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYF 60
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPT RL+ E+RANYF NLQKGVLP+ L LP GQ+A TLLELMTIRAFHSKILR +
Sbjct: 61 SWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCY 120
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIR+GVLTDIPAILVFV+RKVH+QWLS +QCLP ALEGPGGVWCDVDVVEFS
Sbjct: 121 SLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFS 180
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
Y+GAP P PKE+LYTE+VD LRGSDPCIGSGSQVASQETYGTLGAIVRS+TG +QVGFLT
Sbjct: 181 YFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPETFVRAD
Sbjct: 241 NRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD 300
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFA+DF+++ VTTSVKGVG++GDV IDLQSPI++LIG+QV+KVGRSSGLTTGTV+
Sbjct: 301 GAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVL 360
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLI+L G+N + +P+GIIWGGTAN
Sbjct: 361 AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRDTLQPIGIIWGGTAN 420
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGRLKLKVGQPP NWTSGVDLGRLL+LLELDLI ++EG +AAVQ+Q SA I S VG+
Sbjct: 421 RGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGD 480
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGH 540
S P + KEK+ E+ E IQ + E P + TEFH+E G+ + +V H
Sbjct: 481 SSPPDTTLPKEKSEEKSEQLGFQIQHMPTEVEPS-AKDRPLLETEFHLEPGMNRAPSVEH 539
Query: 541 QFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLNV 600
QFIPS SP HQN+ + S++LS LR+ ED VSLQLG+ E KRR+ SD S+++
Sbjct: 540 QFIPSLFSCSPSHQNSTLDRAVSQNLSLLRSDC-EDLCVSLQLGDHEAKRRR-SDASVSM 597
Query: 601 QESK 604
+E K
Sbjct: 598 EELK 601
>gi|297794835|ref|XP_002865302.1| hypothetical protein ARALYDRAFT_917056 [Arabidopsis lyrata subsp.
lyrata]
gi|297311137|gb|EFH41561.1| hypothetical protein ARALYDRAFT_917056 [Arabidopsis lyrata subsp.
lyrata]
Length = 614
Score = 835 bits (2158), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 444/614 (72%), Positives = 498/614 (81%), Gaps = 20/614 (3%)
Query: 1 MEKNRWDLRFQNSGSSQSEE---SALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNA 57
ME R DLRF +S SS S+ +ALDL++N +H L SSSP QPF SGGQH E++A
Sbjct: 1 MEGKRLDLRFHHSVSSSSQSVESAALDLDKNGYNHIKLASSSP--FQPFPSGGQHPETSA 58
Query: 58 A--YFSWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSK 115
A YFSWPT RLND+AEDRANYF NLQKGVLPET LPTG++ATTLLELM IRAFHSK
Sbjct: 59 AAAYFSWPTSCRLNDSAEDRANYFANLQKGVLPETFDGLPTGKKATTLLELMMIRAFHSK 118
Query: 116 ILRRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVD 175
LRRFSLGTAIGFRIRRGVLT+I AILVFVARKVH+QWL+ +QCLP ALEGPGGVWCDVD
Sbjct: 119 NLRRFSLGTAIGFRIRRGVLTNIAAILVFVARKVHKQWLNPLQCLPTALEGPGGVWCDVD 178
Query: 176 VVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQ 235
VVEF YYGAPA TPKE++YTELVD LRGS IGSGSQVASQETYGTLGAIV+S+TG +Q
Sbjct: 179 VVEFQYYGAPAQTPKEQVYTELVDDLRGSGSSIGSGSQVASQETYGTLGAIVKSKTGIRQ 238
Query: 236 VGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPET 295
VGFLTNRHVAVDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPET
Sbjct: 239 VGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPET 298
Query: 296 FVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLT 355
FVRADGAFIPFAEDFN+NNVTT+VKG+GEIG++H DLQSPINSLIGR+V+KVGRSSGLT
Sbjct: 299 FVRADGAFIPFAEDFNMNNVTTTVKGIGEIGNIHATDLQSPINSLIGRKVVKVGRSSGLT 358
Query: 356 TGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTG--QNGEKPRPVGI 413
TGT+MAYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLILL + EKPRPVGI
Sbjct: 359 TGTIMAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLAAGDEKNEKPRPVGI 418
Query: 414 IWGGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNA-SAA 472
IWGGTANRGRLKLKVG+ P NWTSGVDLGR+L+LLELDLI +NEG QAAV +QRN A
Sbjct: 419 IWGGTANRGRLKLKVGEQPENWTSGVDLGRVLNLLELDLITSNEGLQAAVLEQRNGIMCA 478
Query: 473 AIESTVGESPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGI 532
I+STV ES P S+ KT E EP NLN+QQ L + +S IH EF +ED +
Sbjct: 479 GIDSTVVESSPGVCNISRCKTGENFEPINLNVQQVLREEDSSN------IHPEFQIEDVL 532
Query: 533 ESSSNV-GHQFIPSFT--GRSPMHQNNAQENKGSKSLSALRNGPDEDNY-VSLQLGEPEP 588
ES++ + HQFIPS + G S + N EN SK+LS+L+ D SLQLGE +
Sbjct: 533 ESAAMIEEHQFIPSSSNNGYSLHQKINGPENLESKNLSSLKTNSSGDEIGFSLQLGESDT 592
Query: 589 KRRKHSDTSLNVQE 602
K+RK +D+ QE
Sbjct: 593 KKRKRTDSPDGSQE 606
>gi|357152457|ref|XP_003576125.1| PREDICTED: uncharacterized protein LOC100833303 [Brachypodium
distachyon]
Length = 598
Score = 825 bits (2132), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/602 (73%), Positives = 494/602 (82%), Gaps = 12/602 (1%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D+ ++GSSQSE ALD+ERN C+H + P PLQP AS GQHSES+ AYFSWPT +
Sbjct: 5 DIWKAHAGSSQSEGPALDMERNGCNH----NCCPPPLQPIASAGQHSESSVAYFSWPTST 60
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
++ +AE RANYFGNLQKGVLP LGRLP GQQATTLL+LM IRAFHSKILRRFSLGTAI
Sbjct: 61 LMHGSAEGRANYFGNLQKGVLPGHLGRLPKGQQATTLLDLMIIRAFHSKILRRFSLGTAI 120
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+G LTD PAILVFVARKV+++WL QCLPAALEGPGGVWCDVDVVEFSYYGAPA
Sbjct: 121 GFRIRKGTLTDTPAILVFVARKVNKKWLRPTQCLPAALEGPGGVWCDVDVVEFSYYGAPA 180
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
PTPKE+LY ELVDGLRGSDP IGSGSQVAS ETYGTLGAIV+SRTG++QVGFLTNRHVAV
Sbjct: 181 PTPKEQLYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGSKQVGFLTNRHVAV 240
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 241 DLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 300
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A+DF++ NV+TSVKGVG IGD+ IDLQSPI+SLIG+QV+KVGRSSGLTTGTVMAYALEY
Sbjct: 301 ADDFDITNVSTSVKGVGIIGDIKAIDLQSPISSLIGKQVVKVGRSSGLTTGTVMAYALEY 360
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGICFFTDFLVVGENQQTFDLEGDSGSLI+LTGQ+GEKP+P+GIIWGGTANRGRLKL
Sbjct: 361 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKL 420
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQR---NASAAAIESTVGESPP 483
K GQ P NWTSGVDLGRLLDLLELDLI T+EG Q A+++QR A+AAA ST ES P
Sbjct: 421 KSGQGPENWTSGVDLGRLLDLLELDLITTSEGLQEALEEQRISLAAAAAAANSTATESSP 480
Query: 484 AEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNV-GHQF 542
Q EK + EP +NIQQ DG S PF EFHV D +E +NV QF
Sbjct: 481 VATPQENEKVDKIYEPLGINIQQLPRDG-SANLTDQPFGSDEFHV-DTVEGMNNVEERQF 538
Query: 543 IPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLNVQE 602
IP+ G SPM N + N G +LS L N P ED SL LGE EPKR + SD++L++
Sbjct: 539 IPNLIGMSPMRDNAREGNGGLDNLSELENSP-EDICFSLHLGEREPKRLR-SDSTLDIDL 596
Query: 603 SK 604
K
Sbjct: 597 QK 598
>gi|225462187|ref|XP_002267587.1| PREDICTED: uncharacterized protein LOC100261226 [Vitis vinifera]
Length = 603
Score = 818 bits (2112), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/595 (68%), Positives = 483/595 (81%), Gaps = 3/595 (0%)
Query: 1 MEKNRWDLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYF 60
M++ + +LR + SGS+ SEESA + ERN C H +LPSSS LQPFAS GQHSESNAAYF
Sbjct: 1 MDQTKLNLRLRCSGSTLSEESAPNQERNCCCHSHLPSSSLPTLQPFASAGQHSESNAAYF 60
Query: 61 SWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRF 120
SWPT SRLNDAAE+RANYF NLQK VL ET G LP GQQAT+LLE+MTIRAFHSKILR +
Sbjct: 61 SWPTSSRLNDAAEERANYFSNLQKAVLSETPGPLPKGQQATSLLEVMTIRAFHSKILRCY 120
Query: 121 SLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS 180
SLGTAIGFRIRRG+LTDIPAILVFV+RKVH+QWL+ +QC P LEGPGG+WCDVDVVEF+
Sbjct: 121 SLGTAIGFRIRRGMLTDIPAILVFVSRKVHKQWLNPIQCFPNVLEGPGGLWCDVDVVEFA 180
Query: 181 YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
Y+GAP PKE+ YTE++D LRG DPCIGSGSQVASQ+ +GTLGAIVRS+TGN+QVGFLT
Sbjct: 181 YFGAPELAPKEQYYTEIMDDLRGGDPCIGSGSQVASQDGFGTLGAIVRSQTGNRQVGFLT 240
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
NRHVAV+LDYP+QKMFHPLPP+LGPGVYLGAVERATSFITDDLW+GIFAG NPETFVRAD
Sbjct: 241 NRHVAVNLDYPSQKMFHPLPPTLGPGVYLGAVERATSFITDDLWFGIFAGINPETFVRAD 300
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
GAFIPFA+DF+++ +TT VKGVGEIGDV IDLQSP+NS+IG+QV+KVGRSSGLTTGT+
Sbjct: 301 GAFIPFADDFDMSTITTLVKGVGEIGDVKKIDLQSPMNSIIGKQVVKVGRSSGLTTGTIF 360
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEY DE+G+C TD +VVGENQQTFDLEGDSGSLI+LTGQ+GEK RP+GIIWGG N
Sbjct: 361 AYALEYIDERGMCLLTDLIVVGENQQTFDLEGDSGSLIVLTGQDGEKARPIGIIWGGNGN 420
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGE 480
RGR+KLK G P NWTS VD+GRLL+LLELDLI T+EG + A+Q+Q ASA AI STVG+
Sbjct: 421 RGRVKLKAGLPLENWTSAVDIGRLLNLLELDLITTSEGLRVALQEQMAASATAIGSTVGD 480
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQD-LVDGESEQGPTPPFIHTEFHVEDGIESSSNVG 539
S P ++ K++ E+ E IQ D DG P + EF +EDG+
Sbjct: 481 SSPQDKMLPKDRAEEKFESEGFQIQHDPWDDGLGSPDLNRPLVEAEFLLEDGVRVCPCFE 540
Query: 540 HQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDN--YVSLQLGEPEPKRRK 592
HQFIPSF P+H+N Q ++LS+L++ DED+ +SLQLG+ EPKR +
Sbjct: 541 HQFIPSFPEAPPLHENIEQARVTPENLSSLKHDTDEDDGAAISLQLGDHEPKRTR 595
>gi|297826993|ref|XP_002881379.1| hypothetical protein ARALYDRAFT_902611 [Arabidopsis lyrata subsp.
lyrata]
gi|297327218|gb|EFH57638.1| hypothetical protein ARALYDRAFT_902611 [Arabidopsis lyrata subsp.
lyrata]
Length = 577
Score = 814 bits (2103), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/596 (72%), Positives = 481/596 (80%), Gaps = 25/596 (4%)
Query: 1 MEKNRWDLRF-QNSGSSQSEESALDLERNY-CHHPNLPSSSPSPL-QPFASGGQHSESNA 57
M W RF Q + SS+SE+SALDLERN+ C+H +LPSSS QPF QH+ESNA
Sbjct: 1 MTLGAWGQRFIQAAASSESEDSALDLERNHHCNHLSLPSSSTPSPLQPFTFNIQHAESNA 60
Query: 58 AYFSWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKIL 117
YFSWPTLSRLNDA EDRANYFGNLQKGVLPET+GRLP+GQQATTLLELMTIRAFHSKIL
Sbjct: 61 PYFSWPTLSRLNDAVEDRANYFGNLQKGVLPETVGRLPSGQQATTLLELMTIRAFHSKIL 120
Query: 118 RRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVV 177
RRFSLGTA+GFRI RGVLT++PAILVFVARKVHRQWL+ +QCLP+ALEGPGGVWCDVDVV
Sbjct: 121 RRFSLGTAVGFRISRGVLTNVPAILVFVARKVHRQWLNPMQCLPSALEGPGGVWCDVDVV 180
Query: 178 EFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVG 237
EF YYGAPA TP E++Y ELVDGLRGSDPCIGSGSQVASQETYGTLGAIV+SRTGN QVG
Sbjct: 181 EFQYYGAPAATPNEQVYNELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNHQVG 240
Query: 238 FLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 297
FLTNRHVAVDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDD WYGIFAGTNPETFV
Sbjct: 241 FLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDQWYGIFAGTNPETFV 300
Query: 298 RADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTG 357
RADGAFIPFAEDFN +NVTT +KG+GEIG+VH+IDLQSPI+SLIG+QV+KVGRSSG TTG
Sbjct: 301 RADGAFIPFAEDFNTSNVTTMIKGIGEIGNVHVIDLQSPIDSLIGKQVVKVGRSSGYTTG 360
Query: 358 TVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGG 417
T+MAYALEYNDEKGICF TDFLV+GENQQTFDLEGDSGSLILLTG NG+KPRPVGIIWGG
Sbjct: 361 TIMAYALEYNDEKGICFLTDFLVIGENQQTFDLEGDSGSLILLTGPNGQKPRPVGIIWGG 420
Query: 418 TANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIEST 477
TANRG+LKL GQ P NWTSGVDLGRLLDLLELDLI +N +AA +++RN S A++ST
Sbjct: 421 TANRGKLKLIAGQEPENWTSGVDLGRLLDLLELDLITSNHELEAAAREERNTSVTALDST 480
Query: 478 VGESPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSN 537
V +S P + S EK E E PFI EF +E+ I+ +
Sbjct: 481 VSQSSPPDPVPSGEKQDESFE---------------------PFIPHEFRIEEAIKPTPE 519
Query: 538 V-GHQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRK 592
V H FI + QE +L AL+N +E+ VSL LGEP+ K+ K
Sbjct: 520 VEEHIFIAPISVNESTSAIKGQEKPKLDNLMALKNSSEEEVNVSLHLGEPKLKKPK 575
>gi|125561508|gb|EAZ06956.1| hypothetical protein OsI_29197 [Oryza sativa Indica Group]
Length = 590
Score = 812 bits (2097), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/604 (73%), Positives = 495/604 (81%), Gaps = 24/604 (3%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D+ ++GSSQSE SALD+ERN C+H + PSPLQP ASGGQHSES+AAYFSWPT +
Sbjct: 5 DIWKAHAGSSQSEGSALDMERNGCNH----NCCPSPLQPIASGGQHSESSAAYFSWPTST 60
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
++ +AE RANYFGNLQKGVLP LGRLPTGQ+ATTLL+LM IRAFHSKILRRFSLGTAI
Sbjct: 61 LMHGSAEGRANYFGNLQKGVLPGHLGRLPTGQRATTLLDLMIIRAFHSKILRRFSLGTAI 120
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRI++G LTD PAILVFVARKVHR+WLS QCLPA LEGPGGVWCDVDVVEFSYYGAPA
Sbjct: 121 GFRIKKGTLTDTPAILVFVARKVHRKWLSTTQCLPAHLEGPGGVWCDVDVVEFSYYGAPA 180
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
PTPKE+LY ELVDGLRGSDP IGSGSQVAS ETYGTLGAIV+SRTGN+QVGFLTNRHVAV
Sbjct: 181 PTPKEQLYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGNKQVGFLTNRHVAV 240
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 241 DLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 300
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A+D+++ +V TSVKGVG IGDV IDLQSPI+SLIGRQV+KVGRSSGLTTGTV+AYALEY
Sbjct: 301 ADDYDITSVNTSVKGVGVIGDVKAIDLQSPISSLIGRQVVKVGRSSGLTTGTVVAYALEY 360
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGICFFTDFLVVGENQQTFDLEGDSGSLI+LTG++GEKP+P+GIIWGGTANRGRLKL
Sbjct: 361 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLIILTGKDGEKPQPIGIIWGGTANRGRLKL 420
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQR---NASAAAIESTVGESPP 483
K GQ P NWTSGVDLGRLLDLLELDLI T+EG Q A+++QR A+AAA ST GES P
Sbjct: 421 KSGQGPENWTSGVDLGRLLDLLELDLITTSEGLQEALEEQRIILAAAAAAANSTAGESSP 480
Query: 484 AEREQSKEKTAERLEPFNLNIQQDLVDGE-SEQGPTPPFIHTEFHVEDGIESSSNV-GHQ 541
Q EK + EP +NIQQ D + GP EFHV D +E +NV Q
Sbjct: 481 VAGPQENEKVDKIYEPLGINIQQLPRDNSATSTGP------DEFHV-DTVEGVTNVEERQ 533
Query: 542 FIPSFTGRSPMHQNNAQENKGS-KSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLNV 600
F+ G SP + QE G +L+ L N P ED SL LGE EPKR + SD+SL++
Sbjct: 534 FL---IGMSPARE--GQEANGDLNNLAELENSP-EDICFSLHLGEREPKRLR-SDSSLDI 586
Query: 601 QESK 604
K
Sbjct: 587 DLQK 590
>gi|115476358|ref|NP_001061775.1| Os08g0407200 [Oryza sativa Japonica Group]
gi|37572952|dbj|BAC98602.1| unknown protein [Oryza sativa Japonica Group]
gi|113623744|dbj|BAF23689.1| Os08g0407200 [Oryza sativa Japonica Group]
gi|125603365|gb|EAZ42690.1| hypothetical protein OsJ_27258 [Oryza sativa Japonica Group]
gi|215695285|dbj|BAG90476.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704499|dbj|BAG93933.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767959|dbj|BAH00188.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 590
Score = 811 bits (2094), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/604 (73%), Positives = 495/604 (81%), Gaps = 24/604 (3%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D+ ++GSSQSE SALD+ERN C+H + PSPLQP ASGGQHSES+AAYFSWPT +
Sbjct: 5 DIWKAHAGSSQSEGSALDMERNGCNH----NCCPSPLQPIASGGQHSESSAAYFSWPTST 60
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
++ +AE RANYFGNLQKGVLP LGRLPTGQ+ATTLL+LM IRAFHSKILRRFSLGTAI
Sbjct: 61 LMHGSAEGRANYFGNLQKGVLPGHLGRLPTGQRATTLLDLMIIRAFHSKILRRFSLGTAI 120
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRI++G LTD PAILVFVARKVHR+WLS QCLPA LEGPGGVWCDVDVVEFSYYGAPA
Sbjct: 121 GFRIKKGTLTDTPAILVFVARKVHRKWLSPTQCLPAHLEGPGGVWCDVDVVEFSYYGAPA 180
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
PTPKE+LY ELVDGLRGSDP IGSGSQVAS ETYGTLGAIV+SRTGN+QVGFLTNRHVAV
Sbjct: 181 PTPKEQLYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGNKQVGFLTNRHVAV 240
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 241 DLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 300
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A+D+++ +V TSVKGVG IGDV IDLQSPI+SLIGRQV+KVGRSSGLTTGTV+AYALEY
Sbjct: 301 ADDYDITSVNTSVKGVGVIGDVKAIDLQSPISSLIGRQVVKVGRSSGLTTGTVVAYALEY 360
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGICFFTDFLVVGENQQTFDLEGDSGSLI+LTG++GEKP+P+GIIWGGTANRGRLKL
Sbjct: 361 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLIILTGKDGEKPQPIGIIWGGTANRGRLKL 420
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQR---NASAAAIESTVGESPP 483
K GQ P NWTSGVDLGRLLDLLELDLI T+EG Q A+++QR A+AAA ST GES P
Sbjct: 421 KSGQGPENWTSGVDLGRLLDLLELDLITTSEGLQEALEEQRIILAAAAAAANSTAGESSP 480
Query: 484 AEREQSKEKTAERLEPFNLNIQQDLVDGE-SEQGPTPPFIHTEFHVEDGIESSSNV-GHQ 541
Q EK + EP +NIQQ D + GP EFHV D +E +NV Q
Sbjct: 481 VAGPQENEKVDKIYEPLGINIQQLPRDNSATSTGP------DEFHV-DTVEGVTNVEERQ 533
Query: 542 FIPSFTGRSPMHQNNAQENKGS-KSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLNV 600
F+ G SP + QE G +L+ L N P ED SL LGE EPKR + SD+SL++
Sbjct: 534 FL---IGMSPARE--GQEANGDLNNLAELENSP-EDICFSLHLGEREPKRLR-SDSSLDI 586
Query: 601 QESK 604
K
Sbjct: 587 DLQK 590
>gi|226858186|gb|ACO87664.1| unknown [Brachypodium sylvaticum]
Length = 598
Score = 808 bits (2087), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/589 (73%), Positives = 483/589 (82%), Gaps = 13/589 (2%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D+ ++GSSQSE ALD+ERN C+H P S LQP AS GQHSES+ AYFSWPT +
Sbjct: 5 DIWKAHAGSSQSEGPALDMERNGCNHNCCPPS----LQPIASAGQHSESSVAYFSWPTST 60
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
++ +AE RANYFGNLQKGVLP LGRLP GQQATTLL+LM IRAFHSKILRRFSLGTAI
Sbjct: 61 LMHGSAEGRANYFGNLQKGVLPGHLGRLPKGQQATTLLDLMIIRAFHSKILRRFSLGTAI 120
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+G LTD PAILVFVARKV+++WL QCLPAALEGPGGVWCDVDVVEFSYYGAPA
Sbjct: 121 GFRIRKGTLTDTPAILVFVARKVNKKWLGPTQCLPAALEGPGGVWCDVDVVEFSYYGAPA 180
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
PTPKE+LY ELVDGLRGSDP IGSGSQVAS ETYGTLGAIV+SRTG++QVGFLTNRHVAV
Sbjct: 181 PTPKEQLYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGSKQVGFLTNRHVAV 240
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 241 DLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 300
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A+DF++ NV TSVKGVG IGD+ IDLQSPI+SLIG+QV+KVGRSSGLTTGTVMAYALEY
Sbjct: 301 ADDFDITNVGTSVKGVGIIGDIKAIDLQSPISSLIGKQVVKVGRSSGLTTGTVMAYALEY 360
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGICFFTDFLVVGENQQTFDLEGDSGSLI+LTGQ+GEKP+P+GIIWGGTANRGRLKL
Sbjct: 361 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKL 420
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQR---NASAAAIESTVGESPP 483
K GQ P NWTSGVDLGRLLDLLELDLI T+EG Q A+++QR A+A A ST ES P
Sbjct: 421 KSGQGPENWTSGVDLGRLLDLLELDLITTSEGLQEALEEQRISLAAAATAANSTATESSP 480
Query: 484 AEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTP-PFIHTEFHVEDGIESSSNV-GHQ 541
Q EK + EP +NIQQ DG + PT F EFHV D +E +NV Q
Sbjct: 481 VATPQENEKVDKIYEPLGINIQQLPRDGSAN--PTDQSFGSDEFHV-DTLEGMNNVEERQ 537
Query: 542 FIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKR 590
FIP+ G SPM N + N G +L+ + N P ED SL LGE EPKR
Sbjct: 538 FIPNLIGMSPMRDNAREGNGGLDNLAEMDNSP-EDICFSLHLGEREPKR 585
>gi|18403763|ref|NP_565798.1| trypsin-like protein [Arabidopsis thaliana]
gi|20197214|gb|AAM14975.1| expressed protein [Arabidopsis thaliana]
gi|23297468|gb|AAN12976.1| unknown protein [Arabidopsis thaliana]
gi|330253980|gb|AEC09074.1| trypsin-like protein [Arabidopsis thaliana]
Length = 579
Score = 802 bits (2071), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/598 (72%), Positives = 482/598 (80%), Gaps = 27/598 (4%)
Query: 1 MEKNRWDLRF-QNSGSSQSEESALDLERNY-CHHPNLPSSSPSPL-QPFASGGQHSESNA 57
M W RF Q + SS+SE+SALDLERN+ C+H +LPSSS QPF QH+ESNA
Sbjct: 1 MNLGAWGQRFIQAAASSESEDSALDLERNHHCNHLSLPSSSSPSPLQPFTLNIQHAESNA 60
Query: 58 AYFSWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKIL 117
YFSWPTLSRLND EDRANYFGNLQKGVLPET+GRLP+GQQATTLLELMTIRAFHSKIL
Sbjct: 61 PYFSWPTLSRLNDTVEDRANYFGNLQKGVLPETVGRLPSGQQATTLLELMTIRAFHSKIL 120
Query: 118 RRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVV 177
RRFSLGTA+GFRI RGVLT++PAILVFVARKVHRQWL+ +QCLP+ALEGPGGVWCDVDVV
Sbjct: 121 RRFSLGTAVGFRISRGVLTNVPAILVFVARKVHRQWLNPMQCLPSALEGPGGVWCDVDVV 180
Query: 178 EFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVG 237
EF YYGAPA TPKE++Y ELVDGLRGSDPCIGSGSQVASQETYGTLGAIV+SRTGN QVG
Sbjct: 181 EFQYYGAPAATPKEQVYNELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNHQVG 240
Query: 238 FLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 297
FLTNRHVAVDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDD WYGIFAGTNPETFV
Sbjct: 241 FLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDQWYGIFAGTNPETFV 300
Query: 298 RADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTG 357
RADGAFIPFAEDFN +NVTT +KG+GEIGDVH+IDLQSPI+SLIG+QV+KVGRSSG TTG
Sbjct: 301 RADGAFIPFAEDFNTSNVTTLIKGIGEIGDVHVIDLQSPIDSLIGKQVVKVGRSSGYTTG 360
Query: 358 TVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGG 417
T+MAYALEYNDEKGICF TDFLV+GENQQTFDLEGDSGSLILLTG NG+KPRPVGIIWGG
Sbjct: 361 TIMAYALEYNDEKGICFLTDFLVIGENQQTFDLEGDSGSLILLTGPNGQKPRPVGIIWGG 420
Query: 418 TANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGF--QAAVQDQRNASAAAIE 475
TANRGRLKL GQ P NWTSGVDLGRLLDLLELDLI +N AA +++RN S A++
Sbjct: 421 TANRGRLKLIAGQEPENWTSGVDLGRLLDLLELDLITSNHELEAAAAAREERNTSVTALD 480
Query: 476 STVGESPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESS 535
STV +S P + S +K E EPF PP EFH+E+ I+ +
Sbjct: 481 STVSQSSPPDPVPSGDKQDESFEPF-----------------IPP----EFHIEEAIKPT 519
Query: 536 SNV-GHQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRK 592
V H FI + QE +L AL+N +E+ +SL LGEP+ K+ K
Sbjct: 520 LEVEEHIFIAPISVNESTSAIKGQEIPKLDNLMALKNSSEEEVNISLHLGEPKLKKPK 577
>gi|16604659|gb|AAL24122.1| unknown protein [Arabidopsis thaliana]
Length = 579
Score = 800 bits (2065), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/598 (72%), Positives = 481/598 (80%), Gaps = 27/598 (4%)
Query: 1 MEKNRWDLRF-QNSGSSQSEESALDLERNY-CHHPNLPSSSPSPL-QPFASGGQHSESNA 57
M W RF Q + SS+SE+SALDLERN+ C+H +LPSSS QPF QH+ESNA
Sbjct: 1 MNLGAWGQRFIQAAASSESEDSALDLERNHHCNHLSLPSSSSPSPLQPFTLNIQHAESNA 60
Query: 58 AYFSWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKIL 117
YFSWPTLSRLND EDRANYFGNLQKGVLPET+GRLP+GQQATTLLELMTIRAFHSKIL
Sbjct: 61 PYFSWPTLSRLNDTVEDRANYFGNLQKGVLPETVGRLPSGQQATTLLELMTIRAFHSKIL 120
Query: 118 RRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVV 177
RRFSLGTA+GFRI RGVLT++PAILVFVARKVHRQWL+ +QCLP+ALEGPGGVWCDVDVV
Sbjct: 121 RRFSLGTAVGFRISRGVLTNVPAILVFVARKVHRQWLNPMQCLPSALEGPGGVWCDVDVV 180
Query: 178 EFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVG 237
EF YYGAPA TPKE++Y ELVDGLRGSDPCIGSGSQVASQETYGTLGAIV+SRTGN QVG
Sbjct: 181 EFQYYGAPAATPKEQVYNELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNHQVG 240
Query: 238 FLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 297
FLTNRHVAVDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDD WYGIFAGTNPETFV
Sbjct: 241 FLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDQWYGIFAGTNPETFV 300
Query: 298 RADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTG 357
RADGAFIPFAED N +NVTT +KG+GEIGDVH+IDLQSPI+SLIG+QV+KVGRSSG TTG
Sbjct: 301 RADGAFIPFAEDVNTSNVTTLIKGIGEIGDVHVIDLQSPIDSLIGKQVVKVGRSSGYTTG 360
Query: 358 TVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGG 417
T+MAYALEYNDEKGICF TDFLV+GENQQTFDLEGDSGSLILLTG NG+KPRPVGIIWGG
Sbjct: 361 TIMAYALEYNDEKGICFLTDFLVIGENQQTFDLEGDSGSLILLTGPNGQKPRPVGIIWGG 420
Query: 418 TANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGF--QAAVQDQRNASAAAIE 475
TANRGRLKL GQ P NWTSGVDLGRLLDLLELDLI +N AA +++RN S A++
Sbjct: 421 TANRGRLKLIAGQEPENWTSGVDLGRLLDLLELDLITSNHELEAAAAAREERNTSVTALD 480
Query: 476 STVGESPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESS 535
STV +S P + S +K E EPF PP EFH+E+ I+ +
Sbjct: 481 STVSQSSPPDPVPSGDKQDESFEPF-----------------IPP----EFHIEEAIKPT 519
Query: 536 SNV-GHQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKRRK 592
V H FI + QE +L AL+N +E+ +SL LGEP+ K+ K
Sbjct: 520 LEVEEHIFIAPISVNESTSAIKGQEIPKLDNLMALKNSSEEEVNISLHLGEPKLKKPK 577
>gi|159137849|gb|ABW89000.1| narrow leaf 1 [Oryza sativa Japonica Group]
gi|222629546|gb|EEE61678.1| hypothetical protein OsJ_16147 [Oryza sativa Japonica Group]
Length = 582
Score = 783 bits (2022), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/601 (67%), Positives = 469/601 (78%), Gaps = 28/601 (4%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D + Q SG +QSEES+LD++ H + P S PS +QP ASG H+E++AAYF WPT +
Sbjct: 5 DDKAQLSGLAQSEESSLDVD-----HQSFPCS-PS-IQPVASGCTHTENSAAYFLWPTSN 57
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
+ AAE RANYFGNLQKG+LP GRLP GQQA +LL+LMTIRAFHSKILRRFSLGTA+
Sbjct: 58 LQHCAAEGRANYFGNLQKGLLPRHPGRLPKGQQANSLLDLMTIRAFHSKILRRFSLGTAV 117
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+G LTDIPAILVFVARKVH++WL+ QCLPA LEGPGGVWCDVDVVEFSYYGAPA
Sbjct: 118 GFRIRKGDLTDIPAILVFVARKVHKKWLNPAQCLPAILEGPGGVWCDVDVVEFSYYGAPA 177
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
TPKE++++ELVD L GSD CIGSGSQVAS ET+GTLGAIV+ RTGN+QVGFLTN HVAV
Sbjct: 178 QTPKEQMFSELVDKLCGSDECIGSGSQVASHETFGTLGAIVKRRTGNKQVGFLTNHHVAV 237
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 238 DLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 297
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A+DF+++ VTT V+GVG+IGDV +IDLQ P+NSLIGRQV KVGRSSG TTGTVMAYALEY
Sbjct: 298 ADDFDISTVTTVVRGVGDIGDVKVIDLQCPLNSLIGRQVCKVGRSSGHTTGTVMAYALEY 357
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGICFFTD LVVGEN+QTFDLEGDSGSLI+LT Q+GEKPRP+GIIWGGTANRGRLKL
Sbjct: 358 NDEKGICFFTDILVVGENRQTFDLEGDSGSLIILTSQDGEKPRPIGIIWGGTANRGRLKL 417
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGESPPAER 486
P NWTSGVDLGRLLD LELD+I TNE Q AVQ QR A AA+ S VGES
Sbjct: 418 TSDHGPENWTSGVDLGRLLDRLELDIIITNESLQDAVQQQRFALVAAVTSAVGESSGVPV 477
Query: 487 EQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNV----GHQF 542
+EK E EP + IQQ + G +G E+S+ V HQF
Sbjct: 478 AIPEEKIEEIFEPLGIQIQQLPRHDVAASG------------TEGEEASNTVVNVEEHQF 525
Query: 543 IPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKR-RKHSDTSLNVQ 601
I +F G SP+ + +S++ L N +E+ +SL LG+ EPKR R S +SL+++
Sbjct: 526 ISNFVGMSPVR----DDQDAPRSITNLNNPSEEELAMSLHLGDREPKRLRSDSGSSLDLE 581
Query: 602 E 602
+
Sbjct: 582 K 582
>gi|116309879|emb|CAH66916.1| OSIGBa0126B18.9 [Oryza sativa Indica Group]
gi|125549723|gb|EAY95545.1| hypothetical protein OsI_17391 [Oryza sativa Indica Group]
Length = 588
Score = 781 bits (2016), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/607 (66%), Positives = 471/607 (77%), Gaps = 34/607 (5%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D + Q SG +QSEES+LD++ H + P S PS +QP ASG H+E++AAYF WPT +
Sbjct: 5 DDKAQLSGLAQSEESSLDVD-----HQSFPCS-PS-IQPVASGCTHTENSAAYFLWPTSN 57
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
+ AAE RANYFGNLQKG+LP GRLP GQQA +LL+LMTIRAFHSKILRRFSLGTA+
Sbjct: 58 LQHCAAEGRANYFGNLQKGLLPRHPGRLPKGQQANSLLDLMTIRAFHSKILRRFSLGTAV 117
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+G LTDIPAILVFVARKVH++WL+ QCLPA LEGPGGVWCDVDVVEFSYYGAPA
Sbjct: 118 GFRIRKGDLTDIPAILVFVARKVHKKWLNPAQCLPAILEGPGGVWCDVDVVEFSYYGAPA 177
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
TPKE++++ELVD L GSD CIGSGSQVAS ET+GTLGAIV+ RTGN+QVGFLTNRHVAV
Sbjct: 178 QTPKEQMFSELVDKLCGSDECIGSGSQVASHETFGTLGAIVKRRTGNKQVGFLTNRHVAV 237
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 238 DLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 297
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A+DF+++ VTT V+GVG+IGDV +IDLQ P+NSLIGRQV KVGRSSG TTGTVMAYALEY
Sbjct: 298 ADDFDISTVTTVVRGVGDIGDVKVIDLQCPLNSLIGRQVCKVGRSSGHTTGTVMAYALEY 357
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGICFFTD LVVGEN+QTFDLEGDSGSLI+LT Q+GEKPRP+GIIWGGTANRGRLKL
Sbjct: 358 NDEKGICFFTDILVVGENRQTFDLEGDSGSLIILTSQDGEKPRPIGIIWGGTANRGRLKL 417
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQA------AVQDQRNASAAAIESTVGE 480
P NWTSGVDLGRLLD LELD+I TNE Q AVQ QR A AA+ S VGE
Sbjct: 418 TSDHGPENWTSGVDLGRLLDRLELDIIITNESLQEFAYYKDAVQQQRFALVAAVTSAVGE 477
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNV-- 538
S A +EK E EP + IQQ + G +G E+S+ V
Sbjct: 478 SSGAPVAIPEEKVEEIFEPLGIQIQQLPRHDVAASG------------TEGEEASNTVVN 525
Query: 539 --GHQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKR-RKHSD 595
HQFI +F G SP+ + +S++ L N +E+ +SL LG+ EPKR R S
Sbjct: 526 VEEHQFISNFVGMSPVR----DDQDAPRSITNLNNPSEEELAMSLHLGDREPKRLRSDSG 581
Query: 596 TSLNVQE 602
+SL++++
Sbjct: 582 SSLDLEK 588
>gi|242077610|ref|XP_002448741.1| hypothetical protein SORBIDRAFT_06g032440 [Sorghum bicolor]
gi|241939924|gb|EES13069.1| hypothetical protein SORBIDRAFT_06g032440 [Sorghum bicolor]
Length = 579
Score = 779 bits (2011), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/604 (70%), Positives = 478/604 (79%), Gaps = 35/604 (5%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D+ ++GSSQSE S LD+ERN C H + PSPLQP AS GQHSES+AAYFSWPT +
Sbjct: 5 DIWKAHAGSSQSEGSGLDMERNGCSH----NCCPSPLQPIASAGQHSESSAAYFSWPTST 60
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
++ +AE RANYFGNLQKGVLP LGRLP GQQATTLL+LM IRAFHSKILRRFSLGTAI
Sbjct: 61 LMHGSAEGRANYFGNLQKGVLPGHLGRLPKGQQATTLLDLMIIRAFHSKILRRFSLGTAI 120
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+G LTD PAILVFVARKVHR+WLS QCLPAALEGPGGVWCDVDVVEFSYYGAPA
Sbjct: 121 GFRIRKGTLTDTPAILVFVARKVHRKWLSPTQCLPAALEGPGGVWCDVDVVEFSYYGAPA 180
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
PTPKE+LY ELVDGLRGSDP +GSGSQVAS ETYGTLGAIV+SRTGN+QVGFLTNRHVAV
Sbjct: 181 PTPKEQLYDELVDGLRGSDPIVGSGSQVASLETYGTLGAIVKSRTGNKQVGFLTNRHVAV 240
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 241 DLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 300
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A+DF++ +V+TSVKGVG IGDV IDLQSPI SLIGRQV+KVGRSSGLTTGTV+AYALEY
Sbjct: 301 ADDFDITSVSTSVKGVGVIGDVKAIDLQSPIGSLIGRQVVKVGRSSGLTTGTVVAYALEY 360
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGICFFTDFLVVGENQQTFDLEGDSGSLI+LTGQ+GEKP+P+GIIWGGTANRGRLKL
Sbjct: 361 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKL 420
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRN----ASAAAIESTVGESP 482
K GQ P NWTSGVDLGRLLDLLELDLI T+EG QAA+ +Q+ A+A A ST ES
Sbjct: 421 KSGQGPENWTSGVDLGRLLDLLELDLITTSEGLQAAIDEQKKTLAAAAAVATNSTATESS 480
Query: 483 PAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGHQF 542
P Q +K + EP +NI DG S++ ++
Sbjct: 481 PVGGPQENDKIDKIYEPLGINIIP----------------------RDGSAISTDQPNEN 518
Query: 543 IPSFTGRSPMHQNNAQENKGSKSLSAL--RNGPDEDNYVSLQLGEPEPKRRKHSDTSLNV 600
+ SPM +N + N +L L N PD + ++L LGE EPKR + +D+ L++
Sbjct: 519 MEELNLMSPM-RNGEESNGELNNLLDLESENSPDGIS-IALNLGEREPKRLR-TDSMLDI 575
Query: 601 QESK 604
K
Sbjct: 576 DLQK 579
>gi|293335623|ref|NP_001168357.1| uncharacterized protein LOC100382125 [Zea mays]
gi|223942135|gb|ACN25151.1| unknown [Zea mays]
gi|223947737|gb|ACN27952.1| unknown [Zea mays]
gi|413919905|gb|AFW59837.1| hypothetical protein ZEAMMB73_955518 [Zea mays]
gi|413919906|gb|AFW59838.1| hypothetical protein ZEAMMB73_955518 [Zea mays]
Length = 581
Score = 776 bits (2005), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/605 (70%), Positives = 479/605 (79%), Gaps = 35/605 (5%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D+ ++GSSQSE S LD+ERN C+H + PSPLQP AS GQHSES+AAYFSWPT +
Sbjct: 5 DIWKAHAGSSQSEASGLDMERNGCNH----NCCPSPLQPIASAGQHSESSAAYFSWPTST 60
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
++ +AE RANYFGNLQKGVLP LGRLP GQQATTLL+LM IRAFHSKILRRFSLGTAI
Sbjct: 61 LMHGSAEGRANYFGNLQKGVLPGHLGRLPNGQQATTLLDLMIIRAFHSKILRRFSLGTAI 120
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+G LTD PAILVFVARKVHR+WLS QCLP ALEGPGGVWCDVDVVEFSYYGAPA
Sbjct: 121 GFRIRKGTLTDTPAILVFVARKVHRKWLSPTQCLPGALEGPGGVWCDVDVVEFSYYGAPA 180
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
PTPKE+LY ELVDGLRGSDP IGSGSQVAS ETYGTLGAIV+SRTGN+QVGFLTNRHVAV
Sbjct: 181 PTPKEQLYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGNKQVGFLTNRHVAV 240
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 241 DLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 300
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A+DF + +V+TSVKGVG IG+V IDLQSPI SLIGRQV+KVGRSSG+TTGTV+AYALEY
Sbjct: 301 ADDFEIASVSTSVKGVGVIGNVKAIDLQSPIGSLIGRQVVKVGRSSGMTTGTVVAYALEY 360
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGICFFTDFLVVGENQQTFDLEGDSGSLI+LTGQ+GEKP+P+GIIWGGTANRGRLKL
Sbjct: 361 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKL 420
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQR----NASAAAIESTVGESP 482
K GQ P NWTSGVDLGRLLDLLELDLI T+EG QAA+++QR A+AAA ST ES
Sbjct: 421 KSGQGPENWTSGVDLGRLLDLLELDLITTSEGLQAALEEQRITLAAAAAAATNSTATESS 480
Query: 483 PAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGHQF 542
P Q +K + EP +NI DG S++ ++
Sbjct: 481 PVAGPQEDDKIDKIYEPLGINIIP----------------------RDGSAISTDQPNED 518
Query: 543 IPSFTGRSPMHQNNAQENKGSKSLSAL--RNGPDEDNYVSLQLGEPEPKR-RKHSDTSLN 599
+ SPM +N + N +L L N PD + ++L LGE EP+R R SD+ L+
Sbjct: 519 VEELNLMSPM-RNGEEGNGDFNNLMDLESENSPDGIS-IALNLGEREPERLRSVSDSMLD 576
Query: 600 VQESK 604
+ K
Sbjct: 577 IDLQK 581
>gi|38344253|emb|CAD41791.2| OSJNBa0008M17.6 [Oryza sativa Japonica Group]
Length = 588
Score = 776 bits (2005), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/607 (66%), Positives = 469/607 (77%), Gaps = 34/607 (5%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D + Q SG +QSEES+LD++ H + P S PS +QP ASG H+E++AAYF WPT +
Sbjct: 5 DDKAQLSGLAQSEESSLDVD-----HQSFPCS-PS-IQPVASGCTHTENSAAYFLWPTSN 57
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
+ AAE RANYFGNLQKG+LP GRLP GQQA +LL+LMTIRAFHSKILRRFSLGTA+
Sbjct: 58 LQHCAAEGRANYFGNLQKGLLPRHPGRLPKGQQANSLLDLMTIRAFHSKILRRFSLGTAV 117
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+G LTDIPAILVFVARKVH++WL+ QCLPA LEGPGGVWCDVDVVEFSYYGAPA
Sbjct: 118 GFRIRKGDLTDIPAILVFVARKVHKKWLNPAQCLPAILEGPGGVWCDVDVVEFSYYGAPA 177
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
TPKE++++ELVD L GSD CIGSGSQVAS ET+GTLGAIV+ RTGN+QVGFLTN HVAV
Sbjct: 178 QTPKEQMFSELVDKLCGSDECIGSGSQVASHETFGTLGAIVKRRTGNKQVGFLTNHHVAV 237
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 238 DLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 297
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A+DF+++ VTT V+GVG+IGDV +IDLQ P+NSLIGRQV KVGRSSG TTGTVMAYALEY
Sbjct: 298 ADDFDISTVTTVVRGVGDIGDVKVIDLQCPLNSLIGRQVCKVGRSSGHTTGTVMAYALEY 357
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGICFFTD LVVGEN+QTFDLEGDSGSLI+LT Q+GEKPRP+GIIWGGTANRGRLKL
Sbjct: 358 NDEKGICFFTDILVVGENRQTFDLEGDSGSLIILTSQDGEKPRPIGIIWGGTANRGRLKL 417
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQA------AVQDQRNASAAAIESTVGE 480
P NWTSGVDLGRLLD LELD+I TNE Q AVQ QR A AA+ S VGE
Sbjct: 418 TSDHGPENWTSGVDLGRLLDRLELDIIITNESLQEFAYYKDAVQQQRFALVAAVTSAVGE 477
Query: 481 SPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNV-- 538
S +EK E EP + IQQ + G +G E+S+ V
Sbjct: 478 SSGVPVAIPEEKIEEIFEPLGIQIQQLPRHDVAASG------------TEGEEASNTVVN 525
Query: 539 --GHQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKR-RKHSD 595
HQFI +F G SP+ + +S++ L N +E+ +SL LG+ EPKR R S
Sbjct: 526 VEEHQFISNFVGMSPVR----DDQDAPRSITNLNNPSEEELAMSLHLGDREPKRLRSDSG 581
Query: 596 TSLNVQE 602
+SL++++
Sbjct: 582 SSLDLEK 588
>gi|414584860|tpg|DAA35431.1| TPA: hypothetical protein ZEAMMB73_495650 [Zea mays]
Length = 581
Score = 774 bits (1999), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/601 (70%), Positives = 475/601 (79%), Gaps = 37/601 (6%)
Query: 12 NSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLSRLNDA 71
++GSSQSE S LD+ERN C+H + PSPLQP AS GQHSES+AAYFSWPT + ++ +
Sbjct: 10 HAGSSQSEGSGLDMERNGCNH----NYCPSPLQPIASAGQHSESSAAYFSWPTSTLMHGS 65
Query: 72 AEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAIGFRIR 131
AE RANYFGNLQKGVLP LGRLP GQQATTLL+LM IRAFHSKILRRFSLGTAIGFRIR
Sbjct: 66 AEGRANYFGNLQKGVLPGHLGRLPKGQQATTLLDLMIIRAFHSKILRRFSLGTAIGFRIR 125
Query: 132 RGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPAPTPKE 191
+G LTD PAILVFVARKVHR+WLS QCLP ALEGPGGVWCDVDVVEFSYYGAPAPTPKE
Sbjct: 126 KGTLTDTPAILVFVARKVHRKWLSATQCLPTALEGPGGVWCDVDVVEFSYYGAPAPTPKE 185
Query: 192 ELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYP 251
+LY ELVDGLRGSDP +GSGSQVAS ETYGTLGAIV+S+TGN+QVGFLTNRHVAVDLDYP
Sbjct: 186 QLYDELVDGLRGSDPIVGSGSQVASLETYGTLGAIVKSQTGNKQVGFLTNRHVAVDLDYP 245
Query: 252 NQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFN 311
NQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPFA+DF+
Sbjct: 246 NQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDFD 305
Query: 312 LNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKG 371
+ +V+TSVKGVG IGDV IDLQS I SLIGRQV+KVGRSSGLTTGTV+AYALEYNDEKG
Sbjct: 306 ITSVSTSVKGVGVIGDVKAIDLQSSIGSLIGRQVVKVGRSSGLTTGTVVAYALEYNDEKG 365
Query: 372 ICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQP 431
ICFFTDFLVVGENQQTFDLEGDSGSLI+LTGQ+GEKP+P+GIIWGGTANRGRLKLK GQ
Sbjct: 366 ICFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKLKSGQG 425
Query: 432 PVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQR----NASAAAIESTVGESPPAERE 487
P NWTSGVDLGRLLDLLELDLI T+EG QAA+++QR A+AAA ST ES P
Sbjct: 426 PENWTSGVDLGRLLDLLELDLITTSEGLQAALEEQRITLAAAAAAATNSTATESSPVAGP 485
Query: 488 QSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGHQFIPSFT 547
Q +K + EP +NI P D S++ ++ +
Sbjct: 486 QENDKIDKIYEPLGINI-------------IP---------RDSSSISTDQPNENVEELN 523
Query: 548 GRSPMHQNNAQENKGSKSLSA---LRNGPDEDNYVSLQLGEPEPKR-RKHSDTSLNVQES 603
SPM N QE G + L N PD ++L LGE EPKR R D++L++
Sbjct: 524 LMSPMR--NGQEGNGDLNNLMDLELENSPD-GICIALNLGEREPKRLRSDFDSTLDMDLQ 580
Query: 604 K 604
K
Sbjct: 581 K 581
>gi|413919907|gb|AFW59839.1| hypothetical protein ZEAMMB73_955518 [Zea mays]
Length = 555
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/502 (79%), Positives = 439/502 (87%), Gaps = 8/502 (1%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D+ ++GSSQSE S LD+ERN C+H + PSPLQP AS GQHSES+AAYFSWPT +
Sbjct: 5 DIWKAHAGSSQSEASGLDMERNGCNH----NCCPSPLQPIASAGQHSESSAAYFSWPTST 60
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
++ +AE RANYFGNLQKGVLP LGRLP GQQATTLL+LM IRAFHSKILRRFSLGTAI
Sbjct: 61 LMHGSAEGRANYFGNLQKGVLPGHLGRLPNGQQATTLLDLMIIRAFHSKILRRFSLGTAI 120
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+G LTD PAILVFVARKVHR+WLS QCLP ALEGPGGVWCDVDVVEFSYYGAPA
Sbjct: 121 GFRIRKGTLTDTPAILVFVARKVHRKWLSPTQCLPGALEGPGGVWCDVDVVEFSYYGAPA 180
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
PTPKE+LY ELVDGLRGSDP IGSGSQVAS ETYGTLGAIV+SRTGN+QVGFLTNRHVAV
Sbjct: 181 PTPKEQLYDELVDGLRGSDPSIGSGSQVASLETYGTLGAIVKSRTGNKQVGFLTNRHVAV 240
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 241 DLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 300
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A+DF + +V+TSVKGVG IG+V IDLQSPI SLIGRQV+KVGRSSG+TTGTV+AYALEY
Sbjct: 301 ADDFEIASVSTSVKGVGVIGNVKAIDLQSPIGSLIGRQVVKVGRSSGMTTGTVVAYALEY 360
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGICFFTDFLVVGENQQTFDLEGDSGSLI+LTGQ+GEKP+P+GIIWGGTANRGRLKL
Sbjct: 361 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLIILTGQDGEKPQPIGIIWGGTANRGRLKL 420
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQR----NASAAAIESTVGESP 482
K GQ P NWTSGVDLGRLLDLLELDLI T+EG QAA+++QR A+AAA ST ES
Sbjct: 421 KSGQGPENWTSGVDLGRLLDLLELDLITTSEGLQAALEEQRITLAAAAAAATNSTATESS 480
Query: 483 PAEREQSKEKTAERLEPFNLNI 504
P Q +K + EP +NI
Sbjct: 481 PVAGPQEDDKIDKIYEPLGINI 502
>gi|148906346|gb|ABR16328.1| unknown [Picea sitchensis]
Length = 683
Score = 758 bits (1956), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/604 (68%), Positives = 478/604 (79%), Gaps = 34/604 (5%)
Query: 13 SGSSQSEESALDLER----NYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLSRL 68
SGS QSEESALD E+ N HP S SP PLQ FASGGQHSES+AA F WP +RL
Sbjct: 87 SGSMQSEESALDREQTVTGNSGRHPR--SDSP-PLQAFASGGQHSESSAACFRWPPSNRL 143
Query: 69 NDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAIGF 128
N AE+RA YFG +QK V ETL LP+G QATTLL+LMTIRAFHSKILRR+SLGTAIGF
Sbjct: 144 NGTAEERAAYFGGVQKEVDSETLEHLPSGHQATTLLDLMTIRAFHSKILRRYSLGTAIGF 203
Query: 129 RIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPAPT 188
RIR GVLT+IPAILVFVARKVH+QWL VQ LP+ LEGPGGVWCDVDVVEFSYYGAPA T
Sbjct: 204 RIREGVLTNIPAILVFVARKVHKQWLLDVQRLPSVLEGPGGVWCDVDVVEFSYYGAPAAT 263
Query: 189 PKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDL 248
PKE+LYTELV+GLRGSD IGSGSQVASQETYGTLGAIV+SRTG++QVGFLTNRHVAVDL
Sbjct: 264 PKEQLYTELVEGLRGSDQTIGSGSQVASQETYGTLGAIVKSRTGSRQVGFLTNRHVAVDL 323
Query: 249 DYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAE 308
DYPNQKMFHPLPP+LGPGVYLGAVERATSFITDDLWYGIFAG NPETFVRADGAFIPFA+
Sbjct: 324 DYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDLWYGIFAGMNPETFVRADGAFIPFAD 383
Query: 309 DFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYND 368
F+++NVTT+VKGVG++G+V ++DLQ+P+ SLIG+QV+KVGRSSGLT GT+MAYALEYND
Sbjct: 384 SFDVSNVTTTVKGVGDMGEVMLVDLQAPVGSLIGKQVVKVGRSSGLTRGTIMAYALEYND 443
Query: 369 EKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKV 428
EKGICFFTDFLVVGEN+Q FDLEGDSGSLIL+T ++GEKPRPVGIIWGGTANRGRLKLK
Sbjct: 444 EKGICFFTDFLVVGENKQAFDLEGDSGSLILVTEESGEKPRPVGIIWGGTANRGRLKLKN 503
Query: 429 GQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQ-RNASAAAIESTVGESPP---- 483
G P NWTSGVDLGRLLDLL+L++I G + AV++Q R +SA AI+STVGES P
Sbjct: 504 GSGPENWTSGVDLGRLLDLLQLEMITGAGGLREAVEEQKRWSSAVAIDSTVGESSPRGYR 563
Query: 484 ------AEREQSKEKTAERLEPFN------LNIQQDLVDGESEQGPTPPFIHTEFHVEDG 531
AE+E+++E L F+ + Q + +E P F +EF +
Sbjct: 564 IGPLTLAEKEKTEEVCP--LMQFDNDDMSSFHTQHLGIQSGAEVNPI--FRQSEFMTKLA 619
Query: 532 IESSSNVGHQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPD---EDNYVSLQLGEPEP 588
E S++V HQF+ F RS H A+ K ++LSALR+G D ED + L LG+ E
Sbjct: 620 -EPSTSVEHQFMKDFH-RSLGHPEQAKSPK-CENLSALRDGKDGSSEDISIGLHLGDREA 676
Query: 589 KRRK 592
KRR+
Sbjct: 677 KRRR 680
>gi|297791289|ref|XP_002863529.1| hypothetical protein ARALYDRAFT_917030 [Arabidopsis lyrata subsp.
lyrata]
gi|297309364|gb|EFH39788.1| hypothetical protein ARALYDRAFT_917030 [Arabidopsis lyrata subsp.
lyrata]
Length = 578
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/613 (67%), Positives = 465/613 (75%), Gaps = 54/613 (8%)
Query: 1 MEKNRWDLRFQNSGSSQSEE--SALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAA 58
ME R DLRF +S SS +ALDL++N +H L SSSP LQPF SGGQH E++AA
Sbjct: 1 MEGKRLDLRFHHSVSSSQSVESAALDLDKNGYNHIKLASSSP--LQPFPSGGQHPETSAA 58
Query: 59 --YFSWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKI 116
YFSWPT SRLND+AEDRANYF NLQKGVLPET LPT I
Sbjct: 59 AAYFSWPTSSRLNDSAEDRANYFANLQKGVLPETFDGLPT-------------------I 99
Query: 117 LRRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDV 176
L VLT+I AILVFVARKVH+QWL+ QCLP ALEGPGGVWCDVDV
Sbjct: 100 L----------------VLTNIAAILVFVARKVHKQWLNPPQCLPTALEGPGGVWCDVDV 143
Query: 177 VEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQV 236
VEF YYGAPA TPKE++YTELVD LRGS IGSGSQVASQETYGTLGAIV+S+TG +QV
Sbjct: 144 VEFQYYGAPAQTPKEQVYTELVDDLRGSGSSIGSGSQVASQETYGTLGAIVKSKTGIRQV 203
Query: 237 GFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETF 296
GFLTNRHVAVDLDYP+QKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETF
Sbjct: 204 GFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETF 263
Query: 297 VRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTT 356
VRADGAFIPFAEDFN+NNVTT+VKG+GEIG++H DLQSPINSLIGR+V+KVGRSSGLTT
Sbjct: 264 VRADGAFIPFAEDFNMNNVTTTVKGIGEIGNIHATDLQSPINSLIGRKVVKVGRSSGLTT 323
Query: 357 GTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTG--QNGEKPRPVGII 414
GT+MAYALEYNDEKGICF TDFLVVGENQQTFDLEGDSGSLILL + EKPRPVGII
Sbjct: 324 GTIMAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLAAGDEKNEKPRPVGII 383
Query: 415 WGGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNA-SAAA 473
WGGTANRGRLKLKVG+ P NWTSGVDLGR+L+LLELDLI +NEG QAAV +QRN+ A
Sbjct: 384 WGGTANRGRLKLKVGEQPENWTSGVDLGRVLNLLELDLITSNEGLQAAVLEQRNSIMCAG 443
Query: 474 IESTVGESPPAEREQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIE 533
I+STV ES P S+ KT E EP NLN+QQ L + +S IH EF +ED +E
Sbjct: 444 IDSTVVESSPGVCNISRCKTGENFEPINLNVQQVLREEDSSN------IHPEFQIEDVLE 497
Query: 534 SSSNV-GHQFIPSFT--GRSPMHQNNAQENKGSKSLSALRNGPDEDNY-VSLQLGEPEPK 589
S++ + HQFIPS + G S + N EN SK+LS+L+ D SLQLGE + K
Sbjct: 498 SAAMIEEHQFIPSSSNNGYSLHQKINGPENLESKNLSSLKTNSSGDEIGFSLQLGESDTK 557
Query: 590 RRKHSDTSLNVQE 602
+RK +D+ QE
Sbjct: 558 KRKRTDSPDGSQE 570
>gi|357165942|ref|XP_003580546.1| PREDICTED: uncharacterized protein LOC100839778 [Brachypodium
distachyon]
Length = 639
Score = 751 bits (1939), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/598 (65%), Positives = 466/598 (77%), Gaps = 17/598 (2%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D R Q G +QSEES+LD+E YC+H SPS +QP ASG H+E++AAYF WPT +
Sbjct: 5 DDRMQLLGLTQSEESSLDVE-GYCYHNETFPCSPS-MQPIASGCVHTENSAAYFLWPTSN 62
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
+ AAE RANYFGNLQKG+LP G+LP GQQA +LL+LMT+RAFHSKILRRFSLGTA+
Sbjct: 63 LQHCAAEGRANYFGNLQKGLLPVLPGKLPKGQQANSLLDLMTVRAFHSKILRRFSLGTAV 122
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRI++GVLTDIPAI+VFVARKVH++WL+ QCLPA L GPGGVWCDVDVVEFSYYGAPA
Sbjct: 123 GFRIKKGVLTDIPAIIVFVARKVHKKWLNPNQCLPAILAGPGGVWCDVDVVEFSYYGAPA 182
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
TPKE++++ELV+ L GSD IGSGSQVASQ+T+GTLGAIV+ RT N+QVGFLTNRHVAV
Sbjct: 183 QTPKEQMFSELVNKLCGSDEYIGSGSQVASQDTFGTLGAIVKRRTNNRQVGFLTNRHVAV 242
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 243 DLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 302
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A+DF+++ VTT V+ VGEIGDV +IDLQ PINSLIGRQV KVGRSSG TTGTVMAYALEY
Sbjct: 303 ADDFDISTVTTIVREVGEIGDVKVIDLQCPINSLIGRQVCKVGRSSGHTTGTVMAYALEY 362
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGICFFTD LVVGEN+QTFDLEGDSGSLILLT Q+GEKP P+GIIWGGTANRGR+KL
Sbjct: 363 NDEKGICFFTDLLVVGENRQTFDLEGDSGSLILLTSQDGEKPLPIGIIWGGTANRGRIKL 422
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGESPPAER 486
P NWT+GVDLGRLLD LELDLI TNE + AVQ RNA AA+ S VGES
Sbjct: 423 TSDHGPENWTTGVDLGRLLDRLELDLIITNESLKDAVQQHRNALVAAVISAVGESSTVAA 482
Query: 487 EQSKEKTAERLEPFNLNIQQDLVDGESEQGPTPPFIHTEFHVEDGIESSSNV-GHQFIPS 545
+EK E EP + IQ Q P + ED +S++V HQFI +
Sbjct: 483 TAPEEKAEEVFEPLGIKIQ---------QLPRHDVTISATEGEDTANTSADVEEHQFISN 533
Query: 546 FTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKR-RKHSDTSLNVQE 602
F SP ++ +++ L N +E+ +SL +G+ EPKR R ++++L++++
Sbjct: 534 FGSMSPAR----RDQDTPRNIGNLNNPSEEELTMSLHVGDREPKRLRSDAESNLDLEK 587
>gi|293336302|ref|NP_001169250.1| uncharacterized protein LOC100383111 [Zea mays]
gi|223975799|gb|ACN32087.1| unknown [Zea mays]
gi|414585456|tpg|DAA36027.1| TPA: hypothetical protein ZEAMMB73_252293 [Zea mays]
Length = 582
Score = 735 bits (1898), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/601 (64%), Positives = 464/601 (77%), Gaps = 28/601 (4%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D R Q SG +QS+ES LD+E +C+H SSPS +QP ASG H+E++AAYF WPT +
Sbjct: 5 DGRTQLSGFAQSDESTLDVE-GHCYHQQSFPSSPS-MQPIASGCTHTENSAAYFLWPTSN 62
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
+ AAE RANYF NL KG+LP++ GRLP GQQA +LL+LMTIRAFHSK+LR FSLGTA+
Sbjct: 63 LQHCAAEGRANYFANLSKGLLPKS-GRLPKGQQANSLLDLMTIRAFHSKVLRCFSLGTAV 121
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+G LTDIPAIL FVARKVH++WL+ QCLPA +EGPGG+WCDVDVVEFSYYGAPA
Sbjct: 122 GFRIRKGALTDIPAILCFVARKVHKKWLNPDQCLPAIVEGPGGIWCDVDVVEFSYYGAPA 181
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
PK +++TELVD L GSD CIGSGSQVASQ+T+GTLGAIV+ RTGN+Q+GFLTNRHVAV
Sbjct: 182 QNPKVQMFTELVDKLCGSDECIGSGSQVASQDTFGTLGAIVKRRTGNKQIGFLTNRHVAV 241
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKM+HPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 242 DLDYPNQKMYHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 301
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A DF+++ VTT+V+GVG+IGDV +IDLQSP+NSLIGRQV K+GRSSG TTGTV+AYALEY
Sbjct: 302 AHDFDISTVTTTVRGVGDIGDVKVIDLQSPLNSLIGRQVCKIGRSSGHTTGTVVAYALEY 361
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGI FFTD LVVGEN+QTFDLEGDSGSLI+LTGQ+ EKP P+GIIWGGTANRGRLKL
Sbjct: 362 NDEKGISFFTDLLVVGENRQTFDLEGDSGSLIILTGQDNEKPCPIGIIWGGTANRGRLKL 421
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGESPPAER 486
+ P NWTSGVDLGRLLD LELDLI TNE + AVQ QR A AA S VGES A
Sbjct: 422 RCDHGPENWTSGVDLGRLLDRLELDLIITNESLKDAVQQQRLALVAAANSAVGESSTAAV 481
Query: 487 EQSKEKTAERLEPFNLNIQQ----DLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGHQF 542
+EK E EP + I+Q D+ + +G I+ E QF
Sbjct: 482 PAPEEKV-EIFEPLGIKIEQLPRHDV--SATTEGDEAAVINVE-------------ERQF 525
Query: 543 IPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGEPEPKR-RKHSDTSLNVQ 601
I +F G SP+ + + ++ L N +E+ +SL LG+ E KR R +++ L+++
Sbjct: 526 ISNFVGMSPVR----DDQDAPRQIANLNNPSEEELAMSLHLGDREAKRLRTDTESELDLE 581
Query: 602 E 602
+
Sbjct: 582 K 582
>gi|242074316|ref|XP_002447094.1| hypothetical protein SORBIDRAFT_06g028460 [Sorghum bicolor]
gi|241938277|gb|EES11422.1| hypothetical protein SORBIDRAFT_06g028460 [Sorghum bicolor]
Length = 607
Score = 735 bits (1897), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/623 (62%), Positives = 464/623 (74%), Gaps = 47/623 (7%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D R Q SG +QS+ES LD+E +C+H SPS +QP ASG H+E++AAYF WPT +
Sbjct: 5 DDRAQLSGFAQSDESTLDVE-GHCYHQQSFPCSPS-MQPIASGCTHTENSAAYFLWPTSN 62
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
+ AAE RANYF NL KG+LP++ G+LP GQQA +LL+LMTIRAFHSKILR FSLGTA+
Sbjct: 63 LQHCAAEGRANYFANLSKGLLPKS-GKLPKGQQANSLLDLMTIRAFHSKILRCFSLGTAV 121
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+GVLTDIPAIL FVARKVH++WL+ QCLPA +EGPGG+WCDVDVVEFSYYGAPA
Sbjct: 122 GFRIRKGVLTDIPAILCFVARKVHKKWLNPTQCLPAIVEGPGGIWCDVDVVEFSYYGAPA 181
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQV-----------------------ASQETYGTL 223
TPKE+++TELVD L GSD CIGSGSQV ASQ+T+GTL
Sbjct: 182 QTPKEQMFTELVDKLCGSDECIGSGSQVLAKIDLNYLKVADKDSWNDAMAVASQDTFGTL 241
Query: 224 GAIVRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDL 283
GAIV+ RTGN+Q+GFLTNRHVAVDLDYPNQKM+HPLPP+LGPGVYLGAVERATSFITDD+
Sbjct: 242 GAIVKRRTGNKQIGFLTNRHVAVDLDYPNQKMYHPLPPNLGPGVYLGAVERATSFITDDV 301
Query: 284 WYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGR 343
WYGI+AGTNPETFVRADGAFIPFA DF+++ V+T+V+GVG+IGDV IDLQ P+NSLIGR
Sbjct: 302 WYGIYAGTNPETFVRADGAFIPFAHDFDISTVSTTVRGVGDIGDVKFIDLQCPLNSLIGR 361
Query: 344 QVMKVGRSSGLTTGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQ 403
QV K+GRSSG TTGTVMAYALEYNDEKGI FFTD LVVGEN+QTFDLEGDSGSLI+LTGQ
Sbjct: 362 QVCKIGRSSGHTTGTVMAYALEYNDEKGISFFTDLLVVGENRQTFDLEGDSGSLIILTGQ 421
Query: 404 NGEKPRPVGIIWGGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAV 463
+ EKPRP+GIIWGGTANRGRLKL+ P NWTSGVDLGRLLD LELDLI T+E + AV
Sbjct: 422 DSEKPRPIGIIWGGTANRGRLKLRCDHGPENWTSGVDLGRLLDRLELDLIITSESLKDAV 481
Query: 464 QDQRNASAAAIESTVGESPPAEREQSKEKTAERLEPFNLNIQQ---DLVDGESEQGPTPP 520
Q QR A AA S VGES A +EK E EP + I+Q V +G
Sbjct: 482 QQQRLAMVAAANSAVGESSTAAVPVPEEKVEELYEPLGIKIEQLPRHDVSASGTEGEEAA 541
Query: 521 FIHTEFHVEDGIESSSNVGHQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVS 580
++ E QFI +F G SP+ + + ++ L N +E+ +S
Sbjct: 542 VVNVE-------------ERQFISNFVGMSPVR----GDQDAPRQIANLNNPSEEELAMS 584
Query: 581 LQLGEPEPKR-RKHSDTSLNVQE 602
L LG+ EPKR R +++ L++++
Sbjct: 585 LHLGDREPKRLRTDTESDLDLEK 607
>gi|413919513|gb|AFW59445.1| hypothetical protein ZEAMMB73_623071 [Zea mays]
Length = 566
Score = 724 bits (1869), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/583 (65%), Positives = 448/583 (76%), Gaps = 26/583 (4%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D R Q SG +QS+ES LD+E + CH P+ P S PS +QP SG H+E++AAYF WPT +
Sbjct: 5 DDRAQLSGFAQSDESTLDVEGHCCHQPSFPCS-PS-MQPIVSGCTHTENSAAYFLWPTSN 62
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
+ AAE RANYF NL KG+LP+ RLP GQQA +LL+LMTIRAFHSK+LR F LGTA+
Sbjct: 63 LQHCAAEGRANYFANLSKGLLPKIGRRLPKGQQANSLLDLMTIRAFHSKVLRCFGLGTAV 122
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+GVLTDIPAIL FVARKVH++WL CLPA L GPGG+WCDVDVVEFSYYGAPA
Sbjct: 123 GFRIRKGVLTDIPAILCFVARKVHKKWLDPAHCLPAILAGPGGIWCDVDVVEFSYYGAPA 182
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
TPK +++TELVD L GSD CIGSGSQVASQ+T+GTLGAIV+ RTGN+ VGF+TNRHVAV
Sbjct: 183 QTPKVQIFTELVDKLCGSDECIGSGSQVASQDTFGTLGAIVKRRTGNKLVGFVTNRHVAV 242
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKM+HPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 243 DLDYPNQKMYHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 302
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A DF+++ VTT+V+GVG+IGDV +IDLQ P+N LIGR+V K+GRSSG TTGTVMAYALEY
Sbjct: 303 AHDFDISTVTTTVRGVGDIGDVKVIDLQCPLNRLIGRRVCKIGRSSGHTTGTVMAYALEY 362
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGI FFTD LVVGEN+QTFDLEGDSGSLI+LTGQ+ EKPRP+GIIWGGTANRGRLKL
Sbjct: 363 NDEKGISFFTDLLVVGENRQTFDLEGDSGSLIILTGQDSEKPRPIGIIWGGTANRGRLKL 422
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGESPPAER 486
+ P NWTSGVDLGRLLD LELDLI T+E + AVQ QR A AAA S GES A
Sbjct: 423 RCDHGPQNWTSGVDLGRLLDRLELDLIITSESLKDAVQQQRRALAAAANSAAGESSTAAA 482
Query: 487 EQSKEKTAERLEPFNLNIQQ----DLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGHQF 542
+EK E EP + I+Q D+ E+E+ +VE+ QF
Sbjct: 483 PVLEEKVEEIFEPLGIKIEQLRRHDVSASEAEEA-------AGINVEE---------RQF 526
Query: 543 IPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGE 585
I +F GRSP+ + + ++ L N +E+ + L LG+
Sbjct: 527 ISNFVGRSPVRDDQG----APRQIANLNNPSEEELAMLLHLGD 565
>gi|15230650|ref|NP_187901.1| trypsin-like protein [Arabidopsis thaliana]
gi|15795124|dbj|BAB02502.1| unnamed protein product [Arabidopsis thaliana]
gi|45773814|gb|AAS76711.1| At3g12950 [Arabidopsis thaliana]
gi|52627109|gb|AAU84681.1| At3g12950 [Arabidopsis thaliana]
gi|332641744|gb|AEE75265.1| trypsin-like protein [Arabidopsis thaliana]
Length = 558
Score = 689 bits (1778), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/564 (64%), Positives = 435/564 (77%), Gaps = 38/564 (6%)
Query: 46 FASGGQHSESNAA-YFSWPTLSRLNDAAEDRANYFGNLQKG------VLPETLGRLPTGQ 98
+ S GQH E AA YFSWPT SRL++AAE+RANYF NLQK V PE + P GQ
Sbjct: 4 YGSTGQHCEFTAASYFSWPTSSRLSNAAEERANYFSNLQKEEDDDDEVSPEPVSTEPKGQ 63
Query: 99 QATTLLELMTIRAFHSKILRRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQ 158
+ATTLLELMTIRAFHSK+LR +SLGTAIGFRIRRGVLTDIPAI+VFV+RKVH+QWLS +Q
Sbjct: 64 RATTLLELMTIRAFHSKMLRCYSLGTAIGFRIRRGVLTDIPAIIVFVSRKVHKQWLSPLQ 123
Query: 159 CLPAALEGPGGVWCDVDVVEFSYYGAP--APTPKEELYTELVDGLRGSDPCIGSGSQVAS 216
CLP ALEG GG+WCDVDVVEFSY+G P PTPK+ T++VD L+GSDP IGSGSQVAS
Sbjct: 124 CLPTALEGAGGIWCDVDVVEFSYFGEPDHQPTPKQTFTTDIVDHLQGSDPFIGSGSQVAS 183
Query: 217 QETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERAT 276
QET GTLGAIVRS+TG +QVGF+TNRHVAV+LDYP+QKMFHPLPP+LGPGVYLGAVERAT
Sbjct: 184 QETCGTLGAIVRSQTGGRQVGFVTNRHVAVNLDYPSQKMFHPLPPALGPGVYLGAVERAT 243
Query: 277 SFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVK-GVGEIGDVHIIDLQS 335
SFITDDLW+GIFAGTNPETFVRADGAFIPFA+D++L+ VTTSVK GVGEIG+V I+LQS
Sbjct: 244 SFITDDLWFGIFAGTNPETFVRADGAFIPFADDYDLSRVTTSVKGGVGEIGEVKAIELQS 303
Query: 336 PINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICFFTDFLVVGENQQT-FDLEGDS 394
P+ SL+G+QV+KVGRSSGLTTGTV+AYALEYNDE+G+CF TDFLVVGEN ++ FDLEGDS
Sbjct: 304 PVGSLVGKQVVKVGRSSGLTTGTVLAYALEYNDERGVCFLTDFLVVGENHRSPFDLEGDS 363
Query: 395 GSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIA 454
GSLI++ G+ EK RP+GIIWGGT +RGRLKLKVG+ P +WT+GVDLGRLL L+LDLI
Sbjct: 364 GSLIVMKGE--EKARPIGIIWGGTGSRGRLKLKVGECPESWTTGVDLGRLLTHLQLDLIT 421
Query: 455 TNEGFQAAVQDQRNASAAAIESTVGESPPAEREQSKEKTA--ERLEPFNLNIQQDLVDGE 512
T+EG +AAVQ+QR AS + S V +S P KEK + E+LE +Q +D
Sbjct: 422 TDEGLKAAVQEQRAASTTGMSSMVADSSPPYVNLKKEKRSPEEKLEASLGPLQVQHID-- 479
Query: 513 SEQGPTPPFIHTEFHVEDGIES---SSNVGHQFIPSFTGRSPMHQNNAQENKGSKSLSAL 569
+E+ IE+ + +V HQF+P+F+G+ + E ++
Sbjct: 480 ---------------LEERIETKGGAPSVEHQFMPTFSGQC--SASAWPETAREDLVAGF 522
Query: 570 RNGP-DEDNYVSLQLGEPEPKRRK 592
NG D D V L+LG+ KRR+
Sbjct: 523 TNGSCDGDLCVGLRLGDDGAKRRR 546
>gi|297834104|ref|XP_002884934.1| hypothetical protein ARALYDRAFT_478657 [Arabidopsis lyrata subsp.
lyrata]
gi|297330774|gb|EFH61193.1| hypothetical protein ARALYDRAFT_478657 [Arabidopsis lyrata subsp.
lyrata]
Length = 558
Score = 689 bits (1778), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/568 (63%), Positives = 437/568 (76%), Gaps = 40/568 (7%)
Query: 43 LQPFASGGQHSESNAA-YFSWPTLSRLNDAAEDRANYFGNLQKG------VLPETLGRLP 95
+ + S GQH E AA YFSWPT SRL++AAE+RANYF NLQK V PE P
Sbjct: 1 MHQYGSTGQHCEFTAASYFSWPTSSRLSNAAEERANYFSNLQKEEEEDEEVSPEPASTDP 60
Query: 96 TGQQATTLLELMTIRAFHSKILRRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLS 155
GQ+ATTLLELMTIRAFHSKILR +SLGTAIGFRIRRGVLTDIPAI+VFV+RKVH+QWLS
Sbjct: 61 KGQRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRRGVLTDIPAIIVFVSRKVHKQWLS 120
Query: 156 HVQCLPAALEGPGGVWCDVDVVEFSYYGAP--APTPKEELYTELVDGLRGSDPCIGSGSQ 213
+QCLP ALEG GG+WCDVDVVEFSY+G P PTPK+ T++VD L+GSDP IGSGSQ
Sbjct: 121 PLQCLPTALEGAGGIWCDVDVVEFSYFGEPDHQPTPKQTFTTDIVDHLQGSDPFIGSGSQ 180
Query: 214 VASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVE 273
VASQET GTLGAIVRS+TG++QVGF+TNRHVAV+LDYP+QKMFHPLPP+LGPGVYLGAVE
Sbjct: 181 VASQETCGTLGAIVRSQTGSRQVGFVTNRHVAVNLDYPSQKMFHPLPPALGPGVYLGAVE 240
Query: 274 RATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVK-GVGEIGDVHIID 332
RATSFITDDLW+GIFAGTNPETFVRADGAFIPFA+D++L+ VTTSVK GVGEIG+V I+
Sbjct: 241 RATSFITDDLWFGIFAGTNPETFVRADGAFIPFADDYDLSRVTTSVKGGVGEIGEVKAIE 300
Query: 333 LQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICFFTDFLVVGENQQT-FDLE 391
LQSP+ SL+G+QV+KVGRSSGLTTGTV+AYALEYNDEKG+CF TDFLVVGEN ++ FDLE
Sbjct: 301 LQSPVGSLVGKQVVKVGRSSGLTTGTVLAYALEYNDEKGVCFLTDFLVVGENHRSPFDLE 360
Query: 392 GDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELD 451
GDSGSLI++ G+ EK RP+GIIWGGT +RGRLKLKVG+ P +WT+GVDLGRLL L+LD
Sbjct: 361 GDSGSLIVMKGE--EKARPIGIIWGGTGSRGRLKLKVGECPESWTTGVDLGRLLTHLQLD 418
Query: 452 LIATNEGFQAAVQDQRNASAAAIESTVGESPP--AEREQSKEKTAERLEPFNLNIQQDLV 509
LI T+EG +AAVQ+QR AS + S V +S P ++ K E++E +Q +
Sbjct: 419 LITTDEGLKAAVQEQRAASTTGMSSMVADSSPPYVNLKKGKRNPEEKVEASLGPLQVQHI 478
Query: 510 DGESEQGPTPPFIHTEFHVEDGIES---SSNVGHQFIPSFTGRSPMHQNNAQENKGSKSL 566
D +E+ IE+ + +V HQF+P+F+G+ +A + L
Sbjct: 479 D-----------------LEERIETKGGAPSVEHQFMPTFSGQC---SASAWPETAREDL 518
Query: 567 SA-LRNGP-DEDNYVSLQLGEPEPKRRK 592
+ L NG D D V L+LG+ KRR+
Sbjct: 519 AVGLTNGSCDGDLCVGLRLGDDGAKRRR 546
>gi|296082780|emb|CBI21785.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/494 (68%), Positives = 400/494 (80%), Gaps = 3/494 (0%)
Query: 107 MTIRAFHSKILRRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEG 166
MTIRAFHSKILR +SLGTAIGFRIRRG+LTDIPAILVFV+RKVH+QWL+ +QC P LEG
Sbjct: 1 MTIRAFHSKILRCYSLGTAIGFRIRRGMLTDIPAILVFVSRKVHKQWLNPIQCFPNVLEG 60
Query: 167 PGGVWCDVDVVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAI 226
PGG+WCDVDVVEF+Y+GAP PKE+ YTE++D LRG DPCIGSGSQVASQ+ +GTLGAI
Sbjct: 61 PGGLWCDVDVVEFAYFGAPELAPKEQYYTEIMDDLRGGDPCIGSGSQVASQDGFGTLGAI 120
Query: 227 VRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYG 286
VRS+TGN+QVGFLTNRHVAV+LDYP+QKMFHPLPP+LGPGVYLGAVERATSFITDDLW+G
Sbjct: 121 VRSQTGNRQVGFLTNRHVAVNLDYPSQKMFHPLPPTLGPGVYLGAVERATSFITDDLWFG 180
Query: 287 IFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVM 346
IFAG NPETFVRADGAFIPFA+DF+++ +TT VKGVGEIGDV IDLQSP+NS+IG+QV+
Sbjct: 181 IFAGINPETFVRADGAFIPFADDFDMSTITTLVKGVGEIGDVKKIDLQSPMNSIIGKQVV 240
Query: 347 KVGRSSGLTTGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGE 406
KVGRSSGLTTGT+ AYALEY DE+G+C TD +VVGENQQTFDLEGDSGSLI+LTGQ+GE
Sbjct: 241 KVGRSSGLTTGTIFAYALEYIDERGMCLLTDLIVVGENQQTFDLEGDSGSLIVLTGQDGE 300
Query: 407 KPRPVGIIWGGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQ 466
K RP+GIIWGG NRGR+KLK G P NWTS VD+GRLL+LLELDLI T+EG + A+Q+Q
Sbjct: 301 KARPIGIIWGGNGNRGRVKLKAGLPLENWTSAVDIGRLLNLLELDLITTSEGLRVALQEQ 360
Query: 467 RNASAAAIESTVGESPPAEREQSKEKTAERLEPFNLNIQQD-LVDGESEQGPTPPFIHTE 525
ASA AI STVG+S P ++ K++ E+ E IQ D DG P + E
Sbjct: 361 MAASATAIGSTVGDSSPQDKMLPKDRAEEKFESEGFQIQHDPWDDGLGSPDLNRPLVEAE 420
Query: 526 FHVEDGIESSSNVGHQFIPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDN--YVSLQL 583
F +EDG+ HQFIPSF P+H+N Q ++LS+L++ DED+ +SLQL
Sbjct: 421 FLLEDGVRVCPCFEHQFIPSFPEAPPLHENIEQARVTPENLSSLKHDTDEDDGAAISLQL 480
Query: 584 GEPEPKRRKHSDTS 597
G+ EPKR + +S
Sbjct: 481 GDHEPKRTRLDPSS 494
>gi|302781773|ref|XP_002972660.1| hypothetical protein SELMODRAFT_98342 [Selaginella moellendorffii]
gi|302812925|ref|XP_002988149.1| hypothetical protein SELMODRAFT_127331 [Selaginella moellendorffii]
gi|300144255|gb|EFJ10941.1| hypothetical protein SELMODRAFT_127331 [Selaginella moellendorffii]
gi|300159261|gb|EFJ25881.1| hypothetical protein SELMODRAFT_98342 [Selaginella moellendorffii]
Length = 454
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/417 (73%), Positives = 355/417 (85%), Gaps = 5/417 (1%)
Query: 27 RNYCHHPNLPSSSPS----PLQPFASGGQHSESNAAYFSWPTLSRLNDAAEDRANYFGNL 82
+++ ++P S P PLQ ASGGQHSES+AAY WP +R+N AE+RA YF L
Sbjct: 18 KDWTYYPGSTSRHPRSESPPLQAVASGGQHSESSAAYVLWPP-ARINGTAEERAAYFSGL 76
Query: 83 QKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAIGFRIRRGVLTDIPAIL 142
QK +T R+P+GQQA+TLL+LMTIRAFHSK+LRR+SLGTA+GFR R GVLT+IPAI+
Sbjct: 77 QKDAEMDTQQRVPSGQQASTLLDLMTIRAFHSKVLRRYSLGTALGFRTRAGVLTNIPAII 136
Query: 143 VFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPAPTPKEELYTELVDGLR 202
VFVARKVH+QWL VQ LP ALEGPGGVWCDVDVVEFSYYGA TPKE++Y+ELV+GLR
Sbjct: 137 VFVARKVHKQWLLDVQRLPTALEGPGGVWCDVDVVEFSYYGASTVTPKEQIYSELVEGLR 196
Query: 203 GSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPS 262
G+DPCIGSGSQVASQETYGTLGAIVRS+TG +QVGFLTNRHVAVDLDYPNQKMFHPLPP+
Sbjct: 197 GNDPCIGSGSQVASQETYGTLGAIVRSQTGARQVGFLTNRHVAVDLDYPNQKMFHPLPPN 256
Query: 263 LGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGV 322
LGPGVYLGAVERATSFITDDLWYGIFAG NPETFVRADGAFIPFAE F+ + V+ V +
Sbjct: 257 LGPGVYLGAVERATSFITDDLWYGIFAGMNPETFVRADGAFIPFAESFDTSKVSVRVHSL 316
Query: 323 GEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICFFTDFLVVG 382
GE+G+V +DLQ+PI S++G+ V+KVGRSSGLT G +MAYA+EYNDEKGICFFTDFL+VG
Sbjct: 317 GELGEVFRVDLQAPIESIVGQHVVKVGRSSGLTKGIIMAYAVEYNDEKGICFFTDFLIVG 376
Query: 383 ENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPVNWTSGV 439
EN+Q FDLEGDSGSLI +T + E PRPVGIIWGGTANRGRLKL+ G P NWTSGV
Sbjct: 377 ENKQAFDLEGDSGSLISMTWERCENPRPVGIIWGGTANRGRLKLRSGHGPENWTSGV 433
>gi|413919512|gb|AFW59444.1| hypothetical protein ZEAMMB73_623071 [Zea mays]
Length = 516
Score = 623 bits (1606), Expect = e-175, Method: Compositional matrix adjust.
Identities = 344/583 (59%), Positives = 407/583 (69%), Gaps = 76/583 (13%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D R Q SG +QS+ES LD+E + CH P+ P S PS +QP SG H+E++AAYF WPT +
Sbjct: 5 DDRAQLSGFAQSDESTLDVEGHCCHQPSFPCS-PS-MQPIVSGCTHTENSAAYFLWPTSN 62
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
+ AAE RANYF NL KG+LP+ RLP GQQA +LL+LMTIRAFHSK
Sbjct: 63 LQHCAAEGRANYFANLSKGLLPKIGRRLPKGQQANSLLDLMTIRAFHSK----------- 111
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GPGG+WCDVDVVEFSYYGAPA
Sbjct: 112 ---------------------------------------GPGGIWCDVDVVEFSYYGAPA 132
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
TPK +++TELVD L GSD CIGSGSQVASQ+T+GTLGAIV+ RTGN+ VGF+TNRHVAV
Sbjct: 133 QTPKVQIFTELVDKLCGSDECIGSGSQVASQDTFGTLGAIVKRRTGNKLVGFVTNRHVAV 192
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPF 306
DLDYPNQKM+HPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPF
Sbjct: 193 DLDYPNQKMYHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPF 252
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
A DF+++ VTT+V+GVG+IGDV +IDLQ P+N LIGR+V K+GRSSG TTGTVMAYALEY
Sbjct: 253 AHDFDISTVTTTVRGVGDIGDVKVIDLQCPLNRLIGRRVCKIGRSSGHTTGTVMAYALEY 312
Query: 367 NDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
NDEKGI FFTD LVVGEN+QTFDLEGDSGSLI+LTGQ+ EKPRP+GIIWGGTANRGRLKL
Sbjct: 313 NDEKGISFFTDLLVVGENRQTFDLEGDSGSLIILTGQDSEKPRPIGIIWGGTANRGRLKL 372
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGESPPAER 486
+ P NWTSGVDLGRLLD LELDLI T+E + AVQ QR A AAA S GES A
Sbjct: 373 RCDHGPQNWTSGVDLGRLLDRLELDLIITSESLKDAVQQQRRALAAAANSAAGESSTAAA 432
Query: 487 EQSKEKTAERLEPFNLNIQQ----DLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGHQF 542
+EK E EP + I+Q D+ E+E+ +VE+ QF
Sbjct: 433 PVLEEKVEEIFEPLGIKIEQLRRHDVSASEAEEA-------AGINVEE---------RQF 476
Query: 543 IPSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGE 585
I +F GRSP+ + + ++ L N +E+ + L LG+
Sbjct: 477 ISNFVGRSPVRDDQG----APRQIANLNNPSEEELAMLLHLGD 515
>gi|168064147|ref|XP_001784026.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664412|gb|EDQ51132.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 580 bits (1495), Expect = e-163, Method: Compositional matrix adjust.
Identities = 276/407 (67%), Positives = 340/407 (83%), Gaps = 4/407 (0%)
Query: 58 AYFSWPTLSRLNDAAEDRANYFGNLQK--GVLPETLGRLPTGQQATTLLELMTIRAFHSK 115
AY WP +L ++++RA F L+K GV+ G P GQQA+TLLELMTIRA+HSK
Sbjct: 1 AYLLWPGSDQLLGSSDERAACFIGLEKSGGVMYND-GVTPRGQQASTLLELMTIRAYHSK 59
Query: 116 ILRRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVD 175
LR+ LGTA+GFR RRG LT IPAI+VFVARKVH QWL +Q LP+++EGPGG+WCDVD
Sbjct: 60 SLRQCGLGTALGFRTRRGELTSIPAIIVFVARKVHTQWLHELQVLPSSVEGPGGLWCDVD 119
Query: 176 VVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQ 235
VVEFSY+G P PK++L +E++DGLRG D IGSG+QVASQETYGTLGA+V+S+TG +Q
Sbjct: 120 VVEFSYFGVPTMVPKKQLSSEILDGLRGMDATIGSGTQVASQETYGTLGALVQSQTGLRQ 179
Query: 236 VGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPET 295
+GF+TNRHVAVDLDYP QKMFHPLPP+LGPGVYLGAV+RATSF+ DDLWYGIFAG NPET
Sbjct: 180 LGFITNRHVAVDLDYPCQKMFHPLPPNLGPGVYLGAVKRATSFVKDDLWYGIFAGMNPET 239
Query: 296 FVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLT 355
FVRADGAFIPF+E F+++ VTTS+KG+G +GDV+ +DLQS I+S++GR+V+KVGRSSG+T
Sbjct: 240 FVRADGAFIPFSETFDISKVTTSIKGIGSMGDVYRVDLQSQISSIVGRKVVKVGRSSGVT 299
Query: 356 TGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQN-GEKPRPVGII 414
G +M YA+EYNDE GICF TDFL+VGE ++ FDLEGDSGSLILL+ +N EK +PVG+I
Sbjct: 300 KGVIMGYAVEYNDENGICFLTDFLIVGEKKKNFDLEGDSGSLILLSSENETEKAQPVGLI 359
Query: 415 WGGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQA 461
WGGTANRGRLKL+ P NWTSGVDLGRLLD+L+LD+I T++ +
Sbjct: 360 WGGTANRGRLKLRNEHGPENWTSGVDLGRLLDILQLDIITTDQNLRG 406
>gi|168009441|ref|XP_001757414.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691537|gb|EDQ77899.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 409
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 266/413 (64%), Positives = 317/413 (76%), Gaps = 5/413 (1%)
Query: 62 WPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFS 121
WPT N AE RA +F +LQK + P G QA TLL+LMTIRA HSK LR FS
Sbjct: 1 WPTPRLQNGRAEQRATHFSSLQKKT--SCPSKRPRGHQAATLLDLMTIRALHSKTLRCFS 58
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
LGTA+GFRIR GV TDIPAI+VFVARKVHR WL Q LP LEGPGGVWCDVDVVEFS
Sbjct: 59 LGTALGFRIRGGVQTDIPAIIVFVARKVHRHWLQEAQELPLILEGPGGVWCDVDVVEFSL 118
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTN 241
G+ P++ +YT+LV+GLRG D IGSGSQVA E YGTL AIVRSRTG QVGFLTN
Sbjct: 119 LGSQ--RPQDPVYTDLVEGLRGGDATIGSGSQVACFELYGTLSAIVRSRTGLCQVGFLTN 176
Query: 242 RHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADG 301
RHVAV LD+P QK+FHPLPP LGPGVYLGAVER T+FI DDLWYG+FA TNPE+FVRADG
Sbjct: 177 RHVAVSLDHPVQKLFHPLPPHLGPGVYLGAVERTTTFIRDDLWYGVFASTNPESFVRADG 236
Query: 302 AFIPFAEDFNLNN-VTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
AFIPF + ++ N ++ VK VGEIG+V +DLQ+P+NSLIG+ V+KVGRSSG T G ++
Sbjct: 237 AFIPFDSNLDVRNFISPFVKSVGEIGEVISVDLQAPLNSLIGKHVIKVGRSSGFTEGCIL 296
Query: 361 AYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
AYALEYN++KG CFF DFL+V ++ F+LEGD+GSLIL+ G+ GEKPRPVG++WGGT
Sbjct: 297 AYALEYNNDKGHCFFNDFLIVSDDNNAFELEGDTGSLILVRGEAGEKPRPVGVVWGGTTQ 356
Query: 421 RGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAA 473
+GRLKL + P NWTSGVDL RLL+ L+L ++ +NE A++ QR AA+
Sbjct: 357 QGRLKLHKWKEPENWTSGVDLSRLLESLDLSIVTSNEALCEALEVQRQCRAAS 409
>gi|167999079|ref|XP_001752245.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696640|gb|EDQ82978.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 527 bits (1357), Expect = e-147, Method: Compositional matrix adjust.
Identities = 263/422 (62%), Positives = 320/422 (75%), Gaps = 5/422 (1%)
Query: 53 SESNAAYFSWPTLSRLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAF 112
+E +A + WPT N E RA +F LQK + + P G QA TLL+LMTIRAF
Sbjct: 1 NEGSAHFVEWPTSQLQNGPVELRAIHFCTLQKQM--SCSSKWPHGYQAATLLDLMTIRAF 58
Query: 113 HSKILRRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWC 172
HSK LR +SLG+A+GFRIR GV TDIPAI+VFVARKVHR WL Q LP LEGPGG+WC
Sbjct: 59 HSKSLRCYSLGSALGFRIRGGVQTDIPAIIVFVARKVHRHWLYEAQELPLILEGPGGIWC 118
Query: 173 DVDVVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTG 232
DVDVVEFS G P P P E ++TELV+GL+G D IGSGSQVA E YGTLGAIVRSRTG
Sbjct: 119 DVDVVEFSLLG-PQP-PLEPVHTELVEGLQGRDATIGSGSQVACYELYGTLGAIVRSRTG 176
Query: 233 NQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTN 292
QVGFLTNRHVAV LD+P QK+F+PLPP LGPGVYLGAVER T+FI DDLWYG+FA N
Sbjct: 177 LCQVGFLTNRHVAVSLDHPVQKLFYPLPPHLGPGVYLGAVERTTTFIRDDLWYGVFASMN 236
Query: 293 PETFVRADGAFIPFAEDFNLNN-VTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRS 351
PE+F RADGAFIPF + ++ N V+ SV+GVGEIG+V +DL +P+NSLIG+ V+KVGRS
Sbjct: 237 PESFARADGAFIPFDNNLDVRNFVSPSVRGVGEIGEVMSVDLHAPLNSLIGKHVIKVGRS 296
Query: 352 SGLTTGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPV 411
SG+T G + AYA+EYN + G CFF DFL+V ++ Q F+ EGDSGSLIL+TG+ KPRP+
Sbjct: 297 SGVTKGCIFAYAVEYNSDIGHCFFNDFLIVSDDGQAFESEGDSGSLILVTGEAEGKPRPI 356
Query: 412 GIIWGGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASA 471
G++WGGT ++GRLK + + P WTSGVDL RLLD LEL ++++NE A++ QR A
Sbjct: 357 GMVWGGTTHQGRLKFQSWKEPEKWTSGVDLSRLLDSLELSIVSSNEALCEALEMQRQCLA 416
Query: 472 AA 473
A+
Sbjct: 417 AS 418
>gi|302813186|ref|XP_002988279.1| hypothetical protein SELMODRAFT_42830 [Selaginella moellendorffii]
gi|300144011|gb|EFJ10698.1| hypothetical protein SELMODRAFT_42830 [Selaginella moellendorffii]
Length = 358
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 231/344 (67%), Positives = 281/344 (81%), Gaps = 3/344 (0%)
Query: 96 TGQQATTLLELMTIRAFHSKILRRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLS 155
TG+QA TL ELM IRA H K+ RR LGTA+GFR R +TD PAI+VFVARK+H QW+
Sbjct: 1 TGRQAGTLRELMAIRAIHGKMFRRLGLGTALGFRTRDRQVTDRPAIIVFVARKLHAQWVL 60
Query: 156 HVQCLPAALEGPGGVWCDVDVVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVA 215
Q LP+ ++GPG +WCDVDVVEFSY+GA + PKE++Y+ELV+ LRG D C+G GSQVA
Sbjct: 61 DGQMLPSTVQGPGDLWCDVDVVEFSYHGASSAAPKEQVYSELVECLRGDDQCVGPGSQVA 120
Query: 216 SQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERA 275
S E YGT+GA+VRSRTG Q+GFLTNRHVAVDLD+P QKMFHPLPP+LGPGVYLG VERA
Sbjct: 121 SLEVYGTMGAVVRSRTGEHQIGFLTNRHVAVDLDFPYQKMFHPLPPNLGPGVYLGTVERA 180
Query: 276 TSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQS 335
TSF+TDDLWYG+FA ET VRADGAF+PFA F+ ++VT S+KGVGE+G++ I+L
Sbjct: 181 TSFVTDDLWYGMFATCCSETVVRADGAFVPFAASFDSSSVTASIKGVGEVGELFTINLDD 240
Query: 336 PINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSG 395
PI +L+G+ +KVGRSSGLT GTV+AY +EY+D+KG+CFFTD LVVG+ Q FD EGDSG
Sbjct: 241 PIANLVGKAAIKVGRSSGLTRGTVVAYGVEYHDDKGVCFFTDLLVVGDGGQ-FDSEGDSG 299
Query: 396 SLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPVNWTSGV 439
S+ILL +G+KPRPVG+IWGGT+NRGRLKL+ G P NWTSGV
Sbjct: 300 SMILLC--DGDKPRPVGMIWGGTSNRGRLKLRQGHEPQNWTSGV 341
>gi|302760907|ref|XP_002963876.1| hypothetical protein SELMODRAFT_80513 [Selaginella moellendorffii]
gi|300169144|gb|EFJ35747.1| hypothetical protein SELMODRAFT_80513 [Selaginella moellendorffii]
Length = 372
Score = 483 bits (1243), Expect = e-133, Method: Compositional matrix adjust.
Identities = 229/346 (66%), Positives = 280/346 (80%), Gaps = 3/346 (0%)
Query: 94 LPTGQQATTLLELMTIRAFHSKILRRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQW 153
+ TG+QA TL ELM IRA H K+ RR LGTA+GFR R +TD PAI+VFVARK+H QW
Sbjct: 1 MGTGRQARTLRELMAIRAIHGKMFRRLGLGTALGFRTRDRQVTDRPAIIVFVARKLHAQW 60
Query: 154 LSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQ 213
+ Q LP+ ++GPG +WCDVDVVEFSY+G + PKE++Y+ELV+ LRG D IG GSQ
Sbjct: 61 VLDGQMLPSTVQGPGDLWCDVDVVEFSYHGTSSAAPKEQVYSELVECLRGDDQSIGPGSQ 120
Query: 214 VASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVE 273
VAS E YGT+GA+VRSRTG Q+GFLTNRHVAVDLD+P QKMFHPLPP+LGPGVYLG VE
Sbjct: 121 VASLEVYGTMGAVVRSRTGEHQIGFLTNRHVAVDLDFPYQKMFHPLPPNLGPGVYLGTVE 180
Query: 274 RATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDL 333
RATSF+TDDLWYG+FA ET VRADGAF+PFA F+ ++VT ++KGVGE+G++ I+L
Sbjct: 181 RATSFVTDDLWYGMFATCCSETVVRADGAFVPFAASFDSSSVTATIKGVGEVGELFTINL 240
Query: 334 QSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGD 393
PI +L+G+ +KVGRSSGLT GTV+AY +EY+D+KG+CFFTD LVVG+ Q FD EGD
Sbjct: 241 DDPIANLVGKAAIKVGRSSGLTRGTVVAYGVEYHDDKGVCFFTDLLVVGDGGQ-FDSEGD 299
Query: 394 SGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPVNWTSGV 439
SGS+ILL +G+KPRPVG+IWGGT+NRGRLKL+ G P NWTSGV
Sbjct: 300 SGSMILLC--DGDKPRPVGMIWGGTSNRGRLKLRQGHEPENWTSGV 343
>gi|413919514|gb|AFW59446.1| hypothetical protein ZEAMMB73_623071 [Zea mays]
Length = 302
Score = 437 bits (1124), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/287 (72%), Positives = 242/287 (84%), Gaps = 2/287 (0%)
Query: 7 DLRFQNSGSSQSEESALDLERNYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLS 66
D R Q SG +QS+ES LD+E + CH P+ P S PS +QP SG H+E++AAYF WPT +
Sbjct: 5 DDRAQLSGFAQSDESTLDVEGHCCHQPSFPCS-PS-MQPIVSGCTHTENSAAYFLWPTSN 62
Query: 67 RLNDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAI 126
+ AAE RANYF NL KG+LP+ RLP GQQA +LL+LMTIRAFHSK+LR F LGTA+
Sbjct: 63 LQHCAAEGRANYFANLSKGLLPKIGRRLPKGQQANSLLDLMTIRAFHSKVLRCFGLGTAV 122
Query: 127 GFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPA 186
GFRIR+GVLTDIPAIL FVARKVH++WL CLPA L GPGG+WCDVDVVEFSYYGAPA
Sbjct: 123 GFRIRKGVLTDIPAILCFVARKVHKKWLDPAHCLPAILAGPGGIWCDVDVVEFSYYGAPA 182
Query: 187 PTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAV 246
TPK +++TELVD L GSD CIGSGSQVASQ+T+GTLGAIV+ RTGN+ VGF+TNRHVAV
Sbjct: 183 QTPKVQIFTELVDKLCGSDECIGSGSQVASQDTFGTLGAIVKRRTGNKLVGFVTNRHVAV 242
Query: 247 DLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNP 293
DLDYPNQKM+HPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNP
Sbjct: 243 DLDYPNQKMYHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNP 289
>gi|215695330|dbj|BAG90521.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 342
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 240/356 (67%), Positives = 274/356 (76%), Gaps = 20/356 (5%)
Query: 255 MFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNN 314
MFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPFA+D+++ +
Sbjct: 1 MFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDYDITS 60
Query: 315 VTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICF 374
V TSVKGVG IGDV IDLQSPI+SLIGRQV+KVGRSSGLTTGTV+AYALEYNDEKGICF
Sbjct: 61 VNTSVKGVGVIGDVKAIDLQSPISSLIGRQVVKVGRSSGLTTGTVVAYALEYNDEKGICF 120
Query: 375 FTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPVN 434
FTDFLVVGENQQTFDLEGDSGSLI+LTG++GEKP+P+GIIWGGTANRGRLKLK GQ P N
Sbjct: 121 FTDFLVVGENQQTFDLEGDSGSLIILTGKDGEKPQPIGIIWGGTANRGRLKLKSGQGPEN 180
Query: 435 WTSGVDLGRLLDLLELDLIATNEGFQAAVQDQR---NASAAAIESTVGESPPAEREQSKE 491
WTSGVDLGRLLDLLELDLI T+EG Q A+++QR A+AAA ST GES P Q E
Sbjct: 181 WTSGVDLGRLLDLLELDLITTSEGLQEALEEQRIILAAAAAAANSTAGESSPVAGPQENE 240
Query: 492 KTAERLEPFNLNIQQDLVDGE-SEQGPTPPFIHTEFHVEDGIESSSNVGH-QFIPSFTGR 549
K + EP +NIQQ D + GP EFHV D +E +NV QF+ G
Sbjct: 241 KVDKIYEPLGINIQQLPRDNSATSTGP------DEFHV-DTVEGVTNVEERQFL---IGM 290
Query: 550 SPMHQNNAQENKGS-KSLSALRNGPDEDNYVSLQLGEPEPKRRKHSDTSLNVQESK 604
SP + QE G +L+ L N P ED SL LGE EPKR + SD+SL++ K
Sbjct: 291 SPARE--GQEANGDLNNLAELENSP-EDICFSLHLGEREPKRLR-SDSSLDIDLQK 342
>gi|413919515|gb|AFW59447.1| hypothetical protein ZEAMMB73_623071 [Zea mays]
Length = 316
Score = 368 bits (944), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 206/335 (61%), Positives = 245/335 (73%), Gaps = 24/335 (7%)
Query: 255 MFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNN 314
M+HPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPFA DF+++
Sbjct: 1 MYHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFAHDFDIST 60
Query: 315 VTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICF 374
VTT+V+GVG+IGDV +IDLQ P+N LIGR+V K+GRSSG TTGTVMAYALEYNDEKGI F
Sbjct: 61 VTTTVRGVGDIGDVKVIDLQCPLNRLIGRRVCKIGRSSGHTTGTVMAYALEYNDEKGISF 120
Query: 375 FTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPVN 434
FTD LVVGEN+QTFDLEGDSGSLI+LTGQ+ EKPRP+GIIWGGTANRGRLKL+ P N
Sbjct: 121 FTDLLVVGENRQTFDLEGDSGSLIILTGQDSEKPRPIGIIWGGTANRGRLKLRCDHGPQN 180
Query: 435 WTSGVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGESPPAEREQSKEKTA 494
WTSGVDLGRLLD LELDLI T+E + AVQ QR A AAA S GES A +EK
Sbjct: 181 WTSGVDLGRLLDRLELDLIITSESLKDAVQQQRRALAAAANSAAGESSTAAAPVLEEKVE 240
Query: 495 ERLEPFNLNIQQ----DLVDGESEQGPTPPFIHTEFHVEDGIESSSNVGHQFIPSFTGRS 550
E EP + I+Q D+ E+E+ +VE+ QFI +F GRS
Sbjct: 241 EIFEPLGIKIEQLRRHDVSASEAEEAAG-------INVEE---------RQFISNFVGRS 284
Query: 551 PMHQNNAQENKGSKSLSALRNGPDEDNYVSLQLGE 585
P+ + + ++ L N +E+ + L LG+
Sbjct: 285 PVRDDQG----APRQIANLNNPSEEELAMLLHLGD 315
>gi|115460532|ref|NP_001053866.1| Os04g0615000 [Oryza sativa Japonica Group]
gi|113565437|dbj|BAF15780.1| Os04g0615000 [Oryza sativa Japonica Group]
Length = 207
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 172/207 (83%), Positives = 188/207 (90%)
Query: 255 MFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNN 314
MFHPLPP+LGPGVYLGAVERATSFITDD+WYGI+AGTNPETFVRADGAFIPFA+DF+++
Sbjct: 1 MFHPLPPNLGPGVYLGAVERATSFITDDVWYGIYAGTNPETFVRADGAFIPFADDFDIST 60
Query: 315 VTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICF 374
VTT V+GVG+IGDV +IDLQ P+NSLIGRQV KVGRSSG TTGTVMAYALEYNDEKGICF
Sbjct: 61 VTTVVRGVGDIGDVKVIDLQCPLNSLIGRQVCKVGRSSGHTTGTVMAYALEYNDEKGICF 120
Query: 375 FTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPVN 434
FTD LVVGEN+QTFDLEGDSGSLI+LT Q+GEKPRP+GIIWGGTANRGRLKL P N
Sbjct: 121 FTDILVVGENRQTFDLEGDSGSLIILTSQDGEKPRPIGIIWGGTANRGRLKLTSDHGPEN 180
Query: 435 WTSGVDLGRLLDLLELDLIATNEGFQA 461
WTSGVDLGRLLD LELD+I TNE Q
Sbjct: 181 WTSGVDLGRLLDRLELDIIITNESLQG 207
>gi|218195570|gb|EEC77997.1| hypothetical protein OsI_17387 [Oryza sativa Indica Group]
Length = 999
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 161/187 (86%), Positives = 176/187 (94%)
Query: 107 MTIRAFHSKILRRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEG 166
MTIRAFHSKILRRFSLGTA+GFRIR+G LTDIPAILVFVARKVH++WL+ QCLPA LEG
Sbjct: 1 MTIRAFHSKILRRFSLGTAVGFRIRKGDLTDIPAILVFVARKVHKKWLNPAQCLPAILEG 60
Query: 167 PGGVWCDVDVVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAI 226
PGGVWCDVDVVEFSYYGAPA TPKE++++ELVD L GSD CIGSGSQVAS ET+GTLGAI
Sbjct: 61 PGGVWCDVDVVEFSYYGAPAQTPKEQMFSELVDKLCGSDECIGSGSQVASHETFGTLGAI 120
Query: 227 VRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYG 286
V+ RTGN+QVGFLTN HVAVDLDYPNQKMFHPLPP+LGPGVYLGAVERATSFITDD+WYG
Sbjct: 121 VKRRTGNKQVGFLTNHHVAVDLDYPNQKMFHPLPPNLGPGVYLGAVERATSFITDDVWYG 180
Query: 287 IFAGTNP 293
I+AGTNP
Sbjct: 181 IYAGTNP 187
>gi|224286426|gb|ACN40920.1| unknown [Picea sitchensis]
Length = 170
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 109/157 (69%), Positives = 120/157 (76%), Gaps = 7/157 (4%)
Query: 13 SGSSQSEESALDLER----NYCHHPNLPSSSPSPLQPFASGGQHSESNAAYFSWPTLSRL 68
SGS QSEESALD E+ N HP S SP PLQ FASGGQ SES+AA F WP +RL
Sbjct: 14 SGSMQSEESALDREQTVTGNSGRHPR--SDSP-PLQAFASGGQRSESSAACFRWPPSNRL 70
Query: 69 NDAAEDRANYFGNLQKGVLPETLGRLPTGQQATTLLELMTIRAFHSKILRRFSLGTAIGF 128
N AE+RA YFG +QK V ETL LP+G QAT LL+LMTIRAFHSKILRR+SLGTAIGF
Sbjct: 71 NGTAEERAAYFGGIQKEVDSETLEHLPSGHQATALLDLMTIRAFHSKILRRYSLGTAIGF 130
Query: 129 RIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALE 165
RIR GVLT+I AILVFVARKVH+QWL VQ LP+ LE
Sbjct: 131 RIREGVLTNILAILVFVARKVHKQWLLDVQRLPSVLE 167
>gi|357449481|ref|XP_003595017.1| Elongation factor 1-alpha [Medicago truncatula]
gi|355484065|gb|AES65268.1| Elongation factor 1-alpha [Medicago truncatula]
Length = 591
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 66/106 (62%), Positives = 72/106 (67%), Gaps = 13/106 (12%)
Query: 164 LEGPGGVWCDVDVVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTL 223
L+GPGGVWCDVD+VE Y+ A P PKE+ YTE+VD RG DPCIGSGSQVASQ+TY TL
Sbjct: 481 LQGPGGVWCDVDMVEILYFSALDPVPKEQNYTEIVDDSRGGDPCIGSGSQVASQKTYRTL 540
Query: 224 GAIVRSRTGNQQVGFL-TNRHVAVDLDYPNQKMFHPLPPSLGPGVY 268
VGFL T H VDLDY NQKMFHPLP L VY
Sbjct: 541 ------------VGFLRTYCHAVVDLDYSNQKMFHPLPHILSLEVY 574
>gi|388511095|gb|AFK43612.1| unknown [Medicago truncatula]
Length = 99
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 47/79 (59%), Positives = 61/79 (77%), Gaps = 1/79 (1%)
Query: 524 TEFHVEDGIESSSNVGHQFI-PSFTGRSPMHQNNAQENKGSKSLSALRNGPDEDNYVSLQ 582
EFH+ + IE+ NV HQFI SF G+SP+HQ+ +E+ KSLS LRN PDEDN+VSL
Sbjct: 20 CEFHIRNEIETVPNVEHQFIRTSFAGKSPVHQSFLKEDMQFKSLSELRNEPDEDNFVSLH 79
Query: 583 LGEPEPKRRKHSDTSLNVQ 601
LGEPE KRRKHS++SL+++
Sbjct: 80 LGEPEAKRRKHSNSSLSLK 98
>gi|357452683|ref|XP_003596618.1| Elongation factor 1-alpha [Medicago truncatula]
gi|355485666|gb|AES66869.1| Elongation factor 1-alpha [Medicago truncatula]
Length = 608
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/62 (53%), Positives = 44/62 (70%), Gaps = 5/62 (8%)
Query: 194 YTELVDGLRGSDPCIGSGSQVASQ-----ETYGTLGAIVRSRTGNQQVGFLTNRHVAVDL 248
YTE+VD LRG +PCIGS SQ++ + +T G RS+TG++QVGF T +HVA+DL
Sbjct: 547 YTEIVDDLRGGNPCIGSRSQMSEKSLVRSQTERNFGCTGRSQTGSRQVGFRTYQHVAIDL 606
Query: 249 DY 250
DY
Sbjct: 607 DY 608
>gi|323701635|ref|ZP_08113307.1| hypothetical protein DesniDRAFT_0519 [Desulfotomaculum nigrificans
DSM 574]
gi|333922305|ref|YP_004495885.1| hypothetical protein Desca_0068 [Desulfotomaculum carboxydivorans
CO-1-SRB]
gi|323533408|gb|EGB23275.1| hypothetical protein DesniDRAFT_0519 [Desulfotomaculum nigrificans
DSM 574]
gi|333747866|gb|AEF92973.1| hypothetical protein Desca_0068 [Desulfotomaculum carboxydivorans
CO-1-SRB]
Length = 334
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/329 (25%), Positives = 133/329 (40%), Gaps = 68/329 (20%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G +G++ T+ PAI+VFV++K + LS Q +P + G + DV+E
Sbjct: 22 VGVGVGYKHVGMSRTERPAIIVFVSKKEAPENLSREQTVPIKING-----LETDVIEIG- 75
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTN 241
+ E +R + P I G + T GT GA+VR R +++ L+N
Sbjct: 76 --------EVRFLEERTQLVRPAQPGISIGHY---RITAGTFGAVVRDRHTGEKL-ILSN 123
Query: 242 RHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADG 301
H+ + N P L PG Y G + T + I G P T A+G
Sbjct: 124 NHILANATSGNDGRAAIGDPILQPGEYDGG-SKDDRIATLLRYIPIQKGEVPATCPVANG 182
Query: 302 A------FI-----------------------PFAEDFNLNNVTTSVKGVGEIGDVHIID 332
A F+ A + +T + G+G +
Sbjct: 183 AARLANMFVHAVRPNYQLKFFKRGGAANIVDCAVARPLRPDLITEEILGLGLV------- 235
Query: 333 LQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYN---DEKGICFFTDFLVVGENQQTFD 389
Q + +G +V+K GR+SG+T GTV A + + D+ F+D +V Q
Sbjct: 236 -QGVAEAKLGMKVVKSGRTSGITRGTVTAVGVTLDVKLDDNTSAHFSDQVVTDMKSQG-- 292
Query: 390 LEGDSGSLILLTGQNGEKPRPVGIIWGGT 418
GDSGSL+L G + VG+++ G+
Sbjct: 293 --GDSGSLVLTEGN-----KAVGLLFAGS 314
>gi|419714426|ref|ZP_14241842.1| hypothetical protein S7W_08218 [Mycobacterium abscessus M94]
gi|382945545|gb|EIC69839.1| hypothetical protein S7W_08218 [Mycobacterium abscessus M94]
Length = 728
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 58/106 (54%), Gaps = 5/106 (4%)
Query: 316 TTSVKGVGEIGDVHIIDLQSPIN--SLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGIC 373
T++ G+G+IG ++D N LIG+ V+ G SSGL G VMA Y G
Sbjct: 233 TSTAYGIGDIGP--MVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSVGGSE 290
Query: 374 FFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTA 419
+ +DFL+ + Q + + GDSG + LT +N +P P+ + WGG A
Sbjct: 291 YVSDFLIAPDPQGSQTVPGDSGMVWHLT-ENRARPAPLAVEWGGQA 335
>gi|333977577|ref|YP_004515522.1| hypothetical protein Desku_0073 [Desulfotomaculum kuznetsovii DSM
6115]
gi|333821058|gb|AEG13721.1| hypothetical protein Desku_0073 [Desulfotomaculum kuznetsovii DSM
6115]
Length = 334
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 91/338 (26%), Positives = 143/338 (42%), Gaps = 57/338 (16%)
Query: 108 TIRAFHSKILRRFSL-GTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEG 166
++ K+LR ++ G +G + G T+ PA+++FV +KV L VQ +PA ++G
Sbjct: 7 VLKKSREKLLRLPNVTGVGVGLKQVSGETTNRPALIIFVKKKVPSDGLVRVQQVPAYIDG 66
Query: 167 PGGVWCDVDVVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAI 226
D++E + L + R + P + G S GT GA+
Sbjct: 67 -----LPTDIIEIG---------EVRLLSLRTGKERPAQPGMSIGHYKISA---GTFGAV 109
Query: 227 VRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLG--AVERATSFIT-DDL 283
V+ R + + L+N H+ + P L PG + G A +R + + L
Sbjct: 110 VKDRVTKEPL-ILSNNHILANATDGKDGRAAVGDPILQPGPHDGGQAGDRIGTLLRFSPL 168
Query: 284 WYGIFAGTNP--ETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSP--INS 339
I P E VRA + + +G G I D + SP IN
Sbjct: 169 LRSIQEAECPVAEALVRAGNLLVRLVRPHYQLKMFQYYRG-GNIIDAAVARPDSPGLIND 227
Query: 340 LI--------------GRQVMKVGRSSGLTTGTVMAYALEY-----NDEKGICFFTDFLV 380
I G+ VMK GR++G++ GTV A + NDEKG +FTD +V
Sbjct: 228 EILEIGKVEGVARVDPGQGVMKSGRTTGISEGTVTAVGVTLEVEIGNDEKG--WFTDQVV 285
Query: 381 VGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGT 418
+ + GDSGSL+L + EK R VG+++ G+
Sbjct: 286 TDMSSRP----GDSGSLVL----DREK-RAVGLLFAGS 314
>gi|420864658|ref|ZP_15328047.1| hypothetical protein MA4S0303_3019 [Mycobacterium abscessus
4S-0303]
gi|420869447|ref|ZP_15332829.1| hypothetical protein MA4S0726RA_2952 [Mycobacterium abscessus
4S-0726-RA]
gi|420873892|ref|ZP_15337268.1| hypothetical protein MA4S0726RB_2542 [Mycobacterium abscessus
4S-0726-RB]
gi|420990095|ref|ZP_15453251.1| hypothetical protein MA4S0206_3037 [Mycobacterium abscessus
4S-0206]
gi|421042016|ref|ZP_15505024.1| hypothetical protein MA4S0116R_2995 [Mycobacterium abscessus
4S-0116-R]
gi|421044246|ref|ZP_15507246.1| hypothetical protein MA4S0116S_2090 [Mycobacterium abscessus
4S-0116-S]
gi|392063374|gb|EIT89223.1| hypothetical protein MA4S0303_3019 [Mycobacterium abscessus
4S-0303]
gi|392065367|gb|EIT91215.1| hypothetical protein MA4S0726RB_2542 [Mycobacterium abscessus
4S-0726-RB]
gi|392068917|gb|EIT94764.1| hypothetical protein MA4S0726RA_2952 [Mycobacterium abscessus
4S-0726-RA]
gi|392184374|gb|EIV10025.1| hypothetical protein MA4S0206_3037 [Mycobacterium abscessus
4S-0206]
gi|392222944|gb|EIV48467.1| hypothetical protein MA4S0116R_2995 [Mycobacterium abscessus
4S-0116-R]
gi|392233699|gb|EIV59197.1| hypothetical protein MA4S0116S_2090 [Mycobacterium abscessus
4S-0116-S]
Length = 728
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 57/106 (53%), Gaps = 5/106 (4%)
Query: 316 TTSVKGVGEIGDVHIIDLQSPIN--SLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGIC 373
T++ G+G+IG ++D N LIG+ V+ G SSGL G VMA Y G
Sbjct: 233 TSTAYGIGDIGP--MVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSVGGSE 290
Query: 374 FFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTA 419
+ +DFL+ + Q + GDSG + LT +N +P P+ + WGG A
Sbjct: 291 YVSDFLIAPDPQGPQTVPGDSGMVWHLT-ENRARPAPLAVEWGGQA 335
>gi|419709529|ref|ZP_14236997.1| hypothetical protein OUW_08328 [Mycobacterium abscessus M93]
gi|382943410|gb|EIC67724.1| hypothetical protein OUW_08328 [Mycobacterium abscessus M93]
Length = 728
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 57/106 (53%), Gaps = 5/106 (4%)
Query: 316 TTSVKGVGEIGDVHIIDLQSPIN--SLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGIC 373
T++ G+G+IG ++D N LIG+ V+ G SSGL G VMA Y G
Sbjct: 233 TSTAYGIGDIGP--MVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSVGGSE 290
Query: 374 FFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTA 419
+ +DFL+ + Q + GDSG + LT +N +P P+ + WGG A
Sbjct: 291 YVSDFLIAPDPQGPQTVPGDSGMVWHLT-ENRARPAPLAVEWGGQA 335
>gi|271966485|ref|YP_003340681.1| hypothetical protein [Streptosporangium roseum DSM 43021]
gi|270509660|gb|ACZ87938.1| hypothetical protein Sros_5160 [Streptosporangium roseum DSM 43021]
Length = 523
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 90/342 (26%), Positives = 132/342 (38%), Gaps = 73/342 (21%)
Query: 115 KILRRFS-----LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGG 169
KIL F G IGFR R G TD P ++V VA+K +S+ + LP +E G
Sbjct: 17 KILDSFGADPNVTGAGIGFRRRDGQWTDEPVVVVLVAKKRPEALVSNRRLLPRTVEVDGS 76
Query: 170 VWCDVDVVEFSYYGAP-APTPKEELYTELVDGLRGSDPCIGSGSQVASQ---ETYGTLGA 225
C+VDV+E + P +E+ V G+ G G +++ +T GTLG
Sbjct: 77 -PCEVDVIEAGPFRMDRVSDPAQEVTPAAVVGVTGRMRPPRPGCSISNPLDGDTAGTLGL 135
Query: 226 IVRSRTGNQQVGFLTNRHVAVDL--DYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDL 283
V +T + V ++N HV + +K+ PGV+ G + T
Sbjct: 136 FVLDKT-DGTVCLMSNNHVMARMGEGVKGEKIIQ-------PGVHDGGTAAKDTIATLKR 187
Query: 284 WYGI-FAGTNPETFVRADGAFIPFAEDFNLN-----------NVTTSVKGVGEIGDVH-- 329
W I AGT + D A + NL+ V G+ GD H
Sbjct: 188 WVPITTAGT------KIDAAIAQLVDQMNLSLQPALDRMPPLGVKHPAVGIFTGGDDHGT 241
Query: 330 --IIDLQSPINSL---------IGR----------------QVMKVGRSSGLTTGTVMAY 362
I + +N+L GR + KVGR+SG T+ + A
Sbjct: 242 GVITRIDLALNALNVVPAVSAPDGRVAAAPPEAVKVPEPFMNIEKVGRTSGYTSSMITAI 301
Query: 363 ALE--YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTG 402
+E G+ +TD + F L GDSGS + G
Sbjct: 302 GVESLILTPIGMVLYTDLALTDR----FGLAGDSGSAVFHGG 339
>gi|418421347|ref|ZP_12994521.1| hypothetical protein MBOL_30670 [Mycobacterium abscessus subsp.
bolletii BD]
gi|363996427|gb|EHM17642.1| hypothetical protein MBOL_30670 [Mycobacterium abscessus subsp.
bolletii BD]
Length = 728
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 57/106 (53%), Gaps = 5/106 (4%)
Query: 316 TTSVKGVGEIGDVHIIDLQSPIN--SLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGIC 373
T++ G+G+IG ++D N LIGR V+ G SSGL G VMA Y G
Sbjct: 233 TSTAYGIGDIGP--MVDTGDMTNGLDLIGRPVVAHGASSGLVAGKVMALFYRYKSVGGSE 290
Query: 374 FFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTA 419
+ +DFL+ + Q + GDSG + LT ++ +P P+ + WGG A
Sbjct: 291 YVSDFLIAPDPQGPQTVPGDSGMVWHLT-EDRARPGPLAVEWGGQA 335
>gi|169630314|ref|YP_001703963.1| hypothetical protein MAB_3233 [Mycobacterium abscessus ATCC 19977]
gi|420910850|ref|ZP_15374162.1| hypothetical protein MA6G0125R_2366 [Mycobacterium abscessus
6G-0125-R]
gi|420917303|ref|ZP_15380606.1| hypothetical protein MA6G0125S_3405 [Mycobacterium abscessus
6G-0125-S]
gi|420922468|ref|ZP_15385764.1| hypothetical protein MA6G0728S_3090 [Mycobacterium abscessus
6G-0728-S]
gi|420928131|ref|ZP_15391411.1| hypothetical protein MA6G1108_3333 [Mycobacterium abscessus
6G-1108]
gi|420967738|ref|ZP_15430942.1| hypothetical protein MM3A0810R_3493 [Mycobacterium abscessus
3A-0810-R]
gi|420978471|ref|ZP_15441648.1| hypothetical protein MA6G0212_3393 [Mycobacterium abscessus
6G-0212]
gi|420983854|ref|ZP_15447021.1| hypothetical protein MA6G0728R_3335 [Mycobacterium abscessus
6G-0728-R]
gi|421008973|ref|ZP_15472083.1| hypothetical protein MA3A0119R_3393 [Mycobacterium abscessus
3A-0119-R]
gi|421013827|ref|ZP_15476905.1| hypothetical protein MA3A0122R_3404 [Mycobacterium abscessus
3A-0122-R]
gi|421018771|ref|ZP_15481828.1| hypothetical protein MA3A0122S_2998 [Mycobacterium abscessus
3A-0122-S]
gi|421024437|ref|ZP_15487481.1| hypothetical protein MA3A0731_3523 [Mycobacterium abscessus
3A-0731]
gi|421030220|ref|ZP_15493251.1| hypothetical protein MA3A0930R_3458 [Mycobacterium abscessus
3A-0930-R]
gi|421035683|ref|ZP_15498701.1| hypothetical protein MA3A0930S_3391 [Mycobacterium abscessus
3A-0930-S]
gi|169242281|emb|CAM63309.1| Conserved hypothetical protein [Mycobacterium abscessus]
gi|392110194|gb|EIU35964.1| hypothetical protein MA6G0125S_3405 [Mycobacterium abscessus
6G-0125-S]
gi|392112844|gb|EIU38613.1| hypothetical protein MA6G0125R_2366 [Mycobacterium abscessus
6G-0125-R]
gi|392127121|gb|EIU52871.1| hypothetical protein MA6G0728S_3090 [Mycobacterium abscessus
6G-0728-S]
gi|392129249|gb|EIU54996.1| hypothetical protein MA6G1108_3333 [Mycobacterium abscessus
6G-1108]
gi|392162749|gb|EIU88438.1| hypothetical protein MA6G0212_3393 [Mycobacterium abscessus
6G-0212]
gi|392168850|gb|EIU94528.1| hypothetical protein MA6G0728R_3335 [Mycobacterium abscessus
6G-0728-R]
gi|392197121|gb|EIV22737.1| hypothetical protein MA3A0119R_3393 [Mycobacterium abscessus
3A-0119-R]
gi|392200682|gb|EIV26287.1| hypothetical protein MA3A0122R_3404 [Mycobacterium abscessus
3A-0122-R]
gi|392207401|gb|EIV32978.1| hypothetical protein MA3A0122S_2998 [Mycobacterium abscessus
3A-0122-S]
gi|392211234|gb|EIV36800.1| hypothetical protein MA3A0731_3523 [Mycobacterium abscessus
3A-0731]
gi|392223440|gb|EIV48962.1| hypothetical protein MA3A0930R_3458 [Mycobacterium abscessus
3A-0930-R]
gi|392224178|gb|EIV49699.1| hypothetical protein MA3A0930S_3391 [Mycobacterium abscessus
3A-0930-S]
gi|392250245|gb|EIV75719.1| hypothetical protein MM3A0810R_3493 [Mycobacterium abscessus
3A-0810-R]
Length = 728
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 57/106 (53%), Gaps = 5/106 (4%)
Query: 316 TTSVKGVGEIGDVHIIDLQSPIN--SLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGIC 373
T++ G+G+IG ++D N LIG+ V+ G SSGL G VMA Y G
Sbjct: 233 TSTAYGIGDIGP--MVDTGDMTNGLDLIGQPVVAHGASSGLVGGKVMALFYRYKSVGGSE 290
Query: 374 FFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTA 419
+ +DFL+ + Q + GDSG + LT +N +P P+ + WGG A
Sbjct: 291 YVSDFLIAPDPQGPQTVPGDSGMVWHLT-ENRARPAPLAVEWGGQA 335
>gi|414154359|ref|ZP_11410678.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
= DSM 18033]
gi|411454150|emb|CCO08582.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
= DSM 18033]
Length = 335
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 129/329 (39%), Gaps = 67/329 (20%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G +G + T+ PAI++FV +K Q LS +P + G DV+E
Sbjct: 22 VGVGVGHKYVDMQRTEQPAIIIFVKKKEEPQNLSREHLVPYQING-----LTTDVIEVGE 76
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTN 241
L E +R + P + G + T GT GA+VR R +++ L+N
Sbjct: 77 V--------RLLDEERTKHVRPAQPGLSIGH---YRVTAGTFGAVVRDRQTGERL-ILSN 124
Query: 242 RHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADG 301
H+ + P L PG Y G R T + + G P T A+G
Sbjct: 125 NHILANATNGKDGRAAIGDPILQPGEYDGGT-REDRIATLLRYIPLQKGEAPATCPVANG 183
Query: 302 A------------------FIPFAEDFNLNN-----------VTTSVKGVGEIGDVHIID 332
A FI N+ + +T + G IG V ++
Sbjct: 184 AARFLNIFVHTVRPNYDLRFIKRGGTPNIVDCAVARPVRPELITDDILG---IGKVQGVE 240
Query: 333 LQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYN---DEKGICFFTDFLVVGENQQTFD 389
P G QV+K GR++G+T GTV A D++ +F D +V Q
Sbjct: 241 RAKP-----GMQVVKSGRTTGITRGTVTAVGATMEVKLDDENTAYFADQVVTDMKSQG-- 293
Query: 390 LEGDSGSLILLTGQNGEKPRPVGIIWGGT 418
GDSGSL+L ++ R VG+++ G+
Sbjct: 294 --GDSGSLVL-----NQENRAVGLLFAGS 315
>gi|418247622|ref|ZP_12874008.1| hypothetical protein MAB47J26_03320 [Mycobacterium abscessus 47J26]
gi|420932347|ref|ZP_15395622.1| hypothetical protein MM1S1510930_3180 [Mycobacterium massiliense
1S-151-0930]
gi|420939252|ref|ZP_15402521.1| hypothetical protein MM1S1520914_3384 [Mycobacterium massiliense
1S-152-0914]
gi|420952865|ref|ZP_15416108.1| hypothetical protein MM2B0626_3102 [Mycobacterium massiliense
2B-0626]
gi|420957036|ref|ZP_15420272.1| hypothetical protein MM2B0107_2440 [Mycobacterium massiliense
2B-0107]
gi|420962692|ref|ZP_15425916.1| hypothetical protein MM2B1231_3167 [Mycobacterium massiliense
2B-1231]
gi|420992988|ref|ZP_15456134.1| hypothetical protein MM2B0307_2407 [Mycobacterium massiliense
2B-0307]
gi|420998760|ref|ZP_15461896.1| hypothetical protein MM2B0912R_3420 [Mycobacterium massiliense
2B-0912-R]
gi|421003282|ref|ZP_15466405.1| hypothetical protein MM2B0912S_3107 [Mycobacterium massiliense
2B-0912-S]
gi|353452115|gb|EHC00509.1| hypothetical protein MAB47J26_03320 [Mycobacterium abscessus 47J26]
gi|392137106|gb|EIU62843.1| hypothetical protein MM1S1510930_3180 [Mycobacterium massiliense
1S-151-0930]
gi|392144767|gb|EIU70492.1| hypothetical protein MM1S1520914_3384 [Mycobacterium massiliense
1S-152-0914]
gi|392156377|gb|EIU82080.1| hypothetical protein MM2B0626_3102 [Mycobacterium massiliense
2B-0626]
gi|392179090|gb|EIV04742.1| hypothetical protein MM2B0307_2407 [Mycobacterium massiliense
2B-0307]
gi|392184901|gb|EIV10551.1| hypothetical protein MM2B0912R_3420 [Mycobacterium massiliense
2B-0912-R]
gi|392193854|gb|EIV19475.1| hypothetical protein MM2B0912S_3107 [Mycobacterium massiliense
2B-0912-S]
gi|392245605|gb|EIV71082.1| hypothetical protein MM2B1231_3167 [Mycobacterium massiliense
2B-1231]
gi|392251846|gb|EIV77317.1| hypothetical protein MM2B0107_2440 [Mycobacterium massiliense
2B-0107]
Length = 726
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 5/106 (4%)
Query: 316 TTSVKGVGEIGDVHIIDLQSPIN--SLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGIC 373
T++ G+G+IG ++D N LIG+ V+ G SSGL G VMA Y G
Sbjct: 231 TSTAYGIGDIGP--MVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSMGGSE 288
Query: 374 FFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTA 419
+ +DFL+ + Q + GDSG + LT ++ +P P+ + WGG A
Sbjct: 289 YVSDFLIAPDPQGPQTVPGDSGMVWHLT-EDRARPAPLAVEWGGQA 333
>gi|365871159|ref|ZP_09410700.1| hypothetical protein MMAS_31020 [Mycobacterium massiliense CCUG
48898 = JCM 15300]
gi|421050237|ref|ZP_15513231.1| hypothetical protein MMCCUG48898_3242 [Mycobacterium massiliense
CCUG 48898 = JCM 15300]
gi|363994962|gb|EHM16180.1| hypothetical protein MMAS_31020 [Mycobacterium massiliense CCUG
48898 = JCM 15300]
gi|392238840|gb|EIV64333.1| hypothetical protein MMCCUG48898_3242 [Mycobacterium massiliense
CCUG 48898]
Length = 727
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 5/106 (4%)
Query: 316 TTSVKGVGEIGDVHIIDLQSPIN--SLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGIC 373
T++ G+G+IG ++D N LIG+ V+ G SSGL G VMA Y G
Sbjct: 232 TSTAYGIGDIGP--MVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSMGGSE 289
Query: 374 FFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTA 419
+ +DFL+ + Q + GDSG + LT ++ +P P+ + WGG A
Sbjct: 290 YVSDFLIAPDPQGPQTVPGDSGMVWHLT-EDRARPAPLAVEWGGQA 334
>gi|414582515|ref|ZP_11439655.1| hypothetical protein MA5S1215_2581 [Mycobacterium abscessus
5S-1215]
gi|420880944|ref|ZP_15344311.1| hypothetical protein MA5S0304_2543 [Mycobacterium abscessus
5S-0304]
gi|420884687|ref|ZP_15348047.1| hypothetical protein MA5S0421_2798 [Mycobacterium abscessus
5S-0421]
gi|420890907|ref|ZP_15354254.1| hypothetical protein MA5S0422_3719 [Mycobacterium abscessus
5S-0422]
gi|420896690|ref|ZP_15360029.1| hypothetical protein MA5S0708_2471 [Mycobacterium abscessus
5S-0708]
gi|420901021|ref|ZP_15364352.1| hypothetical protein MA5S0817_2089 [Mycobacterium abscessus
5S-0817]
gi|420904996|ref|ZP_15368314.1| hypothetical protein MA5S1212_2226 [Mycobacterium abscessus
5S-1212]
gi|420973119|ref|ZP_15436311.1| hypothetical protein MA5S0921_3501 [Mycobacterium abscessus
5S-0921]
gi|392078167|gb|EIU03994.1| hypothetical protein MA5S0422_3719 [Mycobacterium abscessus
5S-0422]
gi|392080450|gb|EIU06276.1| hypothetical protein MA5S0421_2798 [Mycobacterium abscessus
5S-0421]
gi|392085853|gb|EIU11678.1| hypothetical protein MA5S0304_2543 [Mycobacterium abscessus
5S-0304]
gi|392096002|gb|EIU21797.1| hypothetical protein MA5S0708_2471 [Mycobacterium abscessus
5S-0708]
gi|392098382|gb|EIU24176.1| hypothetical protein MA5S0817_2089 [Mycobacterium abscessus
5S-0817]
gi|392102900|gb|EIU28686.1| hypothetical protein MA5S1212_2226 [Mycobacterium abscessus
5S-1212]
gi|392117667|gb|EIU43435.1| hypothetical protein MA5S1215_2581 [Mycobacterium abscessus
5S-1215]
gi|392164670|gb|EIU90358.1| hypothetical protein MA5S0921_3501 [Mycobacterium abscessus
5S-0921]
Length = 716
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 5/106 (4%)
Query: 316 TTSVKGVGEIGDVHIIDLQSPIN--SLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGIC 373
T++ G+G+IG ++D N LIG+ V+ G SSGL G VMA Y G
Sbjct: 221 TSTAYGIGDIGP--MVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSMGGSE 278
Query: 374 FFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTA 419
+ +DFL+ + Q + GDSG + LT ++ +P P+ + WGG A
Sbjct: 279 YVSDFLIAPDPQGPQTVPGDSGMVWHLT-EDRARPAPLAVEWGGQA 323
>gi|420942606|ref|ZP_15405862.1| hypothetical protein MM1S1530915_2728 [Mycobacterium massiliense
1S-153-0915]
gi|420948873|ref|ZP_15412123.1| hypothetical protein MM1S1540310_2737 [Mycobacterium massiliense
1S-154-0310]
gi|392147703|gb|EIU73421.1| hypothetical protein MM1S1530915_2728 [Mycobacterium massiliense
1S-153-0915]
gi|392155903|gb|EIU81609.1| hypothetical protein MM1S1540310_2737 [Mycobacterium massiliense
1S-154-0310]
Length = 716
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 57/106 (53%), Gaps = 5/106 (4%)
Query: 316 TTSVKGVGEIGDVHIIDLQSPIN--SLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGIC 373
T++ G+G+IG ++D N LIG+ V+ G SSGL G VMA Y G
Sbjct: 221 TSTAYGIGDIGP--MVDTGDMTNGLDLIGQPVVAHGASSGLVAGKVMALFYRYKSMGGSE 278
Query: 374 FFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTA 419
+ +DFL+ + Q + GDSG + LT ++ +P P+ + WGG A
Sbjct: 279 YVSDFLIAPDPQGPQTVPGDSGMVWHLT-EDRARPAPLAVEWGGQA 323
>gi|334338755|ref|YP_004543735.1| hypothetical protein [Desulfotomaculum ruminis DSM 2154]
gi|334090109|gb|AEG58449.1| hypothetical protein Desru_0150 [Desulfotomaculum ruminis DSM 2154]
Length = 334
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 86/328 (26%), Positives = 129/328 (39%), Gaps = 66/328 (20%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G +G++ T+ PAI+VFV +K + LS +P + G + DV+E
Sbjct: 22 VGVGVGYKHVGLERTERPAIIVFVKKKETSENLSRENLVPYKING-----LETDVIEIG- 75
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTN 241
+ L +E +R + P + G + T GT GA+VR R +++ L+N
Sbjct: 76 --------EVRLLSERTQVIRPAQPGVSIGHY---RITAGTFGAVVRDRDTGEKL-ILSN 123
Query: 242 RHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVER---ATSFITDDLWYGIFAGTNPETFVR 298
H+ + N P L PG Y G + AT L G T P V
Sbjct: 124 NHILANASNGNDGRAAVGDPILQPGEYDGGTKDNRIATLLRYIPLQKGESLATCPVANVA 183
Query: 299 A--------------DGAFIPFAEDFNL-----------NNVTTSVKGVGEIGDVHIIDL 333
A D F NL N + V G+G I
Sbjct: 184 ARLANILVHTLRPNYDLRFFKRGRAENLVDCAVARPVRENVIFEEVLGIGRI-------- 235
Query: 334 QSPINSLIGRQVMKVGRSSGLTTGTVMAYA--LEYN-DEKGICFFTDFLVVGENQQTFDL 390
+ + G V+K GR++G+T GTV A LE D++ F+ +V Q
Sbjct: 236 EGLAEARPGMPVVKSGRTTGITKGTVTAVGATLEVKLDDESTAHFSGQVVTNMKSQG--- 292
Query: 391 EGDSGSLILLTGQNGEKPRPVGIIWGGT 418
GDSGSL+L G R VG+++ G+
Sbjct: 293 -GDSGSLVLTEGN-----RAVGLLFAGS 314
>gi|398353752|ref|YP_006399216.1| hypothetical protein USDA257_c39150 [Sinorhizobium fredii USDA 257]
gi|390129078|gb|AFL52459.1| hypothetical protein USDA257_c39150 [Sinorhizobium fredii USDA 257]
Length = 766
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 123/311 (39%), Gaps = 63/311 (20%)
Query: 139 PAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPAPTPKEE------ 192
P+ILVFV + V ++ L + +P L P G V V+E PKEE
Sbjct: 79 PSILVFVEQWVSKKDLEPGEIVPKTLYLPDGRRVPVCVIE---------APKEEKNEKRP 129
Query: 193 LYTEL-VDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYP 251
L T V+ + G P I S Q T+ +V + V LTNRHVA +
Sbjct: 130 LTTVFPVNNIGGGWPVI---SHNQGQSYAATIACLV---SDGHTVYALTNRHVAGE---A 180
Query: 252 NQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNP-----ETFVRADGAFIPF 306
+ ++ L G ER L +F P + +V D I
Sbjct: 181 GEIIYSRLG---------GKQERIGVSSEKHLTRALFTTHYPGWPGRDVYVNLDVGLIDI 231
Query: 307 AEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEY 366
NL+ T ++ +G++G + + + + +LIGR V G +SGL G + A Y
Sbjct: 232 D---NLDRWTAEIRDIGQMGKMVDLSVHTISLALIGRDVRGTGAASGLMQGEIAALFYRY 288
Query: 367 NDEKGICFFTDFLVVGE-----NQQTFDLE---GDSGSLILL----------TGQNGEKP 408
G + D L+ ++ T E GDSG+L LL + G+KP
Sbjct: 289 KTNGGFEYVADLLIGPRPADDGDRNTVPFETHPGDSGTLWLLEPDKNDRSGKSPSKGKKP 348
Query: 409 ---RPVGIIWG 416
P+ + WG
Sbjct: 349 PDYLPLAMQWG 359
>gi|425465752|ref|ZP_18845059.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389831923|emb|CCI24872.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 321
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 89/206 (43%), Gaps = 28/206 (13%)
Query: 219 TYGTLGAIVRSRTGN-QQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATS 277
T GTLG +V+ G+ ++ L+N HV D + P L G + + T
Sbjct: 123 TAGTLGCLVKKTAGDDNEIFILSNNHVLADSNQAQIDDNIIEPGKLDQGTE--PIAKLTD 180
Query: 278 FITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPI 337
F T IF P F+ A A+ N N+V S+ +G + Q P+
Sbjct: 181 FET------IFLDDKPN-FIDA-----AIAKVINNNDVRPSILTIGNVQ-------QPPM 221
Query: 338 NSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKG--ICFFTDFLVVGENQQTFDLEGDSG 395
S + + V K GR++G T G +M A + G I F D L + F GDSG
Sbjct: 222 TSALYQSVRKHGRTTGHTIGVIMDIAADVRVRFGQKIANFEDQLAIQGVNGLFSQGGDSG 281
Query: 396 SLILLTGQNGEKPRPVGIIWGGTANR 421
SLI+ + RPVG+++ G N+
Sbjct: 282 SLIV----DAMTRRPVGLLFAGGGNQ 303
>gi|166366703|ref|YP_001658976.1| hypothetical protein MAE_39620 [Microcystis aeruginosa NIES-843]
gi|440756156|ref|ZP_20935357.1| hypothetical protein O53_4564 [Microcystis aeruginosa TAIHU98]
gi|166089076|dbj|BAG03784.1| hypothetical protein MAE_39620 [Microcystis aeruginosa NIES-843]
gi|440173378|gb|ELP52836.1| hypothetical protein O53_4564 [Microcystis aeruginosa TAIHU98]
Length = 321
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 89/206 (43%), Gaps = 28/206 (13%)
Query: 219 TYGTLGAIVRSRTGN-QQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATS 277
T GTLG +V+ G+ ++ L+N HV D + P L G + + T
Sbjct: 123 TAGTLGCLVKKTAGDDNEIFILSNNHVLADSNQAQIDDNIIEPGKLDQGTE--PIAKLTD 180
Query: 278 FITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPI 337
F T IF P F+ A A+ N N+V S+ +G + Q P+
Sbjct: 181 FET------IFLDDKPN-FIDA-----AIAKVINNNDVRPSILTIGNVQ-------QPPM 221
Query: 338 NSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKG--ICFFTDFLVVGENQQTFDLEGDSG 395
S + + V K GR++G T G +M A + G I F D L + F GDSG
Sbjct: 222 TSALYQSVRKHGRTTGHTIGVIMDIAADVRVRFGQKIANFEDQLAIQGVNGLFSQGGDSG 281
Query: 396 SLILLTGQNGEKPRPVGIIWGGTANR 421
SLI+ + RPVG+++ G N+
Sbjct: 282 SLIV----DAMTRRPVGLLFAGGGNQ 303
>gi|331271091|ref|YP_004385800.1| hypothetical protein CbC4_6003 [Clostridium botulinum BKT015925]
gi|329127586|gb|AEB77528.1| hypothetical protein CbC4_6003 [Clostridium botulinum BKT015925]
Length = 313
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 76/297 (25%), Positives = 123/297 (41%), Gaps = 71/297 (23%)
Query: 120 FSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEF 179
+ +G A+G++I+ G +T+ I VFV++KV L + +P +G + DVVE
Sbjct: 34 YIVGIALGYKIKNGFITNKKCIKVFVSKKVPLSNLYEHEVIPKFFKG-----IETDVVES 88
Query: 180 SYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQ-VASQETYGTLGAIVRSRTGNQQVGF 238
+ A T K P IG S V++ G++G +V T +
Sbjct: 89 GKFSAAEFTGKVR-------------PVIGGYSIGVSNILRVGSMGCLV---TDGRYKYI 132
Query: 239 LTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPET-FV 297
LTN H+ DL+ K+ P+ + PG Y G NP T V
Sbjct: 133 LTNNHIIADLN--KVKIGTPI---IQPGRY--------------------DGGNPNTDIV 167
Query: 298 RADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDL-------------QSPINSLIGRQ 344
+IP + + TS + +ID Q P+ +IG++
Sbjct: 168 AILSKYIPLKTE----GIITSPTNYMDCAIAKLIDESLVSPKIAIVGAPQEPMIPIIGKE 223
Query: 345 VMKVGRSSGLTTGTVMAYALEYNDEKG--ICFFTDFLVVGENQQTFDLEGDSGSLIL 399
V KVGRS+ +TTG + ++ + G I F + +V ++ GDSGS++L
Sbjct: 224 VKKVGRSTEMTTGRITDIDGTFHIKFGSKIFLFEEQIVTTCMCES----GDSGSILL 276
>gi|326330454|ref|ZP_08196762.1| hypothetical protein NBCG_01888 [Nocardioidaceae bacterium Broad-1]
gi|325951729|gb|EGD43761.1| hypothetical protein NBCG_01888 [Nocardioidaceae bacterium Broad-1]
Length = 332
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 123/316 (38%), Gaps = 61/316 (19%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G +G +I G TD P+++V V++K+ + +S +P ++G DV+E +
Sbjct: 39 VGVGVGLKITDGEQTDTPSVMVLVSQKMPTELVSDADTVPDTVDG-----TPTDVLEVGH 93
Query: 182 YGAPAPTPKEELYTELVDG------LRGSDPCIGSGSQVASQETYGTLGAIVRSRTG-NQ 234
A ++ + T+ VD +R + P G + T G +R+ G
Sbjct: 94 LFAGGS--QQLMETQEVDAQTLALRIRPARPGFSVGHYKITAGTIGAGAYDLRTFPGIPP 151
Query: 235 QVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPE 294
+ L+N HV N P L PG + G GT P
Sbjct: 152 RYYVLSNNHV-----LANSNDASIGDPILQPGPFDG-------------------GTAPA 187
Query: 295 TFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPIN---------SLIGRQV 345
+ F+P D + N V +V V H+ID N + +G +
Sbjct: 188 DVIGRLARFVPIRFDGSCNYVDAAVAEV----PFHVIDRDVYWNGYPATAAKAATVGMLL 243
Query: 346 MKVGRSSGLTTGTVMAYALEYNDEKGICFFTDFL--VVGENQQTFDLEGDSGSLILLTGQ 403
K GR++ TTG V A A N G F ++ N GDSGS++L
Sbjct: 244 KKTGRTTNFTTGRVTAVAATVNVNYGAGKVAKFCNQIITTNMSA---GGDSGSMVLDLQN 300
Query: 404 NGEKPRPVGIIWGGTA 419
N PVG+++ G++
Sbjct: 301 N-----PVGLLFAGSS 311
>gi|427382731|ref|ZP_18879451.1| hypothetical protein HMPREF9447_00484 [Bacteroides oleiciplenus YIT
12058]
gi|425729976|gb|EKU92827.1| hypothetical protein HMPREF9447_00484 [Bacteroides oleiciplenus YIT
12058]
Length = 435
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 85/210 (40%), Gaps = 31/210 (14%)
Query: 221 GTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHP--LPPSLGPGVYLGAVERATSF 278
GTLG V+ N +V LTNRHV V + ++HP P Y
Sbjct: 112 GTLGCFVKD--ANDRVYGLTNRHVGVSV---GSVLYHPKKTPVHCCSEKYCNH-----DC 161
Query: 279 ITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPI- 337
D+ I + D A I A D N EI D+ ++ +S I
Sbjct: 162 CIIDVKGNIGSVKKISQLTTTDSAIIELATDVKWKN---------EIVDIGVVKGESTIA 212
Query: 338 -NSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGS 396
L+G+ V K GR++ LTTG + + Y + + + +V+ F GDSGS
Sbjct: 213 PEELLGQTVRKRGRTTCLTTGKI---DICYYESVSSYQYREQIVIKNEGGIFAQGGDSGS 269
Query: 397 LILLTGQNGEKPRPVGIIWGGTANRGRLKL 426
+++ + + + ++WGG N G L
Sbjct: 270 VVV-----DKDDKVLALLWGGMGNDGVCNL 294
>gi|83595940|gb|ABC25300.1| hypothetical protein [uncultured marine bacterium Ant24C4]
Length = 396
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 71/261 (27%), Positives = 113/261 (43%), Gaps = 34/261 (13%)
Query: 177 VEFSYYGAP-APTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQ 235
+ +S+ G P +P + + + V G C GS GTLGAIV+ ++G
Sbjct: 131 INYSHGGVPQVKSPSTQPHVQPVTEKGGIIAC-GSSINPVDIVGAGTLGAIVKDKSG--A 187
Query: 236 VGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGI--FAGTNP 293
LTN HV+ +Y P P L PG L A A T + F P
Sbjct: 188 FYGLTNNHVSGGCNYS-----APEIPILCPGP-LDAKNCAIDPFTIGRHKNLLQFVDGLP 241
Query: 294 ETF---VRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGR 350
E +D A ++ + +S +G+ + HI P+ + +V K GR
Sbjct: 242 ENVDISKNSDAAIFALSKP----DRVSSYQGLSQDTPKHI---GVPMGMM---KVTKHGR 291
Query: 351 SSGLTTGTVMAY-------ALEYNDEKGICFFTD-FLVVGENQQTFDLEGDSGSLILLTG 402
++GLT G ++ A Y + K + +F D +L+ EN + F GDSGSL++ T
Sbjct: 292 TTGLTRGKIIGISASPIDVAYSYGNMKKVVYFDDVWLIKKENDKPFSEPGDSGSLVIGTD 351
Query: 403 QNGEKPRPVGIIWGGTANRGR 423
G+K +G+++ G + G
Sbjct: 352 STGQK-IALGLVFAGNPHFGH 371
>gi|331269877|ref|YP_004396369.1| hypothetical protein CbC4_1696 [Clostridium botulinum BKT015925]
gi|329126427|gb|AEB76372.1| hypothetical protein CbC4_1696 [Clostridium botulinum BKT015925]
Length = 313
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 126/305 (41%), Gaps = 49/305 (16%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFS- 180
+G +G +++ G+ T I VFV RK+ + L +P + G+ DV+ ++ +
Sbjct: 29 VGVGLGIKLKNGIDTGQNCIKVFVTRKLPQNSLCKNALVPTLYQ---GIITDVEEIQNNN 85
Query: 181 -YYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFL 239
YY + +T+ V G G AS +G+LG IV+ G + F
Sbjct: 86 LYYPKNNFSSMNNPFTKRVRPTPG-----GYAIGPASNVLFGSLGCIVKDDMGKHYL-FS 139
Query: 240 TNRHVAVDLDYP-NQKMFHPLPPSLG--PGVYLGAVERATSFITDDLWYGIFAGTNPETF 296
+ + D P ++ P P G P +G + + P F
Sbjct: 140 SAHVLTADYTVPLGTEIIQPSYPFHGHAPNDTIGTLYKYI----------------PLNF 183
Query: 297 VRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTT 356
A+ A A +L+ V+ V +G+I V + P+ L V K G +GLT
Sbjct: 184 TGANFADAGIALVSDLSKVSNKVALIGDIKGVSL-----PVLRL---SVKKTGYKTGLTK 235
Query: 357 GTVMAYALE--YNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGII 414
GT+ + + Y+ E G F + L++ N GDSGS IL N + +GI+
Sbjct: 236 GTIKSIGVTRLYSYEHGAVLFKN-LILTSNMSN---PGDSGS-ILFDNSN----KAIGIL 286
Query: 415 WGGTA 419
+GG A
Sbjct: 287 FGGDA 291
>gi|390573926|ref|ZP_10254079.1| hypothetical protein WQE_35945 [Burkholderia terrae BS001]
gi|389934138|gb|EIM96113.1| hypothetical protein WQE_35945 [Burkholderia terrae BS001]
Length = 833
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 127/327 (38%), Gaps = 35/327 (10%)
Query: 139 PAILVFVARKVHRQWLSHVQC-----LPAALEGPGGVWCDVDVVEFSYYGAPAPTPKEEL 193
PA++V V V H + +P L P G V VV A P +
Sbjct: 169 PAVIVLVRDWVDTTEFGHGKVDPDHMVPRTLYMPDGRAVPVCVVAVEPTVPAASAPADAR 228
Query: 194 YTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPNQ 253
+ G G P I + E ++G +V T LTNRHV + P +
Sbjct: 229 WPSTYIG--GGCPLIADAQGI---ERTASVGCLV---TDGHTTYALTNRHVCGEPGSPVK 280
Query: 254 KMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLN 313
+ +G A +R + + + FAG+ +F+ D I E + N
Sbjct: 281 ALLRGAVAEVGI-----ASDRQLTREPFTVVFPEFAGS--RSFLTLDIGLI---EVHDAN 330
Query: 314 NVTTSVKGV-GEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGI 372
+ ++ G+ G IG+V I+ S LI + + G +SG GT+ A + G
Sbjct: 331 DWSSQPFGIEGSIGNVADINELSLSLQLIDQPLTAFGSASGALDGTIKALFYRHKSLAGY 390
Query: 373 CFFTDFLVVGENQQTFDLEGDSGSLILL------TGQNGEKPRPVGIIWGGTANRGRLKL 426
+ + FL+ N GDSG+L L TG + P+ I WGG + L
Sbjct: 391 DYVSQFLIAPANGSPQTQPGDSGTLWYLTSPANTTGDGERRLTPLAIEWGGQS----LAS 446
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLI 453
G+ +N+ L LL++DL+
Sbjct: 447 DDGE-RLNYALATGLSTACQLLDVDLV 472
>gi|170699116|ref|ZP_02890171.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
gi|170135991|gb|EDT04264.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
Length = 313
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 63/227 (27%), Positives = 96/227 (42%), Gaps = 37/227 (16%)
Query: 207 CIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPG 266
C GS ++ + GTLGAIV+ G+ LTN HV ++ + P L PG
Sbjct: 73 CCGSSISPGNEASAGTLGAIVKKSDGSLY--GLTNNHVTGGCNHSAIDL-----PILAPG 125
Query: 267 VYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLN-NVTTSVKGVGEI 325
V+ A + F G + E G A + ++N N ++ + E
Sbjct: 126 VFDVAAKTIIPFTI---------GFHSEVLPFVTGT----AGNVSINDNTDAALFRIAEP 172
Query: 326 GDVHIIDLQ---SPINSL---IGRQVMKVGRSSGLTTGTVMAYAL---------EYNDEK 370
DV Q +P NS+ +G +V KVGR++G TTG ++ L + N +
Sbjct: 173 ADVSSRQGQQYDTPANSVAPTVGMKVQKVGRTTGHTTGVIVGQQLRPIRVHAQSQRNKFQ 232
Query: 371 GICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGG 417
I + +V + + F GDSGSL++ G VGII G
Sbjct: 233 AIITMPNVYLVHGDYRPFSDSGDSGSLVVTNDGTGTN-YAVGIIMSG 278
>gi|253682715|ref|ZP_04863512.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253562427|gb|EES91879.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 318
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 70/296 (23%), Positives = 121/296 (40%), Gaps = 73/296 (24%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G IG+++++ VLT I VF + K+ L +P+ +G DV+E
Sbjct: 41 VGVGIGYKVQKEVLTSEKCIAVFASEKIPNNELKREDLVPSVYKG-----IKTDVIETGI 95
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGS-GSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
+ +L + +R P +G G + + YGT+G +V N L+
Sbjct: 96 FST----------MKLSNRIR---PVLGGYGIAPVTTKYYGTMGCLVTDGIEN---FILS 139
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGA------VERATSFITDDLWYGIFAGTNPE 294
+ H+ DL+ N K+ P+ L P + G V + FI I PE
Sbjct: 140 SNHILADLN--NIKLGTPI---LQPAIINGGNPEKDQVAVLSKFIP---LRCINGTKRPE 191
Query: 295 TFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGL 354
++ + A+ N N V++ +K +G+ V +G+ V KVG S+ L
Sbjct: 192 NYMD-----VAIAKVINNNFVSSDIKFIGKPKGVR--------GHRLGQLVKKVGASTEL 238
Query: 355 TTGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLE-----------GDSGSLIL 399
TTG + + ++V EN++ F ++ GDSGS++L
Sbjct: 239 TTGIIQ-------------YINVTIIVDENKKQFLMKKQLVTNAMAKPGDSGSILL 281
>gi|398802706|ref|ZP_10561909.1| S1/P1 Nuclease [Polaromonas sp. CF318]
gi|398098944|gb|EJL89217.1| S1/P1 Nuclease [Polaromonas sp. CF318]
Length = 757
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 56/228 (24%), Positives = 93/228 (40%), Gaps = 27/228 (11%)
Query: 239 LTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLG-AVERATSFITDDLWYGIFAGTNPETFV 297
LTNRHV + P G V +G A ER + + Y FAG +T++
Sbjct: 179 LTNRHVCGEPGEPVHARLR------GEEVEVGHASERQLTRLPFTEVYPSFAGK--QTYL 230
Query: 298 RADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTG 357
D + E + + T+SV G+GEIG + ++ Q+ LI V G +SG G
Sbjct: 231 NLD---VGLVEVDDARDWTSSVYGIGEIGALADLNEQNLGLQLIDHPVSAFGAASGHLEG 287
Query: 358 TVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKP--------- 408
+ A Y G + D L+ ++ GDSG++ L + +
Sbjct: 288 RIKALFYRYKSVGGYDYVADLLIAPQDPAHQTQPGDSGTVWHLKAEEEKDSKGVPGKVSY 347
Query: 409 RPVGIIWGGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATN 456
RP+ + WG V N+ +L + LL+++L++ +
Sbjct: 348 RPLAVEWGAQT------FSVDGGAYNFALATNLSNVCKLLDVELVSAH 389
>gi|420256689|ref|ZP_14759520.1| hypothetical protein PMI06_09988 [Burkholderia sp. BT03]
gi|398042752|gb|EJL35726.1| hypothetical protein PMI06_09988 [Burkholderia sp. BT03]
Length = 749
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 125/327 (38%), Gaps = 35/327 (10%)
Query: 139 PAILVFVARKVHRQWLSHVQC-----LPAALEGPGGVWCDVDVVEFSYYGAPAPTPKEEL 193
PA++V V V H + +P L P G V VV A P +
Sbjct: 85 PAVIVLVRDWVDTTEFGHGKVDPDHMVPRTLYMPDGRAVPVCVVAVEPTVPAAGAPADAR 144
Query: 194 YTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPNQ 253
+ G G P I + E ++G +V T LTNRHV + P +
Sbjct: 145 WPSTYIG--GGCPLIADAQGI---ERTASVGCLV---TDGHTTYALTNRHVCGEPGSPVK 196
Query: 254 KMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLN 313
+ +G A +R + + + FAG+ +F+ D I E + N
Sbjct: 197 ALLRGAVAEVGI-----ASDRQLTREPFTVVFPEFAGS--RSFLTLDIGLI---EVHDAN 246
Query: 314 NVTTSVKGV-GEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGI 372
+ ++ G+ G IG+V I+ S LI + V G +SG GT+ A + G
Sbjct: 247 DWSSQPFGIEGGIGNVADINELSLSLQLIDQPVTAFGSASGALDGTIKALFYRHKSLAGY 306
Query: 373 CFFTDFLVVGENQQTFDLEGDSGSLILLT------GQNGEKPRPVGIIWGGTANRGRLKL 426
+ + FL+ N GDSG+L LT G + P+ I WGG +
Sbjct: 307 DYVSQFLIAPANGSPQTQPGDSGTLWYLTSAASTAGDGERRLTPLAIEWGGQSLASDDGA 366
Query: 427 KVGQPPVNWTSGVDLGRLLDLLELDLI 453
+ +N+ L LL++DL+
Sbjct: 367 R-----LNYALATGLSTACQLLDVDLV 388
>gi|331269221|ref|YP_004395713.1| hypothetical protein CbC4_1036 [Clostridium botulinum BKT015925]
gi|329125771|gb|AEB75716.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 302
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 128/305 (41%), Gaps = 52/305 (17%)
Query: 118 RRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVV 177
+R +G +G++I G IP I V V+ K+ + + +P +G DVV
Sbjct: 20 KRNVVGVGLGYKITNGFCKFIPCIKVLVSTKIPPNEIPPNESIPEHFKG-----LITDVV 74
Query: 178 EFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVG 237
+ A + T K ++ G IG S + S G++ +V T +
Sbjct: 75 QSGNISASSLTTKAR---PVLGGYS-----IGPSSGIRS----GSMACLV---TDGKHYY 119
Query: 238 FLTNRHVAVDLDYPNQKMFHPLP---PSLGPGVYLGAVERATSFITDDLWYGIFAGTNPE 294
L+N HV V Y N LP P L PG+ G T + + T+ E
Sbjct: 120 ILSNNHVLV---YGNV-----LPIGTPVLQPGIEDGGQPLDDKVATLSKYAQLKFITHKE 171
Query: 295 TFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGL 354
T + D +L V++ + +G I + SP+ +G V KVGRS+GL
Sbjct: 172 TPTNYIDCALAQVNDKSL--VSSKLAIIGSIKGI-----TSPV---LGESVKKVGRSTGL 221
Query: 355 TTGTVMAY--ALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVG 412
TTG +++ + N + G C F + + + + GDSGSL++ + + VG
Sbjct: 222 TTGKILSIGSTVSVNFKAGKCLFKNQITTTKMAE----AGDSGSLLVNSSHHA-----VG 272
Query: 413 IIWGG 417
+++ G
Sbjct: 273 LLFSG 277
>gi|147676419|ref|YP_001210634.1| hypothetical protein PTH_0084 [Pelotomaculum thermopropionicum SI]
gi|146272516|dbj|BAF58265.1| hypothetical protein PTH_0084 [Pelotomaculum thermopropionicum SI]
Length = 335
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 81/342 (23%), Positives = 137/342 (40%), Gaps = 66/342 (19%)
Query: 110 RAFHSKILRRFSL----GTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALE 165
RAF + SL G +G++ G T PA +++V +K+ L+ +P ++
Sbjct: 6 RAFKKTRAKLLSLENVVGIGVGYKQTGGENTGEPAFIIYVEKKMPAAGLARGSVIPKRID 65
Query: 166 GPGGVWCDVDVVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGA 225
G DV+E E PC S Q T GTLGA
Sbjct: 66 G-----LITDVIEIGRVKMLGVRTSRE------------RPCQPGVSVGHYQSTAGTLGA 108
Query: 226 IVRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWY 285
+VR R +++ L+N HV + ++ P L PG Y G + + D
Sbjct: 109 VVRDRE-TKKLMILSNNHVLANGSSESEAKAKQGDPILQPGPYDGGTLKDRIGVLDRYVP 167
Query: 286 GIFAGTNPETFVRADGA------FIPFAEDFNL---------NNVTTS---------VKG 321
+ + + V A A F +++ + N V + VK
Sbjct: 168 LVKSAVKADCPVAAAVARGGTRLLNIFKQNYEVRFYKRLYGENTVDCALARLDSEDLVKA 227
Query: 322 -VGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMA----YALEYNDEKGICFFT 376
+ +IGD+ + P G V K GR++GLT+G V + +E D++ + +F+
Sbjct: 228 TILDIGDITGVSEAGP-----GDLVQKSGRTTGLTSGVVKSVNTTLQVEMKDDEKL-WFS 281
Query: 377 DFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGT 418
D +V Q GDSGSL++ ++ + VG+++ G+
Sbjct: 282 DQVVADMVSQ----PGDSGSLVV-----DQERKVVGLLFAGS 314
>gi|302388636|ref|YP_003824457.1| hypothetical protein Toce_0037 [Thermosediminibacter oceani DSM
16646]
gi|302199264|gb|ADL06834.1| conserved hypothetical protein [Thermosediminibacter oceani DSM
16646]
Length = 334
Score = 47.8 bits (112), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 82/334 (24%), Positives = 134/334 (40%), Gaps = 51/334 (15%)
Query: 109 IRAFHSKILR-RFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGP 167
+R + K+LR +GT +G++I G +T+ PA++V V +K + L Q +P L+
Sbjct: 8 LRRYERKLLRLENVVGTGLGYKIIEGRITNEPAVIVLVRKKKPERELPASQVVPKKLD-- 65
Query: 168 GGVWCDVDVVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIV 227
D++E L T R + P + G + T GT GA+V
Sbjct: 66 ---EVYTDIIEVG---------DVRLLTARTQKTRPAMPGMSIGHY---KITAGTFGAVV 110
Query: 228 RSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLG-----AVERATSFI--T 280
R + + + L+N HV + P + PG Y G + FI
Sbjct: 111 RDQITGEPL-ILSNNHVLANASNGRDGRAAVGDPIMQPGPYDGGGPEDVIAHLYRFIPVE 169
Query: 281 DDLWYG----IFAGTNPETF----VRAD--GAFIPFAEDFNLNNVTTSVKGVGEIGDVHI 330
D+ + G N F +R D AF+ +NL + + + I
Sbjct: 170 KDVTHSRCPIARRGENLLNFFVRMIRPDYRVAFMKHRAAYNLVDAAVAKPINPDYISPEI 229
Query: 331 IDL---QSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGI---CFFTDFLVVGEN 384
+DL + IG ++K GR+SG++ V A ++ G F D ++ G
Sbjct: 230 LDLGEIRGIAEPRIGMTLVKSGRTSGVSKSEVKALNVKIRVMMGAGEEATFYDQILTGPM 289
Query: 385 QQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGT 418
Q GDSGSL+L E VG+++ G+
Sbjct: 290 AQ----PGDSGSLVL-----NENMEAVGLLFAGS 314
>gi|168041453|ref|XP_001773206.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675565|gb|EDQ62059.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 188
Score = 47.4 bits (111), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 28/38 (73%)
Query: 386 QTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRGR 423
+ F+L DS SLIL+ + GE+PR VG++WGG A+ GR
Sbjct: 49 RAFELGSDSQSLILVREEAGERPRLVGVVWGGCASNGR 86
>gi|378551300|ref|ZP_09826516.1| hypothetical protein CCH26_14474 [Citricoccus sp. CH26A]
Length = 374
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 92/348 (26%), Positives = 132/348 (37%), Gaps = 76/348 (21%)
Query: 105 ELMTIR----AFHSKILRR-FSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQC 159
EL I+ A +L R +G IG ++ G T P+ILVFV H++ + +
Sbjct: 9 ELAVIKPVKEAIEDDLLARPGVVGVDIGEKVSHGKKTGEPSILVFVE---HKKPVKALPP 65
Query: 160 LPAALEGPGGVWCDVDVVEFSYYGA-----PAPTPKEELYTELVDG--------LRGSDP 206
GV DV + A PA Y L G +R P
Sbjct: 66 EEVVPPEVDGVKTDVQEMVIELQAARQLLVPAQQVDPAAYPRLAGGISMGPARSIRMEPP 125
Query: 207 CIGSGSQVASQETY---GTLGAIVRSRTGNQQVGFLTNRHVAVDLD--YPNQKMFHPLPP 261
+VA Y GTLGA+VR R + +TN HVA D +M P P
Sbjct: 126 ------EVAEAGEYVFVGTLGAMVRDRASGATLA-MTNFHVACVDDGWAAGDRMIQPGRP 178
Query: 262 SLGPGVY--LGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSV 319
G G++ RA ++++ DGA + E +NV
Sbjct: 179 DGGDATTQQFGSLARA--VLSEN----------------TDGAVVTVDEGKEWDNV---- 216
Query: 320 KGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMA----YALEYNDEKGICFF 375
V +IGDV + IG V K GR++ T GTV + +L+Y D G
Sbjct: 217 --VMDIGDV-----AGSAEASIGLAVQKRGRTTQHTFGTVASAEATLSLDYGDGMGTRTL 269
Query: 376 ---TDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
L Q F GDSGS++L +N VG+++ G+ +
Sbjct: 270 RHQVRILTDTARSQRFSEGGDSGSVVLDMDRN-----VVGLLFAGSTD 312
>gi|258650626|ref|YP_003199782.1| hypothetical protein Namu_0364 [Nakamurella multipartita DSM 44233]
gi|258553851|gb|ACV76793.1| conserved hypothetical protein [Nakamurella multipartita DSM 44233]
Length = 765
Score = 47.0 bits (110), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 73/167 (43%), Gaps = 15/167 (8%)
Query: 294 ETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSG 353
T++ D A + E +L + T+ G+ +G + + ++ LI QV G +SG
Sbjct: 245 RTYLTLDAALV---EVNDLADWTSQTYGLPPVGALADLSERNIGMQLINAQVTAYGAASG 301
Query: 354 LTTGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKP----- 408
TG + A + G TDFL+ + Q GDSG++ L + E+P
Sbjct: 302 RLTGRIAALFYRHRSMGGYDEITDFLIAPDPGQPSSQPGDSGTVWHLI-EPSEQPDDPAR 360
Query: 409 --RPVGIIWGGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLI 453
RP+ + WGG R P N+ L +L LL+++L+
Sbjct: 361 RLRPIALQWGGQGVRPADP----GPGYNFALAAGLTAILRLLDVELV 403
>gi|253771263|ref|YP_003034130.1| hypothetical protein CLG_A0037 [Clostridium botulinum D str. 1873]
gi|253721415|gb|ACT33707.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 319
Score = 47.0 bits (110), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 117/312 (37%), Gaps = 73/312 (23%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G +G+++ G T I VFV +KV+ L +PA +G D V+ Y
Sbjct: 43 VGVGLGYKVTSGFCTFQKCIKVFVTKKVYENELPEADLVPAIYKG-----IITDTVDSGY 97
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTN 241
+ + T K P I S GTLG +V T FL+N
Sbjct: 98 FQPQSLTEKIR-------------PVICGYSLGPVNALGGTLGCLV---TDGFSRFFLSN 141
Query: 242 RHVAVDLDYPNQKMFHP-LPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
HV D + + + P L PS G G +P V
Sbjct: 142 NHVLADFN--SLSINTPILQPSANDG-----------------------GKSPADVVGNL 176
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIID--LQSPINSLIG-----------RQVMK 347
FIP T V + +ID + SP +L+G V K
Sbjct: 177 SNFIPLERVTAFKRPTNYV----DCAIARLIDKSIASPAIALVGPPKGTKQPQLNSSVKK 232
Query: 348 VGRSSGLTTGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDLE-GDSGSLILLTGQNGE 406
VG++S LTTGT+ A + Y + GI + L + TF + GDSGS +LL N
Sbjct: 233 VGKTSELTTGTITAINVTYTADYGI---KEVLFKNQIVTTFLSQPGDSGS-VLLDNDN-- 286
Query: 407 KPRPVGIIWGGT 418
+G+I GG+
Sbjct: 287 --YVLGLIIGGS 296
>gi|399021530|ref|ZP_10723627.1| hypothetical protein PMI16_04605 [Herbaspirillum sp. CF444]
gi|398091303|gb|EJL81750.1| hypothetical protein PMI16_04605 [Herbaspirillum sp. CF444]
Length = 351
Score = 46.6 bits (109), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 62/140 (44%), Gaps = 16/140 (11%)
Query: 290 GTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVG--------EIGDVHIIDLQSPINS-L 340
G +P + A F+P A + V ++ E G+ + + +P+ +
Sbjct: 185 GNDPADVIGALSYFVPLAAPGGTSPVDAAIAAFDDTKNDPRMERGENKVEKMVAPVTAPY 244
Query: 341 IGRQVMKVGRSSGLTTGTVMAYALEYNDE---KGICFFTDFLVVGENQQTFDLEGDSGSL 397
+G +V K GR++G+T G V A AL + G+ + V F L GDSGS+
Sbjct: 245 VGMEVQKSGRTTGVTKGKVTAIALTIATDYAGYGVVTIQNTFSVKHVSGYFSLPGDSGSV 304
Query: 398 ILLTGQNGEKPRPVGIIWGG 417
I QN PVG+++ G
Sbjct: 305 ITTASQN----NPVGLLFAG 320
>gi|395448531|ref|YP_006388784.1| hypothetical protein YSA_09065 [Pseudomonas putida ND6]
gi|388562528|gb|AFK71669.1| hypothetical protein YSA_09065 [Pseudomonas putida ND6]
Length = 409
Score = 46.6 bits (109), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 100/230 (43%), Gaps = 41/230 (17%)
Query: 208 IGSGSQVASQETY--GTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPNQKM--FHPLP--- 260
I GS V + + + GTLG + R G + VGF +N HV + ++ M P P
Sbjct: 166 ISCGSSVTTSQVFDAGTLGFLARLADG-RLVGF-SNNHVTGECNHTPHGMHILSPSPMDA 223
Query: 261 -PSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSV 319
P+ P V +G T F L G N T D A E + +S+
Sbjct: 224 SPASPPPVAIG-----THFALAPLNSG---DPNQITLQETDAAIFLVTEP----DKVSSM 271
Query: 320 KGVGEIGDVHIIDLQSPINSL-IGRQVMKVGRSSGLTTGTVMA-----YALEY--NDEKG 371
+G G D S +L G +V KVGR++GL GTV+ + L Y N +
Sbjct: 272 QGNG------FYDTPSETVALRAGLRVKKVGRTTGLRAGTVLGQMVAPFYLPYKSNRFQS 325
Query: 372 ICFFTDFLVV-GENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTAN 420
I +F+ V G+ TF GDSGSL++ + R VG+++ G N
Sbjct: 326 IVYFSGVWAVQGDGGNTFSEGGDSGSLVVTE----DGTRSVGVVFAGGNN 371
>gi|443289395|ref|ZP_21028489.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
08]
gi|385887548|emb|CCH16563.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
08]
Length = 528
Score = 45.8 bits (107), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 57/123 (46%), Gaps = 17/123 (13%)
Query: 123 GTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALE-GPGGVWCDVDVVEFSY 181
G A G R G TD PA++V+V RKV RQ+L + LP + GP + +VDVVE
Sbjct: 35 GLAYGRREVSGRRTDEPALVVYVVRKVPRQFLPTTRLLPRRVYFGPD--FVEVDVVETGP 92
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTN 241
+ A T +E P S T GTLGA+V T + + L+N
Sbjct: 93 FFAQEFTARER-------------PAPNGVSIAHIDVTAGTLGALVTDNT-DGSLCILSN 138
Query: 242 RHV 244
HV
Sbjct: 139 NHV 141
>gi|357040054|ref|ZP_09101844.1| hypothetical protein DesgiDRAFT_2960 [Desulfotomaculum gibsoniae
DSM 7213]
gi|355357034|gb|EHG04813.1| hypothetical protein DesgiDRAFT_2960 [Desulfotomaculum gibsoniae
DSM 7213]
Length = 333
Score = 45.8 bits (107), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 75/326 (23%), Positives = 138/326 (42%), Gaps = 63/326 (19%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G +G++ T+ PAI++FV +KV L Q LP ++G + DV+E
Sbjct: 22 VGVGVGYKQVGLTQTNKPAIIIFVEKKVPAANLQRSQKLPPKIDG-----LETDVIEIGR 76
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTN 241
L+D + P + S + + GT GA+VR + +++ L+N
Sbjct: 77 -------------VRLLDRVMKMRPALPGSSVGHYKISAGTFGAVVRDKNTGEKL-ILSN 122
Query: 242 RHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWY--------------GI 287
H+ + + L PG Y G +A+ I + + + +
Sbjct: 123 NHILANGTNGSDGRASVGDAILQPGPYDGG--KASDKIAELIRFIPLIRTAQPSECPVAV 180
Query: 288 FAGTNPETFVRA-----DGAFIPFAEDFNLNNVTTS--VKGVGEIGDVHIIDLQSPINSL 340
F+R + F ++ N+ + + +K G IG+ +++L +
Sbjct: 181 GVAGIGNRFIRLIRPAYEMRFYKYSRSTNIVDCAVARPIK-TGLIGE-ELVELGAVTGVE 238
Query: 341 IGRQ---VMKVGRSSGLTTGTVMAYALEY-----NDEKGICFFTDFLVVGENQQTFDLEG 392
R+ V K GR++G+T+G V A + +DE G +F+D +V Q G
Sbjct: 239 EAREGMWVQKSGRTTGVTSGLVTAMGVTLKVSLSDDESG--WFSDQVVADVMCQ----PG 292
Query: 393 DSGSLILLTGQNGEKPRPVGIIWGGT 418
DSGSLI+ G++ + VG+++ G+
Sbjct: 293 DSGSLII-----GKENKAVGLLFAGS 313
>gi|416354626|ref|ZP_11681687.1| hypothetical protein CBCST_10406 [Clostridium botulinum C str.
Stockholm]
gi|338195372|gb|EGO87663.1| hypothetical protein CBCST_10406 [Clostridium botulinum C str.
Stockholm]
Length = 259
Score = 45.4 bits (106), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 112/274 (40%), Gaps = 62/274 (22%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G IG+++++ VLT I VF ++K+ L +P+ +G DV+E
Sbjct: 41 VGVGIGYKVQKEVLTSEKCIAVFASKKIPNNELKREDLVPSVYKG-----IKTDVIETGI 95
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGS-GSQVASQETYGTLGAIVRSRTGNQQVGFLT 240
+ +L + +R P +G G + + YGT+G +V N L+
Sbjct: 96 FST----------MKLSNRIR---PVLGGYGIAPVTTKYYGTMGCLVTDGIENF---ILS 139
Query: 241 NRHVAVDLDYPNQKMFHPLPPSLGPGVYLGA------VERATSFITDDLWYGIFAGTNPE 294
+ H+ DL+ N K+ P+ L P + G V + FI I PE
Sbjct: 140 SNHILADLN--NIKLGTPI---LQPAIVNGGNPEKDQVAVLSKFIP---LRSINGTKRPE 191
Query: 295 TFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGL 354
++ + A+ N N V++ +K +G+ V +G+ V KVG S+ L
Sbjct: 192 NYMD-----VAIAKVINNNFVSSDIKFIGKPKGVR--------GHRLGQLVKKVGASTEL 238
Query: 355 TTGTVMAYALEYNDEKGICFFTDFLVVGENQQTF 388
TTG + + ++V EN++ F
Sbjct: 239 TTGIIQ-------------YMNVTIIVDENKKQF 259
>gi|253682482|ref|ZP_04863279.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253562194|gb|EES91646.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 305
Score = 45.4 bits (106), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 72/290 (24%), Positives = 122/290 (42%), Gaps = 63/290 (21%)
Query: 123 GTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYY 182
G +G++++ G T I VFV KV + + +P+ + G+ DV+ + S
Sbjct: 30 GIGLGYKVKNGFDTHKKCIKVFVDVKVSKNNIPLHDLIPSYYD---GIETDVEQIGIS-- 84
Query: 183 GAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNR 242
+ K+++ VDG P IGS S GT G +V T + + L+N
Sbjct: 85 --TMCSLKDKV--RPVDGGYNISPLIGSPS--------GTFGCLV---TDGRFMYLLSNC 129
Query: 243 HV-----AVDLDYPNQKMFHPLPPSLGPGVYLGA------VERATSFITDDLWYGIFAGT 291
HV A LD P L PG G + + +I I +
Sbjct: 130 HVLATNGATPLD----------CPILQPGRKYGGKDPEDKIAILSKYIEPKY---ITPTS 176
Query: 292 NPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRS 351
+PE FV A+ +L+ V+ +K +G I + +++G V KVG +
Sbjct: 177 SPENFVDC-----AIAKITDLSKVSNKIKFLGNI--------KGTAPAILGESVQKVGCT 223
Query: 352 SGLTTGTVMAYALEYNDE--KGICFFTDFLVVGENQQTFDLEGDSGSLIL 399
+ LT G ++A + + KG C F + ++ + + +GDSGS++L
Sbjct: 224 TELTKGKIIALGVTITIQRPKGNCIFKNQILTNKMGE----KGDSGSILL 269
>gi|331271090|ref|YP_004385799.1| hypothetical protein CbC4_6002 [Clostridium botulinum BKT015925]
gi|329127585|gb|AEB77527.1| hypothetical protein CbC4_6002 [Clostridium botulinum BKT015925]
Length = 313
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 75/303 (24%), Positives = 125/303 (41%), Gaps = 83/303 (27%)
Query: 120 FSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEF 179
+ +G A+G++I+ G +T+ I VFV++KV L + +P + + DVVE
Sbjct: 34 YVVGIALGYKIKNGFITNKKCIKVFVSKKVPLSNLYEHEVIPKFFK-----CIETDVVES 88
Query: 180 SYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQ-VASQETYGTLGAIVRSRTGNQQVGF 238
+ A T K P IG S V++ G+LG +V T +
Sbjct: 89 GEFSAAEFTGKVR-------------PVIGGYSIGVSNVRGVGSLGCLV---TDGRYKYI 132
Query: 239 LTNRHVAVDLDYPNQKMFHPLP---PSLGPGVYLGAVERATSFITDDLWYGIFAGTNPET 295
L+N HV DL+ +P P + PG+ DD G P T
Sbjct: 133 LSNNHVIADLN--------KIPIGTPIIQPGL-------------DD-------GGKPST 164
Query: 296 -FVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIID--LQSPINSLIG---------- 342
V +IP + + TS + +I+ + SP +++G
Sbjct: 165 DIVALLSKYIPLKTE----GIITSPTNYTDCAIAKLINESIASPKIAIVGAPEGTMIPII 220
Query: 343 -RQVMKVGRSSGLTTGTVM----AYALEYNDEKGICFFTDFLVVGENQQTFDLE-GDSGS 396
+ V KVGRS+ +TTG + + + ++ ++ FF + +V T+ E GDSGS
Sbjct: 221 DKGVRKVGRSTEMTTGRITDIDGTFHIRFDSKR--VFFEEQIV-----TTYMCEDGDSGS 273
Query: 397 LIL 399
++L
Sbjct: 274 ILL 276
>gi|134297959|ref|YP_001111455.1| hypothetical protein Dred_0080 [Desulfotomaculum reducens MI-1]
gi|134050659|gb|ABO48630.1| conserved hypothetical protein [Desulfotomaculum reducens MI-1]
Length = 336
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 70/328 (21%), Positives = 130/328 (39%), Gaps = 65/328 (19%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G +G++ T AI++FV +K LS + +P + G + DV+E
Sbjct: 22 VGVGVGYKHVGMERTQQKAIIIFVTKKEDLGNLSREELVPFKING-----LETDVIEVGD 76
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTN 241
K+ + R + P + G + T GT GA+VR R+ + + L+N
Sbjct: 77 IRFLEEDRKKHV--------RPAQPGMSVGHY---RVTAGTFGAMVRDRSTGEPL-ILSN 124
Query: 242 RHVAVD-------LDYPNQKMFHP------------------LPPSLGPGVYLGAVERAT 276
H+ + P +F P +P G +
Sbjct: 125 NHILANGTDGKDGRSAPGDLIFQPGEYDGGTKADRIATLIRFIPIQKGEAPASCPIANGV 184
Query: 277 SFITDDLWYGIFAGTNPETFVR---ADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDL 333
+ I + L + I + + F R A+ A + + ++ + G+G++
Sbjct: 185 ARIANMLVHTIRPNYDLKFFKREGVANHVDCAVARPLSPDLISDEILGIGKV-------- 236
Query: 334 QSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYN---DEKGICFFTDFLVVGENQQTFDL 390
Q I++ G +V K GR++G+T+G V A D+ +F++ ++ Q
Sbjct: 237 QGIIDAKPGMKVKKSGRTTGITSGVVTAIGTTMQVKMDDNNNAYFSNQVICDMKSQG--- 293
Query: 391 EGDSGSLILLTGQNGEKPRPVGIIWGGT 418
GDSGSL+L G + VG+++ G+
Sbjct: 294 -GDSGSLVLTEGN-----KAVGLLFAGS 315
>gi|416365266|ref|ZP_11682761.1| hypothetical protein CBCST_17192 [Clostridium botulinum C str.
Stockholm]
gi|338194035|gb|EGO86591.1| hypothetical protein CBCST_17192 [Clostridium botulinum C str.
Stockholm]
Length = 305
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 71/290 (24%), Positives = 121/290 (41%), Gaps = 63/290 (21%)
Query: 123 GTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYY 182
G +G++++ G T I +FV KV + +P+ + G+ DV+ + S
Sbjct: 30 GIGLGYKVKNGFDTHKKCIKIFVDVKVSENNIPLHDLIPSYYD---GIETDVEQIGIS-- 84
Query: 183 GAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNR 242
+ K+++ VDG P IGS S GT G +V T + + L+N
Sbjct: 85 --TMCSLKDKV--RPVDGGYNISPLIGSPS--------GTFGCLV---TDGRFMYLLSNC 129
Query: 243 HV-----AVDLDYPNQKMFHPLPPSLGPGVYLGA------VERATSFITDDLWYGIFAGT 291
HV A LD P L PG G + + +I I +
Sbjct: 130 HVLATNGATPLD----------CPILQPGRKYGGKDPEDKIAILSKYIEPKY---ITPTS 176
Query: 292 NPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRS 351
+PE FV A+ +L+ V+ +K +G I + +++G V KVG +
Sbjct: 177 SPENFVDC-----AIAKVTDLSKVSNKIKFLGNI--------KGTAPAILGESVQKVGCT 223
Query: 352 SGLTTGTVMAYALEYNDE--KGICFFTDFLVVGENQQTFDLEGDSGSLIL 399
+ LT G ++A + + KG C F + ++ + + +GDSGS++L
Sbjct: 224 TELTKGKIIALGVTITIQRPKGNCIFKNQILTNKMGE----KGDSGSILL 269
>gi|225166828|ref|YP_002650813.1| conserved hypothetical protein [Clostridium botulinum]
gi|253771431|ref|YP_003034186.1| hypothetical protein CLG_0045 [Clostridium botulinum D str. 1873]
gi|225007492|dbj|BAH29588.1| conserved hypothetical protein [Clostridium botulinum]
gi|253721408|gb|ACT33701.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 306
Score = 42.7 bits (99), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 125/325 (38%), Gaps = 92/325 (28%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDV---DVVE 178
+G +G++I+ G T + VFV K LP +CD+ D+V
Sbjct: 29 VGVGLGYKIKNGFNTFQKCLSVFVTNK-----------LP---------FCDIPSNDMVP 68
Query: 179 FSYYGAPAPTPKEELY--TELVDGLR----GSDPCIGSGSQVASQETYGTLGAIVRSRTG 232
YYG P + +L +R G D IG V GTLG IV T
Sbjct: 69 SYYYGIPTDVINTGAFHLQKLTQKIRPVPGGYD--IGPALIVEG----GTLGCIV---TD 119
Query: 233 NQQVGFLTNRHV-----AVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGI 287
+ LT H V + YP + PS +
Sbjct: 120 GKYYHILTCNHSLTAKEVVTVTYPITQ------PSC-----------------------V 150
Query: 288 FAGTNPETFVRADGAFIPF----AEDFNLNNVTTSVKGVGEIGDVHI-IDLQSPINSL-- 340
+ G PE + +IP + N+N V ++ + + + I+ I +
Sbjct: 151 YGGNYPEDIIARISKYIPINNSTTTNENINYVDCAIAKINKRSQISTKINFLGRIKGMTK 210
Query: 341 --IGRQVMKVGRSSGLTTGTVMAY--ALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGS 396
+G V KVG ++ LT GTV + LE+N+ +G F D ++ + + EGDSGS
Sbjct: 211 ASLGLNVQKVGANTELTEGTVTSVGATLEFNEPQGKFIFVDQIITNKMSE----EGDSGS 266
Query: 397 LILLTGQNGEKPRPVGIIWGGTANR 421
+++ + + VG++ GG + +
Sbjct: 267 ILV-----DKNIQAVGMLMGGGSTK 286
>gi|297623499|ref|YP_003704933.1| hypothetical protein [Truepera radiovictrix DSM 17093]
gi|297164679|gb|ADI14390.1| conserved hypothetical protein [Truepera radiovictrix DSM 17093]
Length = 323
Score = 42.7 bits (99), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 90/234 (38%), Gaps = 29/234 (12%)
Query: 188 TPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVD 247
TP++E+ +V G + + G+ + + GTLGA + G L+N HV
Sbjct: 94 TPEQEVLDPVVLGAQIQN---GAADERSGGYGVGTLGAFYPAPEGGTL--LLSNNHVIAA 148
Query: 248 LDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFA 307
+ P+++ +G +Y R W + +P RAD A
Sbjct: 149 ENTPDEEHAR-----VGDPIYQAQRGRGRVVARLSAWVPL----SPTAPNRADIASAALL 199
Query: 308 EDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYN 367
+ N +G G + + +G++V KVGR+SGLT GTV A
Sbjct: 200 PETVFENAFLPPRGRPAPGATQLAAPR------VGQRVFKVGRTSGLTFGTVSAVGARVP 253
Query: 368 DEK----GICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGG 417
F ++ G N TF GDSGS G K R VG ++ G
Sbjct: 254 RVAYGFGSAAFEGSVIIEGLNGSTFSAPGDSGS-----GIYDLKGRLVGFLYAG 302
>gi|402772295|ref|YP_006591832.1| protease [Methylocystis sp. SC2]
gi|401774315|emb|CCJ07181.1| Putative protease [Methylocystis sp. SC2]
Length = 495
Score = 42.7 bits (99), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 81/196 (41%), Gaps = 22/196 (11%)
Query: 233 NQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTN 292
N + GF+TN H + N FH L G +G + + T G
Sbjct: 232 NGRDGFITNSHCTKNRGVSNDDDFHQPNDPLLSGNKIGDEDADPPYFT--------GGQC 283
Query: 293 P--ETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIG-----DVHIIDLQSPINSLIGRQV 345
P +D A+ + D + + VG + V I ++P +S++G ++
Sbjct: 284 PSGRKCRFSDSAYADYRIDRGRFEIARTTNNVGSLTINSFPGVFRIMSETP-DSMVGMRL 342
Query: 346 MKVGRSSGLTTGTVMAYALEYN----DEKGICFFTDFLVVGENQQTFDLEGDSGSLILLT 401
KVGR++G G V A ++ N D + +C + V G N+ T + GDSGS +
Sbjct: 343 NKVGRTTGWAFGDVRATCIDVNVADTDVRLLCQSSVARVSGTNKLTDN--GDSGSPVFSI 400
Query: 402 GQNGEKPRPVGIIWGG 417
+ GI+WGG
Sbjct: 401 LPTASQASLHGILWGG 416
>gi|258513478|ref|YP_003189700.1| hypothetical protein Dtox_0114 [Desulfotomaculum acetoxidans DSM
771]
gi|257777183|gb|ACV61077.1| conserved hypothetical protein [Desulfotomaculum acetoxidans DSM
771]
Length = 164
Score = 42.7 bits (99), Expect = 0.60, Method: Composition-based stats.
Identities = 51/181 (28%), Positives = 77/181 (42%), Gaps = 25/181 (13%)
Query: 106 LMTIRAFHSKILRRFSL-GTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAAL 164
L ++ KILRR ++ G +G ++ RG T AI+VFV +K+ + + + LP +
Sbjct: 5 LNVMKVHRKKILRRKNVVGVGVGTKLTRGEDTGKTAIVVFVKKKLPQAEIYGTEVLPKKI 64
Query: 165 EGPGGVWCDVDVVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLG 224
+VDVVE T D R + P + S + T GTLG
Sbjct: 65 ND-----LEVDVVEIGTVRLLGRT----------DRGRPAQPGV---SIAHYKSTAGTLG 106
Query: 225 AIVRS-RTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDL 283
AIVR TG + + L+N HV + P L PG ++ ++ DL
Sbjct: 107 AIVRDLETGEKFI--LSNNHVLANATNGRDGRSQLGDPILQPGGWVSLLKEKPRI---DL 161
Query: 284 W 284
W
Sbjct: 162 W 162
>gi|379059056|ref|ZP_09849582.1| Equine arteritis virus peptidase S32 [Serinicoccus profundi MCCC
1A05965]
Length = 440
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 124/318 (38%), Gaps = 76/318 (23%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G IG +I G T +I+V+V +KV ++ Q +PA L+G
Sbjct: 29 VGVDIGEKISDGKPTGEMSIVVYVEKKVAPSKVARSQKVPAELDG--------------- 73
Query: 182 YGAPAPTPKEELYTELVD--GLRGSDP--------CIGSGSQV--ASQETYGTLGAIVRS 229
PT +EL EL GL DP I G + + + GT GA+VR
Sbjct: 74 ----IPTDVQELVIELQGGPGLYAGDPLSDTSKHTTIRGGISIGPSRHQNAGTAGALVRD 129
Query: 230 RTGNQQVGFLTNRHVA-VDLDYPNQKMFHPLPPSLGPGVYLG---AVERATSFITDDLWY 285
T V LTN HVA VD + + L PG + AV++ + L
Sbjct: 130 TT-TGAVSLLTNFHVACVDTSWTAGETV------LQPGRFDSGNPAVDQVGT-----LTR 177
Query: 286 GIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQV 345
G+ + VR DG + E ++ V S V G V
Sbjct: 178 GVISEQVDGAVVRLDGDEVWADEVVDIGGVVGSTPAVA------------------GMAV 219
Query: 346 MKVGRSSGLTTGTVMA----YALEYNDEKGICFFTDFLVVGENQQT--FDLEGDSGSLIL 399
K GR++ T G V++ L+Y D G+ + + T F GDSGS+++
Sbjct: 220 QKRGRTTEHTHGEVVSVDATVTLDYGDGVGMRTLRRQVSIRPAAGTARFSDRGDSGSVVM 279
Query: 400 LTGQNGEKPRPVGIIWGG 417
G+ + VG+++ G
Sbjct: 280 NAGR-----QVVGLLFAG 292
>gi|331270132|ref|YP_004396624.1| hypothetical protein CbC4_1955 [Clostridium botulinum BKT015925]
gi|329126682|gb|AEB76627.1| hypothetical protein CbC4_1955 [Clostridium botulinum BKT015925]
Length = 322
Score = 42.4 bits (98), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 143/348 (41%), Gaps = 67/348 (19%)
Query: 118 RRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVV 177
+R G +G++ G T I VFV++K+ ++ +PA + DVV
Sbjct: 25 KRNVQGIGLGYKKINGKCTFRKCIRVFVSKKLPSNDIAKEDLIPAYFN-----YIPTDVV 79
Query: 178 EFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVG 237
E + A ++G C G YGTLG +V+++ + V
Sbjct: 80 ESGVFTTCA-----------LNGRIRPTQC-GYSIGPVGIGIYGTLGCLVKNKR-EKAVY 126
Query: 238 FLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFI-----TDDLWYGIFAGTN 292
L+ HV P +KM P + PGV G R T+ + G F+
Sbjct: 127 LLSASHVL----NPLEKMSFG-TPIVQPGVLDGGNIRNDVIANLVRSTNIKYIGTFS--K 179
Query: 293 PETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSS 352
PE V A A + D +L V+T++ VG+ D++ + IG +V KVGR++
Sbjct: 180 PENTVDAAVAKV---SDISL--VSTTMAIVGK-------DVKQIASPKIGEKVFKVGRTT 227
Query: 353 GLTTGTVMAYALEYNDEKGICFFTDFLVVGENQQTFDL---EGDSGSLILLTGQNGEKPR 409
G T G + D I + + + Q D+ +GDSGS++L E
Sbjct: 228 GYTEGEITE-----TDVTQIINSSGKKALFKGQIAADVKSDKGDSGSVLL-----NENMN 277
Query: 410 PVGIIWGGTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATNE 457
P+G++ G + Q V ++ D+ ++ L +++I T+E
Sbjct: 278 PIGLLMGAS-----------QSTV-YSVFNDMKKVTSALNVEIITTSE 313
>gi|327401310|ref|YP_004342149.1| hypothetical protein Arcve_1431 [Archaeoglobus veneficus SNP6]
gi|327316818|gb|AEA47434.1| hypothetical protein Arcve_1431 [Archaeoglobus veneficus SNP6]
Length = 345
Score = 42.4 bits (98), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 69/300 (23%), Positives = 120/300 (40%), Gaps = 51/300 (17%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G IG+R+R +T I VFV +K+ + L+ + +P L+G DV+E
Sbjct: 69 VGVGIGYRVREYKVTPELCIQVFVTKKLRKDMLTERELVPQDLDG-----IRTDVIE--- 120
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTN 241
G + +Y P S + T GT G IV+ + + L+N
Sbjct: 121 TGVIEALTYKSMYR----------PAFPGCSIGHYRITAGTFGCIVQDKK-DHDFLILSN 169
Query: 242 RHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADG 301
HV + + N P L PG Y G +R + + +G N D
Sbjct: 170 NHVLANSNNANIG-----DPILQPGPYDGGTQRNI-IAKLKKFVPLLSGYN-----LVDA 218
Query: 302 AFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMA 361
A A+ ++ V S+ +G V + P++ L +V K GR++ G +++
Sbjct: 219 A---VAKPLDMRYVKASIAKIGMPTGV-----REPLHGL---RVQKTGRTTQYNRGRIIS 267
Query: 362 Y--ALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTA 419
++ G+ + ++ GDSGSL+L G R VG+++ G++
Sbjct: 268 TDATVKVGYGPGVTYLFKNQILTTRMAA---GGDSGSLLL-----GMCKRAVGLLFAGSS 319
>gi|190891805|ref|YP_001978347.1| hypothetical protein RHECIAT_CH0002212 [Rhizobium etli CIAT 652]
gi|190697084|gb|ACE91169.1| hypothetical protein RHECIAT_CH0002212 [Rhizobium etli CIAT 652]
Length = 783
Score = 42.0 bits (97), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 40/160 (25%), Positives = 72/160 (45%), Gaps = 22/160 (13%)
Query: 311 NLNNVTTSVKGVGEIG---DVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYN 367
++ + T+++ G+ +I DV+ +L + L+ + V+ VG +SGL G + A Y
Sbjct: 244 DMRDWTSNIYGLPKIKPLFDVYEQNLS--LRRLMDQPVVAVGGASGLLQGKIKAMFYRYR 301
Query: 368 DEKGICFFTDFLVVGENQQTFDLEGDSGSL--ILLTGQNG---EKP------RPVGIIWG 416
G + +DFL+ GDSG+L + + G +G E+P RP+ I WG
Sbjct: 302 SVGGFDYVSDFLIAPIPGGKVPRHGDSGALWHVQMPGPDGKQDERPLAQRDLRPLAIEWG 361
Query: 417 GTANRGRLKLKVGQPPVNWTSGVDLGRLLDLLELDLIATN 456
G ++ L + LL+++L+ N
Sbjct: 362 AQV------FADGGERSTYSVASSLSNICKLLDVELVMEN 395
>gi|86139781|ref|ZP_01058347.1| hypothetical protein MED193_12148 [Roseobacter sp. MED193]
gi|85823410|gb|EAQ43619.1| hypothetical protein MED193_12148 [Roseobacter sp. MED193]
Length = 516
Score = 42.0 bits (97), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 55/122 (45%), Gaps = 13/122 (10%)
Query: 123 GTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYY 182
G IGFR RRG TD + + V RK+ L Q LP+ + G +DV+E +Y
Sbjct: 38 GIDIGFRWRRGQRTDEICLRMHVQRKLPIDALLPSQVLPSHVAG-----IALDVIEAAYQ 92
Query: 183 GAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNR 242
+ P + T + G C G S E GT+G +V RT + G L+N
Sbjct: 93 PSLEPGASRQAATPQPYTMGGL--CCGR-----SGEGAGTIGLVVIDRTTGKP-GILSNW 144
Query: 243 HV 244
HV
Sbjct: 145 HV 146
>gi|253680830|ref|ZP_04861633.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253562679|gb|EES92125.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 325
Score = 42.0 bits (97), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 131/308 (42%), Gaps = 65/308 (21%)
Query: 126 IGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAP 185
+G++ +G+LT+ I VFV++K+ L P+A D++ Y G
Sbjct: 50 LGYKEIQGILTNEKCIKVFVSQKISSNNL------PSA-----------DLIPPIYNGIK 92
Query: 186 APTPKEELYTELVDGLRGSDPCIGSGSQV--ASQETYGTLGAIVRSRTGNQQVGFLTNRH 243
K ++T GL + +G + A + GTLG IV++ + + L H
Sbjct: 93 TDVVKSGIFTSC--GLTEKIRPVPNGYSIGPAGYKMAGTLGCIVQNPS-ERAYYILGTNH 149
Query: 244 VAVDLDYPNQKMFHPLPPSLGPGVYLGA------VERATSFITDDLWYGIFAGTNPETFV 297
V L K+ P+ L PGV G + T +I + + F T PE ++
Sbjct: 150 VLAQLG--KAKISTPI---LQPGVLDGGSVNTDIIANLTKYI--PIKFKTFFKT-PENYI 201
Query: 298 RADGAFIPFAEDFNLNNVTTSVKGVG-EIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTT 356
A AE N++ V+ V + + D+ I + IG++V KVGR++G TT
Sbjct: 202 DA-----AIAEISNISLVSPKVAIINNKFKDIGIPE--------IGQEVFKVGRTTGYTT 248
Query: 357 GTVMAY----ALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVG 412
G + + ++Y D G F D ++ + GDSGS++ N P+G
Sbjct: 249 GRITSIDATAIIKYPD--GTALFKDQILASTEVKV----GDSGSILATKNLN-----PLG 297
Query: 413 IIWGGTAN 420
++ + N
Sbjct: 298 MLSSASEN 305
>gi|331269225|ref|YP_004395717.1| hypothetical protein CbC4_1040 [Clostridium botulinum BKT015925]
gi|329125775|gb|AEB75720.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length = 314
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 71/294 (24%), Positives = 119/294 (40%), Gaps = 60/294 (20%)
Query: 118 RRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVV 177
++ +G +G++I T I VFV+ KV + L +PA +G + DVV
Sbjct: 32 KKNVVGVGVGYKIINNFYTSKKCITVFVSEKVDQNNLPLKDLIPAVYKG-----IETDVV 86
Query: 178 EFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVG 237
+ Y+ + T K +R G + AS T G+ G +V G ++
Sbjct: 87 QSGYFVGASLTQK----------IRPVQGGYSVGPESASNIT-GSQGCVVTD--GTRRYM 133
Query: 238 FLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVER-ATSFITDDLWYGIFAGTNPETF 296
N +A + P L PSLG G G + A +++T
Sbjct: 134 LSCNHIIAHENMLPRNTQI--LQPSLGDG---GKTTKDAVAYLTK--------------- 173
Query: 297 VRADGAFIPFAEDFNL----NNVTTSVKGVGEIG----DVHII-DLQSPINSLIGRQVMK 347
+IP + L N+V ++ E G ++II DL+ +GR+V+K
Sbjct: 174 ------YIPLKKKTTLNSPENDVDCAIAREYEPGILSSKIYIIGDLKGVSAPNLGRKVVK 227
Query: 348 VGRSSGLTTG--TVMAYALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLIL 399
GR++ T G T + ++ E GI F ++ Q EGDSG++++
Sbjct: 228 SGRTTAYTEGSITTIGATVQVKLELGIYIFKHQIITTSMGQ----EGDSGAVLV 277
>gi|401662288|emb|CCG27838.1| putative serine protease [Aeropyrum spring-shaped virus]
Length = 326
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 62/145 (42%), Gaps = 16/145 (11%)
Query: 129 RIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYYGAPAPT 188
RIRRG + D P I V+V +K+ R L +P +EG DVVE A A
Sbjct: 34 RIRRGRVVDEPVIRVYVKKKLPRNLLRPQDLVPEEVEG-----IRTDVVEIGEVEAWALL 88
Query: 189 PKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDL 248
+ L G P I S Q T GTLG V++ N ++ F +N HV
Sbjct: 89 QPRAAASPLYTGR--YRPVIAGVSIGHYQITAGTLGWYVKA--PNAEILFASNAHVFT-- 142
Query: 249 DYPN---QKMFHPLPPSLGPGVYLG 270
PN Q+ + P L PG Y G
Sbjct: 143 --PNASGQEGQYEGDPILQPGPYDG 165
>gi|228994928|ref|ZP_04154706.1| hypothetical protein bpmyx0001_55800 [Bacillus pseudomycoides DSM
12442]
gi|228764830|gb|EEM13606.1| hypothetical protein bpmyx0001_55800 [Bacillus pseudomycoides DSM
12442]
Length = 329
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 136/326 (41%), Gaps = 47/326 (14%)
Query: 105 ELMTIRAFHSKIL--RRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPA 162
+L+ I+ + +L + +G +GF+ G TD AI FV +K + + +P
Sbjct: 7 KLLDIKEANENVLLNKPNVIGVDVGFKYVEGKRTDEIAIRTFVTKK---ENVGPEHEIPR 63
Query: 163 ALEGPGGVWCDVDVVEFSYYGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGT 222
+EG + VE P P E T D L G +G + GT
Sbjct: 64 TIEGVKTDVIEEKKVELQVLKIPVGAPVLENETGKFDPLVGG-ISVGPCRAINGFIFVGT 122
Query: 223 LGAIVRSRTGNQQVGFLTNRHV-AVDLDYPN-QKMFHPLPPSLG--PGVYLGAVERATSF 278
LGAIV+ + + L+N HV VD ++ + +M P G G +GA++
Sbjct: 123 LGAIVQKE--DNKFYALSNFHVMGVDNNWKSGDEMTQPGRVDGGQCSGDIIGALDSVC-- 178
Query: 279 ITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPIN 338
L I + P D A ++ + + EI ++I ++ ++
Sbjct: 179 ----LGDKINSQNKP-----VDAAI----------SIIKNRRTSPEI--LNIGKVKGKVS 217
Query: 339 SLIGRQVMKVGRSSGLTTGTVMAY----ALEYNDEKGICFFTDFLVVGENQQ---TFDLE 391
IG V K GR++GLT GT+ +++Y G+ + + + + F
Sbjct: 218 PTIGASVRKQGRTTGLTHGTITGLGRTSSIDYGSGIGVVTLKNQITIEPDTTKNPKFSDH 277
Query: 392 GDSGSLILLTGQNGEKPRPVGIIWGG 417
GDSGS+I+ E+ R +G+++GG
Sbjct: 278 GDSGSVIV-----DEQNRVIGLLFGG 298
>gi|416347989|ref|ZP_11680104.1| hypothetical protein CBCST_00400 [Clostridium botulinum C str.
Stockholm]
gi|338197134|gb|EGO89308.1| hypothetical protein CBCST_00400 [Clostridium botulinum C str.
Stockholm]
Length = 306
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 75/325 (23%), Positives = 125/325 (38%), Gaps = 92/325 (28%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDV---DVVE 178
+G +G++I+ G T + VFV K LP +CD+ D+V
Sbjct: 29 VGVGLGYKIKNGFNTFQKCLSVFVTNK-----------LP---------FCDIPSNDMVP 68
Query: 179 FSYYGAPAPTPKEELY--TELVDGLR----GSDPCIGSGSQVASQETYGTLGAIVRSRTG 232
YYG P + +L +R G D IG V GTLG IV T
Sbjct: 69 SYYYGIPTDVINTGAFHLQKLTQKIRPVPGGYD--IGPALIVEG----GTLGCIV---TD 119
Query: 233 NQQVGFLTNRHV-----AVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGI 287
+ LT H V + YP + PS +
Sbjct: 120 GKYYHILTCNHSLTAKEVVTVTYPITQ------PSC-----------------------V 150
Query: 288 FAGTNPETFVRADGAFIPF----AEDFNLNNVTTSVKGVGEIGDVHI-IDLQSPINSL-- 340
+ G PE + +IP + N+N V ++ + + + I+ I +
Sbjct: 151 YGGNYPEDIIARISKYIPINNSTTTNENINYVDCAIAKINKRSQISTKINFLGRIKGITK 210
Query: 341 --IGRQVMKVGRSSGLTTGTVMAY--ALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGS 396
+G V KVG ++ LT GTV + LE+N+ +G F D ++ + + +GDSG+
Sbjct: 211 ASLGLNVQKVGANTELTEGTVTSVGATLEFNEPRGKSIFVDQIITNKMSE----KGDSGA 266
Query: 397 LILLTGQNGEKPRPVGIIWGGTANR 421
+++ + + VG++ GG + +
Sbjct: 267 ILV-----DKNIQAVGLLMGGGSTK 286
>gi|422630026|ref|ZP_16695226.1| hypothetical protein PSYPI_09900 [Pseudomonas syringae pv. pisi
str. 1704B]
gi|330939286|gb|EGH42683.1| hypothetical protein PSYPI_09900 [Pseudomonas syringae pv. pisi
str. 1704B]
Length = 339
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 77/299 (25%), Positives = 121/299 (40%), Gaps = 54/299 (18%)
Query: 141 ILVFVARKVHRQWLSHVQCLPA-----ALEGPGGVWCDVDVVEFSYYGAPAPTPKEELYT 195
I ++ RKV ++ L Q LP+ + P G+ V G A P+ +
Sbjct: 39 ISIYTKRKVIKKDL---QVLPSNIWRQGIAYPQGLMDSV--------GKEATKPQGATFA 87
Query: 196 -ELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDY--PN 252
+ G + C GS + + GT+GA+VR G + LTN HV+ + PN
Sbjct: 88 LHQIAGGHATYAC-GSSISPGNDASAGTMGALVRLPDG--LLYGLTNNHVSALCSHVAPN 144
Query: 253 QKMFHPLPPSLGPGVY----LGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAE 308
+ P +GP LG RA L N + D A A+
Sbjct: 145 TPILAPGVLDVGPNAIAPFTLGFHSRALEMRVGSLG-------NVDFSNNLDAAVFRIAD 197
Query: 309 DFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMA------- 361
+ N+ +S++G + ++D P+ G +V KVGR++ T G +++
Sbjct: 198 EANV----SSMQGGAYDTPLVVLD---PVE---GMRVQKVGRTTRHTQGQIVSRELRPLN 247
Query: 362 ---YALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGG 417
+A Y I F F + G+N + F GDSGSLI+ G VG+I+ G
Sbjct: 248 VSYHAQSYGFNGMIWFGNVFAIHGDNAE-FSKGGDSGSLIVAVDDAGLVLGAVGLIFAG 305
>gi|253771267|ref|YP_003034112.1| hypothetical protein CLG_A0018 [Clostridium botulinum D str. 1873]
gi|253721419|gb|ACT33711.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 308
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 70/309 (22%), Positives = 123/309 (39%), Gaps = 59/309 (19%)
Query: 118 RRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVV 177
+R +G +G++++ G T+ + VFV+RK ++ +P+ +G DV
Sbjct: 33 KRNVVGLGLGYKVKNGFYTNQLCVQVFVSRKYSENEINIKDKIPSMYKGI-----LTDVK 87
Query: 178 EFSYYGAPAPTPKEELYTELVDGLRGSDPCIG--SGSQVASQETYGTLGAIVRSRTGNQQ 235
E Y+ A + K P +G S S E YGT G +V + N+
Sbjct: 88 ETGYFKACSLNKKIR-------------PVLGGYSISVYKGNEIYGTAGCVVTNGV-NKF 133
Query: 236 VGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPET 295
V L+ HV ++ K++ P VY G + + + +F G P
Sbjct: 134 V--LSTNHVLTKIN----KLYMHFPIIQPACVYGGTYSDTIATLHRYIPLHLFNGGEPPI 187
Query: 296 FVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLT 355
A I E + IG V + +SP +G V KVG S LT
Sbjct: 188 LGLLTNANIMNPE-------------IAFIGKVTCV--KSP---KLGIPVRKVGAMSELT 229
Query: 356 TGTVMA----YALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQNGEKPRPV 411
G + + + + Y + + + FF D ++ ++GDSGS+++ + +
Sbjct: 230 EGIITSINANHTVTYTNGE-VAFFKDQILTSN----MAVKGDSGSILI-----DKNNCAI 279
Query: 412 GIIWGGTAN 420
G+++ T N
Sbjct: 280 GLLFATTNN 288
>gi|448319038|ref|ZP_21508546.1| hypothetical protein C492_21210 [Natronococcus jeotgali DSM 18795]
gi|445597027|gb|ELY51106.1| hypothetical protein C492_21210 [Natronococcus jeotgali DSM 18795]
Length = 443
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 13/86 (15%)
Query: 340 LIGRQVMKVGRSSGLTTGTVMA----YALEYNDEKGICFFTDFLVVGENQQTFDLEGDSG 395
L G V K GR++G+T+ TV A A+E+ E+G D L+ G + GDSG
Sbjct: 224 LRGETVTKTGRTTGVTSATVEATSASVAVEFGAERGTVTLRDQLIAGYLSEG----GDSG 279
Query: 396 SLILLTGQNGEKPRPVGIIWGGTANR 421
S + L ++GE VG+++ G+A +
Sbjct: 280 SPVFL--EDGEL---VGLLFAGSAQQ 300
>gi|448637439|ref|ZP_21675677.1| hypothetical protein C436_02871 [Haloarcula sinaiiensis ATCC 33800]
gi|445764286|gb|EMA15441.1| hypothetical protein C436_02871 [Haloarcula sinaiiensis ATCC 33800]
Length = 429
Score = 40.4 bits (93), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 76/303 (25%), Positives = 115/303 (37%), Gaps = 43/303 (14%)
Query: 123 GTAIGFRIRRGVLTD-IPAILVFVARKVHRQWLSHVQCLPAALEGPGGVW-------CDV 174
GT IG + R G + + +++VFV RKV L + +P +E G + ++
Sbjct: 24 GTGIGPKQRAGEMDEEAESVIVFVERKVAEADLDDNEVIPEEIEIDGKTYKTDVQESGEI 83
Query: 175 DVVEFSYYGAPAPTPKE--------ELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAI 226
+E AP E E+ L R P S T GTLG
Sbjct: 84 KALELELTAPEAPMELEGRDRAEIKEIPASLSRTRRWR-PAPAGVSVGHPDITAGTLGTQ 142
Query: 227 VRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYG 286
RT ++++ FLTN HVA D N+ L PG Y G I L +
Sbjct: 143 PL-RTQDEKLVFLTNSHVAADSGRANRGDM-----VLQPGPYDGGTA-PDDEIGSLLGFN 195
Query: 287 IFAGTNPETFV--RADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQ 344
+ F R D A + D ++ T + + E DL+ ++ +G
Sbjct: 196 VIDADTSSPFPKNRTDSAIVEVTPD----HLQTDIWELHE-------DLRGFTDAEVGAI 244
Query: 345 VMKVGRSSGLTTGTVMAYALEYNDE--KGICFFTDFLVVGENQQTFDLEGDSGSLILLTG 402
K GR++G+T A +N G+ D V + GDSGSLI +
Sbjct: 245 HTKSGRTTGVTQAKCTARHANFNVRYSHGVAKMVDCDVFNAMAKG----GDSGSLIGMER 300
Query: 403 QNG 405
++G
Sbjct: 301 EDG 303
>gi|343500347|ref|ZP_08738242.1| hypothetical protein VITU9109_14061 [Vibrio tubiashii ATCC 19109]
gi|418477654|ref|ZP_13046779.1| hypothetical protein VT1337_04732 [Vibrio tubiashii NCIMB 1337 =
ATCC 19106]
gi|342820593|gb|EGU55413.1| hypothetical protein VITU9109_14061 [Vibrio tubiashii ATCC 19109]
gi|384574609|gb|EIF05071.1| hypothetical protein VT1337_04732 [Vibrio tubiashii NCIMB 1337 =
ATCC 19106]
Length = 445
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 90/221 (40%), Gaps = 55/221 (24%)
Query: 219 TYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPN--QKMFHPLPPSLGPGVYLGAVERAT 276
T GT+GA V + T V L+N HV + + N + M P P + G E+
Sbjct: 153 TAGTIGARVTNGT---NVFALSNNHVFANSNDTNVPENMLQPGP-------FDGGTEQND 202
Query: 277 SFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIID---- 332
+F + D I F N+ + ++ GE+ D
Sbjct: 203 TFAS-----------------LTDYEPILFDGSANIMDAAVALTSTGELTTSTPADGYGT 245
Query: 333 LQSPIN-SLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICF-----------FTDFLV 380
S +N ++IG V K GR++G T GTV A N +C+ F +V
Sbjct: 246 PDSTVNEAVIGMSVKKYGRTTGFTQGTVDAINASVN----VCYEGSSTCTKLALFVGQIV 301
Query: 381 VGENQQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANR 421
V TF GDSGSLI+ + N PVG+++ G+++
Sbjct: 302 V--TPGTFSAGGDSGSLIVSSNGN----NPVGLLFAGSSSH 336
>gi|302342875|ref|YP_003807404.1| glucose inhibited division protein A [Desulfarculus baarsii DSM
2075]
gi|301639488|gb|ADK84810.1| glucose inhibited division protein A [Desulfarculus baarsii DSM
2075]
Length = 630
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 39/82 (47%), Gaps = 2/82 (2%)
Query: 225 AIVRSRTGNQQVGFLTNRHVAVDLDYPNQKMFHP-LPPSLGPGVYLGAVERATSFITDDL 283
A+V S G + F+ A++ DY + + P L + PG+YL TS +
Sbjct: 324 AMVHSLPGCEN-AFIVRPGYAIEYDYADPQDLKPTLESKIAPGLYLAGQINGTSGYEEAA 382
Query: 284 WYGIFAGTNPETFVRADGAFIP 305
G++AG N VR +GAF P
Sbjct: 383 AQGLWAGINAALAVRGEGAFAP 404
>gi|331271154|ref|YP_004385863.1| hypothetical protein CbC4_6070 [Clostridium botulinum BKT015925]
gi|329127649|gb|AEB77591.1| hypothetical protein CbC4_6070 [Clostridium botulinum BKT015925]
Length = 302
Score = 39.7 bits (91), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 112/281 (39%), Gaps = 46/281 (16%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G +G++I GV T I VFV K+ + L+ + +P +G D+VE +
Sbjct: 27 IGVGLGYKISNGVNTLTKCIKVFVKNKISKDKLNENEMIPKCYKGI-----PTDIVECGF 81
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTN 241
+ +T+ + + G IG G+ + + GT+G +V+ ++ L
Sbjct: 82 ATSCG-------FTKRIRPVYGGYS-IGPGNALLN----GTMGCVVKD---HRYYYILGC 126
Query: 242 RHVAVDLDYPNQKMFHPLPPSLGPGV-YLGAVERATSFITDDLWYGIFAGTNPETFVRAD 300
HV D + P L G + T FI I G+ E +V
Sbjct: 127 NHVLADENIEKIGAAIIQPSKLDSGTPSHDTIAHLTKFIP------IKFGSGEENYVDCA 180
Query: 301 GAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVM 360
A I +D +L VT + +G I + L G V K GR++ T G +
Sbjct: 181 MARI---DDKSL--VTPEIVIIGSIKGTSDVKL--------GESVRKCGRTTEFTIGRIS 227
Query: 361 AY--ALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLIL 399
A L N +KG C F + + +GDSG++++
Sbjct: 228 AINTTLNINFKKGKCLFKNQIA----TSIMSSKGDSGAILV 264
>gi|253682406|ref|ZP_04863203.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
gi|253562118|gb|EES91570.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 317
Score = 39.7 bits (91), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 75/316 (23%), Positives = 130/316 (41%), Gaps = 73/316 (23%)
Query: 122 LGTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSY 181
+G G++I+ G T+ I VFV++K+ L+ +P+ +G D+ E
Sbjct: 35 VGIGCGYKIKNGFYTNQLCIQVFVSKKLPLNELNINDLIPSTYKG-----IPTDIKETGG 89
Query: 182 YGAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTN 241
+ A + T K +R + P S S + E GTLG +V+ N+ + L+N
Sbjct: 90 FTACSLTQK----------IRPT-PGGYSISNEYNNEYSGTLGCLVKD---NKDLFLLSN 135
Query: 242 RHVAVDLDYPNQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPE-----TF 296
HV +F+ P LG + +E + F G NP+ T
Sbjct: 136 SHVLA--------IFNQAP--LGTKI----IEPSNEF-----------GGNPKTDTIATL 170
Query: 297 VRADGAFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPI--------NSLIGRQVMKV 348
VR I F E++N+ T G+ +I D ++ + + N + + + KV
Sbjct: 171 VRYIK--IRFIENYNMPFNYTDC-GIAKIIDKSLVSPEIALTGIPKGVSNPKLNQPIKKV 227
Query: 349 GRSSGLTTGTVMA----YALEYNDEKGICFFTDFLVVGENQQTFDLEGDSGSLILLTGQN 404
G S LTTG + + + Y+D K F + + + GDSG+++L N
Sbjct: 228 GAISELTTGVITSIHNTLTVNYHDIKKSAIFKEQIFTSFMAE----HGDSGAILLDQSNN 283
Query: 405 GEKPRPVGIIWGGTAN 420
+G++ G+ N
Sbjct: 284 -----VIGLLMSGSKN 294
>gi|134096198|ref|YP_001101273.1| hypothetical protein HEAR3043 [Herminiimonas arsenicoxydans]
gi|133740101|emb|CAL63152.1| Conserved hypothetical protein [Herminiimonas arsenicoxydans]
Length = 359
Score = 39.3 bits (90), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 84/347 (24%), Positives = 138/347 (39%), Gaps = 60/347 (17%)
Query: 95 PTGQQATTLLELMTIRAFHSKILRRFSLGTAIGFRIRRGVLTDIPAILVFVARKVHRQWL 154
PT + +L + + K LR TAI F +T VF + V +
Sbjct: 30 PTDEAKDSLFDSAAMSVLAEKTLRSRGGITAIAFNNANNTVT------VFTDKSVPAK-- 81
Query: 155 SHVQCLPAALEGPGGVWCDVDVVEFSYY---GAPAPTPKEELYTELVDGLRGSDPCIGSG 211
+ LP A+ + VE +Y A A P G C GS
Sbjct: 82 -EQKILPQAV---------LQQVEINYMHSGTAQAGVPANSAVPAPFSIHNGRYAC-GSS 130
Query: 212 SQVASQETYGTLGAIVRSRTGNQQVGFLTNRHVAVDLDYPN--QKMFHP-LPPSLGPGV- 267
A GTLG +VR +G+ + LTN HV+ +Y + +K+ P P + G+
Sbjct: 131 IHPAKVLGAGTLGCLVRDPSGD--IFALTNNHVSGMCNYASNGEKIIAPGHPDIIANGID 188
Query: 268 --YLGAVERATSFITDDLWYGIFAGTNPETFVRADGAFIPFAEDFNLNNVTTSVKGVGEI 325
+G R+ + +G+ N + D A + ++ +N+ S++G
Sbjct: 189 PFTIGYHSRSLPMV-----HGL--PDNVDIATNNDAALLKLSD----SNLVCSMQGQSYD 237
Query: 326 GDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMA-----YALEYNDE---KGICFFTD 377
++Q+ G V KVGR++GLT G ++ + + Y+ + FF
Sbjct: 238 TPSLTFEMQA------GFSVQKVGRTTGLTHGQIIGEIIAPHPVSYSVPGFGNHVSFFER 291
Query: 378 FLVVGEN--QQTFDLEGDSGSLILLTGQNGEKPRPVGIIWGGTANRG 422
+ N F GDSGSL+ T NG++ +GI++ G N+G
Sbjct: 292 VFAIHSNDPDTPFSQPGDSGSLV-TTEMNGDR-YAIGIVFAGN-NQG 335
>gi|393726247|ref|ZP_10346174.1| hypothetical protein SPAM2_21549 [Sphingomonas sp. PAMC 26605]
Length = 736
Score = 39.3 bits (90), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 37/147 (25%), Positives = 67/147 (45%), Gaps = 10/147 (6%)
Query: 316 TTSVKGV-GEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMAYALEYNDEKGICF 374
T+ V G+ GE+G V ++ + LI +++ G SG G + A + G +
Sbjct: 234 TSRVFGLEGELGAVVDLNEDNLGTQLIDQRMEAFGAVSGHLVGRIKALFYRHKALAGYEY 293
Query: 375 FTDFLVVGENQQTFDLEGDSG---SLILLTGQNGEKP-RPVGIIWGGTANRGRLKLKVGQ 430
++FL+ E+ Q GDSG L+ +G++ +P+ + WGG G
Sbjct: 294 VSEFLIAPEDGQAQTCPGDSGMVWHLVQTDAASGDRTLQPLAVEWGGQGLIGS-----DD 348
Query: 431 PPVNWTSGVDLGRLLDLLELDLIATNE 457
+N++ L LL++DL+ T +
Sbjct: 349 RTLNFSLATGLATACQLLDVDLVRTGD 375
>gi|311281607|ref|YP_003943838.1| hypothetical protein Entcl_4324 [Enterobacter cloacae SCF1]
gi|308750802|gb|ADO50554.1| protein of unknown function DUF638 hemagglutinin/hemolysin
[Enterobacter cloacae SCF1]
Length = 677
Score = 39.3 bits (90), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 41/87 (47%), Gaps = 8/87 (9%)
Query: 438 GVDLGRLLDLLELDLIATNEGFQAAVQDQRNASAAAIESTVGESPPAEREQ-SKEKTAER 496
G D G++ D LE+ AT G QAA NA AA+E E A +++ S+E
Sbjct: 151 GFDAGKVKDKLEIQKEATALGIQAA-----NAYKAAMEHEAAEKNAALKDEISREHPGAT 205
Query: 497 LEPFNLNIQQD--LVDGESEQGPTPPF 521
E N ++ D +D E E GP F
Sbjct: 206 EEALNAAVKNDSRYIDAEKEYGPGSDF 232
>gi|226313997|ref|YP_002773893.1| hypothetical protein BBR47_44120 [Brevibacillus brevis NBRC 100599]
gi|226096947|dbj|BAH45389.1| hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 367
Score = 38.9 bits (89), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 75/167 (44%), Gaps = 34/167 (20%)
Query: 341 IGRQVMKVGRSSGLTTGTVMAYA----LEYNDEKGI--CFFTDFLVVGENQQTFDLEGDS 394
IGR++ KVGRSSGL GTV + + Y + G+ F + V+ + L GDS
Sbjct: 215 IGRRLKKVGRSSGLAWGTVESIHTDIDVSYGNYGGLGTIRFQNQTVI-RSTVPISLPGDS 273
Query: 395 GSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPVNWTS---GVDLGRLLDLLELD 451
GS+ L G + + G+AN GRL + PV W GV + R
Sbjct: 274 GSVWLTAGNYAA-----AVNFAGSAN-GRLSISY---PVVWALQAFGVGIAR-------- 316
Query: 452 LIATNEGFQAAVQDQRNASAAAIESTVGESPPAE--REQSKEKTAER 496
AT ++ V+ +R ++ G PAE R Q+K+ ++R
Sbjct: 317 --ATGRAGRSVVKAKR---VRRTDTRTGPLSPAELNRVQTKKAASKR 358
>gi|253771307|ref|YP_003034126.1| hypothetical protein CLG_A0033 [Clostridium botulinum D str. 1873]
gi|253721459|gb|ACT33751.1| conserved hypothetical protein [Clostridium botulinum D str. 1873]
Length = 313
Score = 38.9 bits (89), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 68/300 (22%), Positives = 118/300 (39%), Gaps = 45/300 (15%)
Query: 123 GTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYY 182
G +G++I+ G T I+V+V+ K+ + +P +G + ++
Sbjct: 30 GVGLGYKIKNGFYTCQKCIVVYVSNKLSSNEIYEQDLIPEIYKGIATDVVQIGIMSIDRD 89
Query: 183 GAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNR 242
+ + + T+ + ++G G V + T+G +V T N L+N
Sbjct: 90 SLCSNFNQNDSLTKKIRPVQG-----GYSISVITINGAATMGCVV---TDNHDNYMLSNN 141
Query: 243 HVAVDLDYPNQKMFHPLPPSL-GPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADG 301
HV DL+ P+ ++ PGV G DD+ G + P +F +
Sbjct: 142 HVLADLNTV------PIGTAVVQPGVLDGGKS------PDDIV-GALSQYTPISFEETNL 188
Query: 302 AFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMA 361
A N NV+ + V V I+ G+ V KVGR++ LTTG +
Sbjct: 189 VDCAIARVLNKRNVSPKIALVNAPKGV--------ISPKFGQSVKKVGRTTALTTGKITG 240
Query: 362 YALEYN-DEKGICFFTDFLVVGENQQTFDLE---GDSGSLILLTGQNGEKPRPVGIIWGG 417
+ + KG D ++ NQ D+ GDSGS++L + +G+I G
Sbjct: 241 VKTTFRFNIKG----QD--IIFRNQILADIMTSPGDSGSILL-----SDNDYAIGLIMTG 289
>gi|416350183|ref|ZP_11680798.1| hypothetical protein CBCST_04706 [Clostridium botulinum C str.
Stockholm]
gi|338196342|gb|EGO88540.1| hypothetical protein CBCST_04706 [Clostridium botulinum C str.
Stockholm]
Length = 313
Score = 38.9 bits (89), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 68/300 (22%), Positives = 118/300 (39%), Gaps = 45/300 (15%)
Query: 123 GTAIGFRIRRGVLTDIPAILVFVARKVHRQWLSHVQCLPAALEGPGGVWCDVDVVEFSYY 182
G +G++I+ G T I+V+V+ K+ + +P +G + ++
Sbjct: 30 GVGLGYKIKNGFYTCQKCIVVYVSNKLSSNEIYEQDLIPEIYKGIATDVVQIGIMSIDRD 89
Query: 183 GAPAPTPKEELYTELVDGLRGSDPCIGSGSQVASQETYGTLGAIVRSRTGNQQVGFLTNR 242
+ + + T+ + ++G G V + T+G +V T N L+N
Sbjct: 90 SLCSNFNQNDSLTKKIRPVQG-----GYSISVITINGAATMGCVV---TDNHDNYMLSNN 141
Query: 243 HVAVDLDYPNQKMFHPLPPSL-GPGVYLGAVERATSFITDDLWYGIFAGTNPETFVRADG 301
HV DL+ P+ ++ PGV G DD+ G + P +F +
Sbjct: 142 HVLADLNTV------PIGTAVVQPGVLDGGKS------PDDIV-GALSQYTPISFEETNL 188
Query: 302 AFIPFAEDFNLNNVTTSVKGVGEIGDVHIIDLQSPINSLIGRQVMKVGRSSGLTTGTVMA 361
A N NV+ + V V I+ G+ V KVGR++ LTTG +
Sbjct: 189 VDCAIARVLNKRNVSPKIALVNAPKGV--------ISPKFGQSVKKVGRTTALTTGKITG 240
Query: 362 YALEYN-DEKGICFFTDFLVVGENQQTFDLE---GDSGSLILLTGQNGEKPRPVGIIWGG 417
+ + KG D ++ NQ D+ GDSGS++L + +G+I G
Sbjct: 241 VKTTFRFNIKG----QD--IIFRNQILADIMTSPGDSGSILL-----SDNDYAIGLIMTG 289
>gi|398815593|ref|ZP_10574260.1| hypothetical protein PMI05_02691 [Brevibacillus sp. BC25]
gi|398034383|gb|EJL27653.1| hypothetical protein PMI05_02691 [Brevibacillus sp. BC25]
Length = 367
Score = 38.5 bits (88), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 71/162 (43%), Gaps = 20/162 (12%)
Query: 341 IGRQVMKVGRSSGLTTGTVMAYA----LEYNDEKGI--CFFTDFLVVGENQQTFDLEGDS 394
IGR++ KVGRSSGL GTV + + Y + G+ F + V+ + L GDS
Sbjct: 215 IGRRLKKVGRSSGLAWGTVESIHTDIDVSYGNYGGLGTVRFQNQTVI-RSTVPISLPGDS 273
Query: 395 GSLILLTGQNGEKPRPVGIIWGGTANRGRLKLKVGQPPVNWTS---GVDLGRLLDLLELD 451
GS+ L G + + G+AN GRL + PV W GV + R
Sbjct: 274 GSVWLTAGNYAA-----AVNFAGSAN-GRLSISY---PVVWALQAFGVGVARAAGRTGRS 324
Query: 452 LIATNEGFQAAVQDQRNASAAAIESTVGESPPAEREQSKEKT 493
+A +G + R SA + + ++R+ K+KT
Sbjct: 325 -VAKAKGVRRNNARTRPLSATELSRVQTKKAASKRQPGKKKT 365
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.134 0.397
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,189,834,508
Number of Sequences: 23463169
Number of extensions: 468291623
Number of successful extensions: 818220
Number of sequences better than 100.0: 159
Number of HSP's better than 100.0 without gapping: 73
Number of HSP's successfully gapped in prelim test: 86
Number of HSP's that attempted gapping in prelim test: 817921
Number of HSP's gapped (non-prelim): 183
length of query: 604
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 455
effective length of database: 8,863,183,186
effective search space: 4032748349630
effective search space used: 4032748349630
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 80 (35.4 bits)