BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 043883
(348 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 180/339 (53%), Positives = 240/339 (70%), Gaps = 15/339 (4%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
+ V L++ G ASQA R+ + ++ E+ E W A+YGR YK+++E +RFEIF++N+
Sbjct: 9 LMFVALLVVGLWASQAWSRSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
+E FN +GNR Y L +N+FADLT +EF S+ G+K S S + F Y + + V
Sbjct: 69 IESFNK--LGNRPYKLDINEFADLTNEEFKVSKNGYKRS--SGVGLTEKSSFRYANVTAV 124
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P S++W + GAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+
Sbjct: 125 PTSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGE 184
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG MDDAF++I QN G+T +A Y Y+G + G C++ KA + AA+IT YEDVP N
Sbjct: 185 DQGCEGGLMDDAFEFIKQNGGLTTEANYPYQG-TDGTCNTNKAGNDAAKITGYEDVPANS 243
Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E++LLKAVA+QPVSVAIDAS A QFYSGGVF G C T L+HGVTAVGYGTS++G KYWL
Sbjct: 244 EDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDDGTKYWL 303
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+KNSWG WGEDGY R++RDI+ +G CGIAM S+P +
Sbjct: 304 VKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYPTA 342
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 355 bits (912), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 181/341 (53%), Positives = 239/341 (70%), Gaps = 17/341 (4%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+Y + +L + + ASQAT R E S+ E+ E W QYGR YK++ E SKR++IFKDN+
Sbjct: 8 QYICLALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A ++SY L +N+FADLT +EF AS+ FK H S +A T F Y++ +
Sbjct: 68 ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYENVT 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +V+W +KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAFK+I QN G+T +A Y Y G + G C+ KA AA+I YEDVP
Sbjct: 182 GEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240
Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAVA+QP++VAIDA S QFYS GVF G C T L+HGV+AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKY 300
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGE+GY R+QRD+ +G CGIAM AS+P +
Sbjct: 301 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 180/341 (52%), Positives = 236/341 (69%), Gaps = 17/341 (4%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+Y + +L + + AS A R E S+ E+ E W AQYGR YK++ E SKR++IFKDN+
Sbjct: 8 RYICLALLFVLAAWASHAKARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYKIFKDNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A N+SY L +N+FADLT +EF AS+ FK H S +A T F Y+
Sbjct: 68 ARIESFNKAM--NKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYEHVX 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +V+W +KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAFK+I QN G+T +A Y Y G + G C+ KA AA+I YEDVP
Sbjct: 182 GEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240
Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAVA+QP++VAIDA QFYS GVF G C T L+HGV+AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKY 300
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGE+GY R+QRD+ + +G CGIAM AS+P +
Sbjct: 301 WLVKNSWGTGWGEEGYIRMQRDVTEKEGLCGIAMQASYPTA 341
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 185/342 (54%), Positives = 230/342 (67%), Gaps = 19/342 (5%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
+A +F + +L I Q T RT + SI E+ EQW YG+ YK E KR IF +
Sbjct: 12 LALFFCLGLLAI------QVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTE 65
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
NL +E NNA N+ Y L +N+FADLT +EFIAS+ FK SS ++ T F Y++
Sbjct: 66 NLKYIEASNNAG-NNKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRT--TTFKYEN 122
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+ VP +V+W +KGAVTPVK QGQC A+AA EGI+ I +LVSLSEQ+LVDC T
Sbjct: 123 TSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDT 182
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N + GC GG MDDAFK+IIQN GI+ +A Y Y+G+ G C + +A AA IT YEDVP
Sbjct: 183 NGVDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVD-GTCKANEASTSAATITGYEDVP 241
Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
N+E +L KAVANQP+SVAIDAS QFY GVF G C T L+HGVTAVGYG S +G K
Sbjct: 242 ANNENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTK 301
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YWL+KNSWG DWGE+GY R+QR ID +G CGIAM AS+P +
Sbjct: 302 YWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 353 bits (907), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 180/341 (52%), Positives = 238/341 (69%), Gaps = 17/341 (4%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+Y + +L + + ASQAT R+ E S+ E+ E W QYGR YK++ E SKR++IFKDN+
Sbjct: 8 QYICLALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A ++SY L +N+FADLT +EF AS+ FK H S +A T F Y++ +
Sbjct: 68 ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYENVT 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +V+W +KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAFK+I QN G+T +A Y Y G + G C+ KA AA+I YEDVP
Sbjct: 182 GEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240
Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAVA+QP++VAIDAS QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 300
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSW WGE+GY R+QRD+ +G CGIAM AS+P +
Sbjct: 301 WLVKNSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 353 bits (906), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 180/341 (52%), Positives = 242/341 (70%), Gaps = 16/341 (4%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+ + V L++ G SQA R+ + ++ E+ E W +YGR YK+++E +RFEIF++N+
Sbjct: 7 RKLMFVALLVVGLWVSQAWSRSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNV 66
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN GNR Y L +N+FADLT +EF AS+ G+K S S+ + + F Y + +
Sbjct: 67 EFIESFNKP--GNRPYKLDINEFADLTNEEFKASRNGYKRS--SNVGLSEKSSFRYGNVT 122
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP S++W +KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+
Sbjct: 123 AVPTSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTS 182
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAF++I QN G+T +A Y Y+G + G C++ KA + AA+IT YEDVP
Sbjct: 183 GEDQGCEGGLMDDAFEFIKQNGGLTTEANYPYQG-TDGTCNTNKAGNDAAKITGYEDVPA 241
Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N E++LLKAVA+QPVSVAIDA SA QFYSGGVF G C T L+HGVTAVGYGTS +G KY
Sbjct: 242 NSEDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTS-DGTKY 300
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGEDGY R++RDI+ +G CGIAM +S+P +
Sbjct: 301 WLVKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYPTA 341
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 353 bits (905), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 180/341 (52%), Positives = 237/341 (69%), Gaps = 17/341 (4%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+Y + +L + + ASQAT R E S+ E+ E W QYGR YK++ E SKR++IFKDN+
Sbjct: 8 QYICLALLFVLAAWASQATARXLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A ++SY L +N+FADLT +EF AS+ FK H S +A T F Y++ +
Sbjct: 68 ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYENVT 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +V+W +KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAFK+I QN G+T +A Y Y G + G C+ KA AA+I YEDVP
Sbjct: 182 GEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240
Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAVA+QP++VAIDAS QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 300
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSW WGE+GY R+QRD+ +G CGIAM AS+P +
Sbjct: 301 WLVKNSWSTGWGEEGYIRMQRDVTVKEGLCGIAMQASYPTA 341
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 186/343 (54%), Positives = 231/343 (67%), Gaps = 21/343 (6%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
+A +F + +L I Q T RT + SI E+ EQW YG+ YK E KR IF +
Sbjct: 12 LALFFCLGLLAI------QVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTE 65
Query: 61 NLVAVERFNNAAIGNRS-YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
NL +E NNA GN+ Y L +N+FADLT +EFIAS+ FK SS ++ T F Y+
Sbjct: 66 NLKYIEASNNA--GNKKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRT--TTFKYE 121
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
++ VP +V+W +KGAVTPVK QGQC A+AA EGI+ I +LVSLSEQ+LVDC
Sbjct: 122 NTSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCD 181
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
TN + GC GG MDDAFK+IIQN GI+ +A Y Y+G+ G C + +A AA IT YEDV
Sbjct: 182 TNGVDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVD-GTCKANEASTSAATITGYEDV 240
Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P N+E +L KAVANQP+SVAIDAS QFY GVF G C T L+HGVTAVGYG S +G
Sbjct: 241 PANNENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGT 300
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KYWL+KNSWG DWGE+GY R+QR ID +G CGIAM AS+P +
Sbjct: 301 KYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 351 bits (901), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 182/343 (53%), Positives = 236/343 (68%), Gaps = 18/343 (5%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K L+ +L+++ ASQ+ R+ E S+ + + W QYGR YK + E KRF+IFK+N+
Sbjct: 8 KLVLMAMLLVT-LWASQSWSRSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENV 66
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGF--KMSDHSSSLKANGTPFLYKS 120
+E FNN GN+ Y L +N F DLT +EF AS G+ MS H SS + F Y++
Sbjct: 67 EFIESFNNN--GNKPYKLGINAFTDLTNEEFRASHNGYTMSMSSHQSSYRTK--SFRYEN 122
Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+ VPPS++W KGAVT +K QGQC AVAA+EGI + L+SLSEQ+LVDC
Sbjct: 123 VTAVPPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCD 182
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
T+ + GC GG MDDAF++II+N G+T +A Y YEG+ G C++ KA +HAA+IT YE+V
Sbjct: 183 TSGMDQGCEGGLMDDAFEFIIENNGLTTEANYPYEGVD-GSCNTRKAANHAAKITGYENV 241
Query: 233 PPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P DEE+L KAVANQPVSVAIDA SA Q YS G+F G C T L+HGVT VGYGTS++G
Sbjct: 242 PAYDEEALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGTELDHGVTVVGYGTSDDGT 301
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KYWL+KNSWG WGEDGY R++RDID +G CGIAM S+P +
Sbjct: 302 KYWLVKNSWGTSWGEDGYIRMERDIDAKEGLCGIAMEPSYPTA 344
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 350 bits (899), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 178/341 (52%), Positives = 236/341 (69%), Gaps = 17/341 (4%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+Y + +L + + ASQAT R E S+ E+ E W AQYGR YK++ E SKR++IFKDN+
Sbjct: 8 QYICLALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A ++SY L +N+FADLT +EF S+ FK H S +A T F Y++ +
Sbjct: 68 ARIESFNKAM--DKSYKLSINEFADLTNEEFGTSRNRFKA--HICSTEA--TSFKYENVT 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +++W +KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAFK+I QN G+T +A Y Y G + G C+ KA AA+I YEDVP
Sbjct: 182 GEDQGCNGGLMDDAFKFIKQNHGLTTEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240
Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAV +QP++VAIDA QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 300
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGE+GY R+QRD+ +G CGIAM AS+P +
Sbjct: 301 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 350 bits (897), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 183/342 (53%), Positives = 232/342 (67%), Gaps = 16/342 (4%)
Query: 4 YFLIVVLII--SGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
Y+ I + I G CA Q T R+ S+ E+ EQW +QY + YK+ E +R +IF N
Sbjct: 8 YYSIALTFIFCLGLCAIQVTSRSLQVDSMYERHEQWMSQYSKVYKDPQEREERHKIFTAN 67
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
+ +E FNN A N+ Y L +N+FADLT +EFIAS+ FK H S A T F Y++
Sbjct: 68 VNYIEVFNNDA-NNKLYKLGINQFADLTNEEFIASRNKFK--GHMCSSIAKTTTFKYENV 124
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +P +V+W +KGAVTPVK QGQC AVAA EGI + +LVSLSEQ+LVDC T
Sbjct: 125 SAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDT 184
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ GC GG MDDAFK+IIQN G++ +A Y Y+G+ G C++ KA HAA IT YEDVP
Sbjct: 185 KGVDQGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVD-GTCNANKASIHAATITGYEDVP 243
Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
N+E++L KAVANQP+SVAIDAS QFY GVF+G C T L+HGVTAVGYG +G K
Sbjct: 244 ANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGYGVGNDGTK 303
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YWL+KNSWG DWGE+GY R+QR +D +G CGIAM AS+P +
Sbjct: 304 YWLVKNSWGTDWGEEGYIRMQRGVDAAEGLCGIAMQASYPTA 345
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 180/341 (52%), Positives = 234/341 (68%), Gaps = 17/341 (4%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+Y + +L + ASQAT R E S+ E+ E W AQYGR YK++ E SKR++IFKDN+
Sbjct: 8 QYICLALLFFLAAWASQATARNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A ++SY L +N+FADLT +EF AS+ FK H S +A T F Y+ +
Sbjct: 68 ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYEHVA 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +V+W +KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAFK+I QN G+ +A Y Y G + G C+ KA AA+I YEDVP
Sbjct: 182 GEDQGCNGGLMDDAFKFIEQNHGLATEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240
Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAVA+QP++VAIDA QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 300
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGE GY R+QRD+ +G CGIAM AS+P +
Sbjct: 301 WLVKNSWGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 179/312 (57%), Positives = 224/312 (71%), Gaps = 18/312 (5%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
E+ E W AQYGR YK E +R IFK+N+ +E FN +G + Y L +N+FADLT +
Sbjct: 2 ERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNK--VGKKPYKLSVNEFADLTNE 59
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC------ 144
EF AS+ G+KMS H SS ++ PF Y++ S VP +++W +KGAVTP+K QGQC
Sbjct: 60 EFQASRNGYKMSAHLSS--SSTKPFRYENVSAVPSTMDWRKKGAVTPIKDQGQCGCCWAF 117
Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
AVAA EGI + +L+SLSEQ+LVDC T+ + GC GG MDDAF +IIQNKG+T +A
Sbjct: 118 SAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEAN 177
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFY 261
Y Y+G + G C+S KA AA+IT YEDVP N E +LLKAVANQPVSVAIDA SA QFY
Sbjct: 178 YPYQG-ADGACNSGKA---AAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFY 233
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
S GVF G C T L+HGVTAVGYG S++G KYWL+KNSWG WGE+GY R++RDID +G
Sbjct: 234 SSGVFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQEGL 293
Query: 322 CGIAMFASFPVS 333
CGIAM AS+P +
Sbjct: 294 CGIAMEASYPTA 305
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 346 bits (888), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 179/341 (52%), Positives = 231/341 (67%), Gaps = 15/341 (4%)
Query: 4 YFLIVVLIIS-GSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
Y + + L+ G A Q T RT +GS+ E+ E+W YG+ YK+ E KRF+IF +N+
Sbjct: 8 YHISLALVFCLGLWAIQVTSRTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTENM 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FNN N SY L +N+FADLT +EF+AS+ FK SS ++ T F Y++ S
Sbjct: 68 KYIEAFNNGD-NNESYKLGINQFADLTNEEFVASRNKFKGHMCSSIIRT--TTFKYENVS 124
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P +V+W +KGAVTPVK QGQC AVAA EGI+ + +LVSLSEQ+LVDC T
Sbjct: 125 AIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTK 184
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAFK+IIQN G+ +A Y Y+G+ G C++ KA A IT YEDVP
Sbjct: 185 GVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVD-GTCNANKASIQATTITGYEDVPA 243
Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAVANQP+SVAIDAS QFY GVF G C T L+HGVTAVGYG S +G KY
Sbjct: 244 NNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKY 303
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG DWGE+GY +QR ++ +G CGIAM AS+P +
Sbjct: 304 WLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA 344
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 180/337 (53%), Positives = 230/337 (68%), Gaps = 14/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L G A Q T RT + S+ E+ QW +QYG+ YK+ E RF+IFK+N+ +E
Sbjct: 12 LALLFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFKENVNYIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNA +SY L +N+FADLT +EFIAS+ FK SS ++ T F Y++ S +P
Sbjct: 72 TFNNAD-DTKSYKLGINQFADLTNEEFIASRNKFKGHMCSSIMRT--TSFKYENVSGIPS 128
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W +KGAVTPVK QGQC AVAA EGI+ + +L+SLSEQ+LVDC T +
Sbjct: 129 TVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQ 188
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MDDAFK+IIQN G++ +A Y YEG+ G C++ KA A IT YEDVP N E+
Sbjct: 189 GCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVD-GTCNANKASVQAVTITGYEDVPANSEQ 247
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQP+SVAIDAS QFY GVF G C T L+HGVTAVGYG S +G KYWL+K
Sbjct: 248 ALQKAVANQPISVAIDASGSDFQFYKSGVFTGACGTELDHGVTAVGYGVSNDGTKYWLVK 307
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG DWGE+GY +QR I+ +G CGIAM AS+P +
Sbjct: 308 NSWGTDWGEEGYIMMQRGIEAAEGICGIAMQASYPTA 344
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 175/337 (51%), Positives = 231/337 (68%), Gaps = 15/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L SG A Q T RT + S+ E+ E+W +Y + YK+ E +RF+IFK+N+ +E
Sbjct: 12 LALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNAA N+ YTL +N+FADLT +EFIA + FK H S T F Y++ + +P
Sbjct: 72 AFNNAA--NKPYTLGINQFADLTNEEFIAPRNRFK--GHMCSSITRTTTFKYENVTAIPS 127
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W +KGAVTP+K QGQC AVAA EGI+A+ +L+SLSEQ++VDC T +
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQ 187
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GGFMD AFK+IIQN G+ N+ Y Y+ + G C++ A +H A IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVD-GKCNAKAAANHVATITGYEDVPVNNEK 246
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQPVSVAIDAS QFY GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVK 306
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG +WGE+GY R+QR + +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 178/341 (52%), Positives = 231/341 (67%), Gaps = 15/341 (4%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K+F+I LI+ G+ A QAT RT E S+ E+ EQW QYGR YK+ AE S RF+IF DN+
Sbjct: 26 KHFMIAALILLGAWACQATSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNV 85
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN G +SY L +N+FAD T +EF AS+ G+KM+ SS + T F Y++ +
Sbjct: 86 KFIEEFNKD--GRQSYKLAVNEFADQTNEEFQASRNGYKMA--VSSRPSQTTLFRYENVT 141
Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP S++W +KGAVTPVK QGQC +AA EGI +K +L+SLSEQ+LVDC
Sbjct: 142 AVPSSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKT 201
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG+M+D F++I++NKGI +A Y Y + G C+S + AA+I+ YE VP
Sbjct: 202 GEDQGCEGGYMEDGFEFIVKNKGIALEASYPYTA-ADGTCNSKEEASRAAKISGYEKVPA 260
Query: 235 NDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N E +LLKAVANQPVSV+IDAS A QFYS GVF G C T L+HGVTAVGYG + +G KY
Sbjct: 261 NSETALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYGKTSDGTKY 320
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WG+ GY +QR + G CGIAM AS+P +
Sbjct: 321 WLVKNSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYPTA 361
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 177/327 (54%), Positives = 222/327 (67%), Gaps = 14/327 (4%)
Query: 17 ASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGN 75
A Q T RT D+ +I EK EQW YG+ YK+ E R +IFK+N+ +E NNA N
Sbjct: 23 AIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAG-NN 81
Query: 76 RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAV 135
+ Y L +N+FADLT +EFIAS+ FK H S + F Y+++ VP +V+W +KGAV
Sbjct: 82 KLYKLGINQFADLTNEEFIASRNKFK--GHMCSSITKTSTFKYENASVPSTVDWRKKGAV 139
Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
TPVK QGQC AVAA EGI+ + +LVSLSEQ+LVDC T + GC GG MDDA
Sbjct: 140 TPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDA 199
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
FK+IIQN G+ +A Y Y+G+ G C + KA HA IT YEDVP N+E++L KAVANQP
Sbjct: 200 FKFIIQNHGLNTEAQYPYQGVD-GTCSANKASIHAVTITGYEDVPANNEQALQKAVANQP 258
Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
+SVAIDAS QFY GVF G C T L+HGVTAVGYG +G KYWL+KNSWG DWGE+
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEE 318
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
GY ++QR +D +G CGIAM AS+P +
Sbjct: 319 GYIKMQRGVDAAEGLCGIAMEASYPTA 345
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 175/337 (51%), Positives = 230/337 (68%), Gaps = 15/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L+ A Q T R+ + S+ E+ EQW +YG+ YK+ E KRF IFK+N+ +E
Sbjct: 559 LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 618
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNAA N+ Y L +N+FADLT +EFIA + FK H S T F Y++ + VP
Sbjct: 619 AFNNAA--NKRYKLAINQFADLTNEEFIAPRNRFK--GHMCSSIIRTTTFKYENVTAVPS 674
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W +KGAVTP+K QGQC AVAA EGI+A+ +L+SLSEQ+LVDC T +
Sbjct: 675 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQ 734
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MDDAFK++IQN G+ +A Y Y+G+ G C++ +A + IT YEDVP N+E+
Sbjct: 735 GCEGGLMDDAFKFVIQNHGLNTEANYPYKGVD-GKCNANEAANDVVTITGYEDVPANNEK 793
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQPVSVAIDAS QFY GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 794 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVK 853
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG +WGE+GY R+QR +D +G CGIAM AS+P +
Sbjct: 854 NSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 890
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 174/337 (51%), Positives = 230/337 (68%), Gaps = 15/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L SG Q T RT + S+ E+ E+W +Y + YK+ E +RF+IFK+N+ +E
Sbjct: 12 LALLFCSGFLTFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNAA N+ YTL +N+FADLT +EFIA + FK H S T F Y++ + +P
Sbjct: 72 AFNNAA--NKPYTLGINQFADLTNEEFIAPRNRFK--GHMCSSITRTTTFKYENVTAIPS 127
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W +KGAVTP+K QGQC AVAA EGI+A+ +L+SLSEQ++VDC T +
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQ 187
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GGFMD AFK+IIQN G+ N+ Y Y+ + G C++ A +H A IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVD-GKCNAKAAANHVATITGYEDVPVNNEK 246
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQPVSVAIDAS QFY GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVK 306
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG +WGE+GY R+QR + +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 178/337 (52%), Positives = 224/337 (66%), Gaps = 14/337 (4%)
Query: 7 IVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
+ + G A Q T RT D+ I EK EQW YG+ YK+ E R +IFK+N+ +
Sbjct: 13 LALFFCLGLFAIQVTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYI 72
Query: 66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
E NNA N+ Y L +N+FADLT +EFIAS+ FK H S + F Y+++ VP
Sbjct: 73 EASNNAG-NNKLYKLGINQFADLTNEEFIASRNKFK--GHMCSSITKTSTFKYENASVPS 129
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W +KGAVTPVK QGQC AVAA EGI+ + +LVSLSEQ+LVDC T +
Sbjct: 130 TVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQ 189
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MDDAFK+IIQN G+ +A Y Y+G+ G C + KA HA IT YEDVP N+E+
Sbjct: 190 GCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVD-GTCSANKASIHAVTITGYEDVPANNEQ 248
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQP+SVAIDAS QFY GVF G C T L+HGVTAVGYG +G KYWL+K
Sbjct: 249 ALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVK 308
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG DWGE+GY ++QR +D +G CGIAM AS+P +
Sbjct: 309 NSWGTDWGEEGYIKMQRGVDAAEGLCGIAMEASYPTA 345
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 174/338 (51%), Positives = 229/338 (67%), Gaps = 16/338 (4%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F +V+ + G A Q + RT + S+ E+ EQW A+YGR YK+ E KRF IFK+N+
Sbjct: 12 FALVLCL--GLWAFQVSSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSIFKENVNY 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
+E NNA G++ Y L +N+FADLT +EFIA++ FK H SS T F Y++ P
Sbjct: 70 IEASNNA--GDKPYKLGVNQFADLTNEEFIATRNKFK--GHMSSSITRTTTFKYENVTAP 125
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
+V+W ++GAVTPVK QG C AVAA EGI+ + LVSLSEQ+LVDC T+ +
Sbjct: 126 STVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGAD 185
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GG MDDAFK+IIQN G+ +A Y Y+G+ G C++ + H A IT YEDVP N+E
Sbjct: 186 QGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVD-GTCNTNEEATHVATITGYEDVPSNNE 244
Query: 238 ESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
++L +AVANQP+S+AIDAS F Y GVF G C T L+HGV VGYG S++G KYWL+
Sbjct: 245 QALQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KNSWG DWGE+GY R+QRD+D P+G CG+AM S+P +
Sbjct: 305 KNSWGADWGEEGYIRMQRDVDAPEGLCGLAMQPSYPTA 342
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 343 bits (881), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 176/340 (51%), Positives = 234/340 (68%), Gaps = 15/340 (4%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ + +L+ A Q T R+ + S+ E+ EQW +YG+ YK+ E KRF IFK+N+
Sbjct: 9 HISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVN 68
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+E FNNAA N+ Y L +N+FADLT +EFIA + FK SS ++ T F Y++ +
Sbjct: 69 YIEAFNNAA--NKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRT--TTFKYENVTA 124
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
VP +V+W +KGAVTP+K QGQC AVAA EGI+A+ +L+SLSEQ+LVDC T
Sbjct: 125 VPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKG 184
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
+ GC GG MDDAFK++IQN G+ +A Y Y+G+ G C+ +A + AA IT YEDVP N
Sbjct: 185 VDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVD-GKCNVNEAANDAATITGYEDVPAN 243
Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
+E++L KAVANQPVSVAIDAS QFY GVF G C T L+HGVTAVGYG S +G +YW
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYW 303
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
L+KNSWG +WGE+GY R+QR ++ +G CGIAM AS+P +
Sbjct: 304 LVKNSWGTEWGEEGYIRMQRGVNSEEGLCGIAMQASYPTA 343
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 178/341 (52%), Positives = 237/341 (69%), Gaps = 18/341 (5%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
++ + +L I G+ S++T RT + + E+ EQW QYGR YK+ E + R+ IFK+N+
Sbjct: 8 QFVCLALLFILGAWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDDNERATRYSIFKENV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
++ FN+ +SY L +N+FADLT +EF AS+ FK H S +A PF Y++ S
Sbjct: 68 ARIDAFNSQT--GKSYKLGVNQFADLTNEEFKASRNRFK--GHMCSPQAG--PFRYENVS 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +V+W ++GAVTPVK QGQC AVAA+EGIN + +L+SLSEQ++VDC T
Sbjct: 122 AVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTK 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAFK+I QNKG+T +A Y Y+G + G C++ KA HAA+IT +EDVP
Sbjct: 182 GEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKG-TDGTCNTNKAAIHAAKITGFEDVPA 240
Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N E +L+KAVA QPVSVAIDA S QFYS G+F G C+T L+HGVTAVGYG S +G KY
Sbjct: 241 NSEAALMKAVAKQPVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVS-DGSKY 299
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGE+GY R+Q+DI +G CGIAM AS+P +
Sbjct: 300 WLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPTA 340
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 175/337 (51%), Positives = 232/337 (68%), Gaps = 15/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L+ A Q T R+ + S+ E+ EQW +YG+ YK+ E KRF IFK+N+ +E
Sbjct: 30 LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 89
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNAA N+ Y L +N+FADLT +EFIA + FK SS ++ T F Y++ + VP
Sbjct: 90 AFNNAA--NKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRT--TTFKYENVTAVPS 145
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W +KGAVTP+K QGQC AVAA EGI+A+ +L+SLSEQ+LVDC T +
Sbjct: 146 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQ 205
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MDDAFK++IQN G+ +A Y Y+G+ G C++ +A + IT YEDVP N+E+
Sbjct: 206 GCEGGLMDDAFKFVIQNHGLNTEANYPYKGVD-GKCNANEAANDVVTITGYEDVPANNEK 264
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQPVSVAIDAS QFY GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 265 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVK 324
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG +WGE+GY R+QR +D +G CGIAM AS+P +
Sbjct: 325 NSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 361
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 170/335 (50%), Positives = 224/335 (66%), Gaps = 16/335 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ + +I CA +A RT ++ + E+ EQW A +G+ YK S E ++++IF +N+ +E
Sbjct: 11 LALFLIFAFCAFEANARTLEDAPMRERHEQWMATHGKVYKHSYEKEQKYQIFMENVQRIE 70
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNA G + Y L +N FADLT +EF A + H S + T F Y++ + VP
Sbjct: 71 AFNNA--GXKPYKLGINHFADLTNEEFKAIN---RFKGHVCSKRTRTTTFRYENVTAVPA 125
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
S++W +KGAVTP+K QGQC AVAA EGI ++ +L+SLSEQ+LVDC T +
Sbjct: 126 SLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDTKGVDQ 185
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MDDAFK+I+QNKG+ +A+Y YEG G C++ +HA I YEDVP N E
Sbjct: 186 GCEGGLMDDAFKFILQNKGLATEAIYPYEGFD-GTCNAKADGNHAGSIKGYEDVPANSES 244
Query: 239 SLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+LLKAVANQPVSVAI+AS QFYSGGVF G C T L+HGVT+VGYG ++G KYWL+K
Sbjct: 245 ALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVGDDGTKYWLVK 304
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
NSWG WGE GY R+QRD+ +G CGIAM AS+P
Sbjct: 305 NSWGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYP 339
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 176/340 (51%), Positives = 234/340 (68%), Gaps = 18/340 (5%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
+++ + +L + G+ S++ RT + S+ E+ EQW AQYGR YK+ AE R+ IFK+N
Sbjct: 7 SQFICLALLFVLGAWPSKSAARTLQDVSMYERHEQWMAQYGRVYKDDAEKETRYNIFKEN 66
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
+ ++ FN+ +SY L +N+FADL+ +EF AS+ FK H S +A PF Y++
Sbjct: 67 VARIDAFNSQT--GKSYKLGVNQFADLSNEEFKASRNRFK--GHMCSPQAG--PFRYENV 120
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S VP +++W +KGAVTPVK QGQC AVAA+EGIN + +L+SLSEQ++VDC T
Sbjct: 121 SAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSEQEVVDCDT 180
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ GC GG MDDAFK+I QNKG+T +A Y Y G + G C++ K HAA+IT +EDVP
Sbjct: 181 KGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTG-TDGTCNTQKEATHAAKITGFEDVP 239
Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
N E +L+KAVA QPVSVAIDA QFYS G+F G C T L+HGVTAVGYG S +G K
Sbjct: 240 ANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYGIS-DGTK 298
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
YWL+KNSWG WGE+GY R+Q+DI +G CGIAM AS+P
Sbjct: 299 YWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYP 338
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 174/337 (51%), Positives = 230/337 (68%), Gaps = 15/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L SG A Q T RT + S+ E+ E+W +Y + YK+ E +RF+IFK+N+ +E
Sbjct: 12 LALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNAA N+ YTL +N+FADLT +EFIA + FK H S T F Y++ + +P
Sbjct: 72 AFNNAA--NKPYTLGINQFADLTNEEFIAPRNRFK--GHMCSSITRTTTFKYENVTAIPS 127
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W +KGAVTP+K QGQC AVAA EGI+A+ +L+SLSEQ++VDC T +
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQ 187
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GGFMD AFK+IIQN G+ N+ Y Y+ + G C++ A +H A IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVD-GKCNAKAAANHVATITGYEDVPVNNEK 246
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQPVSVAIDAS QFY GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVK 306
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG +WGE+GY R+QR + +G GIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIRMQRGVKAEEGLXGIAMMASYPTA 343
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 178/337 (52%), Positives = 235/337 (69%), Gaps = 15/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L+ A Q T RT + S+ E+ EQW +YG+ YK+ E KRF +FK+N+ +E
Sbjct: 12 LAMLLCMTFLAFQVTCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENVNYIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNAA N+SY L +N+FADLT +EFIA + GFK SS ++ T F +++ + P
Sbjct: 72 AFNNAA--NKSYKLGINQFADLTNKEFIAPRNGFKGHMCSSIIRT--TTFKFENVTATPS 127
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W +KGAVTP+K QGQC AVAA EGI+A+ +L+SLSEQ+LVDC T +
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDQ 187
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MDDAFK+IIQN G+ +A Y Y+G+ G C++ +A +AA IT YEDVP N+E
Sbjct: 188 GCEGGLMDDAFKFIIQNHGLNTEANYPYKGVD-GKCNANEAAKNAATITGYEDVPANNEM 246
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQPVSVAIDAS QFY GVF G C T L+HGVTAVGYG S++G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWLVK 306
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG +WGE+GY R+QR +D +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 343
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 173/338 (51%), Positives = 229/338 (67%), Gaps = 16/338 (4%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F +V+ + G A Q + RT + S+ E+ EQW A+YG+ YK+ E KRF IF++N+
Sbjct: 12 FALVLCL--GLWAFQVSSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNIFQENVKY 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
+E NNA GN+ Y L +N+F DLT +EFIA++ FK H SS T F Y++ P
Sbjct: 70 IEASNNA--GNKPYKLGVNQFTDLTNKEFIATRNKFK--GHMSSSITRTTTFKYENVTAP 125
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
+V+W ++GAVTPVK QG C AVAA EGI+ + LVSLSEQ+LVDC T+ +
Sbjct: 126 STVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGAD 185
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GG MDDAFK+IIQN G+ +A Y Y+G+ G C++ + H A IT YEDVP N+E
Sbjct: 186 QGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVD-GTCNTNEEVTHVATITGYEDVPSNNE 244
Query: 238 ESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
++L +AVANQP+SVAIDAS F Y GVF G C T L+HGV VGYG S++G KYWL+
Sbjct: 245 QALQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KNSWG+DWGE+GY R+QRD++ P+G CGIAM S+P +
Sbjct: 305 KNSWGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYPTA 342
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 176/327 (53%), Positives = 219/327 (66%), Gaps = 14/327 (4%)
Query: 17 ASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGN 75
A Q T RT D+ I EK EQW YG+ YK+ E R +IFK+N+ +E NNA N
Sbjct: 23 AIQVTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAG-NN 81
Query: 76 RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAV 135
+ Y L +N+FAD+T +EFIAS+ FK H S + F Y+++ VP +V+W +KGAV
Sbjct: 82 KLYKLGINQFADITNEEFIASRNKFK--GHMCSSITKTSTFKYENASVPSTVDWRKKGAV 139
Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
TPVK QGQC AVAA EGI+ + +LVSLSEQ+LVDC T + GC GG MDDA
Sbjct: 140 TPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDA 199
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
FK+IIQN G+ +A Y Y+G+ G C + + AA I YEDVP N+E +L KAVANQP
Sbjct: 200 FKFIIQNHGLHTEAQYPYQGVD-GTCSANETSTPAATIAGYEDVPANNENALQKAVANQP 258
Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
+SVAIDAS QFY GVF G C T L+HGVTAVGYG S +G KYWL+KNSWG DWGE+
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEE 318
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
GY R+QR +D QG CGIAM AS+P +
Sbjct: 319 GYIRMQRSVDAAQGLCGIAMMASYPTA 345
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 179/339 (52%), Positives = 231/339 (68%), Gaps = 15/339 (4%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
+ + L+I G ASQA RT E S++E+ E W YGRTYK+ AE +RF+IFK+N+
Sbjct: 7 IICITLLIMGVWASQALSRTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEY 66
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
+E N+A GNR Y L +N+FAD T +EF AS+ G+ MS S + T F Y++ + V
Sbjct: 67 IESVNSA--GNRRYKLSINEFADQTNEEFKASRNGYNMSSRPRSSEI--TSFRYENVAAV 122
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P S++W +KGAVTP+K QGQC AVAA+EG+ +K L+SLSEQ+LVDC T+
Sbjct: 123 PSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGE 182
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG MD AF++II N G+T +A Y Y+G+ C+ KA AA+I NYEDVP N
Sbjct: 183 DQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDA-TCNKKKAASSAAKIKNYEDVPANS 241
Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E +LLKAVA PVSVAIDA S QFYS GVF G C T L+HGVTAVGYG +++G KYWL
Sbjct: 242 EAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWL 301
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+KNSWG WGEDGY ++RDI +G CGIAM AS+P +
Sbjct: 302 VKNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 340
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 178/344 (51%), Positives = 235/344 (68%), Gaps = 18/344 (5%)
Query: 1 MAKYFLIVVLIISGSCASQ-ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+++ F +VV++ G+ ASQ A R+ + S+ E+ E+W A YGR YK+ E KR++IF+
Sbjct: 4 VSQCFCLVVMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRYKIFE 63
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+N+ +E N A N+ Y L +N+FADLT +EF AS+ FK H S K+ T F Y
Sbjct: 64 ENVALIESSNKDA--NKPYKLSVNQFADLTNEEFKASRNRFK--GHICSTKS--TSFKYG 117
Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
+ S VP +++W KGAVTPVK QGQC AVAA EGI + L+SLSEQ+LVDC
Sbjct: 118 NVSAVPSAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELISLSEQELVDC 177
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
T+ + GC GG MD+AF +I N G+ ++A Y Y+G+ G C++ K HAA+I +ED
Sbjct: 178 DTSGVDQGCEGGLMDNAFTFIQHNHGLASEANYPYKGVD-GTCNTNKQAIHAAEINGFED 236
Query: 232 VPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP N EE+LL AVA+QPVSVAIDA S QFYS GVF G C T L+HGVTAVGYGTS++G
Sbjct: 237 VPANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIGACGTQLDHGVTAVGYGTSDDG 296
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KYWL+KNSWG WGE+GY R+QRD+D +G CGIAM AS+P +
Sbjct: 297 TKYWLVKNSWGTQWGEEGYIRMQRDVDAKEGLCGIAMKASYPTA 340
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 174/340 (51%), Positives = 232/340 (68%), Gaps = 15/340 (4%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
Y + +L+ G A Q T RT + S+ E+ +QW QY + Y + E KRF+IFK+N+
Sbjct: 9 YISLALLMCLGLWAVQVTSRTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIFKENVN 68
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+E N G R Y L +N+F DLT +EFIA + FK SS ++ N + Y++ +
Sbjct: 69 YIETSNKE--GGRFYKLGVNQFVDLTNEEFIAPRNRFKGHMCSSIIRTN--TYKYENVTT 124
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
VP +V+W +KGAVTPVK QGQC AVAA EGI+ + +L+SLSEQ+LVDC T
Sbjct: 125 VPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDTKG 184
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
+ GC GG MDDAFK+IIQN G+ +A Y Y+G+ G C++ +A +AA IT+YEDVP N
Sbjct: 185 VDQGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVD-GTCNANEASINAATITSYEDVPTN 243
Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
+E++L KAVANQP+SVAIDAS QFY+ GVF G C T L+HGVTAVGYG S++G KYW
Sbjct: 244 NEQALQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDDGTKYW 303
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
L+KNSWG WGE+GY R+QR +D +G CGIAM AS+P++
Sbjct: 304 LVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASYPIA 343
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 179/337 (53%), Positives = 228/337 (67%), Gaps = 17/337 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + ++ + A AT RT + +A + EQW AQYGR YK E +KR+ IFK+N+
Sbjct: 8 LLIALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRYNIFKENVEY 67
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
+E FN A G + Y L +N FADLT +EFIAS+ G+ + SS TPF Y++ S V
Sbjct: 68 IESFNKA--GTKPYKLGINAFADLTNKEFIASRNGYILPHECSS----NTPFRYENVSAV 121
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P +V+W +KGAVTPVK QGQC AVAA+EGI + L+SLSEQ+LVDC
Sbjct: 122 PTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGI 181
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG MDDAF +II NKG+T ++ Y Y+G + G C K+ + AA+I+ YEDVP N
Sbjct: 182 DQGCEGGLMDDAFTFIINNKGLTTESNYPYQG-TDGSCKKSKSSNSAAKISGYEDVPANS 240
Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E +L KAVANQPVSVAIDA S QFYS GVF G C T L+HGVTAVGYG +E+G KYWL
Sbjct: 241 ESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWL 300
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+KNSWG WGE GY R+Q+DI+ +G CGIAM +S+P
Sbjct: 301 VKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQSSYP 337
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 336 bits (862), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 178/322 (55%), Positives = 220/322 (68%), Gaps = 17/322 (5%)
Query: 20 ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
AT RT + + + EQW AQYGR YK AE +KRF IFK+N+ +E FN A G + Y
Sbjct: 23 ATSRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKA--GTKPYK 80
Query: 80 LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPV 138
L +N FADLT QEF AS+ G+K+ SS TPF Y++ S VP +V+W KGAVTPV
Sbjct: 81 LGINAFADLTNQEFKASRNGYKLPHDCSS----NTPFRYENVSSVPTTVDWRTKGAVTPV 136
Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K QGQC AVAA+EGI + L+SLSEQ+LVDC + GC GG MDDAF +
Sbjct: 137 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSF 196
Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
II NKG+T ++ Y Y+G + G C K+ + AA+I+ YEDVP N E +L KAVANQPVSV
Sbjct: 197 IINNKGLTTESNYPYQG-TDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSV 255
Query: 252 AIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
AIDA S QFYS GVF G C T L+HGVTAVGYG +E+G KYWL+KNSWG WGE GY
Sbjct: 256 AIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYI 315
Query: 310 RLQRDIDQPQGQCGIAMFASFP 331
R+Q+DI+ +G CGIAM +S+P
Sbjct: 316 RMQKDIEAKEGLCGIAMQSSYP 337
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 336 bits (862), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 177/341 (51%), Positives = 227/341 (66%), Gaps = 16/341 (4%)
Query: 4 YFLIVVLIIS-GSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
Y + + L+ G A Q T RT + S+ E+ QW +QYG+ YK+ E RF+IF +N+
Sbjct: 8 YHISLALVFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTENV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
VE N A +SY L +N+FADLT +EF+AS+ FK H S T F Y++ S
Sbjct: 68 NYVEASN--ADDTKSYKLGINQFADLTNEEFVASRNKFK--GHMCSSITRTTTFKYENVS 123
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P +V+W +KGAVTPVK QGQC AVAA EGI+ + +L+SLSEQ+LVDC T
Sbjct: 124 AIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTK 183
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAFK+IIQN G++ +A Y YEG+ G C++ KA A IT YEDVP
Sbjct: 184 GVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVD-GTCNANKASVQAVTITGYEDVPA 242
Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N E++L KAVANQP+SVAIDAS QFY GVF G C T L+HGVTAVGYG S +G KY
Sbjct: 243 NSEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG DWGE+GY +QR ++ +G CGIAM AS+P +
Sbjct: 303 WLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA 343
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 336 bits (862), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 167/341 (48%), Positives = 226/341 (66%), Gaps = 16/341 (4%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
+ +YF + + ++ CA + RT ++ + E+ EQW A +G+ Y S E ++++ FK+
Sbjct: 7 LFQYFTLALCLVFAFCAFEGNARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQKYQTFKE 66
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
N+ +E FN+A GN+ Y L +N FADLT +EF A + H S F Y++
Sbjct: 67 NVQRIEAFNHA--GNKPYKLGINHFADLTNEEFKAIN---RFKGHVCSKITRTPTFRYEN 121
Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+ VP +++W ++GAVTP+K QGQC AVAA EGI + +L+SLSEQ+LVDC
Sbjct: 122 MTAVPATLDWRQEGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCD 181
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
T + GC GG MDDAFK+I+QNKG+ +A+Y YEG+ G C++ +HA I YEDV
Sbjct: 182 TKGVDQGCEGGLMDDAFKFILQNKGLAAEAIYPYEGVD-GTCNAKAEGNHATSIKGYEDV 240
Query: 233 PPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P N E +LLKAVANQPVSVAI+AS QFYSGGVF G C T L+HGVTAVGYG S++G
Sbjct: 241 PANSESALLKAVANQPVSVAIEASGFEFQFYSGGVFTGSCGTNLDHGVTAVGYGVSDDGT 300
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
KYWL+KNSWG WG+ GY R+QRD+ +G CGIAM AS+P
Sbjct: 301 KYWLVKNSWGVKWGDKGYIRMQRDVAAKEGLCGIAMLASYP 341
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 171/337 (50%), Positives = 223/337 (66%), Gaps = 16/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L++ G A +A RT ++ S+ E+ EQW QYG+ Y +S E R IFK+N+ +E
Sbjct: 12 LALLLVFGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQRIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNA GN+ Y L +N+FADLT +EF A + H S F Y+ S VP
Sbjct: 72 AFNNA--GNKPYKLGINQFADLTNEEFKARN---RFKGHMCSNSTRTPTFKYEDVSSVPA 126
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
S++W +KGAVTP+K QGQC AVAA EGI + +L+SLSEQ+LVDC T +
Sbjct: 127 SLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQ 186
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MDDAFK+I+QNKG+ +A Y Y+G+ C++ AA I +EDVP N E
Sbjct: 187 GCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVD-ATCNANAEAKDAASIKGFEDVPANSES 245
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+LLKAVANQP+SVAIDAS QFYS G+F G C T L+HGVTAVGYG S++G KYWL+K
Sbjct: 246 ALLKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVSDDGTKYWLVK 305
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG+ WGE+GY R+QRD+ +G CGIAM AS+P +
Sbjct: 306 NSWGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYPTA 342
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 172/337 (51%), Positives = 224/337 (66%), Gaps = 17/337 (5%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L++ G + +A RT ++ S+ E+ EQW AQYG+ YK+S E R +IFK+N+ +E
Sbjct: 12 LTLLLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKENVQRIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNA GN+SY L +N+FADLT +EF A + H S F Y+ + VP
Sbjct: 72 AFNNA--GNKSYKLGINQFADLTNEEFKARN---RFKGHMCSNSTRTPTFKYEHVTSVPA 126
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
S++W +KGAVTP+K QGQC AVAA EGI + +L+SLSEQ+LVDC T +
Sbjct: 127 SLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQ 186
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MDDAFK+I+QNKG+ +A Y Y+G+ C++ AA I +EDVP N E
Sbjct: 187 GCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVD-ATCNANAEAKDAASIKGFEDVPANSES 245
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+LLKAVANQP+SVAIDAS QFYS GVF G C T L+HGVTAVGYG S+ G KYWL+K
Sbjct: 246 ALLKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELDHGVTAVGYG-SDGGTKYWLVK 304
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG+ WGE GY R+QRD+ +G CG AM AS+P +
Sbjct: 305 NSWGEQWGEQGYIRMQRDVAAEEGLCGFAMQASYPTA 341
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 172/337 (51%), Positives = 225/337 (66%), Gaps = 15/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ + G A Q RT + S+ E+ EQW A+YG+ YK+ E KRF +FK+N+ +E
Sbjct: 12 LALFFCLGFLAFQVASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PP 125
FNNAA N+ Y L +N+FADLT +EFI + F + H+ S T F Y++ V P
Sbjct: 72 AFNNAA--NKPYKLGINQFADLTSEEFIVPRNRF--NGHTRSSNTRTTTFKYENVTVLPD 127
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
S++W +KGAVTP+K QG C A+AA EGI+ I +LVSLSEQ++VDC T ++
Sbjct: 128 SIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDH 187
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG+MD AFK+IIQN GI +A Y Y+G+ G C+ + HAA IT YEDVP N+E+
Sbjct: 188 GCEGGYMDGAFKFIIQNHGINTEASYPYKGVD-GKCNIKEEAVHAATITGYEDVPINNEK 246
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQPVSVAIDAS QFY G+F G C T L+HGVTAVGYG + EG KYWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEGTKYWLVK 306
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG +WGE+GY +QR + +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIMMQRGVKAVEGICGIAMMASYPTA 343
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 170/337 (50%), Positives = 230/337 (68%), Gaps = 18/337 (5%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ ++ + G+ SQA RT + S+ EK E+W +++GR Y + E R++IFK+N+ +E
Sbjct: 12 LALIFLLGALVSQAMARTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FN A+ +SY L +N+FADLT +EF S+ FK H S +A PF Y++ + P
Sbjct: 72 SFNKAS--GKSYKLGINQFADLTNEEFKTSRNRFK--GHMCSSQAG--PFRYENLTAAPS 125
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
S++W +KGAVT +K QGQC AVAAVEGI + ++L+SLSEQ+LVDC T +
Sbjct: 126 SMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQ 185
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MDDAFK+I QN+G+T +A Y YEG S G C++ + +HAA+I +EDVP N+E
Sbjct: 186 GCQGGLMDDAFKFIEQNQGLTTEANYPYEG-SDGTCNTKQEANHAAKINGFEDVPANNEG 244
Query: 239 SLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L+KAVA QPVSVAIDA QFYS G+F G C T L+HGV AVGYG S G+ YWL+K
Sbjct: 245 ALMKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVK 303
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG WGE+GY R+Q+DID +G CGIAM AS+P +
Sbjct: 304 NSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 176/322 (54%), Positives = 219/322 (68%), Gaps = 17/322 (5%)
Query: 20 ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
AT RT + + + EQW AQYGR Y+ E +KRF IFK+N+ +E FN A G + Y
Sbjct: 25 ATSRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKA--GTKPYK 82
Query: 80 LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPV 138
L +N FADLT QEF AS+ G+K+ SS TPF Y++ S VP +V+W KGAVTPV
Sbjct: 83 LGINAFADLTNQEFKASRNGYKLPHDCSS----NTPFRYENVSSVPTTVDWRTKGAVTPV 138
Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K QGQC AVAA+EGI + L+SLSEQ+LVDC + GC GG MDDAF +
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSF 198
Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
II NKG+T ++ Y Y+G + G C K+ + AA+I+ YEDVP N E +L KAVANQPVSV
Sbjct: 199 IINNKGLTTESNYPYQG-TDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSV 257
Query: 252 AIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
AIDA S QFYS GVF G C T L+HGVTAVGYG +E+G KYWL+KNSWG WGE GY
Sbjct: 258 AIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYI 317
Query: 310 RLQRDIDQPQGQCGIAMFASFP 331
R+Q+DI+ +G CGIAM +S+P
Sbjct: 318 RMQKDIEAKEGLCGIAMQSSYP 339
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 172/327 (52%), Positives = 221/327 (67%), Gaps = 15/327 (4%)
Query: 17 ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
A Q T RT + + E+ QW +QYG+ YK+S E KRF+IF +N+ +E FN N+
Sbjct: 22 AIQVTSRTLQD-DMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGD-NNK 79
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAV 135
YTL +N+FADLT EF +S+ FK H S + F Y+ +S +P SV+W +KGAV
Sbjct: 80 LYTLGVNQFADLTNDEFTSSRNKFK--GHMCSSITRTSTFKYENASAIPSSVDWRKKGAV 137
Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
TPVK QGQC AVAA EGI+ + +L+SLSEQ+LVDC T + GC GG MDDA
Sbjct: 138 TPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDA 197
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
FK+IIQN G+ +A Y Y+G+ G C++ K +A IT YEDVP N+E++L KAVANQP
Sbjct: 198 FKFIIQNHGLNTEANYPYQGVD-GTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQP 256
Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
+SVAIDAS QFY GVF G C T L+HGVTAVGYG S +G KYWL+KNSWG +WGE+
Sbjct: 257 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEE 316
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
GY +QR +D +G CGIAM AS+P +
Sbjct: 317 GYIMMQRGVDAAEGLCGIAMQASYPTA 343
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 177/337 (52%), Positives = 232/337 (68%), Gaps = 15/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L+ + A Q T T + S+ E+ EQW ++G+ YK+ E KRF IF +N+ VE
Sbjct: 108 LAMLLCTAFLAFQVTCCTLQDASMYERHEQWMTRHGKVYKDPREREKRFRIFNENVNYVE 167
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNAA N+ Y L +N+F DLT QEFIA + FK SS ++ T F Y++ + VP
Sbjct: 168 AFNNAA--NKPYKLGINQFXDLTNQEFIAPRNRFKGHMCSSIIRT--TTFKYENVTTVPS 223
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W + GAVTPVK QGQC AVAA EGI+A+ +L+SLSEQ+LVDC T +
Sbjct: 224 TVDWRQNGAVTPVKDQGQCGCCWAFSAVAATEGIHALSGGKLISLSEQELVDCDTKGVDQ 283
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MDDA+K+IIQN G+ +A Y Y+G+ G C++ +A +HAA IT YEDVP N+E+
Sbjct: 284 GCEGGLMDDAYKFIIQNHGLNTEANYPYKGVD-GKCNANEAANHAATITGYEDVPANNEK 342
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQPVSVAIDAS+ QFY G F G C T L+HGVTAVGYG S+ G KYWL+K
Sbjct: 343 ALQKAVANQPVSVAIDASSSDFQFYKSGAFTGSCGTELDHGVTAVGYGVSDHGTKYWLVK 402
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG +WGE+GY R+QR +D +G CGIAM AS+P +
Sbjct: 403 NSWGTEWGEEGYIRMQRGVDSEEGVCGIAMQASYPTA 439
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 171/337 (50%), Positives = 230/337 (68%), Gaps = 18/337 (5%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ ++ G+ ASQA RT + SI EK E+W ++ R Y ++ E R++IFK+N+ +E
Sbjct: 12 LALIFFLGALASQAIARTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FN A+ +SY L +N+FADLT +EF S+ FK H S +A PF Y++ + VP
Sbjct: 72 SFNKAS--EKSYKLGINQFADLTNEEFKTSRNRFK--GHMCSSQAG--PFRYENITAVPS 125
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
S++W ++GAVT +K QGQC AVAAVEGI + ++L+SLSEQ+LVDC T +
Sbjct: 126 SMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQ 185
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MDDAFK+I QN+G+T +A Y YEG S G C++ + +HAA+I +EDVP N+E
Sbjct: 186 GCQGGLMDDAFKFIEQNQGLTTEANYPYEG-SDGTCNTKQEANHAAKINGFEDVPANNEG 244
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L+KAVA QPVSVAIDA QFYS G+F G C T L+HGV AVGYG S G+ YWL+K
Sbjct: 245 ALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVK 303
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG WGE+GY R+Q+DID +G CGIAM AS+P +
Sbjct: 304 NSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 176/349 (50%), Positives = 229/349 (65%), Gaps = 22/349 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
++ L +VL++S C SQ R E S++E+ EQW +YG+ YK++AE KR IFKDN+
Sbjct: 8 QHILALVLLLS-ICTSQVMSRNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNV 66
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A GN+ Y L +N AD T +EF+AS G+K S TPF Y + +
Sbjct: 67 EFIESFN--AAGNKPYKLSINHLADQTNEEFVASHNGYKYKGSHSQ-----TPFKYGNVT 119
Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P +V+W + GAVT VK QGQC VAA EGI I L+SLSEQ+LVDC +
Sbjct: 120 DIPTAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV 179
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
D+ GC GG M+D F++II+N GI+++A Y Y + G CD+ K AAQI YE VP
Sbjct: 180 DH--GCDGGLMEDGFEFIIKNGGISSEANYPYTAVD-GTCDASKEASPAAQIKGYETVPA 236
Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI-K 291
N EE+L +AVANQPVSV+IDA S QFYS GVF G C T L+HGVT VGYGT+++G +
Sbjct: 237 NSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHE 296
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
YW++KNSWG WGE+GY R+QR ID +G CGIAM AS+P+ K S PS
Sbjct: 297 YWIVKNSWGTQWGEEGYIRMQRGIDAQEGLCGIAMDASYPMGKSSDSPS 345
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 169/341 (49%), Positives = 229/341 (67%), Gaps = 19/341 (5%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+Y + +L I + ASQAT R+ E S+ E+ E W A+YGR YK++ E KRF+IFKDN+
Sbjct: 8 QYVSMALLFILAAWASQATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A +++Y L +N+FADLT +EF + + FK + + + T F Y++ +
Sbjct: 68 ARIESFNKAM--DKTYKLSINEFADLTNEEFRSLRNRFK-----AHICSEATTFKYENVT 120
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +++W +KGAVTP+K Q QC AVAA EGI I +L+SLSEQ+LVDC T
Sbjct: 121 AVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTG 180
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
N GC GG MDDAF++I + G+ ++A Y YEG G C+S K AA+I YEDVP
Sbjct: 181 GENQGCSGGLMDDAFRFI-KIHGLASEATYPYEG-DDGTCNSKKEAHPAAKIKGYEDVPA 238
Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAVA+QPV+VAIDA QFY+ GVF G C T L+HGV AVGYG ++G+ Y
Sbjct: 239 NNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMMY 298
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGE+GY R+QRD+ +G CGIAM AS+P +
Sbjct: 299 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 339
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 330 bits (846), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 170/337 (50%), Positives = 226/337 (67%), Gaps = 17/337 (5%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L G AS A R+ +E S+ E +QW A+YGR YK + E ++R IF++NL ++
Sbjct: 12 LALLFTIGVLASLAAARSLNEASMTETHDQWMARYGRVYKTANEKNRRSTIFQENLKYIQ 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FN A N+ Y L +N+FADLT +EF S+ FK H + N F Y++ + VP
Sbjct: 72 TFNKA--NNKPYKLGVNEFADLTNEEFTTSRNKFK--SHVCATVTN--VFRYENVTAVPA 125
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+++W +KGAVTP+K QGQC AVAA+EGI +K +L+SLSEQ+LVDC TN +
Sbjct: 126 TMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQELVDCDTNGEDQ 185
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MD AF +I QN G++ + Y Y G + G C++ K +HAA IT +EDVP N E
Sbjct: 186 GCEGGLMDYAFDFIQQNHGLSTETNYPYSG-TDGTCNANKEANHAATITGHEDVPANSES 244
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+LLKAVANQP+SVAIDAS QFYS GVF G C T L+HGVTAVGYGT+ +G KYWL+K
Sbjct: 245 ALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYGTAADGTKYWLVK 304
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG WGE+GY ++QR + +G CGIAM AS+P +
Sbjct: 305 NSWGTSWGEEGYIQMQRGVAAAEGLCGIAMQASYPTA 341
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 167/339 (49%), Positives = 225/339 (66%), Gaps = 16/339 (4%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F +LI+ G A + R E S++ + EQW +G+ Y ++AE +RFEIFKDN+
Sbjct: 10 FFAFILIL-GMWAYEVASRELQEPSMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEY 68
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
+E FN A GN+ Y L +NKFADLT +E ++ G++ + +K T F Y++ + V
Sbjct: 69 IESFNTA--GNKPYKLSVNKFADLTNEELKVARNGYRRPLQTRPMKV--TSFKYENVTAV 124
Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P +++W +KGAVTP+K QGQC VAA EGIN + +LVSLSEQ+LVDC T
Sbjct: 125 PATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGE 184
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG M+D F++II+N GIT +A Y Y+ + G C+S K A+IT YE VP N
Sbjct: 185 DQGCEGGLMEDGFEFIIKNHGITTEANYPYQA-ADGTCNSKKEASRIAKITGYESVPANS 243
Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E +LLKAVA+QP+SV+IDA S QFYS GVF G C T L+HGVTAVGYG + +G KYWL
Sbjct: 244 EAALLKAVASQPISVSIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGETSDGTKYWL 303
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+KNSWG WGE+GY R+QRD + +G CGIAM +S+P +
Sbjct: 304 VKNSWGTSWGEEGYIRMQRDTEAEEGLCGIAMDSSYPTA 342
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 330 bits (846), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 174/341 (51%), Positives = 227/341 (66%), Gaps = 21/341 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
++ L +VL++S C SQ R E S++E+ EQW +YG+ YK++AE KR IFKDN+
Sbjct: 8 QHILALVLLLS-ICTSQVMSRNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNV 66
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A GNR Y L +N AD T +EF+AS G+K S TPF Y++ +
Sbjct: 67 EFIESFN--AAGNRPYKLSINHLADQTNEEFVASHNGYKHKGSHSQ-----TPFKYENVT 119
Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +V+W E GAVT VK QGQC VAA EGI I + L+SLSEQ+LVDC +
Sbjct: 120 GVPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV 179
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
D +GC GG+M+ F++II+N GI+++A Y Y + G CD+ K AAQI YE VP
Sbjct: 180 D--HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVD-GTCDANKEASPAAQIKGYETVPA 236
Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N E++L KAVANQPVSV IDA SA QFYS GVF G C T L+HGVTAVGYG++++G +Y
Sbjct: 237 NSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQY 296
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
W++KNSWG WGE+GY R+QR D +G CGIAM AS+P +
Sbjct: 297 WIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 330 bits (845), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 175/337 (51%), Positives = 231/337 (68%), Gaps = 15/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L G A Q T RT + S+ E+ E+W A+Y + YK+ E KRF+IFK+N+ +E
Sbjct: 12 LALLFCLGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNAA N+ Y L +N+FADLT +EFIA + FK H S T F Y++ + +P
Sbjct: 72 AFNNAA--NKPYKLGINQFADLTNEEFIAPRNRFK--GHMCSSITRTTTFKYENVTALPS 127
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W +KGAVTP+K QGQC AVAA EGI+A+ +L+SLSEQ++VDC T +
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQ 187
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GGFMD AFK+IIQN G+ +A Y Y+ + G C++ +A +HAA IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNTEANYPYKAVD-GKCNANEAANHAATITGYEDVPVNNEK 246
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQPVSVAIDAS QFY GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVK 306
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG +WGE+GY +QR + +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIMMQRGVKAQEGLCGIAMMASYPTA 343
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 170/312 (54%), Positives = 221/312 (70%), Gaps = 18/312 (5%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
E+ EQW QYGR YK+ E + R+ IFK+N+ ++ FN+ +SY L +N+FADLT +
Sbjct: 3 ERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQT--GKSYKLGVNQFADLTNE 60
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC------ 144
EF AS+ FK H S +A PF Y++ S VP +V+W ++GAVTPVK QGQC
Sbjct: 61 EFKASRNRFK--GHMCSPQAG--PFRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAF 116
Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
AVAA+EGIN + +L+SLSEQ++VDC T + GC GG MDDAFK+I QNKG+T +A
Sbjct: 117 SAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEAN 176
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFY 261
Y Y+G + G C++ K+ HAA+IT +EDVP N E +L+KAVA QPVSVAIDA S QFY
Sbjct: 177 YPYKG-TDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFY 235
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
S G+F G C+T L+HGVTAVGYG S +G KYWL+KNSWG WGE+GY R+Q+DI +G
Sbjct: 236 SSGIFTGSCDTQLDHGVTAVGYGVS-DGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGL 294
Query: 322 CGIAMFASFPVS 333
CGIAM AS+P +
Sbjct: 295 CGIAMQASYPTA 306
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 176/337 (52%), Positives = 227/337 (67%), Gaps = 15/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L G A Q T RT + S+ E+ QW A+Y + YK+ E KRF IFK+N+ +E
Sbjct: 12 LALLFCMGFLAFQVTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIFKENVNYIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
FN+A N+SY L +N+FADLT +EFIA + FK H S T F Y++ V PS
Sbjct: 72 TFNSA--DNKSYKLDINQFADLTNEEFIAPRNRFK--GHMCSSITRTTTFKYENVTVIPS 127
Query: 127 -VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
V+W +KGAVTP+K QGQC AVAA EGI+A+ +L+SLSEQ++VDC T +
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQDQ 187
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GGFMD AFK+IIQN G+ + Y Y+ + G C++ A +HAA IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNTEPNYPYKA-ADGKCNAKAAANHAATITGYEDVPVNNEK 246
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQPVSVAIDAS QFY GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVK 306
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG +WGE+GY R+QR + +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 174/337 (51%), Positives = 231/337 (68%), Gaps = 15/337 (4%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L G A Q T RT + S+ E+ E+W A+Y + YK+ E KRF+IFK+N+ +E
Sbjct: 12 LALLFCLGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNAA ++ Y L +N+FADLT +EFIA + FK H S T F Y++ + +P
Sbjct: 72 AFNNAA--DKPYKLGINQFADLTNEEFIAPRNKFK--GHMCSSITRTTTFKYENVTALPS 127
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W +KGAVTP+K QGQC AVAA EGI+A+ +L+SLSEQ++VDC T +
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQ 187
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GGFMD AFK+IIQN G+ +A Y Y+ + G C++ +A +HAA IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNTEANYPYKAVD-GKCNANEAANHAATITGYEDVPVNNEK 246
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L KAVANQPVSVAIDAS QFY GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVK 306
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
NSWG +WGE+GY +QR + +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIMMQRGVKAQEGLCGIAMMASYPTA 343
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 168/305 (55%), Positives = 217/305 (71%), Gaps = 12/305 (3%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
E+ EQW AQYGR YK+ AE R+ IFK+N+ ++ FN+ +SY L +N+FADL+ +
Sbjct: 3 ERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQT--GKSYNLGVNQFADLSNE 60
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCAVAAVE 150
EF AS+ FK H S +A PF Y++ S VP +++W +KGAVTPVK QGQC VAA+E
Sbjct: 61 EFKASRNRFK--GHMCSPQAG--PFRYENVSAVPATMDWRKKGAVTPVKDQGQC-VAAME 115
Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
GIN + +L+SLSEQ++VDC T + GC GG MDDAFK+I QNKG+T +A Y Y G +
Sbjct: 116 GINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTG-T 174
Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNG 268
G C++ K HAA+IT ++DVP N E +L+KAVA QPVSVAIDA QFYS G+F G
Sbjct: 175 DGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTG 234
Query: 269 YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFA 328
C T L+HGVTAVGYG S +G KYWL+KNSWG WGE+GY R+Q+DI +G CGIAM A
Sbjct: 235 SCGTELDHGVTAVGYGGS-DGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQA 293
Query: 329 SFPVS 333
S+P +
Sbjct: 294 SYPTA 298
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 174/342 (50%), Positives = 229/342 (66%), Gaps = 23/342 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
++ L +VL++S C SQ R E S++E+ EQW +YG+ YK++AE KR IFKDN+
Sbjct: 8 QHILALVLLLS-ICTSQVMSRYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNV 66
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKM-SDHSSSLKANGTPFLYKS- 120
+E FN A GN+ Y L +N AD T +EF+AS G+K + HS TPF Y++
Sbjct: 67 EFIESFN--AAGNKPYKLGINHLADQTNEEFVASHNGYKHKASHSQ------TPFKYENV 118
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
+ VP +V+W E GAVT VK QGQC VAA EGI I + L+SLSEQ+LVDC +
Sbjct: 119 TGVPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDS 178
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
D +GC GG+M+ F++II+N GI+++A Y Y + G CD+ K AAQI YE VP
Sbjct: 179 VD--HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVD-GTCDANKEASPAAQIKGYETVP 235
Query: 234 PNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
N E++L KAVANQPVSV IDA SA QFYS GVF G C T L+HGVTAVGYG++++G +
Sbjct: 236 ANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQ 295
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YW++KNSWG WGE+GY R+QR D +G CGIAM AS+P +
Sbjct: 296 YWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 166/339 (48%), Positives = 223/339 (65%), Gaps = 16/339 (4%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F +LI+ G A + R E ++ + EQW A YG+ Y ++AE +RF+IFK+N+
Sbjct: 10 FFAFILIL-GMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEY 68
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
+E FN A GN+ Y L +NKFAD T ++F ++ G++ + +K T F Y++ + V
Sbjct: 69 IESFNTA--GNKPYKLSVNKFADQTNEKFKGARNGYRRPFQTRPMKV--TSFKYENVTAV 124
Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P +++W +KGAVTP+K QGQC VAA EGIN + +LVSLSEQ+LVDC
Sbjct: 125 PATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGE 184
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG M+D F++II+N GIT +A Y Y+ + G C+S K H A+IT YE VP N
Sbjct: 185 DQGCEGGLMEDGFEFIIKNHGITTEANYPYQA-ADGTCNSKKQASHIAKITGYESVPANS 243
Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E LLK VANQP+SV+IDA S QFYS GVF G C T L+HGVTAVGYG + +G KYWL
Sbjct: 244 EAELLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWL 303
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+KNSW WGE+GY R+QRDID +G CGIAM +S+P +
Sbjct: 304 VKNSWXTSWGEEGYIRMQRDIDAEEGLCGIAMDSSYPTA 342
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 167/346 (48%), Positives = 222/346 (64%), Gaps = 16/346 (4%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
K I + + CA QA R E + + E+W A++G+ YK+ E +RF+IFK N
Sbjct: 7 GKILPIALFFVLAMCADQAASRELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIFKSN 66
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
+V +E FN A GN+SY L +NKFADLT +EF A G+K +S TPF Y++
Sbjct: 67 VVFIESFNTA--GNKSYMLGINKFADLTNEEFRAFWNGYKRPLGASR---KITPFKYENV 121
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+ +P S++W KGAVTP+K QG C AVAA EGI+ ++ +LVSLSEQ+LVDC
Sbjct: 122 TALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDCDV 181
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ GC GG M DAFK+I ++ G+T++A Y Y+G G CD+ K A +IT Y+ VP
Sbjct: 182 KGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRD-GKCDTKKEASRAVKITGYQAVP 240
Query: 234 PNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
N E +LLKAVANQPVSVAIDA +L QFY G+F G C +NHGV AVGYG S G K
Sbjct: 241 KNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRSNSGSK 300
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
YW++KNSWG +WGE GY R++RD+ +G CGIAM S+P ++ A
Sbjct: 301 YWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMECSYPTAQVQA 346
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 166/339 (48%), Positives = 223/339 (65%), Gaps = 16/339 (4%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F +LI+ G A + R E ++ + EQW A YG+ Y ++AE +RF+IFK+N+
Sbjct: 10 FFAFILIL-GMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEY 68
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
+E FN A GN+ Y L +NKFAD T ++F ++ G++ + +K T F Y++ + V
Sbjct: 69 IESFNTA--GNKPYKLSVNKFADQTNEKFKGARNGYRRPFQTRPMKV--TSFKYENVTAV 124
Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P +++W +KGAVT +K QGQC VAA EGIN + +LVSLSEQ+LVDC
Sbjct: 125 PATMDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGE 184
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG M+D F++II+N GIT +A Y Y+ + G C+S K H A+IT YE VP N
Sbjct: 185 DQGCEGGLMEDGFEFIIKNHGITTEANYPYQA-ADGTCNSKKQASHIAKITGYESVPANS 243
Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E LLK VANQP+SV+IDA S QFYS GVF G C T L+HGVTAVGYG + +G KYWL
Sbjct: 244 EAELLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWL 303
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+KNSWG WGE+GY R+QRDID +G CGIAM +S+P +
Sbjct: 304 VKNSWGTSWGEEGYIRMQRDIDTEEGLCGIAMDSSYPTA 342
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 323 bits (828), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 172/343 (50%), Positives = 223/343 (65%), Gaps = 22/343 (6%)
Query: 3 KYFLIVVLIISGSCASQATY-RTFDEG-SIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
K + L+I ASQ R+ E S+ E+ EQW AQ+GR YK +AE + RFEIF+
Sbjct: 8 KLLPALALLIVAIWASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAHRFEIFRA 67
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
N+ +E FN N + L +N+FADLT +EF T K S +S+ F Y++
Sbjct: 68 NVERIESFNAE---NHKFKLGVNQFADLTNEEFKTRNT-LKPSKMAST-----KSFKYEN 118
Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+ VP +++W KGAVTP+K QGQC AVAA EGI + +L+SLSEQ++VDC
Sbjct: 119 VTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISLSEQEVVDCD 178
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
++ GC GG MDDAF+YII+NKGIT +A Y Y+ + G C++ KA HAA IT YEDV
Sbjct: 179 VTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKA-ADGTCNTKKAASHAASITGYEDV 237
Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
N E +LLKA ANQP++VAIDA A Q YS GVF G C T L+HGVT VGYG + +G
Sbjct: 238 TVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVGYGATSDGT 297
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KYWL+KNSWG WGEDGY R++RD+D +G CGIAM AS+P +
Sbjct: 298 KYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYPTA 340
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 323 bits (827), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 163/341 (47%), Positives = 226/341 (66%), Gaps = 16/341 (4%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
++ LI + + A QA+ R E ++ E+ E+W A++G+ YK+ E +RF+IFK+N+
Sbjct: 8 QFLLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E N A GN SY L +N+FADLT +EF AS G+K +S + TPF Y++ +
Sbjct: 68 EFIESSN--AAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDASRIV---TPFKYENVT 122
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P S++W KGAVT +K Q +C AVAA EG++ ++ +LVSLSEQ+LVDC
Sbjct: 123 ALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVK 182
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG M+DAFK+I +N GIT +A Y+Y G G CD+ K H A+IT Y+ VP
Sbjct: 183 GEDKGCQGGLMEDAFKFIKRNGGITTEANYAYRGRD-GKCDTKKEASHVAKITGYQVVPE 241
Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N E +LLKAVA+QPVSV+IDA ++ QFY G++ G C + LNHGV AVGYGTS G KY
Sbjct: 242 NSEAALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTSSSGSKY 301
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
W++KNSWG +WGE GY R++RDI +G CGIAM S+P +
Sbjct: 302 WIVKNSWGPEWGERGYVRMKRDITSRKGLCGIAMDCSYPTA 342
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 174/349 (49%), Positives = 229/349 (65%), Gaps = 23/349 (6%)
Query: 1 MAKYFLIVVLIISGSCASQA---TYRTFDEGSIAEKFEQWKAQYGRTYKESAEN--SKRF 55
+ + FL V L++S + Q + DE S+ + E+W +Q+GR Y + E+ +KRF
Sbjct: 3 LLQIFLFVALVLSFCFSIQLAGLSRPLLDEDSM--RHEEWMSQHGRVYADEQEDHKNKRF 60
Query: 56 EIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP 115
+FK+N+ +E FN+ +++ L +N+FADLT +EF AS GFK SS TP
Sbjct: 61 NVFKENVERIEEFNDG----KTFKLAINQFADLTNEEFRASYNGFKGPMVLSSQITKPTP 116
Query: 116 FLYK--SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQ 166
F Y+ SS +P SV+W +KGAVTPVK QGQC AVAA+EGI I +L+SLSEQ
Sbjct: 117 FRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLSEQ 176
Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
+LVDC T ++GC GG MD AF++II N G+T ++ Y Y+G G C+ K A I
Sbjct: 177 ELVDCDTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKG-EDGTCNFNKTNPIAVSI 235
Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYG 284
T YEDVP NDE++L+KAVA+QPVSVAI+A S QFYS GVF G C T L+H VTAVGYG
Sbjct: 236 TGYEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVGYG 295
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
SE+G KYW++KNSWG WGE GY +Q+DI QG CGIAM AS+P +
Sbjct: 296 ESEDGSKYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYPTA 344
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 174/341 (51%), Positives = 222/341 (65%), Gaps = 17/341 (4%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
Y + + G + QAT RT + E EQW Q+G+ YK + E KRF IFK+N+
Sbjct: 8 HYIPFALFLCLGLLSFQATSRTLQNDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FNN +GN+SY L LN FADLT EFIA++ F H S + T F YK+ S
Sbjct: 68 NYIEAFNN--VGNKSYKLGLNHFADLTNHEFIAARNKFNGYLHGSII----TTFKYKNVS 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +V+W ++GAVTPVK QGQC AVA+ EGI+ + LVSLSEQ+LVDC TN
Sbjct: 122 DVPSAVDWRQEGAVTPVKNQGQCGCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTN 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAF++IIQN G++ +A Y Y+G+ G C+ + AA I+ YE+VP
Sbjct: 182 GEDQGCEGGLMDDAFEFIIQNNGLSTEAEYPYQGVD-GTCNKTEVGSSAATISGYENVPV 240
Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
NDE++L KAVANQPVSVAIDAS QFY GVF G C T L+HGV VGYG E+ +Y
Sbjct: 241 NDEQALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVAVVGYGVGEDETEY 300
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGE+GY R+QR +D +G CGIAM S+P +
Sbjct: 301 WLVKNSWGTQWGEEGYIRMQRGVDASEGLCGIAMQPSYPTA 341
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 170/339 (50%), Positives = 218/339 (64%), Gaps = 17/339 (5%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L + L+ G A + S+AE+ +W A++GRTYK++AE +R IFK N+ +
Sbjct: 7 LWMALLALGLGACSPAAAELGDASMAERHVEWMARHGRTYKDAAEKEQRLGIFKSNVEYI 66
Query: 66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVP 124
E FN G R Y L N+FADLT +EF A TGFK S + NG F + S S VP
Sbjct: 67 ESFN---AGKRKYQLAANQFADLTHEEFKAMHTGFKPSGTGAKKAGNG--FRHGSLSSVP 121
Query: 125 PSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
SV+W KGAVTPVK QG C VAAVEGI I +L+SLSEQQLVDC + +
Sbjct: 122 DSVDWRSKGAVTPVKDQGLCGSCWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKD 181
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GG MD AF++I+ N GIT++A Y YE + +C++ A A I ++EDVP NDE
Sbjct: 182 QGCQGGDMDAAFEFIVNNGGITSEANYPYEEVQR-LCNAHNASFVVATIESHEDVPTNDE 240
Query: 238 ESLLKAVANQPVSVAIDASA---LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
++L KAVANQPVSV IDA + Q YSGGVF+G C T L+H VT VGYGT+ +G KYWL
Sbjct: 241 KALRKAVANQPVSVGIDAGSSLDFQLYSGGVFSGECGTDLDHAVTVVGYGTTSDGTKYWL 300
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KNSWG+ WGE+GY R++RD+ +G CGIAM AS+P +
Sbjct: 301 AKNSWGETWGENGYIRMERDVAAKEGLCGIAMQASYPTA 339
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 170/343 (49%), Positives = 222/343 (64%), Gaps = 21/343 (6%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F ++ +I+S + + E S EK EQW +++ R Y + +E + RFEIFK NL
Sbjct: 6 FFLLAIILSSRTSGATSRGGLFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNLKF 65
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGF----KMSDHSSSLKANGTPFLYKS 120
VE FN N++YTL +N+F+DLT +EF A TG M+ S++ F Y++
Sbjct: 66 VESFNMNT--NKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRMSTTDSHETVSFRYEN 123
Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+ S++W E+GAVT VK+Q QC AVAAVEG+ I LVSLSEQQL+DC+
Sbjct: 124 VGETGESMDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSLSEQQLLDCS 183
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
T N+GC GG M AF YI++N+GIT + Y Y+G + C+S AA I+ YE V
Sbjct: 184 TE--NDGCDGGIMWKAFDYIVENQGITAEDNYPYQG-AQQTCESNHVA--AATISGYETV 238
Query: 233 PPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P NDEE+LLKAV+ QPVSVAI+ S +F YSGG+FNG C T LNH VT VGYG SEEGI
Sbjct: 239 PQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTHLNHAVTIVGYGVSEEGI 298
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KYWL+KNSWG+ WGEDGY R+ RD+D PQG CG+A A +PV+
Sbjct: 299 KYWLLKNSWGESWGEDGYMRIMRDVDAPQGMCGLASLAYYPVA 341
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 167/328 (50%), Positives = 217/328 (66%), Gaps = 15/328 (4%)
Query: 16 CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGN 75
C SQ R + S+ E+ EQW +YG+ YK+SAE KRF IF++N+ +E FN A GN
Sbjct: 20 CTSQVKSRKLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFN--AAGN 77
Query: 76 RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGA 134
+ Y L +N AD T +EF+AS G+K S TPF Y++ + +P +V+W +KG
Sbjct: 78 KPYKLSINHLADQTNEEFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPWAVDWRQKGD 137
Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
T +K QGQC AVAA EGI I LVSLSEQ+LVDC + D+ GC GG M+
Sbjct: 138 ATSIKDQGQCGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSVDH--GCDGGLMEH 195
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
F++II+N GI+++A Y Y ++ G CD+ K AQI YE VP N EE L KAVANQ
Sbjct: 196 GFEFIIKNGGISSEANYPYTAVN-GTCDTNKEASPGAQIKGYETVPVNCEEELQKAVANQ 254
Query: 248 PVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
PVSV+IDA SA QFYS GVF G C T L+HGVTAVGYG++++GI+YW++KNSWG WGE
Sbjct: 255 PVSVSIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQWGE 314
Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+GY R+ R ID +G CGIAM AS+P +
Sbjct: 315 EGYIRMLRGIDAQEGLCGIAMDASYPTA 342
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 163/324 (50%), Positives = 220/324 (67%), Gaps = 12/324 (3%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
S+AT RT ++ ++ + EQW A +GR Y + E RF+IFK+N+ ++ N A ++S
Sbjct: 39 SRATSRTLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHN--ARSDQS 96
Query: 78 YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTP 137
YTL +NKFADLT EF AS+ G+K S S +G S VP V+W ++GAVTP
Sbjct: 97 YTLEVNKFADLTNDEFRASRNGYKKQPDSDSHVVSGLFRYANVSAVPDEVDWRKEGAVTP 156
Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
VK QG C AVAA+EGIN ++ +LVSLSEQ+LVDC + + GC GG M++AF+
Sbjct: 157 VKDQGDCGCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQ 216
Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
+I + KG+ ++VY Y G GIC++ KA AA+I+ +E VP N+E++LL+AVANQPVS
Sbjct: 217 FIEKRKGLAAESVYPYTG-EDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVS 275
Query: 251 VAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
+AIDAS QFYSGGVF G C T L+H +TAVGYG + +G KYWL+KNSWG WGE+GY
Sbjct: 276 IAIDASGYEFQFYSGGVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGY 335
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
R++RD +G CGIAM S+PV
Sbjct: 336 IRIKRDSLAKEGLCGIAMDPSYPV 359
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 165/326 (50%), Positives = 215/326 (65%), Gaps = 21/326 (6%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
SQ R E S+ E+ EQW +YG+ YK++AE KRF+IFKDN+ +E FN A GN+
Sbjct: 22 SQVMCRKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFN--ADGNKP 79
Query: 78 YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVT 136
Y L +N ADLT +EF AS+ GFK S+ T F Y++ + +P +++W KGAVT
Sbjct: 80 YKLGVNHLADLTVEEFKASRNGFKRPHEFST-----TTFKYENVTAIPAAIDWRTKGAVT 134
Query: 137 PVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
P+K QGQC +AA EGI+ I +LVSLSEQ+LVDC T + GC GG+M+D F
Sbjct: 135 PIKDQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGF 194
Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
++II+N GIT++ Y Y+ + G C+ KA AQI YE VPPN E +L KAVANQPV
Sbjct: 195 EFIIKNGGITSETNYPYKAVD-GKCN--KATSPVAQIKGYEKVPPNSETALQKAVANQPV 251
Query: 250 SVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
SV+IDA + FYS G++NG C T L+HGVTAVGYGT+ G YW++KNSWG WGE G
Sbjct: 252 SVSIDADGAGFMFYSSGIYNGECGTELDHGVTAVGYGTA-NGTDYWIVKNSWGTQWGEKG 310
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVS 333
Y R+QR I G CGIA+ +S+P S
Sbjct: 311 YVRMQRGIAAKHGLCGIALDSSYPTS 336
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 169/341 (49%), Positives = 224/341 (65%), Gaps = 38/341 (11%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+Y + +L + + ASQAT R E S+ E+ E W QYGR YK++ E SKR++IFKDN+
Sbjct: 8 QYICLALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A ++SY L +N+FADLT +EF AS+ FK H S +A T F Y++ +
Sbjct: 68 ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYENVT 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +V+W +KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC TN Y Y G + G C+ KA AA+I YEDVP
Sbjct: 182 GEDQGC------------------TN---YPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 219
Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAVA+QP++VAIDA S QFYS GVF G C T L+HGV+AVGYGTS++G+KY
Sbjct: 220 NNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKY 279
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGE+GY R+QRD+ +G CGIAM AS+P +
Sbjct: 280 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 168/341 (49%), Positives = 223/341 (65%), Gaps = 38/341 (11%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+Y + +L + + ASQAT R+ E S+ E+ E W QYGR YK++ E SKR++IFKDN+
Sbjct: 8 QYICLALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A ++SY L +N+FADLT +EF AS+ FK H S +A T F Y++ +
Sbjct: 68 ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYENVT 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +V+W +KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC TN Y Y G + G C+ KA AA+I YEDVP
Sbjct: 182 GEDQGC------------------TN---YPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 219
Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAVA+QP++VAIDAS QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 220 NNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 279
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSW WGE+GY R+QRD+ +G CGIAM AS+P +
Sbjct: 280 WLVKNSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 162/294 (55%), Positives = 204/294 (69%), Gaps = 14/294 (4%)
Query: 50 ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL 109
E KR IF N+ +E +N+A+ N+ Y L +NKFADLT +EFIAS+ FK SS +
Sbjct: 3 EREKRLRIFNKNVNYIEA-SNSAVNNKLYKLSINKFADLTNEEFIASRNKFKGHMCSSII 61
Query: 110 KANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLV 161
+ T F Y+ +S +P +V+W +KGAVTPVK QGQC AVAA EGI+ + +LV
Sbjct: 62 RT--TTFKYENASAIPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLV 119
Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
SLSEQ+L+DC T + GC GG MDDAFK+IIQN G++ + Y YEG+ G C++ KA
Sbjct: 120 SLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVD-GTCNANKASI 178
Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVT 279
HA IT YEDVP N+E +L KAVANQP+SVAIDAS QFY+ GVF G C T L+HGVT
Sbjct: 179 HAVTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVT 238
Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
AVGYG +G KYWL+KNSWG DWGE+GY R+QR I +G CGIAM AS+P +
Sbjct: 239 AVGYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYPTA 292
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 166/341 (48%), Positives = 221/341 (64%), Gaps = 36/341 (10%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+Y + +L + + ASQAT R E S+ E+ E W AQYGR YK++ E SKR++IFKDN+
Sbjct: 8 QYICLALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN A ++SY L +N+FADLT +EF S+ FK H S +A T F Y++ +
Sbjct: 68 ARIESFNKAM--DKSYKLSINEFADLTNEEFGTSRNRFKA--HICSTEA--TSFKYENVT 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +++W +KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC N A Y Y G + G C+ KA AA+I YEDVP
Sbjct: 182 GEDQGC-------------------NGANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 221
Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAV +QP++VAIDA QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 222 NNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 281
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGE+GY R+QRD+ +G CGIAM AS+P +
Sbjct: 282 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 322
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 164/342 (47%), Positives = 218/342 (63%), Gaps = 25/342 (7%)
Query: 6 LIVVLIISGSCASQATYRTFDEGS-IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
++ VL + C + R +E S + + EQW AQY R YK++AE ++RFE+FK N+
Sbjct: 8 ILAVLSFAFFCGAALAARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKF 67
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYKSSQ 122
+E FN GNR + L +N+FADLT EF ++T GFK SL T F Y++
Sbjct: 68 IESFNTG--GNRKFWLGINQFADLTNDEFRTTKTNKGFK-----PSLDKVSTGFRYENVS 120
Query: 123 V---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
V P +++W GAVTP+K QGQC AVAA EGI I +L+SLSEQ+LVDC
Sbjct: 121 VDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCD 180
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ + GC GG MDDAFK+II+N G+T ++ Y Y + G C S + AA I YEDV
Sbjct: 181 VHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTA-ADGKCKS--GSNSAANIKGYEDV 237
Query: 233 PPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P NDE +L+KAVANQPVSVA+D + QFYSGGV G C T L+HG+ A+GYG + +G
Sbjct: 238 PTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGT 297
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYWL+KNSWG WGE+GY R+++DI +G CG+AM S+P
Sbjct: 298 KYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYPT 339
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 158/338 (46%), Positives = 223/338 (65%), Gaps = 17/338 (5%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ +L++ G A A RT ++ S+ E+ EQW AQ+G+ YK+ E R++IF+ N+ +E
Sbjct: 12 LALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIE 71
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
FNNA GN+S+ L +N+FADLT +EF A K+ + S + + F Y+ ++VP
Sbjct: 72 GFNNA--GNKSHKLGVNQFADLTEEEFKAIN---KLKGYMWSKISRTSTFKYEHVTKVPA 126
Query: 126 SVNWIEKGAVTPVKYQG-QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
+++W +KGAVTP+K QG +C AVAA EGI + L+SLSEQ+L+DC TN +N
Sbjct: 127 TLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGDN 186
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC G + +AFK+I+QNKG+ +A Y Y+ + G C++ H A I YEDVP N+E
Sbjct: 187 GGCKWGIIQEAFKFIVQNKGLATEASYPYQAVD-GTCNAKVESKHVASIKGYEDVPANNE 245
Query: 238 ESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
+LL AVANQPVSV +D+S +FYS GV +G C T +H VT VGYG S++G KYWLI
Sbjct: 246 TALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDGTKYWLI 305
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KNSWG WGE GY R++RD+ +G CGIAM AS+P++
Sbjct: 306 KNSWGVYWGEQGYIRIKRDVAAKEGMCGIAMQASYPIA 343
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 166/342 (48%), Positives = 220/342 (64%), Gaps = 22/342 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K + I + ++ Q R E S+ E+ EQW A+YG+ YK++AE KRF IFK N+
Sbjct: 7 KQYTIALFLLLALGIPQMMSRKLHETSMRERHEQWMAEYGKVYKDAAEKEKRFLIFKHNV 66
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E FN AA N+ Y L +N ADLT +EF AS+ G K S+ TPF Y++ +
Sbjct: 67 EFIESFNAAA--NKPYKLGVNHLADLTVEEFKASRNGLKRPYELST-----TPFKYENVT 119
Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA--------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W KGAVT +K QGQCA VAA EGI+ I +LVSLSEQ+LVDC T
Sbjct: 120 AIPAAIDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCDT 179
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ GC GG+M+D F++II+N GIT++A Y Y+ + G C+ KA AQI YE VP
Sbjct: 180 KGVDQGCEGGYMEDGFEFIIKNGGITSEANYPYKAVD-GKCN--KATSPVAQIKGYEKVP 236
Query: 234 PNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
PN E++L KAVANQPVSV+IDA+ FYS G++NG C T L+HGVTAVGYG + G
Sbjct: 237 PNSEKTLQKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGYGIA-NGTD 295
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YWL+KNSWG WGE GY R+QR + G CGIA+ +S+P +
Sbjct: 296 YWLVKNSWGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYPTA 337
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 165/347 (47%), Positives = 219/347 (63%), Gaps = 20/347 (5%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+A IV L S A A R + ++A + E+W AQ+GR YK++AE ++R E+FK
Sbjct: 10 LAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLEVFK 69
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT---GFKMSDHSSSLKANGTPF 116
N+ +E FN A G Y L +N+FADLT +EF A+ T GF ++ + T F
Sbjct: 70 ANVAFIESFN--AGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVS---TGF 124
Query: 117 LYK---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQ 166
Y+ + +P SV+W KGAVT +K QGQC AVAA+EGI + +L+SLSEQ
Sbjct: 125 KYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQ 184
Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
+LVDC + N+ GC GG +D AF++I+ N G+T +A Y Y G C + A D AA I
Sbjct: 185 ELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTA-EDGRCKTTAAADVAASI 243
Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
YEDVP NDE SL+KAVA QPVSVA+DAS QFY GGV G C T L+HGVT +GYG +
Sbjct: 244 RGYEDVPANDEPSLMKAVAGQPVSVAVDASKFQFYGGGVMAGECGTSLDHGVTVIGYGAA 303
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+G KYWL+KNSWG WGE GY R+++DID +G CG+AM S+P +
Sbjct: 304 SDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPTA 350
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 169/354 (47%), Positives = 224/354 (63%), Gaps = 22/354 (6%)
Query: 5 FLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
FL ++I+ +C + + E ++ +++W++ + + E KRF +F+ N++
Sbjct: 8 FLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHS-VPRSLNEREKRFNVFRHNVM 66
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKS- 120
V N NRSY L+LNKFADLT EF + TG + H K F+Y
Sbjct: 67 HVHNTNKK---NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHE 123
Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
S++P SV+W +KGAVT +K QG+C VAAVEGIN IK N+LVSLSEQ+LVDC
Sbjct: 124 NLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDC 183
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
T N GC GG M+ AF++I +N GIT + Y YEG+ G CD+ K I +ED
Sbjct: 184 DTK-QNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGID-GKCDASKDNGVLVTIDGHED 241
Query: 232 VPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP NDE +LLKAVANQPVSVAIDA S QFYS GVF G C T LNHGV AVGYG SE G
Sbjct: 242 VPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYG-SERG 300
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSAD 343
KYW+++NSWG +WGE GY +++R+ID+P+G+CGIAM AS+P+ S+ P+ D
Sbjct: 301 KKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIKLSSSNPTPKD 354
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 163/342 (47%), Positives = 222/342 (64%), Gaps = 25/342 (7%)
Query: 6 LIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
++ +L ++ C + R D+ ++ + EQW AQY R YK++ E ++RFE+FK N+
Sbjct: 8 ILAILGLALFCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKF 67
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYKSSQ 122
+E FN A GNR + L +N+FADLT EF A++T GFK S +K T F Y++
Sbjct: 68 IESFN--AGGNRKFWLGVNQFADLTNDEFRATKTNKGFK----PSPVKVP-TGFRYENVS 120
Query: 123 V---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
V P S++W KGAVTP+K QGQC AVAA EGI I ++L+SLSEQ+LVDC
Sbjct: 121 VDALPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCD 180
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ + GC GG MDDAFK+II+N G+T ++ Y Y + G C S + AA I +EDV
Sbjct: 181 VHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTA-TDGKCKS--GTNSAANIKGFEDV 237
Query: 233 PPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P NDE +L+KAVANQPVSVA+D + Q YSGGV G C T L+HG+ A+GYG + +G
Sbjct: 238 PANDEAALMKAVANQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYGQTSDGT 297
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYWL+KNSWG WGE+GY R+++DI +G CG+AM S+P
Sbjct: 298 KYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 339
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 164/346 (47%), Positives = 217/346 (62%), Gaps = 20/346 (5%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+A IV L S A A R + ++A + E+W AQ+GR YK++AE ++R E+FK
Sbjct: 10 LAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLEVFK 69
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT---GFKMSDHSSSLKANGTPF 116
N+ +E FN A G Y L +N+FADLT +EF A+ T GF ++ + T F
Sbjct: 70 ANVAFIESFN--AGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVS---TGF 124
Query: 117 LYK---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQ 166
Y+ + +P SV+W KGAVT +K QGQC AVAA+EG + +L+SLSEQ
Sbjct: 125 KYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVKLSTGKLISLSEQ 184
Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
+LVDC + N+ GC GG +D AF++I+ N G+T +A Y Y G C + A D AA I
Sbjct: 185 ELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTA-EDGRCKTTAAADVAASI 243
Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
YEDVP NDE SL+KAVA QPVSVA+DAS QFY GGV G C T L+HGVT +GYG +
Sbjct: 244 RGYEDVPANDEPSLMKAVAGQPVSVAVDASKFQFYGGGVMAGECGTSLDHGVTVIGYGAA 303
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYWL+KNSWG WGE GY R+++DID +G CG+AM S+P
Sbjct: 304 SDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPT 349
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 170/346 (49%), Positives = 224/346 (64%), Gaps = 26/346 (7%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+FL+ +L+ S + + F E S EK EQW +++ R Y + +E + RFEIF +NL
Sbjct: 6 FFLLAILLSSRTSGVTSRGGLF-EASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLK 64
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGF----KMSDHSSSLKANGTPFLYK 119
VE N N++YTL +N+F+DLT +EF A TG M+ S++ F Y+
Sbjct: 65 FVESINMNT--NKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFRYE 122
Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
+ + S++WI++GAVT VK+Q QC AVAAVEG+ I LVSLSEQQL+DC
Sbjct: 123 NVGETGESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDC 182
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH--AAQITNY 229
+T NNGC GG M AF YI +N+GIT + Y Y+G + C+S +H AA I+ Y
Sbjct: 183 STE--NNGCGGGIMWKAFDYIKENQGITTEDNYPYQG-AQQTCES----NHLAAATISGY 235
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSE 287
E VP NDEE+LLKAV+ QPVSVAI+ S +F YSGG+FNG C T L H VT VGYG SE
Sbjct: 236 ETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSE 295
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
EGIKYWL+KNSWG+ WGE+GY R+ RD+D PQG CG+A A +PV+
Sbjct: 296 EGIKYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYPVA 341
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 167/345 (48%), Positives = 226/345 (65%), Gaps = 21/345 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K ++ + + SQ R + ++ E+ E W A+YG+ YK++AE KRF+IFKDN+
Sbjct: 7 KQHMLALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKIYKDAAEKEKRFQIFKDNV 66
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--SSSLKANGTPFLYKS 120
+E FN A GN+ Y L +N ADLT +EF S+ G K + +++ K NG F Y++
Sbjct: 67 EFIESFN--AAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNG--FKYEN 122
Query: 121 -SQVPPSVNWIEKGAVTPVKYQG-QCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
+ +P +++W KGAVTP+K QG QC VAA EGI I L+SLSEQ+LVDC
Sbjct: 123 VTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDC 182
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ D+ GC GG M+D F++II+N GI+++A Y Y + G CD+ K AAQI YE
Sbjct: 183 DSVDH--GCDGGLMEDGFEFIIKNGGISSEANYPYTAVD-GTCDASKEASPAAQIKGYET 239
Query: 232 VPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP N EE+L +AVANQPVSV+IDA S QFYS GVF G C T L+HGVT VGYGT+++G
Sbjct: 240 VPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDG 299
Query: 290 I-KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+YW++KNSWG WGE+GY R+QR ID +G CGIAM AS+P +
Sbjct: 300 THEYWIVKNSWGTQWGEEGYIRMQRGIDALEGLCGIAMDASYPTA 344
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 160/342 (46%), Positives = 220/342 (64%), Gaps = 21/342 (6%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
+K FL+ +L + C+S R + ++ E+ E W +YGR YK++AE ++RFE+FKDN
Sbjct: 4 SKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVFKDN 63
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
+ VE FN N + L +N+FADLT +EF A++ GFK S+ K T F Y++
Sbjct: 64 VAFVESFNTNK--NNKFWLGINQFADLTIEEFKANK-GFK---PISAEKVPTTGFKYENL 117
Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
S +P +V+W KGAVTP+K QGQC AVAA+EGI + L+SLSEQ+LVDC
Sbjct: 118 SVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDC 177
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
T+ + GC GG+MD AF+++I+N G+ + Y Y+ + G C AA I +ED
Sbjct: 178 DTHSMDEGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVD-GKCKG--GSKSAATIKGHED 234
Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP NDE +L+KAVANQPVSVA+DAS F YSGGV G C T L+HG+ A+GYG +G
Sbjct: 235 VPVNDEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDG 294
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
KYW++KNSWG WGE G+ R+++DI QG CG+AM S+P
Sbjct: 295 TKYWILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 159/342 (46%), Positives = 227/342 (66%), Gaps = 18/342 (5%)
Query: 6 LIVVLIISGSCASQ-ATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
L + I G SQ A+ R + E S+ + +QW A + + YK+ E RF+IFK+N+
Sbjct: 12 LALFFIFLGVWRSQVASSRPINYEASMRARHDQWIAHHDKVYKDLNEKEMRFKIFKENVE 71
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG--TPFLYKS- 120
+E FN A ++ Y L +NKF+DLT ++F TG+K S H + ++ T F Y +
Sbjct: 72 RIEAFN--AGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRS-HPKVMSSSKPKTHFRYANV 128
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+ +PP+++W +KGAVTP+K Q +C AVAA EG++ +K +L+ LSEQ+LVDC
Sbjct: 129 TDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDV 188
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ GC GG +D AF +I++NKG+T +A Y Y+G G+C+ K+ AA+I YEDVP
Sbjct: 189 EGEDEGCSGGLLDTAFDFILKNKGLTTEANYPYKG-EDGVCNKKKSALSAAKIAGYEDVP 247
Query: 234 PNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
N E++LL+AVANQPVSVAID S+ QFYS GVF+G C T+LNH VTAVGYG + +G K
Sbjct: 248 ANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTK 307
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YW+IKNSWG WG+ GY R++RD+ + +G CG+AM AS+P +
Sbjct: 308 YWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 168/354 (47%), Positives = 227/354 (64%), Gaps = 22/354 (6%)
Query: 5 FLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
FL ++I+ +C + + E +++ +++W++ + + E KRF +F+ N++
Sbjct: 8 FLFSLVILETACGFDYEDKEIESEEGLSKLYDRWRSHHS-VPRSLHEREKRFNVFRHNVM 66
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKS- 120
V +N+ NRSY L+LNKFADLT EF + TG K+ H K F+Y
Sbjct: 67 HV---HNSNKKNRSYKLKLNKFADLTIHEFKNAYTGSKIKHHRMLQGPKRGSKQFMYDHE 123
Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
S++P SV+W +KGAVT +K QG+C VAAVEGIN IK N+LVSLSEQ+LVDC
Sbjct: 124 NVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDC 183
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
TN N GC GG M+ AF++I +N GIT + Y YEG+ G CD+ K I +E+
Sbjct: 184 DTN-QNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGID-GKCDASKDNGVLVTIDGHEN 241
Query: 232 VPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP NDE +LLKAVANQPVSVAIDA S QFYS GVF G C T LNHGV VGYG S+ G
Sbjct: 242 VPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGYG-SQGG 300
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSAD 343
KYW+++NSWG +WGE GY +++R ID+P+G+CGIAM AS+P+ S+ P+ D
Sbjct: 301 KKYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPIKLSSSNPTPKD 354
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 169/339 (49%), Positives = 222/339 (65%), Gaps = 22/339 (6%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FL++ + IS S+ + T E S+ E+ EQW A+Y + YK++AE KRF IFKDN+
Sbjct: 15 FLLLAVGIS-RVISRELHET--ETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVEF 71
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
+E FN A GN+ Y L +N ADLT +EF AS+ G K S + T F Y++ + +
Sbjct: 72 IESFN--AAGNKPYKLGVNHLADLTIEEFKASRNGLK---RSYDYEVGTTSFKYENVTAI 126
Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P SV+W +KGAVTP+K QGQC VAA EGI+ I +LVSLSEQ+LVDC
Sbjct: 127 PASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGT 186
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG+M+D F++II+N GIT +A Y Y+ + G C + A AAQI YE VP N
Sbjct: 187 DQGCEGGYMEDGFEFIIKNGGITTEANYPYKAVD-GSCKNATAP--AAQIKGYEKVPVNS 243
Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E++LLKAVANQPVSV+IDA+ + FYS G+F G C T L+HGVTAVGYG + G YW+
Sbjct: 244 EKALLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYGRA-NGTDYWI 302
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+KNSWG WGE GY R+QR I +G CGIAM +S+P +
Sbjct: 303 VKNSWGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYPTA 341
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 158/347 (45%), Positives = 227/347 (65%), Gaps = 17/347 (4%)
Query: 1 MAKYFLIVVLIIS-GSCASQ-ATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
+++Y + + I G +SQ A R + E ++ + +QW + + YK+ E RF+I
Sbjct: 6 LSQYLCLALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQI 65
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG-TPF 116
FK+N+ +E FN A ++ Y L NKF+DLT +EF TG+K S + G T F
Sbjct: 66 FKENVERIEAFN--AGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHF 123
Query: 117 LYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
Y + + +PP+++W +KGAVTP+K Q +C AVAA+EG++ +K L+ LSEQ+L
Sbjct: 124 RYTNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQEL 183
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
VDC + GC GG +D AF +I++NKG+T + Y Y+G G+C+ K+ AA+IT
Sbjct: 184 VDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKG-EDGVCNKKKSALSAAKITG 242
Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTS 286
YEDVP N E++LL+AVANQPVSVAID S+ QFYS GVF+G C T+LNH VTAVGYG +
Sbjct: 243 YEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGAT 302
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+G KYW+IKNSWG WG+ GY R++RD+ + +G CG+AM AS+P +
Sbjct: 303 TDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 154/314 (49%), Positives = 209/314 (66%), Gaps = 17/314 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++ E+W AQ+GR Y + E KR+ IFK+N+ +E FNN + +R Y L +NKFADLT
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGS--DRGYKLGVNKFADLT 58
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC---- 144
+EF A G+K S K + F +++ S +P S++W + GAVTPVK QG C
Sbjct: 59 NEEFRAMHHGYK----RQSSKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCCW 114
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
AVAA+EGI +K +L+SLSEQQLVDC + GC GG MD+AF++I++N G+T++
Sbjct: 115 AFSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSE 174
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
A Y Y+G+ G C S K A+IT YEDVP N+E +LL+AVA QPVSVA++ Q
Sbjct: 175 ATYPYQGVD-GTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQ 233
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FY GVF G C T+L+H VTA+GYGT+ +G YWL+KNSWG WGE GY R+QR I +
Sbjct: 234 FYKSGVFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRGIGARE 293
Query: 320 GQCGIAMFASFPVS 333
G CG+AM AS+P +
Sbjct: 294 GLCGVAMDASYPTA 307
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 160/344 (46%), Positives = 221/344 (64%), Gaps = 17/344 (4%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ L VVL C++ + R + ++ E+ EQW AQ+GR YK+ AE ++RFE F++N+V
Sbjct: 7 FLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFRNNVV 66
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGT-PFLYK- 119
+E FN AA R + L +N+F DLT EF A++T GF + ++ KA+ T F Y
Sbjct: 67 FIESFN-AAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTFRYSN 125
Query: 120 --SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
+ +P +V+W KGAVTP+K QGQC AVAA EGI + +LV LSEQ+LVD
Sbjct: 126 VSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQELVD 185
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C N ++GC GG MDDAF++II+N G+T++ Y Y G C + + A I YE
Sbjct: 186 CDANGADHGCEGGEMDDAFEFIIKNGGLTSETNYPYTAQD-GQCKAKNTINSVATIKGYE 244
Query: 231 DVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
DVP NDE SL+KAVA QPVSVA+D + Q Y+GGV +G C T L+HG+ AVGYG +++
Sbjct: 245 DVPANDEASLMKAVAAQPVSVAVDGGDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAADD 304
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G K+WL+KNSWG WGEDGY R+++D+ G CG+AM S+P
Sbjct: 305 GTKFWLMKNSWGTTWGEDGYIRMEKDVADAGGMCGLAMQPSYPT 348
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 167/339 (49%), Positives = 216/339 (63%), Gaps = 35/339 (10%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
+ + L+I G ASQA RT E S++E+ E W YGRTYK+ AE +RF+IFK+N+
Sbjct: 7 IICITLLIMGVWASQALSRTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEY 66
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
+E N +F AS+ G+ MS S + T F Y++ + V
Sbjct: 67 IESVN----------------------KFKASRNGYNMSSRPRSSEI--TSFRYENVAAV 102
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P S++W +KGAVTP+K QGQC AVAA+EG+ +K L+SLSEQ+LVDC T+
Sbjct: 103 PSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGE 162
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG MD AF++II N G+T +A Y Y+G+ C+ KA AA+I NYEDVP N
Sbjct: 163 DQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDA-TCNKKKAASSAAKIKNYEDVPANS 221
Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E +LLKAVA PVSVAIDA S QFYS GVF G C T L+HGVTAVGYG +++G KYWL
Sbjct: 222 EAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWL 281
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+KNSWG WGEDGY ++RDI +G CGIAM AS+P +
Sbjct: 282 VKNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 320
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 165/342 (48%), Positives = 218/342 (63%), Gaps = 21/342 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGS-IAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
+ FL +LI++ + A++ R DE + ++ E+W AQ+GR Y + E KR+ IFK+N
Sbjct: 9 RIFLPFLLILA-AWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKEN 67
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
+ +E FNN + +R Y L +NKFADLT +EF A G+K S K + F Y++
Sbjct: 68 IERIEAFNNGS--DRGYKLGVNKFADLTNEEFRAMYHGYK----RQSSKLMSSSFRYENL 121
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +P S++W GAVTPVK QG C VAA+EGI ++ L+SLSEQQLVDC
Sbjct: 122 SDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDCTA 181
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GG MD AF+YII+N G+T++ Y Y+G+ G C S KA AQIT YEDVP
Sbjct: 182 G--NKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVD-GTCSSEKAASTEAQITGYEDVP 238
Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
N+E +LL+AVA QPVSV +D QFY GVFNG C T NH VTA+GYGT +G
Sbjct: 239 QNNENALLQAVAKQPVSVGVDGGGNDFQFYKSGVFNGDCGTQQNHAVTAIGYGTDIDGTD 298
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YWL+KNSWG WGE+GY R++R I +G CG+AM AS+P +
Sbjct: 299 YWLVKNSWGTSWGENGYMRMRRGIGSSEGLCGVAMDASYPTA 340
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 160/330 (48%), Positives = 216/330 (65%), Gaps = 24/330 (7%)
Query: 17 ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
A+ A D+ + + EQW AQY R YK+++E ++RFE+FK N+ +E FN A GN
Sbjct: 113 AAMAARDLSDDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIESFN--AGGNN 170
Query: 77 SYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIE 131
+ L +N+FADLT EF +++T G K SS++K T F Y+ + +P +++W
Sbjct: 171 KFWLGVNQFADLTNDEFRSTKTNKGLK----SSNMKIP-TGFRYENVSADALPTTIDWRT 225
Query: 132 KGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGF 184
KGAVTP+K QGQC AVAA EGI I +LVSL+EQ+LVDC + + GC GG
Sbjct: 226 KGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGL 285
Query: 185 MDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAV 244
MDDAFK+II+N G+T ++ Y Y + G C S + AA I YEDVP NDE +L+KAV
Sbjct: 286 MDDAFKFIIKNGGLTTESSYPYTA-ADGKCKS--GSNSAATIKGYEDVPANDEAALMKAV 342
Query: 245 ANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
ANQPVSVA+D + QFYSGGV G C T L+HG+ A+GYG + +G KYWL+KNSWG
Sbjct: 343 ANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTT 402
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE+GY R+++DI +G CG+AM S+P
Sbjct: 403 WGENGYLRMEKDISDKRGMCGLAMEPSYPT 432
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/314 (51%), Positives = 206/314 (65%), Gaps = 19/314 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++ E+W AQ+GR Y + E KR+ IFK+N+ +E FNN + +R Y L +NKFADLT
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGS--DRGYKLGVNKFADLT 58
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF A G+K SS L + + F Y++ S +P S++W GAVTPVK QG C
Sbjct: 59 NEEFRAMYHGYKR--QSSKLMS--SSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCW 114
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAA+EGI ++ L+SLSEQQLVDC N GC GG MD AF+YII+N G+T++
Sbjct: 115 AFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAG--NKGCQGGLMDTAFQYIIRNGGLTSE 172
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y+G+ G C S KA AQIT YEDVP N+E +LL+AVA QPVSVA+D +
Sbjct: 173 DNYPYQGVD-GTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFR 231
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FY GVF G C T LNHGVTA+GYGT +G YWL+KNSWG WGE GY R+QR I +
Sbjct: 232 FYKSGVFEGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASE 291
Query: 320 GQCGIAMFASFPVS 333
G CG+AM AS+P S
Sbjct: 292 GLCGVAMDASYPTS 305
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 163/345 (47%), Positives = 220/345 (63%), Gaps = 25/345 (7%)
Query: 3 KYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
K ++ +L + C + R D+ ++ + EQW AQY R YK+++E ++RFE+FK N
Sbjct: 5 KASILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKAN 64
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEF--IASQTGFKMSDHSSSLKANGTPFLYK 119
+ +E FN A GN + L +N+FADLT EF I + GFK SS++K T F Y+
Sbjct: 65 VKFIESFN--AGGNNKFWLGVNQFADLTNDEFRSIKTNKGFK----SSNMKIP-TGFRYE 117
Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+ V P +++W KGAVTP+K QGQC AVAA EGI I +LVSL+EQ+LV
Sbjct: 118 NVSVDALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELV 177
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + + GC GG MDDAFK+II N G+T ++ Y Y + G C S + AA I Y
Sbjct: 178 DCDVHGEDQGCEGGLMDDAFKFIINNGGLTTESSYPYTA-ADGKCKS--GSNSAATIKGY 234
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
EDVP NDE +L+KAVANQPVSVA+D + QFYS GV G C T L+HG+ A+GYG +
Sbjct: 235 EDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYGKTS 294
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYWL+KNSWG WGE+GY R+++DI +G CG+AM S+P
Sbjct: 295 DGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 339
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 166/324 (51%), Positives = 209/324 (64%), Gaps = 20/324 (6%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
E +E+W++ + + + E KRF +FK N+ V FN ++ Y L+LNKFAD+T
Sbjct: 36 ELYERWRSHHTVS-RSLDEKDKRFNVFKANVHYVHNFNKK---DKPYKLKLNKFADMTNH 91
Query: 92 EFIASQTGFKMSDHSSSL---KANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-- 145
EF G K+ H S L +ANGT F+Y + + VPPSV+W +KGAVTPVK QG+C
Sbjct: 92 EFRHHYAGSKIKHHRSFLGASRANGT-FMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSC 150
Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
V AVEGIN IK N LVSLSEQ+LVDC T+ N GC GG MD AF++I + GI
Sbjct: 151 WAFSTVVAVEGINQIKTNELVSLSEQELVDCDTS-QNQGCNGGLMDMAFEFIKKKGGINT 209
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
+ Y Y G CD K I YEDVPPNDE+SLLKAVANQPVSVAI AS
Sbjct: 210 EENYPYMA-EGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDF 268
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
QFYS GVF G C T L+HGV VGYGT+ +G KYW+++NSWG +WGE GY R+QR+ID
Sbjct: 269 QFYSEGVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQREIDAE 328
Query: 319 QGQCGIAMFASFPVSKESAQPSSA 342
+G CGIAM S+P+ S+ P+ +
Sbjct: 329 EGLCGIAMQPSYPIKTSSSNPTGS 352
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 168/360 (46%), Positives = 219/360 (60%), Gaps = 23/360 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTFD-----EGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
K FL VVL +S ++ D E S+ + +E+W++ + + + KRF +
Sbjct: 4 KKFLWVVLSLSLVLGVANSFDFHDKDLESEESLWDLYERWRSHH-TVSRSLGDKHKRFNV 62
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGT 114
FK N++ V N ++ Y L+LNKFAD+T EF ++ G K++ H + NGT
Sbjct: 63 FKANMMHVHNTNKM---DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRDMPRGNGT 119
Query: 115 PFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQ 167
K VP SV+W +KGAVT VK QG C V AVEGIN IK N+LVSLSEQ+
Sbjct: 120 FMYEKVGSVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSEQE 179
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
LVDC T +N GC GG M+ AF++I Q GIT ++ Y Y G CD+ KA D A I
Sbjct: 180 LVDCDTEENA-GCNGGLMESAFQFIKQKGGITTESYYPYTAQD-GTCDASKANDLAVSID 237
Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGT 285
+E+VP NDE +LLKAVANQPVSVAIDA S QFYS GVF G C T LNHGV VGYG
Sbjct: 238 GHENVPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGA 297
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSADKS 345
+ +G YW+++NSWG +WGE GY R+QR+I + +G CGIAM AS+P+ S P+ S
Sbjct: 298 TVDGTSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASYPIKNSSNNPTGPSSS 357
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 158/338 (46%), Positives = 217/338 (64%), Gaps = 23/338 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
K ++ +L ++ C + R D+ ++ + EQW QY R YK++ E ++RFE+FK N
Sbjct: 5 KASILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKAN 64
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYK 119
+ +E FN A GNR + L +N+FADLT EF A++T GFK S S T F Y+
Sbjct: 65 VKFIESFN--AGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPVKVS-----TGFRYE 117
Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
+ V P +++W KGAVTP+K QGQC EGI I +L+SLSEQ+LVDC +
Sbjct: 118 NVSVDALPATIDWRTKGAVTPIKDQGQC-----EGIVKISTGKLISLSEQELVDCDVHGE 172
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG MDDAFK+II+N G+T ++ Y Y + G C S + AA + +EDVP ND
Sbjct: 173 DQGCEGGLMDDAFKFIIKNGGLTTESSYPYTA-ADGKCKS--GSNSAATVKGFEDVPAND 229
Query: 237 EESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E +L+KAVANQPVSVA+D + QFYSGGV G C T L+HG+ A+GYG + +G KYWL
Sbjct: 230 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWL 289
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+KNSWG WGE+GY R+++DI +G CG+AM S+P
Sbjct: 290 LKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 327
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 169/361 (46%), Positives = 221/361 (61%), Gaps = 29/361 (8%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFD-----EGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
K L VVL S ++ D E S+ + +E+W++ + + E KRF
Sbjct: 2 TKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHH-TVSRSLGEKHKRFN 60
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP- 115
+FK NL+ V N ++ Y L+LNKFAD+T EF ++ G K++ H GTP
Sbjct: 61 VFKANLMHVHNTNKM---DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHR---MFRGTPH 114
Query: 116 ----FLY-KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSL 163
F+Y K VPPSV+W +KGAVT VK QGQC V AVEGIN IK N+LV+L
Sbjct: 115 ENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVAL 174
Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
SEQ+LVDC + N GC GG M+ AF++I Q GIT ++ Y Y+ G CD+ K D A
Sbjct: 175 SEQELVDC-DKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQE-GTCDASKVNDLA 232
Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAV 281
I +E+VP NDE++LLKAVANQPVSVAIDA S QFYS GVF G C T LNHGV V
Sbjct: 233 VSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIV 292
Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSS 341
GYGT+ +G YW+++NSWG +WGE GY R+QR+I + +G CGIAM S+P+ S P+
Sbjct: 293 GYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIKNSSDNPTG 352
Query: 342 A 342
+
Sbjct: 353 S 353
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 161/348 (46%), Positives = 226/348 (64%), Gaps = 26/348 (7%)
Query: 2 AKYFLIVVLIISGSCAS-----QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
++ FL+++ I++G S A D+ ++AE+ E+W A YGR YK++AE ++RFE
Sbjct: 4 SRAFLLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARRFE 63
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF 116
+FKDNL VE FN A + L +N+FADLT +EF A++ GFK S+ + T F
Sbjct: 64 VFKDNLAFVESFN--ADKKNKFWLGVNQFADLTTEEFKANK-GFK---PISAEEVPTTGF 117
Query: 117 LYKS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQ 166
Y++ S +P +V+W KGAVTP+K QGQC AVAA+EGI + + LVSLSEQ
Sbjct: 118 KYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQ 177
Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
+LVDC T+ + GC GG+MD AF+++I+N G+ ++ Y Y+ + G C AA I
Sbjct: 178 ELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVD-GKCKG--GSKSAATI 234
Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYG 284
+EDVPPN+E +L+KAVA+QPVSVA+DAS F YSGGV G C T L+HG+ A+GYG
Sbjct: 235 KGHEDVPPNNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGYG 294
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYW++KNSWG WGE + R+++DI QG CG+AM S+P
Sbjct: 295 VESDGTKYWILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYPT 342
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 162/331 (48%), Positives = 207/331 (62%), Gaps = 18/331 (5%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S + +E+W++ + + + KRF +FK N++ V N ++ Y L+LNKFA
Sbjct: 33 EESFWDLYERWRSHH-TVSRSLGDKHKRFNVFKANVMHVHNTNKM---DKPYKLKLNKFA 88
Query: 87 DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
D+T EF ++ G K++ H + + NGT K VPPSV+W + GAVT VK QGQ
Sbjct: 89 DMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSVDWRKNGAVTGVKDQGQ 148
Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C V AVEGIN IK N+LVSLSEQ+LVDC T N GC GG M+ AF++I Q
Sbjct: 149 CGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTK-KNAGCNGGLMESAFEFIKQKG 207
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA- 255
GIT ++ Y Y G CD+ KA D A I +E+VP NDE +LLKAVANQPVSVAIDA
Sbjct: 208 GITTESNYPYTAQD-GTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAG 266
Query: 256 -SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
S QFYS GVF G C T LNHGV VGYGT+ +G YW ++NSWG +WGE GY R+QR
Sbjct: 267 GSDFQFYSEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRS 326
Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSADKS 345
I + +G CGIAM AS+P+ S P+ S
Sbjct: 327 ISKKEGLCGIAMMASYPIKNSSNNPTGPSSS 357
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 163/342 (47%), Positives = 213/342 (62%), Gaps = 18/342 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFE-QWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ V I S C S R D I +K +W ++GR Y + E + R+ +FK+N+
Sbjct: 8 IFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK-MSDHSSSLKANGTPFLYK--- 119
+E N+ G R++ L +N+FADLT EF + TGFK +S SS + +PF Y+
Sbjct: 68 RIEHLNSIPAG-RTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTKMSPFRYQNVS 126
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
S +P SV+W +KGAVTP+K QG C AVAA+EG IK +L+SLSEQQLVDC
Sbjct: 127 SGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD 186
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
TND GC GG MD AF++I G+T ++ Y Y+G C+S K A IT YEDV
Sbjct: 187 TNDF--GCEGGLMDTAFEHIKATGGLTTESDYPYKG-EDATCNSKKTNPKATSITGYEDV 243
Query: 233 PPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P NDE++L+KAVA+QPVSV I+ QFYS GVF G C T+L+H VTA+GYG S G
Sbjct: 244 PVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGS 303
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYW+IKNSWG WGE GY R+Q+D+ QG CG+AM AS+P
Sbjct: 304 KYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 163/342 (47%), Positives = 213/342 (62%), Gaps = 18/342 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFE-QWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ V I S C S R D I +K +W ++GR Y + E + R+ +FK+N+
Sbjct: 8 IFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK-MSDHSSSLKANGTPFLYK--- 119
+E N+ G R++ L +N+FADLT EF + TGFK +S SS + +PF Y+
Sbjct: 68 RIEHLNSIPAG-RTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVS 126
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
S +P SV+W +KGAVTP+K QG C AVAA+EG IK +L+SLSEQQLVDC
Sbjct: 127 SGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD 186
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
TND GC GG MD AF++I G+T ++ Y Y+G C+S K A IT YEDV
Sbjct: 187 TNDF--GCEGGLMDTAFEHIKATGGLTTESNYPYKG-EDATCNSKKTNPKATSITGYEDV 243
Query: 233 PPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P NDE++L+KAVA+QPVSV I+ QFYS GVF G C T+L+H VTA+GYG S G
Sbjct: 244 PVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGS 303
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYW+IKNSWG WGE GY R+Q+D+ QG CG+AM AS+P
Sbjct: 304 KYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 303 bits (777), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 154/336 (45%), Positives = 219/336 (65%), Gaps = 17/336 (5%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
+K FL+ +L + C+S R + ++ E+ E W +YGR YK++AE ++RF++FKDN
Sbjct: 4 SKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKDN 63
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
+ VE FN N + L +N+FADLT +EF A++ GFK ++ K T F Y++
Sbjct: 64 VAFVESFNTNK--NNKFWLGVNQFADLTTEEFKANK-GFK----PTAEKVPTTGFKYENL 116
Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
S +P +V+W KGAVTP+K QGQCA A+EGI + L+SLSEQ+LVDC T+ +
Sbjct: 117 SVSALPTAVDWRTKGAVTPIKNQGQCA--AMEGIVKLSTGNLISLSEQELVDCDTHSMDE 174
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG+MD AF+++I+N G+ ++ Y Y+ + G C AA I +EDVP N+E
Sbjct: 175 GCEGGWMDSAFEFVIKNGGLATESNYPYKAVD-GKCKG--GSKSAATIKGHEDVPVNNEA 231
Query: 239 SLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L+KAVANQPVSVA+DAS F YSGGV G C T L+HG+ A+GYG +G KYW++K
Sbjct: 232 ALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILK 291
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
NSWG WGE G+ R+++DI +G CG+AM S+P
Sbjct: 292 NSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 327
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 303 bits (777), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 155/305 (50%), Positives = 208/305 (68%), Gaps = 19/305 (6%)
Query: 39 AQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT 98
A+YGR YK++ E KRF+IFKDN+ +E FN A +++Y L +N+FADLT +EF + +
Sbjct: 2 ARYGRMYKDANEKEKRFKIFKDNVARIESFNKAM--DKTYKLSINEFADLTNEEFRSLRN 59
Query: 99 GFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVE 150
FK + + + T F Y++ + VP +++W +KGAVTP+K Q QC AVAA E
Sbjct: 60 RFK-----AHICSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATE 114
Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
GI I +L+SLSEQ+LVDC T N GC GG MDDAF++I + G+ ++A Y YEG
Sbjct: 115 GITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFI-KIHGLASEATYPYEG-D 172
Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNG 268
G C+S K AA+I YEDVP N+E++L KAVA+QPV+VAIDA QFY+ GVF G
Sbjct: 173 DGTCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTG 232
Query: 269 YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFA 328
C T L+HGV AVGYG ++G+ YWL+KNSWG WGE+GY R+QRD+ +G CGIAM A
Sbjct: 233 QCGTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQA 292
Query: 329 SFPVS 333
S+P +
Sbjct: 293 SYPTA 297
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 175/340 (51%), Positives = 220/340 (64%), Gaps = 17/340 (5%)
Query: 6 LIVVLIISGSCASQATYRT-FDEGS--IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+I + + +CA A RT +DE S +A+ +QW QYGR+Y AE KRF+IF +NL
Sbjct: 7 IIALCTMLWACAYTAMSRTLYDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFMENL 66
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKM-SDHSSSLKANGTPFLYKSS 121
+E+FNNA GN+SY L LN+F+DLT +EFIAS TG + SS +P S
Sbjct: 67 EYIEKFNNAP-GNKSYKLDLNQFSDLTNEEFIASHTGLMIDPSKPSSSSKRASPASLDLS 125
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
P S++W E+GAVT VK QG C AVAAVEGI IK L+SLSEQQLVDCA+N
Sbjct: 126 DTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCASN 185
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ N GC GGFMD+AF YI +N GI ++ Y Y G G C + + AA+I+ YEDVP
Sbjct: 186 EQNQGCGGGFMDNAFSYITEN-GIASENDYQYRG-GAGTCQNNEMITPAARISGYEDVPA 243
Query: 235 NDEESLLKAVANQPVSVAID-ASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE-GIKY 292
E+ LL AV+ QPVSVAI + Y G+++G C + LNHGVT VGYGTSEE G KY
Sbjct: 244 G-EDQLLLAVSQQPVSVAIAVGQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEEDGTKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WLIKNSWG+ WGE+GY RL R+ Q +G CGIA+ AS P
Sbjct: 303 WLIKNSWGESWGENGYMRLLRESGQSEGHCGIAVKASHPT 342
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 165/341 (48%), Positives = 211/341 (61%), Gaps = 18/341 (5%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFE-QWKAQYGRTYKESAENSKRFEIFKDNLVA 64
+ V I S S + R D I +K +W ++GR Y + E S R+ +FK N+
Sbjct: 9 FLFVAIFSSFYFSISLSRPLDNELIMQKRHIEWMTKHGRVYADVKEKSNRYVVFKSNVER 68
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK-MSDHSSSLKANGTPFLYK---S 120
+E NN G R++ L +N+FADLT EF + TGFK +S SS + T F Y+ S
Sbjct: 69 IEHLNNIPAG-RTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSFRYQNVSS 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P SV+W KGAVTP+K QG C AVAA+EG IK +L+SLSEQQLVDC T
Sbjct: 128 GALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
ND GC GG MD AF++I+ G+T ++ Y Y+G C+S K A IT YEDVP
Sbjct: 188 NDF--GCEGGLMDTAFEHIMATGGLTTESNYPYKG-EDATCNSKKTNPKATSITGYEDVP 244
Query: 234 PNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
NDE++L+KAVA+QPVSV I+ QFYS GVF G C T+L+H VTA+GYG S G K
Sbjct: 245 VNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGQSTNGSK 304
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW+IKNSWG WGE GY R+Q+DI QG CG+AM AS+P
Sbjct: 305 YWIIKNSWGTKWGESGYMRIQKDIKDKQGLCGLAMKASYPT 345
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 155/346 (44%), Positives = 220/346 (63%), Gaps = 22/346 (6%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
MAK L +L C++ R D+ ++A + E+W AQYGR Y++ AE ++RFE+FK
Sbjct: 3 MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N+ +E FN GN ++ L +N+FADLT EF ++T ++ + T F Y+
Sbjct: 63 ANVAFIESFN---AGNHNFWLGVNQFADLTNDEFRWTKTNKGFIPSTTRVP---TGFRYE 116
Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+ + P +V+W KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LV
Sbjct: 117 NVNIDALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELV 176
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + + GC GG MDDAFK+II+N G+T ++ Y Y + C S+ + A I Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAA-ADDKCKSV--SNSVASIKGY 233
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
EDVP N+E +L+KAVANQPVSVA+D + QFY GGV G C T L+HG+ A+GYG +
Sbjct: 234 EDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKAS 293
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+G KYWL+KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 294 DGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 169/361 (46%), Positives = 222/361 (61%), Gaps = 29/361 (8%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFD-----EGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
K L VVL S ++ D E S+ + +E+W++ + + E KRF
Sbjct: 3 TKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHH-TVSRSLGEKHKRFN 61
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP- 115
+FK NL+ V N ++ Y L+LNKFAD+T EF ++ G K+ +H + GTP
Sbjct: 62 VFKANLMHVHNTNKM---DKPYKLKLNKFADMTNHEFRSTYAGSKV-NHPRMFR--GTPH 115
Query: 116 ----FLY-KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSL 163
F+Y K VPPSV+W +KGAVT VK QGQC V AVEGIN IK N+LV+L
Sbjct: 116 ENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVAL 175
Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
SEQ+LVDC + N GC GG M+ AF++I Q GIT ++ Y Y+ G CD+ K D A
Sbjct: 176 SEQELVDC-DKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQE-GTCDASKVNDLA 233
Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAV 281
I +E+VP NDE++LLKAVANQPVSVAIDA S QFYS GVF G C T LNHGV V
Sbjct: 234 VSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIV 293
Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSS 341
GYGT+ +G YW+++NSWG +WGE GY R+QR+I + +G CGIAM S+P+ S P+
Sbjct: 294 GYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIKNSSDNPTG 353
Query: 342 A 342
+
Sbjct: 354 S 354
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 157/338 (46%), Positives = 219/338 (64%), Gaps = 23/338 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
K ++ +L ++ C + R D+ ++ + EQW QY R YK++ E ++RFE+FK N
Sbjct: 5 KASILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKAN 64
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYK 119
+ +E FN A GNR + L +N+FADLT EF A++T GFK S +K T F Y+
Sbjct: 65 VKFIESFN--AGGNRKFWLGVNQFADLTNDEFRATKTNKGFK----PSPVKVP-TGFRYE 117
Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
+ V P +++W KGAVTP+K QGQC EGI I +L+SLSEQ+LVDC +
Sbjct: 118 NVSVDALPATIDWRTKGAVTPIKDQGQC-----EGIVKISTGKLISLSEQELVDCDVHGE 172
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG MDDAF++II+N G+T ++ Y Y + G C S + AA + +EDVP ND
Sbjct: 173 DQGCEGGLMDDAFQFIIKNGGLTTESSYPYTA-ADGKCKS--GSNSAATVKGFEDVPAND 229
Query: 237 EESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E +L+KAVANQPVSVA+D + QFYSGGV G C T L+HG+ A+GYG + +G KYWL
Sbjct: 230 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWL 289
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+KNSWG WGE+GY R+++DI +G CG+AM S+P+
Sbjct: 290 LKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPI 327
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 167/358 (46%), Positives = 215/358 (60%), Gaps = 23/358 (6%)
Query: 1 MAKYFLIV----VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
M K FL++ +++ G E E +E+W++ + + + E KRF
Sbjct: 1 MKKLFLVLFTLALVLRLGESFDFHEKELETEEKFWELYERWRSHHTVS-RSLDEKHKRFN 59
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL---KANG 113
+FK N+ V FN ++ Y L+LNKFAD+T EF G K+ H + L +ANG
Sbjct: 60 VFKANVHYVHNFNKK---DKPYKLKLNKFADMTNHEFRQHYAGSKIKHHRTLLGASRANG 116
Query: 114 TPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQ 166
T VPPS++W +KGAVTPVK QGQC V AVEGIN IK +LVSLSEQ
Sbjct: 117 TFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKLVSLSEQ 176
Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
+LVDC T +N GC GG MD AF +I + GIT + Y Y+ CD K I
Sbjct: 177 ELVDCDTTENQ-GCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDK-CDIQKRNTPVVSI 234
Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYG 284
+EDVPPNDE++LLKAVANQP+SVAIDAS QFYS GVF G C T L+HGV VGYG
Sbjct: 235 DGHEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHGVAIVGYG 294
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
T+ +G KYW++KNSWG WGE GY R+QR +D +G CGIAM S+P+ K S+ P+ +
Sbjct: 295 TTVDGTKYWIVKNSWGAGWGEKGYIRMQRKVDAEEGLCGIAMQPSYPI-KTSSNPTGS 351
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 164/343 (47%), Positives = 219/343 (63%), Gaps = 23/343 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGS--IAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
++ L +VL++ C SQ R E S ++E+ EQW +YG+ YK++AE KR IFKD
Sbjct: 8 QHILALVLLLP-ICISQVMSRNLHEASXCMSERHEQWTKKYGKVYKDAAEKQKRLLIFKD 66
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
N+ +E FN A GN+ Y L +N D T +EF+AS G+K S TPF Y++
Sbjct: 67 NVEFIESFN--AAGNKPYKLSINHLTDQTNEEFVASHNGYKHKGSHSQ-----TPFKYEN 119
Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
+ VP +V+W E GAV +K QGQC VA EGI I + L+SLSEQ+LVDC
Sbjct: 120 ITGVPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIYQITTSMLMSLSEQELVDCD 179
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ D+ GC GG+M+ F++I +N GI+++A Y Y + G D+ K AAQI YE V
Sbjct: 180 SVDH--GCDGGYMEGGFEFIXKNGGISSEANYPYTAVD-GTYDANKEASPAAQIKGYETV 236
Query: 233 PPNDEESLLKAVANQPVSVAID--ASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P N E++L KAVANQPVSV ID SA QF S GVF G C T L+HGVTAVGYG++++G
Sbjct: 237 PANSEDALQKAVANQPVSVTIDVGGSAFQFNSSGVFTGQCGTQLDHGVTAVGYGSTDDGT 296
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+YW++KNSWG WGE+GY R+QR D +G CGIAM AS+P +
Sbjct: 297 QYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 339
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 159/328 (48%), Positives = 208/328 (63%), Gaps = 18/328 (5%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ + +E+W++ + + E KRF +FK N++ V N ++ Y L+LNKFA
Sbjct: 33 EESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKM---DKPYKLKLNKFA 88
Query: 87 DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
D+T EF ++ G K++ H S +GT K VP SV+W +KGAVT VK QGQ
Sbjct: 89 DMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQ 148
Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + AVEGIN IK N+LVSLSEQ+LVDC + N GC GG M+ AF++I Q
Sbjct: 149 CGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDC-DKEENQGCNGGLMESAFEFIKQKG 207
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA- 255
GIT ++ Y Y+ G CD K D A I +E+VP NDE +LLKAVANQPVSVAIDA
Sbjct: 208 GITTESNYPYKAQE-GTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266
Query: 256 -SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
S QFYS GVF G C T LNHGV VGYGT+ +G YW+++NSWG +WGE GY R+QR+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326
Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSA 342
I + +G CGIAM AS+P+ S P+ +
Sbjct: 327 ISKKEGLCGIAMMASYPIKNSSDNPTGS 354
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/346 (44%), Positives = 219/346 (63%), Gaps = 22/346 (6%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
MAK L +L C++ R D+ ++A + E+W AQYGR YK+ AE ++RFE+FK
Sbjct: 3 MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N+ +E FN GN + L +N+FADLT EF +++T ++ + T F Y+
Sbjct: 63 ANVAFIESFN---AGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVP---TGFRYE 116
Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+ + P +++W KG VTP+K QGQC AVAA+EGI + +L+SLSEQ+LV
Sbjct: 117 NVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELV 176
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + + GC GG MDDAFK+II+N G+T ++ Y Y + C S+ + A I Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAA-ADDKCKSV--SNSVASIKGY 233
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
EDVP N+E +L+KAVANQPVSVA+D + QFY GGV G C T L+HG+ A+GYG +
Sbjct: 234 EDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKAS 293
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+G KYWL+KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 294 DGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 168/350 (48%), Positives = 221/350 (63%), Gaps = 20/350 (5%)
Query: 1 MAKYFLIVVLIISGSCASQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ + ++ I S AT R E S EK EQW A++ R Y + +E RF IFK
Sbjct: 1 MSSTIIFILTIFLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKRNRFNIFK 60
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGT-P 115
NL V+ FN N +Y L +N+F+DLT +EF A+ TG + + S+L ++ T P
Sbjct: 61 KNLEFVQSFNMNK--NITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSSDKTVP 118
Query: 116 FLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
F Y + S S++W ++GAVTPVKYQG+C AVAAVEGI I LVSLSEQQ
Sbjct: 119 FRYGNVSDTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQ 178
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED--HAAQ 225
L+DC T D N GC+GG M AF+YII+N+GIT + Y Y+ S AA
Sbjct: 179 LLDCDT-DYNQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAAT 237
Query: 226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGY 283
I+ YE VP N+EE+LL+AV+ QPVSV I+ + F YSGG+FNG C T L+H VT VGY
Sbjct: 238 ISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVTIVGY 297
Query: 284 GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
G SEEG KYW++KNSWG+ WGEDG+ R++RD+D PQG CG+AM A +P++
Sbjct: 298 GMSEEGTKYWVVKNSWGETWGEDGFMRIKRDVDAPQGMCGLAMLAFYPLA 347
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 157/343 (45%), Positives = 217/343 (63%), Gaps = 21/343 (6%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
+K FL+ +L + C+S R + ++ E+ E W +YGR YK++AE ++RFE FK N
Sbjct: 4 SKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHN 63
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
+ VE FN + L +N+FADLT +EF A++ GFK S+ T F Y++
Sbjct: 64 VAFVESFNTNK--KNKFWLGVNQFADLTTEEFKANK-GFK---PISAEMVPTTGFKYENL 117
Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
S +P +V+W KGAVTP+K QGQC AVAA+EGI + L+SLSEQ+LVDC
Sbjct: 118 SVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDC 177
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
T+ + GC GG+MD AF+++I+N G+ ++ Y Y+ + G C AA I +ED
Sbjct: 178 DTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVD-GKCKG--GSKSAATIKGHED 234
Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP NDE +L+KAVANQPVSVA+DAS F YSGGV G C T L+HG+ A+GYG +G
Sbjct: 235 VPVNDEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDG 294
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYW++KNSWG WGE G+ R+++DI QG CG+AM S+P
Sbjct: 295 TKYWILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYPT 337
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/341 (45%), Positives = 213/341 (62%), Gaps = 19/341 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FLI +L + + ++ A D+ S+ + EQW A+YGR Y + AE ++R E+FK N+
Sbjct: 82 FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS---S 121
+E N GN ++L N+FAD+T EF A+ TG+K + K T F Y +
Sbjct: 142 IELVN---AGNDKFSLEANQFADMTVDEFRAAHTGYKPVPAN---KGRTTQFKYANVSLD 195
Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P S++W KGAVTP+K QGQC VA+VEGI + +L+SLSEQ+LVDC +
Sbjct: 196 ALPASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVD 255
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MD+AF++II N G+T + Y Y G C+S K + A I YEDVP
Sbjct: 256 GMDQGCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDS-CNSNKESNDVASIKGYEDVPS 314
Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
NDE SLLKAVA QPVS+A+D + +FY GGV +G C T L+HG+ AVGYG + +G K+
Sbjct: 315 NDETSLLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSDGTKF 374
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGE G+ R++RDI +G CG+AM S+P +
Sbjct: 375 WLMKNSWGTSWGEKGFIRMERDIADEEGLCGLAMQPSYPTA 415
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 163/329 (49%), Positives = 214/329 (65%), Gaps = 16/329 (4%)
Query: 16 CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGN 75
C SQ R + S+ E+ EQW +YG+ YK+SAE KRF IF++N+ +E FN A GN
Sbjct: 20 CTSQVKSRKLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFN--AAGN 77
Query: 76 RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGA 134
+ Y L +N AD T +EF+AS G+K S TPF Y++ + +P +V+W +KG
Sbjct: 78 KPYKLSINHLADQTNEEFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPWAVDWRQKGD 137
Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
VT +K Q QC AVAA EGI I LVSLSE++LVDC + D+ GC GG M+
Sbjct: 138 VTSIKDQAQCGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSVDH--GCDGGLMEH 195
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
F++II+N GI+++A Y Y ++ G CD+ K AQIT YE VP N EE L KAVANQ
Sbjct: 196 GFEFIIKNGGISSEANYPYTAVN-GTCDTNKEASPVAQITGYETVPVNCEEELQKAVANQ 254
Query: 248 -PVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
+SV+IDA SA QFY GVF G C T L+HGVTAVGYG+++ G +YW++KNSWG WG
Sbjct: 255 LTMSVSIDAGGSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQWG 314
Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
E+GY R+ R ID +G CGIAM AS+P +
Sbjct: 315 EEGYIRMLRGIDAQEGLCGIAMDASYPTA 343
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 155/346 (44%), Positives = 219/346 (63%), Gaps = 22/346 (6%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
MAK L +L C++ R D+ ++A + E+W AQYGR Y++ AE ++RFE+FK
Sbjct: 3 MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N+ +E FN GN ++ L +N+FADLT EF +T ++ + T F Y+
Sbjct: 63 ANVAFIESFN---AGNHNFWLGVNQFADLTNDEFRWMKTNKGFIPSTTRVP---TGFRYE 116
Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+ + P +V+W KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+LV
Sbjct: 117 NVNIDALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELV 176
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + + GC GG MDDAFK+II+N G+T ++ Y Y + C S+ + A I Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAA-ADDKCKSV--SNSVASIKGY 233
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
EDVP N+E +L+KAVANQPVSVA+D + QFY GGV G C T L+HG+ A+GYG +
Sbjct: 234 EDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKAS 293
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+G KYWL+KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 294 DGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 159/328 (48%), Positives = 207/328 (63%), Gaps = 18/328 (5%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ + +E+W++ + + E KRF +FK N++ V N ++ Y L+LNKFA
Sbjct: 33 EESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKM---DKPYKLKLNKFA 88
Query: 87 DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
D+T EF ++ G K++ H S +GT K VP SV+W +KGAVT VK QGQ
Sbjct: 89 DMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQ 148
Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + AVEGIN IK N+LVSLSEQ+LVDC + N GC GG M+ AF++I Q
Sbjct: 149 CGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDC-DKEENQGCNGGLMESAFEFIKQKG 207
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA- 255
GIT ++ Y Y G CD K D A I +E+VP NDE +LLKAVANQPVSVAIDA
Sbjct: 208 GITTESNYPYTAQE-GTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266
Query: 256 -SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
S QFYS GVF G C T LNHGV VGYGT+ +G YW+++NSWG +WGE GY R+QR+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326
Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSA 342
I + +G CGIAM AS+P+ S P+ +
Sbjct: 327 ISKKEGLCGIAMMASYPIKNSSDNPTGS 354
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 301 bits (770), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 158/328 (48%), Positives = 208/328 (63%), Gaps = 18/328 (5%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ + +E+W++ + + E KRF +FK+N++ V N ++ Y L+LNKFA
Sbjct: 33 EESLWDLYERWRSHH-TVSRSLTEKHKRFNVFKENVMHVHNTNKM---DKPYKLKLNKFA 88
Query: 87 DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
D+T EF ++ G K++ H + NGT K VP SV+W +KGAVT VK QGQ
Sbjct: 89 DMTNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQ 148
Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C V AVEGIN IK ++LVSLSEQ+LVDC + N GC GG M+ AF++I Q
Sbjct: 149 CGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDC-DKEENQGCNGGLMESAFEFIKQKG 207
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA- 255
GIT ++ Y Y G CD+ K D A I +E+VP NDE +LLKAVANQPVSVAIDA
Sbjct: 208 GITTESNYPYTAQE-GTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266
Query: 256 -SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
S QFYS GV G C T LNHGV VGYGT+ +G YW+++NSWG +WGE GY R+QR+
Sbjct: 267 GSDFQFYSEGVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326
Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSA 342
I + +G CGIAM AS+P+ S P+ +
Sbjct: 327 ISKKEGLCGIAMMASYPIKNSSDNPTGS 354
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 165/324 (50%), Positives = 203/324 (62%), Gaps = 18/324 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E+W++ + + E KRF +FK N + V +NA ++ Y L+LNKFAD+T EF
Sbjct: 38 YERWRSHH-TVSRSLHEKQKRFNVFKHNAMHV---HNANKMDKPYKLKLNKFADMTNHEF 93
Query: 94 IASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
+ +G K+ H + NGT K VP SV+W +KGAVT VK QGQC
Sbjct: 94 RNTYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAF 153
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
+ AVEGIN IK N+LVSLSEQ+LVDC T D N GC GG MD AF++I Q GIT +A
Sbjct: 154 STIVAVEGINQIKTNKLVSLSEQELVDCDT-DQNQGCNGGLMDYAFEFIKQRGGITTEAN 212
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFY 261
Y YE G CD K A I +E+VP NDE +LLKAVANQPVSVAIDA S QFY
Sbjct: 213 YPYEAYD-GTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFY 271
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
S GVF G C T L+HGV VGYGT+ +G KYW +KNSWG +WGE GY R++R I +G
Sbjct: 272 SEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGL 331
Query: 322 CGIAMFASFPVSKESAQPSSADKS 345
CGIAM AS+P+ K S PS S
Sbjct: 332 CGIAMEASYPIKKSSNNPSGIKSS 355
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 300 bits (768), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 223/345 (64%), Gaps = 23/345 (6%)
Query: 6 LIVVLII--SGSCASQATYRT--FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
L+ VLII +G SQAT RT F E S+ +K EQW A++ R Y++ E + R ++FK N
Sbjct: 7 LVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKN 66
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK----MSDHSSSLKANGTPFL 117
L +E FN GN+SY L +N+FAD T +EF+A TG K +S K +
Sbjct: 67 LKFIENFNKK--GNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTW 124
Query: 118 YKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
S V S +W +GAVTPVKYQGQC AVAAVEG+ I LVSLSEQQL+D
Sbjct: 125 NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLD 184
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C + + GC GG M DAF Y++QN+GI ++ YSY+G S G C S AA+I+ ++
Sbjct: 185 C-DREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQG-SDGGCRS--NARPAARISGFQ 240
Query: 231 DVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEE 288
VP N+E +LL+AV+ QPVSV++DA+ F YSGGV++G C T NH VT VGYGTS++
Sbjct: 241 TVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQD 300
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
G KYWL KNSWG+ WGE GY R++RD+ PQG CG+A +A +PV+
Sbjct: 301 GTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 159/321 (49%), Positives = 210/321 (65%), Gaps = 19/321 (5%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
+ ++ S+ E+ EQW ++YG+ YK++ E KRF IFKDN+ +E FN A N+ Y L +
Sbjct: 29 KLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFN--AADNKPYKLSV 86
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
N ADLT EF AS+ G+K D + T F Y++ + +P +V+W KGAVTP+K Q
Sbjct: 87 NHLADLTLDEFKASRNGYKKIDREFAT----TSFKYENVTAIPEAVDWRVKGAVTPIKDQ 142
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
GQC VAA+EGIN I +L+SLSEQ+LVDC T + GC GG M+D F++II+
Sbjct: 143 GQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIK 202
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N GIT++ Y Y+ + G C S A+IT YE VP N E SLLKAVANQP+SV+ID
Sbjct: 203 NGGITSETNYPYKA-ADGSC-SAATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSID 260
Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
A S+ FYS G++ G C T L+HGVTAVGYG S G YW++KNSWG WGE GY R+Q
Sbjct: 261 ASDSSFMFYSSGIYTGECGTELDHGVTAVGYG-SANGTDYWIVKNSWGTVWGEKGYIRMQ 319
Query: 313 RDIDQPQGQCGIAMFASFPVS 333
R I +G CGIAM +S+P +
Sbjct: 320 RGIADKEGLCGIAMDSSYPTA 340
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 156/320 (48%), Positives = 200/320 (62%), Gaps = 18/320 (5%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
D+ IA + EQW A+YGR Y + AE ++R E+FK N+ +E N GN + L N+F
Sbjct: 25 DDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKANVGFIESVN---AGNHKFWLEANQF 81
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQG 142
AD+T EF A G+KM S KA T F Y + + P SV+W GAVTPVK QG
Sbjct: 82 ADITKDEFRAMHKGYKMQVIGS--KARATGFRYANVSIDDLPASVDWRANGAVTPVKDQG 139
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
QC VA++EGI + +L+SLSEQ+LVDC N GC GG MD+AF++I+ N
Sbjct: 140 QCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQNKGCGGGLMDNAFEFIVNN 199
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
G+ +A Y Y G + G C+S K + AA I YEDVP NDE SL KAVA QPVS+A+D
Sbjct: 200 GGLDTEADYPYTG-ADGTCNSNKESNIAASIKGYEDVPANDEASLQKAVAAQPVSIAVDG 258
Query: 256 S--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
+FY GGV G C T L+HGV AVGYG + +G KYWL+KNSWG WGEDG+ RL+R
Sbjct: 259 GDDLFRFYKGGVLTGACGTELDHGVAAVGYGVAGDGTKYWLVKNSWGTSWGEDGFIRLER 318
Query: 314 DIDQPQGQCGIAMFASFPVS 333
D+ G CG+AM S+P +
Sbjct: 319 DVADEAGMCGLAMKPSYPTA 338
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 158/321 (49%), Positives = 211/321 (65%), Gaps = 19/321 (5%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
+ ++ S+ E+ EQW ++YG+ YK++ E KRF IFKDN+ +E FN A N+ Y L +
Sbjct: 29 KLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFN--AADNKPYKLSV 86
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
N ADLT EF AS+ G+K D + T F Y++ + +P +V+W KGAVTP+K Q
Sbjct: 87 NHLADLTLDEFKASRNGYKKIDREFAT----TSFKYENVTAIPEAVDWRVKGAVTPIKDQ 142
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
GQC VAA+EGIN I +L+SLSEQ+LVDC T + GC GG M+D F++II+
Sbjct: 143 GQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIK 202
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N GIT++ Y Y+ + G C++ A+IT YE VP N E SLLKAVANQP+SV+ID
Sbjct: 203 NGGITSETNYPYKA-ADGSCNTATTAP-VAKITGYEKVPVNSEISLLKAVANQPISVSID 260
Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
A S+ FYS G++ G C T L+HGVTAVGYG S G YW++KNSWG WGE GY R+Q
Sbjct: 261 ASDSSFMFYSSGIYTGECGTELDHGVTAVGYG-SANGTDYWIVKNSWGTVWGEKGYIRMQ 319
Query: 313 RDIDQPQGQCGIAMFASFPVS 333
R I +G CGIAM +S+P +
Sbjct: 320 RGIADKEGLCGIAMDSSYPTA 340
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 161/333 (48%), Positives = 209/333 (62%), Gaps = 21/333 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E ++ + +E+W+ + + E +RF +FK N++ V N ++ Y L+LNKFA
Sbjct: 33 EDNLWDMYERWRHKVATNH---GEKLRRFNVFKSNVLHVHETNKM---DKPYKLKLNKFA 86
Query: 87 DLTPQEFIASQTGFKMSDHSSSL---KANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQG 142
D+T EF + G K+ H SL ++ F+Y + + VP SV+W +KGAV PVK QG
Sbjct: 87 DMTNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAPVKDQG 146
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
QC VAAVEGIN IK N LVSLSEQ+LVDC T +N GC GG MD AF +I +
Sbjct: 147 QCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQ-GCNGGLMDLAFDFIKKT 205
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
G+T + Y Y G CDS K I +EDVP NDE+SL+KAVANQPV+VAIDA
Sbjct: 206 GGLTREDAYPY-AAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDA 264
Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
S QFYS GVF G C T L+HGV AVGYGT+ +G KYW+++NSWG +WGE GY R++R
Sbjct: 265 GSSDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWGEKGYIRMER 324
Query: 314 DIDQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
I +G CGIAM AS+P+ S P S+ SS
Sbjct: 325 GISDKRGLCGIAMEASYPIKNSSNNPKSSPTSS 357
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 210/332 (63%), Gaps = 20/332 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESA-ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
E S+ + +E+W++ + T S E KRF +FK+N++ V + N + Y L+LNKF
Sbjct: 33 EESLWDLYERWRSHH--TVSTSLDEKHKRFNVFKENVMHVHKTNKMG---KPYKLKLNKF 87
Query: 86 ADLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
AD+T EF + G K+ H + + NG+ K +VP SV+W +KGAVT VK QG
Sbjct: 88 ADMTNHEFRSVYAGSKVKHHRMFRGTTRGNGSFMYGKVEKVPTSVDWRKKGAVTAVKDQG 147
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
QC + AVEGIN IK N LVSLSEQ+LVDC T +N GC GG M+ AF++I +
Sbjct: 148 QCGSCWAFSTIVAVEGINYIKTNELVSLSEQELVDCDTTENQ-GCNGGLMEYAFEFIKKK 206
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
+GIT ++ Y Y+ G CD+ K + A I YE VP NDE++LLKA ANQPVSVAIDA
Sbjct: 207 RGITTESTYPYKA-EDGHCDAAKENNPAVSIDGYEKVPENDEDALLKAAANQPVSVAIDA 265
Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
S QFYS GVF G C T L+HGV VGYGT+ +G KYW+++NSWG +WGE GY R+QR
Sbjct: 266 GGSDFQFYSEGVFIGECGTELDHGVAVVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQR 325
Query: 314 DIDQPQGQCGIAMFASFPVSKESAQPSSADKS 345
I +G CGIAM AS+P+ S PS S
Sbjct: 326 GISDKEGLCGIAMEASYPIKNSSTNPSGTKSS 357
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 155/343 (45%), Positives = 218/343 (63%), Gaps = 22/343 (6%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
+K FL+ +L + C+S R + ++ E+ E W +YGR YK++AE ++RFE FK N
Sbjct: 4 SKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHN 63
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
+ VE FN + L +N+FADLT +EF A++ GFK ++ K T F Y++
Sbjct: 64 VAFVESFNTNK--KNKFWLGVNQFADLTTEEFKANK-GFK----PTAEKVPTTGFKYENL 116
Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
S +P +V+W KGAVTP+K QGQC AVAA+EGI + L+SLSEQ+LVDC
Sbjct: 117 SVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDC 176
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
T+ + GC GG+MD AF+++I+N G+ ++ Y Y+ + G C AA I +ED
Sbjct: 177 DTHSMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVD-GKCKG--GSKSAATIKGHED 233
Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP N+E +L+KAVANQPVSVA+DAS F YSGGV G C T L+HG+ A+GYG +G
Sbjct: 234 VPVNNEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDG 293
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYW++KNSWG WGE G+ R+++DI +G CG+AM S+P
Sbjct: 294 TKYWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 336
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 153/274 (55%), Positives = 193/274 (70%), Gaps = 13/274 (4%)
Query: 70 NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVN 128
N+ + N+ Y L +NKFADLT +EF AS+ FK SS ++ T F Y+ +S +P +V+
Sbjct: 2 NSNVNNKLYKLGINKFADLTNEEFKASRNKFKGHMCSSIIRT--TTFKYENASAIPSTVD 59
Query: 129 WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
W +KGAVTPVK QGQC AVAA EGI+ + +LVSLSEQ+L+DC T + GC
Sbjct: 60 WRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCE 119
Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
GG MDDAFK+IIQN G++ + Y YEG+ G C++ +A HA IT YEDVP N+E +L
Sbjct: 120 GGLMDDAFKFIIQNHGLSTEVQYPYEGVD-GTCNTNEASIHAVTITGYEDVPANNELALQ 178
Query: 242 KAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSW 299
KAVANQP+SVAIDAS QFY+ GVF G C T L+HGVTAVGYG +G KYWL+KNSW
Sbjct: 179 KAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSW 238
Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
G DWGE+GY R+QR ID +G CGIAM AS+P +
Sbjct: 239 GADWGEEGYIRMQRGIDAAEGLCGIAMQASYPTA 272
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 161/343 (46%), Positives = 213/343 (62%), Gaps = 20/343 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIA--EKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
FLIV LI S C S R D+ + ++ ++W A++GR Y + E + R+ +FK N+
Sbjct: 9 FLIVSLI-SSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKRNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP-FLYK-- 119
+ER NN G R++ L +N+FADLT EF + TG+K SS T F Y+
Sbjct: 68 ERIERLNNVPAG-RTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQNV 126
Query: 120 -SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
S +P SV+W +KGAVTP+K QG C AVAA+EG IK +L+SLSEQQLVDC
Sbjct: 127 SSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVDC 186
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
TND GC GG MD AF++I+ G+T ++ Y Y+G C + A IT YED
Sbjct: 187 DTNDF--GCSGGLMDTAFEHIMATGGLTTESNYPYKG-KDATCKIKNTKPTATSITGYED 243
Query: 232 VPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP NDE++L+KAVA+QPVS+ I+ QFY GVF G C T+L+H VTAVGYG S G
Sbjct: 244 VPVNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHAVTAVGYGQSSNG 303
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYW+IKNSWG WGE GY R+++D+ +G CG+AM AS+P
Sbjct: 304 SKYWIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYPT 346
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 161/331 (48%), Positives = 207/331 (62%), Gaps = 18/331 (5%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ +E+W++ + + E KRF +FK+N+ V FN + Y L+LNKFA
Sbjct: 31 EESLWNLYERWRSHH-TVSRSLDEKHKRFNVFKENVNFVHEFNKK---DEPYKLKLNKFA 86
Query: 87 DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
D+T EF ++ G K++ H S A G+ K VPPSV+W +KGAVTP+K QGQ
Sbjct: 87 DMTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQ 146
Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C V AVEGIN IK N+LVSLSEQ+LVDC T++N GC GG M AF++I +
Sbjct: 147 CGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQ-GCNGGLMGYAFEFIKEKG 205
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA- 255
GIT + Y Y G CD K I +E VPPN+E++LLKA ANQP+SVAIDA
Sbjct: 206 GITTEQSYPYTA-EDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAG 264
Query: 256 -SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
SA QFYS GVF G C T L+HGV VGYGT+ +G KYW++KNSWG DWGE+GY R++R
Sbjct: 265 GSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRG 324
Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSADKS 345
I +G CGIA+ AS+P+ S P A S
Sbjct: 325 ISAKEGLCGIAVEASYPIKNSSTNPVGAPSS 355
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 163/325 (50%), Positives = 203/325 (62%), Gaps = 22/325 (6%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
E +E+W++ + + E KRF +FK N+ V FN ++ Y L+LNKFAD+T
Sbjct: 36 ELYERWRSHH-TVSRSLDEKDKRFNVFKANVHYVHNFNKK---DKPYKLKLNKFADMTNH 91
Query: 92 EFIASQTGFKMSDHSSSL---KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
EF G K+ H + L +ANGT VPP+V+W +KGAVTPVK QG+C
Sbjct: 92 EFRHHYAGSKIKHHRTFLGASRANGTFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGSCW 151
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
V AVEGIN IK N LVSLSEQ+LVDC T+ N GC GG MD AF++I + GI +
Sbjct: 152 AFSTVVAVEGINQIKTNELVSLSEQELVDCDTS-QNQGCNGGLMDMAFEFIKKKGGINTE 210
Query: 202 AVYSY--EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
Y Y EG G CD K I +EDVPPNDE SLLKAVANQPVSVAI AS
Sbjct: 211 ENYPYMAEG---GECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSD 267
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
QFYS GVF G C T L+HGV VGYGT+ + KYW++KNSWG +WGE GY R+QR+ID
Sbjct: 268 FQFYSEGVFTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQREIDA 327
Query: 318 PQGQCGIAMFASFPVSKESAQPSSA 342
+G CGIAM S+P+ S+ P+ +
Sbjct: 328 EEGLCGIAMQPSYPIKTSSSNPTGS 352
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 157/347 (45%), Positives = 212/347 (61%), Gaps = 24/347 (6%)
Query: 1 MAKYFLIVVLIISGSCASQA-TYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+ K L+ ++ C+S + R + ++ E+ EQW A++ R YK+ E ++RFE+FK
Sbjct: 3 IPKALLLAIVGCICLCSSAVLSARELGDTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
N+ +E FN NR + L +N+F DLT EF A++T G KMS + T F
Sbjct: 63 ANVAFIESFNAE---NRKFWLGVNQFTDLTNDEFRATKTNKGLKMSGGRAP-----TGFK 114
Query: 118 YKSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
Y + + P +V+W KG VTP+K QGQC AV A EGI + +L+SLSEQ+
Sbjct: 115 YSNVSIDALPTAVDWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQE 174
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
LVDC + + GC GG MDDAFK+II+N G+T +A Y Y G C + A + A I
Sbjct: 175 LVDCDVHGVDQGCEGGEMDDAFKFIIKNGGLTTEANYPYTAQD-GQCKTSIASNSVATIK 233
Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGT 285
YEDVP NDE SL+KAVANQPVSVA+D + Q YSGGV G C T L+HG+ A+GYG
Sbjct: 234 GYEDVPANDESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIAAIGYGM 293
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+ +G KYWL+KNSWG WGE GY R+++DI G CG+AM S+P
Sbjct: 294 TSDGTKYWLLKNSWGTTWGESGYLRMEKDISDKSGMCGLAMQPSYPT 340
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 165/338 (48%), Positives = 217/338 (64%), Gaps = 17/338 (5%)
Query: 7 IVVLIISGSCASQATYRT--FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
I ++ + SQAT RT F E S EK EQW A++ R Y++ E R ++FK NL
Sbjct: 10 IFTILFTTFSISQATSRTVTFHEPSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKNLKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
+E FN GN+SY L +N+FAD T +EF+A TG K + + S V
Sbjct: 70 IENFNKK--GNKSYKLGVNEFADWTNEEFLAIHTGLKGLSSKVVDETISSRSWNISDMVG 127
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
S +W +GAVTPVKYQGQC AVAAVEG+ I LVSLSEQQL+DC + +
Sbjct: 128 VSKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDC-DREYD 186
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GG M DAF YIIQN+GI ++ YSY+G S G C S + AA+I+ ++ VP N+E
Sbjct: 187 RGCDGGIMSDAFNYIIQNRGIASENDYSYQG-SDGRCRS--SARPAARISGFQTVPSNNE 243
Query: 238 ESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
++LL+AV+ QPVSV++DA+ F YSGGV++G C T NH VT VGYGTS++G KYWL
Sbjct: 244 QALLEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLA 303
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KNSWG+ WGE GY R++RD+ PQG CG+A +A +PV+
Sbjct: 304 KNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 341
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 161/344 (46%), Positives = 220/344 (63%), Gaps = 21/344 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K ++ + + SQ R + ++ E+ E W A+YG+ YK++AE KRF+IFKDN+
Sbjct: 7 KQHMLALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNV 66
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--SSSLKANGTPFLYKS 120
+E FN A GN+ Y L +N ADLT +EF S+ G K + +++ K NG F Y++
Sbjct: 67 EFIESFN--AAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNG--FKYEN 122
Query: 121 -SQVPPSVNWIEKGAVTPVKYQG-QCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
+ +P +++W KGAVTP+K QG QC +AA EGI+ I LVSLSEQ+LVDC
Sbjct: 123 VTDIPEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLSEQELVDC 182
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ D+ GC GGFM+D F++II+N GIT++ Y Y+G+ G C++ A AQI YE
Sbjct: 183 DSVDD--GCEGGFMEDGFEFIIKNGGITSETNYPYKGVD-GTCNTTIAASPVAQIKGYEI 239
Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP EE+L KAVANQPVSV+I A+ FYS G++NG C T L+HGVTAVGYGT E G
Sbjct: 240 VPSYSEEALKKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT-ENG 298
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YW++KNSWG WGE GY R+ R I G CGIA+ +S+P +
Sbjct: 299 TDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 157/321 (48%), Positives = 210/321 (65%), Gaps = 19/321 (5%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
+ ++ S+ E+ EQW ++G+ Y+++ E KRF IFKDN+ +E FN A N+ Y L +
Sbjct: 29 KLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFN--AADNQPYKLSV 86
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
N ADLT EF AS+ G+K D + T F Y++ + +P +V+W KGAVTP+K Q
Sbjct: 87 NHLADLTLDEFKASRNGYKKIDREFTT----TSFKYENVTAIPAAVDWRVKGAVTPIKDQ 142
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
GQC VAA EGIN I +LVSLSEQ+LVDC T + GC GG M+D F++II+
Sbjct: 143 GQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIK 202
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N GIT++ Y Y+ + G C++ A+IT YE VP N E+SLLKAVANQP+SV+ID
Sbjct: 203 NGGITSETNYPYKA-ADGSCNTATTTP-VAKITGYEKVPVNSEKSLLKAVANQPISVSID 260
Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
A S+ FYS G++ G C T L+HGVTAVGYG S G YW++KNSWG WGE GY R+Q
Sbjct: 261 ASDSSFMFYSSGIYTGECGTELDHGVTAVGYG-SANGTDYWIVKNSWGTVWGEKGYIRMQ 319
Query: 313 RDIDQPQGQCGIAMFASFPVS 333
R I +G CGIAM +S+P +
Sbjct: 320 RGIAAKEGLCGIAMDSSYPTA 340
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 161/344 (46%), Positives = 220/344 (63%), Gaps = 21/344 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K ++ + + SQ R + ++ E+ E W A+YG+ YK++AE KRF+IFKDN+
Sbjct: 7 KQHMLALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNV 66
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--SSSLKANGTPFLYKS 120
+E FN A GN+ Y L +N ADLT +EF S+ G K + +++ K NG F Y++
Sbjct: 67 EFIESFN--AAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNG--FKYEN 122
Query: 121 -SQVPPSVNWIEKGAVTPVKYQG-QCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
+ +P +++W KGAVTP+K QG QC +AA EGI+ I LVSLSEQ+LVDC
Sbjct: 123 VTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLSEQELVDC 182
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ D+ GC GGFM+D F++II+N GIT++ Y Y+G+ G C++ A AQI YE
Sbjct: 183 DSVDD--GCEGGFMEDGFEFIIKNGGITSETNYPYKGVD-GTCNTTIAASPVAQIKGYEI 239
Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP EE+L KAVANQPVSV+I A+ FYS G++NG C T L+HGVTAVGYGT E G
Sbjct: 240 VPSYSEEALQKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT-ENG 298
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YW++KNSWG WGE GY R+ R I G CGIA+ +S+P +
Sbjct: 299 TDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 158/348 (45%), Positives = 219/348 (62%), Gaps = 26/348 (7%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+ K L +L C A A D ++ + E+W QYGR YK++ E ++RFEIFK
Sbjct: 3 IPKALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
N+ +E FN GN + L +N+FADLT EF A++T GF S+++ T F
Sbjct: 63 ANVAFIESFN---AGNHKFWLSVNQFADLTNYEFRATKTNKGFI----PSTVRVP-TTFR 114
Query: 118 YKSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
Y++ + P +V+W KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+
Sbjct: 115 YENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
LVDC + + GC GG MDDAFK+II+N G+T ++ Y Y + G C+ + AA I
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTA-ADGKCNG--GSNSAATIK 231
Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGT 285
YEDVP N+E +L+KAVANQPVSVA+D + QFYSGGV G C T L+HG+ A+GYG
Sbjct: 232 GYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+G +YWL+KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 165/352 (46%), Positives = 219/352 (62%), Gaps = 23/352 (6%)
Query: 1 MAKYFLIVVLIISGSCASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
MA + ++ I S AT R + E S EK EQW A++ R Y + E RF IFK
Sbjct: 1 MASTIIFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFK 60
Query: 60 DNLVAVERFNNAAIGNR-SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA-----NG 113
NL V+ FN + N+ +Y + +N+F+DLT +EF A+ TG + + + + N
Sbjct: 61 KNLEFVQNFN---MNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNT 117
Query: 114 TPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
PF Y + S S++W ++GAVTPVKYQG+C AVAAVEGI I LVSLSE
Sbjct: 118 VPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSE 177
Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED--HA 223
QQL+DC D N GC GG M AF+YII+N+GIT + Y Y+ S A
Sbjct: 178 QQLLDC-DRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRA 236
Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAV 281
A I+ YE VP N+EE+LL+AV+ QPVSV I+ + A + YSGGVFNG C T L+H VT V
Sbjct: 237 ATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIV 296
Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
GYG SEEG KYW++KNSWG+ WGE+GY R++RD+D PQG CG+A+ A +P++
Sbjct: 297 GYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 158/348 (45%), Positives = 219/348 (62%), Gaps = 26/348 (7%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+ K L +L C A A D ++ + E+W QYGR YK++ E ++RFEIFK
Sbjct: 3 IPKALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
N+ +E FN GN + L +N+FADLT EF A++T GF S+++ T F
Sbjct: 63 ANVAFIESFN---AGNHKFWLGVNQFADLTNYEFRATKTNKGFI----PSTVRVP-TTFR 114
Query: 118 YKSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
Y++ + P +V+W KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+
Sbjct: 115 YENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
LVDC + + GC GG MDDAFK+II+N G+T ++ Y Y + G C+ + AA I
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTA-ADGKCNG--GSNSAATIK 231
Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGT 285
YEDVP N+E +L+KAVANQPVSVA+D + QFYSGGV G C T L+HG+ A+GYG
Sbjct: 232 GYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+G +YWL+KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 154/346 (44%), Positives = 220/346 (63%), Gaps = 22/346 (6%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+ K L+ +L C S R D+ S+ + E W QYGR YK++AE +++FE+FK
Sbjct: 3 IPKASLLAILGCLCLCGSVLAARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N + FN GN + L +N+FAD+T +EF A++T + + T F+Y+
Sbjct: 63 ANAEFINSFN---AGNHKFWLGINQFADITNEEFKATKTNKGFISNKVRVP---TGFMYE 116
Query: 120 S---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+ +P +++W KGAVTP+K QGQC AVAA+EGI + +LVSLSEQ+LV
Sbjct: 117 NMSFDALPATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELV 176
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + + GC GG MDDAFK+II+N G+T ++ Y Y+ + G C S AA I +Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTQESNYPYDA-ADGKCKS--GSSSAATIKSY 233
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
EDVP N+E +L+KAVANQPVSVA+D + QFYSGGV G C T L+HG+ A+GYGT+
Sbjct: 234 EDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGTTS 293
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+G K+W++KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 294 DGTKFWIMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 156/346 (45%), Positives = 219/346 (63%), Gaps = 22/346 (6%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+ K ++ +L C+S R D+ S+A + E W AQYGR YK++AE +++FE+FK
Sbjct: 3 IPKASILAILGCLCFCSSVLAARELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N ++ FN N + L +N+FADLT +EF A++T + + + T F Y+
Sbjct: 63 ANARFIDSFNAE---NHKFWLGINQFADLTNEEFKATKTNKGFISNKARVS---TGFKYE 116
Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+ ++ P S++W KGAVTPVK QGQC AVAA EGI + +LVSLSEQ+LV
Sbjct: 117 NLKIEALPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELV 176
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + + GC GG MDDAFK+II N G+T ++ Y Y+ G C S A I +Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDA-EDGKCKS--GSKSAGTIKSY 233
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
EDVP N+E +L+KAVANQPVSVA+D + QFYSGGV G C T L+HG+ A+GYG +
Sbjct: 234 EDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTS 293
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+G K+WL+KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 294 DGTKFWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 159/326 (48%), Positives = 203/326 (62%), Gaps = 18/326 (5%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S + +E+W++ Y + + KRF +FK N++ V N ++ Y L+LNKFA
Sbjct: 33 EESFWDLYERWRS-YRTVSRSLGDKHKRFNVFKANVMHVHNTNKM---DKPYKLKLNKFA 88
Query: 87 DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
D+T EF ++ G K++ H + + NGT K VPPS +W + GAVT VK QGQ
Sbjct: 89 DMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSADWRKNGAVTGVKDQGQ 148
Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C V AVEGIN IK N+LVSLSEQ+LVDC T N GC GG M+ AF++I Q
Sbjct: 149 CGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTK-KNAGCNGGLMESAFEFIKQKG 207
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
GIT ++ Y Y G CD+ KA D A I +E+VP NDE +LLKAVANQPVSVAIDA
Sbjct: 208 GITTESNYPYTAQD-GTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAG 266
Query: 257 AL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
QFY GVF G C T LNHGV VGYGT+ +G YW ++NSWG +WGE GY R+QR
Sbjct: 267 GFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRS 326
Query: 315 IDQPQGQCGIAMFASFPVSKESAQPS 340
I + +G CGIAM AS+P+ S P+
Sbjct: 327 IFKKEGLCGIAMMASYPIKNSSNNPT 352
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 296 bits (758), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 165/345 (47%), Positives = 215/345 (62%), Gaps = 21/345 (6%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEK-FEQWKAQYGRTYKESAENSKRFEIFKDN 61
K FLIV L+ S C S R D+ I +K ++W A++GRTY + E + R+ +FK N
Sbjct: 7 KIFLIVSLV-SSFCFSTTLSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRN 65
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYK 119
+ +ER NN G R++ L +N+FADLT EF TG+K D S + T F Y+
Sbjct: 66 VERIERLNNVPAG-RTFKLAVNQFADLTNDEFRFMYTGYK-GDFVLFSQSQTKSTSFRYQ 123
Query: 120 S---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+ +P +V+W +KGAVTP+K QG C AVAA+EG IK +L+SLSEQQLV
Sbjct: 124 NVFFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLV 183
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC TND GC GG MD AF++I+ G+T ++ Y Y+G C + AA IT Y
Sbjct: 184 DCDTNDF--GCSGGLMDTAFEHIMATGGLTTESNYPYKGEDAN-CKIKSTKPSAASITGY 240
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
EDVP NDE +L+KAVA+QPVSV I+ QFYS GVF G C T+L+H VTAVGY S
Sbjct: 241 EDVPVNDENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSS 300
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYW+IKNSWG WGE GY R+++DI +G CG+AM AS+P
Sbjct: 301 AGSKYWIIKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYPT 345
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 162/339 (47%), Positives = 210/339 (61%), Gaps = 17/339 (5%)
Query: 5 FLIVVLIISGSCAS-QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ VL+IS S S AT T +E +E+W + + Y E +RFEIFKDNL
Sbjct: 13 LIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLK 72
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQ 122
VE +++I NR+Y + L +FADLT EF A KM + + G +LYK
Sbjct: 73 FVEE--HSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKM--ERTRVPVKGEKYLYKVGDS 128
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P +++W KGAV PVK QG C A+ AVEGIN IK L+SLSEQ+LVDC T+
Sbjct: 129 LPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTS- 187
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N+GC GG MD AFK+II+N GI + Y Y +C+S K I YEDVP N
Sbjct: 188 YNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQN 247
Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
DE+SL KA+ANQP+SVAI+A A Q Y+ GVF G C T L+HGV AVGYG SE G YW
Sbjct: 248 DEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYG-SEGGQDYW 306
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+++NSWG +WGE GYF+L+R+I + G+CG+AM AS+P
Sbjct: 307 IVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 160/348 (45%), Positives = 220/348 (63%), Gaps = 26/348 (7%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+ K L+ +L C+S R D+ S+ + E W QYGR YK++AE + +FE+FK
Sbjct: 3 IPKASLLAILGCLCFCSSVLAARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
N ++ FN GN + L +N+FAD+T +EF A++T GF S+ ++A T F
Sbjct: 63 ANAGFIDSFN---AGNHKFWLGINQFADITNKEFKATKTNKGF----ISNKVRAP-TGFS 114
Query: 118 YKS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
Y++ +P S++W KGAVTPVK QGQC AVAA EGI + +LVSLSEQ+
Sbjct: 115 YENVSFDALPASIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQE 174
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
LVDC + + GC GG MDDAFK+II N G+T ++ Y Y+ G C S A I
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIISNGGLTQESSYPYDA-EDGKCKS--GSKSAGTIK 231
Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGT 285
+YEDVP N+E +L+KAVANQPVSVA+D + QFYSGGV G C T L+HG+ A+GYG
Sbjct: 232 SYEDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGV 291
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+ +G KYWL+KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 292 TSDGTKYWLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 157/348 (45%), Positives = 219/348 (62%), Gaps = 26/348 (7%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+ K L +L C A A D ++ + E+W QYGR YK++ E ++RFEIFK
Sbjct: 3 IPKALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
N+ +E FN GN + L +N+FADLT EF A++T GF S+++ T F
Sbjct: 63 ANVAFIESFN---AGNHKFWLGVNQFADLTNYEFRATKTNKGFI----PSTVRVP-TTFR 114
Query: 118 YKSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
Y++ + P +V+W KGAVTP+K QGQC AVAA+EGI + +L+SLSEQ+
Sbjct: 115 YENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
LVDC + + GC GG MDDAFK+II+N G+T ++ Y Y + G C+ + AA I
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTA-ADGKCNG--GSNSAATIK 231
Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGT 285
YE+VP N+E +L+KAVANQPVSVA+D + QFYSGGV G C T L+HG+ A+GYG
Sbjct: 232 GYEEVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+G +YWL+KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 157/347 (45%), Positives = 215/347 (61%), Gaps = 25/347 (7%)
Query: 6 LIVVLIISGSCASQATYRTF------DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
L++ ++ G C A DE ++ + EQW Q+GR YK+ + + RF +FK
Sbjct: 7 LLLAILGCGVCLCSAAVLAARELGGDDELAMVARHEQWMVQHGRVYKDETDKAHRFLVFK 66
Query: 60 DNLVAVERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
N+ +E FN AA GNR + L +N+FADLT EF A++T + + + T F Y
Sbjct: 67 ANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEFRATKTNKGFNPNVVKVP---TGFRY 123
Query: 119 KSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
++ + P +V+W KGAVTP+K QGQC AVAA EGI I +L SLSEQ+L
Sbjct: 124 QNLSIDALPQTVDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLTSLSEQEL 183
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
VDC + + GC GG MDDAFK+II+N G+T ++ Y Y G C S + AA I
Sbjct: 184 VDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTTESNYPYTAQD-GQCKS--GSNGAATIKG 240
Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTS 286
YEDVP NDE +L+KAVA+QPVSVA+D + QFYSGGV G C T L+HG+ A+GYG +
Sbjct: 241 YEDVPANDEAALMKAVASQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKT 300
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
+G KYWL+KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 301 SDGTKYWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMQPSYPTA 347
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 151/320 (47%), Positives = 209/320 (65%), Gaps = 17/320 (5%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
DE ++ ++ +W ++GR Y ++ E + R+ +FK N+ +ER N+ G ++ L +N+
Sbjct: 29 LDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSG-LTFKLAVNQ 87
Query: 85 FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQ 141
FADLT +EF + TGFK + SS + T F Y+ S +P SV+W +KGAVTP+K Q
Sbjct: 88 FADLTNEEFRSMYTGFKGNSVLSS-RTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQ 146
Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
G C AVAA+EG+ IK +L+SLSEQ+LVDC TND GC GG MD AF Y I
Sbjct: 147 GLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDG--GCMGGLMDTAFNYTIT 204
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
G+T+++ Y Y+ + G C+ K + A I +EDVP NDE++L+KAVA+ PVS+ I
Sbjct: 205 IGGLTSESNYPYKS-TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIA 263
Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
QFYS GVF+G C T L+HGVTAVGYG S+ G+KYW++KNSWG WGE GY R++
Sbjct: 264 GGDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIK 323
Query: 313 RDIDQPQGQCGIAMFASFPV 332
+DI GQCG+AM AS+P
Sbjct: 324 KDIKPKHGQCGLAMNASYPT 343
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 167/345 (48%), Positives = 221/345 (64%), Gaps = 23/345 (6%)
Query: 6 LIVVLII--SGSCASQATYRT--FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
L+ VLII +G SQAT RT F E S+ +K EQW A++ R Y++ E + R ++FK N
Sbjct: 7 LVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKN 66
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK----MSDHSSSLKANGTPFL 117
L +E FN GN+SY L +N+FAD T +EF+A TG K +S K +
Sbjct: 67 LKFIENFNKK--GNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTW 124
Query: 118 YKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
S V S +W +GAVTPVKYQGQC AVAAVEG+ I LVSLSEQQL+D
Sbjct: 125 NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLD 184
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C + + C GG M DAF Y++QN+GI ++ YSY+G S G C S AA+I+ ++
Sbjct: 185 C-DREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQG-SDGGCRS--NARPAARISGFQ 240
Query: 231 DVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEE 288
VP N+E +LL+AV+ QPVSV++DA+ F YSGGV++G C T NH VT VGYGTS++
Sbjct: 241 TVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQD 300
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
G KYWL KNSWG+ W E GY R++RD+ PQG CG+A +A +PV+
Sbjct: 301 GTKYWLAKNSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 152/339 (44%), Positives = 213/339 (62%), Gaps = 22/339 (6%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
MAK L +L C++ R D+ ++A + E+W AQYGR YK+ AE ++RFE+FK
Sbjct: 3 MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N +E FN GN + L +N+FADLT EF ++T ++ + T F Y+
Sbjct: 63 ANAAFIESFN---AGNHKFWLGVNQFADLTNDEFRLTKTNKGFIPSTTRVP---TGFRYE 116
Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+ + P +++W KG VTP+K QGQC AVAA+EGI + +L+SLSEQ+LV
Sbjct: 117 NVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELV 176
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + + GC GG MDDAFK+II+N G+T ++ Y Y + C S+ + A I Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAA-ADDKCKSV--SNSVASIKGY 233
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
EDVP N+E +L+KAVANQPVSVA+D + QFY GGV G C T L+HG+ A+GYG +
Sbjct: 234 EDVPANNEAALMKAVANQPVSVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKAS 293
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAM 326
+G KYWL+KNSWG WGE+G+ R+++DI +G CG+AM
Sbjct: 294 DGTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAM 332
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 157/328 (47%), Positives = 206/328 (62%), Gaps = 22/328 (6%)
Query: 27 EGSIAEKFEQWKAQY--GRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
E S+ E +E+W++ + R+ +E A KRF +FK N+ + N ++SY L+LNK
Sbjct: 31 ENSLWELYERWRSHHTVARSLEEKA---KRFNVFKHNVKHIHETNKK---DKSYKLKLNK 84
Query: 85 FADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
F D+T +EF + G + H K F+Y + + +P SV+W + GAVTPVK Q
Sbjct: 85 FGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQ 144
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
GQC V AVEGIN I+ +L SLSEQ+LVDC TN N GC GG MD AF++I +
Sbjct: 145 GQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTN-QNQGCNGGLMDLAFEFIKE 203
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
G+T++ VY Y+ S CD+ K I +EDVP N E+ L+KAVANQPVSVAID
Sbjct: 204 KGGLTSELVYPYKA-SDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAID 262
Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
A S QFYS GVF G C T LNHGV VGYGT+ +G KYW++KNSWG++WGE GY R+Q
Sbjct: 263 AGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQ 322
Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQPS 340
R I +G CGIAM AS+P+ + PS
Sbjct: 323 RGIRHKEGLCGIAMEASYPLKNSNTNPS 350
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 152/310 (49%), Positives = 203/310 (65%), Gaps = 22/310 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ + EQW QY R YK++ E ++RFE+FK N+ +E FN A GNR + L +N+FADLT
Sbjct: 1 MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFN--AGGNRKFWLGVNQFADLT 58
Query: 90 PQEFIASQT--GFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQGQC 144
EF A++T GFK S +K T F Y++ V P +++W KGAVTP+K QGQC
Sbjct: 59 NDEFRATKTNKGFK----PSPVKVP-TGFRYENISVDALPATIDWRTKGAVTPIKDQGQC 113
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
EGI I +L+SLSEQ+LVDC + + GC GG MDDAFK+II+ G+T ++ Y
Sbjct: 114 -----EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTTESSY 168
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYS 262
Y + G C S + A + +EDVP NDE SL+KAVANQPVSVA+D + QFYS
Sbjct: 169 PYTA-ADGKCKS--GSNSVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTFQFYS 225
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
GGV G C T L+HG+ A+GYG + +G KYWL+KNSWG WGE+GY R+++DI +G C
Sbjct: 226 GGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRGMC 285
Query: 323 GIAMFASFPV 332
G+AM S+P
Sbjct: 286 GLAMEPSYPT 295
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 156/314 (49%), Positives = 202/314 (64%), Gaps = 22/314 (7%)
Query: 34 FEQWKAQYG---RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTP 90
+E+W++ Y R AE +RF +FK+N + N +R + L LNKFAD+T
Sbjct: 40 YERWRSHYTVSRRGLGADAE-ERRFNVFKENARYIHEGNKK---DRPFRLALNKFADMTT 95
Query: 91 QEFIASQTGFKMSDH---SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-- 145
EF + G ++ H S + +G+ + +PP+V+W +KGAVT +K QGQC
Sbjct: 96 DEFRRTYAGSRVRHHLSLSGGRRGDGSFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSC 155
Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
+ AVEGIN I+ +LVSLSEQ+L+DC N NN GC GG MD AF++I +N GIT
Sbjct: 156 WAFSTIVAVEGINKIRTGKLVSLSEQELMDC-DNVNNQGCDGGLMDYAFQFIHKN-GITT 213
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
++ Y Y+G G CD K + HA I YEDVP NDE +L KAVA QPVSVAIDAS
Sbjct: 214 ESNYPYQG-EQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGNDF 272
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
QFYS GVF G C T L+HGV AVGYGT+ +G KYW++KNSWG+DWGE GY R+QR + Q
Sbjct: 273 QFYSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQA 332
Query: 319 QGQCGIAMFASFPV 332
+GQCGIAM AS+P
Sbjct: 333 EGQCGIAMQASYPT 346
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 160/339 (47%), Positives = 201/339 (59%), Gaps = 18/339 (5%)
Query: 7 IVVLIISGSCA-SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
++ L + SCA +T + + + +E+W ++ + Y E KRF++FKDNL +
Sbjct: 12 LLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFKDNLGFI 71
Query: 66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS---Q 122
+ NN N +Y L LNKFAD+T +E+ G K +K T Y S Q
Sbjct: 72 QEHNNNQ--NNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYSAGDQ 129
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P V+W KGAV P+K QG C VA VE IN I + VSLSEQ+LVDC
Sbjct: 130 LPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC-DRA 188
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD AF++IIQN GI D Y Y G GICD K A I YEDVPP
Sbjct: 189 YNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFD-GICDPTKKNAKAVNIDGYEDVPPY 247
Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
DE +L KAVA QPVS+AI+AS ALQ Y GVF G C T L+HGV VGYG SE G+ YW
Sbjct: 248 DENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVGYG-SENGVDYW 306
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
L++NSWG WGEDGYF++QR++ P G+CGI M AS+PV
Sbjct: 307 LVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 167/355 (47%), Positives = 221/355 (62%), Gaps = 39/355 (10%)
Query: 4 YFLIVVLIISGSC-ASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
+ L+ + I+S + SQAT R TF E +AE +QW ++ R Y + E RF++FK N
Sbjct: 15 FMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKN 74
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
L +E+FN G+R+Y L +N+FAD T +EFIA+ TG K NG P
Sbjct: 75 LKFIEKFNKK--GDRTYKLGVNEFADWTREEFIATHTGLK--------GVNGIPSSEFVD 124
Query: 122 QVPPSVNW-------------IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLV 161
++ PS NW +GAVTPVKYQGQC +VAAVEG+ I N LV
Sbjct: 125 EMIPSWNWNVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLV 184
Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
SLSEQQL+DC + +NGC GG M DAF YII+N+GI ++A Y Y+ + G C
Sbjct: 185 SLSEQQLLDC-DRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQA-AEGTCRY--NGK 240
Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFN-GYCETFLNHGV 278
+A I ++ VP N+E +LL+AV+ QPVSV+IDA F YSGGV++ YC T +NH V
Sbjct: 241 PSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAV 300
Query: 279 TAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
T VGYGTS EGIKYWL KNSWG+ WGE+GY R++RD+ PQG CG+A +A +PV+
Sbjct: 301 TFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 166/356 (46%), Positives = 222/356 (62%), Gaps = 26/356 (7%)
Query: 3 KYFLIV-----VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
K LIV VL++S S + DE S+ + +E+W++ + + + E KRF +
Sbjct: 5 KLLLIVLSIALVLVVSESFDFHDKDVSSDE-SLWDLYERWRSHHTVS-RNLNEKQKRFNV 62
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGT 114
FK N++ V N ++ Y L+LNKFAD+T EF + G K++ H + + +GT
Sbjct: 63 FKSNVMHVHNTNKM---DKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGT 119
Query: 115 PFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQ 166
F+Y++ ++ P SV+W +KGAVT VK QGQC V AVEGIN IK NRLV LSEQ
Sbjct: 120 -FMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
+L+DC N N GC GG M+ AF+YI Q GIT ++ Y Y + G CD+ K A I
Sbjct: 179 ELIDC-DNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTA-NDGSCDATKENVPAVSI 236
Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYG 284
+E VP NDE++LLKAVANQPVSVAIDA S QFYS GVF G C LNHGV VGYG
Sbjct: 237 DGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYG 296
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
T+ +G YW+++NSWG +WGE GY R++R++ +G CGIAM AS+PV S P+
Sbjct: 297 TTVDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPVKNSSKNPA 352
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 211/322 (65%), Gaps = 25/322 (7%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
D+ S+ + E W +QYGR+YK++AE ++FE+FK N ++ FN N + L +N+F
Sbjct: 29 DDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFKANAAFIDSFNAK---NHKFWLGINQF 85
Query: 86 ADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKY 140
AD+T +EF ++T GF S+ ++A+ T F Y++ + P +++W KGAVTPVK
Sbjct: 86 ADITNEEFKVTKTNKGF----ISNKVRAS-TGFSYENVSIDALPATIDWRTKGAVTPVKD 140
Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QGQC AVAA EGI + +LVSLSEQ+LVDC + + GC GG MDDAFK+II
Sbjct: 141 QGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFII 200
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
N G+T ++ Y Y+ G C S A I +YEDVP N+E +L+KAVANQPVSVA+
Sbjct: 201 TNGGLTQESSYPYDA-EDGKCKS--GSKSAGTIKSYEDVPANNEGALMKAVANQPVSVAV 257
Query: 254 DASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
D + QFYSGGV G C T L+HG+ A+GYG + +G KYWL+KNSWG WGE+G+ R+
Sbjct: 258 DGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLMKNSWGTSWGENGFLRM 317
Query: 312 QRDIDQPQGQCGIAMFASFPVS 333
++DI +G CG+AM S+P +
Sbjct: 318 EKDIADKKGMCGLAMEPSYPTA 339
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 159/315 (50%), Positives = 200/315 (63%), Gaps = 17/315 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ + FE W +++GR Y+ + E +RFEIFKDNL ++ N R+Y L LN+FADL+
Sbjct: 43 LIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKKV---RNYWLGLNEFADLS 99
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
+EF G K D S + F YK +P SV+W +KGAVTPVK QG C
Sbjct: 100 HEEFKNKYLGLK-PDLSKRAQCP-EEFTYKDVAIPKSVDWRKKGAVTPVKNQGSCGSCWA 157
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC T NNGC GG MD AF YI+ N G+ +
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFAYIVANGGLHKEE 216
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M G CD K E A I+ Y DVP N EESLLKA+ANQP+S+AI+AS QF
Sbjct: 217 DYPYI-MEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQF 275
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YSGGVF+G+C T L+HGV AVGYGTS +G+ Y ++KNSWG WGE GY R++R +P+G
Sbjct: 276 YSGGVFDGHCGTELDHGVAAVGYGTS-KGLDYIIVKNSWGPKWGEKGYIRMKRKTSKPEG 334
Query: 321 QCGIAMFASFPVSKE 335
CGI AS+P K+
Sbjct: 335 ICGIYKMASYPTKKK 349
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 172/363 (47%), Positives = 228/363 (62%), Gaps = 30/363 (8%)
Query: 1 MAK--YFLIVVLIISGSCASQATYRTFDEGSIAEK------FEQWKAQYGRTYKESAENS 52
MAK Y L+ V+++ GS A A FDE +A + +E+W+A + + ++ +
Sbjct: 1 MAKLSYALLSVVLVLGSVA-LAQSIPFDEKDLASEESLWSLYEKWRAHHAVS-RDLDDTD 58
Query: 53 KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA- 111
KRF +FK+N+ + FN + +Y L LNKF D+T QEF ++ G K+ DH +L+
Sbjct: 59 KRFNVFKENVKFIHEFNQKK--DATYKLALNKFGDMTNQEFRSTYAGSKI-DHHMTLRGV 115
Query: 112 -NGTPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVS 162
+ F Y K +P SV+W EKGAVT VK QGQC V AVEGIN IK N LVS
Sbjct: 116 KDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQIKTNELVS 175
Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
LSEQQLVDC T N+GC GG MD AF +I N G++++ Y Y C S +A
Sbjct: 176 LSEQQLVDCDTK--NSGCNGGLMDYAFDFIKNNGGLSSEDSYPYLAEQKS-CGS-EANSA 231
Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTA 280
I Y+DVP N+E +L+KAVANQPVSVAI+AS A QFYS GVF+G+C T L+HGV A
Sbjct: 232 VVTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQFYSQGVFSGHCGTELDHGVAA 291
Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
VGYG ++G KYW++KNSWG+ WGE GY R++R I +G+CGIAM AS+P+ K S P
Sbjct: 292 VGYGVDDDGKKYWIVKNSWGEGWGESGYIRMERGIKDKRGKCGIAMEASYPI-KSSPNPK 350
Query: 341 SAD 343
A+
Sbjct: 351 KAE 353
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 166/364 (45%), Positives = 211/364 (57%), Gaps = 23/364 (6%)
Query: 1 MAKYFLIVVLIISGSCASQA----TYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
MA +I L+ S A T + + + +E+W ++ + Y E + KRF+
Sbjct: 1 MASMTMIYTLLFLSFTLSYAIKTSTIINYTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQ 60
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF 116
+FKDNL ++ NN N +Y L LNKFAD+T +E+ A G K + +K T
Sbjct: 61 VFKDNLGFIQEHNNNL--NNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGH 118
Query: 117 LYKSS---QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQ 166
Y S ++P V+W KGAV P+K QG C VA VE IN I + VSLSEQ
Sbjct: 119 RYAFSARDRLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQ 178
Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
+LVDC N GC GG MD AF++IIQN GI D Y Y G GICD K I
Sbjct: 179 ELVDC-DRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFD-GICDPTKKNAKVVNI 236
Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYG 284
YEDVPP DE +L KAVA+QPVSVAI+AS ALQ Y GVF G C T L+HGV VGYG
Sbjct: 237 DGYEDVPPYDENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVVGYG 296
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK--ESAQPSSA 342
SE G+ YWL++NSWG WGEDGYF++QR++ G+CGI M AS+PV SA P+S
Sbjct: 297 -SENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPVKNGLNSAVPNSV 355
Query: 343 DKSS 346
+S+
Sbjct: 356 YEST 359
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 213/340 (62%), Gaps = 38/340 (11%)
Query: 18 SQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
SQAT R TF E +AE +QW ++ R Y + E RF++FK NL +E+FN G+R
Sbjct: 6 SQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKK--GDR 63
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNW------- 129
+Y L +N+FAD T +EFIA+ TG K NG P ++ PS NW
Sbjct: 64 TYKLGVNEFADWTREEFIATHTGLK--------GVNGIPSSEFVDEMIPSWNWNVSDVAG 115
Query: 130 ------IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
+GAVTPVKYQGQC +VAAVEG+ I N LVSLSEQQL+DC +
Sbjct: 116 RETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDC-DRER 174
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+NGC GG M DAF YII+N+GI ++A Y Y+ + G C +A I ++ VP N+
Sbjct: 175 DNGCNGGIMSDAFSYIIKNRGIASEASYPYQA-AEGTCRYNGKP--SAWIRGFQTVPSNN 231
Query: 237 EESLLKAVANQPVSVAIDASALQF--YSGGVFN-GYCETFLNHGVTAVGYGTSEEGIKYW 293
E +LL+AV+ QPVSV+IDA F YSGGV++ YC T +NH VT VGYGTS EGIKYW
Sbjct: 232 ERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYW 291
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
L KNSWG+ WGE+GY R++RD+ PQG CG+A +A +PV+
Sbjct: 292 LAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 331
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 152/337 (45%), Positives = 214/337 (63%), Gaps = 14/337 (4%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
++LI+ LI++ R E +E+ E+W AQYG+ Y ++AE KRF+IFK+N+
Sbjct: 8 HYLILFLILT-VWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQ 66
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+E FN A G++ + L +N+FADL +EF AS + + S A T F Y+S ++
Sbjct: 67 FIESFN--AAGDKPFNLSINQFADLHNEEFKASLINVQKKE-SGVETATETSFRYESITK 123
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P +++W ++GAVTP+K QG C VAA+EGI+ I +LVSLSEQ+LVDC
Sbjct: 124 IPVTMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLSEQELVDC-VKG 182
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
+ GC G+ ++AF+++ +N G+ ++ Y Y+ + C K AQI YE+VP N
Sbjct: 183 KSEGCNFGYKEEAFEFVAKNGGLASEISYPYKA-NNKTCMVKKETQGVAQIKGYENVPSN 241
Query: 236 DEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
E++LLKAVANQPVSV IDA ALQFYS G+F G C T NH VT +GYG + G KYWL+
Sbjct: 242 SEKALLKAVANQPVSVYIDAGALQFYSSGIFTGKCGTAPNHAVTVIGYGKARGGAKYWLV 301
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KNSWG WGE GY +++RDI +G CGIA AS+P
Sbjct: 302 KNSWGTKWGEKGYIKMKRDIRAKEGLCGIATNASYPT 338
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 290 bits (743), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 162/344 (47%), Positives = 216/344 (62%), Gaps = 41/344 (11%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+ K I +L++ + ASQA R +E ++ EK EQW A++GRTY++S E +RF+IFK
Sbjct: 5 LEKKLAIALLVVFSTWASQAMARQLINEDALVEKHEQWMARHGRTYQDSEEKERRFQIFK 64
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
NL ++ FN A+ N++Y L LN FADL+ +E++A+ T KM
Sbjct: 65 SNLEYIDNFNKAS--NQTYQLGLNNFADLSHEEYVATYTARKMP---------------- 106
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+VP S++W + GAVTP+K Q QC A AAVEGI + VSLS QQL+DC
Sbjct: 107 -VEVPESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGI----VANGVSLSAQQLLDCV 161
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
++ N GC GG+M++AF YIIQN+GI + Y Y+ M +C S A AAQI+ +EDV
Sbjct: 162 SD--NQGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQ-MCSSRMA---AAQISGFEDV 215
Query: 233 PPNDEESLLKAVANQPVSVAIDASA---LQFYSGGVFNGY-CETFLNHGVTAVGYGTSEE 288
P DEE+L++AVA QPVSV IDA++ + Y GVF C +H VT VGYGTSE+
Sbjct: 216 TPKDEEALMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVTLVGYGTSED 275
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL KNSWG+ WGE GY RLQRDI G CGIA++AS+P
Sbjct: 276 GTKYWLAKNSWGETWGESGYMRLQRDIGLEGGPCGIALYASYPT 319
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 152/337 (45%), Positives = 213/337 (63%), Gaps = 14/337 (4%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
++LI+ LI++ R E +E+ E+W AQYG+ Y ++AE KRF+IFK+N+
Sbjct: 8 HYLILFLILT-VWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQ 66
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+E FN A G++ + L +N+FADL +EF AS + + S A T F Y+S ++
Sbjct: 67 FIESFN--AAGDKPFNLSINQFADLHNEEFKASLINVQKKE-SGVETATETSFRYESITK 123
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P +++W ++GAVTP+K QG C VAA+EGI+ I +LVSLSEQ+LVDC
Sbjct: 124 IPVTMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDC-VKG 182
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
+ GC G+ ++AF+++ +N G+ ++ Y Y+ + C K AQI YE+VP N
Sbjct: 183 KSEGCNFGYKEEAFEFVAKNGGLASEISYPYKA-NNKTCMVKKETQGVAQIKGYENVPSN 241
Query: 236 DEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
E++LLKAVANQPVSV IDA ALQFYS G+F G C T NH T +GYG + G KYWL+
Sbjct: 242 SEKALLKAVANQPVSVYIDAGALQFYSSGIFTGKCGTAPNHAATVIGYGKARGGAKYWLV 301
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KNSWG WGE GY R++RDI +G CGIA AS+P
Sbjct: 302 KNSWGTKWGEKGYIRMKRDIRAKEGLCGIATNASYPT 338
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 157/346 (45%), Positives = 209/346 (60%), Gaps = 27/346 (7%)
Query: 6 LIVVLIISGSC-ASQATYRTFDEG----SIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
L++V I+ C S A + G ++A + EQW AQ+GR YK+ AE + R E+FK
Sbjct: 8 LLLVAIVGCLCLCSTAVLAARELGDADNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKA 67
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLY 118
N+ +E FN N + L N+FADLT EF AS+T G K ++ T F Y
Sbjct: 68 NVAFIESFNAE---NHEFWLGANQFADLTNDEFRASKTNKGIK----QGGVRDAPTGFKY 120
Query: 119 KSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
+ P SV+W KGAVTP+K QGQC AVAA EG+ + +LVSLSEQ+L
Sbjct: 121 SDVSIDALPASVDWRTKGAVTPIKNQGQCGSCWAFSAVAATEGVVKLSTGKLVSLSEQEL 180
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
VDC + + GC GG+MDDAFK+II+N G+T +A Y Y G C S + + AA I
Sbjct: 181 VDCDVHGVDQGCMGGWMDDAFKFIIKNGGLTTEANYPYTG-EDDKCKSNETVNVAATIKG 239
Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTS 286
YEDVP NDE +L+KAVA+QPVSV +D + Q Y+GGV G C ++HG+ A+GYG +
Sbjct: 240 YEDVPANDESALMKAVAHQPVSVVVDGGDMTFQLYAGGVMTGSCGVEMDHGIAAIGYGAT 299
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL+KNSWG WGE G+ R+ +DI +G CG+AM S+P
Sbjct: 300 SNGTKYWLMKNSWGTTWGEKGFLRMAKDIPDKRGMCGLAMKPSYPT 345
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 155/327 (47%), Positives = 204/327 (62%), Gaps = 23/327 (7%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ +E+W++ + + ++ + KRF +FK+N+ + FN + ++ L LNKF
Sbjct: 31 EDSLWSLYERWRSHHAVS-RDLDQKQKRFNVFKENVKFIHEFNKNK--DVTFKLALNKFG 87
Query: 87 DLTPQEFIASQTGFKMSDH-----SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
D+T QEF A G K+ H S +G F+Y+++ PPS++W E+GAV VK Q
Sbjct: 88 DMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKFMYENAVAPPSIDWRERGAVAAVKNQ 147
Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
GQC A+AAVEGIN I LV LSEQ+L+DC T D N GC GG MD AF++I
Sbjct: 148 GQCGSCWAFSAIAAVEGINQIVTKELVPLSEQELIDCDT-DQNQGCSGGLMDYAFEFIKN 206
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N GIT + VY Y+ + K A I YEDVP NDE++L+KAVANQPV+VAI+
Sbjct: 207 NGGITTEDVYPYQAEDA----TCKKNSPAVVIDGYEDVPTNDEDALMKAVANQPVAVAIE 262
Query: 255 ASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
AS QFYS GVF G C T L+HGV VGYGT+++G KYW ++NSWG DWGE GY R+Q
Sbjct: 263 ASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQDGTKYWTVRNSWGADWGESGYVRMQ 322
Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQP 339
R I G CGIAM AS+P+ K S P
Sbjct: 323 RGIKATHGLCGIAMQASYPI-KTSLNP 348
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 166/355 (46%), Positives = 219/355 (61%), Gaps = 39/355 (10%)
Query: 4 YFLIVVLIISGSC-ASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
+ + + I+S S SQAT R TF E +AE +QW ++ R Y + E RF++FK N
Sbjct: 6 FMFVSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKN 65
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
L +E+FN G+R+Y L +N+FAD T +EFIA+ TG K NG P
Sbjct: 66 LKFIEKFNKK--GDRTYKLGVNEFADWTKEEFIATHTGLK--------GFNGIPSSEFVD 115
Query: 122 QVPPSVNW-------------IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLV 161
++ PS NW +GAVTPVKYQGQC +VAAVEG+ I LV
Sbjct: 116 EMIPSWNWNVSDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLV 175
Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
SLSEQQL+DC + +NGC GG M DAF YII+N+GI ++A Y Y+ + G C
Sbjct: 176 SLSEQQLLDC-DRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQ-ETEGTCR--YNAK 231
Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFN-GYCETFLNHGV 278
+A I ++ VP N+E +LL+AV+ QPVSV+IDA F YSGGV++ YC T +NH V
Sbjct: 232 PSAWIRGFQTVPSNNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAV 291
Query: 279 TAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
T VGYGTS EGIKYWL KNSWG+ WGE+GY R++RD+ PQG CG+A +A +PV+
Sbjct: 292 TFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 346
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 160/327 (48%), Positives = 211/327 (64%), Gaps = 22/327 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESA-ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
E S+ + +E+W++ + T S E KRF +F+ N++ V N ++ Y L+LNKF
Sbjct: 31 EESLWDLYEKWRSHH--TVSTSLDEKRKRFNVFRANVLHVHNTNKM---DKPYKLKLNKF 85
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKA---NGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
AD+T EF + K+ H+ A NG+ F+Y + +VP S++W +KGAVTPVK Q
Sbjct: 86 ADMTNHEFRTAYASSKVKHHTMFRGAPLGNGS-FMYGNIDKVPASIDWRKKGAVTPVKDQ 144
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
G+C + AVEGIN IK N+L+SLSEQ+LVDC T +N+ GC GG MD AF++I +
Sbjct: 145 GKCGSCWAFSTIVAVEGINFIKTNKLISLSEQELVDCNTGENH-GCNGGLMDYAFEFITK 203
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
KGIT +A Y Y G CD+ KA A I +EDV N+E +LLKAVANQPVSVAID
Sbjct: 204 QKGITTEANYPYRAQD-GHCDANKANQPAVSIDGHEDVLHNNENALLKAVANQPVSVAID 262
Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
A S QFYS GVF G C L+HGV VGYGT+ +G KYW+++NSWG +WGE GY R+Q
Sbjct: 263 AGGSDFQFYSEGVFTGECGKELDHGVAIVGYGTTVDGTKYWIVRNSWGPEWGERGYIRMQ 322
Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQP 339
R I +G CGIAM AS+P+ K S P
Sbjct: 323 RGISDRRGLCGIAMEASYPIKKSSTNP 349
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 160/355 (45%), Positives = 222/355 (62%), Gaps = 29/355 (8%)
Query: 1 MAKYFLIVVL------IISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKR 54
+A LI+++ ++ + A D+ ++ E++E+W A +GRTYK+S E ++R
Sbjct: 10 LAAILLIIIMYCCPTGLVEAARKGPAAAGGGDDSAMRERYEKWAADHGRTYKDSLEKARR 69
Query: 55 FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT 114
FE+F+ N + ++ FN AA G +S L NKFADLT +EF A G S + G+
Sbjct: 70 FEVFRTNALFIDSFN-AAGGKKSPRLTTNKFADLTNEEF-AEYYGRPFS----TPVIGGS 123
Query: 115 PFLY---KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLS 164
F+Y ++S VP ++NW ++GAVT VK Q CA VAAVEGI+ I+ + LV+LS
Sbjct: 124 GFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSHNLVALS 183
Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
QQL+DC+T NN+GC G MD+AF+YI N GI ++ Y YE + G C + + AA
Sbjct: 184 TQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDYPYEDRALGTCRA-SGKPVAA 242
Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVF----NGYCETFLNHGV 278
I ++ VPPN+E +LL AVA+QPVSVA+D QF+S GVF N C T LNH +
Sbjct: 243 SIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVSQFFSSGVFGAMQNETCTTDLNHAM 302
Query: 279 TAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
TAVGYGT E G KYWL+KNSWG DWGE GY ++ RD+ G CG+AM S+PV+
Sbjct: 303 TAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARDVASNTGLCGLAMQPSYPVA 357
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 150/313 (47%), Positives = 204/313 (65%), Gaps = 21/313 (6%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
S++E+FE WK +YG YK+ AE K F+IFK N+ ++ FN A GN+ Y L +N+F D
Sbjct: 37 SLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFN--AAGNKPYKLAINRFVDK 94
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC--- 144
++ S GF+ + ++ F Y++ + +P +V+W ++GAVTP+K QG+C
Sbjct: 95 PIED---SDDGFERTTTTTPTTT----FKYENVTDIPATVDWRKRGAVTPIKNQGKCGSC 147
Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
AVAA+EGI I LVSLSEQQLVDC + GC G M +AFK+I++N GI
Sbjct: 148 WAFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIAT 207
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL-Q 259
+A Y Y+ + G C + H QI +YE+VP N E+SLLKAVANQPVSV ID + +
Sbjct: 208 EANYPYKRVVKGTCKKV---SHKVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRGMFK 264
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYS G+F G C T NH +T VGYGTS++GIKYWL+KNSW + WGE GY R++RDID +
Sbjct: 265 FYSSGIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRDIDAKE 324
Query: 320 GQCGIAMFASFPV 332
G CGIAM S+P+
Sbjct: 325 GLCGIAMKPSYPI 337
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 160/343 (46%), Positives = 218/343 (63%), Gaps = 26/343 (7%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
+A + L+ + I SQ R E S+ E+ E W A+YG+ YK +AE + F+IFK+
Sbjct: 11 LALFLLLSIEI------SQVMSRKLHETSLREEHENWIARYGQVYKVAAEK-ETFQIFKE 63
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
N+ +E FN AA N+ Y L +N FADLT +EF + G K + H S+ TPF Y++
Sbjct: 64 NVEFIESFNAAA--NKPYKLGVNLFADLTLEEFKDFRFGLKKT-HEFSI----TPFKYEN 116
Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
+ +P +++W EKGAVTP+K QGQC VAA EGI+ I LVSL EQ+LV C
Sbjct: 117 VTDIPEALDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCD 176
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
T + GC GG+M+D F++II+N GIT A Y Y+G++ G C++ A AQI YE V
Sbjct: 177 TKGVDQGCEGGYMEDGFEFIIKNGGITTKANYPYKGVN-GTCNTTIAASTVAQIKGYETV 235
Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P EE+L KAVANQPVSV+IDA+ FY+GG++ G C T L+HGVTAVGYGT+ E
Sbjct: 236 PSYSEEALQKAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNE-T 294
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YW++KNSWG W E G+ R+QR I G CG+A+ +S+P +
Sbjct: 295 DYWIVKNSWGTGWDEKGFIRMQRGITVKHGLCGVALDSSYPTT 337
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 154/324 (47%), Positives = 202/324 (62%), Gaps = 22/324 (6%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
D ++A++ E+W A++GR Y + AE ++R E+F+DN+ +E N AA + L N+F
Sbjct: 32 DAAAMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVN-AAASQHKFWLEENQF 90
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-----PPSVNWIEKGAVTPVKY 140
ADLT EF A++TG + SS + N P ++ + V P SV+W KGAV PVK
Sbjct: 91 ADLTNAEFRATRTGLR----PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKD 146
Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QG C AVAA+EG + +LVSLSEQQLV C + GC GG MDDAF +II
Sbjct: 147 QGDCGCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFII 206
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
+N G+ ++ Y Y S C + A AA I YEDVP NDE +LLKAVANQPVSVAI
Sbjct: 207 KNGGLAAESDYPYTA-SDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAI 265
Query: 254 DAS--ALQFYSGGVFNGY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
D QFY GGV +G C T L+H +TAVGYG + +G KYWL+KNSWG WGEDGY
Sbjct: 266 DGGDRHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYV 325
Query: 310 RLQRDIDQPQGQCGIAMFASFPVS 333
R++R + +G CG+AM AS+P +
Sbjct: 326 RMERGVADKEGVCGLAMMASYPTA 349
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 152/338 (44%), Positives = 211/338 (62%), Gaps = 27/338 (7%)
Query: 1 MAKYF---LIVVLIISGSCASQATYRTFD----EGSIAEKFEQWKAQYGRTYKESAENSK 53
MA ++ +++ +++ +CA + D + ++ + E+W A+Y R Y ++AE ++
Sbjct: 1 MATHYSSAFVLLSVVAWACALSGSLAARDLADQDQAMVARHEEWMAKYDRVYSDAAEKAR 60
Query: 54 RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG 113
RFE+FK N+ +E N GN + L N+FADLT EF A+ TG++ ++S K
Sbjct: 61 RFEVFKANMALIESVN---AGNHKFWLEANRFADLTDDEFRATWTGYRPKTAAASSKGRS 117
Query: 114 ----TPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINR 159
T F Y + VP SV+W KGAVTP+K QG+C AVA++EG+ + +
Sbjct: 118 RTATTGFKYANVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKLSTGK 177
Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
LVSLSEQ+LVDC N + GC GG MDDAF +I+ N G+T ++ Y Y S G C+S +A
Sbjct: 178 LVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGNGGLTTESRYPYTA-SDGTCNSNEA 236
Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHG 277
AA I YEDVP NDE SL KAVANQPVSVA+D S +FY GGV +G C T L+HG
Sbjct: 237 SGDAASIKGYEDVPANDEASLRKAVANQPVSVAVDGGDSHFRFYKGGVLSGACGTELDHG 296
Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
+ AVGYG + +G KYW++KNSWG WGE GY R++RDI
Sbjct: 297 IAAVGYGVASDGTKYWVMKNSWGTSWGEAGYIRMERDI 334
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 156/328 (47%), Positives = 202/328 (61%), Gaps = 22/328 (6%)
Query: 27 EGSIAEKFEQWKAQY--GRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
E S+ E +E+WK+ + R+ +E A KRF +FK N+ + N SY L+LNK
Sbjct: 31 EDSLWELYERWKSHHTIARSLEEKA---KRFNVFKHNVKHIHETNKK---ENSYKLKLNK 84
Query: 85 FADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
F D+T +EF + G + H + F+Y + +P SV+W + GAVTPVK Q
Sbjct: 85 FGDMTSEEFRRTYAGSNIKHHRMFQGERQTTKSFMYANVDTLPTSVDWRKNGAVTPVKNQ 144
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
GQC V AVEGIN I+ +L SLSEQ+LVDC TN N GC GG MD AF++I +
Sbjct: 145 GQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTN-KNQGCNGGLMDLAFEFIKE 203
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
G+T++ VY Y+ S CD+ K I +EDVP N E L+KAVA+QPVSVAID
Sbjct: 204 KGGLTSELVYPYKA-SDETCDTNKENAPVVSIDGHEDVPKNSEVDLMKAVAHQPVSVAID 262
Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
A S QFYS GVF G C T LNHGV VGYGT+ +G KYW++KNSWG++WGE GY R+Q
Sbjct: 263 AGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQ 322
Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQPS 340
R I +G CGIAM AS+P+ + PS
Sbjct: 323 RGIRHKEGLCGIAMEASYPLKNSNTNPS 350
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 149/316 (47%), Positives = 206/316 (65%), Gaps = 17/316 (5%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
DE ++ ++ +W ++GR Y ++ E + R+ +FK N+ +ER N+ G ++ L +N+
Sbjct: 23 LDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSG-LTFKLAVNQ 81
Query: 85 FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQ 141
FADLT +EF + TGFK + SS + T F Y+ S +P SV+W +KGAVTP+K Q
Sbjct: 82 FADLTNEEFRSMYTGFKGNSVLSS-RTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQ 140
Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
G C AVAA+EG+ IK +L+SLSEQ+LVDC TND GC GG MD AF Y I
Sbjct: 141 GLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDG--GCMGGLMDTAFNYTIT 198
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
G+T+++ Y Y+ + G C+ K + A I +EDVP NDE++L+KAVA+ PVS+ I
Sbjct: 199 IGGLTSESNYPYKS-TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIA 257
Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
QFYS GVF+G C T L+HGVTAVGYG S+ G+KYW++KNSWG WGE GY R++
Sbjct: 258 GGDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIK 317
Query: 313 RDIDQPQGQCGIAMFA 328
+DI GQCG+AM A
Sbjct: 318 KDIKPKHGQCGLAMNA 333
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 156/317 (49%), Positives = 204/317 (64%), Gaps = 22/317 (6%)
Query: 34 FEQWKAQYGRTYK-ESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
+++W Q+ T +S E+++RFEIFK+N+ ++ N + Y L LNKFADL+ +E
Sbjct: 45 YDKWALQHRSTRSLDSDEHARRFEIFKENVKHIDSVNKK---DGPYKLGLNKFADLSNEE 101
Query: 93 FIASQTGFKMSDHSSSLKANGT---PFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA--- 145
F A KM H S G F+Y++S ++P S++W +KGAVTPVK QGQC
Sbjct: 102 FKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQGQCGSCW 161
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
+A+VEGIN IK +LVSLSEQQLVDC+ N GC GG MD+AF+YII N GI +
Sbjct: 162 AFSTIASVEGINYIKTGKLVSLSEQQLVDCSKE--NAGCNGGLMDNAFQYIIDNGGIVTE 219
Query: 202 AVYSYEGMSTGICDSIKAEDH--AAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
Y Y G C + K E A I +EDVP N+E +L KAVA+QPVS+AI+AS
Sbjct: 220 DEYPYTA-EAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAIEASGHD 278
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
QFYS GVF G C T L+HGV VGYG S EGI YW+++NSWG +WGE GY R+QR I+
Sbjct: 279 FQFYSTGVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQGYIRMQRGIEA 338
Query: 318 PQGQCGIAMFASFPVSK 334
+G+CGI+M AS+P K
Sbjct: 339 TEGKCGISMQASYPTKK 355
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 163/344 (47%), Positives = 215/344 (62%), Gaps = 28/344 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
+ +F++ + +C +A+ RT E SIA + E+W A + R Y +SAE +R +IFK+
Sbjct: 10 VGTFFMLFL-----TCICRASSRTLSESSIATQHEEWMAMHDRVYADSAEKDRRQQIFKE 64
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG--FKMSDHSSSLKANGTPFLY 118
NL +E+ NN G + Y L LN FADLT +EF+AS TG +K S K N + +
Sbjct: 65 NLEFIEKHNNE--GKKRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKINHSLGFH 122
Query: 119 KSS--QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
K S + S++W ++GAV +K QG+C AVAAVEGIN IK +LVSLSEQ LV
Sbjct: 123 KMSVGDIEASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNLV 182
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DCA+ND GC+G +++ AF YI ++ G+ N+ Y Y + G C + A QI Y
Sbjct: 183 DCASND---GCHGQYVEKAFDYI-RDYGLANEEEYPYV-ETVGTCSG--NSNPAIQIRGY 235
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
+ V P +EE LL AVA+QPVSV ++A QFYSGGVF+G C T LNH VT VGYG
Sbjct: 236 QSVTPQNEEQLLTAVASQPVSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYGEEA 295
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
EG KYWLI+NSWG+ WGE GY +L RD PQG CGI M AS+P
Sbjct: 296 EG-KYWLIRNSWGKSWGEGGYMKLMRDTGNPQGLCGINMQASYP 338
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 163/356 (45%), Positives = 220/356 (61%), Gaps = 26/356 (7%)
Query: 3 KYFLIV-----VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
K LIV VL++S S + DE S+ + +E+W++ + + + E KRF +
Sbjct: 5 KLLLIVLSIALVLVVSESFDFHDKDVSSDE-SLWDLYERWRSHHTVS-RNLNEKQKRFNV 62
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGT 114
FK N++ V N ++ Y L+LNKFAD+T EF + G K++ H + + +GT
Sbjct: 63 FKSNVMHVHNTNKM---DKPYKLKLNKFADMTNHEFKTTYAGTKVNHHRMFRGTPRVSGT 119
Query: 115 PFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQ 166
F+Y++ ++ P SV+W +KGAVT VK QGQC V AVEGIN IK NRLV LSEQ
Sbjct: 120 -FMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
+L+DC N N GC GG M+ AF+YI Q G+T ++ Y Y + G CD+ K I
Sbjct: 179 ELIDC-DNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTA-NDGSCDATKENVPTVSI 236
Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYG 284
+E VP NDE++LLKAVANQPVSVAIDA S QFYS GVF G C LNHGV VGYG
Sbjct: 237 DGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYG 296
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
T+ +G YW+++NSWG +WGE G R++R++ +G CGIAM AS+PV S P+
Sbjct: 297 TTVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPVKNSSKNPA 352
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 207/322 (64%), Gaps = 17/322 (5%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
R DE ++ ++ W ++GR Y ++ E + R+ +FK N+ ++ER N G ++ L +
Sbjct: 26 RPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYG-LTFKLAV 84
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVK 139
N+FADLT +EF + TG+K + SS + T F Y+ S +P SV+W +KGAVTP+K
Sbjct: 85 NQFADLTNEEFRSMYTGYKGNSVLSS-RTKPTSFRYQHVSSDALPISVDWRKKGAVTPIK 143
Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
QG C AVAA+EG+ IK +L+SLSEQ+LVDC TND+ GC GG+M+ AF Y
Sbjct: 144 DQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDD--GCMGGYMNSAFNYT 201
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
+ G+T+++ Y Y+ + G C+ K + A I +EDVP NDE++L+KAVA+ PVS+
Sbjct: 202 MTTGGLTSESNYPYKS-TDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIG 260
Query: 253 I--DASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
I + QFYS GVF+G C T L+HGV VGYG S G KYW++KNSWG WGE GY R
Sbjct: 261 IAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMR 320
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
+++D GQCG+AM AS+P
Sbjct: 321 IKKDTKAKHGQCGLAMNASYPT 342
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 201/315 (63%), Gaps = 15/315 (4%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++G+ Y+ E RFE+FKDNL ++ N +Y L LN+FADL+
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIV---SNYWLGLNEFADLS 99
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
QEF G K++ +N F Y+ +P SV+W +KGAVTPVK QGQC
Sbjct: 100 HQEFKNKYLGLKVNLSQRRESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWA 159
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC T NNGC GG MD AF +I+QN G+ +
Sbjct: 160 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIVQNGGLHKED 218
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M C+ K E I Y DVP N+E+SLLKA+ANQP+SVAI+AS+ QF
Sbjct: 219 DYPYI-MEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQF 277
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YSGGVF+G+C + L+HGV+AVGYGTS + + Y ++KNSWG WGE G+ R++R+I +P+G
Sbjct: 278 YSGGVFDGHCGSDLDHGVSAVGYGTS-KNLDYIIVKNSWGAKWGEKGFIRMKRNIGKPEG 336
Query: 321 QCGIAMFASFPVSKE 335
CG+ AS+P K+
Sbjct: 337 ICGLYKMASYPTKKK 351
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 163/356 (45%), Positives = 220/356 (61%), Gaps = 26/356 (7%)
Query: 3 KYFLIV-----VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
K LIV VL++S S + DE S+ + +E+W++ + + + E KRF +
Sbjct: 5 KLLLIVLSIALVLVVSESFDFHDKDVSSDE-SLWDLYERWRSHHTVS-RNLNEKQKRFNV 62
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGT 114
FK N++ V N ++ Y L+LNKFAD+T EF + G K++ H + + +GT
Sbjct: 63 FKSNVMHVHNTNKM---DKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGT 119
Query: 115 PFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQ 166
F+Y++ ++ P SV+W +KGAVT VK QGQC V AVEGIN IK NRLV LSEQ
Sbjct: 120 -FMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178
Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
+L+DC N N GC GG M+ AF+YI Q G+T ++ Y Y + G CD+ K I
Sbjct: 179 ELIDC-DNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTA-NDGSCDATKENVPTVSI 236
Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYG 284
+E VP NDE++LLKAVANQPVSVAIDA S QFYS GVF G C LNHGV VGYG
Sbjct: 237 DGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYG 296
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
T+ +G YW+++NSWG +WGE G R++R++ +G CGIAM AS+PV S P+
Sbjct: 297 TTVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPVKNSSKNPA 352
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 153/342 (44%), Positives = 221/342 (64%), Gaps = 19/342 (5%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
++LI+ L+++ S R E +E+ E+W AQYGR YK++AE KRF++FK+N+
Sbjct: 8 HYLILFLVLA-VWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVH 66
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+E FN A G++ + L +N+FADL +EF A + S + T F Y+S ++
Sbjct: 67 FIESFN--AAGDKPFNLSINQFADLNDEEFKALLINVQ-KKASWVETSTETSFRYESVTK 123
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P +++W ++GAVTP+K QG+C AVAA EGI+ I +LV LSEQ+LVDC +
Sbjct: 124 IPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGE 183
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPP 234
+ GC GG++DDAF++I + GI ++ Y Y+G++ C +K E H A+I YE VP
Sbjct: 184 SE-GCIGGYVDDAFEFIAKKGGIASETHYPYKGVNK-TC-KVKKETHGVAEIKGYEKVPS 240
Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIK 291
N+E++LLKAVANQPVSV IDA A ++YS G+FN C T NH V VGYG + +G K
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDGSK 300
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YWL+KNSWG +WGE GY R++RDI +G CGIA + +P +
Sbjct: 301 YWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 154/342 (45%), Positives = 221/342 (64%), Gaps = 19/342 (5%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
++LI+ L++S S R E +E+ E+W AQYGR YK++AE KRF++FK+N+
Sbjct: 8 HYLILFLVLS-VWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVH 66
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+E FN A G++ + L +N+FADL +EF A + S + T F Y+S ++
Sbjct: 67 FIESFN--AAGDKPFNLSINQFADLNDEEFKALLINVQ-KKASWVETSTQTSFRYESVTK 123
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P +++W ++GAVTP+K QG+C AVAA EGI+ I +LV LSEQ+LVDC +
Sbjct: 124 IPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGE 183
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPP 234
+ GC GG++DDAF++I + GI ++ Y Y+G++ C +K E H A+I YE VP
Sbjct: 184 SE-GCIGGYVDDAFEFIAKKGGIASETHYPYKGVNK-TC-KVKKETHGVAEIKGYEKVPS 240
Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIK 291
N+E++LLKAVANQPVSV IDA A ++YS G+FN C T NH V VGYG + +G K
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAVAVVGYGKALDGSK 300
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YWL+KNSWG +WGE GY R++RDI +G CGIA + +P +
Sbjct: 301 YWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 161/329 (48%), Positives = 201/329 (61%), Gaps = 20/329 (6%)
Query: 19 QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
Q+T RT E + + +E W ++G+ Y E +RFEIFKDNL V+ N ++ R+Y
Sbjct: 39 QSTERT--EAHMMKMYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQN--SVPGRTY 94
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAV 135
L L KFADLT +E+ A G KM +L+K+ +P V+W EKGAV
Sbjct: 95 KLGLTKFADLTNEEYRAMYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAV 154
Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
T VK QGQC V +VEGIN I L+SLSEQ+LVDC N GC GG MD A
Sbjct: 155 TEVKDQGQCGSCWAFSTVGSVEGINQIVTGDLISLSEQELVDC-DKAYNQGCNGGLMDYA 213
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
F++II+N GI ++A Y Y S +CDS + H I YEDVP NDEESL KAVANQP
Sbjct: 214 FEFIIKNGGIDSEADYPYRA-SDNMCDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQP 272
Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
VSVAI+A Q Y GVF G C T L+HGV AVGYGT E GI YW+++NSWG WGE
Sbjct: 273 VSVAIEAGGREFQLYQSGVFTGRCGTNLDHGVVAVGYGT-ENGIDYWIVRNSWGPKWGES 331
Query: 307 GYFRLQRDI-DQPQGQCGIAMFASFPVSK 334
GY R++R++ G+CGIAM AS+P K
Sbjct: 332 GYIRMERNVASTDTGKCGIAMEASYPTKK 360
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 160/355 (45%), Positives = 215/355 (60%), Gaps = 21/355 (5%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+++L + S A+ + + E + + +E+W ++ + Y E KRF++FKDNL ++
Sbjct: 9 LLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQ 68
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY---KSSQV 123
N N +YTL LNKFAD+T +E+ A G + +K T Y Q+
Sbjct: 69 DHNAQ---NNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQL 125
Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P V+W KGAV P+K QG C VAAVEGIN I VSLSEQ+LVDC +
Sbjct: 126 PVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDC-DREY 184
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG MD AF++IIQN GI + Y Y+G+ G CD K + QI YEDVP N+
Sbjct: 185 DEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGID-GTCDETKKKTKVVQIDGYEDVPSNN 243
Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E +L KAV++QPVSVAI+AS ALQ Y GVF G C T L+HGV VGYGT E G+ YWL
Sbjct: 244 ENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT-ENGVDYWL 302
Query: 295 IKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVS--KESAQPSSADKSS 346
++NSWG WGEDGYF+++R++ +G+CGIAM S+PV SA PSS +S+
Sbjct: 303 VRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVKYGLNSAVPSSVYEST 357
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 164/344 (47%), Positives = 217/344 (63%), Gaps = 28/344 (8%)
Query: 5 FLIVVLIISGSCASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+I +L+I G+ SQA R + +IAEK EQW A++GRTY ++AE +RF+IFK+NL
Sbjct: 10 LVITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQIFKNNLD 69
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YK 119
+E FN A N++Y L LNKF+DL+ +EF+ + G++M + P Y
Sbjct: 70 YIENFNKAF--NKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTANTTVKPTFFSNYYN 127
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+VP S++W E G VT VK QG+C AVAAVEGI SLS QQL+DC
Sbjct: 128 QDEVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGIAG----NGASLSAQQLLDCV 183
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+N+GC GG M AF+YI+QN+GI +D Y YE + +C S + AA+IT YE V
Sbjct: 184 --GDNSGCGGGTMIKAFEYIVQNQGIVSDTDYPYE-QTQEMCRS--GSNVAARITGYESV 238
Query: 233 PPNDEESLLKAVANQPVSVAIDASA---LQFYSGGVFNGY-CETFLNHGVTAVGYGTSEE 288
EE+L +AVA QP+SVAIDAS+ + Y GVF+ C T L H VT VGYGT+E+
Sbjct: 239 I-QSEEALKRAVAKQPISVAIDASSGPNFKSYISGVFSAEDCGTHLTHAVTLVGYGTTED 297
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL+KNSWG++WGE GY RLQRD+ +G CGIAM AS+P
Sbjct: 298 GTKYWLVKNSWGEEWGESGYMRLQRDVGAMEGPCGIAMQASYPT 341
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 160/355 (45%), Positives = 215/355 (60%), Gaps = 21/355 (5%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+++L + S A+ + + E + + +E+W ++ + Y E KRF++FKDNL ++
Sbjct: 9 LLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQ 68
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY---KSSQV 123
N N +YTL LNKFAD+T +E+ A G + +K T Y Q+
Sbjct: 69 DHNAQ---NNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQL 125
Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P V+W KGAV P+K QG C VAAVEGIN I VSLSEQ+LVDC +
Sbjct: 126 PVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDC-DREY 184
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG MD AF++IIQN GI + Y Y+G+ G CD K + QI YEDVP N+
Sbjct: 185 DEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGID-GTCDQTKKKTKVVQIDGYEDVPSNN 243
Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E +L KAV++QPVSVAI+AS ALQ Y GVF G C T L+HGV VGYGT E G+ YWL
Sbjct: 244 ENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT-ENGVDYWL 302
Query: 295 IKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVS--KESAQPSSADKSS 346
++NSWG WGEDGYF+++R++ +G+CGIAM S+PV SA PSS +S+
Sbjct: 303 VRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVKYGLNSAVPSSVYEST 357
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 154/321 (47%), Positives = 202/321 (62%), Gaps = 20/321 (6%)
Query: 27 EGSIAEKFEQWKAQY--GRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
E S+ +E W++ + R + ++RF +FK+N+ + N +R + L LNK
Sbjct: 33 EESLRGLYETWRSHHTVSRRGLGAEAEARRFNVFKENVRYIHEANKK---DRPFRLALNK 89
Query: 85 FADLTPQEFIASQTGFKMSDHSS---SLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKY 140
FAD+T EF + G ++ H S + G F+Y ++ +P +V+W +KGAVTP+K
Sbjct: 90 FADMTTDEFRRTYAGSRVRHHRSLSGGRRQGGGSFMYADAENLPAAVDWRQKGAVTPIKD 149
Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QGQC + AVEGIN I+ RLVSLSEQ+L+DC +N+ GC GG MD AF++I
Sbjct: 150 QGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGEND-GCNGGLMDVAFQFIQ 208
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
QN GIT +A Y Y+G CD K H I YEDVP NDE +L KAVANQPVSVAI
Sbjct: 209 QNGGITTEASYPYQGEQNS-CDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAI 267
Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
DAS QFYS GVF T L+HGV AVGYGT+ +G KYW++KNSWG+DWGE GY R+
Sbjct: 268 DASGNDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRM 327
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
QR + Q +G CGIAM AS+P
Sbjct: 328 QRGVKQAEGLCGIAMEASYPT 348
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 158/324 (48%), Positives = 207/324 (63%), Gaps = 20/324 (6%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLR 81
R+ DE + ++ WKAQ+ R+Y E+ +R EIF+DNL +++ N AA G S+ L
Sbjct: 38 RSDDE--VHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLG 95
Query: 82 LNKFADLTPQEFIASQTGFKMS----DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTP 137
L +FADLT +E+ ++ G + + +S++ +N F S +P S++W +KGAV
Sbjct: 96 LTRFADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFR-SSDDLPDSIDWRDKGAVVD 154
Query: 138 VKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
VK QG C +AAVEGIN I L+SLSEQ+LVDC T N GC GG MD AF+
Sbjct: 155 VKDQGSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTY-YNQGCNGGLMDYAFE 213
Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
+II N GI D Y Y G G CD + H I +YEDVP NDE+SL KAVANQPVS
Sbjct: 214 FIISNGGIDTDEDYPYTGRD-GSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVS 272
Query: 251 VAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
VAI+A A Q Y G+F GYC T L+HGVTA+GYG SE G YW++KNSWG DWGE GY
Sbjct: 273 VAIEAGGRAFQLYESGIFTGYCGTELDHGVTAIGYG-SENGKYYWIVKNSWGSDWGESGY 331
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
R++R+I+ G+CGIAM AS+P+
Sbjct: 332 IRMERNINSATGKCGIAMEASYPI 355
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 152/315 (48%), Positives = 199/315 (63%), Gaps = 15/315 (4%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++G+ Y+ E RFE+FKDNL ++ N +Y L LN+FADL+
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIV---SNYWLGLNEFADLS 99
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
QEF G K+ +N F Y+ +P SV+W +KGAVTPVK QGQC
Sbjct: 100 HQEFKNKYLGLKVDLSQRRESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWA 159
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC T NNGC GG MD AF +I QN G+ +
Sbjct: 160 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIGQNGGLHKEE 218
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M C+ K E I Y DVP N+E+SLLKA+ANQP+SVAI+AS+ QF
Sbjct: 219 DYPYI-MEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQF 277
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YSGGVF+G+C + L+HGV+AVGYGTS + + Y ++KNSWG WGE G+ R++RDI +P+G
Sbjct: 278 YSGGVFDGHCGSDLDHGVSAVGYGTS-KNLDYIIVKNSWGAKWGEKGFIRMKRDIGKPEG 336
Query: 321 QCGIAMFASFPVSKE 335
CG+ AS+P K+
Sbjct: 337 ICGLYKMASYPTKKK 351
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 153/315 (48%), Positives = 202/315 (64%), Gaps = 17/315 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++G+ Y+ E RFEIFKDNL ++ N +Y L LN+FADL+
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVV---SNYWLGLNEFADLS 99
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
QEF G K+ D+S + + F YK ++P SV+W +KGAV PVK QG C
Sbjct: 100 HQEFKNKYLGLKV-DYSRR-RESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC NNGC GG MD AF +I++N G+ +
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYNNGCNGGLMDYAFSFIVENGGLHKEE 216
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M G C+ K E I+ Y DVP N+E+SLLKA+ANQP+SVAI+AS QF
Sbjct: 217 DYPYI-MEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 275
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YSGGVF+G+C + L+HGV AVGYGT++ G+ Y ++KNSWG WGE GY R++R+I +P+G
Sbjct: 276 YSGGVFDGHCGSDLDHGVAAVGYGTAK-GVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEG 334
Query: 321 QCGIAMFASFPVSKE 335
CGI AS+P K+
Sbjct: 335 ICGIYKMASYPTKKK 349
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 161/339 (47%), Positives = 207/339 (61%), Gaps = 16/339 (4%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K+ + +I+ +CA RT E S+ E +QW +Y RTY S+E KR +IFK+NL
Sbjct: 2 KHLIGFCIILLWACAYPTMSRTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENL 61
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--SSSLKANGTPFLYKS 120
+E FNN +GN+SY L LN+++DLT +EFIAS TGFK+SD S +++ PF +
Sbjct: 62 EYIENFNN--VGNKSYKLGLNRYSDLTSEEFIASHTGFKVSDQLSDSKMRSVAIPFNL-N 118
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
VP + +W EKG VT VK Q QC AVAAVEGI IK L+SLSEQQLVDC
Sbjct: 119 DDVPTNFDWREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDC-- 176
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ ++GC GG AF II+++GI + Y Y+ C + AAQI Y VP
Sbjct: 177 DRQSSGCGGGDFVLAFDSIIKSRGIVKEDDYPYKANDVQTCQ-LGQIPGAAQINGYFKVP 235
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
NDE+ LL+AV QPVSVAI S Y GGV+ G C LNH VT +GYG SE G KY
Sbjct: 236 ANDEQQLLRAVLQQPVSVAISTSYDFHHYMGGVYEGSCGPKLNHAVTIIGYGVSEAGKKY 295
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WLIKNSWG+ WGE GY ++ R+ GQC IA+ A++P
Sbjct: 296 WLIKNSWGETWGEKGYMKVLRESSATGGQCSIAVHAAYP 334
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 162/361 (44%), Positives = 221/361 (61%), Gaps = 30/361 (8%)
Query: 3 KYFLIVVLIISGSCASQATYRTFD--------EGSIAEKFEQWKAQYGRTYKESAENSKR 54
K F IV+ + C QA+ + FD E ++ + +E+W+ + T + S E KR
Sbjct: 2 KLFFIVLSFL---CLLQAS-KGFDFDEKELETEENVWKLYERWRDHHSVT-RASHEALKR 56
Query: 55 FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKAN 112
F +F+ N++ V R N N+ Y L++N+FAD+T EF +S G + H K
Sbjct: 57 FNVFRHNVLHVHRTNKK---NKPYKLKVNRFADITHHEFRSSYAGSNVKHHRMLRGPKRG 113
Query: 113 GTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLS 164
F+Y++ ++VP SV+W EKGAVT VK Q C VAAVEGIN I+ N+LVSLS
Sbjct: 114 SGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLS 173
Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
EQ+LVDC T +N GC GG M+ AF++I N GI + Y Y+ C + +
Sbjct: 174 EQELVDCDTEENQ-GCAGGLMEPAFEFIKNNGGIKTEETYPYDSNDVQFCRAKSIDGETV 232
Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVG 282
I +E VP NDEE+LLKAVA+QPVSVAIDA S Q YS GVF G C T LNHGV VG
Sbjct: 233 TIDGHEHVPENDEEALLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVG 292
Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
YG ++ G KYW+++NSWG +WGE GY R++R I + +G+CGIAM AS+P +K S+ PS+
Sbjct: 293 YGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYP-TKVSSTPSTP 351
Query: 343 D 343
+
Sbjct: 352 E 352
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 153/320 (47%), Positives = 200/320 (62%), Gaps = 22/320 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+A++ E+W A++GR Y + AE ++R E+F+DN+ +E N AA + L N+FADLT
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVN-AAASQHKFWLEENQFADLT 59
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-----PPSVNWIEKGAVTPVKYQGQC 144
EF A++TG + SS + N P ++ + V P SV+W KGAV PVK QG C
Sbjct: 60 NAEFRATRTGLR----PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDC 115
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
AVAA+EG + +LVSLSEQQLV C + GC GG MDDAF +II+N G
Sbjct: 116 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 175
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS- 256
+ ++ Y Y S C + A AA I YEDVP NDE +LLKAVANQPVSVAID
Sbjct: 176 LAAESDYPYTA-SDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGD 234
Query: 257 -ALQFYSGGVFNGY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
QFY GGV +G C T L+H +TAVGYG + +G KYWL+KNSWG WGEDGY R++R
Sbjct: 235 RHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMER 294
Query: 314 DIDQPQGQCGIAMFASFPVS 333
+ +G CG+AM AS+P +
Sbjct: 295 GVADKEGVCGLAMMASYPTA 314
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 153/328 (46%), Positives = 202/328 (61%), Gaps = 27/328 (8%)
Query: 27 EGSIAEKFEQWKAQYG----------RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
E S+ +E+W+++Y R + ++RF +FK+N+ + N +R
Sbjct: 31 EESLRGLYERWRSRYTVSPSTPGSGLRGKLADHDPARRFNVFKENVKYIHEANKK---DR 87
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDH---SSSLKANGTPFLYKSSQVPPSVNWIEKG 133
+ L LNKFAD+T E S G ++ H S +A G + +PP+V+W EKG
Sbjct: 88 PFRLALNKFADMTTDELRHSYAGSRVRHHRALSGGRRAQGNFTYSDAENLPPAVDWREKG 147
Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
AVT +K QGQC +AAVE IN I+ +LVSLSEQ+L+DC N N+ GC GG MD
Sbjct: 148 AVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQELMDC-DNVNDQGCDGGLMD 206
Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
AF++I +N G+T++A Y Y+G CD K H I YEDVP NDE +L KAVA
Sbjct: 207 YAFQFIQKNGGVTSEANYPYQGQQN-TCDQAKENTHDVAIDGYEDVPANDESALQKAVAY 265
Query: 247 QPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
QPVSVAI+AS QFYS GVF G C T L+HGV AVGYGT+ +G KYW++KNSWG DWG
Sbjct: 266 QPVSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGYGTARDGTKYWIVKNSWGLDWG 325
Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E GY R+QR + Q +G CGIAM AS+P+
Sbjct: 326 EKGYIRMQRGVSQAEGLCGIAMQASYPI 353
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 149/285 (52%), Positives = 192/285 (67%), Gaps = 15/285 (5%)
Query: 59 KDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
K+N+ +E FNNAA N+ Y L +N+FADLT +EFI + F + H T F Y
Sbjct: 5 KENVNYIEAFNNAA--NKPYKLGINQFADLTSEEFIVPRNRF--NGHMRFSNTRTTTFKY 60
Query: 119 KSSQV-PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
++ V P S++W +KGAVTP+K QG C A+AA EGI+ I +LVSLSEQ++VD
Sbjct: 61 ENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVD 120
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C T ++GC GG+MD AFK+IIQN GI +A Y Y+G+ G C+ + HA IT YE
Sbjct: 121 CDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVD-GKCNIKEEAVHATTITGYE 179
Query: 231 DVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
DVP N+E++L KAVANQPVSVAIDA QFY G+F G C T L+HGVTAVGYG + E
Sbjct: 180 DVPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNE 239
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
G KYWL+KNSWG +WGE+GY +QR + +G CGIAM AS+P +
Sbjct: 240 GTKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYPTA 284
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 148/317 (46%), Positives = 203/317 (64%), Gaps = 14/317 (4%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E ++ K E+W Q+G++YK++AE KRF+IFK+N+ +E FN A+GN+ + L +N FA
Sbjct: 30 EPYLSNKHEKWMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFN--AVGNKPFNLSINHFA 87
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA 145
DLT +EF AS G K + T F Y + + VP S++W ++GAVTP+K QG C
Sbjct: 88 DLTNEEFKASLNGNKKLHDKFDILNETTSFRYHNVTSVPASMDWRKRGAVTPIKNQGSCG 147
Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
VA++EGI+ I LVSLSEQ+L+DC N++GC GG+++DAFK+I + G+
Sbjct: 148 SCWAFSTVASIEGIHQITTGELVSLSEQELIDCV-RGNSSGCSGGYLEDAFKFIAKKGGM 206
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
++ Y Y+ C K H A+I YE VP N E LLKAVANQPVSV +DA
Sbjct: 207 ASETNYPYKETDEK-CKFKKESKHVAEIKGYEKVPSNSENDLLKAVANQPVSVYVDAGDY 265
Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
QFYSGG+F G C T +H VT VGYG S + +YWL+KNSWG WGE GY +L+R++D
Sbjct: 266 VFQFYSGGIFTGKCGTDTDHVVTIVGYGVSLDYTEYWLVKNSWGTGWGEKGYMKLKRNVD 325
Query: 317 QPQGQCGIAMFASFPVS 333
+G CGIA S+PV+
Sbjct: 326 SKKGLCGIATNPSYPVA 342
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 155/339 (45%), Positives = 200/339 (58%), Gaps = 18/339 (5%)
Query: 7 IVVLIISGSCA-SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
++ L + SCA +T + + + +E+W ++ + Y E KRF++FKDNL +
Sbjct: 12 LLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFI 71
Query: 66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS---Q 122
+ NN N +Y L LN+FAD+T +E+ G K +K T Y S +
Sbjct: 72 QEHNNNQ--NNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYSAGDR 129
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P V+W KGAV P+K QG C VA VE IN I + VSLSEQ+LVDC
Sbjct: 130 LPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC-DRA 188
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD AF++IIQN GI D Y Y G GICD K I +EDVPP
Sbjct: 189 YNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFD-GICDPTKKNAKVVNIDGFEDVPPY 247
Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
DE +L KAVA+QPVS+AI+AS LQ Y GVF G C T L+HGV VGYG SE G+ YW
Sbjct: 248 DENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYG-SENGVDYW 306
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
L++NSWG WGEDGYF++QR++ P G+CGI M AS+PV
Sbjct: 307 LVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 158/339 (46%), Positives = 203/339 (59%), Gaps = 17/339 (5%)
Query: 5 FLIVVLIISGSCAS-QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ +L+IS S S A T +E +EQW + + Y E RFEIF DNL
Sbjct: 13 LIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEIFTDNLK 72
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQ 122
+E N ++ N+++ + L +FADLT EF A KM + + G +LYK
Sbjct: 73 YIEEHN--SVPNQTFEVGLTRFADLTNDEFRAIYLRSKM--ERTRVPVKGERYLYKVGDT 128
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P ++W KGAV PVK QG C A+ AVEGIN IK L+SLSEQ+LVDC T+
Sbjct: 129 LPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTS- 187
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD AFK+II+N GI + Y Y IC+S K I YEDVP N
Sbjct: 188 YNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVPQN 247
Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
DE+SL KA+ANQP+SVAI+A A Q Y GVF G C T L+HGV AVGYG SE G YW
Sbjct: 248 DEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYG-SEGGQDYW 306
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+++NSWG +WGE GYF+L+R+I + G+CG+AM AS+P
Sbjct: 307 IVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 158/320 (49%), Positives = 199/320 (62%), Gaps = 17/320 (5%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
++ +I E +E W A++ R Y E KRF +FKDN + + N GNRSY L LN+F
Sbjct: 34 EDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHNQ---GNRSYKLGLNQF 90
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC 144
ADL+ +EF A+ G K+ + + Y + +P S++W EKGAVT VK QG C
Sbjct: 91 ADLSHEEFKATYLGAKLDTKKRLSRPPSRRYQYSDGEDLPESIDWREKGAVTSVKDQGSC 150
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
VAAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF++II N G
Sbjct: 151 GSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGG 209
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
+ ++ Y Y G CDS + H I +YEDVP NDE+SL KA ANQP+SVAI+AS
Sbjct: 210 LDSEEDYPYTAYD-GSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASG 268
Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
QFY GVF C T L+HGVT VGYG SE G YW +KNSWG+ WGE+G+ RLQR+I
Sbjct: 269 REFQFYDSGVFTSTCGTQLDHGVTLVGYG-SESGTDYWTVKNSWGKSWGEEGFIRLQRNI 327
Query: 316 D-QPQGQCGIAMFASFPVSK 334
+ G CGIAM AS+PV K
Sbjct: 328 EVASTGMCGIAMEASYPVKK 347
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 154/333 (46%), Positives = 209/333 (62%), Gaps = 23/333 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E + + +E+W++ + + AE +RF +FK+NL + + N+ +R Y L+LN FA
Sbjct: 33 EERLRDLYERWRSHH-TVSRSLAEKQERFNVFKENLKHIHKVNHK---DRPYKLKLNSFA 88
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKAN--GTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQ 143
D+T EF+ G K+S H L+ GT +++ +S++P SV+W + GAVT +K QG+
Sbjct: 89 DMTNHEFLQHYGGSKVS-HYRVLRGQRQGTGSMHEDTSKLPSSVDWRKNGAVTGIKDQGK 147
Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C VAAVEGIN IK L+SLSEQ+LVDC + +N+GC GG M+DAF +I Q
Sbjct: 148 CGSCWAFSTVAAVEGINKIKTGELISLSEQELVDC--DSDNHGCNGGLMEDAFNFIKQIG 205
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
G+T++ Y Y CDS K I YE VP NDE +L+KAVANQPV++A+DA
Sbjct: 206 GLTSENTYPYRAKEEP-CDSNKMNSPVVNIDGYEMVPENDENALMKAVANQPVAIAMDAG 264
Query: 257 A--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
LQFYS +F G C T LNHGV VGYGT+++G KYW++KNSWG DWGE GY R+QR
Sbjct: 265 GKDLQFYSEAIFTGDCGTELNHGVALVGYGTTQDGTKYWIVKNSWGTDWGEKGYIRMQRG 324
Query: 315 IDQPQGQCGIAMFASFPV---SKESAQPSSADK 344
ID +G CGI M AS+PV S PS D+
Sbjct: 325 IDAEEGLCGITMEASYPVKLRSDNKKAPSRKDE 357
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 166/349 (47%), Positives = 209/349 (59%), Gaps = 25/349 (7%)
Query: 4 YFLIVVLIISGSC------ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
YFL V L I S Q RT E +E W +YG+ Y E +RFEI
Sbjct: 15 YFLSVCLAIDMSIIDYNLKHGQVPERT--EAETLRLYEMWLVKYGKAYNALGEKERRFEI 72
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA-NGTPF 116
FKDNL V++ N ++GN SY L LNKFADL+ +E+ A+ G +M L +
Sbjct: 73 FKDNLKFVDQHN--SVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGGPKSARY 130
Query: 117 LYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQL 168
L+K +P SV+W EKGAV PVK QGQC V AVEGIN I L SLSEQ+L
Sbjct: 131 LFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQEL 190
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
VDC N GC GG MD AF++I++N GI + Y Y+ + + +CD + I
Sbjct: 191 VDC-DKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDS-MCDPNRKNARVVTIDG 248
Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
YEDVP NDE+SL KAVANQPVSVAI+A A Q Y GVF G C T L+HGV AVGYGT
Sbjct: 249 YEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFTGSCGTQLDHGVVAVGYGT- 307
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSK 334
E G+ YW+++NSWG WGE+GY R++R++ + G+CGIAM AS+P K
Sbjct: 308 ENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAMEASYPTKK 356
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 157/342 (45%), Positives = 205/342 (59%), Gaps = 28/342 (8%)
Query: 27 EGSIAEKFEQWKAQY--------GRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
E S+ +E+W+++Y G + E +RF +F +N + N G R +
Sbjct: 35 EESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRR--GGRPF 92
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSDHSS---SLKANGTPFLY---KSSQVPPSVNWIEK 132
L LNKFAD+T EF + G + H S G F Y +PP+V+W E+
Sbjct: 93 RLALNKFADMTTDEFRRTYAGSRARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWRER 152
Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
GAVT +K QGQC AVAAVEG+N IK RLV+LSEQ+LVDC T DN GC GG M
Sbjct: 153 GAVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQ-GCDGGLM 211
Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
D AF++I +N GIT ++ Y Y G C+ KA H I YEDVP NDE +L KAVA
Sbjct: 212 DYAFQFIKRNGGITTESNYPYR-AEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVA 270
Query: 246 NQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
NQPV+VA++AS QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DW
Sbjct: 271 NQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDW 330
Query: 304 GEDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQPSSADK 344
GE GY R+QR + G CGIAM AS+PV + +++++
Sbjct: 331 GERGYIRMQRGVSSDSNGLCGIAMEASYPVKSGARNAAASNR 372
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 153/320 (47%), Positives = 199/320 (62%), Gaps = 22/320 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+A++ E+W A++GR Y + AE +R E+F+DN+ +E N AA + L N+FADLT
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVN-AAASQHKFWLEENQFADLT 59
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-----PPSVNWIEKGAVTPVKYQGQC 144
EF A++TG + SS + N P ++ + V P SV+W KGAV PVK QG C
Sbjct: 60 NAEFRATRTGLR----PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDC 115
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
AVAA+EG + +LVSLSEQQLV C + GC GG MDDAF +II+N G
Sbjct: 116 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 175
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS- 256
+ ++ Y Y S C + A AA I YEDVP NDE +LLKAVANQPVSVAID
Sbjct: 176 LAAESDYPYTA-SDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGD 234
Query: 257 -ALQFYSGGVFNGY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
QFY GGV +G C T L+H +TAVGYG + +G KYWL+KNSWG WGEDGY R++R
Sbjct: 235 RHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMER 294
Query: 314 DIDQPQGQCGIAMFASFPVS 333
+ +G CG+AM AS+P +
Sbjct: 295 GVADKEGVCGLAMMASYPTA 314
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 154/327 (47%), Positives = 207/327 (63%), Gaps = 20/327 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E ++ E +E+W+ Q+ R ++ E ++RF +FKDN+ + FN + Y LRLN+F
Sbjct: 41 EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR---DEPYKLRLNRFG 96
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANG---TPFLYKSSQ-VPPSVNWIEKGAVTPVKYQG 142
D+T EF + ++S H + G + F+Y ++ +P +V+W EKGAV VK QG
Sbjct: 97 DMTADEFRRAYASSRVS-HHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAVKDQG 155
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
QC +AAVEGINAI+ + L +LSEQQLVDC T N GC GG MD+AF+YI ++
Sbjct: 156 QCGSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKH 215
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
G+ + Y Y + C S A A I YEDVP N E +L KAVANQPVSVAI+A
Sbjct: 216 GGVAASSAYPYRARQS-SCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEA 274
Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
S QFYS GVF G C T L+HGV AVGYGT+ +G KYW+++NSWG DWGE GY R++R
Sbjct: 275 GGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKR 334
Query: 314 DIDQPQGQCGIAMFASFPVSKESAQPS 340
D+ +G CGIAM AS+P+ K S P+
Sbjct: 335 DVSAKEGLCGIAMEASYPI-KTSPNPA 360
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 219/361 (60%), Gaps = 26/361 (7%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEK------FEQWKAQYGRTYKESAENSKR 54
M +F++++ +S AS+ FDE + + +E+W+ + + + S E KR
Sbjct: 1 MKLFFIVLISFLSLLQASKGF--DFDEKELETEENVWKLYERWRGHHSVS-RASHEAIKR 57
Query: 55 FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKAN 112
F +F+ N++ V R N N+ Y L++N+FAD+T EF +S G + H K
Sbjct: 58 FNVFRHNVLHVHRTNKK---NKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRG 114
Query: 113 GTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLS 164
F+Y++ ++VP SV+W EKGAVT VK Q C VAAVEGIN I+ N+LVSLS
Sbjct: 115 SGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLS 174
Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
EQ+LVDC T +N GC GG M+ AF++I N GI + Y Y+ C +
Sbjct: 175 EQELVDCDTEENQ-GCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETV 233
Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVG 282
I +E VP NDEE LLKAVA+QPVSVAIDA S Q YS GVF G C T LNHGV VG
Sbjct: 234 TIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVG 293
Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
YG ++ G KYW+++NSWG +WGE GY R++R I + +G+CGIAM AS+P +K S+ PS+
Sbjct: 294 YGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYP-TKLSSTPSTH 352
Query: 343 D 343
+
Sbjct: 353 E 353
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 168/344 (48%), Positives = 224/344 (65%), Gaps = 30/344 (8%)
Query: 6 LIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
L +VL+I + SQA R DE ++AEK EQW A++GRTY++ E +RF IFK NL
Sbjct: 9 LAIVLMILVTWVSQAMPRPLIDEDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNLKH 68
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKM-----SDHSSSLKANGTPFLYK 119
+E FNNA NR+Y L LN FADLT +EF+A+ TG+KM + + ++ + LY+
Sbjct: 69 IENFNNAF--NRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPTANITTKTTQSSDVLYE 126
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
++ VP S++W +G VTPVK QG+C A AAVEGI I VSLS QQL+DC
Sbjct: 127 AN-VPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGI----IGNGVSLSAQQLLDCV 181
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
++NGC GGFMD+AF+YIIQN+G+ + Y Y+ M + + ++AA+I+ Y DV
Sbjct: 182 --PDSNGCNGGFMDNAFRYIIQNQGLASATYYPYQLMR----EMCRPSNNAARISGYVDV 235
Query: 233 PPNDEESLLKAVANQPVSVAIDASA---LQFYSGGVFNGY-CETFLNHGVTAVGYGTSEE 288
P DEE+L AVA QPVS A+DA++ ++Y GG+F C + L H +T VGYGTS E
Sbjct: 236 TPADEETLKSAVARQPVSAAVDATSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTSAE 295
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWLIKNSWG+ WGE GY RLQRD+ G CGIA+ AS+P
Sbjct: 296 GTKYWLIKNSWGEGWGEGGYMRLQRDVGSYGGACGIALRASYPT 339
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 168/356 (47%), Positives = 217/356 (60%), Gaps = 28/356 (7%)
Query: 1 MAKYFLIVVLIISGSCASQA-------TYRTFD---EGSIAEKFEQWKAQYGRTYKESAE 50
M L VL +S S + +Y + D + +I E +E W AQ+ + Y E
Sbjct: 1 MGILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDE 60
Query: 51 NSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK 110
K+F +FKDN + + + NN GN SY L LN+FADL+ +EF A+ G K+ D L
Sbjct: 61 KQKKFSVFKDNFLYIHQHNNQ--GNPSYKLGLNQFADLSHEEFKAAYLGTKL-DAKKRLS 117
Query: 111 ANGTP-FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLV 161
+ +P + Y + +P S++W EKGAVT VK QG C VAAVEGIN I L
Sbjct: 118 RSPSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLT 177
Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
SLSEQ+LVDC T+ N GC GG MD AF++II N G+ ++ Y Y+ + G CD+ +
Sbjct: 178 SLSEQELVDCDTS-YNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKA-NNGSCDAYRKNA 235
Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVT 279
H I +YEDVP NDE+SL KA ANQP+SVAI+AS A QFY GVF C T L+HGVT
Sbjct: 236 HVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVT 295
Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID-QPQGQCGIAMFASFPVSK 334
VGYG SE GI YWL+KNSWG WGE G+ +LQR+++ G CGIAM AS+PV K
Sbjct: 296 LVGYG-SESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPVKK 350
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 156/342 (45%), Positives = 204/342 (59%), Gaps = 28/342 (8%)
Query: 27 EGSIAEKFEQWKAQY--------GRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
E S+ +E+W+++Y G + E +RF +F +N + N G R +
Sbjct: 35 EESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRR--GGRPF 92
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSDHSS---SLKANGTPFLY---KSSQVPPSVNWIEK 132
L LNKFAD+T EF + G + H S G F Y +PP+V+W E+
Sbjct: 93 RLALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWRER 152
Query: 133 GAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
GAVT +K QGQC VAAVEG+N IK RLV+LSEQ+LVDC T DN GC GG M
Sbjct: 153 GAVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQ-GCDGGLM 211
Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
D AF++I +N GIT ++ Y Y G C+ KA H I YEDVP NDE +L KAVA
Sbjct: 212 DYAFQFIKRNGGITTESNYPYR-AEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVA 270
Query: 246 NQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
NQPV+VA++AS QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DW
Sbjct: 271 NQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDW 330
Query: 304 GEDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQPSSADK 344
GE GY R+QR + G CGIAM AS+PV + +++++
Sbjct: 331 GERGYIRMQRGVSSDSNGLCGIAMEASYPVKSGARNAAASNR 372
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 153/315 (48%), Positives = 201/315 (63%), Gaps = 17/315 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++G+ Y+ E RFEIFKDNL ++ N +Y L LN+FADL+
Sbjct: 44 LIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVV---SNYWLGLNEFADLS 100
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
QEF G K+ D+S + + F YK ++P SV+W +KGAVT VK QG C
Sbjct: 101 HQEFKNKYLGLKV-DYSRR-RESPEEFTYKDVELPKSVDWRKKGAVTQVKNQGSCGSCWA 158
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC NNGC GG MD AF +I++N G+ +
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYNNGCNGGLMDYAFSFIVENDGLHKEE 217
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M G C+ K E I+ Y DVP N+E+SLLKA+ANQP+SVAI+AS QF
Sbjct: 218 DYPYI-MEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 276
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YSGGVF+G+C + L+HGV AVGYGT++ G+ Y +KNSWG WGE GY R++R+I +P+G
Sbjct: 277 YSGGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335
Query: 321 QCGIAMFASFPVSKE 335
CGI AS+P K+
Sbjct: 336 ICGIYKMASYPTKKK 350
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 153/341 (44%), Positives = 201/341 (58%), Gaps = 60/341 (17%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
+Y + +L I + ASQAT R+ E S+ E+ E W A+YGR YK++ E KRF+IFKDN+
Sbjct: 8 QYVSMALLFILAAWASQATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV 67
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
A T F Y++ +
Sbjct: 68 ------------------------------------------------AQATTFKYENVT 79
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP +++W +KGAVTP+K Q QC AVAA EGI I +L+SLSEQ+LVDC T
Sbjct: 80 AVPSTIDWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTG 139
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
N GC GG DDAF++I + G+ ++A Y YEG G C+S K AA+I YEDVP
Sbjct: 140 GENQGCSGGLXDDAFRFIXIH-GLASEATYPYEG-DDGTCNSKKEAHPAAKIKGYEDVPA 197
Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N+E++L KAVA+QPV+VAIDA QFY+ GVF G C T L+HGV AVGYG ++G+ Y
Sbjct: 198 NNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMXY 257
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WL+KNSWG WGE+GY R+QRD+ +G CGIAM AS+P +
Sbjct: 258 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 298
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 152/315 (48%), Positives = 201/315 (63%), Gaps = 17/315 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++G+ Y+ E RF+IFKDNL ++ N +Y L LN+FADL+
Sbjct: 43 LIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVV---SNYWLGLNEFADLS 99
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
QEF G K+ D+S + + F YK ++P SV+W +KGAVT VK QG C
Sbjct: 100 HQEFKNKYLGLKV-DYSRR-RESPEEFTYKDFELPKSVDWRKKGAVTQVKNQGSCGSCWA 157
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC NNGC GG MD AF +I++N G+ +
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYNNGCNGGLMDYAFSFIVENGGLHKEE 216
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M G C+ K E I+ Y DVP N+E+SLLKA+ NQP+SVAI+AS QF
Sbjct: 217 DYPYI-MEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQF 275
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YSGGVF+G+C + L+HGV AVGYGTS+ G+ Y ++KNSWG WGE GY R++R+I +P+G
Sbjct: 276 YSGGVFDGHCGSDLDHGVAAVGYGTSK-GVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEG 334
Query: 321 QCGIAMFASFPVSKE 335
CGI AS+P K+
Sbjct: 335 ICGIYKMASYPTKKK 349
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 167/356 (46%), Positives = 216/356 (60%), Gaps = 28/356 (7%)
Query: 1 MAKYFLIVVLIISG--SCASQATYRTF--------DEGSIAEKFEQWKAQYGRTYKESAE 50
M L VL +S AS+A + ++ +I E +E W AQ+ + Y E
Sbjct: 1 MGILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGE 60
Query: 51 NSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK 110
RF +FKDN + + + NN GN SY L LN+FADL+ +EF A+ G K+ D L
Sbjct: 61 KQNRFSVFKDNFLYIHQHNNQ--GNPSYKLGLNQFADLSHEEFKATYLGAKL-DTKKRLS 117
Query: 111 ANGTP-FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLV 161
+ +P + Y + +P S++W EKGAVT VK QG C VAAVEGIN I L
Sbjct: 118 NSPSPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLT 177
Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
SLSEQ+LVDC T+ N GC GG MD AF++II N G+ ++ Y Y+ + G CD+ +
Sbjct: 178 SLSEQELVDCDTS-YNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKA-NDGSCDAYRKNA 235
Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVT 279
H I +YEDVP NDE+SL KA ANQP+SVAI+AS A QFY GVF C T L+HGVT
Sbjct: 236 HVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVT 295
Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ-PQGQCGIAMFASFPVSK 334
VGYG SE G YW++KNSWG+ WGE G+ RLQR+I+ G CGIAM AS+P+ K
Sbjct: 296 LVGYG-SESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGIAMEASYPLKK 350
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 152/315 (48%), Positives = 201/315 (63%), Gaps = 17/315 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++G+ Y+ E RFEIFKDNL ++ N +Y L LN+FADL+
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVV---SNYWLGLNEFADLS 100
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
+EF G K+ D+S + + F YK ++P SV+W +KGAV PVK QG C
Sbjct: 101 HREFNNKYLGLKV-DYSRR-RESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC NNGC GG MD AF +I++N G+ +
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYNNGCNGGLMDYAFSFIVENGGLHKEE 217
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M G C+ K E I+ Y DVP N+E+SLLKA+ANQP+SVAI+AS QF
Sbjct: 218 DYPYI-MEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 276
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YSGGVF+G+C + L+HGV AVGYGT++ G+ Y +KNSWG WGE GY R++R+I +P+G
Sbjct: 277 YSGGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335
Query: 321 QCGIAMFASFPVSKE 335
CGI AS+P K+
Sbjct: 336 ICGIYKMASYPTKKK 350
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 154/319 (48%), Positives = 209/319 (65%), Gaps = 20/319 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ + +E+W++ Y ++ E +KRF +FK+N V + N ++ Y L+LNKFA
Sbjct: 33 EESLWDLYERWRS-YHTVSRDLEEKNKRFNVFKENTKHVHKVNQM---DKPYKLKLNKFA 88
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKAN--GTP-FLY-KSSQVPPSVNWIEKGAVTPVKYQG 142
D+T EF +S G K+ H L+ + GT F++ K++ +PPSV+W +KGAVT +K QG
Sbjct: 89 DMTNHEFRSSYGGSKVK-HYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQG 147
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
+C V VEGIN IK L+SLSEQQL+DC +D++ GC GG M+ AF++I +N
Sbjct: 148 KCGSCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDH-GCNGGLMESAFEFIKKN 206
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
GIT + Y Y+ CD +K I +E VP NDE +L+KAVA+QPVSVAIDA
Sbjct: 207 GGITTENNYPYKAKDER-CDMLKMNAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDA 265
Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
S LQFYS GVF+G C T L+HGV VGYGT+ +G KYW++KNSWG +WGE GY R+ R
Sbjct: 266 GGSDLQFYSEGVFDGECGTELDHGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMAR 325
Query: 314 DIDQPQGQCGIAMFASFPV 332
I +GQCGIAM AS+PV
Sbjct: 326 GIQAAEGQCGIAMEASYPV 344
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 154/319 (48%), Positives = 209/319 (65%), Gaps = 20/319 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ + +E+W++ Y ++ E +KRF +FK+N V + N ++ Y L+LNKFA
Sbjct: 31 EESLWDLYERWRS-YHTVSRDLEEKNKRFNVFKENTKHVHKVNQM---DKPYKLKLNKFA 86
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKAN--GTP-FLY-KSSQVPPSVNWIEKGAVTPVKYQG 142
D+T EF +S G K+ H L+ + GT F++ K++ +PPSV+W +KGAVT +K QG
Sbjct: 87 DMTNHEFRSSYGGSKVK-HYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQG 145
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
+C V VEGIN IK L+SLSEQQL+DC +D++ GC GG M+ AF++I +N
Sbjct: 146 KCGSCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDH-GCNGGLMESAFEFIKKN 204
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
GIT + Y Y+ CD +K I +E VP NDE +L+KAVA+QPVSVAIDA
Sbjct: 205 GGITTENNYPYKAKDE-RCDMLKMNAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDA 263
Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
S LQFYS GVF+G C T L+HGV VGYGT+ +G KYW++KNSWG +WGE GY R+ R
Sbjct: 264 GGSDLQFYSEGVFDGECGTELDHGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMAR 323
Query: 314 DIDQPQGQCGIAMFASFPV 332
I +GQCGIAM AS+PV
Sbjct: 324 GIQAAEGQCGIAMEASYPV 342
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 156/321 (48%), Positives = 199/321 (61%), Gaps = 22/321 (6%)
Query: 27 EGSIAEKFEQWKAQYG---RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLN 83
E S+ +E+W++ Y R AE +RF +FK+N V N +R + L LN
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEE-RRFNVFKENARYVHEGNKR---DRPFRLALN 89
Query: 84 KFADLTPQEFIASQTGFKMSDH---SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKY 140
KFAD+T EF + G ++ H S + +G + +PP+V+W +KGAVT +K
Sbjct: 90 KFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKD 149
Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QGQC + AVEGIN I+ +LVSLSEQ+L+DC N NN GC GG MD AF++I
Sbjct: 150 QGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDC-DNVNNQGCEGGLMDYAFQFI- 207
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
Q GIT ++ Y Y+G G CD K A I YEDVP NDE +L KAVA QPVSVAI
Sbjct: 208 QKNGITTESNYPYQG-EQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAI 266
Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
DAS QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DWGE GY R+
Sbjct: 267 DASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRM 326
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
QR + Q +G CGIAM AS+P
Sbjct: 327 QRGVSQTEGLCGIAMQASYPT 347
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 149/340 (43%), Positives = 215/340 (63%), Gaps = 19/340 (5%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
++VV ++ SQ R E + K E+W AQYG+ YK++AE KRF+IFK+N+ +
Sbjct: 10 ILVVFLVLTVWTSQVMSRRLSEAYSSVKHEKWMAQYGKVYKDAAEKEKRFQIFKNNVHFI 69
Query: 66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS-SSLKANGTPFLYKS-SQV 123
E F+ A G++ + L +N+FADL +F A + +H+ + A F Y S +++
Sbjct: 70 ESFH--AAGDKPFNLSINQFADL--HKFKALLINGQKKEHNVRTATATEASFKYDSVTRI 125
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P S++W ++GAVTP+K QG C VA +EG++ I LVSLSEQ+LVDC D+
Sbjct: 126 PSSLDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQELVDCVKGDS 185
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA-QITNYEDVPPN 235
GCYGG+++DAF++I + G+ ++ Y Y+G++ C +K E H QI YE VP N
Sbjct: 186 E-GCYGGYVEDAFEFIAKKGGVASETHYPYKGVNK-TCK-VKKETHGVVQIKGYEQVPSN 242
Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
E++LLKAVA+QPVS ++A A QFYS G+F G C T ++H VT VGYG + G KYW
Sbjct: 243 SEKALLKAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSVTVVGYGKARGGNKYW 302
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
L+KNSWG +WGE GY R++RDI +G CGIA A +P +
Sbjct: 303 LVKNSWGTEWGEKGYIRMKRDIRAKEGLCGIATGALYPTA 342
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 153/339 (45%), Positives = 203/339 (59%), Gaps = 30/339 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATY---RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
MA +++ II C +T R + ++ EK EQW A++ R YK+S E ++RF+
Sbjct: 1 MAIPKALLLAIIGSICLCSSTVLSARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQRFKA 60
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG---- 113
FK N+ +E FN GN + L +N+F DLT EF A++T + LK NG
Sbjct: 61 FKANVAFIESFNT---GNHKFWLGVNQFTDLTNDEFRATKT-------NKGLKRNGARAP 110
Query: 114 TPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSL 163
T F Y + +P +V+W KG VTP+K QGQC AVAA EGI + +LVSL
Sbjct: 111 TRFKYNNVSTDALPAAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSL 170
Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
SEQ+LVDC + + GC GG MD+AFK+II+N G+T +A Y Y G C + +
Sbjct: 171 SEQELVDCDVHGVDQGCEGGEMDNAFKFIIKNGGLTTEANYPYTAQD-GQCKTSTTSNSV 229
Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAV 281
A I YEDVP NDE SL+KAVANQPVSVA+D Q YSGGV G C T L+HG+ A+
Sbjct: 230 ATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIVAI 289
Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
GYG + +G K+WL+KNSWG WGE GY R+++DI G
Sbjct: 290 GYGMTSDGTKFWLLKNSWGTTWGESGYLRMEKDISDKSG 328
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 153/344 (44%), Positives = 215/344 (62%), Gaps = 19/344 (5%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
++ F + ++ + A+++++RT DE + +E+W ++G+ Y E KRFEIFKD
Sbjct: 20 LSSAFDMSIISYHQTHATKSSWRTDDE--VMAMYEEWLVKHGKNYNALGEKEKRFEIFKD 77
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK- 119
NL+ +++ N+ NR+YT+ LN+FADLT +EF + G + + H L + +
Sbjct: 78 NLMFIDQHNSE---NRTYTVGLNRFADLTNEEFRSMYLGTR-TGHKKRLPKTSDRYAPRV 133
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
+P SV+W ++GAV VK QG C +AAVEGIN I L++LSEQ+LVDC
Sbjct: 134 GDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCD 193
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
T+ N GC GG MD AF++II N GI + Y Y G G CD+ + I +YEDV
Sbjct: 194 TS-YNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRD-GRCDTYRKNAKVVSIDSYEDV 251
Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P NDE +L KAVANQPVSVAI+ Q Y+ GVF G C T L+HGV AVGYGT E+G
Sbjct: 252 PENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGT-EKGK 310
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
YW+++NSWG+ WGE GY R++R+I P G+CGIA+ S+P+ K
Sbjct: 311 DYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPIKK 354
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 153/326 (46%), Positives = 211/326 (64%), Gaps = 21/326 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E + + +E+W++ + + + E RF +FK N++ V N ++ Y L+LN+FA
Sbjct: 33 EEGLWDLYERWRSHHTVS-RSLDEKHNRFNVFKGNVMHVHSSNKM---DKPYKLKLNRFA 88
Query: 87 DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQG 142
D+T EF + G K++ H + + NGT F+Y++ +VP SV+W +KGAVT VK QG
Sbjct: 89 DMTNHEFRSIYAGSKVNHHRMFRGTPRGNGT-FMYQNVDRVPSSVDWRKKGAVTDVKDQG 147
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
QC + AVEGIN IK ++LV LSEQ+LVDC T N GC GG M+ AF++I Q
Sbjct: 148 QCGSCWAFSTIVAVEGINQIKTHKLVPLSEQELVDCDTT-QNQGCNGGLMESAFEFIKQ- 205
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
GIT + Y YE G CD+ K + A I +E+VP N+E +LLKAVA+QPVSVAI+A
Sbjct: 206 YGITTASNYPYEA-KDGTCDASKVNEPAVSIDGHENVPVNNEAALLKAVAHQPVSVAIEA 264
Query: 256 SAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
+ QFYS GVF G C T L+HGV VGYGT+++G KYW +KNSWG +WGE GY R++R
Sbjct: 265 GGIDFQFYSEGVFTGNCGTALDHGVAIVGYGTTQDGTKYWTVKNSWGSEWGEKGYIRMKR 324
Query: 314 DIDQPQGQCGIAMFASFPVSKESAQP 339
I +G CGIAM AS+P+ K S++P
Sbjct: 325 SISVKKGLCGIAMEASYPIKKSSSKP 350
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 153/344 (44%), Positives = 215/344 (62%), Gaps = 19/344 (5%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
++ F + ++ + A+++++RT DE + +E+W ++G+ Y E KRFEIFKD
Sbjct: 11 LSSAFDMSIISYHQTHATKSSWRTDDE--VMAMYEEWLVKHGKNYNALGEKEKRFEIFKD 68
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK- 119
NL+ +++ N+ NR+YT+ LN+FADLT +EF + G + + H L + +
Sbjct: 69 NLMFIDQHNSE---NRTYTVGLNRFADLTNEEFRSMYLGTR-TGHKKRLPKTSDRYAPRV 124
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
+P SV+W ++GAV VK QG C +AAVEGIN I L++LSEQ+LVDC
Sbjct: 125 GDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCD 184
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
T+ N GC GG MD AF++II N GI + Y Y G G CD+ + I +YEDV
Sbjct: 185 TS-YNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRD-GRCDTYRKNAKVVSIDSYEDV 242
Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P NDE +L KAVANQPVSVAI+ Q Y+ GVF G C T L+HGV AVGYGT E+G
Sbjct: 243 PENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGT-EKGK 301
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
YW+++NSWG+ WGE GY R++R+I P G+CGIA+ S+P+ K
Sbjct: 302 DYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPIKK 345
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 147/315 (46%), Positives = 207/315 (65%), Gaps = 21/315 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+AE+ E+W A+Y R YK++AE ++RFE+FKDN VE FN A + L +N+FADLT
Sbjct: 1 MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFN--ADKKNKFWLGVNQFADLT 58
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQC-- 144
+EF A++ GFK S+ + T F Y++ S +P +V+W KGAVTP+K QGQC
Sbjct: 59 TEEFKANK-GFK---PISAEEVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGC 114
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
A+AA+EGI + LVSLSEQ+ VDC T++ + GC GG+MD+AF+++I+N G+
Sbjct: 115 CWAFSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLA 174
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQ 259
++ Y Y+ + G C AA I +EDVPPN+E +L+K VA+QPVSVA+DAS
Sbjct: 175 TESSYPYK-VVDGKCKG--GSKSAATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRT 231
Query: 260 F--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
F YSGGV G C T L+HG+ A+GYG + KYW++KNSWG WGE G+ R+++DI
Sbjct: 232 FMLYSGGVMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISD 291
Query: 318 PQGQCGIAMFASFPV 332
+G C +AM S+P
Sbjct: 292 KRGMCDLAMKPSYPT 306
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 144/318 (45%), Positives = 204/318 (64%), Gaps = 17/318 (5%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
R DE ++ ++ W ++GR Y ++ E + R+ +FK N+ ++ER N G ++ L +
Sbjct: 20 RPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYG-LTFKLAV 78
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVK 139
N+FADLT +EF + TG+K + SS + T F Y+ S +P SV+W +KGAVTP+K
Sbjct: 79 NQFADLTNEEFRSMYTGYKGNSVLSS-RTKPTSFRYQHVSSDALPISVDWRKKGAVTPIK 137
Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
QG C AVAA+EG+ IK +L+SLSEQ+LVDC TND+ GC GG+M+ AF Y
Sbjct: 138 DQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDD--GCMGGYMNSAFNYT 195
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
+ G+T+++ Y Y+ + G C+ K + A I +EDVP NDE++L+KAVA+ PVS+
Sbjct: 196 MTTGGLTSESNYPYKS-TDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIG 254
Query: 253 I--DASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
I + QFYS GVF+G C T L+HGV VGYG S G KYW++KNSWG WGE GY R
Sbjct: 255 IAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMR 314
Query: 311 LQRDIDQPQGQCGIAMFA 328
+++D GQCG+AM A
Sbjct: 315 IKKDTKAKHGQCGLAMNA 332
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 201/315 (63%), Gaps = 17/315 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++G+ Y+ E RFEIFKDNL ++ N +Y L L++FADL+
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVV---SNYWLGLSEFADLS 100
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
+EF G K+ D+S + + F YK ++P SV+W +KGAV PVK QG C
Sbjct: 101 HREFNNKYLGLKV-DYSRR-RESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC NNGC GG MD AF +I++N G+ +
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYNNGCNGGLMDYAFSFIVENGGLHKEE 217
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M G C+ K E I+ Y DVP N+E+SLLKA+ANQP+SVAI+AS QF
Sbjct: 218 DYPYI-MEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 276
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YSGGVF+G+C + L+HGV AVGYGT++ G+ Y +KNSWG WGE GY R++R+I +P+G
Sbjct: 277 YSGGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335
Query: 321 QCGIAMFASFPVSKE 335
CGI AS+P K+
Sbjct: 336 ICGIYKMASYPTKKK 350
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 150/315 (47%), Positives = 201/315 (63%), Gaps = 16/315 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++G+ Y+ E RFE+FKDNL ++ N +Y L LN+FADL+
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVV---SNYWLGLNEFADLS 99
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
QEF G K+ D S +++ F Y+ +P SV+W +KGAVTPVK QGQC
Sbjct: 100 HQEFKNKYLGLKV-DLSQRRESSEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWA 158
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC T NNGC GG MD AF +I++N G+ +
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIVKNGGLHKEE 217
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M C+ K I Y DVP N+E+SLLKA+ANQP+SVAI+AS QF
Sbjct: 218 DYPYI-MEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 276
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YSGGVF+G+C + L+HGV+AVGYGTS+ G+ Y ++KNSWG WGE G+ R++R+I + +G
Sbjct: 277 YSGGVFDGHCGSELDHGVSAVGYGTSK-GLDYIIVKNSWGAKWGEKGFIRMKRNIGKSEG 335
Query: 321 QCGIAMFASFPVSKE 335
CG+ AS+P K+
Sbjct: 336 ICGLYKMASYPTKKK 350
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 148/327 (45%), Positives = 213/327 (65%), Gaps = 14/327 (4%)
Query: 17 ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
AS+AT R E S+ E+ EQW A+Y R YK+ AE +RF +FKDN+ ++ F+ A GN
Sbjct: 18 ASEATSRPLHEASMYERHEQWMARYSRNYKDDAEEERRFXMFKDNVDFIQTFDTA--GNM 75
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAV 135
L +N AD+T +EF AS FK+ + L++ T F +++ +++P +++W +K V
Sbjct: 76 PNKLGVNALADMTHEEFRASGNTFKIPP-NLGLRSETTSFRHQNVTRIPSTMDWRKKRTV 134
Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
T +K Q QC AVAA+EGI ++ ++ +SLSEQ+LVDC +N GC GG MDDA
Sbjct: 135 THIKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCEGGCMDDA 194
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
FK+IIQN+G+ ++A Y Y+G+ G C+ K AA+I +YE++P E++LLK VA+QP
Sbjct: 195 FKFIIQNRGLNSEARYLYKGVE-GHCNKKKESSRAARINDYENMPEFSEKALLKVVAHQP 253
Query: 249 VSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
+SVAIDA SA QFY G+ L++GVT GYG S +G K+WL+KNSWG DWGE+
Sbjct: 254 ISVAIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRSADGKKHWLVKNSWGTDWGEN 313
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
GY R++R + G CG M AS+P +
Sbjct: 314 GYTRMERGVKATTGLCGFTMQASYPTA 340
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 151/342 (44%), Positives = 220/342 (64%), Gaps = 19/342 (5%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
++LI+ L+++ S R E +E+ E+W AQYGR YK++AE KRF++FK+N+
Sbjct: 8 HYLILFLVLA-VWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVH 66
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+E FN A G++ + L +N+FADL +EF A + S + T F Y+S ++
Sbjct: 67 FIESFN--AAGDKPFNLSINQFADLNDEEFKALLINVQ-KKASWVETSTETSFRYESVTK 123
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P +++ ++GAVTP+K QG+C AVAA EGI+ I +LV LSEQ+LVDC +
Sbjct: 124 IPATIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGE 183
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPP 234
+ GC GG++DDAF++I + GI ++ Y Y+G++ C +K E H A+I YE VP
Sbjct: 184 SE-GCIGGYVDDAFEFIAKKGGIASETHYPYKGVNK-TC-KVKKETHGVAEIKGYEKVPS 240
Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIK 291
N+E++LLKAVANQPVSV IDA A ++YS G+FN C T NH V VGYG + + K
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDDSK 300
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YWL+KNSWG +WGE GY R++RDI +G CGIA + +P++
Sbjct: 301 YWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPIA 342
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 195/312 (62%), Gaps = 14/312 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E W A++GR Y E +RF +F DNL V+ N A + L +N+FADLT EF
Sbjct: 109 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERA-AEHGFRLGMNQFADLTNDEF 167
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQGQC------- 144
A+ G ++ A G + + + ++P SV+W EKGAV PVK QGQC
Sbjct: 168 RAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 227
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
AV++VE +N I +V+LSEQ+LV+C+T+ N+GC GG MD AF +II+N GI + Y
Sbjct: 228 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 287
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YS 262
Y+ + G CD + I +EDVP NDE+SL KAVA+QPVSVAI+A +F Y
Sbjct: 288 PYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYK 346
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
GVF G C T L+HGV AVGYGT E G YW+++NSWG WGEDGY R++R+++ G+C
Sbjct: 347 AGVFTGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKC 405
Query: 323 GIAMFASFPVSK 334
GIAM AS+P K
Sbjct: 406 GIAMMASYPTKK 417
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 151/318 (47%), Positives = 195/318 (61%), Gaps = 15/318 (4%)
Query: 27 EGSIAEKFEQWKAQYGRTYKES-AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
E + +EQW A++G+ + E+ +RF F DNL V+ +NA G R Y L +N+F
Sbjct: 45 EAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVD-AHNARAGARGYRLGINRF 103
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC 144
ADLT EF A+ + + ++ A G + + + +P V+W +KGAV PVK QGQC
Sbjct: 104 ADLTNAEFRAAYLS-AGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVAPVKNQGQC 162
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
AV AVEGIN I LV+LSEQ+LVDC+ N N GC GG MDDAF +I+ N G
Sbjct: 163 GSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVGNGG 222
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
I D Y Y G CD K H I +E VP NDE+SL KAVA+QPV+VAI+A
Sbjct: 223 IDTDKDYPYTARD-GKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAVAIEAGG 281
Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK-YWLIKNSWGQDWGEDGYFRLQRD 314
Q Y GVF G C T L+HGV AVGYGT +G + YWL++NSWG DWGE GY R++R+
Sbjct: 282 REFQLYQSGVFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGYIRMERN 341
Query: 315 IDQPQGQCGIAMFASFPV 332
+ G+CGIAM AS+PV
Sbjct: 342 VGARAGKCGIAMEASYPV 359
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 160/360 (44%), Positives = 218/360 (60%), Gaps = 32/360 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEK------FEQWKAQYGRTYKESAENSKR 54
MA +++ L+++ + A F+E +A + +E+W++ + ++ +E +KR
Sbjct: 1 MATKSMLLALVVALAFVGVARTIPFNEKDLASEESLWGLYERWRSHH-TVSRDLSEKNKR 59
Query: 55 FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT 114
F +FK+N + FN + Y L LNKFAD+T QEF ++ G K+ H + GT
Sbjct: 60 FNVFKENAKFIHEFNKK---DAPYKLGLNKFADMTNQEFRSTYAGSKIHHHRTQ---RGT 113
Query: 115 P-----FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLV 161
P F+Y++ +P SV+W +GAV PVK QGQC +A+VEGIN IK N+LV
Sbjct: 114 PRATGSFMYENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKTNQLV 173
Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
LS QQLVDC T D N GC GG MD AF++I N GIT+++ Y Y G C S ++
Sbjct: 174 PLSGQQLVDCDT-DQNEGCNGGLMDYAFEFIKSNGGITSESAYPYTA-EQGSCAS-ESSA 230
Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVT 279
I YEDVP N+E +L+KAVANQ VSVAI+AS A QFYS GVF G C L+HGV
Sbjct: 231 PVVTIDGYEDVPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNELDHGVA 290
Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
VGYG + +G KYW+++NSWG +WGE GY R+QR I G CGIAM S+P+ K S P
Sbjct: 291 VVGYGATRDGTKYWIVRNSWGAEWGEKGYIRMQRGIRARHGLCGIAMEPSYPL-KTSPNP 349
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 150/319 (47%), Positives = 199/319 (62%), Gaps = 22/319 (6%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E+W+ ++ ++ + ++RF +FK+N+ + FN + Y LRLN+F D+T EF
Sbjct: 47 YERWRGRHA-VARDLGDKARRFNVFKENVRLIHDFNQR---DEPYKLRLNRFGDMTADEF 102
Query: 94 IASQTGFKMSDHS---SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA---- 145
G +++ H + + + F+Y ++ +P SV+W +KGAVT VK QGQC
Sbjct: 103 RRHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKDQGQCGSCWA 162
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
+AAVEGINAIK L SLSEQQLVDC T N GC GG MD AF+YI ++ G+ +
Sbjct: 163 FSTIAAVEGINAIKTKNLTSLSEQQLVDCDTK-GNAGCDGGLMDYAFQYIAKHGGVAAED 221
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y+ C A A I YEDVP NDE +L KAVA+QPVSVAI+AS QF
Sbjct: 222 AYPYKARQAS-CKKSPAP--AVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQF 278
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YS GVF G C T L+HGVTAVGYG + +G KYW++KNSWG +WGE GY R+ RD+ +G
Sbjct: 279 YSEGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEKGYIRMARDVAAKEG 338
Query: 321 QCGIAMFASFPVSKESAQP 339
CGIAM AS+PV K S P
Sbjct: 339 HCGIAMEASYPV-KTSPNP 356
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 280 bits (715), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 197/322 (61%), Gaps = 25/322 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E+W+ ++ ++ + ++RF +FK N+ + FN + Y LRLN+F D+T EF
Sbjct: 156 YERWRGRHA-LARDLGDKARRFNVFKANVRLIHEFNRR---DEPYKLRLNRFGDMTADEF 211
Query: 94 IASQTGFKMSDHS------SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA- 145
G +++ H A+ + F+Y ++ VP SV+W +KGAVT VK QGQC
Sbjct: 212 RRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKDQGQCGS 271
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
+AAVEGINAIK L SLSEQQLVDC T N GC GG MD AF+YI ++ G+
Sbjct: 272 CWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTK-ANAGCNGGLMDYAFQYIAKHGGVA 330
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
+ Y Y C K+ I YEDVP NDE +L KAVA+QPVSVAI+AS
Sbjct: 331 AEDAYPYRARQAS-CK--KSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSH 387
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
QFYS GVF+G C T L+HGV AVGYG + +G KYWL+KNSWG +WGE GY R+ RD+
Sbjct: 388 FQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAA 447
Query: 318 PQGQCGIAMFASFPVSKESAQP 339
+G CGIAM AS+PV K S P
Sbjct: 448 KEGHCGIAMEASYPV-KTSPNP 468
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 280 bits (715), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 148/312 (47%), Positives = 194/312 (62%), Gaps = 14/312 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+E W A++GR E +RFEIFKDN+ ++ N AA G+RS+ L LN+FAD+T +E
Sbjct: 50 YEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRFADMTNEE 109
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA------ 145
+ G + + H + + Y + + +P SV+W +KGAVT VK QG C
Sbjct: 110 YRTVYLGTRPASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVTTVKDQGSCGSCWAFS 169
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
+AAVEGIN I L+SLSEQ+LVDC N N GC GG MD AF++II N GI + Y
Sbjct: 170 TIAAVEGINKIVTGDLISLSEQELVDC-DNGQNQGCNGGLMDYAFEFIINNGGIDTEEDY 228
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y+ G CD + I YEDVP NDE++L KAVANQPVSVAI+A Q Y
Sbjct: 229 PYKARD-GKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQLYH 287
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
G+F G C T L+HGV AVGYGT E G YW+++NSWG DWGE GY R++R+++ G+C
Sbjct: 288 SGIFTGRCGTDLDHGVVAVGYGT-ENGKDYWIVRNSWGGDWGESGYIRMERNVNASTGKC 346
Query: 323 GIAMFASFPVSK 334
GIAM +S+P K
Sbjct: 347 GIAMESSYPTKK 358
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 280 bits (715), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 153/313 (48%), Positives = 198/313 (63%), Gaps = 22/313 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E+W +GR Y E +RF+IF+DN +E N N++Y L LN FAD+T EF
Sbjct: 34 YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQV--NQTYWLGLNNFADMTHDEF 91
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
A G K+ S+++K+ F YK ++ +P +W KGAV VK QG C
Sbjct: 92 KALYFGTKVP-LSNTIKSG---FRYKDATNLPLDTDWRSKGAVATVKNQGACGSCWAFST 147
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
VAAVEG+N I LVSLSEQ+LVDC N GC GG MD AF++IIQN G+ ++A Y
Sbjct: 148 VAAVEGVNQIVTGELVSLSEQELVDC-DKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYP 206
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSG 263
Y+ +S G CD + H I +EDVP E LLKAVANQPVSVAI+AS Q YSG
Sbjct: 207 YKAVS-GSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSG 265
Query: 264 GVFNGYCETFLNHGVTAVGYGTSE--EGI--KYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
GV+ G+C L+HGV AVGYGTS+ +G+ YW+++NSWG WGE GY RLQR++ P+
Sbjct: 266 GVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASPR 325
Query: 320 GQCGIAMFASFPV 332
G+CGIAM AS+PV
Sbjct: 326 GKCGIAMMASYPV 338
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 152/320 (47%), Positives = 198/320 (61%), Gaps = 20/320 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENS--KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
E S+ +E+W++ Y + + ++ +RF +FK N V N + + L LNK
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAGERRFNVFKQNARYVHEGNKRDM---PFRLALNK 90
Query: 85 FADLTPQEFIASQTGFKMSDH---SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
FAD+T EF + G ++ H S + +G + +PP+V+W +KGAVT +K Q
Sbjct: 91 FADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
GQC + AVEGIN I+ +LVSLSEQ+L+DC N NN GC GG MD AF++I Q
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDC-DNVNNQGCDGGLMDYAFQFI-Q 208
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
GIT ++ Y Y+G G CD K A I YEDVP NDE +L KAVA QPVSVAID
Sbjct: 209 KNGITTESNYPYQG-EQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAID 267
Query: 255 ASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
AS QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DWGE GY R+Q
Sbjct: 268 ASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQ 327
Query: 313 RDIDQPQGQCGIAMFASFPV 332
R + Q +G CGIAM AS+P
Sbjct: 328 RGVSQTEGLCGIAMQASYPT 347
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 197/317 (62%), Gaps = 25/317 (7%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
+FEQW ++GR Y E +RFE++K+NL +E FN+ G YTL NKFADLT +E
Sbjct: 118 RFEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNS---GGHGYTLTDNKFADLTNEE 174
Query: 93 FIASQTGFKMSD--------HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
F A G +D H+S+ A P S+ +P V+W +KGAV VK QG C
Sbjct: 175 FRAKMLGGLGADPDRRRRARHASN--ALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSC 232
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
AVAA+EG+N IK +LVSLSEQ+LVDC + GC GGFM AF++++ N G
Sbjct: 233 GSCWAFSAVAAMEGLNQIKNGKLVSLSEQELVDC--DAEAVGCAGGFMSWAFEFVMANHG 290
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
+T +A Y Y+G++ G C + K + + IT Y +V N E LLK A QPVSVA+DA
Sbjct: 291 LTTEASYPYKGIN-GACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGG 349
Query: 258 L--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
Q Y+GGVF+G C +NHGVT VGYG +++ KYW++KNSWG +WGE GY +QRD
Sbjct: 350 FLFQLYAGGVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRDA 409
Query: 316 DQPQGQCGIAMFASFPV 332
P G CGIAM AS+PV
Sbjct: 410 GVPTGLCGIAMLASYPV 426
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 195/312 (62%), Gaps = 14/312 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E W A++GR Y E +RF +F DNL V+ N A + L +N+FADLT EF
Sbjct: 49 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERA-AEHGFRLGMNQFADLTNDEF 107
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQGQC------- 144
A+ G ++ A G + + + ++P SV+W EKGAV PVK QGQC
Sbjct: 108 RAAYLGARIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 167
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
AV++VE +N I +V+LSEQ+LV+C+T+ N+GC GG MD AF +II+N GI + Y
Sbjct: 168 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 227
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y+ + G CD + I +EDVP NDE+SL KAVA+QPVSVAI+A Q Y
Sbjct: 228 PYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYK 286
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
GVF+G C T L+HGV AVGYGT E G YW+++NSWG WGEDGY R++R+++ G+C
Sbjct: 287 AGVFSGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKC 345
Query: 323 GIAMFASFPVSK 334
GIAM AS+P K
Sbjct: 346 GIAMMASYPTKK 357
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 155/321 (48%), Positives = 205/321 (63%), Gaps = 20/321 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYK-ESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
E S+ ++ W Q+ + +S E+++RFEIFK+N+ ++ N + Y L LNKF
Sbjct: 39 EKSLRSLYDNWALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNKK---DSPYKLGLNKF 95
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC 144
ADL+ +EF A G KM D + F+Y++S+ +P S++W +KGAV VK QG C
Sbjct: 96 ADLSNEEFKAIYMGTKM-DLRGDREVQSGSFMYQNSEPLPASIDWRQKGAVAAVKNQGHC 154
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
VA+VEGIN I LVSLSEQQLVDC+T N+GC GG MD AF+YII N G
Sbjct: 155 GSCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTE--NSGCNGGLMDTAFQYIINNGG 212
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQ--ITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
I + Y Y +T C S K + I +EDVP N+E++L +AVA+QPVSVAI+A
Sbjct: 213 IVTEDNYPYTAEATE-CSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEA 271
Query: 256 SA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
S QFYS GVF G C T L+HGV AVGYGTS EGI YW+++NSWG WGE+GY R+Q+
Sbjct: 272 SGQDFQFYSTGVFTGKCGTALDHGVVAVGYGTSPEGINYWIVRNSWGPKWGEEGYIRMQQ 331
Query: 314 DIDQPQGQCGIAMFASFPVSK 334
I+ +G+CGIAM AS+P K
Sbjct: 332 GIEAAEGKCGIAMQASYPTKK 352
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 194/312 (62%), Gaps = 14/312 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E W A++GR Y E +RF +F DNL V+ N A + L +N+FADLT EF
Sbjct: 52 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERA-AEHGFRLGMNQFADLTNDEF 110
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQGQC------- 144
A+ G ++ A G + + + ++P SV+W EKGAV PVK QGQC
Sbjct: 111 RAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 170
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
AV++VE +N I +V+LSEQ+LV+C+T+ N+GC GG MD AF +II+N GI + Y
Sbjct: 171 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 230
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y+ + G CD + I +EDVP NDE+SL KAVA+QPVSVAI+A Q Y
Sbjct: 231 PYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYK 289
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
GVF G C T L+HGV AVGYGT E G YW+++NSWG WGEDGY R++R+++ G+C
Sbjct: 290 AGVFTGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKC 348
Query: 323 GIAMFASFPVSK 334
GIAM AS+P K
Sbjct: 349 GIAMMASYPTKK 360
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 148/329 (44%), Positives = 207/329 (62%), Gaps = 18/329 (5%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
S R E +E+ E W AQYG+ YK++AE KRF+IFK+N+ +E FN A G++
Sbjct: 22 SHIMSRRLFEACTSERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTA--GDKP 79
Query: 78 YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP---SVNWIEKGA 134
+ L +N+FADL +EF A T S A T +K ++V +++W ++GA
Sbjct: 80 FNLSINQFADLHDEEFKALLTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGA 139
Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
VTP+K Q +C AVAA+EGI+ I ++LVSLSEQ+LVDC ++ GC GG+M+D
Sbjct: 140 VTPIKDQRRCGSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDCVKGESE-GCNGGYMED 198
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPPNDEESLLKAVAN 246
AF+++ + GI +++ Y Y+G +K E H +QI YE VP N E++L KAVA+
Sbjct: 199 AFEFVAKKGGIASESYYPYKGKDKSC--KVKKETHGVSQIKGYEKVPSNSEKALQKAVAH 256
Query: 247 QPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
QPVSV ++A +A QFYS G+F G C T +H +T VGYG S G KYWL+KNSWG WG
Sbjct: 257 QPVSVYVEAGGNAFQFYSSGIFTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWG 316
Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
E GY R++RDI +G CGIAM A +P +
Sbjct: 317 EKGYIRMKRDIRAKEGLCGIAMNAFYPTA 345
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 155/321 (48%), Positives = 197/321 (61%), Gaps = 22/321 (6%)
Query: 27 EGSIAEKFEQWKAQYG---RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLN 83
E S+ +E+W++ Y R AE +RF +FK N V N + + L LN
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEE-RRFNVFKQNARYVHEGNKRDM---PFRLALN 89
Query: 84 KFADLTPQEFIASQTGFKMSDH---SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKY 140
KFAD+T EF + G ++ H S + +G + +PP+V+W +KGAVT +K
Sbjct: 90 KFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKD 149
Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QGQC + AVEGIN I+ +LVSLSEQ+L+DC N NN GC GG MD AF++I
Sbjct: 150 QGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDC-DNVNNQGCDGGLMDYAFQFI- 207
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
Q GIT ++ Y Y+G G CD K A I YEDVP NDE +L KAVA QPVSVAI
Sbjct: 208 QKNGITTESNYPYQG-EQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAI 266
Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
DAS QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DWGE GY R+
Sbjct: 267 DASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRM 326
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
QR + Q +G CGIAM AS+P
Sbjct: 327 QRGVSQTEGLCGIAMQASYPT 347
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 148/312 (47%), Positives = 192/312 (61%), Gaps = 14/312 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+E W A++GR Y E +RFEIFKDN++ ++ N AA G+RS+ L LN+FAD+T +E
Sbjct: 50 YEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLGLNRFADMTNEE 109
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA------ 145
+ A G + + H + + Y + + +P SV+W KGAV VK QG C
Sbjct: 110 YRAVYLGTRPAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVKDQGSCGSCWAFS 169
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
VAAVEGIN I L+SLSEQ+LVDC N N GC GG MD F++II N GI + Y
Sbjct: 170 TVAAVEGINKIVTGDLISLSEQELVDC-DNGYNQGCNGGLMDYGFEFIINNGGIDTEEDY 228
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y G CD + I YEDVP NDE++L KAVANQPVSVAI+A Q Y
Sbjct: 229 PYTARD-GKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQLYH 287
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
G+F G C T L+HGV AVGYGT E G YW+++NSWG DWGE GY R++R+++ G+C
Sbjct: 288 SGIFTGRCGTDLDHGVVAVGYGT-ENGKDYWIVRNSWGGDWGESGYIRMERNVNTSTGKC 346
Query: 323 GIAMFASFPVSK 334
GIA+ S+P K
Sbjct: 347 GIAIEPSYPTKK 358
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 156/340 (45%), Positives = 207/340 (60%), Gaps = 24/340 (7%)
Query: 10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
L IS S S+ + +G + E ++ W A++G+ Y E KRF+IFK+NL ++ N
Sbjct: 16 LSISASALSRRS-----DGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLKFIDDHN 70
Query: 70 NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS---QVPPS 126
+ NR+Y + LN FADLT +E+ A G + +KA Y + ++P S
Sbjct: 71 SE---NRTYKVGLNMFADLTNEEYRALYLGTRSPPARRVMKAKTASRRYAVNNLDRLPES 127
Query: 127 VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
++W +GAV PVK QG C +AAVEGIN I L+SLSEQ+LV C N+G
Sbjct: 128 MDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSC-DKKYNSG 186
Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
C GG MD AF++II N G+ + Y YE G CD + I YEDVP NDEES
Sbjct: 187 CNGGLMDYAFQFIIDNGGLDTEEDYPYEAFD-GQCDPTRKNAKVVSIDAYEDVPANDEES 245
Query: 240 LLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKN 297
L KAVA+QPVSVAI+AS ALQ Y GVF G C + L+HGV AVGYG E G+ YWL++N
Sbjct: 246 LKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYG-KENGVDYWLVRN 304
Query: 298 SWGQDWGEDGYFRLQRDIDQ-PQGQCGIAMFASFPVSKES 336
SWG WGEDGYF+L+R++ +G+CGIAM AS+PV ++
Sbjct: 305 SWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPVKNDN 344
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 162/345 (46%), Positives = 219/345 (63%), Gaps = 25/345 (7%)
Query: 19 QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
Q+++R+ DE + ++ W ++G+ Y E +KRFEIFK+NL ++ N+ NR+Y
Sbjct: 15 QSSWRSDDE--VMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQ---NRTY 69
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP---FLYKS-SQVPPSVNWIEKGA 134
+ L KFADLT QE+ A G + SD L + P + YK+ ++P SV+W KGA
Sbjct: 70 KVGLTKFADLTNQEYRAMFLGTR-SDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGA 128
Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
V P+K QG C VAAVEGIN I L+SLSEQ+LVDC N GC GG MD
Sbjct: 129 VNPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDC-DRFYNAGCNGGLMDY 187
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF++II N G+ + Y Y G + CD K + A I +EDV P DE++L KAVA+Q
Sbjct: 188 AFQFIINNGGLDTEKDYPYLG-NDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQ 246
Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
PVSVAI+AS ALQFY GVF G C T L+HGV VGYGT E+G+ YWL++NSWG +WGE
Sbjct: 247 PVSVAIEASGMALQFYQSGVFTGECGTALDHGVVVVGYGT-EKGLDYWLVRNSWGTEWGE 305
Query: 306 DGYFRLQRDI-DQPQGQCGIAMFASFPVS--KESAQPSSADKSSA 347
GY ++QR++ D G+CGIAM +S+PV + +A+P AD+S+
Sbjct: 306 HGYIKMQRNVRDTYTGRCGIAMESSYPVKNGQNTAKPYLADESAG 350
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 155/335 (46%), Positives = 216/335 (64%), Gaps = 23/335 (6%)
Query: 19 QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
Q+++R+ +E + + W A++ +TY + E KRFEIFK+NL ++ NN+ NR+Y
Sbjct: 35 QSSWRSDNE--VISMYNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSK--NRTY 90
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP---FLYKSSQV-PPSVNWIEKGA 134
+ L +FADLT +E+ A G K SD L + P + +K+ V P S++W + GA
Sbjct: 91 KVGLTRFADLTNEEYRAKFLGTK-SDPKRRLMKSKNPSQRYAFKAGDVLPESIDWRQSGA 149
Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
V+ +K QG C +AAVEG+N I L+SLSEQ+LVDC N GC GG MD+
Sbjct: 150 VSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLSEQELVDC-DRSYNAGCNGGLMDN 208
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF++II N GI D Y Y+ + G CD+ K ++ A I +EDV DE +L KAVA+Q
Sbjct: 209 AFQFIINNGGIDTDKDYPYQAVD-GKCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQ 267
Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
PVSVAI+AS ALQFY GVF G C + L+HGV VGYGT E+GI YWL++NSWG+DWGE
Sbjct: 268 PVSVAIEASGMALQFYQSGVFTGECGSALDHGVVIVGYGT-EDGIDYWLVRNSWGRDWGE 326
Query: 306 DGYFRLQRD-IDQPQGQCGIAMFASFPVSKESAQP 339
+GY ++QR+ +D G+CGIAM +S+P+ K + P
Sbjct: 327 NGYIKMQRNVVDTFTGKCGIAMESSYPI-KNTQNP 360
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 155/346 (44%), Positives = 213/346 (61%), Gaps = 34/346 (9%)
Query: 23 RTFD--------EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIG 74
R+FD E S+ +E+W++ + + E ++RF +FK+NL + + N
Sbjct: 21 RSFDYKEEDLASEESLWNLYERWRSHH-TVSRSLTEKNQRFNVFKENLKHIHKVNQK--- 76
Query: 75 NRSYTLRLNKFADLTPQEFIASQTGFKMSD----HSSSLKANGTPFLYK-SSQVPPSVNW 129
+R Y LRLNKFAD+T EF+ G K+S H S + T F ++ +S +P S++W
Sbjct: 77 DRPYKLRLNKFADMTNHEFLQHYGGSKVSHYRMFHGSRRQ---TGFAHENTSNLPSSIDW 133
Query: 130 IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
++GAVT VK QG+C +VAAVEGIN IK L+SLSEQ+LVDC N N+GC G
Sbjct: 134 RKQGAVTGVKDQGKCGSCWAFSSVAAVEGINKIKTGELISLSEQELVDC--NSVNHGCDG 191
Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
G M+ AF +I + G+T + Y Y G CDS K I YE VP NDE +L++
Sbjct: 192 GLMEQAFSFIEKTGGLTTENNYPYRA-KDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQ 250
Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
AVANQPVS+AIDA QFYS GV+ G C T LNHGV VGYG +++G KYW++KNSWG
Sbjct: 251 AVANQPVSIAIDAGGQDFQFYSEGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWG 310
Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKES--AQPSSADK 344
+WGE+G+ R+QR+ D +G CGI + AS+P+ + S QP S+ K
Sbjct: 311 SEWGENGFIRMQRENDVEEGLCGITLEASYPIKQRSDIKQPPSSGK 356
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 278 bits (711), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 143/316 (45%), Positives = 207/316 (65%), Gaps = 16/316 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ EK EQW ++G+ YK++AE +RF+IFK+NL +E FN A G+ + L +N+F D T
Sbjct: 31 LLEKHEQWMEEHGKFYKDAAEKEQRFQIFKENLEFIESFN--AAGDNGFNLSINQFGDQT 88
Query: 90 PQEFIAS-QTGFKMSDHSSSLKA--NGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA 145
EF A+ G K + A + F Y++ ++VP +++W E+GAVTP+K+Q C
Sbjct: 89 NDEFKANYLNGKKKPLIGVGIAAIEEESVFRYENVTEVPATMDWRERGAVTPIKHQHLCG 148
Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
VAA+EGI+ I RLVSLSEQ+LVDC + +GC GG+++DA +I++ GI
Sbjct: 149 SCWAFATVAAIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDACDFIVKKGGI 208
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
T++ Y Y + G C+ K + A+I YE VP N+E++LLKAVANQP++V I A+
Sbjct: 209 TSETNYPYTRVD-GKCNVRKGTYNVAKIKGYEHVPANNEKALLKAVANQPIAVYIAATKR 267
Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
A QFYS G+ G C L+H VT VGYGTS++G+KYWL+KNSWG WGE GY +++RD+
Sbjct: 268 AFQFYSSGILKGKCGIDLDHTVTIVGYGTSDDGVKYWLVKNSWGTKWGEKGYIKIKRDVH 327
Query: 317 QPQGQCGIAMFASFPV 332
+G CGIAM ++P+
Sbjct: 328 AKEGSCGIAMVPTYPI 343
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 278 bits (711), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 153/324 (47%), Positives = 201/324 (62%), Gaps = 19/324 (5%)
Query: 20 ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
A RT DE + +E W ++G+TY E +RF+IFKDNL ++ N+ G+ +Y
Sbjct: 40 APLRTDDE--VNALYESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHNS---GDHTYK 94
Query: 80 LRLNKFADLTPQEFIASQTGFKMSDHSSSL-KANGTPFLYKSSQ-VPPSVNWIEKGAVTP 137
L LNKFADLT +E+ + TG K D L K + Y+S +P V+W E+GAVT
Sbjct: 95 LGLNKFADLTNEEYRMTYTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTD 154
Query: 138 VKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
VK QG C +VEG+N I L+S+SEQ+LV+C T+ N GC GG MD AF+
Sbjct: 155 VKDQGSCGSCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTS-YNQGCNGGLMDYAFE 213
Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
+II+N GI + Y Y G G CD K I +YEDVP NDE SL KAV+NQPV+
Sbjct: 214 FIIKNGGIDTEEDYPYTGKD-GKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVA 272
Query: 251 VAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
VAI+A QFY+ G+F G C T L+HGV A GYGT E+G YWL+KNSWG +WGE GY
Sbjct: 273 VAIEAGGRDFQFYTSGIFTGSCGTALDHGVLAAGYGT-EDGKDYWLVKNSWGAEWGEGGY 331
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
+++R+I G+CGIAM AS+P+
Sbjct: 332 LKMERNIADKSGKCGIAMEASYPI 355
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 154/327 (47%), Positives = 206/327 (62%), Gaps = 19/327 (5%)
Query: 19 QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
++++RT DE +A +E W A++G++Y E +RF+IFKDNL ++ N NR+Y
Sbjct: 40 KSSWRT-DEDVMA-VYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE---NRTY 94
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTP 137
+ LN+FADLT +E+ + G + + S + ++ +P SV+W +KGAV
Sbjct: 95 KVGLNRFADLTNEEYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVE 154
Query: 138 VKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
VK QG C +AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF+
Sbjct: 155 VKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFE 213
Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
+II N GI ++ Y Y+ S G CD + I YEDVP NDE+SL KAVANQPVS
Sbjct: 214 FIINNGGIDSEEDYPYKA-SDGRCDQYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVS 272
Query: 251 VAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
VAI+A Q Y G+F G C T L+HGVTAVGYGT E G+ YW++KNSWG WGE+GY
Sbjct: 273 VAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYGT-ENGVDYWIVKNSWGASWGEEGY 331
Query: 309 FRLQRDI-DQPQGQCGIAMFASFPVSK 334
R++RD+ G+CGIAM AS+P+ K
Sbjct: 332 IRMERDLATSATGKCGIAMEASYPIKK 358
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 151/309 (48%), Positives = 195/309 (63%), Gaps = 17/309 (5%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
++++W QYGR Y E RF I+ N+ +E N+ N S+ L NKFADLT E
Sbjct: 45 RYDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQ---NLSFKLTDNKFADLTNDE 101
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
F + G+++ + + N + S+ +P +V+W E GAVTP+K QGQC A
Sbjct: 102 FNSIYLGYQIRSYK---RRNLSHMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSA 158
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
VAAVEGIN IK LVSLSEQ+LVDC N +N GC GGFM+ AF +I G+T + Y
Sbjct: 159 VAAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYP 218
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
Y+G + G C+ K ++HA I YE VP N+E SL AV+ QPVSVAIDAS +F YS
Sbjct: 219 YKG-TDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSE 277
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
GVF+GYC LNHGVT VGYG + G KYWL+KNSWG+ WGE GY R++RD +G CG
Sbjct: 278 GVFSGYCGIQLNHGVTIVGYGDN-NGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCG 336
Query: 324 IAMFASFPV 332
IAM S+P+
Sbjct: 337 IAMEPSYPI 345
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 153/325 (47%), Positives = 203/325 (62%), Gaps = 24/325 (7%)
Query: 27 EGSIAEKFEQWKAQYGRT----YKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
E S+ +EQW++ Y + +E + ++ F +FK+N+ + N RS+ L L
Sbjct: 35 EESLRALYEQWRSHYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKG---RSFRLAL 91
Query: 83 NKFADLTPQEFI-ASQTGFKMSDH---SSSLKANGT-PFLY-KSSQVPPSVNWIEKGAVT 136
NKFAD+T EF A G + H SS ++ +G F+Y ++ +P +V+W ++GAVT
Sbjct: 92 NKFADMTTDEFRRAYAAGSRTRHHRALSSGIRRHGDGSFMYAQAGNLPLAVDWRQRGAVT 151
Query: 137 PVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
+K QGQC +AAVEGIN I+ +LVSLSEQ+LVDC DN GC GG MD AF
Sbjct: 152 GIKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQ-GCNGGLMDYAF 210
Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
+YI +N GIT ++ Y Y C+ K H I YEDVP N+E++L KAVANQPV
Sbjct: 211 QYIKRNGGITTESNYPYLAEQRS-CNKAKERSHDVTIDGYEDVPANNEDALQKAVANQPV 269
Query: 250 SVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
S+AI+AS QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DWGE G
Sbjct: 270 SIAIEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERG 329
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
Y R+QR I QG CGIAM S+P
Sbjct: 330 YIRMQRGISDSQGLCGIAMEPSYPT 354
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 154/327 (47%), Positives = 206/327 (62%), Gaps = 19/327 (5%)
Query: 19 QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
++++RT DE +A +E W A++G++Y E +RF+IFKDNL ++ N NR+Y
Sbjct: 38 KSSWRT-DEDVMA-VYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE---NRTY 92
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTP 137
+ LN+FADLT +E+ + G + + S + ++ +P SV+W +KGAV
Sbjct: 93 KVGLNRFADLTNEEYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVE 152
Query: 138 VKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
VK QG C +AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF+
Sbjct: 153 VKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFE 211
Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
+II N GI ++ Y Y+ S G CD + I YEDVP NDE+SL KAVANQPVS
Sbjct: 212 FIINNGGIDSEEDYPYKA-SDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVS 270
Query: 251 VAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
VAI+A Q Y G+F G C T L+HGVTAVGYGT E G+ YW++KNSWG WGE+GY
Sbjct: 271 VAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYGT-ENGVDYWIVKNSWGASWGEEGY 329
Query: 309 FRLQRDI-DQPQGQCGIAMFASFPVSK 334
R++RD+ G+CGIAM AS+P+ K
Sbjct: 330 IRMERDLATSATGKCGIAMEASYPIKK 356
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 155/316 (49%), Positives = 196/316 (62%), Gaps = 18/316 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
I + FE W +++ + Y+ E RFEIFKDNL ++ N + +Y L LN+FADL+
Sbjct: 29 IIDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKKVV---NYWLGLNEFADLS 85
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF G + S+ + F YK S +P SV+W +KGAVT VK QG C
Sbjct: 86 HEEFKNKYLGLNVD--LSNRRECSEEFTYKDVSSIPKSVDWRKKGAVTDVKNQGSCGSCW 143
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L SLSEQ+LVDC T NNGC GG MD AF YII N G+ +
Sbjct: 144 AFSTVAAVEGINQIVTGNLTSLSEQELVDCDTT-YNNGCNGGLMDYAFAYIISNGGLHKE 202
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M G C+ KAE I+ Y DVP N EESLLKA+ANQP+SVAIDAS Q
Sbjct: 203 EDYPYI-MEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGRDFQ 261
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVF+G+C T L+HGV AVGYG S +G+ + ++KNSWG WGE G+ R++R+ +P
Sbjct: 262 FYSGGVFDGHCGTELDHGVAAVGYG-SAKGLDFIVVKNSWGSKWGEKGFIRMKRNTGKPA 320
Query: 320 GQCGIAMFASFPVSKE 335
G CGI AS+P K+
Sbjct: 321 GLCGINKMASYPTKKK 336
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 157/325 (48%), Positives = 198/325 (60%), Gaps = 20/325 (6%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
++A +R +E + FE+W + + Y E KRFEIF DNL V+ N ++ N+S
Sbjct: 24 AKADHRNPEE---VKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHN--SVPNQS 78
Query: 78 YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVT 136
Y L L +FADLT +EF A KM S+K+ +L+ ++P V+W KGAV
Sbjct: 79 YELGLTRFADLTNEEFRAIYLRSKMERTRDSVKSE--RYLHNVGDKLPDEVDWRAKGAVV 136
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
PVK QG C A+ AVEGIN IK LVSLSEQ+LVDC T+ NNGC GG MD AF
Sbjct: 137 PVKDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTS-YNNGCGGGLMDYAF 195
Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
++II N GI + Y Y IC++ K I YEDVP N E SL KA+ANQP+
Sbjct: 196 QFIISNGGIDTEEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPEN-ENSLKKALANQPI 254
Query: 250 SVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
SVAI+A Q Y GVF G C T L+HGV AVGYGTSE G YW+I+NSWG +WGE G
Sbjct: 255 SVAIEAGGRGFQLYKSGVFTGTCGTALDHGVVAVGYGTSE-GQDYWIIRNSWGSNWGESG 313
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
Y +LQR+I G+CG+AM AS+P
Sbjct: 314 YIKLQRNIKDSSGKCGVAMMASYPT 338
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 154/316 (48%), Positives = 195/316 (61%), Gaps = 18/316 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
I + FE W +++G+ Y+ E RFEIFKDNL ++ N + +Y L LN+F+DL+
Sbjct: 29 IIDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKKVV---NYWLGLNEFSDLS 85
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF G K+ S + F YK +P SV+W +KGAVT VK QG C
Sbjct: 86 HEEFKNKYLGLKVD--MSERRECSQEFNYKDVMSIPKSVDWRKKGAVTDVKNQGSCGSCW 143
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L SLSEQ+LVDC T NN GC GG MD AF YII N G+ +
Sbjct: 144 AFSTVAAVEGINQIVTGNLTSLSEQELVDCDTT-NNYGCNGGLMDYAFSYIISNGGLHKE 202
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M G C+ K E I+ Y DVP N EESLLKA+ANQP+SVAI+AS Q
Sbjct: 203 VDYPYI-MEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRDFQ 261
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVF+G+C T L+HGV AVGYG++ G+ Y ++KNSWG WGE GY R++R+ +P
Sbjct: 262 FYSGGVFDGHCGTQLDHGVAAVGYGSTN-GLDYIIVKNSWGSKWGEKGYIRMKRNTGKPA 320
Query: 320 GQCGIAMFASFPVSKE 335
G CGI AS+P K+
Sbjct: 321 GLCGINKMASYPTKKK 336
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 152/316 (48%), Positives = 194/316 (61%), Gaps = 17/316 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ +E W ++G+ Y E KRFEIFKDNL ++ N+ +RSY + LN+FADLT
Sbjct: 47 VRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSV---DRSYKVGLNRFADLT 103
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA--- 145
+E+ A G KM + L +L+K +P +V+W EKGAV PVK QGQC
Sbjct: 104 NEEYKAMFLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSCW 163
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
V AVEGIN I L+SLSEQ+LVDC + N GC GG MD AF++II N GI +
Sbjct: 164 AFSTVGAVEGINQIVTGELISLSEQELVDCDKS-YNQGCNGGLMDYAFEFIINNGGIDTE 222
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQ 259
Y Y+ S ICD + I YEDVP NDE SL KAVA+QPVSVAI+A A Q
Sbjct: 223 EDYPYKA-SDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQ 281
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQP 318
Y GVF G C T L+HGV AVGYGT E G+ YW+++NSWG WGE GY R++R++ +
Sbjct: 282 LYKSGVFTGRCGTELDHGVVAVGYGT-ENGVNYWIVRNSWGSAWGESGYIRMERNVANTK 340
Query: 319 QGQCGIAMFASFPVSK 334
G+CGIA+ S+P K
Sbjct: 341 TGKCGIAIQPSYPTKK 356
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 199/322 (61%), Gaps = 19/322 (5%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
+DE +E W ++G+ Y E +RF+IFKDNL +E N A G++SY L LNK
Sbjct: 39 YDESHTRHVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGA--GDKSYKLGLNK 96
Query: 85 FADLTPQEFIASQTGFKM---SDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKY 140
FADLT +E+ A G + + ++ + + Y++ + +P V+W EKGAVTP+K
Sbjct: 97 FADLTNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKD 156
Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QGQC V AVEGIN I L SLSEQ+LVDC N GC GG MD AF++I+
Sbjct: 157 QGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDC-DRGYNMGCNGGLMDYAFEFIV 215
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
QN GI + Y Y CD + I YEDVP NDE+SL+KAVANQPVSVAI
Sbjct: 216 QNGGIDTEEDYPYHAKDN-TCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAI 274
Query: 254 DASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
+A ++F Y GVF G C T L+HGV AVGYGT E G YWL++NSWG WGE+GY +L
Sbjct: 275 EAGGMEFQLYQSGVFTGRCGTNLDHGVVAVGYGT-ENGTDYWLVRNSWGSAWGENGYIKL 333
Query: 312 QRDIDQPQ-GQCGIAMFASFPV 332
+R++ + G+CGIA+ AS+P+
Sbjct: 334 ERNVQNTETGKCGIAIEASYPI 355
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 148/336 (44%), Positives = 206/336 (61%), Gaps = 20/336 (5%)
Query: 20 ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
T T + + +E W A++G+TY E RF IF DNL ++ N + GNRSY
Sbjct: 22 VTSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLS--GNRSYK 79
Query: 80 LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF-----LYKSSQVPPSVNWIEKGA 134
+ LN+FADLT +E+ + G K+ + K + ++ P V+W E+GA
Sbjct: 80 VGLNQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGA 139
Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
V+PVK QG C VA+VEGIN I L+SLSEQ+LVDC N N+GC GG MD
Sbjct: 140 VSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDC-DNKYNSGCNGGSMDY 198
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF++I+ N GI +++ Y Y+G+ +CD ++ + I YEDVPP +E++L+KAVA+Q
Sbjct: 199 AFQFIVSNGGIDSESDYPYKGVGA-VCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQ 257
Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
PVSV I+AS A Q Y+ GV G C T L+HGV VGYG SE G YW+++NSWG +WGE
Sbjct: 258 PVSVGIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYG-SENGKDYWIVRNSWGPEWGE 316
Query: 306 DGYFRLQRD-IDQPQGQCGIAMFASFPVSKESAQPS 340
DGY R++R+ +D P G CGI + AS+P+ + PS
Sbjct: 317 DGYIRMERNMVDTPVGMCGITLMASYPIKYGNKNPS 352
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 154/331 (46%), Positives = 214/331 (64%), Gaps = 21/331 (6%)
Query: 18 SQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
S+AT R E +I ++W + R Y + E R E+F +NL +E FNN +G++
Sbjct: 21 SEATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNN--MGSQ 78
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA--NGTP-FLYKSSQVPPSV-NWIEK 132
SY L +NKF D T +EF+A+ TG + +S + TP + + S V + +W +
Sbjct: 79 SYKLGVNKFTDWTKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNE 138
Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
GAVTPVKYQG+C A+AAVEG+ I L+SLSEQQL+DCA + NNGC GG M
Sbjct: 139 GAVTPVKYQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCA-REQNNGCKGGTM 197
Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
+AF YI++N G++++ Y Y+ + G C S + A I +E+VP N+E +LL+AV+
Sbjct: 198 IEAFNYIVKNGGVSSENAYPYQ-VKEGPCRS--NDIPAIVIRGFENVPSNNERALLEAVS 254
Query: 246 NQPVSVAIDASALQF--YSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
QPV+V IDAS F YSGGV+N C T +NH VT VGYGTS+EGIKYWL KNSWG+
Sbjct: 255 RQPVAVDIDASETGFIHYSGGVYNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKT 314
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
WGE+GY R++RD++ PQG CG+A +AS+PV+
Sbjct: 315 WGENGYIRIRRDVEWPQGMCGVAQYASYPVA 345
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 277 bits (708), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 154/324 (47%), Positives = 198/324 (61%), Gaps = 30/324 (9%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E+W+ ++ ++ + ++RF +FK N+ + FN + Y LRLN+F D+T EF
Sbjct: 49 YERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR---DEPYKLRLNRFGDMTADEF 104
Query: 94 IASQTGFKMSDH--------SSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC 144
G +++ H SS A+ F+Y ++ VP SV+W +KGAVT VK QGQC
Sbjct: 105 RRHYAGSRVAHHRMFRGDRQGSSASAS---FMYADARDVPASVDWRQKGAVTDVKDQGQC 161
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
+AAVEGINAIK L SLSEQQLVDC T N GC GG MD AF+YI ++ G
Sbjct: 162 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTK-ANAGCNGGLMDYAFQYIAKHGG 220
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
+ + Y Y C A I YEDVP NDE +L KAVA+QPVSVAI+AS
Sbjct: 221 VAAEDAYPYRARQAS-CKKSPAP--VVTIDGYEDVPANDESALKKAVAHQPVSVAIEASG 277
Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
QFYS GVF+G C T L+HGVTAVGYG + +G KYWL+KNSWG +WGE GY R+ RD+
Sbjct: 278 SHFQFYSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDV 337
Query: 316 DQPQGQCGIAMFASFPVSKESAQP 339
+G CGIAM AS+PV K S P
Sbjct: 338 AAKEGHCGIAMEASYPV-KTSPNP 360
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 277 bits (708), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 159/343 (46%), Positives = 211/343 (61%), Gaps = 22/343 (6%)
Query: 4 YFLIVVLI---ISGSCASQATYRTFDEGSIAEK-FEQWKAQYGRTYKESAENSKRFEIFK 59
YF ++++ +S S S+ E S EK +E+W Q+GR YK E + F I++
Sbjct: 11 YFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRHFGIYQ 70
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N+ + N N S+TL N+FAD+T +E+ A G S+ S + N + F +
Sbjct: 71 SNVRFINYINAQ---NFSFTLTDNQFADMTNEEYKALYMGLGTSETS---RKNQSSFKRE 124
Query: 120 SSQVPP-SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
S+V P SV+W + GAVTPV+ QG+C VAAVEGIN I+ +LVSLSEQ+L+DC
Sbjct: 125 RSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDC 184
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ N GC GG+M +AFK+I QN GIT Y Y G GIC+ KA +H +I+ YE
Sbjct: 185 DIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIG-EQGICNKDKAANHVVKISGYET 243
Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VPPN+E+ L AVA QPVSVAIDA +F YS G+FNG+C LNH VT +GYG + G
Sbjct: 244 VPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG-EDNG 302
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYWL+KNSWG WGE GY R+ RD +G CGIAM AS+P+
Sbjct: 303 KKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 345
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 152/314 (48%), Positives = 200/314 (63%), Gaps = 24/314 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
I +++++W +YGR YK E +RF I++ N+ ++ FN+ N S+TL N FADLT
Sbjct: 15 IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM---NHSHTLAENNFADLT 71
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQC---- 144
+EF A+ G+K ++ T F Y + +P +V+W ++GAVTP+K QGQC
Sbjct: 72 NEEFKATYLGYK------TVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
AVAAVEGIN IK +L+SLSEQ+LVDC N GC GG+M AF++I + G+T +
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI-KRTGLTTE 184
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y+G + C+ K + I+ YE VP NDE+SL AVANQPVSVAIDA Q
Sbjct: 185 IEYPYQGAESA-CNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQ 243
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYG-TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
FYSGG+F+G C LNHGV VGYG TS + YWL+KNSWG DWGE GY R++RD
Sbjct: 244 FYSGGIFSGNCGNQLNHGVAIVGYGETSNQA--YWLVKNSWGTDWGESGYIRMKRDSTDR 301
Query: 319 QGQCGIAMFASFPV 332
QG CGIAM AS+P
Sbjct: 302 QGTCGIAMMASYPT 315
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 153/320 (47%), Positives = 200/320 (62%), Gaps = 22/320 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
+GS +E+W +GR Y E +RF+IF+DN +E N N++Y L LN FA
Sbjct: 27 DGSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQV--NQTYWLGLNNFA 84
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQCA 145
D+T EF A G K+ S+++K+ F Y+ ++ +P +W KGAV VK QG C
Sbjct: 85 DMTHDEFKALYFGTKVP-LSNTIKSG---FRYEDATNLPLDTDWRSKGAVATVKNQGACG 140
Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
VAAVEG+N I LVSLSEQ+LVDC N GC GG MD AF++IIQN G+
Sbjct: 141 SCWAFSTVAAVEGVNQIVTGELVSLSEQELVDC-DKQKNQGCNGGLMDSAFEFIIQNGGL 199
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
++A Y Y+ +S G CD + H I +EDVP E LLKAVANQPVSVAI+AS
Sbjct: 200 DSEADYPYKAVS-GSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGR 258
Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSE--EGI--KYWLIKNSWGQDWGEDGYFRLQ 312
Q YSGGV+ G+C L+HGV AVGYGTS+ +G+ YW+++NSWG WGE GY RLQ
Sbjct: 259 NFQLYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQ 318
Query: 313 RDIDQPQGQCGIAMFASFPV 332
R++ +G+CGIAM AS+PV
Sbjct: 319 RNVASSRGKCGIAMMASYPV 338
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 193/310 (62%), Gaps = 18/310 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E+W +G+ Y E +RFEIFKDNL V+ N A SY + LN+FADLT +E+
Sbjct: 47 YEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNAVA---GSYRVGLNRFADLTNEEY 103
Query: 94 IASQTG--FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
+ G +M + S+S K++ F ++P SV+W EKGAV+PVK QGQC
Sbjct: 104 RSMFLGGNMEMKERSASTKSDRYAFR-AGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFS 162
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
++AVEGIN I L+SLSEQ+LVDC N GC GG MD F++II N GI + Y
Sbjct: 163 TISAVEGINQIVTGELISLSEQELVDC-DKSYNMGCNGGLMDYGFQFIINNGGIDTEEDY 221
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
Y + G CD + I YEDVP +DE SL KAVANQPVSVAI+A A Q Y
Sbjct: 222 PYRAVD-GTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAGGRAFQLYE 280
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
GVF G+C T L+HGV AVGYGT E G+ YW ++NSWG WGE+GY +L+R+I+ G+C
Sbjct: 281 SGVFTGHCGTNLDHGVVAVGYGT-ENGVDYWTVRNSWGPKWGENGYIKLERNINATSGKC 339
Query: 323 GIAMFASFPV 332
GIA AS+P
Sbjct: 340 GIASMASYPT 349
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 158/333 (47%), Positives = 199/333 (59%), Gaps = 25/333 (7%)
Query: 19 QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
Q RT E +E W ++GR Y E +RFEIFKDNL ++ N ++GN SY
Sbjct: 12 QVPERT--EAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHN--SVGNPSY 67
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP----FLYKSSQ-VPPSVNWIEKG 133
L LNKFADL+ E+ + G +M L G P +L+K +P +V+W EKG
Sbjct: 68 KLGLNKFADLSNDEYRSVYLGTRMDGKGRLL---GGPKSERYLFKEGDDLPETVDWREKG 124
Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
AV PVK QGQC V AVEGIN I L SLSEQ+LVDC N GC GG MD
Sbjct: 125 AVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKT-YNLGCNGGLMD 183
Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
AF +II+N GI + Y Y+ + + +CD + I YEDVP NDE+SL KAVAN
Sbjct: 184 YAFDFIIENGGIDTEEDYPYKAIDS-MCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVAN 242
Query: 247 QPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
QPVSVAI+A Q Y GVF G C T L+HGV VGYGT E G+ YW+++NSWG WG
Sbjct: 243 QPVSVAIEAGGRGFQLYQSGVFTGSCGTQLDHGVVTVGYGT-EHGVDYWIVRNSWGPAWG 301
Query: 305 EDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKES 336
E+GY R++RD+ + G+CGIAM AS+P K +
Sbjct: 302 ENGYIRMERDVASTETGKCGIAMEASYPTKKSA 334
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 152/313 (48%), Positives = 200/313 (63%), Gaps = 24/313 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
I +++++W +YGR YK E +RF I++ N+ ++ FN+ N S+TL N FADLT
Sbjct: 15 IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM---NHSHTLAENNFADLT 71
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQC---- 144
+EF A+ G+K ++ T F Y + +P +V+W ++GAVTP+K QGQC
Sbjct: 72 NEEFKATYLGYK------TVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
AVAAVEGIN IK +L+SLSEQ+LVDC N GC GG+M AF++I + G+T +
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI-KRTGLTTE 184
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y+G + C+ K + I+ YE VP NDE+SL AVANQPVSVAIDA Q
Sbjct: 185 IEYPYQGAESA-CNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQ 243
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYG-TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
FYSGG+F+G C LNHGV VGYG TS + YWL+KNSWG DWGE GY R++RD
Sbjct: 244 FYSGGIFSGNCGNQLNHGVAIVGYGETSNQA--YWLVKNSWGTDWGESGYIRMKRDSTDK 301
Query: 319 QGQCGIAMFASFP 331
QG CGIAM AS+P
Sbjct: 302 QGTCGIAMMASYP 314
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 159/343 (46%), Positives = 211/343 (61%), Gaps = 22/343 (6%)
Query: 4 YFLIVVLI---ISGSCASQATYRTFDEGSIAEK-FEQWKAQYGRTYKESAENSKRFEIFK 59
YF ++++ +S S S+ E S EK +E+W Q+GR YK E + F I++
Sbjct: 7 YFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRHFGIYQ 66
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N+ + N N S+TL N+FAD+T +E+ A G S+ S + N + F +
Sbjct: 67 SNVRFINYINAQ---NFSFTLTDNQFADMTNEEYKALYMGLGTSETS---RKNQSSFKRE 120
Query: 120 SSQVPP-SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
S+V P SV+W + GAVTPV+ QG+C VAAVEGIN I+ +LVSLSEQ+L+DC
Sbjct: 121 RSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDC 180
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ N GC GG+M +AFK+I QN GIT Y Y G GIC+ KA +H +I+ YE
Sbjct: 181 DIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIG-EQGICNKDKAANHVVKISGYET 239
Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VPPN+E+ L AVA QPVSVAIDA +F YS G+FNG+C LNH VT +GYG + G
Sbjct: 240 VPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG-EDNG 298
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYWL+KNSWG WGE GY R+ RD +G CGIAM AS+P+
Sbjct: 299 KKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 341
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 159/330 (48%), Positives = 208/330 (63%), Gaps = 22/330 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ +E+W+ Q+ ++ E ++RF +F++N+ + FN G+ Y LRLN+F
Sbjct: 40 EDSLWALYERWREQH-TVARDLGEKARRFNVFRENVRLIHEFNR---GDAPYKLRLNRFG 95
Query: 87 DLTPQEFIASQTGFKMSDHSS-SLKANGTPFLYKSS----QVPPSVNWIEKGAVTPVKYQ 141
D+T EF + ++S H SLK G F++ S+ VPPSV+W +KGAVT VK Q
Sbjct: 96 DMTADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQ 155
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
GQC +AAVEGINAI+ L SLSEQQLVDC T +N GC GG MD AF+YI +
Sbjct: 156 GQCGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTK-SNAGCNGGLMDYAFQYIAK 214
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
+ G+ + Y Y+ C+ K I YEDVP NDE +L KAVA QPV+VAI+
Sbjct: 215 HGGVAAEDAYPYKARQASSCN--KKPSAVVTIDGYEDVPANDETALKKAVAAQPVAVAIE 272
Query: 255 ASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
AS QFYS GVF G C T L+HGV AVGYGT+ +G KYW++KNSWG +WGE GY R++
Sbjct: 273 ASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWGPEWGEKGYIRMK 332
Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
RD+ +G CGIAM AS+PV K SA P A
Sbjct: 333 RDVKDKEGLCGIAMEASYPV-KTSANPKHA 361
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 155/339 (45%), Positives = 212/339 (62%), Gaps = 29/339 (8%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LIV+ ++ S +Q ++ +++E+++ WK +Y YK+ AE K +IFK N+
Sbjct: 13 ILIVIWVMFPSNQNQENDQSL---TLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNVAY 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK---MSDHSSSLKANGTPFLYKS- 120
++ FN A GN+SY L +N+FADL P E S GFK + +SSL F YK+
Sbjct: 70 IDSFN--AAGNKSYKLTINRFADL-PTE--PSDDGFKKRKLEPTTSSL------FKYKNI 118
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+ +P +V+W ++GAVTPVK Q +C AV A+EGI I LVSLSEQ+LVD
Sbjct: 119 TDIPAAVDWRKRGAVTPVKNQRECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVR 178
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
++ NGC GG++ DAF+++++N GI +A Y Y G+ ++ K QI +YE VP
Sbjct: 179 SNWTNGCNGGYLIDAFEFVLENGGIATEASYPYRGVKG---NNSKKVSRQVQIKSYEQVP 235
Query: 234 PNDEESLLKAVANQPVSVAIDASAL-QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N E+SLLK VANQPVSV ID S + +FYS G+F G C T NH V VGYGTS +G KY
Sbjct: 236 RNSEDSLLKVVANQPVSVGIDISGMIRFYSSGIFTGECGTKPNHAVIIVGYGTSNDGTKY 295
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE Y R++RDID +G CGI M AS+P
Sbjct: 296 WLVKNSWGIRWGEKRYIRMKRDIDAKEGLCGIPMDASYP 334
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 161/335 (48%), Positives = 206/335 (61%), Gaps = 25/335 (7%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ +E+W+A++ ++ AE S+RF +F++N V FN + Y LRLN+FA
Sbjct: 42 EESLWALYERWRARH-TVSRDLAEKSRRFNVFRENARLVHEFN--LRRDAPYKLRLNRFA 98
Query: 87 DLTPQEFIASQTGFKMSDH----------SSSLKANGTPFLYKSSQVPPSVNWIEKGAVT 136
DLT EF S ++S H + G+ F + + +P SV+W EKGAVT
Sbjct: 99 DLTSDEFRRSYASSRVSHHRMFKPRAANNNDDDDDKGSSFTHGGA-LPTSVDWREKGAVT 157
Query: 137 PVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
VK QGQC +AAVEGINAI+ N L SLSEQQLVDC T N GC GG MDDAF
Sbjct: 158 GVKDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTK-TNAGCDGGLMDDAF 216
Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
YI ++ G+ + Y Y + C+S KA I YEDVP NDE +L KAVA QPV
Sbjct: 217 SYIAKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPV 276
Query: 250 SVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
+VAI+A S QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG++WGE G
Sbjct: 277 AVAIEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSWGEEWGEKG 336
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
Y R++RD+ +G CGIAM AS+PV K S P A
Sbjct: 337 YIRMKRDVADKEGLCGIAMEASYPV-KTSPNPKHA 370
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 150/328 (45%), Positives = 205/328 (62%), Gaps = 18/328 (5%)
Query: 17 ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
A+++++RT DE + +E+W + G+ Y E KRF++FKDNL ++ N+ NR
Sbjct: 37 ATKSSWRTDDE--VMAIYEEWLVKQGKVYNALGEREKRFQVFKDNLRFIDEHNSE---NR 91
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAV 135
+Y L LN FADLT +E+ ++ G + + L+ + + + +P SV+W ++GAV
Sbjct: 92 TYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLRKTSDRYAPRVGESLPDSVDWRKEGAV 151
Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
VK QG C +AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD A
Sbjct: 152 AEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYA 210
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
F++II N GI + Y Y G CD+ + I +YEDVP N E +L KAVANQP
Sbjct: 211 FEFIINNGGIDTEEDYPYLARD-GRCDTYRKNAKVVTIDDYEDVPVNSETALQKAVANQP 269
Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
VSVAI+A QFY+ G+F+G C T L+HGV AVGYGT E G YW+++NSWG+ WGE+
Sbjct: 270 VSVAIEAGGRDFQFYASGIFSGRCGTQLDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGEN 328
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVSK 334
GY R+ R I+ P G CGIAM AS+P+ K
Sbjct: 329 GYLRMARSINSPTGICGIAMEASYPIKK 356
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 156/334 (46%), Positives = 211/334 (63%), Gaps = 24/334 (7%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ +E+W++ + + ++ E KRF +FK+N + FN + Y LRLNKFA
Sbjct: 31 EDSLWNLYERWRSHHTVS-RDLDEKQKRFNVFKENPRYIHDFNKRK--DIPYKLRLNKFA 87
Query: 87 DLTPQEFIASQTGFKMSDHSS---SLKANGT-PFLYKS---SQVPPSVNWIEKGAVTPVK 139
DLT EF ++ G +++ H S S + T F+Y+S +P S++W +KGAVT VK
Sbjct: 88 DLTNHEFRSTYAGSRINHHRSLRGSRRGGATNSFMYQSLDSRSLPASIDWRQKGAVTAVK 147
Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
QGQC VAAVEGIN IK +L+SLSEQ+L+DC T D NNGC GG MD AF +I
Sbjct: 148 DQGQCGSCWAFSTVAAVEGINQIKTKKLLSLSEQELIDCDT-DENNGCNGGLMDYAFDFI 206
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
+N GI+++A Y Y + C + K + H I +EDVP NDE+SLLKAVANQPVS+A
Sbjct: 207 KKNGGISSEAEYPYAAEDS-YCATEK-KSHVVSIDGHEDVPANDEDSLLKAVANQPVSIA 264
Query: 253 IDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
I+AS QFYS GVF G T L+HGV VGYG +++G KYW+++NSWG +WGE GY R
Sbjct: 265 IEASGYDFQFYSEGVFTGRSGTELDHGVAIVGYGKTQQGTKYWIVRNSWGAEWGEKGYIR 324
Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQPSSADK 344
+ D + CG+AM AS+P+ K S PS +
Sbjct: 325 ISAASDSKR-LCGLAMEASYPI-KTSPNPSHKSR 356
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 144/309 (46%), Positives = 196/309 (63%), Gaps = 15/309 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
++ W A+ GR+Y E +RF +F DNL V+ N A + + L +N+FADLT EF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------A 145
++ G K+ + S +A G + + ++P SV+W EKGAV PVK QGQC A
Sbjct: 109 RSTFLGAKVVERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 165
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
V+ VE IN + +++LSEQ+LV+C+TN N+GC GG MDDAF +II+N GI + Y
Sbjct: 166 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 225
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSG 263
Y+ + G CD + I +EDVP NDE+SL KAVA+QPVSVAI+A Q Y
Sbjct: 226 YKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHS 284
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
GVF+G C T L+HGV AVGYGT + G YW+++NSWG WGE GY R++R+I+ G+CG
Sbjct: 285 GVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINATTGKCG 343
Query: 324 IAMFASFPV 332
IAM AS+P
Sbjct: 344 IAMMASYPT 352
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 150/314 (47%), Positives = 196/314 (62%), Gaps = 19/314 (6%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
E FE W +++ +TY+ E RFEIF DNL ++ N SY L LN+FADL+ +
Sbjct: 45 ELFESWMSKHSKTYRSIEEKLHRFEIFLDNLKHIDETNKKV---SSYWLGLNEFADLSHE 101
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA----- 145
EF + G ++ K + F Y + +P SV+W KGAVTPVK QG C
Sbjct: 102 EFKSKYLGLRVE---FPRKRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAF 158
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
VAAVEGIN I L SLSEQ+L+DC NNGCYGG MD AF+YI+ N G+ +
Sbjct: 159 STVAAVEGINQIVTGNLTSLSEQELIDC-DRSFNNGCYGGLMDYAFQYIMSNSGLRKEED 217
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFY 261
Y Y M G C K + I+ YEDVP NDE+SLLKA+++QPVSVAI+AS+ QFY
Sbjct: 218 YPYL-MEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFY 276
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
GG+F G C T ++HGVTAVGYG+SE G Y ++KNSWG WGE+GY R++R+ +P+G
Sbjct: 277 KGGIFTGRCGTQMDHGVTAVGYGSSE-GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGL 335
Query: 322 CGIAMFASFPVSKE 335
CGI AS+P ++
Sbjct: 336 CGINQMASYPTKEK 349
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 199/312 (63%), Gaps = 22/312 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF---NNAAIGNRSYTLRLNKFADLTP 90
++ W A+ GR+Y E+ +RF +F DNL RF +NA + + L +N+FADLT
Sbjct: 54 YDLWLAENGRSYNALGEHERRFRVFWDNL----RFADAHNARADDHGFRLGMNRFADLTN 109
Query: 91 QEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC----- 144
+EF A+ G K+ + S +A G + + ++P SV+W EKGAV PVK QGQC
Sbjct: 110 EEFRATFLGAKVVERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 166
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
AV+ VE IN + +++LSEQ+LV+C+TN N+GC GG MDDAF +II+N GI +
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 226
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y+ + G CD + I +EDVP NDE+SL KAVA+QPVSVAI+A Q
Sbjct: 227 DYPYKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQL 285
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
Y GVF+G C T L+HGV AVGYGT + G YW+++NSWG WGE GY R++R+I+ G
Sbjct: 286 YHSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTG 344
Query: 321 QCGIAMFASFPV 332
+CGIAM AS+P
Sbjct: 345 KCGIAMMASYPT 356
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 198/319 (62%), Gaps = 17/319 (5%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
S+ + +E+W +Q+ + E KRF +FK N+ + R N + Y L+LN+FAD+
Sbjct: 35 SLWDLYERWGSQH-MVSRAPDEKKKRFNVFKYNVNHINRVNQLG---KPYKLKLNEFADM 90
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQCA-- 145
T EF A + K TPF + K++ PPS++W GAV P+K QG+C
Sbjct: 91 TNHEFKAGFDSKILHFRMLKGKRRQTPFTHAKTTDPPPSIDWRTNGAVNPIKNQGRCGSC 150
Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
+ VEGIN IK N+LVSLSEQ+LVDC T+ GC GG M++ +++I + G+T
Sbjct: 151 WAFSTIVGVEGINKIKTNQLVSLSEQELVDCETD--CEGCNGGLMENGYEFIKETGGVTT 208
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL-- 258
+ +Y Y + G CD K +I +E+VP NDE ++L+AVANQPVS+AIDA L
Sbjct: 209 EQIYPYFARN-GRCDISKRNSPVVKIDGFENVPANDESAMLRAVANQPVSIAIDAGGLNF 267
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
QFYS GVFNG C T LNHGV VGYGT+++G YW+++NSWG WGE GY R+QR ++ P
Sbjct: 268 QFYSQGVFNGACGTELNHGVAIVGYGTTQDGTNYWIVRNSWGTGWGEQGYVRMQRGVNVP 327
Query: 319 QGQCGIAMFASFPVSKESA 337
+G CG+AM AS+P+ S
Sbjct: 328 EGLCGLAMDASYPIKASSV 346
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 149/333 (44%), Positives = 198/333 (59%), Gaps = 26/333 (7%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAEN----SKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
E S+ +E+W++ Y R ++ ++RF +FK+N V N R + L L
Sbjct: 34 EESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKD--GRPFRLAL 91
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK--------SSQVPPSVNWIEKGA 134
NKFAD+T EF + G + H + L F + ++ +PP+V+W +GA
Sbjct: 92 NKFADMTTDEFRRTYAGSRTRHHRAQL-GEARSFAHAQHGRGGSGTTNLPPAVDWRLRGA 150
Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
VT VK QGQC A+AAVEG+N I +LVSLSEQ+LVDC DN GC GG MD
Sbjct: 151 VTGVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQ-GCDGGLMDY 209
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF+YI +N G+T ++ Y Y C+ K H I YEDVP N+E++L KAVA+Q
Sbjct: 210 AFQYIQRNGGVTTESNYPYLAEQRS-CNKAKERSHDVTIDGYEDVPANNEDALQKAVASQ 268
Query: 248 PVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
PV+VAI+AS QFYS GVF G C T L+HGV AVGYGT+ +G KYW +KNSWG+DWGE
Sbjct: 269 PVAVAIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGE 328
Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQ 338
GY R+QR + +G CGIAM S+P K +
Sbjct: 329 RGYIRMQRGVPDSRGLCGIAMEPSYPTKKPAGH 361
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 150/313 (47%), Positives = 196/313 (62%), Gaps = 17/313 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++ + YK E RFE+F++NL+ +++ NN SY L LN+FADLT
Sbjct: 47 LLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEI---NSYWLGLNEFADLT 103
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF G S + + F Y+ + +P SV+W +KGAV PVK QGQC
Sbjct: 104 HEEFKGRYLGLAKPQFSRKRQPSAN-FRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCW 162
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L SLSEQ+L+DC T N+GC GG MD AF+YII G+ +
Sbjct: 163 AFSTVAAVEGINQITTGNLSSLSEQELIDCDTT-FNSGCNGGLMDYAFQYIISTGGLHKE 221
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M GIC K + I+ YEDVP ND+ESL+KA+A+QPVSVAI+AS Q
Sbjct: 222 DDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQ 280
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FY GGVFNG C T L+HGV AVGYG+S +G Y ++KNSWG WGE G+ R++R+ +P+
Sbjct: 281 FYKGGVFNGQCGTDLDHGVAAVGYGSS-KGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPE 339
Query: 320 GQCGIAMFASFPV 332
G CGI AS+P
Sbjct: 340 GLCGINKMASYPT 352
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 153/330 (46%), Positives = 203/330 (61%), Gaps = 27/330 (8%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E ++ + +E+W+ + R + AE +RF FK N+ + N G+R Y LRLN+F
Sbjct: 39 EEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKR--GDRPYRLRLNRFG 95
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTP-----FLYKS---SQVPPSVNWIEKGAVTPV 138
D++ EF A+ G ++SD A TP F+Y + S +P SV+W +KGAVT V
Sbjct: 96 DMSQAEFRATFAGSRVSDRRRDGPA--TPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGV 153
Query: 139 KYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K QG+C V +VEGINAI+ +LVSLSEQ+L+DC T DN+ GC GG MD+AF+Y
Sbjct: 154 KNQGKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADND-GCEGGLMDNAFEY 212
Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIK---AEDHAAQITNYEDVPPNDEESLLKAVANQP 248
I +N G+T +A Y Y + G C + K + I ++DVP N EE+L KAVANQP
Sbjct: 213 IKKNGGLTTEAAYPYRA-ANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQP 271
Query: 249 VSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
VSV IDAS A FYS GVF G C T L+HGV VGYG +E+G YW +KNSWG WGE
Sbjct: 272 VSVGIDASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEK 331
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVSKES 336
GY R+++D G CGIAM AS+ V +S
Sbjct: 332 GYIRVEKDSGAEGGLCGIAMEASYAVKTDS 361
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 157/347 (45%), Positives = 219/347 (63%), Gaps = 24/347 (6%)
Query: 5 FLIVVLII--SGSCASQATYRT--FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
F+ VVL I S+AT R + SI + +QW Q+ R Y + E R ++ +
Sbjct: 6 FVCVVLTIFFMDLKISEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTE 65
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA-NGTPFLYK 119
NL +E FNN +GN+SY L +N+F D T +EF+A+ TG + + +S + N T +
Sbjct: 66 NLKFIESFNN--MGNQSYKLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAWN 123
Query: 120 ---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
S + + +W +GAVTPVK QG+C A+AAVEG+ I L+SLSEQQL+
Sbjct: 124 WTVSDVLGTNKDWRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLL 183
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC T + NNGC GG +AF YII+++GI+++ Y Y+ + G C S A I +
Sbjct: 184 DC-TREQNNGCKGGTFVNAFNYIIKHRGISSENEYPYQ-VKEGPCRS--NARPAILIRGF 239
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGY-CETFLNHGVTAVGYGTS 286
E+VP N+E +LL+AV+ QPV+VAIDAS F YSGGV+N C T +NH VT VGYGTS
Sbjct: 240 ENVPSNNERALLEAVSRQPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTS 299
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
EG+KYWL KNSWG+ WGE+GY R++RD++ PQG CG+A +AS+PV+
Sbjct: 300 PEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPVA 346
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 149/315 (47%), Positives = 192/315 (60%), Gaps = 19/315 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
I +E W ++G++Y E +RF+IFKDN + ++ N A +RS+ L LN+FADLT
Sbjct: 40 IMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAK--DRSFKLGLNRFADLT 97
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQCA- 145
+E+ + TG + D S K +G Y S +P SV+W E GAV VK QGQC
Sbjct: 98 NEEYRSKYTGIRTKD--SRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQCGS 155
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
++AVEGIN I +L++LSEQ+LVDC N GC GG MDDAF++II N GI
Sbjct: 156 CWAFSTISAVEGINQIATGKLITLSEQELVDC-DRSYNEGCNGGLMDDAFQFIINNGGID 214
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
+DA Y Y G G CD + I +YEDVP DE++L KA ANQP+SVAI+AS
Sbjct: 215 SDADYPYTGRD-GQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRD 273
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
QFY G+F G C T L+HGV VGYGT E G YW+++NSWG DWGE GY R++R I
Sbjct: 274 FQFYDSGIFTGKCGTDLDHGVVVVGYGT-ENGKDYWIVRNSWGADWGEKGYLRMERGISS 332
Query: 318 PQGQCGIAMFASFPV 332
G CGI S+PV
Sbjct: 333 KAGICGITSEPSYPV 347
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 149/351 (42%), Positives = 211/351 (60%), Gaps = 35/351 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
MAK L +L C++ R D+ ++A + E+W AQYGR YK+ AE ++RFE+FK
Sbjct: 3 MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
N+ +E FN GN + L +N+FADLT EF +++T GF S P
Sbjct: 63 ANVAFIESFN---AGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTR-------VPTG 112
Query: 118 YKSSQV-----PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
+++ V P +++W KG VTP+K QGQC AVAA+EGI + +L+S S
Sbjct: 113 FRNENVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSL 172
Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA-EDHAA 224
+ + + GC GG MDDAFK+II+N G+T ++ Y Y + D K+ + A
Sbjct: 173 NKSLLTVMS---MGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVD----DKFKSVSNSVA 225
Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVG 282
I YEDVP N+E +L+KAVANQPVSVA+D + QFY GGV G C T L+HG+ A+G
Sbjct: 226 SIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIG 285
Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YG + +G KYWL+KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 286 YGKASDGTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 336
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 150/334 (44%), Positives = 205/334 (61%), Gaps = 20/334 (5%)
Query: 9 VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
V+ C ++ D ++ ++F+ W ++GR YK + E RF I++ N+ ++
Sbjct: 21 VIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCK 80
Query: 69 NNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSV 127
N SY L NKFADLT +EF ++ G S+ L+++ T F Y + +P S
Sbjct: 81 NAQ---KNSYNLTDNKFADLTNEEFQSTYMGL-----STRLRSHNTGFRYDEHGDLPESK 132
Query: 128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
+W ++GAVT + QGQC AVAAVEGIN IK +L+SLSEQ+L+DC N GC
Sbjct: 133 DWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGC 192
Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
GG M+ A+ +II+N G+T + Y YEG+ G C KA +AA I+ YE+VP ++E L
Sbjct: 193 QGGLMETAYTFIIENGGLTTEQDYPYEGVD-GTCKMEKAAHYAASISGYEEVPADNEAKL 251
Query: 241 LKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNS 298
A A+QPVSVAIDA + QFYS GVF+G C LNHGVT VGYG E KYW++KNS
Sbjct: 252 KAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVGYG-KETINKYWIVKNS 310
Query: 299 WGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WG DWGE GY R++RD +G CGIAM AS+P+
Sbjct: 311 WGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 344
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 165/349 (47%), Positives = 220/349 (63%), Gaps = 36/349 (10%)
Query: 6 LIVVLIISG--SCASQA----TYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
L+ VL I+ CA A + + E ++ + E+W ++GRTYK+ AE ++RF++FK
Sbjct: 18 LLTVLAIANCIGCAVAARDLSSSTGYGEEAMTARHEKWMVEHGRTYKDEAEKARRFQVFK 77
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP---F 116
N V+ +NAA G + Y L +N+FAD+T EF+A TGFK L A G F
Sbjct: 78 ANAAFVDT-SNAAAGGKKYHLAINRFADMTHDEFMARYTGFK------PLPATGKKMPGF 130
Query: 117 LYK----SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
Y SS+ +V+W +KGAVT VK Q +C AVAA+EG++ I LVSLSE
Sbjct: 131 KYANVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQINTGELVSLSE 190
Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
QQLVDC+TN NNNGC GG M+DAF+Y+I N GI +A Y Y M G+C +++ A
Sbjct: 191 QQLVDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQ-GMCQNVQP---AVA 246
Query: 226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNG-YCETFLNHGVTAVGYG 284
+ +Y+ VP +DE++L AVA QPVSVA+DA+ QFY GGV C T LNH VTAVGYG
Sbjct: 247 VRSYQQVPRDDEDALAAAVAGQPVSVAVDANNFQFYKGGVMTADSCGTNLNHAVTAVGYG 306
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
T+E+G YWL+KN WG WGE+GY RLQR + G CG+A AS+PV+
Sbjct: 307 TAEDGTPYWLLKNQWGSTWGEEGYLRLQRGV----GACGVAKDASYPVA 351
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 158/361 (43%), Positives = 217/361 (60%), Gaps = 26/361 (7%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEK------FEQWKAQYGRTYKESAENSKR 54
MAK I + +++ S S A F E +A + +E+W+ + ++ E ++R
Sbjct: 1 MAKPKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHH-TVARDLDEKNRR 59
Query: 55 FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS--SLKAN 112
F +FK+N+ + FN + Y L LNKF D+T QEF + G K+ H S ++ N
Sbjct: 60 FNVFKENVKFIHEFNQKK--DAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKN 117
Query: 113 GTPFLYKSSQVPP--SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSL 163
F+Y++ P S++W KGAVT VK QGQC +A+VEGIN IK LVSL
Sbjct: 118 TGSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSL 177
Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
SEQ+LVDC T+ N GC GG MD AF++I Q GIT + Y Y G C S
Sbjct: 178 SEQELVDCDTS-YNEGCNGGLMDYAFEFI-QKNGITTEDSYPY-AEQDGTCASNLLNSPV 234
Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAV 281
I ++DVP N+E +L++AVANQP+SV+I+AS QFYS GVF G C T L+HGV V
Sbjct: 235 VSIDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIV 294
Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSS 341
GYG + +G KYW++KNSWG++WGE GY R+QR I +G+CGIAM AS+P+ K SA P +
Sbjct: 295 GYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI-KTSANPKN 353
Query: 342 A 342
+
Sbjct: 354 S 354
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 150/313 (47%), Positives = 196/313 (62%), Gaps = 17/313 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++ + YK E RFE+F++NL+ +++ NN SY L LN+FADLT
Sbjct: 47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEI---NSYWLGLNEFADLT 103
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF G S + + F Y+ + +P SV+W +KGAV PVK QGQC
Sbjct: 104 HEEFKGRYLGLAKPQFSRKRQPSAN-FRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCW 162
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L SLSEQ+L+DC T N+GC GG MD AF+YII G+ +
Sbjct: 163 AFSTVAAVEGINQITTGNLSSLSEQELIDCDTT-FNSGCNGGLMDYAFQYIISTGGLHKE 221
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M GIC K + I+ YEDVP ND+ESL+KA+A+QPVSVAI+AS Q
Sbjct: 222 DDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQ 280
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FY GGVFNG C T L+HGV AVGYG+S +G Y ++KNSWG WGE G+ R++R+ +P+
Sbjct: 281 FYKGGVFNGKCGTDLDHGVAAVGYGSS-KGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPE 339
Query: 320 GQCGIAMFASFPV 332
G CGI AS+P
Sbjct: 340 GLCGINKMASYPT 352
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 155/316 (49%), Positives = 195/316 (61%), Gaps = 24/316 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
FE+W A+Y + Y E RFE+FKDNL ++ N +Y L LN FADLT EF
Sbjct: 66 FEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVT---TYWLGLNAFADLTHDEF 122
Query: 94 IASQTGFKMSDHSSSLKANGTPFLY---KSSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
A+ G + + + K + F Y VP SV+W +KGAVT VK QGQC
Sbjct: 123 KATYLGLRQPE---TKKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQCGSCWAF 179
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
VAAVEGIN I L SLSEQ+LVDC+T D NNGC GG MD+AF YI + G+ +
Sbjct: 180 STVAAVEGINQIVTGNLTSLSEQELVDCST-DGNNGCNGGVMDNAFSYIASSGGLRTEEA 238
Query: 204 YSYEGMSTGICDSIKAED--HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M G CD KA D I+ YEDVP NDE++L+KA+A+QP+SVAI+AS Q
Sbjct: 239 YPYL-MEEGDCDD-KARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEASGRHFQ 296
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVFNG C + L+HGV AVGYG+S +G Y ++KNSWG WGE GY R++R +P+
Sbjct: 297 FYSGGVFNGPCGSELDHGVAAVGYGSS-KGQDYIIVKNSWGSHWGEKGYIRMKRGTGKPE 355
Query: 320 GQCGIAMFASFPVSKE 335
G CGI AS+P +
Sbjct: 356 GLCGINKMASYPTKDQ 371
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 150/333 (45%), Positives = 205/333 (61%), Gaps = 36/333 (10%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E+FEQW ++GR Y ++ E +R E+++ N+ VE FN+ +GN Y L NKFADLT
Sbjct: 50 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNS--MGN-GYRLADNKFADLT 106
Query: 90 PQEFIASQTGFKM------SDHS---SSLKANGTPFLYKS--SQVPPSVNWIEKGAVTPV 138
+EF A GF + HS S++ G+ + + S +P SV+W EKGAV PV
Sbjct: 107 NEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPV 166
Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K QG C AVAA+EGIN IK +LVSLSEQ+LVDC T GC GG+M AF++
Sbjct: 167 KSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEF 224
Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
+++N+G+T + Y Y+G++ G C + K ++ A I+ Y +V P+ E LL+A A QPVSV
Sbjct: 225 VMKNRGLTTERNYPYQGLN-GACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSV 283
Query: 252 AIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE----------EGIKYWLIKNSW 299
A+DA + Q Y GGVF G C LNHGVT VGYG ++ G KYW++KNSW
Sbjct: 284 AVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSW 343
Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G +WG+ GY +QR+ G CGIAM S+PV
Sbjct: 344 GPEWGDAGYILMQREASVASGLCGIAMLPSYPV 376
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 150/328 (45%), Positives = 202/328 (61%), Gaps = 18/328 (5%)
Query: 17 ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
+++++RT DE + +E W ++G+ Y E +RFE+FKDNL ++ N+ NR
Sbjct: 27 GTKSSWRTDDE--VMAIYEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNSE---NR 81
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAV 135
+Y + LN+FADLT +E+ + G + L+ + + +P SV+W ++GAV
Sbjct: 82 TYRVGLNRFADLTNEEYRSMYLGALSGIRRNKLRKISDRYTPRVGDSLPDSVDWRKEGAV 141
Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
VK QG C AVAAVEGIN I L+SLSEQ+LVDC N N GC GG MD
Sbjct: 142 VGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQELVDC-DNSYNEGCNGGLMDYG 200
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
F++II N GI ++ Y Y G CD+ + I +YEDVP N+E +L KAVANQP
Sbjct: 201 FEFIINNGGIDSEEDYPYLARD-GRCDTYRKNARVVSIDSYEDVPVNNEAALQKAVANQP 259
Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
VSVAI+A Q YS GVF+G C T L+HGV AVGYGT E G YW+++NSWG+ WGE
Sbjct: 260 VSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGYGT-ENGQDYWIVRNSWGKSWGES 318
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVSK 334
GY R+ R+I +P G CGIAM AS+P+ K
Sbjct: 319 GYLRMARNIRKPTGICGIAMEASYPIKK 346
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 150/333 (45%), Positives = 205/333 (61%), Gaps = 36/333 (10%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E+FEQW ++GR Y ++ E +R E+++ N+ VE FN+ +GN Y L NKFADLT
Sbjct: 29 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNS--MGN-GYRLADNKFADLT 85
Query: 90 PQEFIASQTGFKM------SDHS---SSLKANGTPFLYKS--SQVPPSVNWIEKGAVTPV 138
+EF A GF + HS S++ G+ + + S +P SV+W EKGAV PV
Sbjct: 86 NEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPV 145
Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K QG C AVAA+EGIN IK +LVSLSEQ+LVDC T GC GG+M AF++
Sbjct: 146 KSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEF 203
Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
+++N+G+T + Y Y+G++ G C + K ++ A I+ Y +V P+ E LL+A A QPVSV
Sbjct: 204 VMKNRGLTTERNYPYQGLN-GACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSV 262
Query: 252 AIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE----------EGIKYWLIKNSW 299
A+DA + Q Y GGVF G C LNHGVT VGYG ++ G KYW++KNSW
Sbjct: 263 AVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSW 322
Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G +WG+ GY +QR+ G CGIAM S+PV
Sbjct: 323 GPEWGDAGYILMQREASVASGLCGIAMLPSYPV 355
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 202/324 (62%), Gaps = 18/324 (5%)
Query: 20 ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
AT RT +E + +EQW ++G+ Y E KRF+IFKDNL ++ N+A +R+Y
Sbjct: 47 ATLRTEEE--LMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAE--DRTYK 102
Query: 80 LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPV 138
L LN+FADLT +E+ A G K+ + K + + ++P SV+W ++GAV PV
Sbjct: 103 LGLNRFADLTNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPDSVDWRKEGAVPPV 162
Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K QG C A+ AVEGIN I L+SLSEQ+LVDC T N GC GG MD AF++
Sbjct: 163 KDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTG-YNQGCNGGLMDYAFEF 221
Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
II N GI +D Y Y G+ G CD+ + I +YEDVP DE +L KAVANQPVSV
Sbjct: 222 IINNGGIDSDEDYPYRGVD-GRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSV 280
Query: 252 AIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
AI+ Q Y GVF G C T L+HGV AVGYGT+ +G YW+++NSWG WGEDGY
Sbjct: 281 AIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGTA-KGHDYWIVRNSWGSSWGEDGYI 339
Query: 310 RLQRDI-DQPQGQCGIAMFASFPV 332
RL+R++ + G+CGIA+ S+P+
Sbjct: 340 RLERNLANSRSGKCGIAIEPSYPL 363
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 152/351 (43%), Positives = 209/351 (59%), Gaps = 30/351 (8%)
Query: 6 LIVVLIISGSCASQAT------YRTFDEGSIA---EKFEQWKAQYGRTYKESAENSKRFE 56
L VL+++ SC + A +R F + ++ E F+ W R Y + E +RF+
Sbjct: 3 LSCVLLVACSCLAVAAGFPFENHRLFIQQAVESPREAFDFWVQTLKRAYASAEEYERRFD 62
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS-SLKANGTP 115
++ DNL V +N G+ S+ L + +ADL+ E+ + G+ H L+A P
Sbjct: 63 VWLDNLRFVHEYN---AGHTSHWLSMGVYADLSQDEYRSKALGYNADLHEERPLRA--AP 117
Query: 116 FLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQL 168
FLY+ + P V+W+ KGAVTPVK Q C AVEG +AI +L SLSEQ L
Sbjct: 118 FLYEGTVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLASLSEQML 177
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
VDC + +NGC+GG MD AF++I++N GI + Y Y G+C K H I +
Sbjct: 178 VDC-DRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTA-EEGMCQDNKMRRHVVTIDD 235
Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
Y+DVPPNDE +L+KAVANQPVSVAI+A A Q Y GGVF+ C T L+HGV VGYGT+
Sbjct: 236 YQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVGYGTA 295
Query: 287 EEG---IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
G + YWL+KNSWG +WG+ GY RL R++ + +GQCG+AM ASFP+ K
Sbjct: 296 SNGTHHLPYWLVKNSWGAEWGDKGYIRLLRNLGE-EGQCGVAMQASFPIKK 345
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 149/314 (47%), Positives = 195/314 (62%), Gaps = 19/314 (6%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
E FE W +++ + Y+ E RFEIF DNL ++ N SY L LN+FADL+ +
Sbjct: 45 ELFESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDETNKKV---SSYWLGLNEFADLSHE 101
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA----- 145
EF + G ++ K + F Y + +P SV+W KGAVTPVK QG C
Sbjct: 102 EFKSKYLGLRVE---FPRKRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAF 158
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
VAAVEGIN I L SLSEQ+L+DC NNGCYGG MD AF+YI+ N G+ +
Sbjct: 159 STVAAVEGINQIVTGNLTSLSEQELIDC-DRSFNNGCYGGLMDYAFQYIMSNSGLRKEED 217
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFY 261
Y Y M G C K + I+ YEDVP NDE+SLLKA+++QPVSVAI+AS+ QFY
Sbjct: 218 YPYL-MEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFY 276
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
GG+F G C T ++HGVTAVGYG+SE G Y ++KNSWG WGE+GY R++R+ +P+G
Sbjct: 277 KGGIFTGRCGTQMDHGVTAVGYGSSE-GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGL 335
Query: 322 CGIAMFASFPVSKE 335
CGI AS+P ++
Sbjct: 336 CGINQMASYPTKEK 349
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 157/348 (45%), Positives = 211/348 (60%), Gaps = 31/348 (8%)
Query: 8 VVLIISGSCASQAT------YRTFDEGS---IAEKFEQWKAQYGRTYKESAENSKRFEIF 58
++L+ G+C ++ + Y D S + E FE+W A++ + Y E RFE+F
Sbjct: 14 LLLLCVGACVARNSDFSIVGYSEEDLSSNERLVELFEKWLAKHQKAYASFEEKLHRFEVF 73
Query: 59 KDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
KDNL +++ N SY L LN+FADLT EF A+ G D + + + + F Y
Sbjct: 74 KDNLKHIDKINREVT---SYWLGLNEFADLTHDEFKAAYLGL---DAAPARRGSSRSFRY 127
Query: 119 K---SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQL 168
+ +S +P SV+W +KGAVT VK QGQC VAAVEGINAI L +LSEQ+L
Sbjct: 128 EDVSASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQEL 187
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGIC-DSIKAEDHAAQIT 227
+DC+ D N+GC GG MD AF YI + G+ + Y Y M G C D KAE A I+
Sbjct: 188 IDCSV-DGNSGCNGGLMDYAFSYIASSGGLHTEEAYPYL-MEEGSCGDGKKAESEAVTIS 245
Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGT 285
YEDVP NDE++L+KA+A+QPVSVAI+AS QFYSGGVF+G C L+HGV AVGYG+
Sbjct: 246 GYEDVPANDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAVGYGS 305
Query: 286 SE-EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+ +G Y +++NSWG WGE GY R++R +G CGI AS+P
Sbjct: 306 DKGKGHDYIIVRNSWGAQWGEKGYIRMKRGTSNGEGLCGINKMASYPT 353
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 151/316 (47%), Positives = 196/316 (62%), Gaps = 19/316 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W + +G+ Y E RFE+FK+NL +++ N SY L LN+FADL+
Sbjct: 43 LVELFESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVT---SYWLGLNEFADLS 99
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF + G K + F Y+ +P S++W +KGAVTPVK QG C
Sbjct: 100 HEEFKSKFLGLYPE---FPRKKSSEDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCW 156
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L SLSEQQL+DC T+ NNGC GG MD AF++I+ N G+ +
Sbjct: 157 AFSTVAAVEGINQIVAGNLTSLSEQQLIDCDTS-FNNGCNGGLMDYAFEFIVNNGGLHKE 215
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M G CD + E I+ Y DVP NDE+SLLKA+A+QP+SVAIDAS Q
Sbjct: 216 EDYPYL-MEEGTCDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQ 274
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVF+G C T L+HGV AVGYG+S GI Y ++KNSWG WGE GY R++R+ +P+
Sbjct: 275 FYSGGVFSGPCGTDLDHGVAAVGYGSSS-GIDYIIVKNSWGPKWGERGYLRMKRNTGKPE 333
Query: 320 GQCGIAMFASFPVSKE 335
G CGI AS+P ++
Sbjct: 334 GLCGINKMASYPTKQK 349
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 149/311 (47%), Positives = 193/311 (62%), Gaps = 14/311 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +W A +GRTY E +R+++F+DNL ++ N AA G S+ L LN+FADLT E
Sbjct: 41 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 100
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
+ A+ G + K + +P SV+W KGAV VK QG C
Sbjct: 101 YRATYLGARTRPQRER-KLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFST 159
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF++II N GI + Y
Sbjct: 160 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDYP 218
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSG 263
Y+G + G CD + I +YEDVP NDE+SL KAVANQPVSVAI+A +A Q YS
Sbjct: 219 YKG-TDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 277
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+F G C T L+HGVTAVGYGT E G YW++KNSWG WGE GY R++R+I G+CG
Sbjct: 278 GIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 336
Query: 324 IAMFASFPVSK 334
IA+ S+P+ +
Sbjct: 337 IAVEPSYPLKE 347
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 158/358 (44%), Positives = 223/358 (62%), Gaps = 29/358 (8%)
Query: 1 MAKYFL---IVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
M K FL ++ +I+ + + + T R E S+ + +E+W++ + ++ +E KRF
Sbjct: 3 MGKAFLFAVVLAVILVAAMSMEITERDLASEESLWDLYERWRSHH-TVSRDLSEKRKRFN 61
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTP---QEFIASQTGFKMSDHSSSLKANG 113
+FK N+ + + N ++ Y L+LN FAD+T +EF +S+ H S +AN
Sbjct: 62 VFKANVHHIHKVNQK---DKPYKLKLNSFADMTNHEFREFYSSKVKHYRMLHGS--RAN- 115
Query: 114 TPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSE 165
T F++ K+ +P SV+W ++GAVT VK QG+C V VEGIN IK +LVSLSE
Sbjct: 116 TGFMHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSE 175
Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
Q+LVDC T+ N GC GG M++A+++I ++ GIT + +Y Y+ G CDS K A
Sbjct: 176 QELVDCETD--NEGCNGGLMENAYEFIKKSGGITTERLYPYKARD-GSCDSSKMNAPAVT 232
Query: 226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNG-YCETFLNHGVTAVG 282
I +E VP NDE +L+KAVANQPVSVAIDAS +QFYS GV+ G C L+HGV VG
Sbjct: 233 IDGHEMVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVG 292
Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ-CGIAMFASFPVSKESAQP 339
YGT+ +G KYW++KNSWG WGE GY R+QR +D +G CGIAM AS+P+ S P
Sbjct: 293 YGTALDGTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPLKLSSHNP 350
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 149/311 (47%), Positives = 193/311 (62%), Gaps = 14/311 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +W A +GRTY E +R+++F+DNL ++ N AA G S+ L LN+FADLT E
Sbjct: 46 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 105
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
+ A+ G + K + +P SV+W KGAV VK QG C
Sbjct: 106 YRATYLGARTRPQRER-KLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFST 164
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF++II N GI + Y
Sbjct: 165 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDYP 223
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSG 263
Y+G + G CD + I +YEDVP NDE+SL KAVANQPVSVAI+A +A Q YS
Sbjct: 224 YKG-TDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 282
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+F G C T L+HGVTAVGYGT E G YW++KNSWG WGE GY R++R+I G+CG
Sbjct: 283 GIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 341
Query: 324 IAMFASFPVSK 334
IA+ S+P+ +
Sbjct: 342 IAVEPSYPLKE 352
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 154/322 (47%), Positives = 198/322 (61%), Gaps = 26/322 (8%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE++ A+Y + Y E +RFE+FKDNL ++ N G Y L LN+FADLT
Sbjct: 48 LMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKKITG---YWLGLNEFADLT 104
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA- 145
EF A+ G ++ + +N F Y+ ++ +P V+W +KGAVT VK QGQC
Sbjct: 105 HDEFKAAYLGLTLT--PARRNSNDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQCGS 162
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
VAAVEGINAI L LSEQ+L+DC T D NNGC GG MD AF YI N G+
Sbjct: 163 CWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDT-DGNNGCSGGLMDYAFSYIAANGGLH 221
Query: 200 NDAVYSYEGMSTGIC--DSIKAEDH-----AAQITNYEDVPPNDEESLLKAVANQPVSVA 252
+ Y Y M G C S + +D A I+ YEDVP N+E++LLKA+A+QPVSVA
Sbjct: 222 TEESYPYL-MEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALAHQPVSVA 280
Query: 253 IDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
I+AS QFYSGGVF+G C T L+HGVTAVGYGT+ +G Y ++KNSWG WGE GY R
Sbjct: 281 IEASGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASKGHDYIIVKNSWGSHWGEKGYIR 340
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
++R + G CGI AS+P
Sbjct: 341 MRRGTGKHDGLCGINKMASYPT 362
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 152/319 (47%), Positives = 197/319 (61%), Gaps = 20/319 (6%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
+E ++E+F W ++G+ Y E++ R+ ++KDNL ++R + NRSY L L KF
Sbjct: 38 NERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEK---NRSYWLGLTKF 94
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
AD+T EF TG ++ S + G F Y S+ P SV+W +KGAVT VK QG C
Sbjct: 95 ADITNDEFRRQYTGTRIDRSKRSKRKTG--FRYADSEAPESVDWRKKGAVTTVKDQGSCG 152
Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
A+ +VEGINAI+ VSLSEQ+LVDC + N GC GG MD AF +I++N GI
Sbjct: 153 SCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDL-EYNQGCNGGLMDYAFDFILENGGI 211
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
+ Y Y+G+ G CD+ K H I YEDVP NDEE+L KAVA QPVSVAI+A
Sbjct: 212 DTENDYPYKGLD-GRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGR 270
Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
Q YSGGVF G C T L+HGV AVGYG SE + YW++KNSWG+ WGE GY R+QR+I
Sbjct: 271 DFQLYSGGVFTGECGTDLDHGVLAVGYG-SEGSLDYWIVKNSWGEYWGESGYLRMQRNIK 329
Query: 317 QPQ---GQCGIAMFASFPV 332
G CGI + S+ V
Sbjct: 330 DSNHQFGLCGINIEPSYAV 348
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 140/338 (41%), Positives = 202/338 (59%), Gaps = 22/338 (6%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
MAK L +L C++ R D+ ++A + E+W AQYGR YK+ AE ++RFE+FK
Sbjct: 3 MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFK 62
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N+ +E FN GN + L +N+FADLT EF +++T ++ +
Sbjct: 63 ANVAFIESFN---AGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTGFRNENVN 119
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCAVA-AVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+P +++W KG VTP+K QGQC A + A+ ++LVDC + +
Sbjct: 120 IDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAM----------EELVDCDVHGEDQ 169
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDE 237
GC GG MDDAFK+II+N G+T ++ Y Y + D K+ + A I YEDVP N+E
Sbjct: 170 GCEGGLMDDAFKFIIKNGGLTTESNYPYAAVD----DKFKSVSNSVASIKGYEDVPANNE 225
Query: 238 ESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
+L+KAVANQPVSVA+D + QFY GGV G C T L+HG+ A+GYG + +G KYWL+
Sbjct: 226 AALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLL 285
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KNSWG WGE+G+ R+++DI +G CG+AM S+P +
Sbjct: 286 KNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 323
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 155/352 (44%), Positives = 216/352 (61%), Gaps = 28/352 (7%)
Query: 5 FLIVVLIISGSCASQA------------TYRTFDEGSIAEKFEQWKAQYGRTYKESAENS 52
F++ VL+++ C + A ++ + E+W A++GRTY + AE +
Sbjct: 6 FVLTVLVVASVCTAAAPRALAVRELAGEEESAAVAAAMVSRHEKWMAEHGRTYTDEAEKA 65
Query: 53 KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN 112
+R EIF+ N ++ FN+A G S+ L N+FADLT +EF A++TGF+ ++ +
Sbjct: 66 RRLEIFRANAEFIDSFNDA--GKHSHRLATNRFADLTDEEFRAARTGFRPRPAPAAAAGS 123
Query: 113 GTPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVS 162
G F Y++ + SV+W GAVT VK QG+C AVAAVEG+N I+ RLVS
Sbjct: 124 GGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCWAFSAVAAVEGLNKIRTGRLVS 183
Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
LSEQ+LVDC N + GC GG MDDAF++I + G+ +++ Y Y+G G C S A
Sbjct: 184 LSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASESGYPYQG-DDGSCRSSAAAAR 242
Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTA 280
AA I +EDVP N+E +L AVANQPVSVAI+ A +FY GV G C T LNH +TA
Sbjct: 243 AASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRFYDSGVLGGECGTDLNHAITA 302
Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
VGYGT+ +G KYWL+KNSWG WGE GY R++R + + +G CG+A S+PV
Sbjct: 303 VGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGV-RGEGVCGLAKLPSYPV 353
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 148/345 (42%), Positives = 206/345 (59%), Gaps = 51/345 (14%)
Query: 3 KYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
K ++ +L + C + R D+ ++ + EQW AQY R YK+++E ++RF
Sbjct: 5 KASILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRF------ 58
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYK 119
KFADLT EF + +T GFK SS++K T F Y+
Sbjct: 59 ----------------------KFADLTNHEFRSVKTNKGFK----SSNMKIL-TGFRYE 91
Query: 120 ---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+ +P +++W KG VTP+K QGQC AVAA EGI I +LVSL++Q+LV
Sbjct: 92 NVSADALPTTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELV 151
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + + GC GG MDDAFK+II+N G+T ++ Y Y + G C+S + AA I Y
Sbjct: 152 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTA-ADGKCNS--GSNSAATIKGY 208
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
EDVP NDE +L+KA+ANQPVSVA+D + +FYSGGV G C T L+HG+ A+GYG +
Sbjct: 209 EDVPANDEAALMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYGKTS 268
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYWL+KNSWG WGE+GY R+++DI +G CG+AM S+P
Sbjct: 269 DGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 313
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 167/356 (46%), Positives = 218/356 (61%), Gaps = 31/356 (8%)
Query: 9 VLIISGSCASQATY-RTFD--------EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
+L IS S A T TFD E S+ +E+W++ + T + E RF +FK
Sbjct: 6 LLFISLSLALIFTVANTFDFNEHDLESEKSLWNLYERWRSHHTVT-RNLDEKHNRFNVFK 64
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGTPF 116
N++ V N ++ Y L+LNKF D+T EF K+S H NGT F
Sbjct: 65 ANVMHVHNTNKL---DKPYKLKLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHENGT-F 120
Query: 117 LYKSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQL 168
+Y+++ VP S++W KGAVT VK QGQC +AAVEGIN IK +LVSLSEQQL
Sbjct: 121 MYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQL 180
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
VDC T +N GC GG M+ AF++I QN GIT ++ Y Y G CD ++ ED A I
Sbjct: 181 VDCDTEENE-GCNGGLMEYAFEFIKQN-GITTESNYPY-AAKDGTCD-VEKEDKAVSIDG 236
Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
+E+VP N+E +LLKA A QPVSVAIDA QFYS GVF G+C+T LNHGV VGYG +
Sbjct: 237 HENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAIVGYGVT 296
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
++ KYW++KNSWG +WGE GY R+QR I +G CGIAM AS+P+ K S +P+ +
Sbjct: 297 QDRTKYWIMKNSWGSEWGEQGYIRMQRGISSREGLCGIAMEASYPIKKSSTKPTES 352
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 153/328 (46%), Positives = 199/328 (60%), Gaps = 16/328 (4%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNR 76
S +Y E + +W A +GRTY E +RFE+F+DNL V+ N AA G
Sbjct: 30 SIVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVH 89
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAV 135
S+ L LN+FADLT E+ A+ G + + G +L ++ +P SV+W KGAV
Sbjct: 90 SFRLGLNRFADLTNDEYRATYLGVRSRPQRE--RRLGDRYLAGDNEDLPESVDWRAKGAV 147
Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
VK QG C +AAVEGIN I ++SLSEQ+LVDC T+ N GC GG MD A
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTS-YNQGCNGGLMDYA 206
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
F++II N GI + Y Y+G + G CD + I +YEDVP N E+SL KAVANQP
Sbjct: 207 FEFIINNGGIDTEEDYPYKG-TDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQP 265
Query: 249 VSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
+SVAI+A A Q Y+ G+F G C T L+HGVTAVGYGT E G YW++KNSWG WGE
Sbjct: 266 ISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGES 324
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVSK 334
GY R++R+I G+CGIA+ S+P+ K
Sbjct: 325 GYVRMERNIKASSGKCGIAVEPSYPLKK 352
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 149/313 (47%), Positives = 192/313 (61%), Gaps = 16/313 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W + + + Y+ E RFE+FKDNL ++ N +SY L LN+FADL+
Sbjct: 47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG---KSYWLGLNEFADLS 103
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF G K + + F Y+ + VP SV+W +KGAV VK QG C
Sbjct: 104 HEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L +LSEQ+L+DC T NNGC GG MD AF+YI++N G+ +
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKE 222
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M G C+ K E I ++DVP NDE+SLLKA+A+QP+SVAIDAS Q
Sbjct: 223 EDYPY-SMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQ 281
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVF+G C L+HGV AVGYG+S+ G Y ++KNSWG WGE GY RL+R+ +P+
Sbjct: 282 FYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPE 340
Query: 320 GQCGIAMFASFPV 332
G CGI ASFP
Sbjct: 341 GLCGINKMASFPT 353
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 149/312 (47%), Positives = 195/312 (62%), Gaps = 16/312 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +W A +GRTY E +RFE+F+DNL V+ N AA G S+ L LN+FADLT E
Sbjct: 46 YAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTNDE 105
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA------ 145
+ A+ G + + G +L ++ +P SV+W KGAV +K QG C
Sbjct: 106 YRATYLGVRSRPQRE--RRLGDRYLAGDNEDLPESVDWRAKGAVAEIKDQGSCGSCWAFS 163
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
+AAVEGIN I ++SLSEQ+LVDC T+ N GC GG MD AF++II N GI + Y
Sbjct: 164 TIAAVEGINQIVTGDMISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEEDY 222
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
Y+G + G CD + I +YEDVP N E+SL KAVANQP+SVAI+A A Q Y+
Sbjct: 223 PYKG-TDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYN 281
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
G+F G C T L+HGVTAVGYGT E G YW++KNSWG WGE GY R++R+I G+C
Sbjct: 282 SGIFTGTCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKC 340
Query: 323 GIAMFASFPVSK 334
GIA+ S+P+ K
Sbjct: 341 GIAVEPSYPLKK 352
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 149/312 (47%), Positives = 191/312 (61%), Gaps = 17/312 (5%)
Query: 37 WKAQYGRTYKES-AENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFI 94
W+A++G S E +RF F DNL V+ N AA G + L +N+FADLT EF
Sbjct: 55 WRAEHGSGNSNSLGEEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFR 114
Query: 95 ASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQC------- 144
A+ G K + S +A G Y+ ++P +V+W EKGAV PVK QGQC
Sbjct: 115 AAYLGVKGAGQRRSARA-GVGERYRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFS 173
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
AV+AVE IN + LV+LSEQ+LV+C N +NGC GG MDDAF +II N GI + Y
Sbjct: 174 AVSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINNGGIDTEDDY 233
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y+ + G CD + I +EDVP NDE+SL KAVA+QPVSVAI+A Q Y
Sbjct: 234 PYKALD-GKCDINRRNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYH 292
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
GVF G C T L+HGV AVGYGT E G YW+++NSWG WGE GY R++R+I+ G+C
Sbjct: 293 SGVFTGRCGTELDHGVVAVGYGT-ENGKDYWIVRNSWGPKWGEAGYLRMERNINATTGKC 351
Query: 323 GIAMFASFPVSK 334
GIAM +S+P K
Sbjct: 352 GIAMMSSYPTKK 363
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 149/325 (45%), Positives = 201/325 (61%), Gaps = 20/325 (6%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
DE + ++E W A++GR Y E KRFEIFKDNL +E NN+ GNR+Y + LN+F
Sbjct: 42 DEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNS--GNRTYKVGLNQF 99
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQG 142
ADLT +E+ G K +K+ Y S +P SV+W ++GAV P+K QG
Sbjct: 100 ADLTNEEYRTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQG 159
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C VAAVEGIN I +++LSEQ+LVDC N+GC GG MD AF++II N
Sbjct: 160 SCGSCWAFSTVAAVEGINQIVTGEMITLSEQELVDC-DRVQNSGCNGGLMDYAFEFIISN 218
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
G+ + Y Y G+ G CD ++ I YEDVP N E +L KAVA+QPV VAI+A
Sbjct: 219 GGMDTEKHYPYRGVE-GRCDPVRKNYKVVSIDGYEDVPRN-ERALQKAVAHQPVCVAIEA 276
Query: 256 S--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
S A Q YS GVF G C ++HGV VGYG SE+G+ YW+++NSWG WGE+GY +++R
Sbjct: 277 SGRAFQLYSSGVFTGECGEEVDHGVVVVGYG-SEDGVDYWIVRNSWGTKWGENGYVKMER 335
Query: 314 DIDQPQ-GQCGIAMFASFPVSKESA 337
++ + G+CGI AS+P +K+SA
Sbjct: 336 NVKKSHLGKCGIMTEASYP-TKDSA 359
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 148/311 (47%), Positives = 192/311 (61%), Gaps = 14/311 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +W A +GRTY +R+++F+DNL ++ N AA G S+ L LN+FADLT E
Sbjct: 44 YAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
+ A+ G + K + +P SV+W KGAV VK QG C
Sbjct: 104 YPATYLGARTRPQRDR-KLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGTCWAFST 162
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF++II N GI + Y
Sbjct: 163 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDYP 221
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSG 263
Y+G + G CD + I +YEDVP NDE+SL KAVANQPVSVAI+A +A Q YS
Sbjct: 222 YKG-TDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 280
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+F G C T L+HGVTAVGYGT E G YW++KNSWG WGE GY R++R+I G+CG
Sbjct: 281 GIFTGSCGTRLDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 339
Query: 324 IAMFASFPVSK 334
IA+ S+P+ +
Sbjct: 340 IAVEPSYPLKE 350
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 205/338 (60%), Gaps = 22/338 (6%)
Query: 7 IVVL---IISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
IV+L II+ +C T + + + +++E W +YGR Y++ E RF+I++ N+
Sbjct: 9 IVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIYQSNVQ 68
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
+E +N+ N SY L N+FAD+T +EF ++ G+ + +K ++
Sbjct: 69 YIEFYNSQ---NYSYKLIDNRFADITNEEFKSTYLGY-----LPRFRVQTEFRYHKHGEL 120
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P S++W +KGAVT VK QG+C AVAAVEGIN IK LVSLSEQQL+DC
Sbjct: 121 PKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIKSG 180
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC GG M AF YI ++ GI Y Y+G G C+ KA+++A I+ YE VP +
Sbjct: 181 NEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRD-GNCNKSKAKNNAVTISGYESVPARN 239
Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E+ L AVA+QPVS+A DA A QFYS G+F+G C LNHG+T VGYG E G KYW+
Sbjct: 240 EKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYG-EENGDKYWI 298
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+KNSW DWGE GY R++RD G CGIAM A++PV
Sbjct: 299 VKNSWANDWGESGYVRMKRDTKDKDGTCGIAMDATYPV 336
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 150/319 (47%), Positives = 197/319 (61%), Gaps = 16/319 (5%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E + E FE W ++G++Y E KRF+IF+DNL ++ N ++ NRSY L LN+FA
Sbjct: 43 EDEVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKN--SLENRSYKLGLNRFA 100
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFL-YKSSQVPPSVNWIEKGAVTPVKYQGQCA 145
D+T +E+ G K + +K+ + +P S++W EKGAVT VK QG C
Sbjct: 101 DITNEEYRTGYLGAKRDASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCG 160
Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
+AAVEG+N + L+SLSEQ+LVDC N GC GG M AF++II+N GI
Sbjct: 161 SCWAFSTIAAVEGVNQLATGNLISLSEQELVDC-DRKINQGCNGGDMGYAFQFIIKNGGI 219
Query: 199 TNDAVYSYEGMSTGICDSIKAED-HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
++ Y Y G G CDS + + A I YE+VP N+E+SL KAVANQPVSVAI+A
Sbjct: 220 DSEEDYPYTG-KDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGG 278
Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
Q YS G+F G C T L+HGV AVGYGT E G+ YW++KNSWG WGE GY R+QR++
Sbjct: 279 YDFQLYSSGIFTGSCGTDLDHGVAAVGYGT-ENGVDYWIVKNSWGDYWGEKGYVRMQRNV 337
Query: 316 DQPQGQCGIAMFASFPVSK 334
G CGIAM AS+P K
Sbjct: 338 KAKTGLCGIAMEASYPTKK 356
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 147/321 (45%), Positives = 198/321 (61%), Gaps = 19/321 (5%)
Query: 27 EGSIAEKFEQWKAQYGRTYKES-AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
E + +E W ++GR E+ RF +F DNL V+ N A G + L +N+F
Sbjct: 49 EAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERA-GEHGFRLGMNQF 107
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQG 142
ADLT EF A+ G ++ ++ N +Y+ + ++P SV+W EKGAV PVK QG
Sbjct: 108 ADLTNDEFRAAYLGARIP---AARSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQG 164
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
QC AV++VE IN I +V+LSEQ+LV+C+T+ N+GC GG MD AF +II+N
Sbjct: 165 QCGSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKN 224
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
GI + Y Y+ + G CD + I +EDVP NDE+SL KAVA+QPVSVAI+A
Sbjct: 225 GGIDTEDDYPYKAVD-GKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEA 283
Query: 256 SALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
QF Y GVF+G C T L+HGV AVGYGT E G YW+++NSWG WGE GY R++R
Sbjct: 284 GGRQFQLYKSGVFSGSCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGPKWGEAGYIRMER 342
Query: 314 DIDQPQGQCGIAMFASFPVSK 334
+I+ G+CGIAM AS+P K
Sbjct: 343 NINATTGKCGIAMMASYPTKK 363
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 148/325 (45%), Positives = 206/325 (63%), Gaps = 20/325 (6%)
Query: 23 RTFDEGSI-AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLR 81
R G I +E+ E+W AQYG+ YK++AE KRF++FK+N+ +E FN A G++ + L
Sbjct: 23 RVMSRGLITSERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFN--AAGDKPFNLS 80
Query: 82 LNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKY 140
+N+FADL +EF A + S A T F Y++ +++P +++W ++GAVTP+K
Sbjct: 81 INQFADLHDEEFKALLNNVQ-KKASRVETATETSFRYENVTKIPSTMDWRKRGAVTPIKD 139
Query: 141 QG----QC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
QG C VA VE ++ I LVSLSEQ+LVDC D+ GC GG++++AF++I
Sbjct: 140 QGYTCGSCWAFATVATVESLHQITTGELVSLSEQELVDCVRGDSE-GCRGGYVENAFEFI 198
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPPNDEESLLKAVANQPVSV 251
GIT++A Y Y+G +K E H A+I YE VP N E++LLKAVANQPVSV
Sbjct: 199 ANKGGITSEAYYPYKGKDRSC--KVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSV 256
Query: 252 AIDASAL--QFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
IDA A+ +FYS G+F C T L+H V VGYG +G KYWL+KNSW WGE GY
Sbjct: 257 YIDAGAIAFKFYSSGIFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGY 316
Query: 309 FRLQRDIDQPQGQCGIAMFASFPVS 333
R++RDI +G CGIA AS+P++
Sbjct: 317 MRIKRDIRAKKGLCGIASNASYPIA 341
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 151/338 (44%), Positives = 207/338 (61%), Gaps = 19/338 (5%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
I L ++ + S +++R+ DE + ++ W Q+G+ Y E KRFEIFKDNL ++
Sbjct: 20 ISTLTLNQNHPSSSSWRSDDE--VMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFID 77
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN--GTPFLYKS-SQV 123
N+ N +Y L LNKFADLT QE+ A G + +K+ + + +++ +
Sbjct: 78 EHNSN--NNTTYKLGLNKFADLTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGDNL 135
Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P SV+W + GAV+PVK QG C +A VEGIN I LVSLSEQ+LVDC +
Sbjct: 136 PDSVDWRDHGAVSPVKDQGSCGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRS-Y 194
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
+ GC GG MD AF++I+ N GI + Y Y G + CD K I YEDVP N+
Sbjct: 195 DAGCNGGLMDYAFQFIMDNGGIDTEKDYPYLGFNNQ-CDPTKKNAKVVSIDGYEDVP-NN 252
Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E +L KAVA+QPVS+AI+A A Q Y GVFNG C L+HGV AVGYGT + G YW+
Sbjct: 253 ENALKKAVAHQPVSIAIEAGGRAFQLYESGVFNGECGLALDHGVVAVGYGTDDNGQDYWI 312
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
++NSWG +WGE+GY R++R+I+ G+CGIAM AS+PV
Sbjct: 313 VRNSWGSNWGENGYIRMERNINANTGKCGIAMEASYPV 350
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 153/327 (46%), Positives = 200/327 (61%), Gaps = 14/327 (4%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNR 76
S +Y E + + +W A+ GRTY E +RFE+F+DNL V++ N AA G
Sbjct: 26 SIVSYGERSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLH 85
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVT 136
S+ L LN+FADLT +E+ + G + + + +G + ++P SV+W EKGAV
Sbjct: 86 SFRLGLNRFADLTNEEYRDTYLGVR-TKPVRERRLSGRYQAADNEELPESVDWREKGAVA 144
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
VK QG C A+AAVEGIN I +++LSEQ+LVDC T+ N GC GG MD AF
Sbjct: 145 KVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTS-YNQGCNGGLMDYAF 203
Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
++II N GI ++ Y Y+ CD+ K I YEDVP N E SL KAVANQP+
Sbjct: 204 EFIINNGGIDSEEDYPYKERDN-RCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPI 262
Query: 250 SVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
SVAI+A A Q Y G+F G C T L+HGVTAVGYG SE G YW++KNSWG WGEDG
Sbjct: 263 SVAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYG-SENGKDYWIVKNSWGTVWGEDG 321
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVSK 334
Y RL+R+I G+CGIA+ S+P+ K
Sbjct: 322 YVRLERNIKATSGKCGIAIEPSYPLKK 348
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 156/317 (49%), Positives = 197/317 (62%), Gaps = 21/317 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+A +F W ++G+ Y + E + RF ++KDNL ++R + N SY L L KFADLT
Sbjct: 41 LAGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEK---NLSYWLGLTKFADLT 97
Query: 90 PQEFIASQTGFKMSDHSSSLKA--NGT-PFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
+EF TG ++ D S LK N T F Y +S+ P S++W EKGAVT VK QG C
Sbjct: 98 NEEFRRQYTGTRI-DRSRRLKKGRNATGSFRYANSEAPKSIDWREKGAVTSVKDQGSCGS 156
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
AV +VEGINAI+ +SLS Q+LVDC N GC GG MD AF ++IQN GI
Sbjct: 157 CWAFSAVGSVEGINAIRTGDAISLSVQELVDC-DKKYNQGCNGGLMDYAFDFVIQNGGID 215
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
+ Y Y+G G CD K I +YEDVP NDEE+L KAVA QPVSVAI+A
Sbjct: 216 TEKDYPYQGYD-GRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRD 274
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-- 315
Q YSGGVF G C T L+HGV AVGYG SE+G+ YW++KNSWG+ WGE GY R+QR++
Sbjct: 275 FQLYSGGVFTGRCGTDLDHGVLAVGYG-SEKGLDYWIVKNSWGEYWGESGYLRMQRNLKD 333
Query: 316 DQPQGQCGIAMFASFPV 332
D G CGI + S+ V
Sbjct: 334 DNGYGLCGINIEPSYAV 350
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 158/362 (43%), Positives = 207/362 (57%), Gaps = 36/362 (9%)
Query: 7 IVVLIISGSCASQAT------YRTFDEGSI---AEKFEQW----KAQYGRTYKESAE-NS 52
+ VL+++ SC + A +R F + +I E F+ W K R Y SAE
Sbjct: 10 LSVLLVACSCLAVAAGFRFENHRLFIQQAIESPREAFDFWVHTVKPPSNRAYASSAEVYE 69
Query: 53 KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS-LKA 111
+RF I+ DNL +N + S+ L + +ADL+ E+ + G+ H L+A
Sbjct: 70 RRFNIWLDNLRFAHEYNAR---HTSHWLSMGVYADLSQDEYRSKALGYNAHLHKKRPLRA 126
Query: 112 NGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLS 164
PFLYK + P V+W+ GAVTPVK Q C AVEG NAI +LVSLS
Sbjct: 127 --APFLYKGTVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIATGKLVSLS 184
Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
EQ LVDC + + GC GGFMD AF +I+ N GI + Y Y GIC + H
Sbjct: 185 EQMLVDC-DREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRA-EDGICQDNRTRRHVV 242
Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVG 282
I Y+DVPPNDE +L+KAVA+QPVSVAI+A A Q Y GGVF+ C T L+H V VG
Sbjct: 243 TIDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTALDHAVLVVG 302
Query: 283 YGTSEEG---IKYWLIKNSWGQDWGEDGYFRLQRDI--DQPQGQCGIAMFASFPVSKESA 337
YGT+ G + YWL+KNSWG +WGE GY RL R++ D P+GQCG+AM+ASFP+ K +
Sbjct: 303 YGTASNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYASFPIKKGAN 362
Query: 338 QP 339
P
Sbjct: 363 PP 364
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 147/317 (46%), Positives = 198/317 (62%), Gaps = 18/317 (5%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
++ + +E+W+ + + E +RF FKDN+ + N G R Y LRLN+F D+
Sbjct: 41 ALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKR--GGRGYRLRLNRFGDM 97
Query: 89 TPQEFIASQTGFKMSD-HSSSLKANGTP-FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA 145
+EF A+ G +D L A P F+Y+ + +P +V+W KGAVT VK QG+C
Sbjct: 98 GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 157
Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
V +VEGINAI+ RLVSLSEQ+L+DC T DN+ GC GG M++AF+YI + GI
Sbjct: 158 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNS-GCQGGLMENAFEYIKHSGGI 216
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPPNDEESLLKAVANQPVSVAIDAS- 256
T ++ Y Y + G CD+++A I +++VP N E +L KAVANQPVSVAIDA
Sbjct: 217 TTESAYPYR-AANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGD 275
Query: 257 -ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
+ QFYS GVF G C T L+HGV VGYG + +G +YW++KNSWG WGE GY R+QRD
Sbjct: 276 QSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDS 335
Query: 316 DQPQGQCGIAMFASFPV 332
G CGIAM AS+PV
Sbjct: 336 GYDGGLCGIAMEASYPV 352
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 153/323 (47%), Positives = 198/323 (61%), Gaps = 20/323 (6%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
RT D+ + +E W ++ + Y E RF IFKDN+ V+R N ++ N+SY L L
Sbjct: 51 RTHDQ--LLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHN--SMRNQSYKLGL 106
Query: 83 NKFADLTPQEFIASQTGFKM--SDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVK 139
NKFADLT E+ + KM + + F+++ +P SV+W ++GAV PVK
Sbjct: 107 NKFADLTNDEYRSLYLSGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVK 166
Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
QGQC V AVEGIN I L+SLSEQ+LVDC N N GC GG MD AF++I
Sbjct: 167 DQGQCGSCWAFSTVGAVEGINKIVTGELISLSEQELVDC-DNGYNQGCNGGLMDYAFEFI 225
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
++N GI + Y Y+G+ G+CD + I YEDVP NDE+SL KAVA+QPVSVA
Sbjct: 226 VKNGGIDTEDDYPYKGVD-GLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVA 284
Query: 253 IDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
I+A A Q Y GVF G C T L+HGV AVGYG SE G YW+++NSWG DWGE GY R
Sbjct: 285 IEAGGRAFQLYESGVFTGQCGTELDHGVVAVGYG-SENGKDYWIVRNSWGPDWGESGYIR 343
Query: 311 LQRDI-DQPQGQCGIAMFASFPV 332
L+R++ G+CGIAM AS+P
Sbjct: 344 LERNVASTSTGKCGIAMQASYPT 366
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 157/365 (43%), Positives = 215/365 (58%), Gaps = 33/365 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRT-FDEGSIA------EKFEQWKAQYGRTYKESAENSK 53
++K L+V L+ S A + FDE +A + +E+W+ + R ++ E +
Sbjct: 4 VSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGR 62
Query: 54 RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD----HSSSL 109
RF FK+N+ + N G+R Y LRLN+F D+ +EF ++ +++D S +
Sbjct: 63 RFGTFKENVRFIHAHNKR--GDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAA 120
Query: 110 KANGTP-FLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRL 160
+A P F+Y S+ PP SV+W ++GAVT VK QG C V AVEGINAI+ L
Sbjct: 121 RAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGSL 180
Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
SLSEQ+L+DC T++N GC GG M++AF++I GIT +A Y Y S G CD +A
Sbjct: 181 ASLSEQELIDCDTDEN--GCQGGLMENAFEFIKSFGGITTEAAYPYR-ASNGTCDGDRAR 237
Query: 221 ---DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLN 275
I ++ VP E++L KAVA+QPVSVA+DA A QFYS GVF G C T L+
Sbjct: 238 RGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLD 297
Query: 276 HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKE 335
HGV AVGYG ++G YW++KNSWG WGE GY R+QR G CGIAM ASFP+ K
Sbjct: 298 HGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPI-KT 355
Query: 336 SAQPS 340
S P+
Sbjct: 356 SPNPA 360
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 148/325 (45%), Positives = 200/325 (61%), Gaps = 20/325 (6%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
DE + ++E W A++GR Y E KRFEIFKDNL +E NN+ GNR+Y + LN+F
Sbjct: 42 DEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNS--GNRTYKVGLNQF 99
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQG 142
ADLT +E+ G K +K+ Y S +P SV+W ++GAV P+K QG
Sbjct: 100 ADLTNEEYRTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQG 159
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C VAAV GIN I +++LSEQ+LVDC N+GC GG MD AF++II N
Sbjct: 160 SCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDC-DRVQNSGCNGGLMDYAFEFIISN 218
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
G+ + Y Y G+ G CD ++ I YEDVP N E +L KAVA+QPV VAI+A
Sbjct: 219 GGMDTEKHYPYRGVE-GRCDPVRKNYKVVSIDGYEDVPRN-ERALQKAVAHQPVCVAIEA 276
Query: 256 S--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
S A Q YS GVF G C ++HGV VGYG SE+G+ YW+++NSWG WGE+GY +++R
Sbjct: 277 SGRAFQLYSSGVFTGECGEEVDHGVVVVGYG-SEDGVDYWIVRNSWGTKWGENGYVKMER 335
Query: 314 DIDQPQ-GQCGIAMFASFPVSKESA 337
++ + G+CGI AS+P +K+SA
Sbjct: 336 NVKKSHLGKCGIMTEASYP-TKDSA 359
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 153/314 (48%), Positives = 195/314 (62%), Gaps = 18/314 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ FE+W A+Y + Y E +RFE+FKDNL ++ N + SY L LN FADLT
Sbjct: 68 LVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEV--TSYWLGLNAFADLT 125
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
EF A+ G + +S + +VP SV+W +KGAVT VK QGQC
Sbjct: 126 HDEFKATYLGL-LPKRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCGSCWA 184
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQQLVDC+T D NNGC GG MD+AF +I G+ ++
Sbjct: 185 FSTVAAVEGINQIVTGNLTSLSEQQLVDCST-DGNNGCSGGVMDNAFSFIATGAGLRSEE 243
Query: 203 VYSYEGMSTGICDSIKAEDHAAQIT--NYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
Y Y M G CD +A D +T YEDVP NDE++L+KA+A+QPVSVAI+AS
Sbjct: 244 AYPYL-MEEGDCDD-RARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHF 301
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
QFYSGGVF+G C + L+HGV AVGYG+S+ G Y ++KNSWG WGE GY R++R +P
Sbjct: 302 QFYSGGVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRGTGKP 360
Query: 319 QGQCGIAMFASFPV 332
+G CGI AS+P
Sbjct: 361 EGLCGINKMASYPT 374
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 153/341 (44%), Positives = 206/341 (60%), Gaps = 27/341 (7%)
Query: 25 FDEGSIA------EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
FDE +A + +E+W+ + R ++ E +RF FK+N + N G+R Y
Sbjct: 27 FDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKR--GDRPY 83
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTP-FLYK-SSQVPPSVNWIEKGAV 135
LRLN+F D+ +EF + +++D A P F+Y ++ +P SV+W +KGAV
Sbjct: 84 RLRLNRFGDMGREEFRSGFADSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAV 143
Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
T VK QG+C V AVEGINAI+ LVSLSEQ+L+DC T++N GC GG M++A
Sbjct: 144 TAVKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTDEN--GCQGGLMENA 201
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQ 247
F++I + GIT ++ Y Y S G CD +A I ++ VP E++L KAVA+Q
Sbjct: 202 FEFIKSHGGITTESAYPYH-ASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQ 260
Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
PVSVAIDA ALQFYS GVF G C T L+HGV AVGYG S++G YW++KNSWG WGE
Sbjct: 261 PVSVAIDAGGQALQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGE 320
Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
GY R+QR G CGIAM ASFP+ K S PS + +
Sbjct: 321 GGYIRMQRGTGN-GGLCGIAMEASFPI-KTSPNPSRKPRRA 359
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 153/333 (45%), Positives = 204/333 (61%), Gaps = 18/333 (5%)
Query: 11 IISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN 70
IIS A AT R+ +E + +EQW ++G+ Y E KRF+IFKDNL ++ N+
Sbjct: 58 IISYDNAHAATSRSDEE--LMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNS 115
Query: 71 AAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNW 129
+R+Y L LN+FADLT +E+ A G K+ + K + + ++P SV+W
Sbjct: 116 QE--DRTYKLGLNRFADLTNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPESVDW 173
Query: 130 IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
++GAV PVK QG C A+ AVEGIN I L+SLSEQ+LVDC T N GC G
Sbjct: 174 RKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTG-YNEGCNG 232
Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
G MD AF++II N GI ++ Y Y G+ G CD+ + I +YEDVP DE +L K
Sbjct: 233 GLMDYAFEFIINNGGIDSEEDYPYRGVD-GRCDTYRKNAKVVSIDDYEDVPAYDELALKK 291
Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
AVANQPVSVAI+ Q Y GVF G C T L+HGV AVGYGT+ G YW+++NSWG
Sbjct: 292 AVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGTA-NGHDYWIVRNSWG 350
Query: 301 QDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
WGEDGY RL+R++ + G+CGIA+ S+P+
Sbjct: 351 PSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 383
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 150/341 (43%), Positives = 202/341 (59%), Gaps = 24/341 (7%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGS-IAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M + V ++I A + + E S A+ FE W QYG+TY E + R ++F+
Sbjct: 1 MGSWLWAVSILI------LAVHSSVSEASSTADLFEAWCEQYGKTYSSEEEKASRLKVFE 54
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+N V + N+ A N SYTL LN FADLT EF AS+ GF + S+++ GTP +
Sbjct: 55 ENHAFVTQHNSMA--NASYTLALNAFADLTHHEFKASRLGFS-PGRAQSIRSVGTPV--Q 109
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
VPP+V+W + GAVT VK QG C A+EGIN I LVSLSEQ+LVDC
Sbjct: 110 ELHVPPAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDC- 168
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
N+GC GG MD A++++I+N+GI ++A Y Y GM C+ K + H I Y D+
Sbjct: 169 DRSYNSGCEGGLMDYAYQFVIKNQGIDSEADYPYVGMDKP-CNKEKLKKHIVTIDGYTDI 227
Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
PPNDE+ LL+ VA QPVSV I S Q YS GV+ G C + L+H V VGYGT E+G+
Sbjct: 228 PPNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGT-EDGV 286
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+W++KNSWG+ WG GY + R+ +G CGI M AS+P
Sbjct: 287 DFWIVKNSWGEHWGMRGYIHMLRNNGTAEGICGINMLASYP 327
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 149/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKTNDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QGQC AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFYSGG ++G C +NH VTA+GYGT EEG KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYSGGTYDGSCADRINHAVTAIGYGTDEEGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 154/319 (48%), Positives = 197/319 (61%), Gaps = 23/319 (7%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E + E+F W ++G+ Y ++ + RF ++KDNL + NR+Y+L L KFA
Sbjct: 47 ENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHSET----NRTYSLGLTKFA 102
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
DLT +EF TG ++ D S K T F Y S+ P SV+W + GAVT VK QG C
Sbjct: 103 DLTNEEFRRMYTGTRI-DRSRRAKRR-TGFRYADSEAPESVDWRKNGAVTSVKDQGSCGS 160
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
AV +VEGINAI+ VSLSEQ+LVDC + N GC GG MD AF +IIQN GI
Sbjct: 161 CWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDL-EYNQGCNGGLMDYAFDFIIQNGGID 219
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
+ Y Y+G G CD+ K H I YEDVP NDEE+L KAVA QPVSVAI+A
Sbjct: 220 TEKDYPYKGFD-GRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRD 278
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-- 315
Q Y+ GVF+G C T L+HGV AVGYGT E+G+ YW++KNSWG+ WGE GY R++R++
Sbjct: 279 FQLYAQGVFSGECGTDLDHGVLAVGYGT-EDGVDYWIVKNSWGEYWGESGYLRMKRNMKD 337
Query: 316 --DQPQGQCGIAMFASFPV 332
D P G CGI + S+ V
Sbjct: 338 SNDGP-GLCGINIEPSYAV 355
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 149/312 (47%), Positives = 197/312 (63%), Gaps = 18/312 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE+W + +G+ Y+ E RFE+FKDNL ++ N SY L +N+FADLT
Sbjct: 41 LIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT---SYWLGVNEFADLT 97
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA--- 145
QEF G K+ SS + + F YK +P SV+W +KGAVT VK QG C
Sbjct: 98 HQEFKNMYLGLKV--ESSRTRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCW 155
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L SLSEQ+L+DC NNGC+GG MD AF +I+ + G+ +
Sbjct: 156 AFSTVAAVEGINKIVGGNLTSLSEQELIDC-DRPYNNGCHGGLMDYAFSFIVSSGGLHKE 214
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y + + CD+ K E I+ Y+DVP N+E SL+KA+A+QP+SVAI+AS Q
Sbjct: 215 EDYPYLEVES-TCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQ 273
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVF+G C T L+HGVTAVGYG+S+ G+ Y ++KNSWG WGE GY R++R+ +P
Sbjct: 274 FYSGGVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPA 332
Query: 320 GQCGIAMFASFP 331
G CGI AS+P
Sbjct: 333 GLCGINKMASYP 344
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 154/316 (48%), Positives = 193/316 (61%), Gaps = 20/316 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE+W A+Y + Y E +RFE+FKDNL ++ N SY L LN+FADLT
Sbjct: 47 LIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKKVT---SYWLGLNEFADLT 103
Query: 90 PQEFIASQTGFKMS-DHSSSLKANGTPFLY---KSSQVPPSVNWIEKGAVTPVKYQGQCA 145
EF A+ G S+S + F Y + +VP ++W +K AVT VK QGQC
Sbjct: 104 HDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCG 163
Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
VAAVEGINAI L SLSEQ+L+DC+T D NNGC GG MD AF YI G+
Sbjct: 164 SCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCST-DGNNGCNGGLMDYAFSYIASTGGL 222
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
+ Y Y M G CD K I+ YEDVP NDE++L+KA+A+QPVSVAI+AS
Sbjct: 223 RTEEAYPY-AMEEGDCDEGKGA-AVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGR 280
Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
QFYSGGVF+G C L+HGVTAVGYGTS+ G Y ++KNSWG WGE GY R++R
Sbjct: 281 HFQFYSGGVFDGPCGEQLDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRGTG 339
Query: 317 QPQGQCGIAMFASFPV 332
+ +G CGI AS+P
Sbjct: 340 KGEGLCGINKMASYPT 355
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 157/365 (43%), Positives = 215/365 (58%), Gaps = 33/365 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRT-FDEGSIA------EKFEQWKAQYGRTYKESAENSK 53
++K L+V L+ S A + FDE +A + +E+W+ + R ++ E +
Sbjct: 4 VSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGR 62
Query: 54 RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD----HSSSL 109
RF FK+N+ + N G+R Y LRLN+F D+ +EF ++ +++D S +
Sbjct: 63 RFGTFKENVRFIHAHNKR--GDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAA 120
Query: 110 KANGTP-FLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRL 160
+A P F+Y S+ PP SV+W ++GAVT VK QG C V AVEGINAI+ L
Sbjct: 121 RAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSL 180
Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
SLSEQ+L+DC T++N GC GG M++AF++I GIT +A Y Y S G CD +A
Sbjct: 181 ASLSEQELIDCDTDEN--GCQGGLMENAFEFIKSFGGITTEAAYPYR-ASNGTCDGDRAR 237
Query: 221 ---DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLN 275
I ++ VP E++L KAVA+QPVSVA+DA A QFYS GVF G C T L+
Sbjct: 238 RGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLD 297
Query: 276 HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKE 335
HGV AVGYG ++G YW++KNSWG WGE GY R+QR G CGIAM ASFP+ K
Sbjct: 298 HGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPI-KT 355
Query: 336 SAQPS 340
S P+
Sbjct: 356 SPNPA 360
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 153/314 (48%), Positives = 195/314 (62%), Gaps = 18/314 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ FE+W A+Y + Y E +RFE+FKDNL ++ N + SY L LN FADLT
Sbjct: 82 LVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEV--TSYWLGLNAFADLT 139
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
EF A+ G + +S + +VP SV+W +KGAVT VK QGQC
Sbjct: 140 HDEFKATYLGL-LPKRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCGSCWA 198
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQQLVDC+T D NNGC GG MD+AF +I G+ ++
Sbjct: 199 FSTVAAVEGINQIVTGNLTSLSEQQLVDCST-DGNNGCSGGVMDNAFSFIATGAGLRSEE 257
Query: 203 VYSYEGMSTGICDSIKAEDHAAQIT--NYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
Y Y M G CD +A D +T YEDVP NDE++L+KA+A+QPVSVAI+AS
Sbjct: 258 AYPYL-MEEGDCDD-RARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHF 315
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
QFYSGGVF+G C + L+HGV AVGYG+S+ G Y ++KNSWG WGE GY R++R +P
Sbjct: 316 QFYSGGVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRGTGKP 374
Query: 319 QGQCGIAMFASFPV 332
+G CGI AS+P
Sbjct: 375 EGLCGINKMASYPT 388
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 157/365 (43%), Positives = 215/365 (58%), Gaps = 33/365 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRT-FDEGSIA------EKFEQWKAQYGRTYKESAENSK 53
++K L+V L+ S A + FDE +A + +E+W+ + R ++ E +
Sbjct: 48 VSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGR 106
Query: 54 RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD----HSSSL 109
RF FK+N+ + N G+R Y LRLN+F D+ +EF ++ +++D S +
Sbjct: 107 RFGTFKENVRFIHAHNKR--GDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAA 164
Query: 110 KANGTP-FLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRL 160
+A P F+Y S+ PP SV+W ++GAVT VK QG C V AVEGINAI+ L
Sbjct: 165 RAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSL 224
Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
SLSEQ+L+DC T++N GC GG M++AF++I GIT +A Y Y S G CD +A
Sbjct: 225 ASLSEQELIDCDTDEN--GCQGGLMENAFEFIKSFGGITTEAAYPYRA-SNGTCDGDRAR 281
Query: 221 ---DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLN 275
I ++ VP E++L KAVA+QPVSVA+DA A QFYS GVF G C T L+
Sbjct: 282 RGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLD 341
Query: 276 HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKE 335
HGV AVGYG ++G YW++KNSWG WGE GY R+QR G CGIAM ASFP+ K
Sbjct: 342 HGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPI-KT 399
Query: 336 SAQPS 340
S P+
Sbjct: 400 SPNPA 404
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 159/330 (48%), Positives = 208/330 (63%), Gaps = 23/330 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ + +E+W++ + T + E RF +FK N++ V N ++ Y L+LNKFA
Sbjct: 33 EKSLWDLYERWRSHHTVT-RSLDEKHNRFNVFKANVMHVHNTNKL---DKPYKLKLNKFA 88
Query: 87 DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQG 142
D+T EF K+S H NGT F+Y++ + VP S++W +KGAVT VK QG
Sbjct: 89 DMTNYEFRRIYADSKVSHHRMFRGMSNENGT-FMYENVKNVPSSIDWRKKGAVTDVKDQG 147
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
QC + AVEGIN IK +LVSLSEQ+LVDC T N GC GG M+ AF++I QN
Sbjct: 148 QCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTG-GNEGCNGGLMEYAFEFIKQN 206
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPPNDEESLLKAVANQPVSVAID 254
GIT ++ Y Y G CD +K ED A I YE+VP N+E +LLKA A QPVSVAID
Sbjct: 207 -GITTESNYPY-AAKDGTCD-LKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAID 263
Query: 255 ASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
A QFYS GVF+G+C T LNHGV VGYG +++ KYW++KNSWG +WGE GY R+Q
Sbjct: 264 AGGYNFQFYSEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWGEQGYIRMQ 323
Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
R I +G CGIAM AS+P+ K S P+ +
Sbjct: 324 RGISHKEGLCGIAMEASYPIKKSSTNPTES 353
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 147/321 (45%), Positives = 191/321 (59%), Gaps = 17/321 (5%)
Query: 22 YRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLR 81
++T DE + FE W +G++Y E KRF+IFK+NL ++ N + +R + L
Sbjct: 35 FKTDDEATTL--FESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQN--LVEDRGFKLG 90
Query: 82 LNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKY 140
LNKFADLT +E+ + TG K D + A + S + +P SV+W E GAV VK
Sbjct: 91 LNKFADLTNEEYRSKYTGIKSKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKD 150
Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QG C ++AVEGIN I +L++LSEQ+LVDC N GC GG MD AF++II
Sbjct: 151 QGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDC-DRSYNEGCNGGLMDYAFEFII 209
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
N GI D Y Y G G CD + I +YEDVP DE +L KA ANQP+SVAI
Sbjct: 210 NNGGIDTDVDYPYTGRD-GKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAI 268
Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
+AS QFY G+F G C L+HGV VGYGT E G YW+++NSWG DWGE+GY R+
Sbjct: 269 EASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGT-ENGKDYWIVRNSWGADWGENGYLRM 327
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
+R I G CGIA+ S+PV
Sbjct: 328 ERGISSKTGICGIAIEPSYPV 348
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 146/338 (43%), Positives = 206/338 (60%), Gaps = 18/338 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + SQ T R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL---YKSS 121
+E N A GN SY L +N+FAD+T +EF+ TG + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGINEFADITSEEFLTKFTGINIPSYLSPSPMSSTEFKINDLSDD 127
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P +++W E GAVT VK QGQC AV ++EG I L+ SEQ+L+DC TN
Sbjct: 128 DMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN 187
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
N GC GGFM +AF +I +N GI++++ Y Y+G C S + + A QI++Y+ V P
Sbjct: 188 --NYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQY-TCRS-QEKTAAVQISSYQ-VVP 242
Query: 235 NDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KYW
Sbjct: 243 EGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKYW 302
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
L+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 LLKNSWGTSWGENGFMKIIRDSGNPGGHCDIAKMSSYP 340
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 149/311 (47%), Positives = 193/311 (62%), Gaps = 14/311 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +W A +GRTY E +R+++F+DNL ++ N AA G S+ L LN+FADLT E
Sbjct: 44 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ---C----A 145
+ A+ G + K + +P SV+W KGAV VK QG C
Sbjct: 104 YRATYLGARTRPQRER-KLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSYGSCWAFST 162
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF++II N GI + Y
Sbjct: 163 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDYP 221
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
Y+G + G CD + I +YEDVP NDE+SL KAVANQPVSVAI+A+ QF YS
Sbjct: 222 YKG-TDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTQFQLYSS 280
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+F G C T L+HGVTAVGYGT E G YW++KNSWG WGE GY R++R+I G+CG
Sbjct: 281 GIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 339
Query: 324 IAMFASFPVSK 334
IA+ S+P+ +
Sbjct: 340 IAVEPSYPLKE 350
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 151/327 (46%), Positives = 196/327 (59%), Gaps = 21/327 (6%)
Query: 20 ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
T R E S+ E+ E W +GR YK+ E RF+ FK+N+ +E FN G + Y
Sbjct: 27 VTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHRFKTFKENVEFIESFNKN--GTQRYK 84
Query: 80 LRLNKFADLTPQEFIASQTGFKMSDHSS-SLKANGTPFLYKS-SQVPPSVNWIEKGAVTP 137
L +NK+ADLT +EF S G S S A T F Y S ++VP S++W ++G+VT
Sbjct: 85 LAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTSFKYDSVTEVPNSMDWRKRGSVTG 144
Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
VK QG C A AA+EG I N L+SLSEQQL+DC+T N GC GG M A+
Sbjct: 145 VKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQLLDCSTQ--NKGCEGGLMTVAYD 202
Query: 191 YIIQNKG--ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
+++QN G IT + Y YE + +C K E AA N +V P+DE SLLKAV NQP
Sbjct: 203 FLLQNNGGGITTETNYPYE-EAQNVC---KTEQPAAVTINGYEVVPSDESSLLKAVVNQP 258
Query: 249 VSVAIDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE-GIKYWLIKNSWGQDWGED 306
+SV I A+ Y G+++G C + LNH VT +GYGTSEE G KYW++KNSWG DWGE+
Sbjct: 259 ISVGIAANDEFHMYGSGIYDGSCNSRLNHAVTVIGYGTSEEDGTKYWIVKNSWGSDWGEE 318
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
GY R+ RD+ G CGIA ASFP +
Sbjct: 319 GYMRIARDVGVDGGHCGIAKVASFPTA 345
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 149/313 (47%), Positives = 197/313 (62%), Gaps = 18/313 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE+W + +G+ Y+ E RFE+FKDNL ++ N SY L +N+FADLT
Sbjct: 44 LIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT---SYWLGVNEFADLT 100
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA--- 145
QEF G K+ SS + + F YK +P SV+W +KGAVT VK QG C
Sbjct: 101 HQEFKNMYLGLKV--ESSRTRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCW 158
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L SLSEQ+L+DC NNGC+GG MD AF +I+ + G+ +
Sbjct: 159 AFSTVAAVEGINKIVGGNLTSLSEQELIDC-DRPYNNGCHGGLMDYAFSFIVSSGGLHKE 217
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y + + CD+ K E I+ Y+DVP N+E SL+KA+A+QP+SVAI+AS Q
Sbjct: 218 EDYPYLEVES-TCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQ 276
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVF+G C T L+HGVTAVGYG+S+ G+ Y ++KNSWG WGE GY R++R+ +P
Sbjct: 277 FYSGGVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPA 335
Query: 320 GQCGIAMFASFPV 332
G CGI AS+P
Sbjct: 336 GLCGINKMASYPT 348
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 146/316 (46%), Positives = 197/316 (62%), Gaps = 18/316 (5%)
Query: 34 FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
++ W A+ G +A E +RF F DNL V+ N AA G Y L +N+FADL
Sbjct: 53 YDLWLAENGGGSSPNANSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLGMNRFADL 112
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC--- 144
T EF A+ G K + + + G + + ++ +P +V+W EKGAV PVK QGQC
Sbjct: 113 TNDEFRAAYLGVK-AQRARPGRMVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGSC 171
Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
AV+ VE IN I +V+LSEQ+LV+C TN ++GC GG MDDAF++II+N GI
Sbjct: 172 WAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGGIDT 231
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
+ Y Y+ + G CD ++ I +EDVP NDE+SL KAVA+QPVSVAI+A
Sbjct: 232 EDDYPYKAID-GRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREF 290
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
Q Y GVF+G C T L+HGV AVGYGT E G YW+++NSWG +WGE GY R++R+I+
Sbjct: 291 QLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGESGYLRMERNINVT 349
Query: 319 QGQCGIAMFASFPVSK 334
G+CGIAM +S+P K
Sbjct: 350 SGKCGIAMMSSYPTKK 365
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 208/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ E S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y+G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 151/319 (47%), Positives = 192/319 (60%), Gaps = 33/319 (10%)
Query: 34 FEQWKAQYGRTYKE-SAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
+E W ++G++Y E KRFEIFKDNL ++ N+ G+RSY L LN+FADLT +E
Sbjct: 49 YESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSR--GDRSYKLGLNRFADLTNEE 106
Query: 93 FIASQTGFKM----------SDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
+ ++ G K SD + KA G+ +P S++W EKGAV VK QG
Sbjct: 107 YRSTYLGAKTDARRRIAKTKSDRRYAPKAGGS--------LPDSIDWREKGAVAEVKDQG 158
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C +AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF++II+N
Sbjct: 159 SCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKN 217
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
GI +A Y Y G G CD + I YEDV P DE +L +AVA QPVSVAI+A
Sbjct: 218 GGIDTEADYPYTG-RYGRCDQTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEA 276
Query: 256 SA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
Q YS G+F G C T L+HGVTAVGYGT E G+ YW++KNSW WGE GY R+QR
Sbjct: 277 GGRDFQLYSSGIFTGSCGTDLDHGVTAVGYGT-ENGVDYWIVKNSWAASWGEKGYLRMQR 335
Query: 314 DIDQPQGQCGIAMFASFPV 332
++ G CGIA+ S+P
Sbjct: 336 NVKDKNGLCGIAIEPSYPT 354
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 145/313 (46%), Positives = 194/313 (61%), Gaps = 18/313 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ + FE W +++G++Y+ E RFE+F+DNL ++ N SY L LN+FADL+
Sbjct: 44 LTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKV---SSYWLGLNEFADLS 100
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF G K+ + + F YK + +P SV+W +KGAV VK QG C
Sbjct: 101 HEEFKRKYLGLKI--ELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCW 158
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L +LSEQ+L+DC NNGC GG MD AF +II N G+ +
Sbjct: 159 AFSTVAAVEGINQIVTGNLTALSEQELIDC-DKPFNNGCNGGLMDYAFAFIISNGGLRKE 217
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M G C K E I+ Y DVP ++E+S LKA+ANQP+SVAI+AS+ Q
Sbjct: 218 EDYPYV-MEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQ 276
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGG+FNG+C T L+HGV AVGYGTS+ G+ Y +KNSWG WGE GY R++R++ +P+
Sbjct: 277 FYSGGIFNGHCGTELDHGVAAVGYGTSK-GVDYITVKNSWGSKWGEKGYIRMKRNVGKPE 335
Query: 320 GQCGIAMFASFPV 332
G CGI AS+P
Sbjct: 336 GICGIYKMASYPT 348
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 156/335 (46%), Positives = 194/335 (57%), Gaps = 36/335 (10%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
S+AE FE+W +++ R Y E +RF++FKDNL ++ N SY L LN+FADL
Sbjct: 54 SLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKV---SSYWLGLNEFADL 110
Query: 89 TPQEFIASQTGFKMS--------DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKY 140
T EF A+ G + S D + + +P SV+W KGAVT VK
Sbjct: 111 THDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSVDWRSKGAVTGVKN 170
Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QGQC VAAVEGIN I L +LSEQ+L+DC T D NNGC GG MD AF YI
Sbjct: 171 QGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDT-DGNNGCNGGLMDYAFSYIA 229
Query: 194 QNKGITNDAVYSYEGMSTGIC------------DSIKAEDHAAQIT--NYEDVPPNDEES 239
N G+ + Y Y M G C S A D AA +T YEDVP N+E++
Sbjct: 230 HNGGLHTEEAYPYL-MEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISGYEDVPRNNEQA 288
Query: 240 LLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKN 297
LLKA+A QPVSVAI+AS QFYSGGVF+G C T L+HGV AVGYGT+ +G Y ++KN
Sbjct: 289 LLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTAAKGHDYIIVKN 348
Query: 298 SWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
SWG WGE GY R++R + QG CGI AS+P
Sbjct: 349 SWGPSWGEKGYIRMRRGTGKRQGLCGINKMASYPT 383
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 202/317 (63%), Gaps = 22/317 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
I E FE+W A++ + Y E RFE+FKDNL +++ N SY L LN+FADLT
Sbjct: 146 IIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVT---SYWLGLNEFADLT 202
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA- 145
+EF A+ G ++ + + ++ G+ F Y+ + +P SV+W KGAVT VK QGQC
Sbjct: 203 HEEFKATYLG--LAPPAPARESRGS-FKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGS 259
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
VAAVEGINAI L +LSEQ+L+DC+ D NNGC GG MD AF YI + G+
Sbjct: 260 CWAFSTVAAVEGINAIVTGNLTALSEQELIDCSV-DGNNGCNGGLMDYAFSYIASSGGLH 318
Query: 200 NDAVYSYEGMSTGIC-DSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
+ Y Y M G C D K+E A I+ YEDVP ++E++L+KA+A+QPVSVAI+AS
Sbjct: 319 TEEAYPYL-MEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQPVSVAIEASGR 377
Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
QFYSGGVF+G C T L+HGV AVGYG+ + +G Y +++NSWG WGE GY R++R
Sbjct: 378 HFQFYSGGVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAKWGEKGYIRMKRGT 437
Query: 316 DQPQGQCGIAMFASFPV 332
+ +G CGI AS+P
Sbjct: 438 GKGEGLCGINKMASYPT 454
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 151/329 (45%), Positives = 199/329 (60%), Gaps = 18/329 (5%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
+E + +EQW + + Y E +RF+IFKDNL V+ N ++ +R++ + L +F
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHN--SVPDRTFEVGLTRF 93
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTPVKYQGQC 144
ADLT +EF A KM + S+K +LYK V P V+W GAV VK QG C
Sbjct: 94 ADLTNEEFRAIYLRKKMERNKDSVKTE--RYLYKEGDVLPDEVDWRANGAVVSVKDQGNC 151
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
AV AVEGIN I L+SLSEQ+LVDC N GC GG M+ AF++I++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211
Query: 198 ITNDAVYSYEGMSTGICDSIKAED-HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
I D Y Y G+C++ K + I YEDVP +DE+SL KAVA+QPVSVAI+AS
Sbjct: 212 IETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEAS 271
Query: 257 --ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
A Q Y GV G C L+HGV VGYG++ G YW+I+NSWG +WG+ GY +LQR+
Sbjct: 272 SQAFQLYKSGVMTGTCGISLDHGVVVVGYGST-SGEDYWIIRNSWGLNWGDSGYVKLQRN 330
Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSAD 343
ID P G+CGIAM S+P +S+ PSS D
Sbjct: 331 IDDPFGKCGIAMMPSYPT--KSSFPSSFD 357
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 152/335 (45%), Positives = 200/335 (59%), Gaps = 26/335 (7%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E ++ + +E+W++ + R + AE +RF FK N + N G+ Y L LN+F
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKR--GDHPYRLHLNRFG 95
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTP-FLYKS---SQVPPSVNWIEKGAVTPVKYQG 142
D+ EF A+ G D S K P F+Y + S +PPSV+W +KGAVT VK QG
Sbjct: 96 DMDQAEFRATFVGDLRRDTPS--KPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQG 153
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
+C V +VEGINAI+ LVSLSEQ+L+DC T DN+ GC GG MD+AF+YI N
Sbjct: 154 KCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNN 212
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHA---AQITNYEDVPPNDEESLLKAVANQPVSVA 252
G+ +A Y Y + G C+ +A ++ I ++DVP N EE L +AVANQPVSVA
Sbjct: 213 GGLITEAAYPYRA-ARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVA 271
Query: 253 IDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
++AS A FYS GVF G C T L+HGV VGYG +E+G YW +KNSWG WGE GY R
Sbjct: 272 VEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIR 331
Query: 311 LQRDIDQPQGQCGIAMFASFPV---SKESAQPSSA 342
+++D G CGIAM AS+PV SK P A
Sbjct: 332 VEKDSGASGGLCGIAMEASYPVKTYSKPKPTPRRA 366
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 203/330 (61%), Gaps = 21/330 (6%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKES---AENSKRFEIFKDNLVAVERFNNAAIG 74
+++++RT DE + +E+W + G+ + + E +RF++FKDNL ++ N+
Sbjct: 37 TKSSWRTDDE--VMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDEHNSE--- 91
Query: 75 NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKG 133
NRSY + LN+FADLT +E+ + G + + L + +L + +P SV+W ++G
Sbjct: 92 NRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEG 151
Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
AV VK QG C +AAVEGIN I L+SLSEQ+LVDC N GC GG MD
Sbjct: 152 AVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDC-DRSYNEGCNGGLMD 210
Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
AF++II N GI ++ Y Y G CD+ + I NYEDVP NDE++L KAVAN
Sbjct: 211 YAFQFIINNGGIDSEEDYPYLARD-GTCDTYRKNAKVVTIDNYEDVPVNDEKALQKAVAN 269
Query: 247 QPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
QPVSVAI+A QFY G+F G C T L+HGV AVGYGT E G YW+++NSWG+ WG
Sbjct: 270 QPVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWG 328
Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
E GY R++R+I G+CGIA+ S+P+ K
Sbjct: 329 ESGYIRMERNIATATGKCGIAIEPSYPIKK 358
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 148/340 (43%), Positives = 207/340 (60%), Gaps = 20/340 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL----YK 119
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLS 127
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC
Sbjct: 128 DDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 187
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
TN N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 TN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-V 242
Query: 233 PPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT EEG K
Sbjct: 243 VPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGNCADRINHAVTAIGYGTDEEGQK 302
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
YWL+KNSWG WGE+GY ++ RD P G C IA +S+P
Sbjct: 303 YWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 151/338 (44%), Positives = 203/338 (60%), Gaps = 22/338 (6%)
Query: 9 VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
V + + + S +Y E + + +W A++G TY E +RFE F+DNL +++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 69 NNAA-IGNRSYTLRLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKSS---QV 123
N AA G S+ L LN+FADLT +E+ ++ G + D L A Y+++ ++
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSAR-----YQAADNDEL 132
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P SV+W +KGAV VK QG C A+AAVEGIN I ++ LSEQ+LVDC T+
Sbjct: 133 PESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS-Y 191
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC GG MD AF++II N GI ++ Y Y+ CD+ K I YEDVP N
Sbjct: 192 NQGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNR-CDANKKNAKVVTIDGYEDVPVNS 250
Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E+SL KAVANQP+SVAI+A A Q Y G+F G C T L+HGV AVGYGT E G YWL
Sbjct: 251 EKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWL 309
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
++NSWG WGEDGY R++R+I G+CGIA+ S+P
Sbjct: 310 VRNSWGSVWGEDGYIRMERNIKASSGKCGIAVEPSYPT 347
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 208/340 (61%), Gaps = 20/340 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL----YK 119
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLS 127
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+P +++W E GAVT VK+QGQC AV ++EG I +L+ SEQ+L+DC
Sbjct: 128 DDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCT 187
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
TN N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 TN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-V 242
Query: 233 PPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G K
Sbjct: 243 VPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQK 302
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
YWL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 342
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 197/316 (62%), Gaps = 18/316 (5%)
Query: 34 FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
++ W A++G +A E +RF F DNL V+ N AA G + L +N+FADL
Sbjct: 50 YDLWLAEHGGGSYPNANSIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 109
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC--- 144
T EF A+ G K + + G + + ++ +P +V+W EKGAV PVK QGQC
Sbjct: 110 TNDEFRAAYLGVK-GQRARPGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGSC 168
Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
A++ VE IN I +V+LSEQ+LV+C TN ++GC GG MDDAF++II+N GI
Sbjct: 169 WAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGGIDT 228
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
+ Y Y+ + G CD ++ I +EDVP NDE+SL KAVA+QPVSVAI+A
Sbjct: 229 EDDYPYKAID-GRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREF 287
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
Q Y GVF+G C T L+HGV AVGYGT E G YW+++NSWG +WGE GY R++R+I+
Sbjct: 288 QLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINVT 346
Query: 319 QGQCGIAMFASFPVSK 334
G+CGIAM +S+P K
Sbjct: 347 SGKCGIAMMSSYPTKK 362
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + SQ R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 144/312 (46%), Positives = 190/312 (60%), Gaps = 19/312 (6%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+ +W A++G+ Y E +RFEIFKDNL V+ N+ NRSY + LN+FADLT +E+
Sbjct: 47 YAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSE---NRSYKVGLNRFADLTNEEY 103
Query: 94 IASQTGFKMSDHSSSLKANGTPFLY---KSSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
+ G K +K+ Y S +P SV+W E GAV P+K QG C
Sbjct: 104 RSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKDQGSCGSCWAF 163
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
VAAVEG+N I ++ LSEQ+LVDC + GC GG MD AF++II N GI +
Sbjct: 164 STVAAVEGVNQIATGEMIQLSEQELVDCDRT-YDAGCNGGLMDYAFEFIINNGGIDTEED 222
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFY 261
Y Y G+ G CD + I +YEDVPP DE +L KAVA+QPVSVAI+AS A Q Y
Sbjct: 223 YPYRGVD-GTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEASGRAFQLY 281
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD-IDQPQG 320
GVF G C L+HGV VGYGT + G +W+++NSWG WGE+GY R++R+ +D G
Sbjct: 282 LSGVFTGECGRALDHGVVVVGYGT-DNGADHWIVRNSWGTSWGENGYIRMERNVVDNFGG 340
Query: 321 QCGIAMFASFPV 332
+CGIAM AS+P+
Sbjct: 341 KCGIAMQASYPI 352
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 148/343 (43%), Positives = 205/343 (59%), Gaps = 27/343 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + SQ R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
+E N A GN SY L +N+FAD+T +EF+A TG + + S+ K N
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMPSTEFKIND--- 124
Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+P +++W E GAVT VK QGQC AV ++EG I L+ SEQ+L+
Sbjct: 125 -LSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC TN N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI+NY
Sbjct: 184 DCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QGKTAAVQISNY 239
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
+ V P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+
Sbjct: 240 Q-VVPEGETSLLQAVTKQPVSIGIAASHDLQFYAGGTYDGSCANRINHAVTAIGYGTDEK 298
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G KYWL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 197/323 (60%), Gaps = 19/323 (5%)
Query: 27 EGSIAEKFEQWKAQYGRTY----KESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
E + ++ W A++GR Y + E +RF +F DNL V+ N A G R + L +
Sbjct: 50 EPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERA-GARGFRLGM 108
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS--QVPPSVNWIEKGAVTPVKY 140
N+FADLT EF A+ G M + G + + + ++P SV+W EKGAV PVK
Sbjct: 109 NQFADLTNDEFRAAYLG-AMVPAARRGAVVGERYRHDGAAEELPESVDWREKGAVAPVKN 167
Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QGQC AV++VE +N I +V+LSEQ+LV+C+T+ N+GC GG MD AF +II
Sbjct: 168 QGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFII 227
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
+N GI + Y Y + G CD + I +EDVP NDE+SL KAVA+QPVSVAI
Sbjct: 228 KNGGIDTEDDYPYRAVD-GKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPVSVAI 286
Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
+A Q Y GVF+G C T L+HGV AVGYG +E G YW+++NSWG WGE GY R+
Sbjct: 287 EAGGREFQLYKSGVFSGSCTTNLDHGVVAVGYG-AENGKDYWIVRNSWGPKWGEAGYIRM 345
Query: 312 QRDIDQPQGQCGIAMFASFPVSK 334
+R+++ G+CGIAM AS+P K
Sbjct: 346 ERNVNASTGKCGIAMMASYPTKK 368
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 150/303 (49%), Positives = 185/303 (61%), Gaps = 29/303 (9%)
Query: 55 FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------S 106
F +FK N+ + FN + Y LRLN+F D+T EF G +++ H
Sbjct: 70 FNVFKANVRLIHEFNRR---DEPYKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQG 126
Query: 107 SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKIN 158
SS A+ F+Y ++ VP SV+W +KGAVT VK QGQC +AAVEGINAIK
Sbjct: 127 SSASAS---FMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTK 183
Query: 159 RLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIK 218
L SLSEQQLVDC T N GC GG MD AF+YI ++ G+ + Y Y C K
Sbjct: 184 NLTSLSEQQLVDCDTK-ANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQAS-CK--K 239
Query: 219 AEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNH 276
+ I YEDVP NDE +L KAVA+QPVSVAI+AS QFYS GVF+G C T L+H
Sbjct: 240 SPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDH 299
Query: 277 GVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKES 336
GV AVGYG + +G KYWL+KNSWG +WGE GY R+ RD+ +G CGIAM AS+PV K S
Sbjct: 300 GVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPV-KTS 358
Query: 337 AQP 339
P
Sbjct: 359 PNP 361
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 208/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F+
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 151/329 (45%), Positives = 198/329 (60%), Gaps = 18/329 (5%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
+E + +EQW + + Y E +RF+IFKDNL V+ N ++ +R++ + L +F
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHN--SVPDRTFEVGLTRF 93
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTPVKYQGQC 144
ADLT +EF A KM S+K +LYK V P V+W GAV VK QG C
Sbjct: 94 ADLTNEEFRAIYLRKKMERTKDSVKTE--RYLYKEGDVLPDEVDWRANGAVVSVKDQGNC 151
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
AV AVEGIN I L+SLSEQ+LVDC N GC GG M+ AF++I++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211
Query: 198 ITNDAVYSYEGMSTGICDSIKAED-HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
I D Y Y G+C++ K + I YEDVP +DE+SL KAVA+QPVSVAI+AS
Sbjct: 212 IETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEAS 271
Query: 257 --ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
A Q Y GV G C L+HGV VGYG++ G YW+I+NSWG +WG+ GY +LQR+
Sbjct: 272 SQAFQLYKSGVMTGTCGISLDHGVVVVGYGST-SGEDYWIIRNSWGLNWGDSGYVKLQRN 330
Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSAD 343
ID P G+CGIAM S+P +S+ PSS D
Sbjct: 331 IDDPFGKCGIAMMPSYPT--KSSFPSSFD 357
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 139/322 (43%), Positives = 201/322 (62%), Gaps = 25/322 (7%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
DE + ++++W AQY R YK+ AE + RF++FK N ++R N A G + Y L N+F
Sbjct: 51 DEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSN--AGGKKKYVLGTNQF 108
Query: 86 ADLTPQEFIASQTGFK----MSDHSSSLKANGTPFL-YKSSQVPPSVNWIEKGAVTPVKY 140
ADLT +EF A TG + + + + A G+ + + V+W ++GAVTPVK
Sbjct: 109 ADLTSKEFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKN 168
Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QGQC AV A+EG+ I LVSLSEQQ++DC +D N GC GG+MD+AF+Y+I
Sbjct: 169 QGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVI 228
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
N G+T + Y Y + G C +++ AA I+ ++D+P DE +L AVANQPVSV +
Sbjct: 229 NNGGVTTEDAYPYSAVQ-GTCQNVQP---AATISGFQDLPSGDENALANAVANQPVSVGV 284
Query: 254 D--ASALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
D +S QFY GG+++G C T +NH VTA+GYG ++G +YW++KNSWG WGE+G+ +
Sbjct: 285 DGGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQ 344
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
LQ + G CGI+ AS+P
Sbjct: 345 LQMGV----GACGISTMASYPT 362
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 147/316 (46%), Positives = 192/316 (60%), Gaps = 18/316 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ FE W ++ + Y+ E RFEIF DNL ++ N +Y L LN+FADLT
Sbjct: 45 VIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKV---SNYWLGLNEFADLT 101
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF GFK + + + F Y+ +P SV+W +KGAV PVK QGQC
Sbjct: 102 HEEFKHKFLGFK-GELAERKDESSKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCW 160
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L LSEQ+L+DC T NNGC GG MD AF Y++++ G+ +
Sbjct: 161 AFSTVAAVEGINQIVTGNLTMLSEQELIDCDTT-FNNGCNGGLMDYAFAYVMRS-GLHKE 218
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y MS G CD K I+ Y DVP NDE S LKA+ANQP+SVAI+AS Q
Sbjct: 219 EEYPYI-MSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQ 277
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVF+G+C T L+HGV AVGYGT++ G+ Y +++NSWG WGE GY R++R +P
Sbjct: 278 FYSGGVFDGHCGTELDHGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPH 336
Query: 320 GQCGIAMFASFPVSKE 335
G CG+ M AS+P ++
Sbjct: 337 GMCGLYMMASYPTKQK 352
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 208/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F+
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 147/312 (47%), Positives = 190/312 (60%), Gaps = 20/312 (6%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E W ++G++Y E +RFEIFKDNL +E N NR+Y + LN+FADLT +E+
Sbjct: 54 YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV---NRTYKVGLNRFADLTNEEY 110
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
+ G + + L+A+ Y +P SV+W EKGAV PVK QG C
Sbjct: 111 RSRYLG-RRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAF 169
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
+AAVEGIN I L+SLSEQ+LVDC N GC GG MD AF++II N GI ++
Sbjct: 170 STIAAVEGINQIATGDLISLSEQELVDC-DKSYNQGCNGGLMDYAFEFIINNGGIDSEED 228
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFY 261
Y Y T CD + I YEDVP NDE SL KAVANQPVSVAI+A A Q Y
Sbjct: 229 YPYRAADT-TCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLY 287
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-G 320
GVF G C T L+HGV AVGYGT E + YW+++NSWG +WGE GY +L+R++ + G
Sbjct: 288 QSGVFTGQCGTQLDHGVVAVGYGT-ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETG 346
Query: 321 QCGIAMFASFPV 332
+CGIA+ S+P+
Sbjct: 347 KCGIAIEPSYPI 358
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 208/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F+
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYK-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 143/311 (45%), Positives = 196/311 (63%), Gaps = 17/311 (5%)
Query: 34 FEQWKAQYGRTYKES--AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
++ W A+ G + E+ +RF +F DNL V+ N A + L +N+FADLT +
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC------ 144
EF A+ G K+++ S +A G + + ++P SV+W EKGAV PVK QGQC
Sbjct: 112 EFRATFLGAKVAERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAF 168
Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
AV+ VE IN + +++LSEQ+LV+C+TN N+GC GG MDDAF +II+N GI +
Sbjct: 169 SAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDD 228
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFY 261
Y Y+ + G CD + I +EDVP NDE+SL KAVA+QPVSVAI+A Q Y
Sbjct: 229 YPYKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
GVF+G C T L+HGV AVGYGT + G YW+++NSWG WGE GY R++R+I+ G+
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346
Query: 322 CGIAMFASFPV 332
CGIAM AS+P
Sbjct: 347 CGIAMMASYPT 357
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 158/296 (53%), Positives = 199/296 (67%), Gaps = 21/296 (7%)
Query: 49 AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--S 106
+E KR IFK+NL +E FNNA GN+SY L LN+++DLT EF+AS TG K+S S
Sbjct: 77 SELEKRKRIFKNNLEYIENFNNA--GNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLSS 134
Query: 107 SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINR 159
S +++ PF + VP + +W ++GAVT VK QG C VAAVEG I
Sbjct: 135 SKMRSAAVPFNL-NDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGE 193
Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY-EGMST-GICDSI 217
L+SLSEQQLVDC ++ N+GC+GG MD AFKYIIQ KGI ++A Y Y EG T + D +
Sbjct: 194 LISLSEQQLVDC--DERNSGCHGGNMDSAFKYIIQ-KGIVSEADYPYQEGSQTCQLNDQM 250
Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID-ASALQFYSGGVFNGYCETFLNH 276
K E AQITN+ DVP NDE+ LL+AVA QPVSV I+ Q Y G V++G C +NH
Sbjct: 251 KFE---AQITNFIDVPANDEQQLLQAVAQQPVSVGIEVGDEFQHYMGDVYSGTCGQSMNH 307
Query: 277 GVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
VTAVGYG SE+G KYWLIKNSWG+ WGE+GY +L R+ +P GQCGIA AS+P+
Sbjct: 308 AVTAVGYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGIAAHASYPI 363
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/360 (41%), Positives = 209/360 (58%), Gaps = 39/360 (10%)
Query: 1 MAKYFLIVVLIISGSCASQATY-----------RTF--DEGSIAEKFEQWKAQYGRTYKE 47
M L++ +I C QA RT DE + ++++W AQY R YK+
Sbjct: 13 MTTLMLLLCVIAIADCICQAAVAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKD 72
Query: 48 SAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS 107
AE + RF++FK N ++R N A G + Y L N+FADLT +EF A TG +
Sbjct: 73 DAEKAHRFQVFKANAEFIDRSN--AGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVP 130
Query: 108 SLKANGTPFLYKSSQVPP-----SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAI 155
S A P +K V+W ++GAVTPVK QGQC AV A+EG+ I
Sbjct: 131 S-GAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMI 189
Query: 156 KINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICD 215
LVSLSEQQ++DC +D N GC GG+MD+AF+Y++ N G+T + Y Y + G C
Sbjct: 190 TTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQ-GTCQ 248
Query: 216 SIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID--ASALQFYSGGVFNG-YCET 272
+++ AA I+ ++D+P DE +L AVANQPVSV +D +S QFY GG+++G C T
Sbjct: 249 NVQP---AATISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGT 305
Query: 273 FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+NH VTA+GYG ++G +YW++KNSWG WGE+G+ +LQ + G CGI+ AS+P
Sbjct: 306 DMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQMGV----GACGISTMASYPT 361
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 147/316 (46%), Positives = 192/316 (60%), Gaps = 18/316 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ FE W ++ + Y+ E RFEIF DNL ++ N +Y L LN+FADLT
Sbjct: 45 VIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKV---SNYWLGLNEFADLT 101
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF GFK + + + F Y+ +P SV+W +KGAV PVK QGQC
Sbjct: 102 HEEFKHKFLGFK-GELAERKDESSKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGNCW 160
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L LSEQ+L+DC T NNGC GG MD AF Y++++ G+ +
Sbjct: 161 AFSTVAAVEGINQIVTGNLTMLSEQELIDCDTT-FNNGCNGGLMDYAFAYVMRS-GLHKE 218
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y MS G CD K I+ Y DVP NDE S LKA+ANQP+SVAI+AS Q
Sbjct: 219 EEYPYI-MSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQ 277
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVF+G+C T L+HGV AVGYGT++ G+ Y +++NSWG WGE GY R++R +P
Sbjct: 278 FYSGGVFDGHCGTELDHGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPH 336
Query: 320 GQCGIAMFASFPVSKE 335
G CG+ M AS+P ++
Sbjct: 337 GMCGLYMMASYPTKQK 352
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 191/314 (60%), Gaps = 17/314 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W + + + Y+ E RFE+FKDNL ++ N +SY L LN+FADL+
Sbjct: 47 LIELFENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKKV---KSYWLGLNEFADLS 103
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF G K + + F Y+ + VP SV+W +KGAV VK QG C
Sbjct: 104 HEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L +LSEQ+L+DC T NNGC GG MD AF+YI++N G+ +
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKE 222
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M G C+ K E I ++DVP NDE+SLLKA+A+QP+SVAIDAS Q
Sbjct: 223 EDYPY-SMEEGTCEMQKDESETVTIDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQ 281
Query: 260 FYSG-GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
FYSG VF+G C L+HGV AVGYG+S+ G Y ++KNSWG WGE GY RL+R+ +P
Sbjct: 282 FYSGVSVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGKP 340
Query: 319 QGQCGIAMFASFPV 332
+G CGI ASFP
Sbjct: 341 EGLCGINKMASFPT 354
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 151/323 (46%), Positives = 198/323 (61%), Gaps = 24/323 (7%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
RT DE + +E W ++G++Y E KRF+IFKDNL ++ N + R+Y + L
Sbjct: 37 RTDDE--VMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAES---RTYKVGL 91
Query: 83 NKFADLTPQEF----IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPV 138
N+FADLT E+ + ++TG + + P +S +P SV+W EKGAV V
Sbjct: 92 NRFADLTNDEYRSMYLGARTGSRRRLSTQKRSDRYVPVAGES--LPDSVDWREKGAVVGV 149
Query: 139 KYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K QG C +AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF++
Sbjct: 150 KDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEF 208
Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
II+N GI + Y Y G CD + I +YEDVP N+E++L KAVANQPVSV
Sbjct: 209 IIKNGGIDTEEDYPYNARD-GRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSV 267
Query: 252 AIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
AI+AS A QFY GVF G C T L+HGVTAVGYGT E + YW++KNSWG WGE GY
Sbjct: 268 AIEASGMAFQFYESGVFTGNCGTALDHGVTAVGYGT-ENSVDYWIVKNSWGSSWGESGYI 326
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
R++R+ G+CGIA+ S+P+
Sbjct: 327 RMERNT-GATGKCGIAVEPSYPI 348
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 149/323 (46%), Positives = 199/323 (61%), Gaps = 19/323 (5%)
Query: 22 YRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLR 81
+R+ DE + ++ W Q+G+ Y E KRFEIFKDNL ++ N+ N +Y L
Sbjct: 36 WRSDDE--VMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSN--NNTTYKLG 91
Query: 82 LNKFADLTPQEFIASQTGFKMSDHSSSLKAN--GTPFLYKS-SQVPPSVNWIEKGAVTPV 138
LNKFADLT QE+ A G + +K+ + + +++ +P SVNW + GAV+ V
Sbjct: 92 LNKFADLTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRV 151
Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K QG C A+AAVEGIN I L+SLSEQ+LVDC + GC GG MD AF++
Sbjct: 152 KDQGSCGSCWAFSAIAAVEGINKIVSGELISLSEQELVDC-DRSYDAGCNGGLMDYAFQF 210
Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
II N GI + Y Y G + CD K I YEDVP N+E +L KAVA+QPVS+
Sbjct: 211 IIDNGGIDTEKDYPYLGFNNQ-CDPTKKNAKVVSIDGYEDVP-NNENALKKAVAHQPVSI 268
Query: 252 AIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
AI+A A Q Y GVFNG C L+HGV AVGYG+ + G YW+++NSWG +WGE+GY
Sbjct: 269 AIEAGGRAFQLYESGVFNGECGLALDHGVVAVGYGSDDNGQDYWIVRNSWGGNWGENGYI 328
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
R++R+I+ G+CGIAM AS+PV
Sbjct: 329 RMERNINANTGKCGIAMEASYPV 351
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/343 (42%), Positives = 207/343 (60%), Gaps = 27/343 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
+E N A GN SY L +N+FAD+T QEF+A TG + + S+ LK N
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKIND--- 124
Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+P +++WIE GAVT VK+QG+C AV ++EG I L+ SEQ+L+
Sbjct: 125 -LSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC TN N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y
Sbjct: 184 DCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSY 239
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
+ V P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+
Sbjct: 240 Q-VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G KYWL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 200/322 (62%), Gaps = 22/322 (6%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
+ E ++ + +QW A++GRTY++ AE + RF++FK N V+ N A +SY L LN+
Sbjct: 42 YGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRLELNE 101
Query: 85 FADLTPQEFIASQTGFKMSDHSSSLKAN---GTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
FAD+T EF+A TG + + A G L + +V+W +KGAVT +K Q
Sbjct: 102 FADMTNDEFMAMYTGLRPVPAGAKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQ 161
Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
GQC AVAAVEGI+ I LVSLSEQQ++DC T D NNGC GG++D+AF+YI+
Sbjct: 162 GQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDT-DGNNGCNGGYIDNAFQYIVG 220
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N G+ + Y Y + +C S++ A I+ Y+DVP DE +L AVANQPVSVAID
Sbjct: 221 NGGLGTEDAYPYTA-AQAMCQSVQP---VAAISGYQDVPSGDEAALAAAVANQPVSVAID 276
Query: 255 ASALQFYSGGVFNGY-CET--FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
A Q Y GGV C T LNH VTAVGYGT+E+G YWL+KN WGQ+WGE GY RL
Sbjct: 277 AHNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRL 336
Query: 312 QRDIDQPQGQCGIAMFASFPVS 333
+R + CG+A AS+PV+
Sbjct: 337 ERGAN----ACGVAQQASYPVA 354
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 149/316 (47%), Positives = 196/316 (62%), Gaps = 31/316 (9%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S EK EQW +++ R Y + +E + RFEIFK NL VE FN N +Y L +NKF+
Sbjct: 11 EASAIEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNT--NNTYKLDVNKFS 68
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC- 144
DLT +EF A G + + + F Y++ S+ S++W +GAVTPVK QGQC
Sbjct: 69 DLTDEEFQARYMGL-VPEGMTGDSQKTVSFRYENVSETGESMDWRLEGAVTPVKDQGQCG 127
Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
AVAAVEG+ I LVSLSEQQLVDC+T +NN GC GG A+ YI +N+GI
Sbjct: 128 CCWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGI 187
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL 258
T++ Y Y+ + C S + AA I+ YE VP +DEE+LLKAV+
Sbjct: 188 TSEENYPYQAVQQ-TCKS--TDPAAATISGYEAVPKDDEEALLKAVSQH----------- 233
Query: 259 QFYSGGVF-NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
G+F + YC T +H VT VGYGTSEEGIKYWL+KNSWG+ WGE+GY R++RD+D+
Sbjct: 234 -----GIFEDEYCGTDSHHAVTIVGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRDVDE 288
Query: 318 PQGQCGIAMFASFPVS 333
PQG CG+A A +PV+
Sbjct: 289 PQGMCGLAHRAYYPVA 304
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/343 (42%), Positives = 207/343 (60%), Gaps = 27/343 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
+E N A GN SY L +N+FAD+T QEF+A TG + + S+ LK N
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKIND--- 124
Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+
Sbjct: 125 -LSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC TN N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y
Sbjct: 184 DCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSY 239
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
+ V P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+
Sbjct: 240 K-VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G KYWL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDENGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 196/322 (60%), Gaps = 23/322 (7%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E ++ + +E+W++ + R + AE +RF FK N + N G+ Y L LN+F
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKR--GDHPYRLHLNRFG 95
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTP-FLYKS---SQVPPSVNWIEKGAVTPVKYQG 142
D+ EF A+ G D + K P F+Y + S +PPSV+W +KGAVT VK QG
Sbjct: 96 DMDQAEFRATFVGDLRRD--TPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQG 153
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
+C V +VEGINAI+ LVSLSEQ+L+DC T DN+ GC GG MD+AF+YI N
Sbjct: 154 KCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNN 212
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHA---AQITNYEDVPPNDEESLLKAVANQPVSVA 252
G+ +A Y Y + G C+ +A ++ I ++DVP N EE L +AVANQPVSVA
Sbjct: 213 GGLITEAAYPYRA-ARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVA 271
Query: 253 IDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
++AS A FYS GVF G C T L+HGV VGYG +E+G YW +KNSWG WGE GY R
Sbjct: 272 VEASGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIR 331
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
+++D G CGIAM AS+PV
Sbjct: 332 VEKDSGASGGLCGIAMEASYPV 353
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 142/317 (44%), Positives = 198/317 (62%), Gaps = 18/317 (5%)
Query: 34 FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
++ W A++G +A + +RF F DNL V+ N AA G + L +N+FADL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 89 TPQEFIASQTGFK-MSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC-- 144
T EF A+ G K ++ + + + G + + ++ +P +V+W EKGAV PVK QGQC
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
AV+ VE IN I +V+LSEQ+LV+C N ++GC GG MDDAF++II+N GI
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
+ Y Y+ + G CD ++ I +EDVP NDE+SL KAVA+ PVSVAI+A
Sbjct: 232 TEDDYPYKAVD-GRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGRE 290
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
Q Y GVF+G C T L+HGV AVGYGT E G YW+++NSWG +WGE GY R++R+I+
Sbjct: 291 FQLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINV 349
Query: 318 PQGQCGIAMFASFPVSK 334
G+CGIAM +S+P K
Sbjct: 350 TSGKCGIAMMSSYPTKK 366
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 142/317 (44%), Positives = 198/317 (62%), Gaps = 18/317 (5%)
Query: 34 FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
++ W A++G +A + +RF F DNL V+ N AA G + L +N+FADL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 89 TPQEFIASQTGFK-MSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC-- 144
T EF A+ G K ++ + + + G + + ++ +P +V+W EKGAV PVK QGQC
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
AV+ VE IN I +V+LSEQ+LV+C N ++GC GG MDDAF++II+N GI
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
+ Y Y+ + G CD ++ I +EDVP NDE+SL KAVA+ PVSVAI+A
Sbjct: 232 TEDDYPYKAVD-GRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGRE 290
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
Q Y GVF+G C T L+HGV AVGYGT E G YW+++NSWG +WGE GY R++R+I+
Sbjct: 291 FQLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINV 349
Query: 318 PQGQCGIAMFASFPVSK 334
G+CGIAM +S+P K
Sbjct: 350 TSGKCGIAMMSSYPTKK 366
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYK-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 152/313 (48%), Positives = 185/313 (59%), Gaps = 17/313 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
IA FE W Q+G+TY E R ++F+DN V N+ GN SYTL LN FADLT
Sbjct: 26 IAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQ--GNSSYTLSLNAFADLT 83
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS--SQVPPSVNWIEKGAVTPVKYQGQC--- 144
EF AS+ G S S+SL + + + VP SV+W + GAVT VK QG C
Sbjct: 84 HHEFKASRLGLS-SAASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGAC 142
Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
A A+EGIN I LVSLSEQ+LVDC NNGC GG MD AF+++I N GI
Sbjct: 143 WSFSATGAIEGINKIVTGSLVSLSEQELVDC-DKSYNNGCEGGIMDYAFQFVIDNHGIDT 201
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--AL 258
+ Y Y+G C+ K + H I Y DVP N+E+ LLKAVANQPVSV I S A
Sbjct: 202 EEDYPYQGRDRS-CNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAF 260
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
Q YS G+F G C T L+H V VGYG SE G+ YW++KNSWG WG DGY +QR+
Sbjct: 261 QLYSKGIFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSS 319
Query: 319 QGQCGIAMFASFP 331
+G CGI M AS+P
Sbjct: 320 RGLCGINMLASYP 332
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 205/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK QGQC AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGEDG+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 205/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI V + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG-TPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + S T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 146/300 (48%), Positives = 182/300 (60%), Gaps = 20/300 (6%)
Query: 49 AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
E +RF +F DNL V+ N A + + L +N+FADLT EF A+ G +
Sbjct: 85 GEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRH 144
Query: 109 LKANGTPFLYKSSQV---PPSVNWIEKGAV-TPVKYQGQC-------AVAAVEGINAIKI 157
+ +Y+ V P SV+W +KGAV +PVK QGQC AVAAVEGIN I
Sbjct: 145 VGE-----MYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVT 199
Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
LVSLSEQ+LV+CA N N+GC GG MDDAF +I +N G+ + Y Y M G CD
Sbjct: 200 GELVSLSEQELVECARNRGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMD-GKCDLA 258
Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLN 275
K I +EDVP NDE SL KAVA+QPVSVAIDA Q Y GVF G C T L+
Sbjct: 259 KKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLD 318
Query: 276 HGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
HGV AVGYGT + G YW ++NSWG DWGE+GY R++R++ G+CGIAM AS+P+ K
Sbjct: 319 HGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 378
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 150/329 (45%), Positives = 199/329 (60%), Gaps = 22/329 (6%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNR 76
S +Y E + + +W A++G TY E +RFE F+DNL +++ N AA G
Sbjct: 27 SIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVH 86
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKSS---QVPPSVNWIEK 132
S+ L LN+FADLT +E+ ++ G + D L A Y+++ ++P SV+W +K
Sbjct: 87 SFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSAR-----YQAADNDELPESVDWRKK 141
Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
GAV VK QG C A+AAVEGIN I ++ LSEQ+LVDC T+ N GC GG M
Sbjct: 142 GAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNQGCNGGLM 200
Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
D AF++II N GI ++ Y Y+ CD+ K I YEDVP N E+SL KAVA
Sbjct: 201 DYAFEFIINNGGIDSEEDYPYKERDN-RCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVA 259
Query: 246 NQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
NQP+SVAI+A A Q Y G+F G C T L+HGV AVGYGT E G YWL++NSWG W
Sbjct: 260 NQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGSVW 318
Query: 304 GEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
GEDGY R++R+I G+CGIA+ S+P
Sbjct: 319 GEDGYIRMERNIKASSGKCGIAVEPSYPT 347
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDENGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ E S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI++++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ E S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI++++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 207/349 (59%), Gaps = 22/349 (6%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
L + + S + +++R+ +E + ++ W A++G+ Y E KRFEIFKDNL
Sbjct: 19 LLFLFFVASSAADLSSSWRSEEE--VMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKF 76
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK-ANGTPF--LYKSS 121
++ N NR+Y + LN+FADLT +E+ A G + K N +P +
Sbjct: 77 IDEHNAQ---NRTYKVGLNRFADLTNEEYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGE 133
Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W E GAV PVK Q C VAAVEGIN I L+SLSEQ+LVDC T
Sbjct: 134 VLPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDT- 192
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ + GC GG MD AF +II+N G+ + Y Y G G C+ I YEDVPP
Sbjct: 193 EYDMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFD-GECNLSGKSSKVVSIDGYEDVPP 251
Query: 235 NDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
DE++L KAVA+QPVSVA++A ALQ Y G+F G C T L+HG+ AVGYGT E G Y
Sbjct: 252 FDEKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGT-ENGTDY 310
Query: 293 WLIKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQPS 340
W+++NSWG WGE+GY R++R++ D G+CGIAM AS+P+ K PS
Sbjct: 311 WIVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPI-KNGENPS 358
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYK-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 145/341 (42%), Positives = 195/341 (57%), Gaps = 47/341 (13%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
++ W A+ GR+Y E +RF +F DNL V+ N A + + L +N+FADLT EF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------- 144
A+ G K + S +A G + + ++P SV+W EKGAV PVK QGQC
Sbjct: 109 RATFLGAKFVERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCVDRIIVWN 165
Query: 145 -------------------------------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
AV+ VE IN + +++LSEQ+LV+C+T
Sbjct: 166 SMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELVECST 225
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N+GC GG MDDAF +II+N GI + Y Y+ + G CD + I +EDVP
Sbjct: 226 NGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVD-GKCDINRENAKVVSIDGFEDVP 284
Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
NDE+SL KAVA+QPVSVAI+A Q Y GVF+G C T L+HGV AVGYGT + G
Sbjct: 285 QNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGT-DNGKD 343
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW+++NSWG WGE GY R++R+I+ G+CGIAM AS+P
Sbjct: 344 YWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYPT 384
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 147/343 (42%), Positives = 206/343 (60%), Gaps = 27/343 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
+E N A GN SY L +N+FAD+T QEF+A TG + + S+ K N
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
Y +P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+
Sbjct: 128 DY----MPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC TN N GC GG M +AF +II+N GI+ ++ Y Y G C S + + A QI++Y
Sbjct: 184 DCTTN--NYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-REKTAAVQISSY 239
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
+ V P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT EE
Sbjct: 240 K-VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGNCADQINHAVTAIGYGTDEE 298
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G KYWL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 146/316 (46%), Positives = 192/316 (60%), Gaps = 18/316 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ FE W A++ + Y+ E RFEIF DNL ++ N +Y L LN+FADLT
Sbjct: 45 VIHLFESWLAKHSKIYESLDEKLHRFEIFMDNLKHIDDTNKKV---SNYWLGLNEFADLT 101
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF G K + + F Y+ +P SV+W +KGAV PVK QGQC
Sbjct: 102 HEEFKNKFLGLK-GELPERKDESIEEFSYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCW 160
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L LSEQ+L+DC T NNGC GG MD AF Y++++ G+ +
Sbjct: 161 AFSTVAAVEGINQIVTGNLTMLSEQELIDCDTT-FNNGCNGGLMDYAFAYVMRS-GLHKE 218
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y MS G CD K I+ Y DVP N+E+S LKA+ANQP+SVAI+AS Q
Sbjct: 219 EEYPYI-MSEGTCDEKKDVSETVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQ 277
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVF+G+C T L+HGV AVGYGT++ G+ Y +++NSWG WGE GY R++R +P
Sbjct: 278 FYSGGVFDGHCGTELDHGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRKTGKPH 336
Query: 320 GQCGIAMFASFPVSKE 335
G CG+ M AS+P ++
Sbjct: 337 GMCGLYMMASYPTKQK 352
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 145/347 (41%), Positives = 209/347 (60%), Gaps = 24/347 (6%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEK--------FEQWKAQYGRTYKESAENSKRFE 56
L++++ + S AS + ++DE I + +E W ++G++Y E KRF+
Sbjct: 12 ILLMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNALGEKDKRFQ 71
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP- 115
IFKDNL ++ N ++ N+SY L L KFADLT +E+ + G K S L N +
Sbjct: 72 IFKDNLRYIDEQN--SVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLSKNKSDR 129
Query: 116 FLYK-SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
+L K +P S++W EKG + VK QG C AVAA+E INAI L+SLSEQ+
Sbjct: 130 YLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQE 189
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
LVDC N GC GG MD AF+++I+N GI + Y Y+ G+CD + +I
Sbjct: 190 LVDC-DRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYK-ERNGVCDQYRKNAKVVKID 247
Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGT 285
+YEDVP N+E++L KAVA+QPVS+A++A Q Y G+F G C T ++HGV GYGT
Sbjct: 248 SYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT 307
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E G+ YW+++NSWG +WGE+GY R+QR++ G CG+A+ S+PV
Sbjct: 308 -ENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGLAIEPSYPV 353
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F+
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F+
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F+
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 181/308 (58%), Gaps = 14/308 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
FE W ++G+TY + RF+IF++N V++ N+ GN SYTL LN FADLT EF
Sbjct: 32 FESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQ--GNSSYTLSLNAFADLTHHEF 89
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AV 146
AS+ G S L P VP S++W +KGAV+ VK QG C A
Sbjct: 90 KASRLGLSAFSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSAT 149
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
A+EGIN I LVSLSEQ+LVDC NNGC GG MD A++++I+N GI + Y Y
Sbjct: 150 GAIEGINKIVTGSLVSLSEQELVDC-DRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPY 208
Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGG 264
+ C+ K + H I Y DVP N+E+ LLKAVA QPVSV I S A Q YS G
Sbjct: 209 QAREK-TCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKG 267
Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
+F G C T L+H V VGYG SE G+ YW++KNSWG WG +GY + R+ QG CGI
Sbjct: 268 IFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGI 326
Query: 325 AMFASFPV 332
M ASFPV
Sbjct: 327 NMLASFPV 334
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C I +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 145/298 (48%), Positives = 182/298 (61%), Gaps = 16/298 (5%)
Query: 49 AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
E +RF +F DNL V+ N A + + L +N+FADLT EF A+ G +
Sbjct: 84 GEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRH 143
Query: 109 LKANGTPFLYKSSQV-PPSVNWIEKGAVT-PVKYQGQC-------AVAAVEGINAIKINR 159
+ G + + +V P SV+W +KGAV PVK QGQC AVAAVEGIN I
Sbjct: 144 V---GEAYRHDGVEVLPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 200
Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
LVSLSEQ+LV+CA N N+GC GG MDDAF +I +N G+ + Y Y M G C+ K
Sbjct: 201 LVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMD-GKCNLAKK 259
Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHG 277
I +EDVP NDE SL KAVA+QPVSVAIDA Q Y GVF G C T L+HG
Sbjct: 260 SRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHG 319
Query: 278 VTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
V AVGYGT + G YW ++NSWG DWGE+GY R++R++ G+CGIAM AS+P+ K
Sbjct: 320 VVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 377
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 142/317 (44%), Positives = 198/317 (62%), Gaps = 18/317 (5%)
Query: 34 FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
++ W A++G +A + +RF F DNL V+ N AA G + L +N+FADL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 89 TPQEFIASQTGFK-MSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC-- 144
T EF A+ G K ++ + + + G + + ++ +P +V+W EKGAV PVK QGQC
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
AV+ VE IN I +V+LSEQ+LV+C N ++GC GG MDDAF++II+N GI
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
+ Y Y+ + G CD ++ I +EDVP NDE+SL KAVA+ PVSVAI+A
Sbjct: 232 TEDDYPYKAVD-GRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGRE 290
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
Q Y GVF+G C T L+HGV AVGYGT E G YW+++NSWG +WGE GY R++R+I+
Sbjct: 291 FQLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINV 349
Query: 318 PQGQCGIAMFASFPVSK 334
G+CGIAM +S+P K
Sbjct: 350 TSGKCGIAMMSSYPTKK 366
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKKNMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QGQC AV ++EG I +L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +II+N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+ G ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAEGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + SQ R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 145/316 (45%), Positives = 189/316 (59%), Gaps = 19/316 (6%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
+I + F QW + R Y+ +E RF+IFK+N + + N +SY L LNKF+DL
Sbjct: 44 AILDVFHQWLETHSRVYRSLSEKHHRFQIFKENFLYIHAHNKQ---QKSYWLGLNKFSDL 100
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
T QEF A G K + +AN F+Y+ + P V+W KGAVT VK QG C
Sbjct: 101 THQEFRAQYLGTKPVNRQRK-EAN---FMYEDVEAEPKVDWRLKGAVTDVKDQGACGSCW 156
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
AV +VEG+NAIK LVSLSEQ+LVDC N GC GG MD AF++II+N GI +
Sbjct: 157 AFSAVGSVEGVNAIKTGELVSLSEQELVDC-DRKQNQGCNGGLMDYAFEFIIKNGGIDTE 215
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y+ G CD + I +Y+DVP E +L+KA+ PVSVAI+A Q
Sbjct: 216 KDYPYKARD-GRCDEGRRNSKVVVIDDYQDVPTQSESALMKALTKNPVSVAIEAGGRDFQ 274
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR-DIDQP 318
Y GGVF G C + L+HGV AVGYGT ++G+ YW++KNSWG WGE GY R++R D
Sbjct: 275 HYQGGVFTGPCGSELDHGVLAVGYGTDDDGVNYWIVKNSWGPGWGEKGYIRMERFGSDST 334
Query: 319 QGQCGIAMFASFPVSK 334
G+CGI + ASFP+ K
Sbjct: 335 DGKCGINIEASFPIKK 350
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 206/343 (60%), Gaps = 27/343 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
+E N A GN SY L +N+FAD+T QEF+A TG + + S+ LK N
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKIND--- 124
Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+
Sbjct: 125 -LSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC TN N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y
Sbjct: 184 DCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSY 239
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
+ V P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+
Sbjct: 240 Q-VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G KYWL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 195/316 (61%), Gaps = 19/316 (6%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
++ + +E+W+ + + E +RF FKDN+ + N A G LN+F D+
Sbjct: 41 ALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPGYAP----LNRFGDM 95
Query: 89 TPQEFIASQTGFKMSD-HSSSLKANGTP-FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA 145
+EF A+ G +D L A P F+Y+ + +P +V+W KGAVT VK QG+C
Sbjct: 96 GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 155
Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
V +VEGINAI+ RLVSLSEQ+L+DC T DN+ GC GG M++AF+YI + GI
Sbjct: 156 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNS-GCQGGLMENAFEYIKHSGGI 214
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
T ++ Y Y + G CD+++A I +++VP N E +L KAVANQPVSVAIDA
Sbjct: 215 TTESAYPYR-AANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQ 273
Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
+ QFYS GVF G C T L+HGV VGYG + +G +YW++KNSWG WGE GY R+QRD
Sbjct: 274 SFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDSG 333
Query: 317 QPQGQCGIAMFASFPV 332
G CGIAM AS+PV
Sbjct: 334 YDGGLCGIAMEASYPV 349
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 151/319 (47%), Positives = 193/319 (60%), Gaps = 28/319 (8%)
Query: 34 FEQWKAQYGRTYKESA--------ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
F+ W Q+G++Y E+A E + R+ IFKDNL + N N+ Y L LN F
Sbjct: 57 FDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGENEK---NQGYFLGLNAF 113
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQG 142
ADLT +EF A + G + S + + F Y S Q+ P S++W EKGAV VK QG
Sbjct: 114 ADLTNEEFRAQRHGGRFD--RSRERTSYEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQG 171
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C AVAA+EG+N + LVSLSEQ+LVDC ++ GC GG MD AF ++I+N
Sbjct: 172 SCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDE-GCNGGLMDYAFGFVIKN 230
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
G+ +A Y Y+G T CD K I YEDVP NDE +LLKAVA+QPVSVAIDA
Sbjct: 231 GGLDTEADYPYKGYGT-RCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDA 289
Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
S++QFY G+F G C T L+HGVT VGYG E+G YW+IKNSWG +WGE GY ++ R
Sbjct: 290 GGSSMQFYRSGIFTGRCGTDLDHGVTNVGYG-KEDGKAYWIIKNSWGSNWGEKGYIKMAR 348
Query: 314 DIDQPQGQCGIAMFASFPV 332
+ G CGI M AS+P
Sbjct: 349 NTGLAAGLCGINMEASYPT 367
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 14/311 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +WKA++G++Y E +R+ F+DNL ++ N AA G S+ L LN+FADLT +E
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
+ + G + K + + +P SV+W KGAV +K QG C A
Sbjct: 100 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSA 158
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF +II N GI + Y
Sbjct: 159 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 217
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
Y+G CD + I +YEDV PN E SL KAVANQPVSVAI+A A Q YS
Sbjct: 218 YKGKDE-RCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSS 276
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+F G C T L+HGV AVGYGT E G YW+++NSWG+ WGE GY R++R+I G+CG
Sbjct: 277 GIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCG 335
Query: 324 IAMFASFPVSK 334
IA+ S+P+ K
Sbjct: 336 IAVEPSYPLKK 346
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 142/311 (45%), Positives = 195/311 (62%), Gaps = 17/311 (5%)
Query: 34 FEQWKAQYGRTYKES--AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
++ W A+ G + E+ +RF +F DNL V+ N A + L +N+FADLT +
Sbjct: 51 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNE 110
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC------ 144
EF A+ G K+++ S +A G + + ++P SV+W EKGAV PVK QGQC
Sbjct: 111 EFRATFLGAKVAERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAF 167
Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
AV+ VE IN + +++LSEQ+LV+C+TN N+GC GG M DAF +II+N GI +
Sbjct: 168 SAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTEDD 227
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFY 261
Y Y+ + G CD + I +EDVP NDE+SL KAVA+QPVSVAI+A Q Y
Sbjct: 228 YPYKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 286
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
GVF+G C T L+HGV AVGYGT + G YW+++NSWG WGE GY R++R+I+ G+
Sbjct: 287 HSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 345
Query: 322 CGIAMFASFPV 332
CGIAM AS+P
Sbjct: 346 CGIAMMASYPT 356
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 152/346 (43%), Positives = 212/346 (61%), Gaps = 22/346 (6%)
Query: 2 AKYFLIVVLIISGSC---ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIF 58
A FL+ VL++ + A+ ++A + E+W A++GR YK+ AE ++R E+F
Sbjct: 3 ASRFLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVF 62
Query: 59 KDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
+ N ++ FN A G S+ L N+FADLT +EF A++TG + S A F Y
Sbjct: 63 RANAELIDSFN--AAGTHSHRLATNRFADLTVEEFRAARTGLRPRPAPS---AGAGRFRY 117
Query: 119 KS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
++ + SV+W GAVT VK QG C AVAAVEG+N I+ RLVSLSEQ+L
Sbjct: 118 ENFSLADAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAAVEGLNKIRTGRLVSLSEQEL 177
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
VDC + + GC GG MD+AF+++ + G+ +++ Y Y+G G C S A AA I
Sbjct: 178 VDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQGRD-GPCRSSAAAARAASIRG 236
Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
+EDVP N+E +L AVANQPVSVAI+ A +FY GV G C T LNH +TAVGYGT+
Sbjct: 237 HEDVPRNNEAALAAAVANQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTA 296
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G +YWL+KNSWG WGE GY R++R + + +G CG+A S+PV
Sbjct: 297 NDGTRYWLMKNSWGASWGEGGYVRIRRGV-RGEGVCGLAKLPSYPV 341
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 150/355 (42%), Positives = 205/355 (57%), Gaps = 33/355 (9%)
Query: 5 FLIVVLIISGSCASQATYRTFDE------------GSIAEKFEQWKAQYGRTYKESA--- 49
L++ ++I S A+ + ++DE +A +E W ++G+ + +
Sbjct: 8 ILLLAMMIGVSYAADMSIISYDEKHHITAENERSDAEVARIYEAWMEKHGKKAQSNGLVG 67
Query: 50 -ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
E +RFEIFKDNL ++ NN N SY L L +FADLT +E+ + G K
Sbjct: 68 EEKDQRFEIFKDNLRFIDEHNNK---NLSYKLGLTRFADLTNEEYRSIYLGAKSKKRVLK 124
Query: 109 LKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLV 161
P + +P SV+W ++GAV VK QG C + AVEGIN I L+
Sbjct: 125 TSDRYQPRV--GDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLI 182
Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
SLSEQ+LVDC T+ N GC GG MD AF++II+N GI + Y Y+ + G CD +
Sbjct: 183 SLSEQELVDCDTS-YNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKA-ADGRCDQTRKNA 240
Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVT 279
I YEDVP N+E +L K +ANQP+SVAI+A A Q YS GVF+G C T L+HGV
Sbjct: 241 KVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVFDGICGTELDHGVV 300
Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
AVGYGT E G YW+++NSWG WGE GY ++ R+I +P G+CGIAM AS+P+ K
Sbjct: 301 AVGYGT-ENGKDYWIVRNSWGGSWGESGYIKMARNIAEPTGKCGIAMEASYPIKK 354
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 195/316 (61%), Gaps = 19/316 (6%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
++ + +E+W+ + + E +RF FKDN+ + N A G LN+F D+
Sbjct: 41 ALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPGYPP----LNRFGDM 95
Query: 89 TPQEFIASQTGFKMSD-HSSSLKANGTP-FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA 145
+EF A+ G +D L A P F+Y+ + +P +V+W KGAVT VK QG+C
Sbjct: 96 GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 155
Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
V +VEGINAI+ RLVSLSEQ+L+DC T DN+ GC GG M++AF+YI + GI
Sbjct: 156 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNS-GCQGGLMENAFEYIKHSGGI 214
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
T ++ Y Y + G CD+++A I +++VP N E +L KAVANQPVSVAIDA
Sbjct: 215 TTESAYPYR-AANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQ 273
Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
+ QFYS GVF G C T L+HGV VGYG + +G +YW++KNSWG WGE GY R+QRD
Sbjct: 274 SFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDSG 333
Query: 317 QPQGQCGIAMFASFPV 332
G CGIAM AS+PV
Sbjct: 334 YDGGLCGIAMEASYPV 349
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 210/354 (59%), Gaps = 31/354 (8%)
Query: 2 AKYFLIVVLIISGSCASQAT------YRTFDEGS---IAEKFEQWKAQYGRTYKESAENS 52
+K + V+L+ G+C ++ + Y D S + E FE+W A++ + Y E
Sbjct: 3 SKLSVAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEEKL 62
Query: 53 KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN 112
RFE+FKDNL ++ N SY L LN+FADLT EF + G + +++
Sbjct: 63 HRFEVFKDNLKLIDEINREVT---SYWLGLNEFADLTHDEFKTTYLGLSPP---PARRSS 116
Query: 113 GTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVS 162
F Y+ + +P +V+W +KGAVT VK QGQC VAAVEGINAI L +
Sbjct: 117 SRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTA 176
Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGIC-DSIKAED 221
LSEQ+L+DC+ D N+GC GG MD AF YI + G+ + Y Y M G C D K+E
Sbjct: 177 LSEQELIDCSV-DGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYL-MEEGSCGDGKKSES 234
Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVT 279
A I+ YEDVP DE++L+KA+A+QPVSVAI+AS QFYSGGVF+G C L+HGV
Sbjct: 235 EAVSISGYEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVA 294
Query: 280 AVGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
AVGYG+ + +G Y ++KNSWG WGE GY R++R + +G CGI AS+P
Sbjct: 295 AVGYGSDKGKGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSEGLCGINKMASYPT 348
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + SQ R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENIKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI++++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 191/315 (60%), Gaps = 23/315 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
++ +E+W ++G+ E +RFEIFKDNL ++ N N SY L L KFADLT
Sbjct: 38 VSRLYEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK---NLSYRLGLTKFADLT 94
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQCA- 145
E+ + G ++ KA T Y++ +P SV+W ++GAV VK QG C
Sbjct: 95 NDEYRSMYLGSRLK-----RKATKTSLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGS 149
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
+ AVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF++II+N GI
Sbjct: 150 CWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGGID 208
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--A 257
+ Y Y+G+ G CD + I +YEDVP N EESL KA+++QP+SVAI+ A
Sbjct: 209 TEEDYPYKGVD-GRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPISVAIEGGGRA 267
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
Q Y G+F+G C T L+HGV AVGYGT E G YW++KNSWG WGE GY R++R+I
Sbjct: 268 FQLYDSGIFDGICGTDLDHGVVAVGYGT-ENGKDYWIVKNSWGTSWGESGYIRMERNIAS 326
Query: 318 PQGQCGIAMFASFPV 332
G+CGIA+ S+P+
Sbjct: 327 SAGKCGIAVEPSYPI 341
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 206/343 (60%), Gaps = 27/343 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
+E N A GN SY L +N+FAD+T QEF+A TG + + S+ LK N
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKIND--- 124
Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+
Sbjct: 125 -LSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC TN N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y
Sbjct: 184 DCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSY 239
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
+ V P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+
Sbjct: 240 Q-VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G KYWL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 145/331 (43%), Positives = 204/331 (61%), Gaps = 23/331 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE----RFNNAAIGNR--SYTL 80
E ++ E + +W++ + + AE +RF FK N++ + R N+ + N SY L
Sbjct: 35 EEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRL 94
Query: 81 RLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP-FLYKSSQ-VPPSVNWIEKGAVTPV 138
RLN+F D+ EF ++ F H + A P F+Y + + +P +V+W +KGAVT V
Sbjct: 95 RLNRFGDMDQAEF---RSTFAGPLHRHTRPAQSIPGFIYDTVKDIPQAVDWRQKGAVTGV 151
Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K QG+C AVA+VEG+NAI+ LVSLSEQ+L+DC T ++NGC GG M+ AF++
Sbjct: 152 KDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEF 211
Query: 192 IIQNKG-ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
I + G + +A Y Y S G C++ + + +I ++ VP +EE+L KAVA+QPVS
Sbjct: 212 IAHSAGGLATEAAYPYH-ASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQPVS 270
Query: 251 VAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE-GIKYWLIKNSWGQDWGEDG 307
VAIDA A QFYS GVF G C + L+HGV VGYG +EE G +YW++KNSWG WGE G
Sbjct: 271 VAIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVKNSWGPGWGEHG 330
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVSKESAQ 338
Y R+QRD G CGIAM AS+PV E +
Sbjct: 331 YVRMQRDSGVDGGLCGIAMEASYPVKNEQTK 361
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 149/329 (45%), Positives = 191/329 (58%), Gaps = 15/329 (4%)
Query: 20 ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
AT +EG + +EQW + G+ Y E +RF+IFKDNL +E N+ NRSY
Sbjct: 27 ATESQRNEGGVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDP--NRSYE 84
Query: 80 LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTP- 137
LNKF+DLT EF AS G KM S S A + YK V P V+W E+GAV P
Sbjct: 85 RGLNKFSDLTADEFQASYLGGKMEKKSLSDVAE--RYQYKEGDVLPDEVDWRERGAVVPR 142
Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
VK QG+C A AVEGIN I LVSLSEQ+L+DC ++N GC GG AF+
Sbjct: 143 VKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFE 202
Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPV 249
+I +N GI +D VY Y G T C +I+ + I +E VP NDE SL KAVA QP+
Sbjct: 203 FIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPI 262
Query: 250 SVAIDASALQFYSGGVFNGYCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
SV I A+ + Y GV+ G C +H V VGYGTS + YWLI+NSWG +WGE GY
Sbjct: 263 SVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGY 322
Query: 309 FRLQRDIDQPQGQCGIAMFASFPVSKESA 337
RLQR+ +P G+C +A+ +P+ S+
Sbjct: 323 LRLQRNFHEPTGKCAVAVAPVYPIKSNSS 351
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 14/311 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +WKA++G++Y E +R+ F+DNL ++ N AA G S+ L LN+FADLT +E
Sbjct: 41 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 100
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
+ + G + K + + +P SV+W KGAV +K QG C A
Sbjct: 101 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSA 159
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF +II N GI + Y
Sbjct: 160 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 218
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
Y+G CD + I +YEDV PN E SL KAVANQPVSVAI+A A Q YS
Sbjct: 219 YKGKDE-RCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSS 277
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+F G C T L+HGV AVGYGT E G YW+++NSWG+ WGE GY R++R+I G+CG
Sbjct: 278 GIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCG 336
Query: 324 IAMFASFPVSK 334
IA+ S+P+ K
Sbjct: 337 IAVEPSYPLKK 347
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 147/329 (44%), Positives = 199/329 (60%), Gaps = 22/329 (6%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNR 76
S +Y E + + +W +++ RTY E +RFE+F+DNL +++ N AA G
Sbjct: 25 SIVSYGERSEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLH 84
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKS---SQVPPSVNWIEK 132
S+ L LN+FADLT +E+ ++ G + D L A Y++ ++P +V+W +K
Sbjct: 85 SFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSAR-----YQADDNEELPETVDWRKK 139
Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
GAV +K QG C A+AAVEGIN I ++ LSEQ+LVDC T+ N GC GG M
Sbjct: 140 GAVAAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNEGCNGGLM 198
Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
D AF++II N GI ++ Y Y+ CD+ K I YEDVP N E+SL KAVA
Sbjct: 199 DYAFEFIINNGGIDSEEDYPYKERDN-RCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVA 257
Query: 246 NQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
NQP+SVAI+A A Q Y G+F G C T L+HGV AVGYGT E G YWL++NSWG W
Sbjct: 258 NQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGTVW 316
Query: 304 GEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
GEDGY R++R+I G+CGIA+ S+P
Sbjct: 317 GEDGYIRMERNIKASSGKCGIAVEPSYPT 345
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 147/335 (43%), Positives = 204/335 (60%), Gaps = 18/335 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
+E N A GN SY L +N+FAD+T QEF+A TG + + S L + L +P
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPN--SYLSPSPINDL-SDDDMP 124
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
+++W E GAVT VK QGQC AV ++EG I L+ SEQ+L+DC TN N
Sbjct: 125 SNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN--N 182
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V P E
Sbjct: 183 YGCNGGFMTNAFDFIKENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VVPEGE 239
Query: 238 ESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KYWL+K
Sbjct: 240 TSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQKYWLLK 299
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
NSWG WGEDG+ ++ RD P G C IA +S+P
Sbjct: 300 NSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 196/323 (60%), Gaps = 28/323 (8%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E ++ E +E+W+ Q+ R ++ E ++RF +FKDN+ + FN + Y LRLN+F
Sbjct: 41 EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR---DEPYKLRLNRFG 96
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
D+T E + ++S H + G K+ ++ GAV VK QGQC
Sbjct: 97 DMTADESAGAYASSRVS-HHRMFRGRGE----KAQRL--------HGAVGAVKDQGQCGS 143
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
+AAVEGINAI+ + L +LSEQQLVDC T N GC GG MD+AF+YI ++ G+
Sbjct: 144 CWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVA 203
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SA 257
+ Y Y + C S A A I YEDVP N E +L KAVANQPVSVAI+A S
Sbjct: 204 ASSAYPYRARQS-SCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSH 262
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
QFYS GVF G C T L+HGV AVGYGT+ +G KYW+++NSWG DWGE GY R++RD+
Sbjct: 263 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDVSA 322
Query: 318 PQGQCGIAMFASFPVSKESAQPS 340
+G CGIAM AS+P+ K S P+
Sbjct: 323 KEGLCGIAMEASYPI-KTSPNPA 344
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 147/304 (48%), Positives = 192/304 (63%), Gaps = 18/304 (5%)
Query: 40 QYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG 99
++ + Y KRFEIFKDNL ++ N N+S+ L LNKFADL+ +E+ + G
Sbjct: 13 KHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGV--NQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 100 FKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEG 151
+M +++ F Y ++P SV+W EKGAV PVK QGQC VAAVEG
Sbjct: 71 GRMVRDRKGFESD--RFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEG 128
Query: 152 INAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST 211
IN I L+SLSEQ+LVDC N GC GGFMD AF++I++N GI + Y Y+G+
Sbjct: 129 INQIATGDLISLSEQELVDCDKG-FNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVD- 186
Query: 212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGY 269
G CD + I +EDVP NDE+SL KAVA+QPVSVAI+A A Q Y G+FNG
Sbjct: 187 GQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGL 246
Query: 270 CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFA 328
C T L+HGV AVGYGT E+G YW+++NSWG +WGE+GY RL+R++ G+CGIAM
Sbjct: 247 CGTDLDHGVVAVGYGT-EDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQP 305
Query: 329 SFPV 332
S+P
Sbjct: 306 SYPT 309
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 149/329 (45%), Positives = 191/329 (58%), Gaps = 15/329 (4%)
Query: 20 ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
AT +EG + +EQW + G+ Y E +RF+IFKDNL +E N+ NRSY
Sbjct: 27 ATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDP--NRSYE 84
Query: 80 LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTP- 137
LNKF+DLT EF AS G KM S S A + YK V P V+W E+GAV P
Sbjct: 85 RGLNKFSDLTADEFQASYLGGKMEKKSLSDVAE--RYQYKEGDVLPDEVDWRERGAVVPR 142
Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
VK QG+C A AVEGIN I LVSLSEQ+L+DC ++N GC GG AF+
Sbjct: 143 VKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFE 202
Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPV 249
+I +N GI +D VY Y G T C +I+ + I +E VP NDE SL KAVA QP+
Sbjct: 203 FIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPI 262
Query: 250 SVAIDASALQFYSGGVFNGYCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
SV I A+ + Y GV+ G C +H V VGYGTS + YWLI+NSWG +WGE GY
Sbjct: 263 SVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGY 322
Query: 309 FRLQRDIDQPQGQCGIAMFASFPVSKESA 337
RLQR+ +P G+C +A+ +P+ S+
Sbjct: 323 LRLQRNFHEPTGKCAVAVAPVYPIKSNSS 351
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 196/312 (62%), Gaps = 18/312 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ +FE W +++G+ YK E RFE+F++NL ++ N SY L LN+FADL+
Sbjct: 400 LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEV---SSYWLGLNEFADLS 456
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF + G + ++ S +G F Y+ + +P SV+W +KGAVT VK QG C
Sbjct: 457 HEEFKSKYLGLR-AEFPRSRDYSG-EFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCW 514
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L +LSEQ+L+DC T N+GC GG MD AF +I N G+ +
Sbjct: 515 AFSTVAAVEGINQIVTGNLTTLSEQELIDCDTT-FNSGCNGGLMDYAFAFIASNGGLHKE 573
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M G C+ K + I+ YEDVP DEESLLKA+A+QP+SVAI+AS Q
Sbjct: 574 DDYPYL-MEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQ 632
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVFNG C T L+HGV AVGYG+S+ G+ Y ++KNSWG WGE GY R++R+ + +
Sbjct: 633 FYSGGVFNGPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTE 691
Query: 320 GQCGIAMFASFP 331
G CGI AS+P
Sbjct: 692 GLCGINKMASYP 703
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 150/319 (47%), Positives = 193/319 (60%), Gaps = 28/319 (8%)
Query: 34 FEQWKAQYGRTYKESA--------ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
F+ W Q+G++Y ++A E + R+ IFKDNL + N N+ Y L LN F
Sbjct: 57 FDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGENEK---NQGYFLGLNAF 113
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQG 142
ADLT +EF A + G + S + + F Y S Q+ P S++W EKGAV VK QG
Sbjct: 114 ADLTNEEFRAQRHGGRFD--RSRERTSHEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQG 171
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C AVAA+EG+N + LVSLSEQ+LVDC ++ GC GG MD AF ++I+N
Sbjct: 172 SCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDE-GCNGGLMDYAFGFVIKN 230
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
G+ +A Y Y+G T CD K I YEDVP NDE +LLKAVA+QPVSVAIDA
Sbjct: 231 GGLDTEADYPYKGYGT-RCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDA 289
Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
S++QFY G+F G C T L+HGVT VGYG E+G YW+IKNSWG +WGE GY ++ R
Sbjct: 290 GGSSMQFYRSGIFTGRCGTDLDHGVTNVGYG-KEDGKAYWIIKNSWGSNWGEKGYVKMAR 348
Query: 314 DIDQPQGQCGIAMFASFPV 332
+ G CGI M AS+P
Sbjct: 349 NTGLAAGLCGINMEASYPT 367
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 147/335 (43%), Positives = 204/335 (60%), Gaps = 18/335 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
+E N A GN SY L +N+FAD+T QEF+A TG + + S L + L +P
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPN--SYLSPSPINDL-SDDDMP 124
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
+++W E GAVT VK QGQC AV ++EG I L+ SEQ+L+DC TN N
Sbjct: 125 SNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN--N 182
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V P E
Sbjct: 183 YGCNGGFMTNAFDFIKENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VVPEGE 239
Query: 238 ESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KYWL+K
Sbjct: 240 TSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQKYWLLK 299
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
NSWG WGEDG+ ++ RD P G C IA +S+P
Sbjct: 300 NSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 144/298 (48%), Positives = 182/298 (61%), Gaps = 16/298 (5%)
Query: 49 AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
E +RF +F DNL V+ N A + + L +N+FADLT EF A+ G +
Sbjct: 84 GEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRH 143
Query: 109 LKANGTPFLYKSSQ-VPPSVNWIEKGAVT-PVKYQGQC-------AVAAVEGINAIKINR 159
+ G + + + +P SV+W +KGAV PVK QGQC AVAAVEGIN I
Sbjct: 144 V---GEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 200
Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
LVSLSEQ+LV+CA N N+GC GG MDDAF +I +N G+ + Y Y M G C+ K
Sbjct: 201 LVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMD-GKCNLAKK 259
Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHG 277
I +EDVP NDE SL KAVA+QPVSVAIDA Q Y GVF G C T L+HG
Sbjct: 260 SRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHG 319
Query: 278 VTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
V AVGYGT + G YW ++NSWG DWGE+GY R++R++ G+CGIAM AS+P+ K
Sbjct: 320 VVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 377
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 155/360 (43%), Positives = 215/360 (59%), Gaps = 29/360 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIA------EKFEQWKAQYGRTYKESAENSKR 54
+AK L+V L+ + S FDE +A + +E+W+ + ++ E +R
Sbjct: 4 LAKTLLLVALV-AMSAVELCRAIEFDERDLASDEALWDLYERWQTHH-HVHRHHGEKGRR 61
Query: 55 FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD--HSSSLKAN 112
F FK+N+ + N G+R Y L LN+F D+ +EF ++ +++D + S A
Sbjct: 62 FGTFKENVRFIHAHNKR--GDRPYRLSLNRFGDMGREEFRSTFADSRINDLRRAESPAAP 119
Query: 113 GTP-FLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSL 163
P F+Y + +PPSV+W ++GAVT VK QG C V +VEGINAI+ LVSL
Sbjct: 120 AVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRTGSLVSL 179
Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE-DH 222
SEQ+L+DC T++N GC GG M++AF++I G+T ++ Y Y S G CDS+++
Sbjct: 180 SEQELIDCDTDEN--GCQGGLMENAFEFIKSYGGVTTESAYPYRA-SNGTCDSVRSRRGQ 236
Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTA 280
I ++ VP E++L KAVANQPVSVAIDA A QFYS GVF G C T L+HGV A
Sbjct: 237 IVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAA 296
Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
VGYG S++G YW++KNSWG WGE GY R+QR G CGIAM ASFP+ K S P+
Sbjct: 297 VGYGVSDDGTAYWIVKNSWGPSWGEGGYIRMQRGAGN-GGLCGIAMEASFPI-KTSPNPA 354
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 144/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F+
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C I +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 146/325 (44%), Positives = 202/325 (62%), Gaps = 19/325 (5%)
Query: 19 QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
+AT+RT +E + +E+W ++G+ Y E KRF+IFKDNL +++ N NR+Y
Sbjct: 27 KATWRTDEE--VNSLYEEWLVKHGKLYNALGEKDKRFQIFKDNLRFIDQQNAE---NRTY 81
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTP 137
L LN+FADLT +E+ A G K+ + + + + + +P SV+W ++GAV P
Sbjct: 82 KLGLNRFADLTNEEYRARYLGTKIDPNRRLGRTPSNRYAPRVGETLPDSVDWRKEGAVVP 141
Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
VK Q C A+ AVEGIN I L+SLSEQ+LVDC T N GC GG MD AF+
Sbjct: 142 VKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELVDCDTG-YNMGCNGGLMDYAFE 200
Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
+II+N GI ++ Y Y+G+ G CD + I YEDV DE +L KAVANQPVS
Sbjct: 201 FIIKNGGIDSEEDYPYKGVD-GRCDEYRKNAKVVSIDGYEDVNTYDELALKKAVANQPVS 259
Query: 251 VAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
VA++ Q YS GVF G C T L+HGV AVGYGT + G +W+++NSWG DWGE+GY
Sbjct: 260 VAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYGT-DNGHDFWIVRNSWGADWGEEGY 318
Query: 309 FRLQRDIDQPQ-GQCGIAMFASFPV 332
RL+R++ + G+CGIA+ S+P+
Sbjct: 319 IRLERNLGNSRSGKCGIAIEPSYPI 343
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 150/347 (43%), Positives = 208/347 (59%), Gaps = 25/347 (7%)
Query: 6 LIVVLIISG-SCASQATYRTFDEGSIAEK--------FEQWKAQYGRTYKESAENSKRFE 56
L+++LI S S AS + ++DE I + +E W ++G++Y E KRF+
Sbjct: 12 LLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNALGEKDKRFQ 71
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP- 115
IFKDNL ++ N ++ N+SY L L KFADLT +E+ + G K S L N +
Sbjct: 72 IFKDNLKYIDEQN--SVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLSKNKSDR 129
Query: 116 FLYK-SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
+L K +P SV+W +KG + VK QG C AVAA+E INAI L+SLSEQ+
Sbjct: 130 YLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQE 189
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
LVDC N GC GG MD AF+++I N GI + Y Y+ + +CD + +I
Sbjct: 190 LVDC-DKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERND-VCDQYRKNAKVVKID 247
Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGT 285
+YEDVP N+E++L KAVA+QPVS+AI+A LQ Y G+F G C T ++HGV A GYG
Sbjct: 248 SYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVVAAGYG- 306
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
SE G+ YW+++NSWG WGE GY R+QR++ G CG+A S+PV
Sbjct: 307 SENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLATEPSYPV 353
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 188/311 (60%), Gaps = 14/311 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +WKA++G+ Y E +R+ F+DNL ++ N AA G S+ L LN+FADLT +E
Sbjct: 40 YAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
+ + G + K + + +P SV+W KGAV +K QG C A
Sbjct: 100 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSA 158
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF +II N GI + Y
Sbjct: 159 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 217
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
Y+G CD + I +YEDV PN E SL KAVANQPVSVAI+A A Q YS
Sbjct: 218 YKGKDE-RCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSS 276
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+F G C T L+HGV AVGYGT E G YW+++NSWG+ WGE GY R++R+I G+CG
Sbjct: 277 GIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCG 335
Query: 324 IAMFASFPVSK 334
IA+ S+P+ K
Sbjct: 336 IAVEPSYPLKK 346
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/320 (45%), Positives = 192/320 (60%), Gaps = 18/320 (5%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
D+G + + F QW ++ R Y +E +RF+IFKDNL + N +SY L LNKF
Sbjct: 45 DDGML-DVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQ---EKSYWLGLNKF 100
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
+DLT EF A G + + + L+ NG F+Y+ V+W +KGAV+ VK QG C
Sbjct: 101 SDLTHDEFRALYLGIRPAGRAHGLR-NGDRFIYEDVVAEEMVDWRKKGAVSDVKDQGSCG 159
Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
A+ +VEG+NAI L+SLSEQ+LVDC N GC GG MD AF +II+N GI
Sbjct: 160 SCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRG-QNQGCNGGLMDYAFDFIIKNGGI 218
Query: 199 TNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
+ Y Y+ + G CD + E I +Y+DVP E SLLKAV+ PVSVAI+A
Sbjct: 219 DTEEDYPYKA-TDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKNPVSVAIEAGG 277
Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR-D 314
Q Y GGVF G C T L+HGV AVGYGT ++G+ YW++KNSWG WGE GY R++R
Sbjct: 278 RDFQHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVNYWIVKNSWGPSWGEKGYIRMERMG 337
Query: 315 IDQPQGQCGIAMFASFPVSK 334
+ G+CGI + SFP+ K
Sbjct: 338 SNSTSGKCGINIEPSFPIKK 357
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 199/322 (61%), Gaps = 22/322 (6%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
+ E ++ + +QW A++GRTY++ AE + RF++FK N V+ N A +SY + LN+
Sbjct: 42 YGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRMELNE 101
Query: 85 FADLTPQEFIASQTGFKMSDHSSSLKAN---GTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
FAD+T EF+A TG + + A G L + +V+W +KGAVT +K Q
Sbjct: 102 FADMTNDEFMAMYTGLRPVPAGAKKMAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQ 161
Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
GQC AVAAVEGI+ I LVSLSEQQ++DC T + NNGC GG++D+AF+YI
Sbjct: 162 GQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDT-EGNNGCNGGYIDNAFQYIAG 220
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N G+ + Y Y + +C S++ A I+ Y+DVP DE +L AVANQPVSVAID
Sbjct: 221 NGGLATEDAYPYTA-AQAMCQSVQP---VAAISGYQDVPSGDEAALAAAVANQPVSVAID 276
Query: 255 ASALQFYSGGVFNGY-CET--FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
A Q Y GGV C T LNH VTAVGYGT+E+G YWL+KN WGQ+WGE GY RL
Sbjct: 277 AHNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRL 336
Query: 312 QRDIDQPQGQCGIAMFASFPVS 333
+R + CG+A AS+PV+
Sbjct: 337 ERGAN----ACGVAQQASYPVA 354
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 149/327 (45%), Positives = 199/327 (60%), Gaps = 20/327 (6%)
Query: 17 ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
AS++++RT DE + +E W ++G++Y E KRF+IFKDNL ++ N A N
Sbjct: 35 ASKSSWRTDDE--VMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHN--AEENL 90
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG-TPFLYKSSQVPPSVNWIEKGAV 135
SY + LN+FADLT +E+ ++ G K S +K++ P + S +P SV+W KGAV
Sbjct: 91 SYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKVKSDRYAPRVGDS--LPESVDWRAKGAV 148
Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
P+K QG C V AVEGIN I L++LSEQ+LVDC N GC GG MD
Sbjct: 149 APIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDC-DKSYNEGCDGGLMDYG 207
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
F++II N GI D Y Y G CD + I +YEDVP N+EE+L KAVA+QP
Sbjct: 208 FEFIINNGGIDTDKDYPYLGRDA-RCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQP 266
Query: 249 VSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
VSV I+ A QFY G+F G C T L+HGV VGYGT E+G YW+++NSWG WGE
Sbjct: 267 VSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGT-EKGKDYWIVRNSWGSSWGEA 325
Query: 307 GYFRLQRDI-DQPQGQCGIAMFASFPV 332
GY R++R++ G+CGIAM S+P+
Sbjct: 326 GYIRMERNLAGTSVGKCGIAMEPSYPL 352
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 203/339 (59%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + SQ R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFLYKS--- 120
+E N A GN SY L +N+FAD+T +EF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTEFKINDISD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK QGQC AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDENGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 152/355 (42%), Positives = 205/355 (57%), Gaps = 33/355 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRT---------FDEGSIAEKFEQWKAQYGRTYKESA-- 49
MA FL +V + S S +Y E + +E W ++G+ +++
Sbjct: 8 MAILFLAMVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLV 67
Query: 50 ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMS---DHS 106
E +RFEIFKDNL V+ N N SY L L +FADLT E+ + G KM +
Sbjct: 68 EKDRRFEIFKDNLRFVDEHNEK---NLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERR 124
Query: 107 SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINR 159
+SL+ ++P S++W +KGAV VK QG C + AVEGIN I
Sbjct: 125 TSLRYEARV----GDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGD 180
Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
L++LSEQ+LVDC T+ N GC GG MD AF++II+N GI D Y Y+G+ G CD I+
Sbjct: 181 LITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVD-GTCDQIRK 238
Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHG 277
I +YEDVP EESL KAVA+QP+S+AI+A A Q Y G+F+G C T L+HG
Sbjct: 239 NAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHG 298
Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
V AVGYGT E G YW+++NSWG+ WGE GY R+ R+I G+CGIA+ S+P+
Sbjct: 299 VVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPI 352
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 143/313 (45%), Positives = 195/313 (62%), Gaps = 14/313 (4%)
Query: 31 AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR--SYTLRLNKFADL 88
A + W ++ + Y E KRF IF+DNL +++ NN G + L LNKFADL
Sbjct: 2 AYHLQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADL 61
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
T EF G K + + S+K++ + + ++P SV+W +KGAV+ VK QGQC
Sbjct: 62 TNDEFRRIYFGVKRPEKAESVKSDRYA-VKEGDELPESVDWRKKGAVSHVKDQGQCGSCW 120
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
A+ AVEGIN I L++LSEQ+LVDC T+ N+GC GG MD AF++II N GI D
Sbjct: 121 AFSAIGAVEGINKIVTGDLITLSEQELVDCDTS-YNSGCDGGLMDYAFRFIINNGGIDTD 179
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y+ + G CDS + I EDVP N+E++L KAVA+QPV +AI+A Q
Sbjct: 180 KDYPYKA-TDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQ 238
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
Y GVF G C T L+HGV AVGYGT+++G YW+++NSWG DWGEDGY R++R+ +
Sbjct: 239 LYKSGVFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKS 298
Query: 320 GQCGIAMFASFPV 332
G+CGIA+ S+PV
Sbjct: 299 GKCGIAIEPSYPV 311
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 194/320 (60%), Gaps = 24/320 (7%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESA--ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
E + +E W ++G+ +++ E +RFEIFKDNL V+ N N SY L L +
Sbjct: 43 EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK---NLSYRLGLTR 99
Query: 85 FADLTPQEFIASQTGFKMS---DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
FADLT E+ + G KM + +SL+ ++P S++W +KGAV VK Q
Sbjct: 100 FADLTNDEYRSKYLGAKMEKKGERRTSLRYEARV----GDELPESIDWRKKGAVAEVKDQ 155
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
G C + AVEGIN I L++LSEQ+LVDC T+ N GC GG MD AF++II+
Sbjct: 156 GGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIK 214
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N GI D Y Y+G+ G CD I+ I +YEDVP EESL KAVA+QP+S+AI+
Sbjct: 215 NGGIDTDKDYPYKGVD-GTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273
Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
A A Q Y G+F+G C T L+HGV AVGYGT E G YW+++NSWG+ WGE GY R+
Sbjct: 274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMA 332
Query: 313 RDIDQPQGQCGIAMFASFPV 332
R+I G+CGIA+ S+P+
Sbjct: 333 RNIASSSGKCGIAIEPSYPI 352
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 194/320 (60%), Gaps = 24/320 (7%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESA--ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
E + +E W ++G+ +++ E +RFEIFKDNL V+ N N SY L L +
Sbjct: 43 EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK---NLSYRLGLTR 99
Query: 85 FADLTPQEFIASQTGFKMS---DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
FADLT E+ + G KM + +SL+ ++P S++W +KGAV VK Q
Sbjct: 100 FADLTNDEYRSKYLGAKMEKKGERRTSLRYEARV----GDELPESIDWRKKGAVAEVKDQ 155
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
G C + AVEGIN I L++LSEQ+LVDC T+ N GC GG MD AF++II+
Sbjct: 156 GGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIK 214
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N GI D Y Y+G+ G CD I+ I +YEDVP EESL KAVA+QP+S+AI+
Sbjct: 215 NGGIDTDKDYPYKGVD-GTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273
Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
A A Q Y G+F+G C T L+HGV AVGYGT E G YW+++NSWG+ WGE GY R+
Sbjct: 274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMA 332
Query: 313 RDIDQPQGQCGIAMFASFPV 332
R+I G+CGIA+ S+P+
Sbjct: 333 RNIASSSGKCGIAIEPSYPI 352
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 154/347 (44%), Positives = 207/347 (59%), Gaps = 33/347 (9%)
Query: 25 FDEGSIA------EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR-S 77
FDE +A + +E+W+ + R ++ E +RF FK+N+ + N G+R S
Sbjct: 31 FDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKR--GDRPS 87
Query: 78 YTLRLNKFADLTPQEFIASQTGFKMSD----HSSSLKANGTP-FLYK-SSQVPPSVNWIE 131
Y LRLN+F D+ P+EF ++ +++D SS A P F+Y ++ VP SV+W +
Sbjct: 88 YRLRLNRFGDMGPEEFRSTFADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQ 147
Query: 132 KGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGF 184
GAVT VK QG+C V AVEGINAI+ LVSLSEQ+LVDC T +N GC GG
Sbjct: 148 HGAVTAVKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDTAEN--GCQGGL 205
Query: 185 MDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT--NYEDVPPNDEESLLK 242
M++AF +I GIT ++ Y Y S G CD ++A ++ ++ VP E++L K
Sbjct: 206 MENAFDFIKSYGGITTESAYPYRA-SNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAK 264
Query: 243 AVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSE-EGIKYWLIKNSW 299
AVA QPVSVAIDA A QFYS GVF G C T L+HGV VGYG S+ +G YW++KNSW
Sbjct: 265 AVARQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSW 324
Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
G WGE GY R+QR G CGIAM ASFP+ K S P+ + +
Sbjct: 325 GPSWGEGGYIRMQRGAGN-GGLCGIAMEASFPI-KTSHNPARKPRRA 369
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 139/270 (51%), Positives = 173/270 (64%), Gaps = 14/270 (5%)
Query: 88 LTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
+T EF ++ G K++ H S A G+ K VPPSV+W +KGAVTP+K QGQC
Sbjct: 1 MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 60
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
V AVEGIN IK N+LVSLSEQ+LVDC T++N GC GG M AF++I + G
Sbjct: 61 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQ-GCNGGLMGYAFEFIKEKGG 119
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-- 255
IT + Y Y G CD K I +E VPPN+E++LLKA ANQP+SVAIDA
Sbjct: 120 ITTEQSYPYTA-EDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGG 178
Query: 256 SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
SA QFYS GVF G C T L+HGV VGYGT+ +G KYW++KNSWG DWGE+GY R++R I
Sbjct: 179 SAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRGI 238
Query: 316 DQPQGQCGIAMFASFPVSKESAQPSSADKS 345
+G CGIA+ AS+P+ S P A S
Sbjct: 239 SAKEGLCGIAVEASYPIKNSSTNPVGAPSS 268
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++G YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QGQC AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI++++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 194/312 (62%), Gaps = 18/312 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++G+ Y+ E RFEIFKDNL ++ N + N Y L LN+FADL+
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDE-RNKVVSN--YWLGLNEFADLS 99
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
QEF G K+ D+S + + F YK ++P SV+W +KGAV PVK QG C
Sbjct: 100 HQEFKNKYLGLKV-DYSRR-RESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC +NGC GG MD AF +I++N G+ +
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYSNGCNGGLMDYAFSFIVENGGLHKEE 216
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M G C+ K E I+ Y DVP N+E+SLLKA+ANQ +SVAI+AS QF
Sbjct: 217 DYPYI-MEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQF 275
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YSGGVF+G+C + L+HGV AVGYGT+ +G+ Y ++KNSWG WGE GY R+ R + +G
Sbjct: 276 YSGGVFDGHCGSDLDHGVAAVGYGTA-KGVDYIIVKNSWGSKWGEKGYIRM-RGTLETRG 333
Query: 321 QCGIAMFASFPV 332
AS+P+
Sbjct: 334 NLRYLQMASYPL 345
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 193/322 (59%), Gaps = 20/322 (6%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
RT DE + + W ++G++Y E RF+IFKDNL ++ N A +RSY L L
Sbjct: 40 RTDDE--VMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHN--ADPDRSYELGL 95
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY---KSSQVPPSVNWIEKGAVTPVK 139
N+FADLT +E+ A G K + L + G Y + ++P S++W EKGAV VK
Sbjct: 96 NRFADLTNEEYRAKYLGTKSRESRPKL-SKGPSDRYAPVEGEELPDSIDWREKGAVAAVK 154
Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
QG C A+ AVEGIN I L++LSEQ+LVDC N GC GG MD AF +I
Sbjct: 155 DQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDC-DRSYNEGCEGGLMDYAFNFI 213
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
I+N GI +D Y Y G G C+ K I +YEDVP DE++L KA ANQP+SVA
Sbjct: 214 IKNGGIDSDLDYPYTGRD-GTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVA 272
Query: 253 IDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
I+A + Q Y G+F G C T ++HGV VGYG SEEG+ YW+++NSWG WGE GY +
Sbjct: 273 IEAGGMDFQLYVSGIFTGKCGTAVDHGVVVVGYG-SEEGMDYWIVRNSWGAAWGEAGYLK 331
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
+QR++ + G CGI + S+PV
Sbjct: 332 MQRNVGKSSGLCGITIEPSYPV 353
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 144/339 (42%), Positives = 205/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C I +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 194/320 (60%), Gaps = 24/320 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E+FE+W A+YGR Y ++AE +RF+IFK+N+ +E FNN + GN SYTL +N+F D+T
Sbjct: 6 MMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRS-GN-SYTLGVNQFTDMT 63
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFL----YKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
EF+A TG + L P + S VP S++W + GAVT VK QG C
Sbjct: 64 NNEFLARYTGASLP-----LNIERDPVVSFDDVDISAVPQSIDWRDYGAVTSVKNQGSCG 118
Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
A+A VEGI IK L+SLSEQ+++DCA + GC GG+++ A+ +II N G+
Sbjct: 119 SCWAFSAIATVEGIYKIKAGNLISLSEQEVLDCALS---YGCDGGWVNKAYDFIISNNGV 175
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
T+ A Y+G G C+ + A IT Y V N+E S++ AVANQP++ IDA
Sbjct: 176 TSFANLPYKGYK-GPCNHNDLPNKA-YITGYTYVQSNNERSMMIAVANQPIAALIDAGGD 233
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
Q+Y GVF G C T LNH +T +GYG + G KYW++KNSWG WGE GY R+ RD+
Sbjct: 234 FQYYKSGVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARDVSS 293
Query: 318 PQGQCGIAMFASFPVSKESA 337
P G CGIAM FP + A
Sbjct: 294 PYGLCGIAMAPLFPTLQSGA 313
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 143/295 (48%), Positives = 181/295 (61%), Gaps = 16/295 (5%)
Query: 50 ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL 109
E+ +RF +F DNL V+ N A + L +N+FADLT EF A+ G + +
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAGRGRRV 143
Query: 110 KANGTPFLYKSSQ-VPPSVNWIEKGAVT-PVKYQGQC-------AVAAVEGINAIKINRL 160
G + + + +P SV+W +KGAV PVK QGQC AVAAVEGIN I L
Sbjct: 144 ---GEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGEL 200
Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
VSLSEQ+LV+CA N N+GC GG MDDAF +I +N G+ + Y Y M G C+ K
Sbjct: 201 VSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMD-GKCNLAKRS 259
Query: 221 DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGV 278
I +EDVP NDE SL KAVA+QPVSVAIDA Q Y GVF G C T L+HGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 279 TAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
AVGYGT + G YW ++NSWG DWGE+GY R++R++ G+CGIAM AS+P+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 205/342 (59%), Gaps = 20/342 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFD--EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
F I+ ++ S + R F+ + IA +E W ++G+ Y E RF IFKDNL
Sbjct: 12 FSIIFIVSSSALDLSIIDRAFNRPDDEIASLYETWLVKHGKNYNGLGEKQLRFNIFKDNL 71
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKS 120
V+ N+ N S+ L LN+FADLT +E+ + G + + S ++ + +++
Sbjct: 72 RFVDERNSE---NLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRSKSDRYAFRA 128
Query: 121 SQ-VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+P SV+W +KGAV +K QG C A+AAVEG+N I L+SLSEQ+LV+C
Sbjct: 129 GDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLISLSEQELVECD 188
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
T+ N+GC GG MD AF++II+N+GI +D Y Y G G CD+ + I +YED
Sbjct: 189 TS-YNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRD-GRCDTNRKNAKVVTIDDYEDS 246
Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P DE+SL KAVANQPVSVAI+ Q Y GVF G C T L+HGV VGYGT E+G+
Sbjct: 247 PVYDEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHGVAVVGYGT-EDGL 305
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW+++NSWG WGE GY R+QR+ P G CGIA+ S+P+
Sbjct: 306 DYWIVRNSWGDTWGEGGYIRMQRNTKLPSGICGIAIEPSYPI 347
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 148/367 (40%), Positives = 211/367 (57%), Gaps = 28/367 (7%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEG-----------SIAEKFEQWKAQYGRTYKESA 49
MA +++ +++ S A + ++D + +E+W ++G+ Y
Sbjct: 8 MATILIVLFTVLAVSSALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHGKVYNAVE 67
Query: 50 ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL 109
E KRF+IFKDNL +E N NR+Y + LN+F+DL+ +E+ + G K+
Sbjct: 68 EKEKRFQIFKDNLNFIEEHNAV---NRTYKVGLNRFSDLSNEEYRSKYLGTKIDPSRMMA 124
Query: 110 KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVS 162
+ + + +P SV+W ++GAV VK Q +C A+AAVEGIN I L +
Sbjct: 125 RPSRRYSPRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTA 184
Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
LSEQ+L+DC N GC GG +D AF++II N GI + Y ++G + GICD K
Sbjct: 185 LSEQELLDC-DRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQG-ADGICDQYKINAR 242
Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTA 280
A I YE VP DE +L KAVANQPVSVAI+A Q Y G+F G C T ++HGVTA
Sbjct: 243 AVTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSIDHGVTA 302
Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQP 339
VGYGT E GI YW++KNSWG++WGE GY ++R+I + G+CGIA+ +P+ K P
Sbjct: 303 VGYGT-ENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPI-KIGQNP 360
Query: 340 SSADKSS 346
S+ D SS
Sbjct: 361 SNPDNSS 367
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 143/295 (48%), Positives = 181/295 (61%), Gaps = 16/295 (5%)
Query: 50 ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL 109
E+ +RF +F DNL V+ N A + L +N+FADLT EF A+ G + +
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAGRGRRV 143
Query: 110 KANGTPFLYKSSQ-VPPSVNWIEKGAVT-PVKYQGQC-------AVAAVEGINAIKINRL 160
G + + + +P SV+W +KGAV PVK QGQC AVAAVEGIN I L
Sbjct: 144 ---GEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGEL 200
Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
VSLSEQ+LV+CA N N+GC GG MDDAF +I +N G+ + Y Y M G C+ K
Sbjct: 201 VSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMD-GKCNLAKRS 259
Query: 221 DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGV 278
I +EDVP NDE SL KAVA+QPVSVAIDA Q Y GVF G C T L+HGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 279 TAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
AVGYGT + G YW ++NSWG DWGE+GY R++R++ G+CGIAM AS+P+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 149/350 (42%), Positives = 209/350 (59%), Gaps = 27/350 (7%)
Query: 5 FLIVVLIISGSCA-------SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
F++ VL++SG+ A A + ++A + E+W A++G+TYK+ E ++R E+
Sbjct: 6 FVLAVLVMSGAAALGRELAGDGAAAAAAADVAMASRHEKWMAKHGKTYKDEEEKARRLEV 65
Query: 58 FKDNLVAVERFNNAA--IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP 115
F+ N ++ FN AA G + L N+FADLT EF A++TG++ + +
Sbjct: 66 FRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDEFRAARTGYQRPPAAVAGAG--GG 123
Query: 116 FLYKS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
FLY++ + P S++W GAVT VK QG C AVAAVEG+ I+ +LVSLSE
Sbjct: 124 FLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLAKIRTGQLVSLSE 183
Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
Q+LVDC + GC GG MD AF+YI + G+ ++ Y Y G+ A AA
Sbjct: 184 QELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAESSYPYRGVDG--ACRAAAGRAAAS 241
Query: 226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGY-CETFLNHGVTAVG 282
I ++DVP NDE +L+ AVA QPVSVAI+ + +FY GV G C T LNH VTAVG
Sbjct: 242 IRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDRGVLGGAGCGTELNHAVTAVG 301
Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YGT+ +G YWL+KNSWG WGE GY R++R + + +G CGIA AS+PV
Sbjct: 302 YGTASDGTGYWLMKNSWGASWGEGGYVRIRRGVGR-EGACGIAQMASYPV 350
>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 334
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 140/337 (41%), Positives = 203/337 (60%), Gaps = 19/337 (5%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ + +L + + + T +E SI + +QW Q+ R YK+ +E R ++FK NL
Sbjct: 8 FVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLK 67
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP-----FLY 118
+E FNN +GN+SYTL +N+F D +EF+A+ TG +++ S S N T +
Sbjct: 68 FIENFNN--MGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMS 125
Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
S +W ++GAVTPVKYQG C + + G N L++LSEQQL+DC + N
Sbjct: 126 DIDMEDESKDWRDEGAVTPVKYQGACRLTKISGKN------LLTLSEQQLIDCDI-EKNG 178
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG ++AFKYII+N G++ + Y Y+ + + H QI ++ VP ++E
Sbjct: 179 GCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHT-QIRGFQMVPSHNER 237
Query: 239 SLLKAVANQPVSVAIDASALQF--YSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLI 295
+LL+AV QPVSV IDA A F Y GGV+ G C T +NH VT VGYGT G+ YW++
Sbjct: 238 ALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTMS-GLNYWVL 296
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KNSWG+ WGE+GY R++RD++ PQG CGIA A++PV
Sbjct: 297 KNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 333
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 156/353 (44%), Positives = 210/353 (59%), Gaps = 25/353 (7%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ F LI S + A + RT DE + +E W +YG++Y E R EIFK
Sbjct: 10 MSLLFFSTFLIFSFAIDAKISPLRTNDE--VMALYESWLVKYGKSYNSLGEREMRIEIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN-GTPFLY 118
+NL ++ N A NRSYT+ LN+FADLT +E+ ++ GFK SSLK+ ++
Sbjct: 68 ENLRFIDEHN--ADPNRSYTVGLNQFADLTDEEYRSTYLGFK-----SSLKSKVSNRYMP 120
Query: 119 KSSQVPPS-VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVD 170
+ +V P V+W GAV VK QG C+ +A VE IN I L+SLSEQ+LVD
Sbjct: 121 QVGEVLPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVD 180
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C N GC GGFMDDA+++II N GI + Y Y G CD K + I +YE
Sbjct: 181 CNRTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQ-CDEPKKNQNYVTIDSYE 239
Query: 231 DVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFN-GYCETFLNHGVTAVGYGTSE 287
VPPNDE ++ +AVA QPVSVAIDA L +FY G+F G C T LNH VT +GYGT E
Sbjct: 240 QVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIGYGT-E 298
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
GI YW++KNS+G WGE GY ++QR++ +G+CGIA + +PV +++P+
Sbjct: 299 NGIDYWIVKNSYGTQWGESGYGKVQRNVGG-EGRCGIASYPFYPVKNYTSKPA 350
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 259 bits (663), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 189/304 (62%), Gaps = 18/304 (5%)
Query: 39 AQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT 98
+++G++Y+ E RFE+F+DNL ++ N SY L LN+FADL+ +EF
Sbjct: 2 SKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKV---SSYWLGLNEFADLSHEEFKRKYL 58
Query: 99 GFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVE 150
G K+ + + F YK + +P SV+W +KGAV VK QG C VAAVE
Sbjct: 59 GLKIE--LPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116
Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
GIN I L +LSEQ+L+DC NNGC GG MD AF +II N G+ + Y Y M
Sbjct: 117 GINQIVTGNLTALSEQELIDC-DKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYV-ME 174
Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNG 268
G C K E I+ Y DVP ++E+S LKA+ANQP+SVAI+AS+ QFYSGG+FNG
Sbjct: 175 EGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNG 234
Query: 269 YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFA 328
+C T L+HGV AVGYGTS+ G+ Y +KNSWG WGE GY R++R++ +P+G CGI A
Sbjct: 235 HCGTELDHGVAAVGYGTSK-GVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMA 293
Query: 329 SFPV 332
S+P
Sbjct: 294 SYPT 297
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 205/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + SQ R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++EG I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQF +GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFCAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 146/314 (46%), Positives = 191/314 (60%), Gaps = 21/314 (6%)
Query: 34 FEQWKAQYGRTYKE----SAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+E W ++G+ AE +RFEIFKDNL ++ N N SY L L +FADLT
Sbjct: 50 YEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK---NLSYKLGLTRFADLT 106
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
+E+ + G K + LK + +P SV+W ++GAV VK QG C
Sbjct: 107 NEEYRSMYLGAKPTKRV--LKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWA 164
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
+ AVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF++II+N GI +A
Sbjct: 165 FSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIIKNGGIDTEA 223
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
Y Y+ + G CD + I +YEDVP N E SL KA+A+QP+SVAI+A A Q
Sbjct: 224 DYPYKA-ADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQL 282
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YS GVF+G C T L+HGV AVGYGT E G YW+++NSWG WGE GY ++ R+I+ P G
Sbjct: 283 YSSGVFDGLCGTELDHGVVAVGYGT-ENGKDYWIVRNSWGNRWGESGYIKMARNIEAPTG 341
Query: 321 QCGIAMFASFPVSK 334
+CGIAM AS+P+ K
Sbjct: 342 KCGIAMEASYPIKK 355
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 14/311 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +WKA++G++Y E +R+ F+DNL ++ N AA G S+ L LN+FADLT +E
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ---GQC----A 145
+ + G + K + + +P SV+W KGAV +K Q G C A
Sbjct: 100 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQEVAGSCWAFSA 158
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF +II N GI + Y
Sbjct: 159 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 217
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
Y+G CD + I +YEDV PN E SL KAVANQPVSVAI+A A Q YS
Sbjct: 218 YKGKDE-RCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSS 276
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+F G C T L+HGV AVGYGT E G YW+++NSWG+ WGE GY R++R+I G+CG
Sbjct: 277 GIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCG 335
Query: 324 IAMFASFPVSK 334
IA+ S+P+ K
Sbjct: 336 IAVEPSYPLKK 346
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 143/315 (45%), Positives = 189/315 (60%), Gaps = 42/315 (13%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++G+TY+ E R E+FKDNL+ ++R N +Y L LN+FADL+
Sbjct: 43 LTELFESWMSKHGKTYESIEEKLHRLEVFKDNLMHIDRRNRDVT---TYWLALNEFADLS 99
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
+EF + K +Q+ +EKGAV PVK QG C
Sbjct: 100 HEEFKS-----------------------KLAQI----RRLEKGAVAPVKNQGSCGSCWA 132
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC T+ N+GC GG MD AF YI+ N G+ +
Sbjct: 133 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTS-FNSGCNGGLMDYAFDYIVNNGGLHKEE 191
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M G CD + E I+ Y DVP N+EESLLKA+A+QP+S+AI+AS QF
Sbjct: 192 DYPYL-MEEGTCDEKREEMEVVTISGYHDVPENNEESLLKALAHQPLSIAIEASGRDFQF 250
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
Y GVFNG C T L+HGV AVGYG+S+ G+ Y ++KNSWG WGE GY R++R+ +P+G
Sbjct: 251 YGRGVFNGPCGTDLDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 309
Query: 321 QCGIAMFASFPVSKE 335
CGI AS+P K+
Sbjct: 310 LCGINKMASYPTKKK 324
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 153/345 (44%), Positives = 205/345 (59%), Gaps = 26/345 (7%)
Query: 5 FLIVVLIISGSCA---SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
F + +I+ S A T R+ DE + +E+W ++ + Y E +RF+IFKDN
Sbjct: 9 FFLFFSLITFSLALDIQLPTGRSNDE--VMTMYEEWLVKHQKVYNGLREKDQRFQIFKDN 66
Query: 62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN---GTPFLY 118
L ++ N N +Y + LNKFAD+T +E+ G + SD + N G + Y
Sbjct: 67 LNFIDEHNAQ---NYTYIVGLNKFADMTNEEYRDMYLGTR-SDIKRRIMKNKITGHRYAY 122
Query: 119 KSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVD 170
S ++P V+W KGA+T +K QG C +A VE IN I +LVSLSEQ+LVD
Sbjct: 123 NSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVD 182
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C N GC GG MD AF++II N GI D Y Y+G G CD + + I YE
Sbjct: 183 C-DRAFNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFE-GRCDPTRKKAKIVSIDGYE 240
Query: 231 DVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
DVP N+E +L KAVA+QPVSVAI+AS ALQ Y GVF G C T L+H V VGYG SE
Sbjct: 241 DVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYG-SEN 299
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPV 332
G+ YWL++NSWG +WGEDGYF+++R++ G+CGIA+ AS+PV
Sbjct: 300 GLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPV 344
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 146/314 (46%), Positives = 186/314 (59%), Gaps = 17/314 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+A +E W +G+ Y E +RFEIFKDNL ++ N + R+Y + L +FADLT
Sbjct: 58 VAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRES---RTYKVGLTRFADLT 114
Query: 90 PQEFIASQTGFKMSDHSS-SLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
+E+ A G + S S +G +P V+W +KGAV VK QGQC
Sbjct: 115 NEEYRARFLGGRFSRKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGSCW 174
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
+VAAVEGIN I L+ LSEQ+LVDC N GC GG MD AF++II N GI +
Sbjct: 175 AFSSVAAVEGINQIVTGELIPLSEQELVDC-DKSFNMGCNGGLMDYAFQFIIGNGGIDTE 233
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQ 259
Y Y+G CD + I YEDVP NDE SL KAVANQPVSVAI+A A Q
Sbjct: 234 EDYPYKGRDAA-CDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQ 292
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQP 318
Y GVF G C T L+HGV AVGYGT + G YW+++NSWG+DWGE GY RL+R++ +
Sbjct: 293 LYQSGVFTGRCGTDLDHGVVAVGYGT-DNGTDYWIVRNSWGKDWGESGYIRLERNVANIT 351
Query: 319 QGQCGIAMFASFPV 332
G+CGIA+ S+P
Sbjct: 352 TGKCGIAVQPSYPT 365
>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 207/340 (60%), Gaps = 30/340 (8%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ +L+ A Q T RT + S+ E+ EQ +YG+ YK+ + FK+N+
Sbjct: 9 HIAFAMLLCMAFLAFQVTCRTLQDASMXERHEQRMTRYGKVYKDPPKRX-----FKENVN 63
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+E NNAA N+ Y +N+FA P+ + H S T F +++ +
Sbjct: 64 YIEACNNAA--NKPYKRGINQFA---PRN--------RFKGHMCSSIIRITTFKFENVTA 110
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
P +V+ +KGAVTP+K QGQC AVAA EGI+A+ +L+SLSEQ+LVDC T
Sbjct: 111 TPSTVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKG 170
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDA-VYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ GC GG MDDAFK+IIQN G+ + + + Y G+ + A++ A IT YEDVP
Sbjct: 171 VDXGCEGGLMDDAFKFIIQNHGLKHXSQLPLYMGVDGKCNANEAAKNAATIITGYEDVPA 230
Query: 235 NDEESLL-KAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
N+E++ L KAVAN PVS AIDAS QFY GVF G C T L+HGVTAVGYG S++G +
Sbjct: 231 NNEKAHLQKAVANNPVSEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTE 290
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
YWL+KNSWG +WGE+GY R+QR +D + CGIA+ AS+P
Sbjct: 291 YWLVKNSWGTEWGEEGYIRMQRGVDSEEALCGIAVQASYP 330
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 205/324 (63%), Gaps = 27/324 (8%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
+ E ++ + +QW A++GRTYK+ AE ++RF++FK N V+R N A G +SY L +N+
Sbjct: 40 YGEEAMKVRHQQWMAEHGRTYKDEAEKARRFQVFKANADFVDRSN--AAGGKSYELAINE 97
Query: 85 FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP----PSVNWIEKGAVTPVKY 140
FAD+T EF+A TG K A F Y++ + +V+W +KGAVT +K
Sbjct: 98 FADMTNDEFVAMYTGLKPVPAGPKKMAG---FKYENLTLSDVDQQAVDWRQKGAVTGIKN 154
Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QGQC AVAAVE I+ I LVSLSEQQ++DC T D NNGC GG++D+AF+YII
Sbjct: 155 QGQCGCCWAFAAVAAVESIHQITTGNLVSLSEQQVLDCDT-DGNNGCNGGYIDNAFQYII 213
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
N G+ + Y Y + G C S + A I++Y+DVP DE +L AVANQPV+VAI
Sbjct: 214 SNGGLATEDAYPY-AAAQGTCQS--SVQPAVTISSYQDVPSGDEAALAAAVANQPVAVAI 270
Query: 254 DA-SALQFYSGGVFNG-YCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
DA + QFYS GV C T LNH VTAVGY T+E+G YWL+KN WGQ+WGE GY R
Sbjct: 271 DAHNNFQFYSSGVLTADTCGTPSLNHAVTAVGYSTAEDGTPYWLLKNQWGQNWGEGGYLR 330
Query: 311 LQRDIDQPQGQCGIAMFASFPVSK 334
++R + CG+A AS+PV++
Sbjct: 331 VERGTN----ACGVAQQASYPVAR 350
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 144/339 (42%), Positives = 205/339 (60%), Gaps = 19/339 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
LI + + +Q R+ + S++E+ E W +++GR YK+ E +RF IFK+N+
Sbjct: 10 ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
+E N A GN SY L +N+FAD+T QEF+A TG + + + S + T F
Sbjct: 70 IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +++W E GAVT VK+QG+C AV ++E I L+ SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLMEFSEQELLDCTT 187
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N N GC GGFM +AF +I +N GI+ ++ Y Y G C S + + A QI++Y+ V
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
P E SLL+AV QPVS+ I AS LQFY+GG ++G C +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG WGE+G+ ++ RD P G C IA +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 194/328 (59%), Gaps = 21/328 (6%)
Query: 20 ATYRTFDEGSIAEKFEQWKAQYGRTYKE----SAENSKRFEIFKDNLVAVERFNNAAIGN 75
+T + + + +E W ++G+ AE +RFEIFKDNL ++ N N
Sbjct: 36 STVSSRSDAEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTK---N 92
Query: 76 RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAV 135
SY L L +FADLT E+ + G K LK + +P SV+W ++GAV
Sbjct: 93 LSYKLGLTRFADLTNDEYRSMYLGAKPVKRV--LKTSDRYEARVGDALPDSVDWRKEGAV 150
Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
VK QG C + AVEGIN I L+SLSEQ+LVDC T+ N GC GG MD A
Sbjct: 151 ADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYA 209
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
F++II+N GI +A Y Y+ + G CD + I +YEDVP N E SL KA+A+QP
Sbjct: 210 FEFIIKNGGIDTEADYPYKA-ADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQP 268
Query: 249 VSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
+SVAI+A A Q YS GVF+G C T L+HGV AVGYGT E G YW+++NSWG WGE
Sbjct: 269 ISVAIEAGGRAFQLYSSGVFDGICGTELDHGVVAVGYGT-ENGKDYWIVRNSWGNRWGES 327
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVSK 334
GY ++ R+I +P G+CGIAM AS+P+ K
Sbjct: 328 GYIKMARNIAEPTGKCGIAMEASYPIKK 355
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 142/311 (45%), Positives = 187/311 (60%), Gaps = 14/311 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +WKA++G++Y E +R+ F+DNL ++ N AA G S+ L LN+FADLT +E
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
+ + G + K + + +P SV+W KGAV +K QG C A
Sbjct: 100 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSA 158
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+AAVE IN I L+SLSEQ+LVDC T+ N GC GG MD AF +II N GI + Y
Sbjct: 159 IAAVEDINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 217
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
Y+G CD + I +YEDV PN E SL KAV NQPVSVAI+A A Q YS
Sbjct: 218 YKGKDE-RCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQLYSS 276
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+F G C T L+HGV AVGYGT E G YW+++NSWG+ WGE GY R++R+I G+CG
Sbjct: 277 GIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCG 335
Query: 324 IAMFASFPVSK 334
IA+ S+P+ K
Sbjct: 336 IAVEPSYPLKK 346
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 189/312 (60%), Gaps = 19/312 (6%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+EQW ++G+ Y E KRF+IFKDNL ++ N NR+Y L LN+FADLT +E+
Sbjct: 4 YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHN---ADNRTYKLGLNRFADLTNEEY 60
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
A G ++ + +K Y +P SV+W + AV PVK QG C
Sbjct: 61 RARYLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWAF 120
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
+ AVEGIN I L+SLSEQ+LVDC T+ N GC GG MD A+++II N GI ++
Sbjct: 121 STIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAYEFIINNGGIDSEED 179
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFY 261
Y Y + G CD + I +YEDVP NDE +L KAVANQPVSVAI+ Q Y
Sbjct: 180 YPYRAVD-GTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLY 238
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-G 320
GVF G C T L+HGV AVGYG S +G YW+++NSWG WGE+GY RL+R++ + + G
Sbjct: 239 VSGVFTGRCGTALDHGVVAVGYG-SVKGHDYWIVRNSWGASWGEEGYVRLERNLAKSRSG 297
Query: 321 QCGIAMFASFPV 332
+CGIA+ S+P+
Sbjct: 298 KCGIAIEPSYPI 309
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 146/360 (40%), Positives = 196/360 (54%), Gaps = 61/360 (16%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E+FEQW ++GR Y ++ E +R E+++ N+ VE FN ++ N Y L NKFADLT
Sbjct: 28 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFN--SMSNGGYRLADNKFADLT 85
Query: 90 PQEFIASQTGF-KMSDHSSSLKANGTPFLYK----------SSQVPPSVNWIEKGAVTPV 138
+EF A GF + H + TP S ++P SV+W EKGAV PV
Sbjct: 86 NEEFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSDELPKSVDWREKGAVAPV 145
Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K QG+C AVAA+EGIN IK +LVSLSEQ+LVDC T GC GG+M AF++
Sbjct: 146 KNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEF 203
Query: 192 IIQNKGITNDAVYSYEGM---------------------------STGICDSIKAEDHAA 224
++ N G+T + Y Y+G G C + K ++ A
Sbjct: 204 VMNNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESAV 263
Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVG 282
I+ Y +V + E LL+A A QPVSVA+DA + Q Y GGVF G C LNHGVT VG
Sbjct: 264 SISGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVVG 323
Query: 283 YGTSEE----------GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YG ++ G KYW++KNSWG +WG+ GY +QR+ G CGIA+ S+PV
Sbjct: 324 YGETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYPV 383
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 141/327 (43%), Positives = 192/327 (58%), Gaps = 23/327 (7%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
+ T + + ++ +E+W ++G+ E +RFEIFKDNL ++ N N S
Sbjct: 26 NHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK---NLS 82
Query: 78 YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGA 134
Y L L KFADLT E+ + G ++ KA + Y+ +P SV+W ++GA
Sbjct: 83 YRLGLTKFADLTNDEYRSMYLGSRLK-----RKATKSSLRYEVRVGDAIPESVDWRKEGA 137
Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
V VK QG C + AVEGIN I L++LSEQ+LVDC T+ N GC GG MD
Sbjct: 138 VAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDY 196
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF++II N GI + Y Y+G+ G CD + I YEDVP N EESL KA+++Q
Sbjct: 197 AFEFIINNGGIDTEEDYPYKGVD-GRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQ 255
Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
P+SVAI+ A Q Y G+F+G C T L+HGV AVGYGT E G YW++KNSWG WGE
Sbjct: 256 PISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT-ENGKDYWIVKNSWGTSWGE 314
Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPV 332
GY R++R+I G+CGIA+ S+P+
Sbjct: 315 SGYIRMERNIASSAGKCGIAVEPSYPI 341
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 190/310 (61%), Gaps = 18/310 (5%)
Query: 34 FEQWKAQYGRTYKESA--ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
+E W ++G+ +++ E +RFEIFKDNL ++ N N SY L L +FADLT
Sbjct: 43 YEAWLVKHGKAQNQNSLVEKDRRFEIFKDNLRFIDDHNKK---NLSYRLGLTRFADLTND 99
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
E+ + G KM + + + ++P S++W +KGAV VK QG C
Sbjct: 100 EYRSKYLGAKM-EKKGERRTSQRYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFS 158
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
+ AVEGIN I L++LSEQ+LVDC T+ N GC GG MD AF++II+N GI D Y
Sbjct: 159 TIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGGIDTDKDY 217
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
Y+G+ G CD I+ I +YEDVP EESL KAVA+QPVSVAI+A A Q Y
Sbjct: 218 PYKGVD-GTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYD 276
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
G+F+G C T L+HGV AVGYGT E G YW+++NSWG+ WGE GY ++ R+I G+C
Sbjct: 277 SGIFDGTCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLKMARNIASSSGKC 335
Query: 323 GIAMFASFPV 332
GIA+ S+P+
Sbjct: 336 GIAIEPSYPI 345
>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 154/327 (47%), Positives = 208/327 (63%), Gaps = 28/327 (8%)
Query: 25 FDEGSIAEKFEQWK--AQYGRTY---KESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
FDE +A + W+ ++G+ + + E KRF +FK+N+ V N ++ Y
Sbjct: 26 FDEKELATEESLWQLYERWGKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM---DKPYK 82
Query: 80 LRLNKFADLTPQEFI----ASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSVNWIEKGA 134
L+LNKFAD++ EF+ S H A G F+Y + + +P SV+W E+GA
Sbjct: 83 LKLNKFADMSNYEFVNFYARSNISHYRKLHERRRGAGG--FMYEQDTDLPSSVDWRERGA 140
Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
V VK QG+C +VAAVEGIN IK N+L+SLSEQ+L+DC N N GC GGFM+
Sbjct: 141 VNAVKEQGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDC--NYRNKGCNGGFMEI 198
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF +I +N GI + Y Y G S G+C S + +I YE VP N E++L++AVANQ
Sbjct: 199 AFDFIKRNGGIATENSYPYHG-SRGLCRSSRISSPIVKIDGYESVPEN-EDALMQAVANQ 256
Query: 248 PVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
PVSVAIDA+ QFYS GVF+GYC T LNHGV A+GYGT+E+G YWL++NSWG WGE
Sbjct: 257 PVSVAIDAAGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGE 316
Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPV 332
DGY R++R ++Q +G CGIAM AS+P+
Sbjct: 317 DGYVRMKRGVEQAEGLCGIAMEASYPI 343
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 206/342 (60%), Gaps = 24/342 (7%)
Query: 8 VVLIISGSCASQATYRTFDE-GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
++L+++G ++ A G++ + ++W A++GRTYK++AE ++RF +FK N+ ++
Sbjct: 15 LLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANVDLID 74
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
R N A GN+ Y L N+F DLT EF A TG+ ++ + T + Q P
Sbjct: 75 RSN--AAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATTRLSSEDDQQPAE 132
Query: 127 VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
V+W ++GAVT VK Q C VAAVEGI+ I LVSLSEQQL+DCA +N G
Sbjct: 133 VDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCA---DNGG 189
Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICD---SIKAEDHAAQITNYEDVPPND 236
C GG +D+AF+Y+ + G+T +A Y+Y+G + G C S A AA I+ Y+ V PND
Sbjct: 190 CTGGSLDNAFQYMANSGGVTTEAAYAYQG-AQGACQFDASSSASGVAATISGYQRVNPND 248
Query: 237 EESLLKAVANQPVSVAIDASALQF--YSGGVFNG-YCETFLNHGVTAVGYGTSEEGI--- 290
E SL AVA+QPVSVAI+ S F Y GVF C T L+H V VGYG +G
Sbjct: 249 EGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSGGG 308
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW+IKNSWG WG+ GY +L++D+ QG CG+AM S+PV
Sbjct: 309 GYWIIKNSWGTTWGDGGYMKLEKDVG-SQGACGVAMAPSYPV 349
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 152/326 (46%), Positives = 206/326 (63%), Gaps = 23/326 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
++ W A++G+ Y E ++RFEIFK+NL ++ N+ N +Y + L KFADLT +E+
Sbjct: 4 YKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQ---NHTYKVGLTKFADLTNEEY 60
Query: 94 IASQTGFKMSDHSSSLKANGTP---FLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA---- 145
A G + SD L + +P + +K+ ++P SV+W KGAV P+K QG C
Sbjct: 61 RAMFLGTR-SDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWA 119
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L+SLSEQ+LVDC N GC GG MD AF++II N G+ +
Sbjct: 120 FSTVAAVEGINQIVTGELISLSEQELVDC-DRTYNAGCNGGLMDYAFQFIINNGGLDTEK 178
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
Y Y G CD K + A I +EDV P DE++L KAVA+QPVSVAI+AS ALQF
Sbjct: 179 DYPYVGDDD-KCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQF 237
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQ 319
Y GVF G C T L+HGV VGY SE G+ YWL++NSWG +WGE GY ++QR++ D
Sbjct: 238 YQSGVFTGECGTALDHGVVVVGY-ASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYT 296
Query: 320 GQCGIAMFASFPVS--KESAQPSSAD 343
G+CGIAM +S+PV + +A+P+ A+
Sbjct: 297 GRCGIAMESSYPVKNGENTAKPNLAE 322
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 147/350 (42%), Positives = 203/350 (58%), Gaps = 20/350 (5%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ F +LI+S + + RT D+ + +E W + G++Y E RFEIFK
Sbjct: 12 MSLLFFSTLLILSSALDIKNSVQRTNDQ--VMAMYESWLVEQGKSYNSLDEKEMRFEIFK 69
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+NL ++ N A NRSY+L LN+FADLT +E+ ++ GFK S + + P +
Sbjct: 70 ENLRIIDDHN--ADANRSYSLGLNRFADLTDEEYRSTYLGFK-SGPKAKVSNRYVPKV-- 124
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+P V+W GAV VK QG C AVAAVEGIN I L+SLSEQ+LVDC
Sbjct: 125 GVVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCG 184
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
GC G+M+DAF++II N GI + Y Y G CD + I NYE +
Sbjct: 185 RTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQD-GQCDWYRKNQRYVTIDNYEQL 243
Query: 233 PPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P N+E L AVA QP++V +++ +F Y+ G++ GYC T ++HGVT VGYGT E G+
Sbjct: 244 PANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYGT-ERGL 302
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
YW++KNSWG +WGE+GY R+QR+I G+CGIAM S+PV P+
Sbjct: 303 DYWIVKNSWGTNWGENGYIRIQRNIGGA-GKCGIAMVPSYPVKYSYQNPN 351
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 198/312 (63%), Gaps = 22/312 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF---NNAAIGNRSYTLRLNKFADLTP 90
++ W A+ GR+Y E+ +RF +F DNL RF +NA + + L +N+FADLT
Sbjct: 53 YDLWLAENGRSYNALGEHERRFRVFWDNL----RFADAHNARADDHGFRLGMNRFADLTN 108
Query: 91 QEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC----- 144
+EF A+ G K+ + S +A G + + ++P SV+W EKGAV PVK QGQC
Sbjct: 109 EEFRATFLGAKVVERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 165
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
AV+ VE IN + +++LSEQ+LV+C+TN N GC GG MDDAF +II+N GI +
Sbjct: 166 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTED 225
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y+ + G CD + I +EDVP NDE+SL KAVA+QPVSVAI+A Q
Sbjct: 226 DYPYKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQL 284
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
Y GVF+G C T L+HGV AVGYGT + G YW+++NSWG WGE GY R++R+I+ G
Sbjct: 285 YHSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTG 343
Query: 321 QCGIAMFASFPV 332
+CGIAM AS+P
Sbjct: 344 KCGIAMMASYPT 355
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 142/321 (44%), Positives = 199/321 (61%), Gaps = 24/321 (7%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
++ + +E+W++ Y + + E RF +FK+N+ + N ++ Y LRLN+F DL
Sbjct: 39 TLWDLYERWRSVY-TSARSFGEKQNRFHVFKENVKYINEVNKM---DKPYKLRLNQFGDL 94
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
TP EF + K+ + + + F+Y++ +VP S++W KGAVTPVK QG+C
Sbjct: 95 TPSEFARTYANSKIIEGTRNESGG---FMYENVEVPRSIDWRVKGAVTPVKNQGRCGGCW 151
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
A AAVEGIN I +L+SLSEQQL+DC T N+GC GG M AF+YI Q GIT++
Sbjct: 152 AFSAAAAVEGINQITTGQLISLSEQQLIDCDTQ--NSGCRGGTMGRAFEYIKQRGGITSE 209
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--- 258
A Y Y+ + G+C + + I Y ++ E+++LK +A+QPVSVA+DA+
Sbjct: 210 ANYPYKAQA-GMCKNNLIQRPTVSIDGYYNIR-RSEDAVLKILAHQPVSVAVDATTWSSL 267
Query: 259 --QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
FY GVF G C T LNHGVTAVGYGT+ +G YW+IKNSWG+ WGE GY R+ R +
Sbjct: 268 DWMFYFQGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLRGVS 327
Query: 317 QPQGQCGIAMFASFPVSKESA 337
P G CGIAM ASFP+ + SA
Sbjct: 328 -PYGLCGIAMQASFPIKRVSA 347
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 141/327 (43%), Positives = 192/327 (58%), Gaps = 23/327 (7%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
+ T + + ++ +E+W ++G+ E +RFEIFKDNL ++ N N S
Sbjct: 32 NHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK---NLS 88
Query: 78 YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGA 134
Y L L KFADLT E+ + G ++ KA + Y+ +P SV+W ++GA
Sbjct: 89 YRLGLTKFADLTNDEYRSMYLGSRLKR-----KATKSSLRYEVRVGDAIPESVDWRKEGA 143
Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
V VK QG C + AVEGIN I L++LSEQ+LVDC T+ N GC GG MD
Sbjct: 144 VAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDY 202
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF++II N GI + Y Y+G+ G CD + I YEDVP N EESL KA+++Q
Sbjct: 203 AFEFIINNGGIDTEEDYPYKGVD-GRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQ 261
Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
P+SVAI+ A Q Y G+F+G C T L+HGV AVGYGT E G YW++KNSWG WGE
Sbjct: 262 PISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT-ENGKDYWIVKNSWGTSWGE 320
Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPV 332
GY R++R+I G+CGIA+ S+P+
Sbjct: 321 SGYIRMERNIASSAGKCGIAVEPSYPI 347
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 206/342 (60%), Gaps = 24/342 (7%)
Query: 8 VVLIISGSCASQATYRTFDE-GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
++L+++G ++ A G++ + ++W A++GRTYK++AE ++RF +FK N+ ++
Sbjct: 5 LLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANVDLID 64
Query: 67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
R N A GN+ Y L N+F DLT EF A TG+ ++ + T + Q P
Sbjct: 65 RSN--AAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATTRLSSEDDQQPAE 122
Query: 127 VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
V+W ++GAVT VK Q C VAAVEGI+ I LVSLSEQQL+DCA +N G
Sbjct: 123 VDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCA---DNGG 179
Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICD---SIKAEDHAAQITNYEDVPPND 236
C GG +D+AF+Y+ + G+T +A Y+Y+G + G C S A AA I+ Y+ V PND
Sbjct: 180 CTGGSLDNAFQYMANSGGVTTEAAYAYQG-AQGACQFDASSSASGVAATISGYQRVNPND 238
Query: 237 EESLLKAVANQPVSVAIDASALQF--YSGGVFNG-YCETFLNHGVTAVGYGTSEEGI--- 290
E SL AVA+QPVSVAI+ S F Y GVF C T L+H V VGYG +G
Sbjct: 239 EGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSGGG 298
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW+IKNSWG WG+ GY +L++D+ QG CG+AM S+PV
Sbjct: 299 GYWIIKNSWGTTWGDGGYMKLEKDVG-SQGACGVAMAPSYPV 339
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 149/305 (48%), Positives = 185/305 (60%), Gaps = 20/305 (6%)
Query: 41 YGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGF 100
Y + Y E +RFE+FKDNL ++ N SY L LN+FADLT EF A+ G
Sbjct: 36 YRKAYASFEEKVRRFEVFKDNLNHIDDINKKVT---SYWLGLNEFADLTHDEFKATYLGL 92
Query: 101 KMS-DHSSSLKANGTPFLY---KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAV 149
S+S + F Y + +VP ++W +K AVT VK QGQC VAAV
Sbjct: 93 TPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAV 152
Query: 150 EGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGM 209
EGINAI L SLSEQ+L+DC+T D NNGC GG MD AF YI G+ + Y Y M
Sbjct: 153 EGINAIVTGNLTSLSEQELIDCST-DGNNGCNGGLMDYAFSYIASTGGLRTEEAYPY-AM 210
Query: 210 STGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFN 267
G CD K I+ YEDVP NDE++L+KA+A+QPVSVAI+AS QFYSGGVF+
Sbjct: 211 EEGDCDEGKGA-AVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFD 269
Query: 268 GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMF 327
G C L+HGVTAVGYGTS+ G Y ++KNSWG WGE GY R++R + +G CGI
Sbjct: 270 GPCGEQLDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCGINKM 328
Query: 328 ASFPV 332
AS+P
Sbjct: 329 ASYPT 333
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 189/312 (60%), Gaps = 15/312 (4%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
I+E F+ W ++G+TY E +R +IFKDN V + N I N +Y+L LN FADLT
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN--LITNATYSLSLNAFADLT 85
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
EF AS+ G +S S + + G L S +VP SV+W +KGAVT VK QG C
Sbjct: 86 HHEFKASRLGLSVSAPSVIMASKGQS-LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
A A+EGIN I L+SLSEQ+L+DC N GC GG MD AF+++I+N GI +
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDC-DKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
Y Y+ G C K + I +Y V NDE++L++AVA QPVSV I S A Q
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 262
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YS G+F+G C T L+H V VGYG S+ G+ YW++KNSWG+ WG DG+ +QR+ + G
Sbjct: 263 YSSGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321
Query: 321 QCGIAMFASFPV 332
CGI M AS+P+
Sbjct: 322 VCGINMLASYPI 333
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 151/348 (43%), Positives = 206/348 (59%), Gaps = 24/348 (6%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ F +LI+S + + RT D+ + + +E W + G++Y E RFEIFK
Sbjct: 10 MSLLFFSTLLILSSALDIVNSAQRTNDQ--VRDMYESWLVEQGKSYNSLDEKEMRFEIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN-GTPFLY 118
DNL ++ N A NRS++L LN+FADLT +E+ ++ GFK S KA ++
Sbjct: 68 DNLRIIDDHN--ADANRSFSLGLNRFADLTDEEYRSTYLGFK-----SGPKAKVSNRYVP 120
Query: 119 KSSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
K V P+ V+W GAV VK QG C AVAAVEGIN I L+SLSEQ+LVD
Sbjct: 121 KVGDVLPNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSEQELVD 180
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C + GC G+M DAF++II N GI + Y Y G C+ I +YE
Sbjct: 181 CGRTQSTRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQD-GQCNRYLQNQKYVTIDDYE 239
Query: 231 DVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEE 288
+VP N+E +L AVA+QPVSV +++ +F Y+ G+F YC T ++HGVT VGYGT E
Sbjct: 240 NVPSNNEWALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHGVTIVGYGT-ER 298
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKES 336
G+ YW++KNSWG +WGE+GY R+QR+I G+CGIA AS+PV S
Sbjct: 299 GLDYWIVKNSWGTNWGENGYIRIQRNIGGA-GKCGIARMASYPVKYNS 345
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 189/312 (60%), Gaps = 15/312 (4%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
I+E F+ W ++G+TY E +R +IFKDN V + N I N +Y+L LN FADLT
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN--LITNATYSLSLNAFADLT 85
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
EF AS+ G +S S + + G L S +VP SV+W +KGAVT VK QG C
Sbjct: 86 HHEFKASRLGLSVSAPSVIMASKGQS-LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
A A+EGIN I L+SLSEQ+L+DC N GC GG MD AF+++I+N GI +
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDC-DKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
Y Y+ G C K + I +Y V NDE++L++AVA QPVSV I S A Q
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 262
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YS G+F+G C T L+H V VGYG S+ G+ YW++KNSWG+ WG DG+ +QR+ + G
Sbjct: 263 YSRGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321
Query: 321 QCGIAMFASFPV 332
CGI M AS+P+
Sbjct: 322 VCGINMLASYPI 333
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 146/325 (44%), Positives = 196/325 (60%), Gaps = 22/325 (6%)
Query: 22 YRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTL 80
Y E + + +W A++ TY E +RFE F++NL +++ N AA G S+ L
Sbjct: 30 YGERSEEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRL 89
Query: 81 RLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKSS---QVPPSVNWIEKGAVT 136
LN+FADLT +E+ ++ G + D L A Y+++ ++P SV+W +KGAV
Sbjct: 90 GLNRFADLTNEEYRSTYLGARTKPDRERKLSAR-----YQAADNDELPESVDWRKKGAVG 144
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
VK QG C A+AAVEGIN I ++ LSEQ+LVDC T+ N GC GG MD AF
Sbjct: 145 AVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNQGCNGGLMDYAF 203
Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
++II N GI ++ Y Y+ CD+ K I YEDVP N E+SL KAVANQP+
Sbjct: 204 EFIINNGGIDSEEDYPYKERDN-RCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPI 262
Query: 250 SVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
SVAI+A A Q Y G+F G C T L+HGV AVGYGT E G YWL++NSWG WGE+G
Sbjct: 263 SVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGSVWGENG 321
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
Y R++R+I G+CGIA+ S+P
Sbjct: 322 YIRMERNIKASSGKCGIAVEPSYPT 346
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 147/341 (43%), Positives = 200/341 (58%), Gaps = 22/341 (6%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ +I+ S A + R+ +E + +E+W ++ + Y E +RFEIFKDNL +
Sbjct: 9 LLFFSLITLSLAMDTSMRSNEE--VMTMYEEWLVKHHKVYNGLGEKDQRFEIFKDNLGFI 66
Query: 66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK---ANGTPFLYKSS- 121
+ N N +Y + LNKFAD T +E+ G K + +K G + + S
Sbjct: 67 DEHNAQ---NYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTGHRYAFNSGD 123
Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
++P V+W KGAV +K QG C +A VE IN I +LVSLSEQ+LVDC
Sbjct: 124 RLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDC-DR 182
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
N GC GG MD AF++I++N GI + Y Y+G G CD + I YEDVP
Sbjct: 183 AFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFE-GRCDPTRKNAKVVSIDGYEDVPA 241
Query: 235 NDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
+E +L KAV +QPVSVAI+A ALQ Y GVF G C T L+HGV VGYG E G+ Y
Sbjct: 242 YNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYGF-ENGVDY 300
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPV 332
WL++NSWG +WGEDGYF+L+R++ + G+CGIAM AS+PV
Sbjct: 301 WLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPV 341
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 145/338 (42%), Positives = 194/338 (57%), Gaps = 20/338 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ VLI S C+ ++ +D ++ ++FE+W + + Y E RF I++ N+
Sbjct: 15 LICFVLIASKLCSVNSS--VYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQ 72
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
++ N+ + + L N+FAD+T EF A G S S L P + V
Sbjct: 73 LIDYINSLHL---PFKLTDNRFADMTNSEFKAHFLGLNTS--SLRLHKKQRPVCDPAGNV 127
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P +V+W +GAVTP++ QG+C AVAA+EGIN IK LVSLSEQQL+DC
Sbjct: 128 PDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTY 187
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC GG M+ AF++I N G+T + Y Y G+ G CD KA++ I Y+ V N
Sbjct: 188 NKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIE-GTCDQEKAKNKVVTIQGYQKVAQN- 245
Query: 237 EESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E SL A A QPVSV IDA Q YS GVF YC T LNHGVT VGYG E KYW+
Sbjct: 246 EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTVVGYGV-EGDQKYWI 304
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+KNSWG WGE+GY R++R I + G+CGIAM AS+P+
Sbjct: 305 VKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYPL 342
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 204/345 (59%), Gaps = 24/345 (6%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FL + L + + S A+ R + ++FE+W A+YGR YK++ E +RF+IFK+N+
Sbjct: 9 FLFLFLCVMWASPSAAS-RDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+E FN+ GN SYTL +N+F D+T EF+A TG S L P +
Sbjct: 68 IETFNSHN-GN-SYTLGINQFTDMTKSEFVAQYTG----GISRPLNIEREPVVSFDDVNI 121
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S VP S++W + GAV VK Q C A+A VEGI IK LVSLSEQ+++DCA
Sbjct: 122 SAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV 181
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ GC GG+++ A+ +II N G+T + Y Y+ G C++ + ++A IT Y V
Sbjct: 182 S---YGCKGGWVNKAYDFIISNNGVTTEENYPYQAYQ-GTCNA-NSFPNSAYITGYSYVR 236
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
NDE S++ AV+NQP++ IDAS Q+Y+GGVF+G C T LNH +T +GYG G KY
Sbjct: 237 RNDERSMMYAVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKY 296
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
W+++NSWG WGE GY R+ R + G CGIAM FP + A
Sbjct: 297 WIVRNSWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFPTLQSGA 341
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 146/327 (44%), Positives = 201/327 (61%), Gaps = 24/327 (7%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
R+ DE + +E W Q+ + Y E KRF IFKDNL +++ N+ ++++ + L
Sbjct: 44 RSDDE--VMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSD--DSQTFKVGL 99
Query: 83 NKFADLTPQEFIASQTG------FKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAV 135
NKFADLT +EF + G S+ K +L+K ++P +V+W + GAV
Sbjct: 100 NKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNGAV 159
Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
VK QGQC +AAVEGIN I L+SLSEQ+LVDC T+ N+GC GG MD A
Sbjct: 160 AKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTS-YNSGCDGGLMDYA 218
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
+++II N GI DA Y Y G CD + I ++EDVP NDE++L KAVA+QP
Sbjct: 219 YEFIINNGGIDTDADYPYTAKD-GKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQP 277
Query: 249 VSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
VSVAI+A S QFY GVF G C L+HGV AVGYG S++G YW+++NSWG DWGE
Sbjct: 278 VSVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYG-SDDGKDYWIVRNSWGADWGES 336
Query: 307 GYFRLQRDIDQPQ-GQCGIAMFASFPV 332
GY R++R+++ + G+CGIA+ S+P+
Sbjct: 337 GYIRMERNLETVKTGKCGIAIEPSYPI 363
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 147/331 (44%), Positives = 198/331 (59%), Gaps = 20/331 (6%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
RT DE + FE W +YG++Y E +RFEIFKDNL V+ N A NRSY + L
Sbjct: 39 RTNDE--VIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHN--ADVNRSYKVGL 94
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
N+F+DLT E+ + G K + +++ P + Q+P SV+W +KGAV VK QG
Sbjct: 95 NQFSDLTDAEYSSIYLGTKFNIRMTNVSDRYEPRV--GDQLPDSVDWRKKGAVLGVKNQG 152
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C ++AAVEGIN I L+SLSEQ++VDC NNGC GG + A+++II N
Sbjct: 153 NCGSCWTFASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINN 212
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI-- 253
GI +A Y Y G G+CD K I YE+VP N+E++L KAVA QPVSV I
Sbjct: 213 GGINTEANYPYTGRD-GVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIAS 271
Query: 254 DASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
+++A + Y G+FNG C ++HGVT VGYGT E G YW+++NSWG +WGE GY R+QR
Sbjct: 272 NSTAFKSYKSGIFNGPCGPRIDHGVTIVGYGT-EGGKDYWIVRNSWGPNWGESGYVRMQR 330
Query: 314 DIDQPQGQCGIAMFASFPV--SKESAQPSSA 342
++ G+C IA +PV +P SA
Sbjct: 331 NVGG-SGKCFIARAPVYPVKYGPNPTKPRSA 360
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 141/309 (45%), Positives = 185/309 (59%), Gaps = 37/309 (11%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
+FE W +++G+ YK E RFE+F++NL ++ N SY L LN+FADL+ +E
Sbjct: 48 RFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEV---SSYWLGLNEFADLSHEE 104
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
F + + +P SV+W +KGAVT VK QG C
Sbjct: 105 FKSKDV----------------------ADLPESVDWRKKGAVTHVKNQGACGSCWAFST 142
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
VAAVEGIN I L +LSEQ+L+DC T N+GC GG MD AF +I N G+ + Y
Sbjct: 143 VAAVEGINQIVTGNLTTLSEQELIDCDTT-FNSGCNGGLMDYAFAFIASNGGLHKEDDYP 201
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSG 263
Y M G C+ K + I+ YEDVP DEESLLKA+A+QP+SVAI+AS QFYSG
Sbjct: 202 YL-MEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYSG 260
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
GVFNG C T L+HGV AVGYG+S+ G+ Y ++KNSWG WGE GY R++R+ + +G CG
Sbjct: 261 GVFNGPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCG 319
Query: 324 IAMFASFPV 332
I AS+P
Sbjct: 320 INKMASYPT 328
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 141/317 (44%), Positives = 185/317 (58%), Gaps = 17/317 (5%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKR-FEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
+G+ F W + YK++ E +R F ++ DNL V N + ++ L L F
Sbjct: 41 KGNPRAAFSDWVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHNEK---DSTFKLGLTNF 97
Query: 86 ADLTPQEFIASQTGFKMSDHSSSL-KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
ADLT E+ G++ + L T F Y + PPS++W +KGAVT VK Q QC
Sbjct: 98 ADLTHDEYRQHALGYRPELKGTGLGTGKSTGFQYADYEAPPSIDWRKKGAVTDVKNQQQC 157
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
+VEG NAI LVSLSEQ+LVDC ++GC+GG MD AF +II+N G
Sbjct: 158 GSCWAFSTTGSVEGANAIYSGELVSLSEQELVDCDVT-QDHGCHGGLMDFAFSFIIRNGG 216
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS- 256
I + Y Y+ G+C+ K + H I +YEDVPPNDE +L KA ANQP+SVAI+A
Sbjct: 217 IDTEKDYKYKAQD-GVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQ 275
Query: 257 -ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
Q Y+GGVF+ C T L+HGV VGYG S+ G YW++KNSWG WG+ GY RL R I
Sbjct: 276 REFQLYAGGVFDAPCGTALDHGVLVVGYG-SDNGTDYWIVKNSWGDFWGDSGYIRLARGI 334
Query: 316 DQPQGQCGIAMFASFPV 332
GQCGIAM AS+P+
Sbjct: 335 SNSAGQCGIAMQASYPI 351
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 148/347 (42%), Positives = 204/347 (58%), Gaps = 28/347 (8%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGS--IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
FL + L + + S A+ DE S + ++FE+W +YGR YK++ E +RF+IFK+N+
Sbjct: 9 FLFLFLCVMWASPSAASA---DEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----Y 118
+E FN+ SYTL +N+F D+T EFIA TG S L P +
Sbjct: 66 NHIETFNSR--NENSYTLGINQFTDMTNNEFIAQYTG----GISRPLNIEREPVVSFDDV 119
Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
S VP S++W + GAVT VK Q C A+A VE I IK L LSEQQ++DC
Sbjct: 120 DISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC 179
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
A GC GG+ AF++II NKG+ + A+Y Y+ + G C + ++A IT Y
Sbjct: 180 A---KGYGCKGGWEFRAFEFIISNKGVASGAIYPYKA-AKGTCKT-NGVPNSAYITGYAR 234
Query: 232 VPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
VP N+E S++ AV+ QP++VA+DA+A Q+Y GVFNG C T LNH VTA+GYG G
Sbjct: 235 VPRNNESSMMYAVSKQPITVAVDANANFQYYKSGVFNGPCGTSLNHAVTAIGYGQDSNGK 294
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
KYW++KNSWG WGE GY R+ RD+ G CGIA+ + +P + A
Sbjct: 295 KYWIVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYPTLESRA 341
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 145/329 (44%), Positives = 196/329 (59%), Gaps = 21/329 (6%)
Query: 17 ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
A ++++RT DE + +E W ++G+ Y E KRF IFKDNL ++ N+ N
Sbjct: 34 ADKSSWRTDDE--VMAMYEAWLVKHGKAYNALGEKEKRFGIFKDNLRFIDEHNSQ---NL 88
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKG 133
+Y L LN+FADLT +E+ + G K + K + + + +P ++W ++G
Sbjct: 89 TYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVSRKSDRFAARVGDALPDFIDWRKEG 148
Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
AV VK QG C +AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD
Sbjct: 149 AVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMD 207
Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
AF++II N GI ++ Y Y CD + + I YEDVP NDE +L KAVA
Sbjct: 208 YAFEFIINNGGIDSEEDYPYRAADQK-CDQYRKNANVVSIDGYEDVPENDEAALKKAVAK 266
Query: 247 QPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
QPVSVAI+A A Q Y GVF G C T L+HGV AVGYGT E G YW++ NSWG++WG
Sbjct: 267 QPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVGYGT-ENGQDYWIVGNSWGKNWG 325
Query: 305 EDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
EDGY R++R++ G+CGIA+ S+P+
Sbjct: 326 EDGYIRMERNLAGSSSGKCGIAIGPSYPI 354
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 189/312 (60%), Gaps = 16/312 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +W+ + K N R E+FK+NL V+ N AA G ++ L +N+FADLT +E
Sbjct: 53 YLEWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHNAAADRGEHTFLLGMNRFADLTNEE 112
Query: 93 FIAS-QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
+ F S+S K + L + +P S++W E GAV PVK QG C
Sbjct: 113 YRTRFLRDFSRLRRSASGKISSRYRLREGDDLPDSIDWRENGAVVPVKNQGGCGSCWAFS 172
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
VAAVEGIN I L+SLSEQQLVDC T N+GC GG+M+ AF++I+ N GI ++ Y
Sbjct: 173 TVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGGINSEETY 230
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y G + GIC+S I +YE+VP ++E+SL KAVANQPVSV +DA+ Q Y
Sbjct: 231 PYRGQN-GICNST-VNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYR 288
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
G+F G C NH +T VGYGT E +W++KNSWG++WGE GY R +R+I+ P G+C
Sbjct: 289 SGIFTGSCNISANHALTVVGYGT-ENDKDFWIVKNSWGKNWGESGYIRAERNIENPNGKC 347
Query: 323 GIAMFASFPVSK 334
GI FAS+PV K
Sbjct: 348 GITRFASYPVKK 359
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 190/312 (60%), Gaps = 16/312 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +W+A+ K N R E+FK+NL V++ N AA G ++ L +N+FADLT +E
Sbjct: 51 YLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHNAAADRGEHTFRLGMNRFADLTNEE 110
Query: 93 FIAS-QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
+ F S+S K + L + +P S++W EKGAV PVK QG C
Sbjct: 111 YRTRFLRDFSRLRRSASGKISSRYRLREGDDLPDSIDWREKGAVVPVKNQGGCGSCWAFS 170
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
VAAVEGIN I L+SLSEQQLVDC T N+GC GG+M+ AF++I+ N GI ++ Y
Sbjct: 171 TVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGGINSEETY 228
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y G + GIC+S I +YE+VP ++E+SL KAVANQPVSV +DA+ Q Y
Sbjct: 229 PYRGQN-GICNST-VNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYR 286
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
G+F G C NH +T VGYGT E Y +KNSWG++WGE GY R++R+I P G+C
Sbjct: 287 SGIFTGSCNISANHALTVVGYGT-ENDKDYRTVKNSWGKNWGESGYIRVERNIGNPNGKC 345
Query: 323 GIAMFASFPVSK 334
GI FAS+PV K
Sbjct: 346 GITRFASYPVKK 357
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 188/323 (58%), Gaps = 26/323 (8%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +WKA++G+ Y E +R+ F+DNL ++ N AA G S+ L LN+FADLT +E
Sbjct: 40 YAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
+ + G + K + + +P SV+W KGAV +K QG C A
Sbjct: 100 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSA 158
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF +II N GI + Y
Sbjct: 159 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 217
Query: 206 YEGMSTGICDS------------IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
Y+G CD + I +YEDV PN E SL KAVANQPVSVAI
Sbjct: 218 YKGKDE-RCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAI 276
Query: 254 DAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
+A A Q YS G+F G C T L+HGV AVGYGT E G YW+++NSWG+ WGE GY R+
Sbjct: 277 EAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRM 335
Query: 312 QRDIDQPQGQCGIAMFASFPVSK 334
+R+I G+CGIA+ S+P+ K
Sbjct: 336 ERNIKASSGKCGIAVEPSYPLKK 358
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 143/317 (45%), Positives = 192/317 (60%), Gaps = 22/317 (6%)
Query: 34 FEQWKAQYGRTYK--ESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
+E+W+ ++G+ + +E KRFEIFKDNL ++ N NR+Y + LN+FADL+ +
Sbjct: 53 YEEWRVKHGKLNNNIDGSEKDKRFEIFKDNLKFIDEHNAE---NRTYKVGLNRFADLSNE 109
Query: 92 EFIASQTGFKMSDHSSSLKANGTPF-LYKSS---QVPPSVNWIEKGAVTPVKYQGQCA-- 145
E+ + G K+ + T Y S ++P SV+W +GAV VK QG C
Sbjct: 110 EYRSRYLGTKIDPIGMMMARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSC 169
Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
+AAVEGIN I LVSLSEQ+LVDC N GC GG M+ AF++II N GI +
Sbjct: 170 WAFSTIAAVEGINKIVTGELVSLSEQELVDC-DRTVNAGCDGGLMEYAFEFIINNGGIDS 228
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
D Y Y G+ G CD K I +YE VP DE +L KAVANQP+SVAI+A
Sbjct: 229 DEDYPYRGVD-GKCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREF 287
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
Q Y G+F G C T L+HGVTAVGYGT E G+ YW+++NSWG+ WGE GY R++R++
Sbjct: 288 QLYVSGIFTGKCGTALDHGVTAVGYGT-ENGVDYWIVRNSWGKSWGESGYVRMERNLAAS 346
Query: 319 -QGQCGIAMFASFPVSK 334
G+CGI M +S+P+ K
Sbjct: 347 VAGKCGIVMQSSYPIKK 363
>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 498
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 161/356 (45%), Positives = 206/356 (57%), Gaps = 34/356 (9%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAE-----KFEQWKAQYGRTYKE-SAENSKRF 55
AK+ + + + G + A + D ++A+ F W Q+ RTY E S E ++R
Sbjct: 3 AKFLALALAGLVGLSCAHALLSSADMLALAQVEPERAFGLWATQHARTYSEGSPEYTRRL 62
Query: 56 EIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN--- 112
+F DN+ A+ N N TL LN++AD T +EF A + G K+S LKA
Sbjct: 63 GVFADNVRAIAEQNRR---NTGITLALNEYADETWEEFAAKRLGLKISQEQ--LKAREAR 117
Query: 113 -----GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRL 160
+ + Y Q P +V+W K AVT VK QGQC AV ++EG NA+ +L
Sbjct: 118 SSSSSSSSWRYAQVQTPAAVDWRAKNAVTQVKNQGQCGSCWAFSAVGSIEGANALATGQL 177
Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY-EGMSTGI-CDSIK 218
V+LSEQQLVDC T +N GC GG MDDAFKY++ N GI + YSY G G C+ K
Sbjct: 178 VALSEQQLVDCDTA-SNMGCSGGLMDDAFKYVLDNGGIDTEEDYSYWSGYGFGFWCNKRK 236
Query: 219 AEDH-AAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNH 276
D A I YEDVP E +LLKAVA QPV+VAI ASA +QFYS GV N CE LNH
Sbjct: 237 QTDRPAVSIDGYEDVP-TSEPALLKAVAGQPVAVAICASANMQFYSSGVINSCCEG-LNH 294
Query: 277 GVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
GV AVGY TS++ YW++KNSWG WGE GYFRL+ + P+G CGIA AS+ V
Sbjct: 295 GVLAVGYDTSDKAQPYWIVKNSWGGSWGEQGYFRLKMG-EGPKGLCGIASAASYAV 349
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 138/320 (43%), Positives = 194/320 (60%), Gaps = 24/320 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++FE+W A+YGR YK++ E +RF+IFK+N+ +E FN+ GN SYTL +N+F D+T
Sbjct: 6 MMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRN-GN-SYTLGINQFTDMT 63
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFL----YKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
EF+A TG + L P + S VP S++W + GAV VK Q C
Sbjct: 64 KSEFVAQYTGVSLP-----LNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCG 118
Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
A+A VEGI IK LVSLSEQ+++DCA + GC GG+++ A+ +II N G+
Sbjct: 119 SCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVS---YGCKGGWVNKAYDFIISNNGV 175
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
T + Y Y+ G C++ + ++A IT Y V NDE S++ AV+NQP++ IDAS
Sbjct: 176 TTEENYPYQAYQ-GTCNA-NSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASEN 233
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
Q+Y+GGVF+G C T LNH +T +GYG G KYW+++NSWG WGE GY R+ R +
Sbjct: 234 FQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSS 293
Query: 318 PQGQCGIAMFASFPVSKESA 337
G CGIAM FP + A
Sbjct: 294 SSGACGIAMSPLFPTLQSGA 313
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 199/343 (58%), Gaps = 22/343 (6%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ F +LI+S + + RT D+ + +E W + G++Y E RFEIFK
Sbjct: 10 MSLLFFSTLLILSLALDIENSVQRTNDQ--VMAMYESWLVEQGKSYNSLDEKEMRFEIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+NL ++ N A NRSY+L LN+FADLT +E+ ++ G KM + ++ K
Sbjct: 68 ENLRIIDDHN--ADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPKTDV----SNEYMPK 121
Query: 120 SSQ-VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
+ +P V+W GAV VK QG C AV AVEGIN I L+SLSEQ+LVDC
Sbjct: 122 VGEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDC 181
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
GC G M DAF++II N GI + Y Y G C+ I NY++
Sbjct: 182 GRTQRTKGCNRGLMTDAFQFIINNGGINTEDNYPYTA-KDGQCNLSLKNQKYVTIDNYKN 240
Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP N+E +L KAVA QPVSV +++ +F Y+ G+F G+C T ++HGVT VGYGT E G
Sbjct: 241 VPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYGT-ERG 299
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+ YW++KNSWG +WGE+GY R+QR+I G+CGIA S+PV
Sbjct: 300 MDYWIVKNSWGTNWGENGYIRIQRNIGG-AGKCGIARMPSYPV 341
>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 153/327 (46%), Positives = 207/327 (63%), Gaps = 28/327 (8%)
Query: 25 FDEGSIAEKFEQWK--AQYGRTY---KESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
FDE +A + W+ ++G+ + + E KRF +FK+N+ V N ++ Y
Sbjct: 26 FDEKELATEESLWQLYERWGKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM---DKPYK 82
Query: 80 LRLNKFADLTPQEFI----ASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSVNWIEKGA 134
L+LNKFAD++ EF+ S H A G F+Y + + +P SV+ E+GA
Sbjct: 83 LKLNKFADMSNYEFVNFYARSNISHYRKLHERRRGAGG--FMYEQDTDLPSSVDGRERGA 140
Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
V VK QG+C +VAAVEGIN IK N+L+SLSEQ+L+DC N N GC GGFM+
Sbjct: 141 VNAVKEQGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDC--NYRNKGCNGGFMEI 198
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF +I +N GI + Y Y G S G+C S + +I YE VP N E++L++AVANQ
Sbjct: 199 AFDFIKRNGGIATENSYPYHG-SRGLCRSSRISSPIVKIDGYESVPEN-EDALMQAVANQ 256
Query: 248 PVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
PVSVAIDA+ QFYS GVF+GYC T LNHGV A+GYGT+E+G YWL++NSWG WGE
Sbjct: 257 PVSVAIDAAGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGE 316
Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPV 332
DGY R++R ++Q +G CGIAM AS+P+
Sbjct: 317 DGYVRMKRGVEQAEGLCGIAMEASYPI 343
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 134/227 (59%), Positives = 155/227 (68%), Gaps = 11/227 (4%)
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
VP SV+W +KGAVT VK QGQC + AVEGIN IK N+LVSLSEQ+LVDC T D
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDT-D 60
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD AF++I Q GIT +A Y YE G CD K A I +E+VP N
Sbjct: 61 QNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYD-GTCDVSKENAPAVSIDGHENVPEN 119
Query: 236 DEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
DE +LLKAVANQPVSVAIDA S QFYS GVF G C T L+HGV VGYGT+ +G KYW
Sbjct: 120 DENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYW 179
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
+KNSWG +WGE GY R++R I +G CGIAM AS+P+ K S PS
Sbjct: 180 TVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPIKKSSNNPS 226
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/310 (46%), Positives = 185/310 (59%), Gaps = 17/310 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
F W ++ + Y E KR+EIFK NL + N N SY L LN FAD+ +EF
Sbjct: 55 FTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRR---NGSYWLGLNHFADIAHEEF 111
Query: 94 IASQTGFKMSDHSSSLKANG-TPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA------ 145
AS G K + +G T F Y ++ +P +V+W +KGAVTPVK QG+C
Sbjct: 112 KASYLGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFS 171
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
VAAVEGIN I +LVSLSEQ+L+DC N N+GC GG MD AF YI+ N+GI + Y
Sbjct: 172 TVAAVEGINQIVTGKLVSLSEQELMDC-DNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDY 230
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y M G C + IT YEDVP N E SLLKA+A+QPVSV I A + QFY
Sbjct: 231 PYL-MEEGYCREKQPHSKVITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYK 289
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
GG+F+G C +H +TAVGYG S G Y ++KNSWG++WGE GYFR++R +P+G C
Sbjct: 290 GGIFDGECGIQPDHALTAVGYG-SYYGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVC 348
Query: 323 GIAMFASFPV 332
I AS+P
Sbjct: 349 DIYKIASYPT 358
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/313 (45%), Positives = 188/313 (60%), Gaps = 18/313 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ + F W ++ + Y E KR+E+FK NL + N N SY L LN+FAD+
Sbjct: 44 LVDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRR---NGSYWLGLNQFADVA 100
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF ++ G K + T F Y++S +P SV+W +KGAVTPVK QG+C
Sbjct: 101 HEEFKSTYLGLKTGMDGPARAP--TAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCW 158
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I +L SLSEQ+L+DC T ++GC GGFMD AF YI+ N GI D
Sbjct: 159 AFSTVAAVEGINQIATGKLESLSEQELMDCDTT-FDHGCGGGFMDFAFAYIMGNLGIHTD 217
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M G C + + I+ YEDVP N E SLLKA+A+QP+SV I A + Q
Sbjct: 218 DDYPYL-MEEGYCKEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQ 276
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FY GVF G C T L+H +TAVGYG+S+ G Y ++KNSWG+ WGE GYFR++R +P+
Sbjct: 277 FYKRGVFEGSCGTELDHALTAVGYGSSD-GQDYIIMKNSWGKSWGEQGYFRIKRGTGKPE 335
Query: 320 GQCGIAMFASFPV 332
G C I AS+P
Sbjct: 336 GVCSIYSMASYPT 348
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/345 (41%), Positives = 199/345 (57%), Gaps = 25/345 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FL + L + S A+ R + ++FE+W A+YGR YK+ E +RF+IFK+N+
Sbjct: 9 FLFLFLCAMWASPSAAS-RDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+E FN+ SYTL +N+F D+T EF+A TG + L P +
Sbjct: 68 IETFNSR--NENSYTLGINQFTDMTKSEFVAQYTGVSLP-----LNIEREPVVSFDDVNI 120
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S VP S++W + GAV VK Q C A+A VEGI IK LVSLSEQ+++DCA
Sbjct: 121 SAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV 180
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ GC GG+++ A+ +II N G+T + Y Y G C++ + ++A IT Y V
Sbjct: 181 S---YGCKGGWVNKAYDFIISNNGVTTEENYPYLAYQ-GTCNA-NSFPNSAYITGYSYVR 235
Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
NDE S++ AV+NQP++ IDAS Q+Y+GGVF+G C T LNH +T +GYG G KY
Sbjct: 236 RNDERSMMYAVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKY 295
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
W+++NSWG WGE GY R+ R + G CGIAM FP + A
Sbjct: 296 WIVRNSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFPTLQSGA 340
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/310 (46%), Positives = 185/310 (59%), Gaps = 17/310 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
F W ++ + Y E KR+EIFK NL + N N SY L LN FAD+ +EF
Sbjct: 46 FTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRR---NGSYWLGLNHFADIAHEEF 102
Query: 94 IASQTGFKMSDHSSSLKANG-TPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA------ 145
AS G K + +G T F Y ++ +P +V+W +KGAVTPVK QG+C
Sbjct: 103 KASYLGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFS 162
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
VAAVEGIN I +LVSLSEQ+L+DC N N+GC GG MD AF YI+ N+GI + Y
Sbjct: 163 TVAAVEGINQIVTGKLVSLSEQELMDC-DNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDY 221
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y M G C + IT YEDVP N E SLLKA+A+QPVSV I A + QFY
Sbjct: 222 PYL-MEEGYCREKQPHSKVITITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYK 280
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
GG+F+G C +H +TAVGYG S G Y ++KNSWG++WGE GYFR++R +P+G C
Sbjct: 281 GGIFDGECGIQPDHALTAVGYG-SYYGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVC 339
Query: 323 GIAMFASFPV 332
I AS+P
Sbjct: 340 DIYKIASYPT 349
>gi|356545079|ref|XP_003540973.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 330
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/299 (47%), Positives = 185/299 (61%), Gaps = 25/299 (8%)
Query: 17 ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
ASQ T RT + S+ E+ E+W ++YG+ YK+ E KRF IFK+N+ +E NN AI +
Sbjct: 5 ASQVTCRTLQDASMYERHEEWMSRYGKVYKDPREREKRFRIFKENMNYIETSNNVAI--K 62
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT---PFLYKSSQVPPSVNWIEKG 133
L +N+FADL +EFIA + FK L T P+++ + KG
Sbjct: 63 PXKLVINQFADLNNEEFIAPRNIFKGMILCRFLSRKHTFPFPYVFLGHK---------KG 113
Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
AVTPVK QG C VA+ EGI A+ +L+SLSEQ+LVDC T + GC G MD
Sbjct: 114 AVTPVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCECGLMD 173
Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
DAFK+IIQN G+ DA Y Y+G+ G C++ + + AA IT EDVP N+E++L K VAN
Sbjct: 174 DAFKFIIQNHGVX-DANYPYKGVD-GKCNANEEANPAATITGXEDVPANNEKALQKVVAN 231
Query: 247 QPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
QPV VAIDA S QFY GVF G CET LNHGVT +GYG S +G +YWL+KNS +W
Sbjct: 232 QPVFVAIDACDSDFQFYKSGVFTGSCETELNHGVTTMGYGVSHDGTQYWLVKNSXETEW 290
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 150/346 (43%), Positives = 210/346 (60%), Gaps = 23/346 (6%)
Query: 2 AKYFLIVVLIISGSC---ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIF 58
A FL+ VL++ + A+ ++A + E+W A++GR YK+ AE ++R E+F
Sbjct: 3 ASRFLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVF 62
Query: 59 KDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
+ N ++ FN A G S+ L N+FADLT QEF A++TG + S A F Y
Sbjct: 63 RANAELIDSFN--AAGTHSHRLATNRFADLTVQEFRAARTGLRPRPAPS---AGAGRFRY 117
Query: 119 KS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
++ + SV+W GAVT VK QG AVAAVEG+N I+ RLVSLSEQ+L
Sbjct: 118 ENFSLADAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAAVEGLNKIRTGRLVSLSEQEL 177
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
VDC + + GC GG MD+AF+++ + G+ +++ Y Y+ G C S A AA I
Sbjct: 178 VDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQ-CRDGPCRSSAAAA-AASIRG 235
Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
+EDVP N+E +L AVA+QPVSVAI+ A +FY GV G C T LNH +TAVGYGT+
Sbjct: 236 HEDVPRNNEAALAAAVAHQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTA 295
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G +YWL+KNSWG WGE GY R++R + + +G CG+A S+PV
Sbjct: 296 ADGTRYWLMKNSWGASWGEGGYVRIRRGV-RGEGVCGLAKLPSYPV 340
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 142/338 (42%), Positives = 193/338 (57%), Gaps = 20/338 (5%)
Query: 5 FLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ VLI S C+ ++ +D ++ ++FE+W + + Y E RF I++ N+
Sbjct: 15 LICFVLIASKLCSVDSS--VYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQ 72
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
++ N+ + + L N+FAD+T EF A G S S L P + V
Sbjct: 73 LIDYINSLHL---PFKLTDNRFADMTNSEFKAHFLGLNTS--SLRLHKKQRPVCDPAGNV 127
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P +V+W +GAVTP++ QG+C AVAA+EGIN IK LVSLSEQQL+DC
Sbjct: 128 PDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTY 187
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC GG M+ AF++I N G+ + Y Y G+ G CD K+++ I Y+ V N
Sbjct: 188 NKGCSGGLMETAFEFIKTNGGLATETDYPYTGIE-GTCDQEKSKNKVVTIQGYQKVAQN- 245
Query: 237 EESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
E SL A A QPVSV IDA Q YS GVF YC T LNHGVT VGYG E KYW+
Sbjct: 246 EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGV-EGDQKYWI 304
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+KNSWG WGE+GY R++R + + G+CGIAM AS+P+
Sbjct: 305 VKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 147/347 (42%), Positives = 205/347 (59%), Gaps = 28/347 (8%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGS--IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
FL + L + + S A+ DE S + ++FE+W +YGR YK++ E +RF+IFK+N+
Sbjct: 9 FLFLFLCVMWASPSAASA---DEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----Y 118
+E FN+ SYTL +N+F D+T EF+A TG S L P +
Sbjct: 66 NHIETFNSR--NKDSYTLGINQFTDMTNNEFVAQYTG----GISRPLNIEREPVVSFDDV 119
Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
S VP S++W + GAVT VK Q C A+A VE I IK L LSEQQ++DC
Sbjct: 120 DISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC 179
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
A GC GG+ AF++II NKG+ + A+Y Y+ + G C + ++A IT Y
Sbjct: 180 A---KGYGCKGGWEFRAFEFIISNKGVASVAIYPYKA-AKGTCKT-NGVPNSAYITGYAR 234
Query: 232 VPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
VP N+E S++ AV+ QP++VA+DA+A Q+Y+ GVFNG C T LNH VTA+GYG G
Sbjct: 235 VPRNNESSMMYAVSKQPITVAVDANANSQYYNSGVFNGPCGTSLNHAVTAIGYGQDSNGK 294
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
KYW++KNSWG WGE GY R+ RD+ G CGIA+ + +P + A
Sbjct: 295 KYWIVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYPTLESRA 341
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 144/341 (42%), Positives = 203/341 (59%), Gaps = 29/341 (8%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGS--IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
FL + L + + S A+ DE S + ++FE+W A+YGR YK++ E RF+IFK+N+
Sbjct: 9 FLFLFLCVMWASPSAAS---CDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNV 65
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----Y 118
+E FNN GN SYTL +N+F D+T EF+A TG + L P +
Sbjct: 66 NHIETFNNRN-GN-SYTLGINQFTDMTNNEFVAQYTGLSLP-----LNIKREPVVSFDDV 118
Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
S VP S++W + GAVT VK QG+C ++A VE I IK LVSLSEQQ++DC
Sbjct: 119 DISSVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDC 178
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
A + GC GG+++ A+ +II NKG+ + A+Y Y+ + G C + ++A IT Y
Sbjct: 179 AVS---YGCKGGWINKAYSFIISNKGVASAAIYPYKA-AKGTCKT-NGVPNSAYITRYTY 233
Query: 232 VPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
V N+E +++ AV+NQP++ A+DAS Q Y GVF G C T LNH + +GYG G
Sbjct: 234 VQRNNERNMMYAVSNQPIAAALDASGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGK 293
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
K+W+++NSWG WGE GY RL RD+ G CGIAM +P
Sbjct: 294 KFWIVRNSWGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 142/330 (43%), Positives = 199/330 (60%), Gaps = 21/330 (6%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLR 81
R DE + +E WK+++G + +++ R E+F+DNL ++ N A G ++ L
Sbjct: 43 RADDE--VRRMYEAWKSEHG--HGHGSDDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLG 98
Query: 82 LNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVK 139
L FADLT +E+ GF+ +S +G+ + + +P +++W E GAVT VK
Sbjct: 99 LTPFADLTLEEYRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGVK 158
Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
Q QC AVAA+EGIN I LVSLSEQ+++DC T D GC GG M +AF+++
Sbjct: 159 NQEQCGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQDG--GCNGGEMQNAFQFV 216
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
I N GI +A Y Y G CD+ + + I + V +E +L +AVANQPVSVA
Sbjct: 217 INNGGIDTEADYPYLGTDAA-CDANRVNERVVTIDGFVSVATENETALQEAVANQPVSVA 275
Query: 253 IDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
IDAS +F Y+ G+FNG C T L+HGVTAVGYG SE G YW++KNSW WGE GY R
Sbjct: 276 IDASGRKFQHYTSGIFNGPCGTQLDHGVTAVGYG-SENGKDYWIVKNSWSSSWGEAGYIR 334
Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
++R++ G+CGIAM AS+PV K S+ P+
Sbjct: 335 IRRNVAAATGKCGIAMDASYPV-KSSSNPA 363
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 151/356 (42%), Positives = 209/356 (58%), Gaps = 35/356 (9%)
Query: 6 LIVVLIISGSCASQA------TY-RTFDEGSIAEK--------FEQWKAQYGRTYKESAE 50
L++VLIIS S A +Y +T + S +++ +E+W ++G++Y E
Sbjct: 12 LMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGE 71
Query: 51 NSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK 110
KRFEIFKDNL ++ N N +Y L L +FADLT +E+ + G K+ + K
Sbjct: 72 KDKRFEIFKDNLKFIDEHNGL---NSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKK 128
Query: 111 ANGTPFLYKSSQV----PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINR 159
G+ + +V P SV+W ++GAV VK Q C A+AAVEGIN I
Sbjct: 129 LGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGD 188
Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
L+SLSEQ+LVDC T+ N GC GG MD AF++II N GI ++ Y Y+ + G CD +
Sbjct: 189 LISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVD-GRCDQNRK 246
Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHG 277
I +YEDVP DE +L KAVANQP++VA++ Q Y GVF G C T L+HG
Sbjct: 247 NAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHG 306
Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPV 332
V AVGYGT E G YW+++NSWG WGE GY RL+R++ + G+CGIA+ S+P+
Sbjct: 307 VAAVGYGT-ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 144/342 (42%), Positives = 195/342 (57%), Gaps = 18/342 (5%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKE-SAENSKRFEIFK 59
MA F + + + + S +S RT DE + ++QW+A++G+ + AE RF IFK
Sbjct: 10 MALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRFHIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
DNL ++ N N Y L LN FADLT +E+ + G K + S + +
Sbjct: 68 DNLKFIDEINAQ---NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSNRYLPRL 124
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
+P S++W KGAV PVK QG C VA+VE IN I L++LSEQ+LVDC
Sbjct: 125 GDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDC- 183
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
N GC GG MD AF++II+N G+ + Y Y G + C K I +YEDV
Sbjct: 184 DRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSS-CIQYKKNAKVVAIDSYEDV 242
Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P N+E++L KAV+ Q VSVAI+ + Q Y G+F G C T L+HGV VGYG SE G+
Sbjct: 243 PVNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYG-SEGGV 301
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW+++NSWG WGE GY ++QR+I P G CGIAM S+P
Sbjct: 302 DYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 343
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 189/319 (59%), Gaps = 22/319 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
I+E F+ W ++G+TY E +R +IFKDN V + N I N +Y+L LN FADLT
Sbjct: 26 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN--LITNATYSLSLNAFADLT 83
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
EF AS+ G +S S + + G L S +VP SV+W +KGAVT VK QG C
Sbjct: 84 HHEFKASRLGLSVSAPSVIMASKGQS-LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 142
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
A A+EGIN I L+SLSEQ+L+DC N GC GG MD AF+++I+N GI +
Sbjct: 143 FSATGAMEGINQIVTGDLISLSEQELIDC-DKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 201
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
Y Y+ G C K + I +Y V NDE++L++AVA QPVSV I S A Q
Sbjct: 202 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 260
Query: 261 YSG-------GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
YS G+F+G C T L+H V VGYG S+ G+ YW++KNSWG+ WG DG+ +QR
Sbjct: 261 YSSKFYLLMQGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQR 319
Query: 314 DIDQPQGQCGIAMFASFPV 332
+ + G CGI M AS+P+
Sbjct: 320 NTENSDGVCGINMLASYPI 338
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 149/346 (43%), Positives = 204/346 (58%), Gaps = 26/346 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FL + L + + S A+ R + ++FE+W A+YGR YK++ E +RF+IFK+N+
Sbjct: 9 FLFLFLCVMWASPSAAS-RDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+E FNN GN SYTL +NKF D+T EF+A TG S L P +
Sbjct: 68 IETFNNRN-GN-SYTLGINKFTDMTNNEFVAQYTG----GISRPLNIEKEPVVSFDDVNI 121
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S V S++W + GAVT VK Q C A+A VEGI I LVSLSEQ+++DCA
Sbjct: 122 SAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAV 181
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ NGC GGF+D+A+ +II N G+ ++A Y Y+ G C + + ++A IT Y V
Sbjct: 182 S---NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQ-GDC-AANSWPNSAYITGYSYVR 236
Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
NDE S+ AV NQP++ AIDAS Q+Y+GGVF+G C T LNH +T +GYG G +
Sbjct: 237 SNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQ 296
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
YW++KNSWG WGE GY R+ R + G CGIAM +P + A
Sbjct: 297 YWIVKNSWGSSWGERGYIRMARGVSS-SGLCGIAMDPLYPTLQSGA 341
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 141/328 (42%), Positives = 190/328 (57%), Gaps = 31/328 (9%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++FEQW ++GR Y +S E +RFE+++ N+ VE FN+ + G Y L NKFADLT
Sbjct: 28 MLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNG---YKLADNKFADLT 84
Query: 90 PQEFIASQTGFK----MSDHSSSLKAN-GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
+EF A GF+ + S++ A+ P +P SV+W +KGAV VK QG C
Sbjct: 85 NEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDC 144
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
AVAA+EGIN IK LVSLSEQ+LVDC +D GC GG+M AF++++ N G
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDC--DDEAVGCGGGYMSWAFEFVVGNHG 202
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
+T +A Y Y + G C + K A I Y +V P+ E L +A A QPVSVA+D +
Sbjct: 203 LTTEASYPYHA-ANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGS 261
Query: 258 LQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIK----------YWLIKNSWGQDWGE 305
F Y GV+ G C +NHGVT VGYG SE YW++KNSWG +WG+
Sbjct: 262 FMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGD 321
Query: 306 DGYFRLQRDI-DQPQGQCGIAMFASFPV 332
GY +QRD+ G CGIA+ S+PV
Sbjct: 322 AGYILMQRDVAGLASGLCGIALLPSYPV 349
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 250 bits (639), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 151/356 (42%), Positives = 209/356 (58%), Gaps = 35/356 (9%)
Query: 6 LIVVLIISGSCASQA------TY-RTFDEGSIAEK--------FEQWKAQYGRTYKESAE 50
L++VLIIS S A +Y +T + S +++ +E+W ++G++Y E
Sbjct: 12 LMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGE 71
Query: 51 NSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK 110
KRFEIFKDNL ++ N N +Y L L +FADLT +E+ + G K+ + K
Sbjct: 72 KDKRFEIFKDNLKFIDEHNGL---NSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKK 128
Query: 111 ANGTPFLYKSSQV----PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINR 159
G+ + +V P SV+W ++GAV VK Q C A+AAVEGIN I
Sbjct: 129 LGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGD 188
Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
L+SLSEQ+LVDC T+ N GC GG MD AF++II N GI ++ Y Y+ + G CD +
Sbjct: 189 LISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVD-GRCDQNRK 246
Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHG 277
I +YEDVP DE +L KAVANQP++VA++ Q Y GVF G C T L+HG
Sbjct: 247 NAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHG 306
Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPV 332
V AVGYGT E G YW+++NSWG WGE GY RL+R++ + G+CGIA+ S+P+
Sbjct: 307 VAAVGYGT-ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 142/311 (45%), Positives = 183/311 (58%), Gaps = 47/311 (15%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E W A++G++Y E +RF+IFKDNL ++ N NR+Y
Sbjct: 4 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE---NRTYK-------------- 46
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
I+ + F++ D +P SV+W +KGAV VK QG C +
Sbjct: 47 ISDRYAFRVGD-----------------SLPESVDWRKKGAVVEVKDQGSCGSCWAFSTI 89
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF++II N GI ++ Y Y
Sbjct: 90 AAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDSEEDYPY 148
Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGG 264
+ S G CD + I YEDVP NDE+SL KAVANQPVSVAI+A Q Y G
Sbjct: 149 KA-SDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSG 207
Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQGQCG 323
+F G C T L+HGVTAVGYGT E G+ YW++KNSWG WGE+GY R++RD+ G+CG
Sbjct: 208 IFTGRCGTALDHGVTAVGYGT-ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCG 266
Query: 324 IAMFASFPVSK 334
IAM AS+P+ K
Sbjct: 267 IAMEASYPIKK 277
>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 150/342 (43%), Positives = 211/342 (61%), Gaps = 33/342 (9%)
Query: 4 YFLIVVLIISGS-CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
Y + +++S + A Q T RT + S+ E Q +Y + K+ + +FK+N+
Sbjct: 8 YHIAFAMLLSMAFLAFQVTCRTLQDASMYESHGQRMTRYSKVDKDPPD-----XVFKENV 62
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E NNAA ++ Y +N+FA P++ + H S T F +++ +
Sbjct: 63 NYIEACNNAA--DKPYKRDINQFA---PKK--------RFKGHMCSSIIRITTFKFENVT 109
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLS-EQQLVDCAT 173
P +V+ +K AVTP+K QGQC AVAA EGI+A+ +L+ LS EQ+LVDC T
Sbjct: 110 ATPSTVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQELVDCDT 169
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI-TNYEDV 232
+ C GG MDDAFK+IIQN G+ +A Y Y+G+ G C++ +A+ +AA I T YEDV
Sbjct: 170 KGVDQDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVD-GKCNAYEADKNAATIITGYEDV 228
Query: 233 PPNDEESLL-KAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
P N+E++ L KAVAN PVSVAIDAS QFY GVF G C T L+HGVTAVGYG S++G
Sbjct: 229 PANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDG 288
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNS G +WGE+GY R+QR +D + CGIA+ AS+P
Sbjct: 289 TEYWLVKNSRGTEWGEEGYIRMQRGVDSEEALCGIAVQASYP 330
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 200/327 (61%), Gaps = 27/327 (8%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN---NAAIGNR--SYTLRLN 83
++A + E W A++GRTY ++ E ++R EIF+ N ++ FN +AA G S+ L N
Sbjct: 38 AMASRHESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATN 97
Query: 84 KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS----SQVPPSVNWIEKGAVTPVK 139
+FADLT +EF A++TG + + F Y++ + S++W GAVT VK
Sbjct: 98 RFADLTDEEFRAARTGLRRPAAVAGAVG--GGFRYENFSLQADAAGSMDWRAMGAVTGVK 155
Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
QG C AVAA+EG+ I+ RLVSLSEQQLVDC ++ GC GG MD+AF+YI
Sbjct: 156 DQGSCGCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYI 215
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
+ G+ +++ Y Y G G C S +A+ AA I +EDVP N+E +L+ AVA+QPVSVA
Sbjct: 216 SRQGGLASESAYPYSGEDGGSCRSGRAQP-AASIRGHEDVPANNEGALMAAVAHQPVSVA 274
Query: 253 IDAS--ALQFY----SGGVFNGYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
I+ +FY G NG CE T L+H +TAVGYG + +G YWL+KNSWG WGE
Sbjct: 275 INGGDYVFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGE 334
Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPV 332
GY R++R + +G CG+A AS+PV
Sbjct: 335 SGYVRIRRG-SRGEGVCGLAKLASYPV 360
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 198/343 (57%), Gaps = 21/343 (6%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ F +LI+S + A T RT DE + +E W +YG++Y E +RFEIFK
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+ L ++ N A NRSY + LN+FADLT +EF ++ GF + + + P +
Sbjct: 68 ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEP---R 122
Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
QV PS V+W GAV +K QG+C A+A VEGIN I L+SLSEQ+L+DC
Sbjct: 123 VGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
N GC GG++ D F++II N GI + Y Y G C+ + I YE+
Sbjct: 183 GRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQD-GECNLDLQNEKYVTIDTYEN 241
Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP N+E +L AV QPVSVA+DA+ A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-EGG 300
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
I YW++KNSW WGE+GY R+ R++ G CGIA S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPV 342
>gi|9502426|gb|AAF88125.1|AC021043_18 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 365
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 141/362 (38%), Positives = 203/362 (56%), Gaps = 38/362 (10%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ + +L + + + T +E SI + +QW Q+ R YK+ +E R ++FK NL
Sbjct: 8 FVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLK 67
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP-----FLY 118
+E FNN +GN+SYTL +N+F D +EF+A+ TG +++ S S N T +
Sbjct: 68 FIENFNN--MGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMS 125
Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQCAVAAV-------------------------EGIN 153
S +W ++GAVTPVKYQG C EG+
Sbjct: 126 DIDMEDESKDWRDEGAVTPVKYQGACPEFPTKQIRRNSLVGKQYTKLLGVLSDWGDEGLT 185
Query: 154 AIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGI 213
I L++LSEQQL+DC + N GC GG ++AFKYII+N G++ + Y Y+
Sbjct: 186 KISGKNLLTLSEQQLIDCDI-EKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESC 244
Query: 214 CDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGY-C 270
+ + H QI ++ VP ++E +LL+AV QPVSV IDA A F Y GGV+ G C
Sbjct: 245 RANARRAPHT-QIRGFQMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDC 303
Query: 271 ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASF 330
T +NH VT VGYGT G+ YW++KNSWG+ WGE+GY R++RD++ PQG CGIA A++
Sbjct: 304 GTDVNHAVTIVGYGTMS-GLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAY 362
Query: 331 PV 332
PV
Sbjct: 363 PV 364
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 249 bits (637), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 198/343 (57%), Gaps = 21/343 (6%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ F +LI+S + A T RT DE + +E W +YG++Y E +RFEIFK
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+ L ++ N A NRSY + LN+FADLT +EF ++ GF + + + P +
Sbjct: 68 ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEP---R 122
Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
QV PS V+W GAV +K QG+C A+A VEGIN I L+SLSEQ+L+DC
Sbjct: 123 FGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
N GC GG++ D F++II N GI + Y Y G C+ + I YE+
Sbjct: 183 GRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQD-GECNLDLQNEKYVTIDTYEN 241
Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP N+E +L AV QPVSVA+DA+ A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-EGG 300
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
I YW++KNSW WGE+GY R+ R++ G CGIA S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPV 342
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 140/328 (42%), Positives = 190/328 (57%), Gaps = 31/328 (9%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++FEQW ++GR Y ++ E +RFE+++ N+ VE FN+ + G Y L NKFADLT
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNG---YKLADNKFADLT 83
Query: 90 PQEFIASQTGFK----MSDHSSSLKAN-GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
+EF A GF+ + S++ A+ P +P SV+W +KGAV VK QG C
Sbjct: 84 NEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDC 143
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
AVAA+EGIN IK LVSLSEQ+LVDC +D GC GG+M AF++++ N G
Sbjct: 144 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDC--DDEAVGCGGGYMSWAFEFVVGNHG 201
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
+T +A Y Y + G C + K A I Y +V P+ E L +A A QPVSVA+D +
Sbjct: 202 LTTEASYPYHA-ANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGS 260
Query: 258 LQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIK----------YWLIKNSWGQDWGE 305
F Y GV+ G C +NHGVT VGYG SE YW++KNSWG +WG+
Sbjct: 261 FMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGD 320
Query: 306 DGYFRLQRDI-DQPQGQCGIAMFASFPV 332
GY +QRD+ G CGIA+ S+PV
Sbjct: 321 AGYILMQRDVAGLASGLCGIALLPSYPV 348
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 147/300 (49%), Positives = 184/300 (61%), Gaps = 20/300 (6%)
Query: 49 AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
E +RF +F DNL V+ N A G+ + L +N+FADLT EF A+ G + +
Sbjct: 85 GEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGMNRFADLTNDEFRAAYLGT-----TPA 139
Query: 109 LKANGTPFLYKSSQV---PPSVNWIEKGAV-TPVKYQGQC-------AVAAVEGINAIKI 157
+ +Y+ V P SV+W +KGAV +PVK QGQC AVAAVEGIN I
Sbjct: 140 GRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVT 199
Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
LVSLSEQ+LV+CA N N+GC GG MDDAF +I +N G+ + Y Y M G CD
Sbjct: 200 GELVSLSEQELVECARNGGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMD-GKCDLA 258
Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLN 275
K I +EDVP NDE SL KAVA+QPVSVAIDA Q Y GVF G C T L+
Sbjct: 259 KKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLD 318
Query: 276 HGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
HGV AVGYGT + G YW ++NSWG DWGE+GY R++R++ G+CGIAM AS+P+ K
Sbjct: 319 HGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 378
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 147/346 (42%), Positives = 203/346 (58%), Gaps = 27/346 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FL + L + + S A+ R + ++FE+W A+YGR YK++ E +RF+IFK+N+
Sbjct: 9 FLFLFLCVMWASPSAAS-RDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+E FNN GN SYTL +NKF D+T EF+ TG + L P +
Sbjct: 68 IETFNNRN-GN-SYTLGINKFTDMTNNEFVTQYTGVSLP-----LNFKREPVVSFDDVNI 120
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S V S++W + GAVT VK Q C A+A VEGI I LVSLSEQ+++DCA
Sbjct: 121 SAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAV 180
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ NGC GGF+D+A+ +II N G+ ++A Y Y+ G C + + ++A IT Y V
Sbjct: 181 S---NGCDGGFVDNAYDFIISNNGVASEADYPYQAYE-GDC-TANSWPNSAYITGYSYVR 235
Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
NDE S+ AV NQP++ AIDAS Q+Y+GGVF+G C T LNH +T +GYG G +
Sbjct: 236 SNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQ 295
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
YW++KNSWG WGE GY R+ R + G CGIAM +P + A
Sbjct: 296 YWIVKNSWGSSWGERGYVRMARGVSS-SGLCGIAMDPLYPTLQSGA 340
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 198/343 (57%), Gaps = 21/343 (6%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ F +LI+S + A T RT DE + +E W +YG++Y E +RFEIFK
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+ L ++ N A NRSY + LN+FADLT +EF ++ GF + + + P +
Sbjct: 68 ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEP---R 122
Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
QV PS V+W GAV +K QG+C A+A VEGIN I L+SLSEQ+L+DC
Sbjct: 123 VGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
N GC GG++ D F++II N GI + Y Y G C+ + I YE+
Sbjct: 183 GRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQD-GECNVELQNEKYVTIDTYEN 241
Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP N+E +L AV QPVSVA+DA+ A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT-EGG 300
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
I YW++KNSW WGE+GY R+ R++ G CGIA S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPV 342
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 143/310 (46%), Positives = 189/310 (60%), Gaps = 17/310 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
F+ W ++ + Y E KR+ IFK NL+ + N N SY L LN+FAD+T +EF
Sbjct: 45 FKSWSVKHRKIYVSPKEKLKRYGIFKQNLMHIAETNRK---NGSYWLGLNQFADITHEEF 101
Query: 94 IASQTGFK--MSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
A+ G K +S + + T ++ +P SV+W KGAVTPVK QG+C
Sbjct: 102 KANHLGLKQGLSRMGAQTRTPTTFRYAAAANLPWSVDWRYKGAVTPVKNQGKCGSCWAFS 161
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
+VAAVEGIN I +LVSLSEQ+L+DC T ++GC GG MD AF YI+ ++GI + Y
Sbjct: 162 SVAAVEGINQIVTGKLVSLSEQELMDCDTM-LDHGCEGGLMDFAFAYIMGSQGIHAEDDY 220
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y M G C + + IT YEDVP N E SLLKA+A+QPVSV I A + QFY
Sbjct: 221 PYL-MEEGYCKEKQPYANVVTITGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYK 279
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
GGVF+G C L+H +TAVGYG+S G Y +KNSWG++WGE GY R++ +P+G C
Sbjct: 280 GGVFDGSCSDELDHALTAVGYGSSY-GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVC 338
Query: 323 GIAMFASFPV 332
GI AS+PV
Sbjct: 339 GIYTMASYPV 348
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 140/324 (43%), Positives = 194/324 (59%), Gaps = 36/324 (11%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT---P 90
+E+W + + Y E +R +IFK+NL ++ N ++ N+++ + L +FADLT P
Sbjct: 2 YERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHN--SLPNQTFEVGLTRFADLTNDEP 59
Query: 91 QEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTPVKYQGQC----- 144
++F+ + +LYK + P ++W KGAV PVK QG C
Sbjct: 60 KDFMKADR-----------------YLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWA 102
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
AV AVEGIN IK L+SLS+Q+L+DC N GC GG M+ AF++II N GI +D
Sbjct: 103 FSAVGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQ 162
Query: 203 VYSYEGMSTGICDSIKAED-HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQ 259
Y Y G+C++ K + +I YE V NDE+SL KAVA+QPV VAI+AS A +
Sbjct: 163 DYPYTATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFK 222
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
Y GVF G C +L+HGV VGYGTS G YW+I+NSWG +WGE+GY +LQR+ID
Sbjct: 223 LYKSGVFTGTCGIYLDHGVVVVGYGTS-SGEDYWIIRNSWGLNWGENGYVKLQRNIDDSF 281
Query: 320 GQCGIAMFASFPVSKESAQPSSAD 343
G+CG+AM S+P +S+ PSS D
Sbjct: 282 GKCGVAMMPSYPT--KSSFPSSFD 303
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 199/342 (58%), Gaps = 23/342 (6%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M+ F +LI+S + ++ RT DE + +E W ++G++Y E +RFEIFK+
Sbjct: 10 MSLLFFSTLLILSLALDAK---RTNDE--VKAMYESWLIKHGKSYNSLGERERRFEIFKE 64
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
L ++ N A +RSY + LN+FADLT +EF ++ GF + + + P +
Sbjct: 65 TLRFIDEHN--ADTSRSYKVGLNQFADLTNEEFRSTYLGFTRGSNKTKVSNRYEP---RV 119
Query: 121 SQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
QV P V+W +GAV +K QGQC A+AAVEGIN I L+SLSEQ+LVDC
Sbjct: 120 GQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLISLSEQELVDCG 179
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ GC GG+M D F++II N GI + Y Y G CD + I NYE+V
Sbjct: 180 RTQSTKGCDGGYMTDGFEFIINNGGINTEENYPYTAQE-GQCDLNLQNEKYVTIDNYENV 238
Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P +E +L AVA QPVSVA++++ A Q YS G+F G C T +H VT VGYGT E GI
Sbjct: 239 PYYNEWALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGTATDHAVTIVGYGT-EGGI 297
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW++KNSW WGE+GY R+ R++ G CGIA S+PV
Sbjct: 298 DYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 338
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 144/355 (40%), Positives = 208/355 (58%), Gaps = 28/355 (7%)
Query: 1 MAKYFLIVVLIISGSCA-------SQATYRTFDEGSIAEKFEQWKAQYGRTYKES-AENS 52
M FL++V ++S + S R+ +E + F+ W +++G+TY + E
Sbjct: 9 MTILFLLIVFVLSAPSSAMDLPATSGGHNRSNEE--VEFIFQMWMSKHGKTYTNALGEKE 66
Query: 53 KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN 112
+RF+ FKDNL +++ N N SY L L +FADLT QE+ G +LK +
Sbjct: 67 RRFQNFKDNLRFIDQHNAK---NLSYQLGLTRFADLTVQEYRDLFPG-SPKPKQRNLKTS 122
Query: 113 GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
Q+P SV+W ++GAV+ +K QG C VAAVEG+N I L+SLSE
Sbjct: 123 RRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSE 182
Query: 166 QQLVDCATNDNNNGCYG-GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
Q+LVDC N NNGCYG G MD AF+++I N G+ ++ Y Y+G + G C+ +
Sbjct: 183 QELVDC--NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQG-TQGSCNRKQVHLLVI 239
Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVG 282
I +YEDVP NDE SL KAVA+QPVSV +D + +F Y ++NG C T L+H + VG
Sbjct: 240 TIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVG 299
Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
YG SE G YW+++NSWG WG+ GY ++ R+ + P+G CGIAM AS+P+ ++
Sbjct: 300 YG-SENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPIKNSAS 353
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 199/343 (58%), Gaps = 21/343 (6%)
Query: 1 MAKYFLIVVLIISGSCASQ-ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ F +LI+S + ++ T RT DE + +E W +YG++Y E +RFEIFK
Sbjct: 10 MSLLFFSTLLILSLAFNTKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+ L ++ N A NRSY + LN+FADLT +EF ++ GF + + + P +
Sbjct: 68 ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEP---R 122
Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
QV PS V+W GAV +K QG+C A+A VEGIN I L+SLSEQ+L+DC
Sbjct: 123 VGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
N GC GG++ D F++II N GI + Y Y G C+ + I YE+
Sbjct: 183 GRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQD-GECNVDLQNEKYVTIDTYEN 241
Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP N+E +L AV QPVSVA+DA+ A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT-EGG 300
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
I YW++KNSW WGE+GY R+ R++ G CGIA S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 342
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 145/334 (43%), Positives = 193/334 (57%), Gaps = 17/334 (5%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
+I +F++W A +G+ Y E +KR IF DN V N A A G +S+ LRLN AD
Sbjct: 65 TIEARFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLAD 124
Query: 88 LTPQEFIASQTGFKMSD---HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
LT +EF G+ S SSS + + Y P +++W+ +GAVTPVK QGQC
Sbjct: 125 LTREEF-KHMLGYDASKKRVESSSPPVDAANWEYADVTPPETMDWVSRGAVTPVKNQGQC 183
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
V AVEG+ A+K L+SLSEQ+LV CA NNGC GG MD+ F++I++N+G
Sbjct: 184 GSCWAFSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIVENRG 243
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS- 256
+ ++ + Y K AA I ++DVP NDE++L KAV+ QPV+VAI+A
Sbjct: 244 VDDEEDWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAIEADH 303
Query: 257 -ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI---KYWLIKNSWGQDWGEDGYFRLQ 312
Q YSGGVF+G C T L+HGV VGYG E YW +KNSWG WGE+GY R+
Sbjct: 304 REFQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGYIRIA 363
Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
R P GQCG+AM AS+P SA D+ +
Sbjct: 364 RGGMGPAGQCGVAMQASYPTKSSSAPLEDGDEPT 397
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 140/327 (42%), Positives = 196/327 (59%), Gaps = 21/327 (6%)
Query: 19 QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
++T RT D+ + +E+W ++G+ Y E KRFEIFKDNL ++ N+ N S+
Sbjct: 34 KSTPRTNDQ--VLTMYEEWLVKHGKNYNALGEKEKRFEIFKDNLGFIDEHNSK---NLSF 88
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAV 135
L LN+FADLT +E+ G +++ + + K N Y + ++P SV+W ++GAV
Sbjct: 89 RLGLNRFADLTNEEYRTRFLGTRINPNRRNRKVNSQTNRYATRVGDKLPESVDWRKEGAV 148
Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
VK QG C A+AAVEG+N + L+SLSEQ+LVDC T+ N GC GG MD A
Sbjct: 149 VGVKDQGSCGSCWAFSAIAAVEGVNKLATGDLISLSEQELVDCDTS-YNEGCNGGLMDYA 207
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
F++II +T + Y Y + G CD + I YEDVP DE +L KAVANQ
Sbjct: 208 FEFIINMVALTPEEDYPYRAID-GRCDQNRKNAKVVSIDQYEDVPAYDEGALKKAVANQV 266
Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
++VA++ Q Y GVF G C T L+HGV AVGYGT E G YW+++NSWG WGE
Sbjct: 267 IAVAVEGGGREFQLYDSGVFTGRCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGGSWGEA 325
Query: 307 GYFRLQRDIDQPQ-GQCGIAMFASFPV 332
GY RL+R++ + G+CGIA+ S+P+
Sbjct: 326 GYIRLERNLATSKSGKCGIAIEPSYPI 352
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 187/329 (56%), Gaps = 15/329 (4%)
Query: 20 ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
AT +E + +E+W ++G+ Y E +RF+IFKDNL +E N+ NRSY
Sbjct: 27 ATESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDP--NRSYD 84
Query: 80 LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTP- 137
LN+F+DLT EF AS G K+ S S A + YK + P V+W E+GAV P
Sbjct: 85 RGLNQFSDLTVDEFQASYLGGKIEKKSLSDVAE--RYQYKEGDILPDEVDWRERGAVVPR 142
Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
VK QG C A AVEGIN I L+SLSEQ+L+DC +N GC GG AF+
Sbjct: 143 VKRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWAFE 202
Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAED-HAAQITNYEDVPPNDEESLLKAVANQPV 249
+I +N GI D Y Y G T C +I+ + I +E VP NDE SL KAV+ QP+
Sbjct: 203 FIKENGGIVTDEDYGYTGDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVSYQPI 262
Query: 250 SVAIDASALQFYSGGVFNGYCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
SV I A+ + Y GV+ G C +H V VGYGTS + YWLI+NSWG WGE GY
Sbjct: 263 SVMISAANMSDYKSGVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPGWGEGGY 322
Query: 309 FRLQRDIDQPQGQCGIAMFASFPVSKESA 337
RLQR+ ++P G+C +A+ +P+ SA
Sbjct: 323 LRLQRNFNEPTGKCAVAVAPVYPIKTNSA 351
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 149/314 (47%), Positives = 184/314 (58%), Gaps = 27/314 (8%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
+FE+W Q R YK+ E RF I++ NL +E N+ SY L NKFADLT +E
Sbjct: 4 RFERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQ---EXSYNLTDNKFADLTNEE 60
Query: 93 FIASQTGF--KMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC----- 144
F++ GF + H T F+Y + +P S +W ++GAV+ +K QG C
Sbjct: 61 FVSPYLGFGTRFLPH--------TGFMYHEHEDLPESKDWRKEGAVSDIKDQGNCGSCWA 112
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
AVAAVEGIN IK +LVSLSEQ+ DC D N GC GG MD AF +I +N G+T
Sbjct: 113 FSAVAAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSK 172
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL--LKAVANQPVSVAIDAS--AL 258
Y YEG+ G C+ KA HAA I+ + VP NDE L A ANQ SVAIDA A
Sbjct: 173 DYPYEGVD-GTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAF 231
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
Q Y GVF+G C LNHGVT VGYG KYW++KNSWG DWGE GY R++RD
Sbjct: 232 QLYLKGVFSGICGKQLNHGVTIVGYGKGTSD-KYWIVKNSWGADWGESGYIRMKRDAFDK 290
Query: 319 QGQCGIAMFASFPV 332
G CGIAM AS+P+
Sbjct: 291 AGTCGIAMQASYPL 304
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 152/339 (44%), Positives = 197/339 (58%), Gaps = 42/339 (12%)
Query: 29 SIAEKFEQWKAQYGR-TYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFAD 87
S+AE FE+W +++ + Y E +RFE+FKDNL ++ N SY L LN+FAD
Sbjct: 43 SLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRKV---SSYWLGLNEFAD 99
Query: 88 LTPQEFIASQTGFKMSDHSSSL----------------KANGTPFLYK-----SSQVPPS 126
LT EF A+ G S + ++ + F ++ ++++P S
Sbjct: 100 LTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAARLPKS 159
Query: 127 VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
V+W KGAVT VK QGQC VAAVEGIN I L +LSEQ+LVDC T D NNG
Sbjct: 160 VDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDT-DGNNG 218
Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
C GG MD AF YI N G+ + Y Y M G C S + I+ YEDVP N+E++
Sbjct: 219 CNGGLMDYAFSYIAHNGGLHTEEAYPYL-MEEGTC-SRGSSAAVVTISGYEDVPRNNEQA 276
Query: 240 LLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG-----IKY 292
LLKA+A+QPVSVAI+AS LQFYSGGVF+G C T L+HGV AVGYGT+ + Y
Sbjct: 277 LLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAAVGYGTAGKDNGHVVADY 336
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
++KNSWG WGE GY R++R + QG CGI S+P
Sbjct: 337 IIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYP 375
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 191/316 (60%), Gaps = 29/316 (9%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQE 92
FE W ++G+ Y+ AE +R IF+DNL RF N N SY L LN+FADL+ E
Sbjct: 56 FESWMVKHGKVYESVAEKERRLTIFEDNL----RFITNRNAENLSYRLGLNRFADLSLHE 111
Query: 93 FIASQTGFKMS---DHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC-- 144
+ G +H +N YK+S +P SV+W +GAVT VK QGQC
Sbjct: 112 YAQICHGADPRPPRNHVFMTSSN----RYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRS 167
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
V AVEG+N I LV+LSEQ L++C N NNGC GG ++ A+++I+ N G+
Sbjct: 168 CWAFSTVGAVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMNNGGLG 225
Query: 200 NDAVYSYEGMSTGIC-DSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
D Y Y+ ++ G+C D +K + I YE++P NDE +L+KAVA+QPV+ +D+S+
Sbjct: 226 TDNDYPYKALN-GVCNDRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSR 284
Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
Q Y+ GVF+G C T LNHGV VGYGT E G YW+++NS G WGE GY ++ R+I
Sbjct: 285 EFQLYASGVFDGTCGTNLNHGVVVVGYGT-ENGRDYWIVRNSRGNTWGEAGYMKMARNIA 343
Query: 317 QPQGQCGIAMFASFPV 332
P+G CGIAM AS+P+
Sbjct: 344 NPRGLCGIAMRASYPL 359
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 187/312 (59%), Gaps = 17/312 (5%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
E F+ W ++G+TY E +R +IFKDN V + N I N +Y+L LN FADLT
Sbjct: 30 ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHN--LITNATYSLSLNAFADLTHH 87
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
EF AS+ G +S S + + G L +++VP SV+W +KGAVT VK QG C
Sbjct: 88 EFKASRLGLSVSASSLIMASKGQS-LGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
A A+EGIN I L+SLSEQ+L+DC N GC GG MD AF+++I+N GI + Y
Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDC-DKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 205
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
Y+ G C K + I +Y V NDE++L +AVA QPVSV I S A Q YS
Sbjct: 206 PYQ-ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYS 264
Query: 263 --GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
G+F+G C T L+H V VGYG S+ G+ YW++KNSWG+ WG DG+ +QR+ +G
Sbjct: 265 RVSGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEG 323
Query: 321 QCGIAMFASFPV 332
CGI M AS+P+
Sbjct: 324 ICGINMLASYPI 335
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 199/346 (57%), Gaps = 23/346 (6%)
Query: 5 FLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
F +LI+S + + RT D+ + +E W ++G++Y E RFEIFK+NL
Sbjct: 14 FFSTLLILSSAIDIENSVQRTNDQ--VMAMYESWLVEHGKSYNSLDEKEMRFEIFKENLR 71
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQ 122
++ N A NRSY+L LN+FADLT +E+ ++ G K + ++ K
Sbjct: 72 IIDDHN--ADANRSYSLGLNRFADLTDEEYRSTYLGLKRGPKTDV----SNQYMPKVGDA 125
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P V+W GAV VK QG C AVAAVEGIN I L+SLSEQ+LVDC
Sbjct: 126 LPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQ 185
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
GC G M DAFK+II N GI + Y Y G C+ I +Y++VP N
Sbjct: 186 ITKGCNRGLMTDAFKFIINNGGINTENNYPYTA-KDGQCNLSLKNQKYVTIDSYKNVPSN 244
Query: 236 DEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
+E +L KAVA QPVSV +++ +F Y+ G+F G C T ++HGVT VGYGT E G+ YW
Sbjct: 245 NEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYGT-ERGMDYW 303
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
++KNSWG +WGE GY R+QR+I G+CGIA S+PV K ++ P
Sbjct: 304 IVKNSWGTNWGESGYIRIQRNIGG-AGKCGIAKMPSYPV-KYTSNP 347
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 185/308 (60%), Gaps = 18/308 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+++W ++G+ Y + E KRF+IFK+N+ + N A N S++L LNKFADLT EF
Sbjct: 38 YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHN--ARRNNSHSLGLNKFADLTNSEF 95
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AV 146
G ++ + + + + SV+W +KG VT +K QG C AV
Sbjct: 96 RGLYVG-RLQRPAPFHEVGDIALV---ADTATSVDWRKKGGVTEIKDQGDCGSCWAFSAV 151
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
AAVEG+ + LVSLSEQ+LVDC T N GC GG MD AF+Y+I+N GIT+ + Y Y
Sbjct: 152 AAVEGLTFLSTGTLVSLSEQELVDCDTT-VNQGCDGGIMDYAFQYMIRNGGITSQSNYPY 210
Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGG 264
+ G CD K + HAA I ++ +PP EE LL+AVANQPVSVAI+A Q YS G
Sbjct: 211 RALR-GACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYSSG 269
Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
VF G C + L+HGV VGYGT G +YWL+KNSWG WGE GY R++R G CGI
Sbjct: 270 VFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVRMERQ-GPGAGVCGI 328
Query: 325 AMFASFPV 332
+ AS+P
Sbjct: 329 NLDASYPT 336
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 155/372 (41%), Positives = 210/372 (56%), Gaps = 43/372 (11%)
Query: 2 AKYFLIVVLIISGSCAS------------QATYRTFD-EGSIAEKFEQWKAQYGRTYKES 48
A L+V ++I+ SCA+ + FD E S+ FE W ++G+ Y
Sbjct: 7 AMLILLVAMVIA-SCATAIDMSVVSYDDNNRLHSVFDAEASLI--FESWMVKHGKVYGSV 63
Query: 49 AENSKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS 107
AE +R IF+DNL RF NN N SY L L FADL+ E+ G +
Sbjct: 64 AEKERRLTIFEDNL----RFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRN 119
Query: 108 SLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKI 157
+ + YK+S +P SV+W +GAVT VK QG C V AVEG+N I
Sbjct: 120 HVFMTSSD-RYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVT 178
Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS- 216
LV+LSEQ L++C N NNGC GG ++ A+++I++N G+ D Y Y+ ++ G+CD
Sbjct: 179 GELVTLSEQDLINC--NKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVN-GVCDGR 235
Query: 217 IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFL 274
+K + I YE++P NDE +L+KAVA+QPV+ ID+S+ Q Y GVF+G C T L
Sbjct: 236 LKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNL 295
Query: 275 NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
NHGV VGYGT E G YWL+KNS G WGE GY ++ R+I P+G CGIAM AS+P+
Sbjct: 296 NHGVVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPLK- 353
Query: 335 ESAQPSSADKSS 346
S DKSS
Sbjct: 354 ---NSFSTDKSS 362
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 176/308 (57%), Gaps = 16/308 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
FE W ++G++Y E S R ++F+DN V + N+ GN SY+L LN FADLT EF
Sbjct: 29 FETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSK--GNSSYSLALNAFADLTHHEF 86
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AV 146
S+ G +S +L +P S++W KG VT VK QG C A
Sbjct: 87 KTSRLG--LSAAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSAT 144
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
A+EGIN I LVSLSEQ+L++C N+GC GG MD AF+++I N GI + Y Y
Sbjct: 145 GAIEGINKIVTGSLVSLSEQELIEC-DKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPY 203
Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGG 264
G C+ + + I Y DVP N+E+ LL+AVA QPVSV I S A Q YS G
Sbjct: 204 RARD-GTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKG 262
Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
+F G C T L+H V VGYG SE G+ YW++KNSWG WG GY +QR+ QG CGI
Sbjct: 263 IFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGI 321
Query: 325 AMFASFPV 332
M AS+PV
Sbjct: 322 NMLASYPV 329
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 182/315 (57%), Gaps = 18/315 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E W G+ Y E +RFEIF DNL ++ N A N SYTL L +FADLT +E+
Sbjct: 38 YEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAE-NNHSYTLGLTRFADLTNEEY 96
Query: 94 IASQTGFK----MSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
++ G K ++ G +P V+W EKGAV P+K QG C
Sbjct: 97 RSTYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAVAPIKDQGGCGSCWA 156
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L+ LSEQ+LVDC T N GC GG MD AF++II N GI +
Sbjct: 157 FSTVAAVEGINQIVTGDLIVLSEQELVDCDTA-YNEGCNGGLMDYAFQFIISNGGIDTEE 215
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
Y Y+ G+CD + I +YEDV NDE +L AVA+QPVSVAI+ + Q
Sbjct: 216 DYPYK-ERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVSVAIEGGGRSFQL 274
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQ 319
Y G+F+G C L+HGV AVGYGT E G YW+++NSWG+ WGE GY R++R++
Sbjct: 275 YKSGIFDGRCGIDLDHGVVAVGYGT-ESGKDYWIVRNSWGKSWGEAGYIRMERNLPSSSS 333
Query: 320 GQCGIAMFASFPVSK 334
G+CGIA+ S+P+ K
Sbjct: 334 GKCGIAIEPSYPIKK 348
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 143/335 (42%), Positives = 201/335 (60%), Gaps = 31/335 (9%)
Query: 30 IAEKFEQWKAQYGR--------------TYKESAENSKRFEIFKDNLVAVERFN-NAAIG 74
+ +E WK+++GR +E + R E+F+DNL ++ N A G
Sbjct: 50 VRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNAEADAG 109
Query: 75 NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA 134
++ L L FADLT +E+ GF+ S G+ + + +P +++W + GA
Sbjct: 110 LHTFRLGLTPFADLTLEEYRGRVLGFRAR-GRRSGARYGSGYSVRGGDLPDAIDWRQLGA 168
Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
VT VK Q QC AVAA+EG+NAI LVSLSEQ+++DC D+ GC GG M++
Sbjct: 169 VTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQDS--GCDGGQMEN 226
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH-AAQITNYEDVPPNDEESLLKAVAN 246
AF+++I N GI +A Y + G + G CD+ K ++ A I +V N+E +L +AVA
Sbjct: 227 AFRFVIGNGGIDTEADYPFIG-TDGTCDASKEKNEKVATIDGLVEVASNNETALQEAVAI 285
Query: 247 QPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
QPVSVAIDAS A Q YS G+FNG C T L+HGVTAVGYG SE G YW++KNSW WG
Sbjct: 286 QPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYG-SESGKDYWIVKNSWSASWG 344
Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
E GY R++R++ +P G+CGIAM AS+PV K++ P
Sbjct: 345 EAGYIRMRRNVPRPTGKCGIAMDASYPV-KDTYHP 378
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 134/269 (49%), Positives = 175/269 (65%), Gaps = 34/269 (12%)
Query: 75 NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKG 133
++SY L +N+FADLT +EF S+ FK H S +A T F Y++ + VP + +W +KG
Sbjct: 2 DKSYKLSINEFADLTNEEFGTSRNRFKA--HICSTEA--TSFKYENVTAVPSTXDWRKKG 57
Query: 134 AVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
AVTP+K QGQC AVAA+EGI + +L+SLSEQ+LVDC T+ + GC G
Sbjct: 58 AVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXG---- 113
Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
A Y Y G + G C+ KA AA+I YEDVP N+E++L KAVA+
Sbjct: 114 ---------------ANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAH 157
Query: 247 QPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
QP++VAIDA QFYS GVF G C T L+HGV AVGYGTS++G+KYWL+KNSWG WG
Sbjct: 158 QPIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTGWG 217
Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
E+GY R+QRD+ +G CGIAM AS+P +
Sbjct: 218 EEGYIRMQRDVTAKEGLCGIAMQASYPTA 246
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 148/343 (43%), Positives = 197/343 (57%), Gaps = 21/343 (6%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ F +L++S + A T RT DE + +E W +YG++Y E +RFEIFK
Sbjct: 10 MSLLFFSTLLVLSLAFNAKNLTKRTNDE--LKAMYESWLTKYGKSYNSLGEWERRFEIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+ L ++ N A NRSY + LN+FAD T +EF ++ GF + + P +
Sbjct: 68 ETLRFIDEHN--ADTNRSYRVGLNQFADQTNEEFQSTYLGFTSGSNKMKVSNRYEP---R 122
Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
QV P V+W GAV +K QGQC A+A VEGIN I L+SLSEQ+LVDC
Sbjct: 123 VGQVLPDYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDC 182
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
N GC GG + D F++II N GI +A Y Y G C+ + A I YE+
Sbjct: 183 GRTQNTRGCDGGSITDGFQFIINNGGINTEANYPYTA-EDGQCNLDLQNEKYASIDTYEN 241
Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP N+E +L AVA QPVSVA++A+ A Q YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYGT-EGG 300
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
I YW++KNSW WGE+GY R+ R++ G CGIA S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYIRILRNVGGA-GTCGIATKPSYPV 342
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 144/356 (40%), Positives = 210/356 (58%), Gaps = 29/356 (8%)
Query: 1 MAKYFLIVVLIISGSCA-------SQATYRTFDEGSIAEKFEQWKAQYGRTYKES-AENS 52
M FL++V ++S + S R+ +E + F+ W +++G+TY + E
Sbjct: 9 MTILFLLIVFVLSAPSSAMDLPATSGGHNRSNEE--VEFIFQMWMSKHGKTYTNALGEKE 66
Query: 53 KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN 112
+RF+ FKDNL +++ N N SY L L +FADLT QE+ G +LK +
Sbjct: 67 RRFQNFKDNLRFIDQHNAK---NLSYQLGLTRFADLTVQEYRDLFPG-SPKPKQRNLKTS 122
Query: 113 GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
Q+P SV+W ++GAV+ +K QG C VAAVEG+N I L+SLSE
Sbjct: 123 RRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSE 182
Query: 166 QQLVDCATNDNNNGCYG-GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA-EDHA 223
Q+LVDC N NNGCYG G MD AF+++I N G+ ++ Y Y+G + G C+ ++ +
Sbjct: 183 QELVDC--NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQG-TQGSCNRKQSTSNKV 239
Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAV 281
I +YEDVP NDE SL KAVA+QPVSV +D + +F Y ++NG C T L+H + V
Sbjct: 240 ITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIV 299
Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
GYG SE G YW+++NSWG WG+ GY ++ R+ + P+G CGIAM AS+P+ ++
Sbjct: 300 GYG-SENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPIKNSAS 354
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/346 (40%), Positives = 201/346 (58%), Gaps = 25/346 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FL + L + + S A+ R + ++FE+W A+YGR YK++ E +RF+IFK+N+
Sbjct: 9 FLFLFLCVMWASPSAAS-RDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+E FN+ GN SYTL +N+F D+T EF+A TG + L P +
Sbjct: 68 IETFNSRN-GN-SYTLGINQFTDMTNNEFVAQYTGVSLP-----LNIEREPVVSFDDVDI 120
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S VP S++W GAVT VK C A+A VE I IK L+SLSEQQ++DCA
Sbjct: 121 SAVPQSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDCAV 180
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG-MSTGICDSIKAEDHAAQITNYEDV 232
+ GC GG+++ A+ +II NKG+ + A+Y Y+ G C I ++A IT Y V
Sbjct: 181 S---YGCDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTC-RINGVPNSAYITGYTRV 236
Query: 233 PPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
N+E S++ AV+NQP++ +I+AS Q Y GVF+G C T LNH +T +GYG G K
Sbjct: 237 QSNNERSMMYAVSNQPIAASIEASGDFQHYKRGVFSGPCGTSLNHAITIIGYGQDSSGKK 296
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
+W+++NSWG WGE GY R+ RD+ G CGIA+ +P + A
Sbjct: 297 FWIVRNSWGASWGERGYIRMARDVSSSSGLCGIAIRPLYPTLQSGA 342
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 152/369 (41%), Positives = 208/369 (56%), Gaps = 42/369 (11%)
Query: 5 FLIVVLIISGSCAS------------QATYRTFD-EGSIAEKFEQWKAQYGRTYKESAEN 51
+++V ++ SCA+ + FD E S+ FE W ++G+ Y AE
Sbjct: 2 LILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLI--FESWMVKHGKVYGSVAEK 59
Query: 52 SKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK 110
+R IF+DNL RF NN N SY L L FADL+ E+ G + +
Sbjct: 60 ERRLTIFEDNL----RFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVF 115
Query: 111 ANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRL 160
+ YK+S +P SV+W +GAVT VK QG C V AVEG+N I L
Sbjct: 116 MTSSD-RYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGEL 174
Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS-IKA 219
V+LSEQ L++C N NNGC GG ++ A+++I++N G+ D Y Y+ ++ G+CD +K
Sbjct: 175 VTLSEQDLINC--NKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVN-GVCDGRLKE 231
Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHG 277
+ I YE++P NDE +L+KAVA+QPV+ ID+S+ Q Y GVF+G C T LNHG
Sbjct: 232 NNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHG 291
Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
V VGYGT E G YWL+KNS G WGE GY ++ R+I P+G CGIAM AS+P+
Sbjct: 292 VVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPLK---- 346
Query: 338 QPSSADKSS 346
S DKSS
Sbjct: 347 NSFSTDKSS 355
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 196/343 (57%), Gaps = 21/343 (6%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ F +LI+S + A T RT DE + +E W +YG++Y E +RFEIFK
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+ L ++ N A NRSY + LN+FADLT +EF ++ GF + + + P +
Sbjct: 68 ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEP---R 122
Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
QV PS V+W GAV +K QG+C A+A VEGIN I L+SLSEQ+L+DC
Sbjct: 123 VGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
N GC G ++ D F +II N GI + Y Y G C+ + I YE+
Sbjct: 183 GRTQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQD-GECNVDLQNEKYVTIDTYEN 241
Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP N+E +L AV QPVSVA+DA+ A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT-EGG 300
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
I YW++KNSW WGE+GY R+ R++ G CGIA S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPV 342
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 121/231 (52%), Positives = 157/231 (67%), Gaps = 15/231 (6%)
Query: 114 TPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSL 163
T F Y+ + +P +++W KGAVTP+K QGQC AVAA EGI I +LVSL
Sbjct: 5 TGFRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSL 64
Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
+EQ+LVDC +D + GC GG MDDAFK+II+N G+T ++ Y Y + G C S + A
Sbjct: 65 AEQELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTA-ADGKCKS--GSNSA 121
Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAV 281
A I YEDVP NDE +L+KAVANQPVSVA+D + QFYSGGV G C T L+HG+ A+
Sbjct: 122 ATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAI 181
Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
GYG + +G KYWL+KNSWG WGE+GY R+++DI +G CG+AM S+P
Sbjct: 182 GYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 232
>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 150/343 (43%), Positives = 207/343 (60%), Gaps = 35/343 (10%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ +L+ A Q T RT + S+ E+ EQ +Y + YK+ E+ F N+
Sbjct: 9 HIAFAMLLCMAFLAFQVTCRTLQDASMYERHEQRMTRYSKVYKDPPES------FXGNVN 62
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+E NNAA ++ Y +N+F P+ + H S T F +++ +
Sbjct: 63 YIEACNNAA--DKPYKXGINQFP---PRN--------RFKGHMCSSIIRITTFKFENVTA 109
Query: 123 VPPSVNWIEKGAVTP--VKYQGQC-------AVAAVEGINAIKINRLVSLS-EQQLVDCA 172
P +V+ +KGAVTP VK QGQC AVAA EGI+A+ +L+ LS E +LVDC
Sbjct: 110 TPSTVDCRQKGAVTPYTVKDQGQCGCFWALSAVAATEGIHALXAGKLILLSXEPELVDCD 169
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI-TNYED 231
T + GC GG DDAFK+IIQN G+ +A Y Y+G+ G C++ +A+ +AA I T Y+D
Sbjct: 170 TKGVDQGCEGGLTDDAFKFIIQNHGLNTEANYPYKGVD-GKCNANEADKNAATIITGYDD 228
Query: 232 VPPNDEESLL-KAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
VP N+E++ L KAVAN PVSVAIDAS QFY GVF G C T L+HGVTAVGYG S++
Sbjct: 229 VPANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD 288
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G +YWL+KNS G +WGE+GY R+QR +D + CGIA+ AS+P
Sbjct: 289 GTEYWLVKNSRGPEWGEEGYIRMQRGVDSEEALCGIAVQASYP 331
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 197/343 (57%), Gaps = 21/343 (6%)
Query: 1 MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
M+ F +LI+S + A T RT DE + +E W +YG++Y E +RFEIFK
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+ L ++ N A NRSY + LN+FADLT +EF ++ F + + + P +
Sbjct: 68 ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGSNKTKVSNRYEP---R 122
Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
QV PS V+W GAV +K QG+C A+A VEGIN I L+SLSEQ+L+DC
Sbjct: 123 VGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
N GC GG++ D F++II N GI + Y Y G C+ + I YE+
Sbjct: 183 GRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQD-GECNVDLQNEKYVTIDTYEN 241
Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
VP N+E +L AV QPVSVA+DA+ A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGT-EGG 300
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
I YW++KNSW WGE+GY R+ R++ G CGIA S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPV 342
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 140/314 (44%), Positives = 183/314 (58%), Gaps = 24/314 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
++E FE W ++G++Y + E R +F DN V NN + N SYTL LN +ADLT
Sbjct: 25 VSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNN--LDNSSYTLSLNSYADLT 82
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS----SQVPPSVNWIEKGAVTPVKYQGQC- 144
EF S+ GF S +L+ N P L + VP S++W +KGAVT VK QG C
Sbjct: 83 HHEFKVSRLGF-----SPALR-NFRPVLPQEPSLPRDVPDSLDWRKKGAVTAVKDQGSCG 136
Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
A A+EGIN I L+SLSEQ+L+DC N+GC GG MD A++++I N GI
Sbjct: 137 ACWSFSATGAMEGINQIMTGSLISLSEQELIDC-DRSYNSGCGGGLMDYAYQFVISNHGI 195
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
+ Y Y+ G C K + + I Y D+P NDE LL+AVA QPVSV I S
Sbjct: 196 DTENDYPYQARD-GSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSER 254
Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
A Q YS G+F+G C T L+H V VGYG SE G+ YW++KNSWG+ WG DGY +QR+
Sbjct: 255 AFQLYSKGIFSGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGKSWGMDGYMHMQRNSG 313
Query: 317 QPQGQCGIAMFASF 330
+G CGI AS+
Sbjct: 314 NSEGVCGINKLASY 327
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 193/311 (62%), Gaps = 18/311 (5%)
Query: 35 EQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFI 94
E+W AQ+G+ YK++AE + +IF++N+ +E F+ G++S+ L N+FADL +EF
Sbjct: 33 EKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFD--VCGDKSFNLSTNQFADLHDEEFK 90
Query: 95 ASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC--------A 145
A T +HS T F Y + +++P S++W ++G VTP+K QG+C
Sbjct: 91 ALLTNGHKKEHSL-WTTTETLFRYDNVTKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLC 149
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
VA +EG++ I + LV LSEQ+LVD ++ GCYG +++DAFK+I + I ++ Y
Sbjct: 150 VATIEGLHQIITSELVPLSEQELVDFVKGESE-GCYGDYVEDAFKFITKKGRIESETHYP 208
Query: 206 YEGMSTGICDSIKAEDH-AAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYS 262
Y+G++ +K E H AQI Y+ VP E +LLKAVANQ VSV+++A SA QFYS
Sbjct: 209 YKGVNNTC--KVKKETHGVAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQFYS 266
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
G+F G C T +H V YG S +G KYWL KNSWG +WGE GY R++ DI +G C
Sbjct: 267 SGIFTGKCGTDTDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLC 326
Query: 323 GIAMFASFPVS 333
GIA + +P++
Sbjct: 327 GIAKYPYYPIA 337
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 140/301 (46%), Positives = 183/301 (60%), Gaps = 18/301 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ FE ++ + Y+ E RFEIF DNL ++ N +Y L LN+FADLT
Sbjct: 45 VIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKV---SNYWLGLNEFADLT 101
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF GFK + + + F Y+ +P SV+W +KGAV+PVK QGQC
Sbjct: 102 HEEFKNKFLGFK-GELAERKDESIEQFRYRDFVDLPKSVDWRKKGAVSPVKNQGQCGSCW 160
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L LSEQ+L+DC T NNGC GG MD AF Y+ +N G+ +
Sbjct: 161 AFSTVAAVEGINQIVTGNLTVLSEQELIDCDTT-FNNGCNGGLMDYAFAYVTRN-GLHKE 218
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y MS G CD + I+ Y DVP N+E+S LKA+ANQP+SVAI+AS Q
Sbjct: 219 EEYPYI-MSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQ 277
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FYSGGVF+G+C T L+HGV AVGYGTS +G+ Y +++NSWG WGE GY R++R+ +P
Sbjct: 278 FYSGGVFDGHCGTELDHGVAAVGYGTS-KGLDYVIVRNSWGPKWGEKGYIRMKRNTGKPM 336
Query: 320 G 320
G
Sbjct: 337 G 337
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 190/320 (59%), Gaps = 31/320 (9%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
F W ++G+ Y E +R+EIFK NL+ + N N SY L LN+FAD+ +EF
Sbjct: 44 FRSWSVKHGKLYASPTEKLERYEIFKQNLMHIAETNRK---NGSYWLGLNQFADVAHEEF 100
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYK-----SSQVPPSVNWIEKGAVTPVKYQGQC---- 144
AS G K + + TP ++ + +P SV+W KGAVTPVK QG+C
Sbjct: 101 KASYLGLKRALPRAGAPQTRTPTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCW 160
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
+VAAVEGIN I +LVSLSEQ+LVDC T ++GC GG MD AF Y++ ++GI +
Sbjct: 161 AFSSVAAVEGINQIVTGKLVSLSEQELVDCDTT-LDHGCEGGTMDLAFAYMMGSQGIHAE 219
Query: 202 AVYSYEGMSTGICD-------SIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
Y Y M G C I +D +T +EDVP N E SLLKA+A+QPVSV I
Sbjct: 220 DDYPYL-MEEGYCKEKQPCVLGITEQD----LTGFEDVPENSEISLLKALAHQPVSVGIA 274
Query: 255 ASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
A + QFY GGVF+G C L+H +TAVGYG+S G Y +KNSWG++WGE GY R++
Sbjct: 275 AGSRDFQFYRGGVFDGACSVELDHALTAVGYGSSY-GQNYITMKNSWGKNWGEQGYVRIK 333
Query: 313 RDIDQPQGQCGIAMFASFPV 332
+P+G CGI AS+PV
Sbjct: 334 MGTGKPEGVCGIYTMASYPV 353
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/320 (44%), Positives = 201/320 (62%), Gaps = 21/320 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
+ ++ + E+W A++GRTY E ++R E+F+ N ++ FN+A + ++ L N+FA
Sbjct: 37 DSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAE--DSTHRLATNRFA 94
Query: 87 DLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQG 142
DLT +EF A++TG + ++ + F Y++ + S++W GAVT VK QG
Sbjct: 95 DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C AVAAVEG+ I+ RLVSLSEQQLVDC ++ GC GG MD+AF+Y+I
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
G+T ++ Y Y G + G C + AA I YEDVP N+E +L+ AVA+QPVSVAI+
Sbjct: 215 GGLTTESSYPYRG-TDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAING 270
Query: 256 --SALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
S +FY GV G C T LNH +TAVGYGT+ +G KYW++KNSWG WGE GY R++
Sbjct: 271 GDSVFRFYDSGVLGGSGCGTELNHAITAVGYGTASDGTKYWIMKNSWGGSWGEGGYVRIR 330
Query: 313 RDIDQPQGQCGIAMFASFPV 332
R + + +G CG+A AS+PV
Sbjct: 331 RGV-RGEGVCGLAQLASYPV 349
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 193/316 (61%), Gaps = 20/316 (6%)
Query: 34 FEQWKAQYGRTYKES-AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
F+ W +++G+TY + E +RF+ FKDNL +++ N N SY L L +FADLT QE
Sbjct: 48 FQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAK---NLSYQLGLTRFADLTVQE 104
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
+ G +L+ + Q+P SV+W +GAV+ +K QG C
Sbjct: 105 YRDLFPG-SPKPKQRNLRISRRYVPLDGDQLPESVDWRNEGAVSAIKDQGTCNSCWAFST 163
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG-GFMDDAFKYIIQNKGITNDAVY 204
VAAVEGIN I LVSLSEQ+LVDC N NNGCYG G MD AF+++I N G+ +D Y
Sbjct: 164 VAAVEGINKIVTGELVSLSEQELVDC--NLVNNGCYGSGTMDAAFQFLINNGGLDSDTDY 221
Query: 205 SYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--Y 261
Y+G S G C+ ++ + I +YEDVP NDE SL KAVA+QPVSV +D + +F Y
Sbjct: 222 PYQG-SQGYCNRKESTSNKIITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLY 280
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
G++NG C T L+H + VGYG SE G YW+++NSWG WG+ GY ++ R+ + P G
Sbjct: 281 RSGIYNGPCGTDLDHALVIVGYG-SENGQDYWIVRNSWGTTWGDAGYAKMARNFEYPSGV 339
Query: 322 CGIAMFASFPVSKESA 337
CGIAM AS+PV ++
Sbjct: 340 CGIAMLASYPVKNSAS 355
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 144/327 (44%), Positives = 198/327 (60%), Gaps = 23/327 (7%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI--GNRSYTLRLN 83
D+ ++ E++E+W A+ GRTYK+S E ++RFE+FK N ++ N A G L N
Sbjct: 12 DDKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTN 71
Query: 84 KFADLTPQEFI-ASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVK 139
KFADLT EF TG +++ +SL + T F + + S VPPS++W +GAVT VK
Sbjct: 72 KFADLTEDEFRNIYVTGHRVNYRPTSLVTD-TVFKFGAVSLSDVPPSIDWRARGAVTSVK 130
Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
Q CA AAVEGI+ I VSLS QQLVDC +N N C G +D A++YI
Sbjct: 131 DQHLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDC-SNAANEKCKAGEIDKAYEYI 189
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
++ G+ D Y YEG S G C + + A+I+ ++ VP +E +LL AVA+QPVSVA
Sbjct: 190 ARSGGLVADQDYPYEGHS-GTC-RVYGKQAVARISGFQYVPARNETALLLAVAHQPVSVA 247
Query: 253 ID--ASALQFYSGGVFNGY---CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
+D + ALQ G+F C T LNH +T VGYGT E G +YWL+KNSWG DWG+ G
Sbjct: 248 LDGLSRALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKG 307
Query: 308 YFRLQRDI-DQPQGQCGIAMFASFPVS 333
Y + RD+ + G CG+A+ AS+PV+
Sbjct: 308 YVKFARDVASEINGVCGLALEASYPVA 334
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 186/312 (59%), Gaps = 21/312 (6%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
FE W ++G+ Y AE +R IFKDNL + N+ +G Y L LN+FADL+ E+
Sbjct: 64 FESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSENLG---YRLGLNRFADLSLHEY 120
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC------ 144
G + + + + YK+S +P SV+W +GAVT VK QG C
Sbjct: 121 KEICHGADPKPPRNHVFMSSSD-RYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAF 179
Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
V AVEG+N I LV+LSEQ L++C N NNGC GG ++ A+++I+ N G+ D
Sbjct: 180 STVGAVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIVSNGGLGTDND 237
Query: 204 YSYEGMSTGICDS-IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y+ ++ G CD +K I YE++P NDE +L+KAVA+QPV+ ID+S+ Q
Sbjct: 238 YPYKAVN-GACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQL 296
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
Y GVF+G C T LNHGV VGYGT E G YW+++NSWG WGE GY ++ R+I P+G
Sbjct: 297 YESGVFDGRCGTNLNHGVVVVGYGT-ENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRG 355
Query: 321 QCGIAMFASFPV 332
CGIAM S+P+
Sbjct: 356 LCGIAMRVSYPL 367
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 194/340 (57%), Gaps = 25/340 (7%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAI 73
S +RT +E + + QW A++G+T + + KRF IFKDNL ++ +N
Sbjct: 35 SDGKWRTDEE--VRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFID-LHNENN 91
Query: 74 GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNW 129
N +Y L L KF DLT E+ G + KA Y ++ +VP +V+W
Sbjct: 92 KNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDW 151
Query: 130 IEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
+KGAV P+K QG C AAVEGIN I L+SLSEQ+LVDC N GC G
Sbjct: 152 RQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDC-DKSYNQGCNG 210
Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
G MD AF++I++N G+ + Y Y G G C+S I YEDVP DE +L K
Sbjct: 211 GLMDYAFQFIMKNGGLNTEKDYPYRGFG-GKCNSFLKNSRVVSIDGYEDVPTKDETALKK 269
Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
A++ QPVSVAI+A Q Y G+F G C T L+H V AVGYG SE G+ YW+++NSWG
Sbjct: 270 AISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWG 328
Query: 301 QDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKESAQP 339
WGE+GY R++R++ + G+CGIA+ AS+PV K S P
Sbjct: 329 PRWGEEGYIRMERNLAASKSGKCGIAVEASYPV-KYSPNP 367
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 192/342 (56%), Gaps = 31/342 (9%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K F V L+ +CA+ A F +WKA + R Y + E + R EI+ NL
Sbjct: 2 KAFTAVALLALVACAT------------AMPFAEWKALHNRQYASAQEEALRQEIYLSNL 49
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ 122
+ N A G SYTL +N+F DL EF A G + + +++ + +L +
Sbjct: 50 ELINEHN--AAGRHSYTLGMNEFGDLAHHEFAAKYLGVRFNGVNATKSFASSTYLPRMVS 107
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P SV+W G VTPVK QGQC +VEG +A K LVSLSEQ LVDC++ +
Sbjct: 108 LPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGSVEGQHARKTGTLVSLSEQNLVDCSSQE 167
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MDDAF+YII+N GI +A Y Y +TG C A + A + +Y+D+
Sbjct: 168 GNEGCNGGLMDDAFEYIIKNGGIDTEASYPYTA-TTGTCK-FNAANIGATVASYQDIITG 225
Query: 236 DEESLLKAVAN-QPVSVAIDASAL--QFYSGGVFN-GYCETF-LNHGVTAVGYGTSEEGI 290
E L AVA PVSVAIDAS + QFY GV+N C T L+HGV AVGYGTS EG
Sbjct: 226 SESDLQNAVATVGPVSVAIDASHINFQFYFTGVYNEKKCSTTQLDHGVLAVGYGTSTEGK 285
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWG WG+ GY + R+ D QCGIA AS+P+
Sbjct: 286 DYWLVKNSWGATWGKAGYIWMSRNADN---QCGIATSASYPL 324
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 195/340 (57%), Gaps = 25/340 (7%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAI 73
S +RT +E + + QW A++G+T + + KRF IFKDNL ++ +N
Sbjct: 35 SDGKWRTDEE--VRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFID-LHNEDN 91
Query: 74 GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNW 129
N +Y L L KF DLT E+ G + KA Y ++ +VP +V+W
Sbjct: 92 KNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDW 151
Query: 130 IEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
+KGAV P+K QG C AAVEGIN I L+SLSEQ+LVDC + N GC G
Sbjct: 152 RQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKS-YNQGCNG 210
Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
G MD AF++I++N G+ + Y Y G G C+S I YEDVP DE +L K
Sbjct: 211 GLMDYAFQFIMKNGGLNTEKDYPYRGFG-GKCNSFLKNSRVVSIDGYEDVPTKDETALKK 269
Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
A++ QPVSVAI+A Q Y G+F G C T L+H V AVGYG SE G+ YW+++NSWG
Sbjct: 270 AISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWG 328
Query: 301 QDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKESAQP 339
WGE+GY R++R++ + G+CGIA+ AS+PV K S P
Sbjct: 329 PRWGEEGYIRMERNLAASKSGKCGIAVEASYPV-KYSPNP 367
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 192/315 (60%), Gaps = 16/315 (5%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+++W+A++ + R E+FK+NL V+ N AA G +Y L +N+FADLT +E
Sbjct: 43 YQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEE 102
Query: 93 FIAS-QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
+ A S+S + + L + +P S++W EKGAV VK QG+C
Sbjct: 103 YRARFLRDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKSQGRCGSCWAFA 162
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
A+A VEGIN I L+SLSEQQLVDC+T N+GC GG+ AF+YII N G+ ++ Y
Sbjct: 163 AIATVEGINQIVTGDLISLSEQQLVDCSTR--NHGCEGGWPYRAFQYIINNGGVNSEEHY 220
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y G + ++ K H I +Y +VP NDE+SL KAVANQP+SV I+AS Q Y
Sbjct: 221 PYTGTNGTC-NTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINASGRNFQLYH 279
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
G+F G C T LNHGVT VGYGT G YW++KNSWG+ WG+ GY ++R+I + G+C
Sbjct: 280 SGIFTGSCNTSLNHGVTVVGYGTVN-GNDYWIVKNSWGESWGDSGYILMERNIAESSGKC 338
Query: 323 GIAMFASFPVSKESA 337
GIA+ S+P+ KE A
Sbjct: 339 GIAISPSYPI-KEGA 352
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 188/310 (60%), Gaps = 15/310 (4%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+++W+ ++ + R E+FK+NL V+ N AA G +Y L +N+FADLT +E
Sbjct: 52 YQEWRVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEE 111
Query: 93 FIAS-QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
+ A S+S + + L + +P S++W EKGAV VK QG+C
Sbjct: 112 YRARFLRDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKNQGRCGSCWAFA 171
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
A+AAVEGIN I L+SLSEQQLVDC+T N GC GG+ AF+YII N G+ ++ Y
Sbjct: 172 AIAAVEGINQIVTGDLISLSEQQLVDCSTR--NYGCEGGWPYRAFQYIINNGGVNSEEHY 229
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
Y G + ++ K H I +Y +VP NDE+SL KA ANQP+SV IDAS Q Y
Sbjct: 230 PYTGTNGTC-NTTKENAHVVSIDSYRNVPSNDEKSLQKAAANQPISVGIDASGRNFQLYH 288
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
G+F G C T LNHGVT VGYGT E G YW++KNSWG++WG GY ++R+I + G+C
Sbjct: 289 SGIFTGSCNTSLNHGVTVVGYGT-ENGNDYWIVKNSWGENWGNSGYILMERNIAESSGKC 347
Query: 323 GIAMFASFPV 332
GIA+ S+P+
Sbjct: 348 GIAISPSYPI 357
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 143/333 (42%), Positives = 198/333 (59%), Gaps = 20/333 (6%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
RT DE + FE W +YG++Y E +RFEIFKDNL V+ N A NRSY + L
Sbjct: 39 RTNDE--VMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHN--ADVNRSYKVGL 94
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
N+F+DLT +E+ + G K +++ P + Q+P S++W +KGAV VK QG
Sbjct: 95 NQFSDLTLEEYSSIYLGTKFDMRMTNVSDRYEPRV--GDQLPNSIDWRKKGAVLGVKNQG 152
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C +AAVE IN I L+SLSEQQ+VDC NNGC GG A+++II N
Sbjct: 153 NCGSCWTFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDN 212
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
GI +A Y Y+ G CD K + + I YE+VP +E++L KAV+NQ VSV I +
Sbjct: 213 GGINTEANYPYKAQD-GECDEQKNQKYVT-IDRYENVPRKNEKALQKAVSNQLVSVGIAS 270
Query: 256 SALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
++ +F Y G+F G C ++H VT VGYGT E G+ YW+++NSWG +WGE+GY R+QR
Sbjct: 271 NSSEFKAYKSGIFTGPCGAKIDHAVTIVGYGT-EGGMDYWIVRNSWGSNWGENGYVRMQR 329
Query: 314 DIDQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
++ G C IA ++PV K P++A SS
Sbjct: 330 NVGNA-GTCFIATSPNYPV-KYGPNPTNAHLSS 360
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 140/320 (43%), Positives = 200/320 (62%), Gaps = 21/320 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
+ ++ + E+W A++GRTY E ++R E+F+ N ++ FN+A + ++ L N+FA
Sbjct: 37 DAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAE--DSTHRLATNRFA 94
Query: 87 DLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQG 142
DLT +EF A++TG + ++ + F Y++ + S++W GAVT VK QG
Sbjct: 95 DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C AVAAVEG+ I+ RLVSLSEQQLVDC ++ GC GG MD+AF+Y+I
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
G+T ++ Y Y G + G C + AA I YEDVP N+E +L+ AVA+QPVSVAI+
Sbjct: 215 GGLTTESSYPYRG-TDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAING 270
Query: 256 --SALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
S +FY GV G C T LNH +TA GYGT+ +G KYW++KNSWG WGE GY R++
Sbjct: 271 GDSVFRFYDSGVLGGSGCGTELNHAITAAGYGTASDGTKYWIMKNSWGGSWGEGGYVRIR 330
Query: 313 RDIDQPQGQCGIAMFASFPV 332
R + + +G CG+A AS+PV
Sbjct: 331 RGV-RGEGVCGLAQLASYPV 349
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 204/342 (59%), Gaps = 35/342 (10%)
Query: 30 IAEKFEQWKAQYGRTYKE----SAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNK 84
+ +E WK+++GR E+ R E+F+DNL ++ N A G ++ L L
Sbjct: 50 VRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTP 109
Query: 85 FADLTPQEFIASQTGFKMSDH--------SSSLKANGTPFLYKS-------SQVPPSVNW 129
FADLT +E+ GF+ +S + + GT ++ +P +++W
Sbjct: 110 FADLTLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPDAIDW 169
Query: 130 IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
+ GAVT VK Q QC AVAA+EGINAI LVSLSEQ+++DC T D+ GC G
Sbjct: 170 RQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQDS--GCNG 227
Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH-AAQITNYEDVPPNDEESLL 241
G M++AF+++I N GI ++A Y + + G CD+ KA D A I + +V N+E +L
Sbjct: 228 GQMENAFQFVIDNGGIDSEADYPFIA-TDGTCDANKANDEKVAAIDGFVEVASNNETALQ 286
Query: 242 KAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSW 299
+AVA QPVSVAIDA A Q YS G+FNG C T L+HGVT VGYG SE G YW++KNSW
Sbjct: 287 EAVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYG-SENGKAYWIVKNSW 345
Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSS 341
WGE GY R++R++ P G+CGIAM AS+PV K++ P++
Sbjct: 346 SDSWGEAGYIRIRRNVFLPVGKCGIAMDASYPV-KDTYGPAA 386
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 146/347 (42%), Positives = 198/347 (57%), Gaps = 25/347 (7%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAI 73
S + +RT +E + + QW A +G+T + + KRF IFKDNL ++ +N
Sbjct: 35 SDSWWRTDEE--VRSIYLQWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFID-LHNEKN 91
Query: 74 GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNW 129
N +Y L L KF DLT +E+ + G + KA Y ++ +VP +V+W
Sbjct: 92 KNATYKLGLTKFTDLTNEEYRSLYLGARTEPVRRIAKAKNVNQKYSAAVDGKEVPETVDW 151
Query: 130 IEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
KGAV P+K QG C AAVEGIN I L+SLSEQ+LVDC N N GC G
Sbjct: 152 RLKGAVNPIKDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVDC-DNSYNQGCNG 210
Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
G MD AF++I++N G+ + Y Y G G C+S I YEDVP DE +L +
Sbjct: 211 GLMDYAFQFIMKNGGLKTEKDYPYRGFG-GKCNSFLKNAKVVSIDGYEDVPTKDETALKR 269
Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
A++ QPVSVAI+A Q Y G+F G C T L+H V AVGYG SE G+ YW+++NSWG
Sbjct: 270 AISLQPVSVAIEAGGRIFQHYQTGIFTGNCGTNLDHAVVAVGYG-SENGVDYWIVRNSWG 328
Query: 301 QDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKESAQPSSADKSS 346
WGE+GY R++R++ + G+CGIA+ AS+PV K S P SS
Sbjct: 329 PRWGEEGYIRMERNLASSKSGKCGIAVEASYPV-KYSPNPVRGSISS 374
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 148/349 (42%), Positives = 195/349 (55%), Gaps = 31/349 (8%)
Query: 1 MAKYFLIVV--LIISGSCASQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
MA FL+VV L+ + A+ A Y D+G + FE+W A++G+TYK E RF I
Sbjct: 1 MASAFLLVVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGI 60
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL 117
F+DN+ + + + + + +N+FADLT EF+A+ TG K + P
Sbjct: 61 FRDNVHFIRGYKPQVTYDSA--VGINQFADLTNDEFVATYTGAKPPHPKEA------PRP 112
Query: 118 YKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
P ++W +GAVT VK QG C AVAA+EG+ I+ +L LSEQ+LVD
Sbjct: 113 VDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVD 172
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED----HAAQI 226
C TN +NGC GG D AF+ + GIT ++ Y YEG G C + +D HAA I
Sbjct: 173 CDTN--SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQ-GKC---RVDDMLFNHAASI 226
Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYG 284
Y VPPNDE L AVA QPV+V IDAS A QFY GVF G C NH VT VGY
Sbjct: 227 GGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYC 286
Query: 285 T-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYW+ KNSWG+ WG+ GY L++D+ QP G CG+A+ +P
Sbjct: 287 QDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYPT 335
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 138/323 (42%), Positives = 188/323 (58%), Gaps = 22/323 (6%)
Query: 34 FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ +W ++G++ S + +RF IFKDNL ++ +N N +Y L L FA+LT
Sbjct: 4 YLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFID-LHNENNKNATYKLGLTIFANLT 62
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNWIEKGAVTPVKYQGQCA 145
E+ + G + KA Y ++ +VP +V+W +KGAV +K QG C
Sbjct: 63 NDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCG 122
Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
AAVEGIN I LVSLSEQ+LVDC N GC GG MD AF++I++N G+
Sbjct: 123 SCWAFSTAAAVEGINKIVTGELVSLSEQELVDC-DKSYNQGCNGGLMDYAFQFIMKNGGL 181
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
+ Y Y G + G C+S+ I YEDVP DE +L +AV+ QPVSVAIDA
Sbjct: 182 NTEKDYPYHG-TNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGR 240
Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
A Q Y G+F G C T ++H V AVGYG SE G+ YW+++NSWG WGEDGY R++R++
Sbjct: 241 AFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNVA 299
Query: 317 QPQGQCGIAMFASFPVSKESAQP 339
G+CGIA+ AS+PV K S P
Sbjct: 300 SKSGKCGIAIEASYPV-KYSPNP 321
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 138/323 (42%), Positives = 188/323 (58%), Gaps = 22/323 (6%)
Query: 34 FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ +W ++G++ S + +RF IFKDNL ++ +N N +Y L L FA+LT
Sbjct: 4 YLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFID-LHNENNKNATYKLGLTIFANLT 62
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNWIEKGAVTPVKYQGQCA 145
E+ + G + KA Y ++ +VP +V+W +KGAV +K QG C
Sbjct: 63 NDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCG 122
Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
AAVEGIN I LVSLSEQ+LVDC N GC GG MD AF++I++N G+
Sbjct: 123 SCWAFSTAAAVEGINKIVTGELVSLSEQELVDC-DKSYNQGCNGGLMDYAFQFIMKNGGL 181
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
+ Y Y G + G C+S+ I YEDVP DE +L +AV+ QPVSVAIDA
Sbjct: 182 NTEKDYPYHG-TNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGR 240
Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
A Q Y G+F G C T ++H V AVGYG SE G+ YW+++NSWG WGEDGY R++R++
Sbjct: 241 AFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNVA 299
Query: 317 QPQGQCGIAMFASFPVSKESAQP 339
G+CGIA+ AS+PV K S P
Sbjct: 300 SKSGKCGIAIEASYPV-KYSPNP 321
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 188/336 (55%), Gaps = 22/336 (6%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L + I + +C ++ + D + ++E W +YG+ Y+ E RFEI++ N+ +
Sbjct: 16 LCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEFRFEIYRANVQFI 75
Query: 66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQVP 124
E +N+ N SY L NKF DLT +EF ++ H T F+Y K +P
Sbjct: 76 EVYNSQ---NYSYKLMDNKFVDLTNEEFRRMYLVYQPRSHLQ------TRFMYQKHGDLP 126
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
++W +GAVT +K QG C AVA VE IN IK +LVSLSEQQL+DC + N
Sbjct: 127 KRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQQLIDCDNRNGN 186
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GG M+ F +I + G+T D Y Y+G S G + K +HA I YE++P ++E
Sbjct: 187 EGCNGGHME-TFTFITKRGGLTTDKNYPYQG-SDGDXNKAKVRNHAVAICGYENLPAHNE 244
Query: 238 ESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
L AVA+QP SVA DA A Q YS G F+G C LNH +T VGYG E G KYWL+
Sbjct: 245 NMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYG-EENGEKYWLV 303
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
KNSW D G GY R++RD G CG AM AS+P
Sbjct: 304 KNSWANDXGVSGYIRMKRDPKDKDGTCGTAMEASYP 339
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/346 (39%), Positives = 204/346 (58%), Gaps = 23/346 (6%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEG---SIAEKFEQWKAQYGRTYKESAENSKRFEI 57
M LI+++++ + + A ++G I FE W A++G++Y E ++R I
Sbjct: 1 MIASTLILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDWEKARRLMI 60
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG-FKMSDHSSSLKANGTPF 116
F D L +E+ N A N ++TL LNKF+DLT EF A G FK + L A
Sbjct: 61 FSDTLAYIEKHN--AQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAEDEDV 118
Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
S +P S++W +KGAVTP+K QG C A+A++E + + LVSLSEQQL+
Sbjct: 119 --DVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLM 176
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC T D GC GG M+ AFK++++N G+T +A Y Y G S G C++ KA++ A+IT +
Sbjct: 177 DCDTVDA--GCDGGLMETAFKFVVKNGGVTTEAAYPYTG-SVGSCNANKAKNKVAEITGF 233
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSE 287
+ V + ++L+KAV+ PV+V+I S F Y G+ +G C+ L+HGV +GYGT E
Sbjct: 234 KVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGKCDDSLDHGVLLIGYGT-E 292
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
G+ YW+IKNSWG WGEDG+ +++R G CG+ +S+P +
Sbjct: 293 GGMPYWIIKNSWGTSWGEDGFMKIER--KDGDGMCGMNGDSSYPTT 336
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 193/340 (56%), Gaps = 25/340 (7%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAI 73
S +RT +E + + QW A++G+T + + KRF IFKDNL ++ +N
Sbjct: 35 SDGKWRTDEE--VRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFID-LHNENN 91
Query: 74 GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNW 129
N +Y L L KF DLT E+ G + KA Y ++ +VP +V+W
Sbjct: 92 KNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDW 151
Query: 130 IEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
+KGAV P+K QG C AAVEGIN I L+SLSEQ+LVDC N GC G
Sbjct: 152 RQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDC-DKSYNQGCNG 210
Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
G MD AF++I++N G+ + Y Y G G C+S I YEDVP DE +L K
Sbjct: 211 GLMDYAFQFIMKNGGLNTEKDYPYRGFG-GKCNSFLKNSRVVSIDGYEDVPTKDETALKK 269
Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
A++ QPV VAI+A Q Y G+F G C T L+H V AVGYG SE G+ YW+++NSWG
Sbjct: 270 AISYQPVRVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWG 328
Query: 301 QDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKESAQP 339
WGE+GY R++R++ + G+CGIA+ AS+PV K S P
Sbjct: 329 PRWGEEGYIRMERNLAASKSGKCGIAVEASYPV-KYSPNP 367
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 134/268 (50%), Positives = 165/268 (61%), Gaps = 21/268 (7%)
Query: 88 LTPQEFIASQTGFKMSDHS------SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKY 140
+T EF G +++ H A+ + F+Y ++ VP SV+W +KGAVT VK
Sbjct: 1 MTADEFRRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 60
Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QGQC +AAVEGINAIK L SLSEQQLVDC T N GC GG MD AF+YI
Sbjct: 61 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTK-ANAGCNGGLMDYAFQYIA 119
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
++ G+ + Y Y K+ I YEDVP NDE +L KAVA+QPVSVAI
Sbjct: 120 KHGGVAAEDAYPYRARQASC---KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAI 176
Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
+AS QFYS GVF+G C T L+HGV AVGYG + +G KYWL+KNSWG +WGE GY R+
Sbjct: 177 EASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRM 236
Query: 312 QRDIDQPQGQCGIAMFASFPVSKESAQP 339
RD+ +G CGIAM AS+PV K S P
Sbjct: 237 ARDVAAKEGHCGIAMEASYPV-KTSPNP 263
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 144/357 (40%), Positives = 198/357 (55%), Gaps = 31/357 (8%)
Query: 1 MAKYFLIVVLIISGS----CASQATYRTFDEGSIAEK-------FEQWKAQYGRTY-KES 48
MA FLI L+++ S A + R E + + F+QW QY + Y +
Sbjct: 1 MAVRFLIAALLVAASGGVGAAPELQLREQHEKLLLDAKANPMAAFQQWMMQYTKAYANDI 60
Query: 49 AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
E RF ++ +NL + +N S+ L LN FADLT EF ++ G+ +S
Sbjct: 61 KELETRFSVWLENLNYILAYNARTT---SHWLHLNAFADLTTDEF-RNRLGYDFKARQAS 116
Query: 109 LKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKIN 158
+ +PF+Y ++Q+P ++W +KGAVT VK QGQC +VEGINAI
Sbjct: 117 NRLQSSPFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTG 176
Query: 159 RLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIK 218
L SLSEQ+LVDC T D + GC GG MD A+++II+N G+ + Y Y G+C + K
Sbjct: 177 ELASLSEQELVDCDT-DEDRGCSGGLMDYAYQWIIKNGGLDTEDDYPYTA-EDGVCVAAK 234
Query: 219 AEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DASALQFYSGGVFNG-YCETFLN 275
I Y D+P NDE +L KA A+QP++VAI DA + Q Y GGV++ C T LN
Sbjct: 235 KNRRVVTIDGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGVYDDPTCGTSLN 294
Query: 276 HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
HGV VGYG YW++KNSWG +WG++GY RL+ + QG CGIAM SFP
Sbjct: 295 HGVLVVGYGKDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCGIAMAPSFPT 351
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/338 (40%), Positives = 198/338 (58%), Gaps = 24/338 (7%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L++V + G+ ++ T ++G + F+ +K ++ + Y+ + E ++RF +F N+ +
Sbjct: 5 LVLVCALVGAAMAEPLSLTVNKGRL---FDAFKTKFNKVYESAEEEARRFSVFSQNIDFI 61
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
R N AA G ++T+ +N+FADLT +E+ + + + L ++
Sbjct: 62 NRHNAEAARGVHTHTVDVNQFADLTNEEY----RQLYLRPYPTELLGRERQEVWLDGPNA 117
Query: 125 PSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
SV+W +KGAVTP+K QGQC +VEG +AI LVSLSEQQLVDC+ + N
Sbjct: 118 GSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGN 177
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GG MD+AFKYII N G+ + Y Y G+CD K HA I+ Y+DVP N+E
Sbjct: 178 QGCNGGLMDNAFKYIISNGGLDTEQDYPYTARD-GVCDKSKESKHAVSISGYKDVPQNNE 236
Query: 238 ESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
+ L AV PVSVAI+A + Q YS GVF+G C T L+HGV VGY TS+ YW++
Sbjct: 237 DQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGY-TSD----YWIV 291
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KNSWG WG+ GY ++R + G CGIAM S+P++
Sbjct: 292 KNSWGASWGDQGYIMMKRGVSS-AGICGIAMQPSYPIA 328
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 190/343 (55%), Gaps = 29/343 (8%)
Query: 5 FLIVVLIISGSCASQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
L+ L+ + A+ A Y D+G + FE+W A++G+TYK E RF IF+DN+
Sbjct: 6 LLVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVH 65
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
+ + + + + +N+FADLT EF+A+ TG K + P
Sbjct: 66 FIRGYKPQVTYDSA--VGINQFADLTNDEFVATYTGAKPPHPKEA------PRPVDPIWT 117
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P ++W +GAVT VK QG C AVAA+EG+ I+ +L LSEQ+LVDC TN
Sbjct: 118 PCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-- 175
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED----HAAQITNYEDV 232
+NGC GG D AF+ + GIT ++ Y YEG G C + +D HAA I Y V
Sbjct: 176 SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQ-GKC---RVDDMLFNHAASIGGYRAV 231
Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEG 289
PPNDE L AVA QPV+V IDAS A QFY GVF G C NH VT VGY G
Sbjct: 232 PPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASG 291
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYWL KNSWG+ WG+ GY L++DI QP G CG+A+ +P
Sbjct: 292 KKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCGLAVSPFYPT 334
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 189/316 (59%), Gaps = 29/316 (9%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQE 92
FE W ++G+ Y AE +R IF+DNL RF N N SY L LN+FADL+ E
Sbjct: 56 FESWMVKHGKVYDSVAEKERRLTIFEDNL----RFITNRNAENLSYRLGLNRFADLSLHE 111
Query: 93 FIASQTGFKMS---DHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC-- 144
+ G +H +N YK+S +P SV+W +GAVT VK QG C
Sbjct: 112 YGEICHGADPRPPRNHVFMTSSN----RYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRS 167
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
V AVEG+N I LV+LSEQ L++C N NNGC GG ++ A+++I+ N G+
Sbjct: 168 CWAFSTVGAVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMNNGGLG 225
Query: 200 NDAVYSYEGMSTGICDS-IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
D Y Y+ ++ G+C+ +K ++ I YE++P NDE +L+KAVA+QPV+ +D+S+
Sbjct: 226 TDNDYPYKALN-GVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSR 284
Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
Q Y GVF+G C T LNHGV VGYGT E G YW++KNS G WGE GY ++ R+I
Sbjct: 285 EFQLYESGVFDGTCGTNLNHGVVVVGYGT-ENGRDYWIVKNSRGDTWGEAGYMKMARNIA 343
Query: 317 QPQGQCGIAMFASFPV 332
P+G CGIAM AS+P+
Sbjct: 344 NPRGLCGIAMRASYPL 359
>gi|302831223|ref|XP_002947177.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
gi|300267584|gb|EFJ51767.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
Length = 514
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 197/349 (56%), Gaps = 49/349 (14%)
Query: 34 FEQWKAQYGRTYKE-SAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
F W QYGRTY E S E ++R IF DN+ A++ + G TL LN++ADLT +E
Sbjct: 38 FTLWSRQYGRTYVEQSPEYTRRLSIFSDNVRAIQESHEKDPG---VTLALNEYADLTWEE 94
Query: 93 FIASQTGFK-----MSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA- 145
F +++ G + + S + + Y ++ P +++W EKGAV VK QGQC
Sbjct: 95 FSSTRLGLRIDQDQLDRRSRRSASRRNAWRYAAAVDNPKAIDWREKGAVAEVKNQGQCGS 154
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCAT-------------------------N 174
A+EGINAI +L SLSEQQLVDC T N
Sbjct: 155 CWAFSTTGAIEGINAIVTGQLQSLSEQQLVDCDTGKRTVTRSKRSCTVILPSYSSNSCRN 214
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY-EGMSTGI-CDSIKAEDH-AAQITNYED 231
++N GC GG MDDAFKY+IQN G+ + Y+Y G G C+ K D A I YED
Sbjct: 215 ESNMGCSGGLMDDAFKYVIQNGGLDTEQDYAYWSGYGLGFWCNKRKQTDRPAVSIDGYED 274
Query: 232 VPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
VP E++LLKAVA+QPV+VAI A A +QFYS GV + CE LNHGV VGY S++G
Sbjct: 275 VP-QGEDNLLKAVAHQPVAVAICAGASMQFYSRGVISTCCEG-LNHGVLTVGYNVSQDGE 332
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
KYW++KNSWG WGE GYFRL+ + + G CGIA AS+P +P
Sbjct: 333 KYWIVKNSWGAGWGEQGYFRLKMGVGE-TGLCGIASAASYPTKTSPNKP 380
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 145/344 (42%), Positives = 196/344 (56%), Gaps = 44/344 (12%)
Query: 30 IAEKFEQWKAQYGRTYK------------ESAENSK-RFEIFKDNLVAVERFN-NAAIGN 75
+ +E WK+++GR E E+ + R E+F+DNL +++ N A G
Sbjct: 80 VRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEADAGL 139
Query: 76 RSYTLRLNKFADLTPQEFIASQTGFKMSD----------HSSSLKANGTPFLYKSSQVPP 125
++ L L FADLT E+ GF+ H + G L P
Sbjct: 140 HTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLL------PD 193
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+++W + GAVT VK Q QC AVAA+EGINAI LVSLSEQ+++DC D+
Sbjct: 194 AIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQDS-- 251
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH-AAQITNYEDVPPNDE 237
GC GG M++AF+++I N GI +A Y + G + G CD+ K + A I +V N+E
Sbjct: 252 GCDGGQMENAFRFVIGNGGIDTEADYPFIG-TDGTCDASKENNEKVATIDGLVEVASNNE 310
Query: 238 ESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
+L +AVA QPVSVAIDAS A Q YS G+FNG C T L+HGVTAVGYG SE G YW++
Sbjct: 311 TALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYG-SESGKDYWIV 369
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
KNSW WGE GY R++R++ +P G+CGIAM AS+PV P
Sbjct: 370 KNSWSASWGEAGYIRMRRNVPRPTGKCGIAMDASYPVKDTYHDP 413
>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
Length = 396
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 134/333 (40%), Positives = 187/333 (56%), Gaps = 26/333 (7%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI-GNRSYTLR 81
R E I + F+ W +Y + + E KR +IF +N + V N + G S+ +
Sbjct: 61 RVLRESKIEDAFDAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVE 120
Query: 82 LNKFADLTPQEFIASQTGFKMS----DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTP 137
+NKFA T +E+ GFK S S + + + Y+ + P S++W+++G +T
Sbjct: 121 MNKFAAHTREEY-RKMLGFKKSLRRKKDSGEAAKDVSLWEYEGVEAPESIDWVDEGVITT 179
Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
K QG C A+ AVEGINAI+ +LVSLSEQ+LV CA N GC GG MD+AF+
Sbjct: 180 PKNQGSCGSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFE 239
Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
+I++N G+ ++ Y Y+ S C + K H A I + DVP NDE +L KAV+ QPVS
Sbjct: 240 WIVENGGVDSEKQYQYKA-SFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVS 298
Query: 251 VAIDAS--ALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEG---------IKYWLIKNS 298
VAI+A + Q Y GGV++ C T L+HGV VGYG KYW IKNS
Sbjct: 299 VAIEADQRSFQLYGGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNS 358
Query: 299 WGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
W + WGE GY R+ RD++ P G CG+A AS+P
Sbjct: 359 WSEQWGEGGYIRIARDVESPSGMCGVAEMASYP 391
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 186/323 (57%), Gaps = 17/323 (5%)
Query: 24 TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRL 82
T + GS+++ F +W ++G+TY E R +IF DN V++ N G ++ + L
Sbjct: 58 TKEVGSLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGL 117
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
N ADLT EF G+ + +S + + + Y P ++W+ GAVTPVK Q
Sbjct: 118 NHLADLTKDEF-KKMLGYNAALRASRAPVDASTWEYADVTPPEEIDWVASGAVTPVKNQK 176
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
QC AVEG+NAIK +L+SLSE++L+ C+TN N GC GG MD+ F++I+ N
Sbjct: 177 QCGSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTN-GNMGCNGGLMDNGFEWIVNN 235
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
+GI + + Y C + A I ++DVP NDE+SL+KAV+ QPVSVAI+A
Sbjct: 236 RGIDTEDGWEYVAKEEK-CGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEA 294
Query: 256 S--ALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIK---YWLIKNSWGQDWGEDGYF 309
+ Q Y+GGV++ C T L+HGV VGYG + K +W IKNSWG WGEDGY
Sbjct: 295 DHQSFQLYAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYI 354
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
R+ + +GQCG+AM S+P
Sbjct: 355 RIAKGGSGVEGQCGVAMQPSYPT 377
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 138/325 (42%), Positives = 195/325 (60%), Gaps = 33/325 (10%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKE-SAENSKRFEIFKDNLVAVERFN-NAAIGN 75
S A DE + + ++ WK+++GR S + R ++F+DNL ++ N A G
Sbjct: 36 SAAPLERADE-EVRQLYKTWKSEHGRPRDGISVADGLRLKVFRDNLRYIDAHNAEADAGL 94
Query: 76 RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGA 134
++ L L F DLT +EF A GF +S+ + +L ++ +P +V+W ++GA
Sbjct: 95 HTFRLGLTPFTDLTLEEFRAHALGFL---NSTLPRVASDRYLPRAGDDLPDAVDWRQQGA 151
Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
VT VK Q C AVAA+EGIN I N L+SLSEQ+L+DC T D GC GG M
Sbjct: 152 VTGVKNQLDCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTEDY--GCQGGEMQK 209
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF+++I N GI +A Y + G + G CD+I+ + I +YE+VP NDEE+L KAVANQ
Sbjct: 210 AFQFVIDNGGIDTEADYPFIG-TNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQ 268
Query: 248 PVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
P G+FNG C L+HGVTAVGYG S+ G +W++KNSWG +WGE G
Sbjct: 269 P---------------GIFNGPCGFILDHGVTAVGYG-SDNGEDFWIVKNSWGAEWGESG 312
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
Y R++R++ P G+CGIAM+AS+PV
Sbjct: 313 YIRMKRNVLLPMGKCGIAMYASYPV 337
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 187/312 (59%), Gaps = 19/312 (6%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
+ +W AQ+G + E R+E F+DNL ++ N AA G S+ L LN+FA LT +E
Sbjct: 43 YAEWTAQHGSPI--TNEEEGRYEAFRDNLRYIDEHNAAADAGIHSFRLGLNRFAGLTNEE 100
Query: 93 FIASQTGFKM-SDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC------ 144
+ A+ G ++ S L+ + + +P SV+W EKGAV VK QG+
Sbjct: 101 YRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVDWREKGAVGKVKDQGRSCGSAWA 160
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
A+AAVE IN I L+SLSEQ+L+DC T+ N GC GG MDDAF++II N GI D
Sbjct: 161 FSAIAAVESINQIVTGELISLSEQELMDCDTS-YNAGCDGGLMDDAFEFIISNGGIDTDE 219
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y+ + CD+ K A I +YED+ N E+SL KAV+NQPVSVAI+A Q
Sbjct: 220 DYPYKARNDS-CDANKRNRKAVTIDDYEDLRMN-EKSLQKAVSNQPVSVAIEAGGRDFQL 277
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
Y G+F G C T L+H T VGYG SE G YW++K S+G WGE GY R++R+I + G
Sbjct: 278 YKSGIFTGTCGTDLDHATTIVGYG-SENGTDYWIVKESYGTSWGESGYARMERNIKETSG 336
Query: 321 QCGIAMFASFPV 332
+CGIAM S+PV
Sbjct: 337 KCGIAMLPSYPV 348
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 148/354 (41%), Positives = 196/354 (55%), Gaps = 41/354 (11%)
Query: 1 MAKYFLIVV--LIISGSCASQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
MA L+VV L+ + + A Y D+G + FE+W A++G+TYK E RF I
Sbjct: 7 MASAVLLVVCTLMALQAMGADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGI 66
Query: 58 FKDNLVAVERFN-----NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN 112
F+DN+ + + ++A+G +N+FADLT EF+A+ TG K +
Sbjct: 67 FRDNVHFIRGYKPQVTYDSAVG-------INQFADLTNDEFVATYTGAKPPHPKEA---- 115
Query: 113 GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
P P ++W +GAVT VK QG C AVAA+EG+ I+ +L LSE
Sbjct: 116 --PRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSE 173
Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED---- 221
Q+LVDC TN +NGC GG D AF+ + GIT ++ Y YEG G C + +D
Sbjct: 174 QELVDCDTN--SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQ-GKC---RVDDMLFN 227
Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVT 279
HAA+I Y VPPNDE L AVA QPV+V IDAS A QFY GVF G C NH VT
Sbjct: 228 HAARIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVT 287
Query: 280 AVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
VGY G KYW+ KNSWG+ WG+ GY L++D+ QP G CG+A+ +P
Sbjct: 288 LVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYPT 341
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 141/327 (43%), Positives = 193/327 (59%), Gaps = 27/327 (8%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQE 92
F+ W ++G+ Y AE +R IF+DNL RF +N N SY L L +FADL+ E
Sbjct: 56 FDSWMVKHGKVYGSVAEKERRLTIFEDNL----RFISNRNAENLSYRLGLTQFADLSLHE 111
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC----- 144
+ G + + + YK+S +P SV+W +GAVT VK QG C
Sbjct: 112 YGEVCHGADPRPPRNHVFMTSSD-RYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWA 170
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
V AVEG+N I LV+LSEQ L++C N NNGC GG ++ A+++I++N G+ D
Sbjct: 171 FSTVGAVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMKNGGLGTDN 228
Query: 203 VYSYEGMSTGICDS-IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y+ ++ G+CD +K + I +E++P NDE +L+KAVA+QPV+ ID+S+ Q
Sbjct: 229 DYPYKAVN-GVCDGRLKENNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQ 287
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
Y GVF+G C T LNHGV VGYGT E G YWL+KNS G WGE GY ++ R+I P+
Sbjct: 288 LYESGVFDGSCGTNLNHGVVVVGYGT-ENGRDYWLVKNSRGNTWGEAGYMKMARNIANPR 346
Query: 320 GQCGIAMFASFPVSKESAQPSSADKSS 346
G CGIAM AS+P+ S DKSS
Sbjct: 347 GLCGIAMRASYPLK----NSFSTDKSS 369
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 140/321 (43%), Positives = 181/321 (56%), Gaps = 28/321 (8%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
D+G + FE+W A++G+TYK E RF IF+DN+ + + + + + +N+F
Sbjct: 12 DDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSA--VGINQF 69
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
ADLT EF+A+ TG K + P P ++W +GAVT VK QG C
Sbjct: 70 ADLTNDEFVATYTGAKPPHPKEA------PRPVDPIWTPCCIDWRFRGAVTGVKDQGACG 123
Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
AVAA+EG+ I+ +L LSEQ+LVDC TN +NGC GG D AF+ + GI
Sbjct: 124 SCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN--SNGCGGGHTDRAFELVASKGGI 181
Query: 199 TNDAVYSYEGMSTGICDSIKAED----HAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
T ++ Y YEG G C + +D HAA I Y VPPNDE L AVA QPV+V ID
Sbjct: 182 TAESDYRYEGFQ-GKC---RVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYID 237
Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRL 311
AS A QFY GVF G C NH VT VGY G KYWL KNSWG+ WG+ GY L
Sbjct: 238 ASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYILL 297
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
++DI QP G CG+A+ +P
Sbjct: 298 EKDIVQPHGTCGLAVSPFYPT 318
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 149/341 (43%), Positives = 197/341 (57%), Gaps = 28/341 (8%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
L+ V II+ S A + +FD E++ +KA +G+TYK E R +IF DN
Sbjct: 4 LLVAVAIIALSYA----HPSFD--IYPEEWHVFKAMHGKTYKNQFEEMFRMKIFMDNKKK 57
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
+E N G SY + +N F DL EF A GFKMS + K NG + +S +
Sbjct: 58 IEAHNAKYEQGEVSYKMMMNHFGDLMVHEFKALMNGFKMSPDT---KRNGELYFPSNSNL 114
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P +V+W +KGAVTPVK QGQC A ++EG +K +LVSLSEQ LVDC+T+
Sbjct: 115 PKTVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYG 174
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
NNGC GG MD AF+Y+ NKGI +A Y YE C K + D+P D
Sbjct: 175 NNGCEGGLMDQAFQYVSDNKGIDTEASYPYEAREN-TC-RFKKNKVGGTDKGHVDIPAGD 232
Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEEGIK 291
E++L A+A P+SVAIDA+ + QFYS GV+N C ++ L+HGV AVGYGT E G
Sbjct: 233 EKALQNALATVGPISVAIDANHGSFQFYSKGVYNEPNCSSYDLDHGVLAVGYGT-ENGQD 291
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWG WGE+GY ++ R+ CGIA AS+P+
Sbjct: 292 YWLVKNSWGPSWGENGYIKIARNHSN---HCGIASMASYPL 329
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 186/318 (58%), Gaps = 26/318 (8%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN---NAAIGN---RSYTLRLNKFAD 87
F+ W A++G+ Y E + R +F DN V N NAA G SYTL LN FAD
Sbjct: 41 FDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFAD 100
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-----SQVPPSVNWIEKGAVTPVKYQG 142
LT +EF A++ G +++ +++L++ P +Y+ VP +++W E GAVT VK QG
Sbjct: 101 LTHEEFRAARLG-RIAAGAAALRSPAAP-VYRGLDGGLGAVPDALDWRENGAVTKVKDQG 158
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C A A+EGIN IK LVSLSEQ+L+DC N+GC GG MD A+K++++N
Sbjct: 159 SCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDC-DRSYNSGCGGGLMDYAYKFVVKN 217
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI-- 253
GI + Y Y + G C+ K + I Y DVP N E+ LL+AVA QPVSV I
Sbjct: 218 GGIDTEEDYPYR-EADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICG 276
Query: 254 DASALQFYSG-GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
A A Q YS G+F+G C T L+H V VGYG SE G YW++KNSWG+ WG GY +
Sbjct: 277 SARAFQLYSQQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGESWGMKGYMHMH 335
Query: 313 RDIDQPQGQCGIAMFASF 330
R+ +G CGI M ASF
Sbjct: 336 RNTGDSKGVCGINMMASF 353
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 175/311 (56%), Gaps = 19/311 (6%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E+W ++ + Y E RF+IFKDNL ++ N N SY + LNKFAD+ +E+
Sbjct: 4 YEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQ---NYSYKVGLNKFADINNEEY 60
Query: 94 IASQTGFKMSDHSSSLKA--NGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
G K +K G Y S V V+W KGAVT +K QG C
Sbjct: 61 RDMYLGTKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFS 120
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
+A VE IN I + VSLSEQ+LVDC N GC GG MD AF++II+N GI D Y
Sbjct: 121 TIATVEAINKIVTGKFVSLSEQELVDC-DRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDY 179
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYS 262
Y G CD K I YEDVP +L KAVA+QPVSVAI ALQ Y
Sbjct: 180 PYNGFERK-CDPTKKNAKVVSIDGYEDVPSY-MNALKKAVAHQPVSVAIAGLGRALQLYQ 237
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL-QRDIDQPQGQ 321
GVF G C T L+HGV VGYG SE G+ YWL++NSWG +WGEDGYF++ R++ +
Sbjct: 238 SGVFTGKCGTDLDHGVVVVGYG-SENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRK 296
Query: 322 CGIAMFASFPV 332
CGIAM AS+PV
Sbjct: 297 CGIAMEASYPV 307
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 188/319 (58%), Gaps = 24/319 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++F W+ + R+Y + E +RF++++ N ++ N G+ +Y L N+FADLT
Sbjct: 43 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVN--LRGDLTYQLAENEFADLT 100
Query: 90 PQEFIASQTGFKMSDHS--SSLKANGT-----PFLYKSSQVPPSVNWIEKGAVTPVKYQ- 141
+EF+A+ TG+ D S+ G F Y+ VP SV+W +GAV P K Q
Sbjct: 101 EEEFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRV-DVPASVDWRAQGAVVPPKSQT 159
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
C+ A +E +N IK +LVSLSEQQLVDC + D GC G A+K++++
Sbjct: 160 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVE 217
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N G+T +A Y Y G C+ K+ HAA+IT + VPP +E +L AVA QPV+VAI+
Sbjct: 218 NGGLTTEADYPYTARR-GPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIE 276
Query: 255 -ASALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
S +QFY GGV+ G C T L H VT VGYGT + G KYW IKNSWGQ WGE GY R+
Sbjct: 277 VGSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRIL 336
Query: 313 RDIDQPQGQCGIAMFASFP 331
RD+ P G CG+ + ++P
Sbjct: 337 RDVGGP-GLCGVTLDIAYP 354
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 188/319 (58%), Gaps = 24/319 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++F W+ + R+Y + E +RF++++ N ++ N G+ +Y L N+FADLT
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVN--LRGDLTYRLAENEFADLT 104
Query: 90 PQEFIASQTGFKMSDHS--SSLKANGT-----PFLYKSSQVPPSVNWIEKGAVTPVKYQ- 141
+EF+A+ TG+ D S+ G F Y+ VP SV+W +GAV P K Q
Sbjct: 105 EEEFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRV-DVPASVDWRAQGAVVPPKSQT 163
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
C+ A +E +N IK +LVSLSEQQLVDC + D GC G A+K++++
Sbjct: 164 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVE 221
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N G+T +A Y Y G C+ K+ HAA+IT + VPP +E +L AVA QPV+VAI+
Sbjct: 222 NGGLTTEADYPYTARR-GPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIE 280
Query: 255 -ASALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
S +QFY GGV+ G C T L H VT VGYGT + G KYW IKNSWGQ WGE GY R+
Sbjct: 281 VGSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRIL 340
Query: 313 RDIDQPQGQCGIAMFASFP 331
RD+ P G CG+ + ++P
Sbjct: 341 RDVGGP-GLCGVTLDIAYP 358
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 119/231 (51%), Positives = 154/231 (66%), Gaps = 15/231 (6%)
Query: 114 TPFLYKSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSL 163
T F Y++ V P +++W GAVTP+K QGQC AVAA EGI I +L+SL
Sbjct: 4 TGFRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISL 63
Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
SEQ+LVDC + GC GG MDDAFK+II+N G+T ++ Y Y + G C S + A
Sbjct: 64 SEQELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTA-ADGKCKS--GSNSA 120
Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAV 281
A I YEDVP NDE +L+KAVANQPVSVA+D + QFYSGGV G C T L+HG+ A+
Sbjct: 121 ANIKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAI 180
Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
GYG + +G KYWL+KNSWG WGE+GY R+++DI +G CG+A+ S+P
Sbjct: 181 GYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYPT 231
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 188/319 (58%), Gaps = 24/319 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++F W+ + R+Y + E +RF++++ N ++ N G+ +Y L N+FADLT
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVN--LRGDLTYQLAENEFADLT 104
Query: 90 PQEFIASQTGFKMSDHS--SSLKANGT-----PFLYKSSQVPPSVNWIEKGAVTPVKYQ- 141
+EF+A+ TG+ D S+ G F Y+ VP SV+W +GAV P K Q
Sbjct: 105 EEEFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRV-DVPASVDWRAQGAVVPPKSQT 163
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
C+ A +E +N IK +LVSLSEQQLVDC + D GC G A+K++++
Sbjct: 164 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVE 221
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N G+T +A Y Y G C+ K+ HAA+IT + VPP +E +L AVA QPV+VAI+
Sbjct: 222 NGGLTTEADYPYTARR-GPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIE 280
Query: 255 -ASALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
S +QFY GGV+ G C T L H VT VGYGT + G KYW IKNSWGQ WGE GY R+
Sbjct: 281 VGSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRIL 340
Query: 313 RDIDQPQGQCGIAMFASFP 331
RD+ P G CG+ + ++P
Sbjct: 341 RDVGGP-GLCGVTLDIAYP 358
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 181/321 (56%), Gaps = 28/321 (8%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
D+G + FE+W A++G+TYK E RF IF+DN+ + + + + + +N+F
Sbjct: 12 DDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSA--VGINQF 69
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
ADLT EF+A+ TG K + P P ++W +GAVT VK QG C
Sbjct: 70 ADLTNDEFVATYTGAKPPHPKEA------PRPVDPIWTPCCIDWRFRGAVTGVKDQGACG 123
Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
AVAA+EG+ I+ +L LSEQ+LVDC TN +NGC GG D AF+ + GI
Sbjct: 124 SCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN--SNGCGGGHTDRAFELVASKGGI 181
Query: 199 TNDAVYSYEGMSTGICDSIKAED----HAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
T ++ Y YEG G C + +D HAA I Y VPPNDE L AVA QPV+V ID
Sbjct: 182 TAESDYRYEGFQ-GKC---RVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYID 237
Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRL 311
AS A QFY GVF G C NH VT VGY G KYW+ KNSWG+ WG+ GY L
Sbjct: 238 ASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILL 297
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
++D+ QP G CG+A+ +P
Sbjct: 298 EKDVLQPHGTCGLAVSPFYPT 318
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 132/261 (50%), Positives = 168/261 (64%), Gaps = 16/261 (6%)
Query: 84 KFADLTPQEFIASQTGFKM-SDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVK 139
+FA++T EF + TG+K S SS + T F Y+ S +P +V+W +KGAVTP+K
Sbjct: 1 QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60
Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
QG C AVAA+EG IK +L+SLSEQQLVDC TND GC GG +D AF++I
Sbjct: 61 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDF--GCSGGLIDTAFEHI 118
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
+ G+T ++ Y Y+G C AA IT YEDVP NDE +L+KAVA+QPVSV
Sbjct: 119 MATGGLTTESNYPYKG-EDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVG 177
Query: 253 IDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
I+ QFYS GVF G C T+L+H VTAVGY S G KYW+IKNSWG WGE GY R
Sbjct: 178 IEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMR 237
Query: 311 LQRDIDQPQGQCGIAMFASFP 331
+++DI +G CG+AM AS+P
Sbjct: 238 IKKDIKDKEGLCGLAMKASYP 258
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 187/312 (59%), Gaps = 25/312 (8%)
Query: 39 AQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT 98
A+YGR YK++ E +RF+IFK+N+ +E FNN GN SYTL +NKF D+T EF+A T
Sbjct: 2 AEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRN-GN-SYTLGINKFTDMTNNEFVAQYT 59
Query: 99 GFKMSDHSSSLKANGTPFL----YKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVA 147
G S L P + S V S++W + GAVT VK Q C A+A
Sbjct: 60 G----GISRPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIA 115
Query: 148 AVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYE 207
VEGI I LVSLSEQ+++DCA + NGC GGF+D+A+ +II N G+ ++A Y Y+
Sbjct: 116 TVEGIYKIVTGYLVSLSEQEVLDCAVS---NGCDGGFVDNAYDFIISNNGVASEADYPYQ 172
Query: 208 GMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGV 265
G C + + ++A IT Y V NDE S+ AV NQP++ AIDAS Q+Y+GGV
Sbjct: 173 AYQ-GDC-AANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGV 230
Query: 266 FNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
F+G C T LNH +T +GYG G +YW++KNSWG WGE GY R+ R + G CGIA
Sbjct: 231 FSGPCGTSLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGVSS-SGLCGIA 289
Query: 326 MFASFPVSKESA 337
M +P + A
Sbjct: 290 MDPLYPTLQSGA 301
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 173/309 (55%), Gaps = 47/309 (15%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E W ++G++Y E +RFEIFKDNL +E N NR+Y
Sbjct: 4 YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV---NRTY--------------- 45
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
K+ D S +P SV+W EKGAV PVK QG C +
Sbjct: 46 -------KVGDRYS---------FRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTI 89
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
AAVEGIN I L+SLSEQ+LVDC N GC GG MD AF++II N GI ++ Y Y
Sbjct: 90 AAVEGINQIATGDLISLSEQELVDC-DKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPY 148
Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGG 264
T CD + I YEDVP NDE SL KAVANQPVSVAI+A A Q Y G
Sbjct: 149 RAADT-TCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSG 207
Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCG 323
VF G C T L+HGV AVGYGT E + YW+++NSWG +WGE GY +L+R++ + G+CG
Sbjct: 208 VFTGQCGTQLDHGVVAVGYGT-ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCG 266
Query: 324 IAMFASFPV 332
IA+ S+P+
Sbjct: 267 IAIEPSYPI 275
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 137/330 (41%), Positives = 189/330 (57%), Gaps = 34/330 (10%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++FEQW ++GR Y ++ E +RFE+++ N+ VE FN+ + G Y L NKFADLT
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNG---YKLADNKFADLT 83
Query: 90 PQEFIASQTGFK----MSDHSSSLKAN-GTPFLYKSSQVPPSVNWIEKGAVTPVKYQ--- 141
+EF A GF+ + S++ A+ P +P SV+W KGAV +++
Sbjct: 84 NEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAVIN-RWKICV 142
Query: 142 --GQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
G C AVAA+EGIN IK LVSLSEQ+LVDC +D GC GG+M AF++++ N
Sbjct: 143 DAGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDC--DDEAVGCGGGYMSWAFEFVVGN 200
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
G+T +A Y Y + G C + K A I Y +V P+ E L +A A QPVSVA+D
Sbjct: 201 HGLTTEASYPYHA-ANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDG 259
Query: 256 SALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIK----------YWLIKNSWGQDW 303
+ F Y GV+ G C +NHGVT VGYG SE YW++KNSWG +W
Sbjct: 260 GSFMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEW 319
Query: 304 GEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
G+ GY +QRD+ G CGIA+ S+PV
Sbjct: 320 GDAGYILMQRDVAGLASGLCGIALLPSYPV 349
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 138/306 (45%), Positives = 182/306 (59%), Gaps = 23/306 (7%)
Query: 37 WKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIA 95
+K+ Y ++Y+ A +KR F+ NL + + N A G SYT+ +N+FADLT EF+A
Sbjct: 1 FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60
Query: 96 SQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAA 148
+ S + ++ N T +L +S+ SV+W KGAVTP+K QGQC +
Sbjct: 61 L---YVPSKFNRTMPYN-TVYLPATSE--DSVDWRTKGAVTPIKNQGQCGSCWSFSTTGS 114
Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
EG +AI LVSLSEQQLVDC+ + N GC GG MDDAFKYII NKG+ + Y Y
Sbjct: 115 TEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTA 174
Query: 209 MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVF 266
G C+ K HAA I++Y DVP N+E+ L AVA PVSVAI+A S Q Y GVF
Sbjct: 175 QD-GTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVF 233
Query: 267 NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAM 326
+G C T L+HGV VGY YW++KNSWG WG +GY ++R + G CGIAM
Sbjct: 234 DGNCGTNLDHGVLVVGYTDD-----YWIVKNSWGTTWGVEGYINMKRGV-SASGICGIAM 287
Query: 327 FASFPV 332
S+P+
Sbjct: 288 QPSYPI 293
>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 360
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 135/326 (41%), Positives = 191/326 (58%), Gaps = 30/326 (9%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++F W+A + ++Y+ + E +RF++++DN+ +E N G+ +Y L N+FADLT
Sbjct: 38 MMDRFLMWQATHNQSYRSAEERLRRFQVYRDNVEYIETTNRR--GDLTYQLGENQFADLT 95
Query: 90 PQEFIASQTGFKM---------SDHSSSLKANGTPFLYKS-----SQVPPSVNWIEKGAV 135
+EFIA T + S +++ G P L+ S S PPSV+W KGAV
Sbjct: 96 REEFIARFTSYNGDDDRTGDDDSVITTAAVGGGDPDLWSSGGDDVSLDPPSVDWRAKGAV 155
Query: 136 TPVKYQGQ--------CAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
P K Q AVA +E ++AIK +LV+LSEQQLVDC D GC G
Sbjct: 156 VPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLVALSEQQLVDCDQYDG--GCNRGTFRR 213
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF ++IQN G+T +A Y Y + G C+S K++ H A I+ + VP ++E ++ AVA Q
Sbjct: 214 AFHWVIQNGGLTTEAEYPYTA-AQGTCNSAKSDHHVAAISGHASVPGSNELAMKHAVATQ 272
Query: 248 PVSVAID-ASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE-GIKYWLIKNSWGQDWGE 305
PV+ AI+ S +QFY GV++G C L H VT VGYG E G KYW++KNSWGQ WGE
Sbjct: 273 PVAAAIELGSDMQFYKSGVYSGPCGARLEHAVTVVGYGADESTGDKYWIVKNSWGQTWGE 332
Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFP 331
GY R+QR I P G CGI + ++P
Sbjct: 333 RGYIRMQRKILGP-GLCGIMLDVAYP 357
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 172/315 (54%), Gaps = 19/315 (6%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR------SYTLRLNKFAD 87
FE W A++G+ Y E + R F DN V N G SYTL LN FAD
Sbjct: 42 FEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFAD 101
Query: 88 LTPQEFIASQTG-FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
LT EF A++ G + + G VP +++W + GAVT VK QG C
Sbjct: 102 LTHAEFRAARLGRLAVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGSCGA 161
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
A A+EGIN IK L+SLSEQ+L+DC N GC GG MD A++++I+N GI
Sbjct: 162 CWSFSATGAIEGINKIKTGSLISLSEQELIDC-DRSYNAGCGGGLMDYAYRFVIKNGGID 220
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DASA 257
+ Y Y + G C+ K + H I Y DVP N E+SLL+AVA QP+SV I A A
Sbjct: 221 TEDDYPYR-EADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSARA 279
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
Q YS G+F+G C T L+H V VGYG SE G YW++KNSWG+ WG GY + R+
Sbjct: 280 FQLYSQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTGS 338
Query: 318 PQGQCGIAMFASFPV 332
G CGI M ASFP
Sbjct: 339 SSGICGINMMASFPT 353
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 189/335 (56%), Gaps = 31/335 (9%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIG-NRSYTLRLNK 84
D+ S+ E+F++WKA Y ++Y AE +RF ++ N+ +E N A +Y L
Sbjct: 42 DDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAAGLTYELGETA 101
Query: 85 FADLTPQEFIASQTG---FKMSDHSSSLKANGTP-------------FLYKSSQVPPSVN 128
+ DLT QEF+A T ++ S + P ++ S+ P SV+
Sbjct: 102 YTDLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSASAPASVD 161
Query: 129 WIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
W GAVTPVK QG+C VA VEGI I+ +LVSLSEQ+LVDC T D+ GC
Sbjct: 162 WRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDD--GCD 219
Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
GG A ++I N GIT +A Y Y G +T C+ K +A I V E SL
Sbjct: 220 GGISYRALRWIASNGGITTEADYPYTG-TTDACNRAKLSHNAVSIAGLRRVATRSEASLA 278
Query: 242 KAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNS 298
AVA QPV+V+I+A Q Y GV+NG C T LNHGVT VGYG + G +YW++KNS
Sbjct: 279 NAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAAGDRYWIVKNS 338
Query: 299 WGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
WGQ WG+DGY R+++D+ +P+G CGIA+ S+P+
Sbjct: 339 WGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 138/323 (42%), Positives = 183/323 (56%), Gaps = 36/323 (11%)
Query: 31 AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR---SYTLRLNKFAD 87
+E FE+W ++ +TY E R ++F+DN V + N A N SYTL LN FAD
Sbjct: 30 SELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFAD 89
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ---------VPPSVNWIEKGAVTPV 138
LT EF ++ G + T +K Q +P ++W + GAVTPV
Sbjct: 90 LTHHEFKTTRLGLPL-----------TLLRFKRPQNQQSRDLLHIPSQIDWRQSGAVTPV 138
Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K Q C A A+EGIN I LVSLSEQ+L+DC T+ N+GC GG MD A+++
Sbjct: 139 KDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTS-YNSGCGGGLMDFAYQF 197
Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
+I NKGI + Y Y+ C K + A I +Y DVPP++EE +LKAVA+QPVSV
Sbjct: 198 VIDNKGIDTEDDYPYQARQRS-CSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSV 255
Query: 252 AIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
I S Q YS G+F G C TFL+H V VGYG SE G+ YW++KNSWG+ WG +GY
Sbjct: 256 GICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYG-SENGVDYWIVKNSWGKYWGMNGYI 314
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
+ R+ +G CGI AS+PV
Sbjct: 315 HMIRNSGNSKGICGINTLASYPV 337
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 140/330 (42%), Positives = 183/330 (55%), Gaps = 28/330 (8%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR--------- 76
D +I +F+ W A++G+ Y E + R +F DN V N A N
Sbjct: 28 DPPAIEAQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAA 87
Query: 77 --SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF---LYKSSQVPPSVNWIE 131
SYTL LN FADLT +EF A++ G ++L++ P L + VP +++W +
Sbjct: 88 PPSYTLALNAFADLTHEEFRAARLGRIAP--GAALRSRAAPVYWGLGGGAAVPDALDWRK 145
Query: 132 KGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGF 184
GAVT VK QG C A A+EGIN IK LVSLSEQ+L+DC N+GC GG
Sbjct: 146 SGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDC-DRSYNSGCGGGL 204
Query: 185 MDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAV 244
MD A+K++I+N GI + Y Y + G C+ K + I Y DVP N E+ LL+AV
Sbjct: 205 MDYAYKFVIKNGGIDTEEDYPYR-EADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAV 263
Query: 245 ANQPVSVAI--DASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
A QPVSV I A A Q Y G+F+G C T L+H V VGYG SE G YW++KNSWG+
Sbjct: 264 AQQPVSVGICGSARAFQLYYQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGES 322
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WG GY + R+ +G CGI M ASFP
Sbjct: 323 WGMKGYMHMHRNTGDSKGVCGINMMASFPT 352
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 143/345 (41%), Positives = 204/345 (59%), Gaps = 29/345 (8%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
KY ++++ + + ++ FDE +++WK ++G+ Y E + R I++ NL
Sbjct: 2 KYLSVLLVAVCVVSSLSMSFTDFDE-----DWKEWKNEHGKRYLSDEEEASRRLIWQKNL 56
Query: 63 VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
V R N +G+ +Y L +N+FADL +EF+A TGF++ + +S A G+ FL ++
Sbjct: 57 DIVIRHNLKYDLGHFTYDLGMNQFADLQNKEFVAMMTGFRV--NGTSKAAKGSTFLPPNN 114
Query: 122 --QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
++P +V+W KG VTPVK QGQC A ++EG + K +LVSLSEQ LVDC+
Sbjct: 115 VGKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCS 174
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
D N GC GG MD AF+YII GI + Y Y M G C K + A +T Y DV
Sbjct: 175 --DKNYGCNGGLMDRAFQYIIDAGGIDTEESYPYIAMD-GNC-HFKTANVGATVTGYTDV 230
Query: 233 PPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVGYGTSE 287
E++L KAVA+ P+SVAIDAS + Q Y GV+N G T L+HGV AVGYGT+
Sbjct: 231 TSGSEKALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTTI 290
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G YW++KNSW + WG +GY + R+ D QCGIA AS+P+
Sbjct: 291 DGTDYWIVKNSWAETWGMNGYIWMSRNKDN---QCGIATQASYPL 332
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 136/334 (40%), Positives = 194/334 (58%), Gaps = 27/334 (8%)
Query: 23 RTFDEGSIAEKFEQWKAQYGRTYKESA----------ENSKRFEIFKDNLVAVERFN-NA 71
RT +E + +E+W++++ + A ++++R E+F+ NL ++ N A
Sbjct: 44 RTDEE--VRRLYEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNLRYIDAHNAEA 101
Query: 72 AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP--FLYKSSQVPPSVNW 129
G + L L +FADLT +E+ A + +++ G+ Q+P +V+W
Sbjct: 102 DAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVVGSRRYLPLAGEQLPDAVDW 161
Query: 130 IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
E+GAV VK QGQC AVAAVEGIN I L+SLSEQ+L+DC + GC G
Sbjct: 162 RERGAVAEVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDC-DKFQDQGCDG 220
Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
G MD+AF ++I+N GI +A Y + G G CD I ++E VP N E +L K
Sbjct: 221 GLMDNAFVFMIKNGGIDTEADYPFTGHD-GTCDLKLKNTRVVSIDSFERVPINYERALQK 279
Query: 243 AVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
AVA+QPVS +I+AS A Q YS G+F+G C T+L+HGVT VGYG SE G YW++KNSWG
Sbjct: 280 AVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYG-SEGGKDYWIVKNSWG 338
Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
WGE GY R+ R++ G+CGIAM +PV +
Sbjct: 339 TQWGEAGYVRMARNVRVRAGKCGIAMEPLYPVKE 372
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 141/365 (38%), Positives = 198/365 (54%), Gaps = 40/365 (10%)
Query: 5 FLIVVLIISGSCASQATYR---------TFDEGSIAEKFEQWKAQYGRTYKESAENSKRF 55
L+++ + C+S +R + D+ S+ E+F++WKA Y ++Y AE +RF
Sbjct: 12 VLLLLAVFHHGCSSARAHRRAGDMERSMSTDDSSMIERFQRWKAAYNKSYATVAEERRRF 71
Query: 56 EIFKDNLVAVERFNNAAIG-NRSYTLRLNKFADLTPQEFIASQTG---FKMSDHSSSLKA 111
+ N+ +E N A +Y L + DLT QEF+A T ++ S +
Sbjct: 72 RVCARNMAYIEATNAEAEAAGLTYELGETAYTDLTNQEFMAMYTAPAPAQLPADESVITT 131
Query: 112 NGTP-------------FLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEG 151
P ++ S+ P SV+W GAVTPVK QG+C VA VEG
Sbjct: 132 RAGPVDAVGGAPGQLPVYVNLSTSAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEG 191
Query: 152 INAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST 211
I I+ +LVSLSEQ+LVDC T D+ GC GG A ++I N GIT + Y Y G +T
Sbjct: 192 IYQIRTGKLVSLSEQELVDCDTLDD--GCDGGISYRALRWIASNGGITTETDYPYTG-TT 248
Query: 212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGY 269
C+ K +A I V E SL AVA QPV+V+I+A Q Y GV+NG
Sbjct: 249 DACNRAKLSHNAVSIAGLRRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGP 308
Query: 270 CETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMF 327
C T LNHGVT VGYG + G +YW++KNSWGQ WG+DGY R+++D+ +P+G CGIA+
Sbjct: 309 CGTNLNHGVTVVGYGQEAAGGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIR 368
Query: 328 ASFPV 332
S+P+
Sbjct: 369 PSYPL 373
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 141/354 (39%), Positives = 190/354 (53%), Gaps = 34/354 (9%)
Query: 9 VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
V + GS A+ T D +A++F +WKA++ RTY E R ++ N+ +E
Sbjct: 18 VFFLHGSSATSRP-ATEDADPMAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEAT 76
Query: 69 NNAAIGNRSYTLRLNKFADLTPQEFIASQTGF--KMSDHSSSLKANGTP----------- 115
N A +Y L + DLT EF A T +SD L
Sbjct: 77 NGDAGAGLTYELGETAYTDLTSDEFTAMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGG 136
Query: 116 ------FLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVS 162
++ +S+ P SV+W E+GAVT VK QGQC VA +EGI+ IK +L S
Sbjct: 137 GGWLQVYVNESAGAPASVDWRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLAS 196
Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
LSEQ+LVDC D +GC GG A ++I N GIT+ Y Y CD+ K H
Sbjct: 197 LSEQELVDCDKLD--HGCNGGVSYRALQWITSNGGITSQDDYPYTAKDD-TCDTKKLSHH 253
Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTA 280
AA I+ ++ V E SL AVA QPV+V+I+A Q Y GV+NG C T LNHGVT
Sbjct: 254 AASISGFQRVATRSELSLTNAVAMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTV 313
Query: 281 VGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQRD-IDQPQGQCGIAMFASFPV 332
VGYG E G YW++KNSWG+ WG++GY R+++ ID+P+G CGIA+ SFP+
Sbjct: 314 VGYGEDEVTGESYWIVKNSWGEKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 135/348 (38%), Positives = 201/348 (57%), Gaps = 25/348 (7%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEG---SIAEKFEQWKAQYGRTYKESAENSKRFEI 57
M LI+++++ + + A ++G I FE W A++G++Y E ++R I
Sbjct: 5 MIASTLILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLEKARRLMI 64
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG-FKMSDHSSSLKANGTPF 116
F D L +E+ N A N ++TL LNKF+DLT EF A G FK + L A
Sbjct: 65 FSDTLAYIEKHN--AQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAEDEDV 122
Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
S +P S++W +KGAVTP+K QG C A+A++E + + LVSLSEQQL+
Sbjct: 123 --DVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLM 180
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE--DHAAQIT 227
DC T D GC GG M+ AFK++++N G+T +A Y Y G S G C++ K + A+IT
Sbjct: 181 DCDTVDA--GCDGGLMETAFKFVVKNGGVTTEASYPYTG-SVGSCNANKVAIINKVAEIT 237
Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGT 285
++ V + ++L+KAV+ PV+V+I S F Y G+ +G C L+HGV +GYGT
Sbjct: 238 GFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSLDHGVLLIGYGT 297
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
E G+ YW+IKNSWG WGEDG+ +++R G CG+ +S+P +
Sbjct: 298 -EGGMPYWIIKNSWGTSWGEDGFMKIER--KDGDGICGMNGDSSYPTT 342
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 143/345 (41%), Positives = 203/345 (58%), Gaps = 29/345 (8%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
KY ++++ + + ++ FDE + QWK ++G+ Y E + R I++ NL
Sbjct: 2 KYLSVLLVAVCVVSSLSMSFTDFDE-----DWNQWKNEHGKRYLSDEEEASRKLIWEKNL 56
Query: 63 VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
V + N +G+ +Y L +N+FADL +EF+A TGF++ + +S A G+ FL ++
Sbjct: 57 DIVIKHNLKYDLGHFTYALGMNQFADLQNEEFVAMMTGFRV--NGTSKAAKGSTFLPSNN 114
Query: 122 --QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
++P +V+W KG VTPVK QGQC A ++EG K +LVSLSEQ LVDC+
Sbjct: 115 VDKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDCS 174
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
N GC+GGFMD AF+YII GI +A YSY + G C KA + A +T Y DV
Sbjct: 175 YR--NYGCHGGFMDRAFQYIIDAGGIDTEATYSYRAVD-GNCHFKKA-NVGATVTGYTDV 230
Query: 233 PPNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGVFN--GYCETFLNHGVTAVGYGTSE 287
E++L KAVA+ P+SVAIDAS +FY GV+N G T L H V VGYGT+
Sbjct: 231 TSGSEKALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTTS 290
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G YW++KNSW + WG +GY + R+ D QCGIA AS+P+
Sbjct: 291 DGTDYWIVKNSWAKTWGMNGYLWMSRNKDN---QCGIASEASYPM 332
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 233 bits (594), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 177/316 (56%), Gaps = 20/316 (6%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI----GNRSYTLRLNKFADL 88
+FE W A++G+ Y E + R F +N V N+A G SYTL LN FADL
Sbjct: 38 QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTP---FLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
T EF A++ G +++ L A F + VP +++W + GAVT VK QG C
Sbjct: 98 THDEFRAARLG-RLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCG 156
Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
A A+EGIN I L+SLSEQ+L+DC N GC GG M A+K++I+N GI
Sbjct: 157 ACWSFSATGAMEGINKITTGSLLSLSEQELIDC-DRSYNTGCGGGLMTYAYKFVIKNGGI 215
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DAS 256
+ Y + + G C+ K + H I Y++VP + E+ LL+AVA QP+SV I A
Sbjct: 216 DTEDDYPFR-EADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSAR 274
Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
A Q YS G+F+G C T L+H V VGYG SE G YW++KNSWG+ WG GY + R+
Sbjct: 275 AFQLYSQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTG 333
Query: 317 QPQGQCGIAMFASFPV 332
G CGI M ASFP
Sbjct: 334 SSSGICGINMMASFPT 349
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 233 bits (594), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 195/343 (56%), Gaps = 28/343 (8%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K L+ V +I+ SCA+ R ++ E++E +K +G+ YK E R +IF +N
Sbjct: 2 KVLLVAVAVIAVSCAN----RFYNIN--PEEWETFKVVHGKNYKNQFEEMFRRKIFMNNK 55
Query: 63 VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
+E N G SY +++N F DL E A GFKM+ ++ K G + +
Sbjct: 56 KRIEAHNAKYEQGEVSYKMKMNHFGDLMSHEIKALMNGFKMTPNT---KREGKIYFPSND 112
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
++P SV+W +KGAVTPVK QGQC A ++EG +K +LVSLSEQ L+DC+
Sbjct: 113 KLPKSVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDCSKE 172
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
NNGC GG MD AF+Y+ NKGI ++ Y YE K + Y D+P
Sbjct: 173 YGNNGCEGGLMDKAFQYVSDNKGIDTESSYPYEARDYAC--RFKKDKVGGTDKGYVDIPE 230
Query: 235 NDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEEG 289
DE++L A+A P+SVAIDAS + FYS GV+N YC ++ L+HGV AVGYGT E G
Sbjct: 231 GDEKALQNALATVGPISVAIDASHESFHFYSEGVYNEPYCSSYDLDHGVLAVGYGT-ENG 289
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWG WGE GY ++ R+ CGIA AS+P+
Sbjct: 290 QDYWLVKNSWGPSWGESGYIKIARN---HSNHCGIASMASYPI 329
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 132/307 (42%), Positives = 180/307 (58%), Gaps = 21/307 (6%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
FE W ++ R Y E RFEIFKDNL+ ++ N N SY L LN+F DLT EF
Sbjct: 48 FESWMLKHDRVYNNIEEKIHRFEIFKDNLMYIDETNKK---NNSYWLGLNEFVDLTHDEF 104
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQ--GQC----AV 146
G D + ++N F YK P S++W +KGAVTPVK G C V
Sbjct: 105 KEKYVGSIGEDFVTIEQSNDEEFPYKHVVDYPESIDWRDKGAVTPVKPNPCGSCWAFSTV 164
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
A VEGIN I +L+SLSEQ+L+DC + ++GC GG+ + +Y++ N G+ + Y Y
Sbjct: 165 ATVEGINKIVTGKLISLSEQELLDC--DRRSHGCKGGYQTTSLQYVVDN-GVHTEKEYPY 221
Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGG 264
E G C + + + QIT Y+ VP NDE SL++A+ANQPVSV +++ A Q Y GG
Sbjct: 222 E-KKQGKCRAKEKKGTKVQITGYKRVPANDEISLIQAIANQPVSVLLESKGRAFQLYKGG 280
Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
+FNG C T L+H VTA+GYG + Y LIKNSWG +WGE GY +++R + +G CG+
Sbjct: 281 IFNGPCGTKLDHAVTAIGYGKT-----YILIKNSWGPNWGEKGYLKIKRASGKSEGTCGV 335
Query: 325 AMFASFP 331
+ FP
Sbjct: 336 YKSSYFP 342
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 203/343 (59%), Gaps = 26/343 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
+L V+L+ +C + +F + E + QWK ++G+ Y E + R I++ NL
Sbjct: 3 YLSVLLV--AACVVSSLSMSFTD--FDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDI 58
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-- 121
V + N +G+ +Y L +N+FADL +EF+A TGF++ + +S A G+ FL ++
Sbjct: 59 VIKHNLKYDLGHFTYALGMNQFADLKNEEFVAMMTGFRV--NGTSKAAKGSTFLPSNNIG 116
Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
++P +V+W KG VTPVK QGQC ++EG + +LVSLSEQ LVDC+
Sbjct: 117 ELPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGK 176
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ N GC GG MD AF+YII+ GI + Y Y+ + G C KA + A +T Y DV
Sbjct: 177 EGNEGCDGGLMDQAFQYIIKAGGIDTEESYPYKAVD-GECHFKKA-NIGATVTGYTDVTS 234
Query: 235 NDEESLLKAVAN-QPVSVAIDASAL--QFYSGGVFN--GYCETFLNHGVTAVGYGTSEEG 289
+ E +L KAVA+ P+SVAIDAS + Q Y GV+N T L+HGV AVGYGT+ +G
Sbjct: 235 DSETALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSDG 294
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW++KNSW + WG +GY + R+ D QCGIA AS+P+
Sbjct: 295 TDYWIVKNSWAETWGMNGYLWMSRNKDN---QCGIATQASYPL 334
>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 371
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 130/331 (39%), Positives = 189/331 (57%), Gaps = 34/331 (10%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
D+ + ++F +W+A + RTY ++ E +RF++++ N+ +E N G +Y L N+
Sbjct: 50 LDDMLMLDRFVRWQAAHNRTYGDAEERLRRFQVYRANIEYIEATNRR--GGLTYELGENQ 107
Query: 85 FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS--------------SQVPPSVNWI 130
FADLT +EF++ S + + +A+ L + + PPS +W
Sbjct: 108 FADLTSEEFLS----MYASSYDAGDRADDEAALITTDVAGDGAWSDGDLEALPPPSWDWR 163
Query: 131 EKGAVTPVKYQGQ--------CAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
KGAVTP K QG VA +EG+ IK +L+SLSEQQLVDC D GC
Sbjct: 164 AKGAVTPPKNQGPTCSSCWAFVTVATIEGLTFIKTGKLISLSEQQLVDCDMYDG--GCNT 221
Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
G F+++++N G+T +A Y Y + G C+ K+ HAA+IT +PP +E + K
Sbjct: 222 GSYSRGFRWVLENGGLTTEAEYPYTA-ARGPCNRAKSAHHAAKITGQGRIPPQNELVMQK 280
Query: 243 AVANQPVSVAID-ASALQFYSGGVFNGYCETFLNHGVTAVGYGTSE-EGIKYWLIKNSWG 300
AVA QPV VAI+ S +QFY GV++G C T L H VT VGYG G KYW++KNSWG
Sbjct: 281 AVAGQPVGVAIEVGSGMQFYKTGVYSGPCGTNLAHAVTVVGYGVDPASGAKYWIVKNSWG 340
Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
Q WGE G+ R++RD+ P G CGIA+ ++P
Sbjct: 341 QAWGERGFIRMRRDVGGP-GLCGIALDVAYP 370
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 138/317 (43%), Positives = 186/317 (58%), Gaps = 26/317 (8%)
Query: 34 FEQWKAQYGRTY-KESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
F++W + R+Y + AE RF+++ +NL V +N S+ L LN ADL+ E
Sbjct: 13 FKEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNARTT---SHWLTLNHLADLSTPE 69
Query: 93 FIASQTGFKMSDHSSSLKANG--TPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA-- 145
+ + GF D+ + + N T F Y+ + +PP+++W +K AV VK QGQC
Sbjct: 70 YKSKLLGF---DNQARVARNKLKTGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSC 126
Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
+VEGINAI LVSLSEQ+LVDC T + + GC GG MD A+ +II+NKGI
Sbjct: 127 WAFATTGSVEGINAIVTGSLVSLSEQELVDCDT-EQDKGCSGGLMDYAYAWIIKNKGINT 185
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DASAL 258
+ Y Y M G CD K + I +YEDVP NDE +L KA A+QPV+VAI DA +
Sbjct: 186 EEDYPYTAMD-GQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSF 244
Query: 259 QFYSGGVFNG-YCETFLNHGVTAVGYG--TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
Q Y GGV++ C T LNHGV VGYG + G YW++KNSWG +WG+ GY RL+
Sbjct: 245 QLYGGGVYDDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGS 304
Query: 316 DQPQGQCGIAMFASFPV 332
+G CGIAM S+PV
Sbjct: 305 TDAEGLCGIAMAPSYPV 321
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 135/310 (43%), Positives = 175/310 (56%), Gaps = 19/310 (6%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
+FE W A++GR+Y E + R F DN V N A SY L LN FADLT E
Sbjct: 37 QFEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPA---SYALALNAFADLTHDE 93
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSS---QVPPSVNWIEKGAVTPVKYQGQC----- 144
F A++ G + + G P+L VP +V+W + GAVT VK QG C
Sbjct: 94 FRAARLGRLAAAGGPG-RDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWS 152
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
A A+EGIN IK L+SLSEQ+L+DC N+GC GG MD A+K++++N GI +A
Sbjct: 153 FSATGAMEGINKIKTGSLISLSEQELIDC-DRSYNSGCGGGLMDYAYKFVVKNGGIDTEA 211
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DASALQF 260
Y Y + G C+ K + I Y+DVP N+E+ LL+AVA QPVSV I A A Q
Sbjct: 212 DYPYR-ETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQL 270
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YS G+F+G C T L+H + VGYG SE G YW++KNSWG+ WG GY + R+ G
Sbjct: 271 YSKGIFDGPCPTSLDHAILIVGYG-SEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNG 329
Query: 321 QCGIAMFASF 330
CGI SF
Sbjct: 330 VCGINQMPSF 339
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 200/342 (58%), Gaps = 26/342 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
L+ VL+I AS A +F + +++ +E WK +G+TY S E R +I+ +N +
Sbjct: 6 LLLSVLVI----ASTANAVSFFDVVLSD-WESWKLMHGKTYSSSIEEKLRLKIYMENSLK 60
Query: 65 VERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
+ R N+ A+ G Y +++N + DL EF+A G++ ++ ++SL GT K+ Q+
Sbjct: 61 ISRHNSEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQYANKTASL--GGTYIPNKNIQL 118
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P V+W E+GAVTPVK QGQC A A+EG + K +L+SLSEQ LVDC+
Sbjct: 119 PTHVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKFG 178
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
NNGC GG MD AF YI NKGI +A Y YEG+ G C ++ + D+
Sbjct: 179 NNGCEGGLMDFAFTYIRDNKGIDTEASYPYEGID-GHCH-YNPKNKGGSDIGFVDIKKGS 236
Query: 237 EESLLKAVAN-QPVSVAIDASAL--QFYSGGVF-NGYCET-FLNHGVTAVGYGT-SEEGI 290
E+ L KAVA P+SVAIDAS + QFYS GV+ C + L+HGV VG+GT S G
Sbjct: 237 EKDLKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEELDHGVLVVGFGTDSVSGE 296
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSW + WG+ GY ++ R+ + CGIA AS+PV
Sbjct: 297 DYWLVKNSWSEKWGDQGYIKMARN---KENMCGIASSASYPV 335
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 135/310 (43%), Positives = 175/310 (56%), Gaps = 20/310 (6%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
+FE W A++GR+Y E + R F DN V N A SY L LN FADLT E
Sbjct: 37 QFEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPA---SYALALNAFADLTHDE 93
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSS---QVPPSVNWIEKGAVTPVKYQGQC----- 144
F A++ G + + G P+L VP +V+W + GAVT VK QG C
Sbjct: 94 FRAARLGRLAAAGPG--RDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWS 151
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
A A+EGIN IK L+SLSEQ+L+DC N+GC GG MD A+K++++N GI +A
Sbjct: 152 FSATGAMEGINKIKTGSLISLSEQELIDC-DRSYNSGCGGGLMDYAYKFVVKNGGIDTEA 210
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DASALQF 260
Y Y + G C+ K + I Y+DVP N+E+ LL+AVA QPVSV I A A Q
Sbjct: 211 DYPYR-ETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQL 269
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
YS G+F+G C T L+H + VGYG SE G YW++KNSWG+ WG GY + R+ G
Sbjct: 270 YSKGIFDGPCPTSLDHAILIVGYG-SEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNG 328
Query: 321 QCGIAMFASF 330
CGI SF
Sbjct: 329 VCGINQMPSF 338
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 126/254 (49%), Positives = 160/254 (62%), Gaps = 29/254 (11%)
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +PPSV+W +KGAVT VK QG+C V +VEGINAI+ LVSLSEQ+L+DC T
Sbjct: 2 SDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT 61
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA---AQITNYE 230
DN+ GC GG MD+AF+YI N G+ +A Y Y + G C+ +A ++ I ++
Sbjct: 62 ADND-GCQGGLMDNAFEYIKNNGGLITEAAYPYR-AARGTCNVARAAQNSPVVVHIDGHQ 119
Query: 231 DVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
DVP N EE L +AVANQPVSVA++AS A FYS GVF G C T L+HGV VGYG +E+
Sbjct: 120 DVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAED 179
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV---------------S 333
G YW +KNSWG WGE GY R+++D G CGIAM AS+PV +
Sbjct: 180 GKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYSKPKPTPRRALGA 239
Query: 334 KESAQPSSADKSSA 347
+ES SS DK +A
Sbjct: 240 RESLNSSSVDKLAA 253
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 126/310 (40%), Positives = 188/310 (60%), Gaps = 22/310 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
FE W A++G++Y E ++R IF D L +E+ N A+ N ++TL LNKF+DLT EF
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHN--ALPNTTFTLGLNKFSDLTNAEF 59
Query: 94 IASQTG-FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
A+ G FK + A S +P S++W ++GAVTP+K QGQC A
Sbjct: 60 RANYVGKFKPPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+A++E + + LVSLSEQQL+DC T D GC GGF +DAFK++++N G+T + Y
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQ--GCQGGFPEDAFKFVVENGGVTTEEAYP 175
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
Y G + G C++ K + +IT Y+DV + ++L+KAV+ PV+V I S F Y
Sbjct: 176 YTGFA-GSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+ +G+C +H V +GYGT E G+ YW+IKNSWG WGEDG+ R+++ + +G CG
Sbjct: 233 GILSGHCSNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGEDGFMRIKK--EDGEGMCG 289
Query: 324 IAMFASFPVS 333
+ +S+P +
Sbjct: 290 MNGQSSYPTT 299
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 143/345 (41%), Positives = 192/345 (55%), Gaps = 25/345 (7%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKE-SAENSKRFEIFK 59
MA F + + + + S +S RT DE + ++QW+A++G+ + AE RF IFK
Sbjct: 10 MALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRFHIFK 67
Query: 60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
DNL ++ N N Y L LN FADLT +E+ + G K + S + +
Sbjct: 68 DNLKFIDEINAQ---NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSNRYLPRL 124
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
+P S++W KGAV PVK QG C VA+VE IN I L++LSEQ+LVDC
Sbjct: 125 GDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDC- 183
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
N GC GG MD AF++II+N G+ + Y Y G DS + I YEDV
Sbjct: 184 DRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGF-----DSSCIQYKKNAIDGYEDV 238
Query: 233 PPNDEESLLKAVANQPVSVAIDA-----SALQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
P N+E++L KAV+ Q VSV A + Q Y G+F G C T L+HGV VGYG SE
Sbjct: 239 PVNNEKALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYG-SE 297
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G+ YW+++NSWG WGE GY ++QR+I P G CGIAM S+P
Sbjct: 298 GGVDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 342
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 136/342 (39%), Positives = 203/342 (59%), Gaps = 25/342 (7%)
Query: 4 YFLIVVLIISGSCA-SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
Y LI+ +I+ S + ++ R+ E + +E+W ++ + Y E ++RF+IFKDNL
Sbjct: 6 YSLILFGLITLSLSLDMSSGRSNKE--VMTMYEKWLVKHQKVYYGLGEKNQRFQIFKDNL 63
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-- 120
+ ++ N N SY + LN+F+D+T +E+ + + S+++ K + YK+
Sbjct: 64 IFIDEHNAP---NHSYRVGLNEFSDITNKEYRDTYLS-RWSNNNIKNKITSVRYAYKAGH 119
Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+++P SV+W +GA+TP+K QG C AVAAVE IN I LVSLSEQ+LVDC
Sbjct: 120 NNKLPVSVDW--RGALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQELVDC- 176
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
N GC GG +A+++I++N G+ + Y Y G + C+ K I Y++V
Sbjct: 177 DRTKNKGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQS-TCNQAKKNTKVVSINGYKNV 235
Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
N E +L++AVANQPVSV I+A Q Y GVF G C T L+H V VGYG SE G
Sbjct: 236 QRNSESALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVVGYG-SENGK 294
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFP 331
YWL+KNSWG +WGE GY +++R++ G+CGIAM A++P
Sbjct: 295 DYWLVKNSWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYP 336
>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
Length = 360
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 181/307 (58%), Gaps = 23/307 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++F W+ + R+Y + E +RF++++ N ++ N G+ +Y L N+FADLT
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVN--LRGDLTYQLAENEFADLT 104
Query: 90 PQEFIASQTGFKMSDHS--SSLKANGT-----PFLYKSSQVPPSVNWIEKGAVTPVKYQ- 141
+EF+A+ TG+ D S+ G F Y+ VP SV+W +GAV P K Q
Sbjct: 105 EEEFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRV-DVPASVDWRAQGAVVPPKSQT 163
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
C+ A +E +N IK +LVSLSEQQLVDC + D GC G A+K++++
Sbjct: 164 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVE 221
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
N G+T +A Y Y G C+ K+ HAA+IT + VPP +E +L AVA QPV+VAI+
Sbjct: 222 NGGLTTEADYPYTARR-GPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIE 280
Query: 255 -ASALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
S +QFY GGV+ G C T L H VT VGYGT + G KYW IKNSWGQ WGE GY R+
Sbjct: 281 VGSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRIL 340
Query: 313 RDIDQPQ 319
RD+ P+
Sbjct: 341 RDVGGPR 347
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 187/321 (58%), Gaps = 23/321 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++ +K Q+ + Y E R +I+ N + + N +G Y LR+NK+ADL
Sbjct: 23 VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADL 82
Query: 89 TPQEFIASQTGFKMSDHSSSLKA----NGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
+EF+ + GF +D SLK F+ ++ +VP +V+W +KGAVTPVK QG
Sbjct: 83 LHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGH 142
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C A A+EG + K +LVSLSEQ LVDC+ NNGC GG MD AF+YI N
Sbjct: 143 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNG 202
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
GI + Y YE + + KA A Y D+P DEE+L KA+A PVS+AIDA
Sbjct: 203 GIDTEKSYPYEAIDDTCHFNPKAV--GATDKGYVDIPQGDEEALKKALATVGPVSIAIDA 260
Query: 256 S--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
S + QFYS GV+ C++ L+HGV AVGYGTSEEG YWL+KNSWG WG+ GY ++
Sbjct: 261 SHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKM 320
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
R+ D CG+A AS+P+
Sbjct: 321 ARNHDN---HCGVATCASYPL 338
>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
Length = 322
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 138/337 (40%), Positives = 201/337 (59%), Gaps = 33/337 (9%)
Query: 6 LIVVLIISGSCA-SQAT-YRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
L+ + I+S SQA + T +E SI + +QW Q+ R Y++ +E R ++FK NL
Sbjct: 8 LVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYQDESEKEMRLQVFKKNLK 67
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT--PFLYKSS 121
+E FNN +GN+SYT+ +N+F D T +EF+A+ TG +++ + S N T + S
Sbjct: 68 FIENFNN--MGNQSYTVGVNEFTDWTIEEFLATHTGLRVNVTTLSELFNETMPSRNWNIS 125
Query: 122 QVP---PSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+ S +W ++GAV PVK QG C + + G N L++LSEQQL+DC T + N
Sbjct: 126 DIDIDDESKDWRDEGAVIPVKVQGACGLTKISGKN------LLTLSEQQLIDCDT-EKNT 178
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG +++AFKYII+N G++ + Y Y+ + G C + QI +E VP ++E
Sbjct: 179 GCDGGGIEEAFKYIIKNGGVSLETEYPYQ-VKKGSCRANARSATQTQIRGFEMVPSHNER 237
Query: 239 SLLKAVANQPVSVAIDASALQF--YSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLI 295
+LL+AV QPVSV IDA A F Y GGV+ G C T +NH VT VGYGT +I
Sbjct: 238 ALLEAVRRQPVSVLIDARADSFKTYKGGVYAGLDCGTDVNHAVTFVGYGT--------MI 289
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
Q WGE+GY R++RD++ PQG CGIA A++P+
Sbjct: 290 -----QSWGENGYMRIRRDVEWPQGMCGIAQVAAYPI 321
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 187/321 (58%), Gaps = 23/321 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++ +K Q+ + Y E R +I+ N + + N +G Y LR+NK+ADL
Sbjct: 23 VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADL 82
Query: 89 TPQEFIASQTGFKMSDHSSSLKA----NGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
+EF+ + GF +D SLK F+ ++ +VP +V+W +KGAVTPVK QG
Sbjct: 83 LHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGH 142
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C A A+EG + K +LVSLSEQ LVDC+ NNGC GG MD AF+YI N
Sbjct: 143 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNG 202
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
GI + Y YE + + KA A Y D+P DEE+L KA+A PVS+AIDA
Sbjct: 203 GIDTEKSYPYEAIDDTCHFNPKAV--GATDKGYVDIPQGDEEALKKALATVGPVSIAIDA 260
Query: 256 S--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
S + QFYS GV+ C++ L+HGV AVGYGTSEEG YWL+KNSWG WG+ GY ++
Sbjct: 261 SHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKM 320
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
R+ D CG+A AS+P+
Sbjct: 321 ARNRDN---HCGVATCASYPL 338
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 126/310 (40%), Positives = 187/310 (60%), Gaps = 22/310 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
FE W A++G++Y E ++R IF D L +E+ N A+ N ++TL LNKF+DLT EF
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHN--ALPNTTFTLGLNKFSDLTNAEF 59
Query: 94 IASQTG-FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
A+ G FK + A S +P S++W ++GAVTP+K QGQC A
Sbjct: 60 RANYVGKFKPPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+A++E + + LVSLSEQQL+DC T D GC GGF +DAFK++++N G+T + Y
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQ--GCQGGFPEDAFKFVVENGGVTTEEAYP 175
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
Y G + G C++ K + +IT Y+DV + ++L+KAV+ PV+V I S F Y
Sbjct: 176 YTGFA-GSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+ +G+C +H V +GYGT E G+ YW+IKNSWG WGEDG+ R+++ +G CG
Sbjct: 233 GILSGHCSNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGEDGFMRIKK--KDGEGMCG 289
Query: 324 IAMFASFPVS 333
+ +S+P +
Sbjct: 290 MNGQSSYPTT 299
>gi|356517306|ref|XP_003527329.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 333
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 140/338 (41%), Positives = 202/338 (59%), Gaps = 21/338 (6%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSI-AEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
Y L++ LI++ + R G I +E+ E+W AQYG+ YK++ E KRF++FK+N+
Sbjct: 9 YVLVLFLILTVWIS-----RVMSRGLIRSERHEKWIAQYGKVYKDAVE-EKRFQVFKNNV 62
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL--YKS 120
+E FN A G++ + L +N+F DL +EF A + +S ++ P + K
Sbjct: 63 QFIESFN--AAGDKPFNLSINQFVDLHDEEFKA--LLINVQKKASGVETVKEPAMDIQKL 118
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
++ N +K P+ G +A +E ++ I I LV LSEQ+LVDC D+ C
Sbjct: 119 TEEACRENXKKKNEKKPMWDLGFFLIATIESLHQITIGELVFLSEQELVDCVRGDSE-AC 177
Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH-AAQITNYEDVPPND-EE 238
+GGF+++AF++I GIT++A Y Y+G C +K E H A+ YE VP N+ E+
Sbjct: 178 HGGFVENAFEFIANKGGITSEAYYPYKGKDRS-C-KVKKETHGVARNIGYEKVPSNNSEK 235
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLI 295
+LLKAVANQPVSV IDA A +FYS G+FN C T L+H T VGYG +G KYWL+
Sbjct: 236 ALLKAVANQPVSVYIDAGAPAYKFYSSGIFNARNCGTHLDHAATVVGYGKLHDGTKYWLV 295
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
KNSW WGE GY R++RDI +G CGIA AS+P++
Sbjct: 296 KNSWSTAWGEKGYIRMKRDIHSKKGLCGIASNASYPIA 333
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 203/343 (59%), Gaps = 28/343 (8%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
+L V+L+ +C + +F + E + +WK ++G+ Y E + R I++ NL
Sbjct: 3 YLSVLLV--AACVVSSLSMSFTD--FDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDI 58
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-- 121
V + N +G+ +Y L +N+F DL +EF+A TGF++S +S A G+ FL ++
Sbjct: 59 VIKHNLKYDLGHFTYDLGINQFTDLQNEEFVAMMTGFRVS--GTSKAAKGSTFLPPNNVG 116
Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
++P +V+W KG VTPVK QGQC +VEG + +LVSLSEQ LVDC+
Sbjct: 117 ELPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDCSGR 176
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
D GC GGFMD AF+YII GI +A Y Y+ + G C KA + A +T Y DV
Sbjct: 177 DA--GCDGGFMDRAFQYIIDAGGIDTEASYPYKAVD-GKCHFKKA-NVGATVTGYTDVTS 232
Query: 235 NDEESLLKAVAN-QPVSVAIDASALQF--YSGGVFN--GYCETFLNHGVTAVGYGTSEEG 289
E++L KAVA+ P+SVAIDAS + F Y GV+N G T L+HGV AVGYGTS +G
Sbjct: 233 GSEKALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTSSDG 292
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW++KNSW + WG +GY + R+ D QCGIA AS+P+
Sbjct: 293 TDYWIVKNSWAETWGMNGYVWMSRNKDN---QCGIATNASYPL 332
>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
Length = 388
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 138/319 (43%), Positives = 186/319 (58%), Gaps = 24/319 (7%)
Query: 31 AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTP 90
+ F QW+ +GR+YK ++E KR +F +N V N N L LN+FADLT
Sbjct: 43 GQAFSQWQMTHGRSYKSASEARKRQAVFVENAKHVAEQNAR---NSGLVLALNQFADLTL 99
Query: 91 QEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
+EF A+ G+ S + T F Y ++ +P +V+W +K AVTPVK Q C
Sbjct: 100 EEFAATHLGYNPSLREGK-EHTTTSFQYADANDLPSTVDWRKKNAVTPVKNQAMCGSCWA 158
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
A AVEGINAI+ +LVSLSEQQLVDC ++ + GC GG MD AF YI +N GI ++
Sbjct: 159 FSATGAVEGINAIRTGKLVSLSEQQLVDC-DSEKDLGCGGGLMDFAFDYITKNGGIDSED 217
Query: 203 VYSYEGMSTGICDSIK-AEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFY 261
YSY G IC K A+ H I +EDVP ND E+L KA+A+QPVS+ ++
Sbjct: 218 DYSYWGYGL-ICQRRKEADRHVVTIDGFEDVPKNDGEALKKAIAHQPVSL--------YH 268
Query: 262 SGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
SG V + C LNHGV AVGY S+ G +++IKNSWG+ WGE G+FRL + G
Sbjct: 269 SGVVGDDACCQDLNHGVLAVGYDDGSKGGTPHYVIKNSWGEGWGEQGFFRLAAKSSEASG 328
Query: 321 QCGIAMFASFPVSKESAQP 339
CG+ AS+P+ K++ P
Sbjct: 329 ACGVYKAASYPLKKDATNP 347
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 139/350 (39%), Positives = 204/350 (58%), Gaps = 36/350 (10%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
++ FLI+ +++ + A+ + FD +++ +K + + Y+ S + R +IF
Sbjct: 4 LSMKFLILAVLVGAASAALTLEQLFDA-----EWQNFKVHHNKKYEGSTVEAFRKKIFLQ 58
Query: 61 NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY- 118
N + R N A G +Y L++N+F D+ EF+++ G L++N T F
Sbjct: 59 NTHLIARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGL--------LRSNRTYFGST 110
Query: 119 ----KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQ 167
+S +P SV+W EKGAVTPVK QG C A+EG K LVSLSEQ
Sbjct: 111 WIEPESVSLPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQN 170
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
L+DC+T+ NNGC GG MD+AF YI +N GI + Y YEG G C K ED A + T
Sbjct: 171 LIDCSTSYGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEG-KQGKCRYHK-EDSAGRDT 228
Query: 228 NYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY-CETF-LNHGVTAVG 282
+ D+P +E +L KA+A PVSVAIDAS + QFY GV+N C++ L+HGV AVG
Sbjct: 229 GFVDIPSGNERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVG 288
Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YGT+++G Y++IKNSWG+ WG++GY + R+ + +CG+A AS+P+
Sbjct: 289 YGTTDDGQDYYIIKNSWGERWGQEGYVLMARN---SKNECGVATQASYPL 335
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 230 bits (586), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 176/314 (56%), Gaps = 20/314 (6%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI----GNRSYTLRLNKFADL 88
+FE W A++G+ Y E + R F +N V N+A G SYTL LN FADL
Sbjct: 38 QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTP---FLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
T EF A++ G +++ L A F + VP +++W + GAVT VK QG C
Sbjct: 98 THDEFRAARLG-RLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCG 156
Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
A A+EGIN I L+SLSEQ+L+DC N GC GG M A+K++I+N GI
Sbjct: 157 ACWSFSATGAMEGINKITTGSLLSLSEQELIDC-DRSYNTGCGGGLMTYAYKFVIKNGGI 215
Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DAS 256
+ Y + + G C+ K + H I Y++VP + E+ LL+AVA QP+SV I A
Sbjct: 216 DTEDDYPFR-EADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSAR 274
Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
A Q YS G+F+G C T L+H V VGYG SE G YW++KNSWG+ WG GY + R+
Sbjct: 275 AFQLYSQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTG 333
Query: 317 QPQGQCGIAMFASF 330
G CGI M ASF
Sbjct: 334 SSSGICGINMMASF 347
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 142/358 (39%), Positives = 193/358 (53%), Gaps = 44/358 (12%)
Query: 16 CASQATYRTF---------DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
C+S +R + D + E+F++WKA Y ++Y AE+ +RF ++ N+ +E
Sbjct: 25 CSSATAHRPYAGDMGSSTDDNSPMIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIE 84
Query: 67 RFNNAAIG-NRSYTLRLNKFADLTPQEFIASQTGFK------------------MSDHSS 107
N A +Y L + DLT QEF+A T ++ +
Sbjct: 85 ATNAEAEAAGLTYELGETAYTDLTNQEFMAMYTAAPSPAQLPADEDEDDAAEAVITTRAG 144
Query: 108 SLKANGTPFLY--KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKIN 158
+ A G +Y S+ P SV+W GAVTPVK QG+C VA VEGI I+
Sbjct: 145 PVDAVGQLPVYVNLSTAAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTG 204
Query: 159 RLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIK 218
+LVSLSEQ+LVDC T D GC GG A ++I N G+T + Y Y G +T C+ K
Sbjct: 205 KLVSLSEQELVDCDTLDA--GCDGGISYRALRWITSNGGLTTEEDYPYTG-TTDACNRAK 261
Query: 219 AEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNH 276
+AA I V E SL AVA QPV+V+I+A Q Y GV+NG C T LNH
Sbjct: 262 LAHNAASIAGLRRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNH 321
Query: 277 GVTAVGYGTSEE-GIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
GVT VGYG EE G KYW+IKNSWG WG+ GY ++++D+ +P+G CGIA+ SFP+
Sbjct: 322 GVTVVGYGQEEEDGDKYWIIKNSWGASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 132/309 (42%), Positives = 179/309 (57%), Gaps = 22/309 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
FE W + + YK E RFEIFKDNL+ ++ N N SY L LN+FADLT EF
Sbjct: 22 FESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKK---NSSYWLGLNEFADLTHDEF 78
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA------- 145
A G D + +++ F YK P S++W +KGAVTPVK Q C
Sbjct: 79 KAKYVGSLGEDSTIIEQSDDEEFPYKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWAFST 138
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
VA VEGIN I +L+SLSEQ+L+DC + ++GC GG+ + +Y+ N G+ + Y
Sbjct: 139 VATVEGINKIVTGKLISLSEQELLDC--DRRSHGCKGGYQTTSLQYVADN-GVHTEKEYP 195
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
YE G C + + +IT Y+ VP N+E SL++A+ANQPVSV +++ A QFY G
Sbjct: 196 YE-KKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPVSVVVESKGRAFQFYKG 254
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+F G C T ++H VTAVGYG + Y LIKNSWG WGE GY R++R + +G CG
Sbjct: 255 GIFEGPCGTKVDHAVTAVGYGKN-----YILIKNSWGPKWGEKGYIRIKRASGKSKGTCG 309
Query: 324 IAMFASFPV 332
+ + FP
Sbjct: 310 VYSSSYFPT 318
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 139/346 (40%), Positives = 202/346 (58%), Gaps = 36/346 (10%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FLI+ +++ + A+ + FD +++ +K + + Y+ S + R +IF N
Sbjct: 3 FLILAVLVGAASAALTLEQLFDA-----EWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHL 57
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY----- 118
+ R N A G +Y L++N+F D+ EF+++ G L++N T F
Sbjct: 58 IARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGL--------LRSNRTYFGSTWIEP 109
Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
+S +P SV+W EKGAVTPVK QG C A+EG K LVSLSEQ L+DC
Sbjct: 110 ESVSLPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDC 169
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+T+ NNGC GG MD+AF YI +N GI + Y YEG G C K ED A + T + D
Sbjct: 170 STSYGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEG-KQGKCRYHK-EDSAGRDTGFVD 227
Query: 232 VPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY-CETF-LNHGVTAVGYGTS 286
+P +E +L KA+A PVSVAIDAS + QFY GV+N C++ L+HGV AVGYGT+
Sbjct: 228 IPSGNERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTT 287
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
++G Y++IKNSWG+ WG++GY + R+ + +CG+A AS+P+
Sbjct: 288 DDGQDYYIIKNSWGERWGQEGYVLMARN---SKNECGVATQASYPL 330
>gi|449469176|ref|XP_004152297.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 340
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 140/345 (40%), Positives = 204/345 (59%), Gaps = 22/345 (6%)
Query: 1 MAKYFLIVVLII---SGSCASQATYRT-FD-EGSIAEKFEQWKAQYGRTYKESAENSKRF 55
M K+ ++ V++I S C R F+ E S+ + +++W + + R + + E KRF
Sbjct: 3 MMKFLIVFVVLIAFASHLCEGFDLERKDFESEKSLMQLYKRWSSHH-RISRNAHEMHKRF 61
Query: 56 EIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS-SLKANGT 114
+IF+DN V + N+ +S LRLN+FADL+ EF + G ++ +++ KA G
Sbjct: 62 KIFQDNAKRVFKVNHMG---KSLKLRLNQFADLSDDEF-SMMYGSNITHYNNLHAKAGGR 117
Query: 115 P--FLY-KSSQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDC 171
F+Y ++ +P S++W EKGAV +K QG CAVAAVE I+ IK N LVSLSEQ++VDC
Sbjct: 118 VGGFMYERAMNIPFSIDWREKGAVNAIKNQGLCAVAAVESIHQIKTNELVSLSEQEVVDC 177
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ GC GG D AF++I+QN GIT + Y Y G C I YE
Sbjct: 178 --DYKVGGCRGGNYDSAFEFIMQNGGITIEENYPYFA-GNGYCRRRGPNSERVTIDGYEC 234
Query: 232 VPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFN--GYCETFLNHGVTAVGYGTSE 287
VP N+E +L+KAVA+QPV+V++ +S +FY G+ +C ++H V VGYG+ E
Sbjct: 235 VPQNNEYALMKAVAHQPVAVSVASSGSDFRFYGEGMLREGSFCGYRIDHTVVVVGYGSDE 294
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
EG YW+I+N +G WG +GY ++QR PQG CG+AM SFPV
Sbjct: 295 EG-DYWIIRNQYGTQWGMNGYMKMQRGTRNPQGVCGMAMQPSFPV 338
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/315 (39%), Positives = 183/315 (58%), Gaps = 22/315 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++F +W+A Y R+Y + E +RF++++ N+ +E N A GN +YTL N+FADLT
Sbjct: 53 MMDRFLRWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRA--GNLTYTLGENQFADLT 110
Query: 90 PQEFIASQTGFKM----SDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG-QC 144
+EF+ T M D +AN + + P SV+W +GAVTP+K QG C
Sbjct: 111 EEEFLDLYTMKGMPPVRRDAGKKQQANFSSVV----DAPTSVDWRSRGAVTPIKNQGPSC 166
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
+ A +E I I+ +LVSLSEQ+L+DC D GC G+ + +K++IQN G
Sbjct: 167 SSCWAFVTAATIESITQIRTGKLVSLSEQELIDCDPYDG--GCNLGYFVNGYKWVIQNGG 224
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
+T +A Y Y+ C+ KA AA+I+NY +P + + + +
Sbjct: 225 LTTEANYPYQARRYQ-CNRSKAGQRAARISNYRQLPQGEAQLQQAVAQQPVAAAIEMGGS 283
Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
LQFYSGGV++G C T +NH +T VGYG G+KYWL+KNSWGQ WGE GY R+++D+ Q
Sbjct: 284 LQFYSGGVWSGQCGTRMNHAITVVGYGADSSGVKYWLVKNSWGQTWGERGYLRMRKDVRQ 343
Query: 318 PQGQCGIAMFASFPV 332
G CGIA+ ++P+
Sbjct: 344 G-GLCGIALDLAYPI 357
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 141/347 (40%), Positives = 200/347 (57%), Gaps = 28/347 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M Y ++ L ++ A+ T++ + ++ +KA +G+ Y E R +I+ +
Sbjct: 1 MRGYIVLCCLFVT---AAAITHQEL----VGAEWSAFKALHGKDYASDTEEYYRLKIYME 53
Query: 61 NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN--GTPFL 117
N + + R N A SY L +N+F DL EF++++ GFK + S + + P
Sbjct: 54 NRLKIARHNEKYAKSQVSYKLAMNEFGDLLHHEFVSTRNGFKRNYRDSPREGSFFVEPEG 113
Query: 118 YKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVD 170
++ Q+P +V+W +KGAVTPVK QGQC ++EG + K +LVSLSEQ LVD
Sbjct: 114 FEDLQLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVD 173
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C+ + NNGC GG MD+AFKYI NKGI + Y Y + G+C D A T +
Sbjct: 174 CSRSFGNNGCEGGLMDNAFKYIKSNKGIDTEWSYPYNA-TDGVC-HFNRSDVGATDTGFV 231
Query: 231 DVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCETF-LNHGVTAVGYGT 285
D+P DE L KAVA PVSVAIDAS + QFYS GV++ C + L+HGV VGYGT
Sbjct: 232 DIPEGDENKLKKAVAAVGPVSVAIDASHESFQFYSEGVYDEPECSSEQLDHGVLVVGYGT 291
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
++G YWL+KNSWG WG++GY + R+ D QCGIA AS+P+
Sbjct: 292 -KDGQDYWLVKNSWGTTWGDEGYIYMTRNKDN---QCGIASSASYPL 334
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 190/318 (59%), Gaps = 21/318 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR-SYTLRLNKFADL 88
+ ++ +KA +G+ Y E R +I+ +N + + R N N+ SY L +N+F DL
Sbjct: 46 VGAEWSAFKALHGKEYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDL 105
Query: 89 TPQEFIASQTGFKMSDHSSSLKANG--TPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
EF++++ GFK + S+ + + P + +P +V+W +KGAVTPVK QGQC
Sbjct: 106 LHHEFVSTRNGFKRNYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGS 165
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
++EG + K R+VSLSEQ LVDC+ NNGC GG MD+AFKYI N GI
Sbjct: 166 CWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGID 225
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
+ Y Y G + GIC K+ D A T + D+P +E+ L KAVA PVSVAIDAS
Sbjct: 226 TELSYPYNG-TDGICHFEKS-DVGATDTGFVDIPEGNEQLLKKAVATVGPVSVAIDASHE 283
Query: 257 ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
+ QFYS GV++ C + L+HGV VGYGT ++G YWL+KNSWG WG+DGY + R+
Sbjct: 284 SFQFYSQGVYDEPECSSESLDHGVLVVGYGT-KDGQDYWLVKNSWGTTWGDDGYIYMTRN 342
Query: 315 IDQPQGQCGIAMFASFPV 332
+ QCGIA AS+P+
Sbjct: 343 ---KENQCGIASSASYPL 357
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 124/314 (39%), Positives = 182/314 (57%), Gaps = 18/314 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++F W+A Y R+Y + E +RF++++ N+ +E N A GN +YTL N+FADLT
Sbjct: 45 MMDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRA--GNLTYTLGENQFADLT 102
Query: 90 PQEFIASQT--GFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG-QCA- 145
+EF+ T G + + +AN + + P SV+W KGAVTP+K QG C+
Sbjct: 103 EEEFLDLYTMKGMPVRRDAGKKRANVSSSA-AAVDAPTSVDWRSKGAVTPIKNQGPSCSS 161
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
A +E I I +LVSLSEQ+L+DC D GC G+ + ++++IQN G+T
Sbjct: 162 CWAFVTAATIESITKITTGKLVSLSEQELIDCDPYDG--GCNLGYFVNGYRWVIQNGGLT 219
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQ 259
+A Y Y+ C +A HAA I++Y +P + + + +LQ
Sbjct: 220 TEANYPYQARRYA-CSRSRAAQHAATISDYVQLPAGEGQLQQAVAQQPVAAAIEMGGSLQ 278
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
FYSGGVF+G C T +NH +T VGYG S G+KYWL+KNSWGQ WGE GY R++RD+ +
Sbjct: 279 FYSGGVFSGQCGTRMNHAITVVGYGADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRG 338
Query: 319 QGQCGIAMFASFPV 332
G CGIA+ ++PV
Sbjct: 339 -GLCGIALDLAYPV 351
>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 326
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 142/347 (40%), Positives = 198/347 (57%), Gaps = 37/347 (10%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M K+FLI+ L G+ S A + + ++ +WKA +G+ Y + E S RF+IF++
Sbjct: 1 MYKFFLILSL---GAFVSGAEFSS--------EWLKWKATHGKVYNSADEESLRFKIFQE 49
Query: 61 NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N + + + N G +Y L +N F DL EF+ GF+ + G F +
Sbjct: 50 NSLMITQHNEEYRQGFHTYILGMNHFGDLLHSEFLERSNGFQGG------VSGGDVFTFD 103
Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
+ + VP NW KGAVTPVK QG+C A +VEG +K +L+SLSEQQLVDC
Sbjct: 104 TNAPVPSYANWTAKGAVTPVKDQGKCGSCWAFSATGSVEGQIFLKKKKLMSLSEQQLVDC 163
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ ++ N GC GG MD+AFKY I NKGI N+ Y Y K A I++++D
Sbjct: 164 SGDEGNLGCGGGLMDNAFKYFIANKGIANEKSYPYTAKDNDC--KYKKSMSVATISSFKD 221
Query: 232 VPPNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGV-FNGYCET-FLNHGVTAVGYGTS 286
V DE+ L AVAN PVSVAIDAS+ QFY GV ++ C + L+HGV AVGYGT
Sbjct: 222 VKHKDEDQLKMAVANVGPVSVAIDASSSKFQFYESGVYYDENCSSEVLDHGVLAVGYGTD 281
Query: 287 EE-GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
++ G+ +WL+KNSW WG +GY ++ R+ D CGIA AS+P+
Sbjct: 282 KKSGMDFWLVKNSWAASWGLNGYIKMARNKDN---NCGIATMASYPI 325
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 136/319 (42%), Positives = 185/319 (57%), Gaps = 22/319 (6%)
Query: 28 GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI-GNRSYTLRLNKFA 86
G + +E WK +G++Y+ S E R +I +N + + R N AI G SY +++N +
Sbjct: 21 GVVLSDWESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYG 80
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
DL EF+A G++ + +S G+ K+ ++P V+W E GAVTPVK QGQC
Sbjct: 81 DLLHHEFVAMVNGYEYVNKTS---LGGSFIPSKNVKLPTHVDWREDGAVTPVKNQGQCGS 137
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
+ ++EG K +L+ LSEQ LVDC+ NNGC GG MD AF YI NKGI
Sbjct: 138 CWAFSSTGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGID 197
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASAL 258
+ Y YEG+ G C ++ ++ I + DV EE LLKAVA+ PVSVAIDAS +
Sbjct: 198 TEGSYPYEGVG-GRCHYDPSKKGSSDI-GFVDVKKGSEEELLKAVASVGPVSVAIDASHM 255
Query: 259 --QFYSGGV-FNGYCE-TFLNHGVTAVGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQR 313
QFYS GV F C L+HGV VGYGT E G YWL+KNSW ++WG+ GY ++ R
Sbjct: 256 SFQFYSHGVYFESKCSPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGDQGYIKMAR 315
Query: 314 DIDQPQGQCGIAMFASFPV 332
+ + CGIA AS+PV
Sbjct: 316 N---KKNMCGIASSASYPV 331
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 132/348 (37%), Positives = 208/348 (59%), Gaps = 30/348 (8%)
Query: 6 LIVVLIISGSCAS-QATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
L+++ + C S + + F+ E S+ + +++W + + R + + E RF++FK+N
Sbjct: 11 LVLIAFLCNICESFELERKDFESEKSLMQLYKRWSSHH-RISRNANEMHNRFKVFKNNAK 69
Query: 64 AVERFNNAAIGNRSYTLRLNKFADLTPQEF---IASQTGFKMSDHSSSLKANGTP---FL 117
V + N + +S L+LN+FAD++ EF +S + H+ ++A G F+
Sbjct: 70 HVFKVN---LMGKSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAKKIEATGGRIGGFM 126
Query: 118 YK-SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
Y+ ++ +P S++W +KGAV +K QG+C AVAAVE I+ IK N LVSLSE++++
Sbjct: 127 YEHANNIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIHQIKTNELVSLSEEEVL 186
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY-EGMSTGICDSIKAEDHAAQITN 228
DC D GC GGF + AF++++ N G+T + Y Y EG G C + +I
Sbjct: 187 DCDYRDG--GCRGGFYNSAFEFMMDNDGVTIEDNYPYYEG--NGYCRRRGGRNKRVRIDG 242
Query: 229 YEDVPPNDEESLLKAVANQPVSVAI--DASALQFYSGGVF--NGYCETFLNHGVTAVGYG 284
YE+VP N+E +L+KAVA+QPV+VAI S +FY GG+F N +C ++H V VGYG
Sbjct: 243 YENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCGFNIDHTVVVVGYG 302
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
T E+G YW+I+N +G WG +GY ++QR PQG CG+AM ++PV
Sbjct: 303 TDEDG-DYWIIRNQYGHRWGMNGYMKMQRGAHSPQGVCGMAMQPAYPV 349
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 139/322 (43%), Positives = 193/322 (59%), Gaps = 27/322 (8%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
EG + +FEQ+K+ +GR Y R IF+ NL + R N + G+ ++++ +N F
Sbjct: 26 EGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNF 85
Query: 86 ADLTPQEFIASQTGFKMSDHSS---SLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
DL+ +EF A+ G++ S S+ A+ +P +V+W KG VTP+K Q
Sbjct: 86 TDLSNEEFRATFNGYRRLAAVSLADSVHADN-----DVEALPATVDWTTKGVVTPIKNQQ 140
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
QC AVA++EG +A+K +LVSLSEQ LVDC+ + + GC GG+MD AFKY+IQN
Sbjct: 141 QCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQN 200
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAID 254
+GI +A Y Y+ + C+ K A I ++ DV DE +L AVA+ P+SVAID
Sbjct: 201 RGIDTEASYPYKAIDES-CE-FKRNSIGATIHSFVDVKTGDESALQNAVASIGPISVAID 258
Query: 255 AS--ALQFYSGGVFNGY-CET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
AS + QFYS GV+N C T L+HGVTAVGYGT G+ YW +KNSWG WG+ GY
Sbjct: 259 ASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL-NGVPYWKVKNSWGTSWGQKGYIF 317
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
+ R+ Q QCGIA AS+PV
Sbjct: 318 MSRN---KQNQCGIATKASYPV 336
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 227 bits (578), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 141/326 (43%), Positives = 187/326 (57%), Gaps = 29/326 (8%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E+++ +KA++ + Y E R +IF DN + + N G Y L LNK++D+
Sbjct: 23 VMEEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQKITKHNTKYQRGEVGYKLGLNKYSDM 82
Query: 89 TPQEFIASQTGFKMSDHSSSLKAN-GTPFLYKSSQVPPS-------VNWIEKGAVTPVKY 140
EFI + GF S L++N G L S +PP+ V+W++ GAVTPVK
Sbjct: 83 LHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFFIPPANVKLPKHVDWVKLGAVTPVKD 142
Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QG C A A+EG++ K LVSLSEQ L+DC+T + NNGC GG MD AF+Y+
Sbjct: 143 QGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTEEGNNGCNGGLMDQAFQYVR 202
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVA 252
N GI + Y YEG + +C + E+ A T Y DVP DE++L AVA PVSVA
Sbjct: 203 INGGIDTERSYPYEG-NNDVC-RYEPENSGAIDTGYTDVPLGDEDALKSAVATVGPVSVA 260
Query: 253 IDAS--ALQFYSGGV-FNGYCET---FLNHGVTAVGYGTSEEGIK-YWLIKNSWGQDWGE 305
IDAS + Q YS GV F C+ L+HGV VGYGT EE + YWL+KNSWG WGE
Sbjct: 261 IDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTDEETQQDYWLVKNSWGDSWGE 320
Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFP 331
+GY ++ R+ D QCGIA SFP
Sbjct: 321 NGYIKMARNADN---QCGIATQPSFP 343
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 227 bits (578), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 135/322 (41%), Positives = 184/322 (57%), Gaps = 24/322 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++ +K Q+ + Y E R +I+ N + + N G + LR+NK+ DL
Sbjct: 23 VKEEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDL 82
Query: 89 TPQEFIASQTGFKMSDHSSSLKAN---GTPFLY---KSSQVPPSVNWIEKGAVTPVKYQG 142
+EF+ + GF ++ + P Y + +VP +V+W EKGAVTPVK QG
Sbjct: 83 LHEEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQG 142
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C A A+EG + K +LVSLSEQ LVDC+T NNGC GG MD AF+YI N
Sbjct: 143 HCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDN 202
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAID 254
GI + Y YE + + KA A + D+P DE++L+KA+A PVSVAID
Sbjct: 203 GGIDTEKAYPYEAIDDTCHYNPKAV--GATDKGFVDIPQGDEKALMKAIATAGPVSVAID 260
Query: 255 AS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
AS + QFYS GV + C++ L+HGV AVGYGTSEEG YWL+KNSWG WG+ GY +
Sbjct: 261 ASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVK 320
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
+ R+ D CGIA AS+P+
Sbjct: 321 MARNRDN---HCGIATAASYPL 339
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 227 bits (578), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 194/340 (57%), Gaps = 26/340 (7%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
I++L + S AS ++ FD + +E WK + + Y S E R +IF +N + +
Sbjct: 6 ILLLSVIISTASAVSF--FD--VVLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRIS 61
Query: 67 RFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
R N AI G +Y +++N + DL EF+A G+ ++ ++ GT K+ +P
Sbjct: 62 RHNAEAIQGRHTYFMKMNHYGDLLHHEFVAMVNGYIYNNKTT---LGGTFIPSKNINLPE 118
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
V+W E+GAVTPVK QGQC A ++EG + K +L+SLSEQ LVDC+ NN
Sbjct: 119 HVDWREEGAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYGNN 178
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MD AFKYI N GI +A Y YEG+ G C + I + D+ E+
Sbjct: 179 GCEGGLMDYAFKYIQDNNGIDTEASYPYEGID-GHCHYDPKNKGGSDI-GFVDIKKGSEK 236
Query: 239 SLLKAVAN-QPVSVAIDASAL--QFYSGGVFN-GYCE-TFLNHGVTAVGYGTSE-EGIKY 292
L KA+A P+SVAIDAS + QFYS GV++ C L+HGV AVGYGT E G Y
Sbjct: 237 DLQKALATVGPISVAIDASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGTDEVTGEDY 296
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WL+KNSW + WGEDGY ++ R+ D CGIA AS+PV
Sbjct: 297 WLVKNSWSEKWGEDGYIKMARNKDN---MCGIASSASYPV 333
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 124/310 (40%), Positives = 186/310 (60%), Gaps = 22/310 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
FE W A++G++Y +E ++R IF D L +E+ N A N ++TL LNKF+DLT EF
Sbjct: 2 FEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHN--AQPNTTFTLGLNKFSDLTNAEF 59
Query: 94 IASQTG-FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
A+ G FK + A S +P S++W ++GAVTP+K QGQC A
Sbjct: 60 RANYVGKFKSPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+A++E + + LVSLSEQQL+DC T D GC GGF +DAFK++++N G+T + Y
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQ--GCQGGFPEDAFKFVVENGGVTTEEAYP 175
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
Y G + G C++ K + +IT Y+DV + ++L+KAV+ PV+V I S F Y
Sbjct: 176 YTGFA-GSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+ +G C +H V +GYGT E G+ YW+IKNSWG WGE+G+ ++++ +G CG
Sbjct: 233 GILSGQCSNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGENGFMKIKK--KDGEGMCG 289
Query: 324 IAMFASFPVS 333
+ +S+P +
Sbjct: 290 MNGQSSYPTT 299
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 132/322 (40%), Positives = 187/322 (58%), Gaps = 19/322 (5%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR-SYTLRLNKF 85
E + E F+QWK ++ + Y+ + E KRFE FK NL + N N+ + + LNKF
Sbjct: 42 EERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKF 101
Query: 86 ADLTPQEFI-ASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
AD++ +EF A + K + + +S P S++W G VT VK QG C
Sbjct: 102 ADMSNEEFRKAYLSKVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKDQGSC 161
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
+ A+EGINA+ L+SLSEQ+LV+C T+ N GC GG+MD AF+++I N G
Sbjct: 162 GSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS--NYGCEGGYMDYAFEWVINNGG 219
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
I +++ Y Y G+ G C++ K E I Y+DV +D +LL AVA QPVSV ID SA
Sbjct: 220 IDSESDYPYTGVD-GTCNTTKEETKVVSIDGYQDVEQSDS-ALLCAVAQQPVSVGIDGSA 277
Query: 258 L--QFYSGGVFNGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
+ Q Y+GG+++G C ++H V VGYG SE+ +YW++KNSWG WG DGYF L+
Sbjct: 278 IDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYG-SEDSEEYWIVKNSWGTSWGIDGYFYLK 336
Query: 313 RDIDQPQGQCGIAMFASFPVSK 334
RD D P G C + AS+P +
Sbjct: 337 RDTDLPYGVCAVNAMASYPTKQ 358
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 129/294 (43%), Positives = 173/294 (58%), Gaps = 15/294 (5%)
Query: 53 KRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA 111
+R E+F+DNL ++ N A G + L L +FADLT +E+ A + +++
Sbjct: 91 RRLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGV 150
Query: 112 NGTP--FLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVS 162
G Q+P +V+W E+GAV VK QGQC AVAAVEGIN I L+S
Sbjct: 151 VGRRRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSLIS 210
Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
LSEQ+L+DC + GC GG MD+AF ++I+N GI +A Y + G G CD
Sbjct: 211 LSEQELIDC-DKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHD-GTCDLKLKNTR 268
Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTA 280
I ++E VP N E +L KAVA+QPVS +I+AS A Q YS G+F+G C T+L+HGVT
Sbjct: 269 VVSIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTV 328
Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
VGYG SE G YW++KNSWG WGE GY R+ R++ GIAM +PV +
Sbjct: 329 VGYG-SEGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPVKE 381
>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/332 (40%), Positives = 183/332 (55%), Gaps = 35/332 (10%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
+ +F W R+Y S+E + RF++++ N+ +E N A +Y L F DL
Sbjct: 56 MMARFHVWMTVQNRSYPTSSEKAHRFKVYRSNMRYIEALNAEATTSGFTYELGEGPFTDL 115
Query: 89 TPQEFIASQTGFKMSD----------------HSSSLKANGTPFLYK--SSQVPPSVNWI 130
T +EFI+ TG K+ D H+ S+ +Y S+ P ++W
Sbjct: 116 TDEEFISLYTG-KIPDDDHREDGVHDEQIITTHAGSVNGAEGVTVYANFSAGAPIRMDWR 174
Query: 131 EKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGG 183
++GAVTPVK QG+C VA +EGI+ IK RLVSLSEQQLVDC D GC GG
Sbjct: 175 KRGAVTPVKDQGKCGSCWAFPTVATIEGIHKIKRGRLVSLSEQQLVDCDFLDG--GCNGG 232
Query: 184 FMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKA 243
+ +AF++IIQN GIT + Y+Y+ + G C + AA+IT Y V N E S++
Sbjct: 233 WPRNAFQWIIQNGGITTTSSYTYKA-AEGQCKGNRKP--AAKITGYRKVKSNSEVSMVNI 289
Query: 244 VANQPV--SVAIDASALQFYSGGVFNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWG 300
VANQP+ S+ + Q Y GG++NG C T LNH +T VGYG G KYW++KNSWG
Sbjct: 290 VANQPIAASIVVHGGQFQHYKGGIYNGPCATSKLNHVITIVGYGQQAYGAKYWIVKNSWG 349
Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WG GY ++R P GQCGIA+ FP+
Sbjct: 350 AAWGNKGYMLMKRGTKNPLGQCGIAVRPIFPL 381
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 147/366 (40%), Positives = 202/366 (55%), Gaps = 42/366 (11%)
Query: 1 MAKYFLIVVLIIS-GSCASQATYRTFD---EGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
MA ++V + +S AS Y D E S+ +E+W A Y ++ E ++RF+
Sbjct: 11 MAATLVVVGMALSIAPVASAIDYTERDLASEESLWALYERWCAHYNMA-RDHGEKTRRFD 69
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQ-----TGFKMSD------- 104
+FK+N + N+ GN +YTL LN+F+D+T +EF S T +MSD
Sbjct: 70 LFKENARRIYEHNHQ--GNATYTLGLNRFSDMTDEEFNRSPYGGCLTAPRMSDDEIEELH 127
Query: 105 -HSSSLKANGTPFLYKSSQ-----VPPSVNWIEKGAVTPVKYQGQC--------AVAAVE 150
H + +G+ L S PP+V+W + AVT VK QG A+AAVE
Sbjct: 128 HHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWRGR-AVTRVKDQGPTCGSCWAFSAIAAVE 186
Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
GINAI+ LV LSEQQLVDC + N+GC GG M AF ++++N+G+ + Y Y G
Sbjct: 187 GINAIRTRNLVPLSEQQLVDC--DKLNHGCNGGLMTTAFSFVVRNRGVVPEGAYPYMGRE 244
Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNG 268
G C + A I Y+ VP D +L+ AVA QPVSVAI+AS+ +F Y GGVFNG
Sbjct: 245 -GRCKHVMAP--PVTIYGYQRVPRFDANALMNAVAAQPVSVAIEASSFEFRHYQGGVFNG 301
Query: 269 YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFA 328
C L H TAVGYG ++ G +W++KNSWG WGE GY R+ R+ QG CGI
Sbjct: 302 NCGGRLGHAATAVGYG-ADAGGPFWIVKNSWGPGWGEGGYVRISRNTPVRQGVCGILTEN 360
Query: 329 SFPVSK 334
S+PV +
Sbjct: 361 SYPVKR 366
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 137/313 (43%), Positives = 183/313 (58%), Gaps = 35/313 (11%)
Query: 37 WKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIAS 96
WK + + Y +E + R+ I+KDN+ + +N+ + ++ LR+N F D+T EF A
Sbjct: 30 WKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKS---KNVILRMNHFGDMTNTEFRAK 86
Query: 97 QTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAA 148
G + H NG+ FL S + P +V+W +G VTPVK QGQC + A
Sbjct: 87 MNGLLLHKHQ-----NGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGA 141
Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
+EG + K RLVSLSEQ LVDC+T+ NNGC GG MD+AF YI N GI + Y YEG
Sbjct: 142 LEGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTETGYPYEG 201
Query: 209 MSTGIC----DSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASAL--QFY 261
G C SI A+D T + D+P DE++L +AVA PVSVAIDAS + QFY
Sbjct: 202 QD-GTCRYSKSSIGADD-----TGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFY 255
Query: 262 SGGVFN-GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
GV++ C + L+HGV VGYGT + G YWL+KNSWG WG +GY + R+ Q
Sbjct: 256 HSGVYDEPQCSPSALDHGVLVVGYGT-DNGKDYWLVKNSWGTGWGTEGYIYMSRN---NQ 311
Query: 320 GQCGIAMFASFPV 332
QCGIA AS+P+
Sbjct: 312 NQCGIASKASYPL 324
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 139/344 (40%), Positives = 186/344 (54%), Gaps = 25/344 (7%)
Query: 5 FLIVVLIISGSCAS----QATYRTFDEGSIA---EKFEQWKAQYGRTYKESAENSKRFEI 57
FL LII S +S Y D SI + F+ W ++ + Y+ E RFEI
Sbjct: 12 FLATCLIIHMSLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYRFEI 71
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL 117
F+DNL+ ++ N N SY L LN FADL+ EF G D + + F
Sbjct: 72 FRDNLMYIDETNKK---NNSYWLGLNGFADLSNDEFKKKYVGSVAEDFTGLEHFDNEDFT 128
Query: 118 YKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLV 169
YK + P S++W KGAVTPVK QG C +A VEG+N I L+ LSEQ+LV
Sbjct: 129 YKHVTNYPQSIDWRAKGAVTPVKNQGSCGSCWAFSTIATVEGVNKIVTGNLLELSEQELV 188
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + N++GC GG+ + +Y+ N G+ VY Y+ + C + +IT Y
Sbjct: 189 DC--DKNSHGCKGGYQTTSLQYVADN-GVHTSKVYPYQAKAMQ-CRATDKPGPKVKITGY 244
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
+ VP N E S L A+ANQP+SV ++A Q Y GVF+G C T L+H VTAVGYGTS+
Sbjct: 245 KRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSD 304
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G Y +IKNSWG +WGE GY RL+R QG CG+ + +P
Sbjct: 305 -GKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 135/339 (39%), Positives = 182/339 (53%), Gaps = 39/339 (11%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
++ E F++WKA+Y R+Y E +R ++ N+ +E N AA +Y L + DL
Sbjct: 47 TMMEMFQRWKAEYNRSYATPEEERRRLRVYARNVRYIEATNAAA--GLAYELGETAYTDL 104
Query: 89 TPQEFIASQTGFKMSDHSSS----------------LKANGTPFLY--KSSQVPPSVNWI 130
T EF+A T + + + + P +Y +S+ P SV+W
Sbjct: 105 TNDEFMAMYTAPPLRSAADDDDDAATTTIITTRAGPVDEHQQPEVYFNESAGAPASVDWR 164
Query: 131 EKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGG 183
GAVT VK QG+C VA VEGI IK +LVSLSEQ+LVDC T D+ GC GG
Sbjct: 165 ASGAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCDTLDS--GCDGG 222
Query: 184 FMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKA 243
A ++I N GIT Y Y G + CD K HAA I V E SL A
Sbjct: 223 VSYRALEWITANGGITTRDDYPYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNA 282
Query: 244 VANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE-------EGIKYWL 294
A QPV+V+I+A Q Y GV++G C T LNHGVT VGYG E G KYW+
Sbjct: 283 AAAQPVAVSIEAGGDNFQHYRKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWI 342
Query: 295 IKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
IKNSWG++WG+ GY ++++D+ +P+G CGIA+ SFP+
Sbjct: 343 IKNSWGKNWGDQGYIKMKKDVAGKPEGLCGIAIRPSFPL 381
>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 199/339 (58%), Gaps = 28/339 (8%)
Query: 7 IVVLIISGSCAS--QATYRTFDEGSI--AEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
I+VL+ G A+ A+ D G + ++F QW+A + R+Y + E +RFE+++ N+
Sbjct: 14 ILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYRTNV 73
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKM-SDHSSSLKANGTPFLYKSS 121
++ N G +Y L N+FADLT +EF+A G S +++ +A+G+ +
Sbjct: 74 EYIDATNRR--GGLTYELGENQFADLTGEEFLARYAGGHTGSAITTAAEADGS----LEA 127
Query: 122 QVPPSVNWIEKGAVTPVKYQG-QC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
P SV+W KGAVTPVK QG QC AVA +E + IK +LV+LSEQQLVDC
Sbjct: 128 DPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQLVDC-- 185
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ + GC G+ AF++I++N GIT A Y Y+ + G C + K A IT + V
Sbjct: 186 DKYDGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVR-GACSAAKP---AVTITGHLAVA 241
Query: 234 PNDEESLLKAVANQPVSVAIDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
N E +L AVA QP+ VAI+ ++QFY GVF+ C ++H V VGYG G+KY
Sbjct: 242 KN-ELALQSAVARQPIGVAIEVPISMQFYKSGVFSAACGIQMSHAVVTVGYGADASGLKY 300
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWGQ WGE GY R++RD+ G CGIA+ ++P
Sbjct: 301 WLVKNSWGQTWGEAGYIRMRRDVGG-GGLCGIALDTAYP 338
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 181/322 (56%), Gaps = 26/322 (8%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
I E++ +K Q+ + Y E R +IF +N + + N A G SY L LNK+AD+
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 89 TPQEFIASQTGFK------MSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
EF + G+ M + + + A P + + VP SV+W E GAVT VK QG
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVT--VPKSVDWREHGAVTGVKDQG 141
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C + A+EG + K LVSLSEQ LVDC+T NNGC GG MD+AF+YI N
Sbjct: 142 HCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 201
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAID 254
GI + Y YEG+ C KA A T + D+P DEE + KAVA PVSVAID
Sbjct: 202 GGIDTEKSYPYEGIDDS-CHFNKATI-GATDTGFVDIPEGDEEKMKKAVATMGPVSVAID 259
Query: 255 AS--ALQFYSGGVFN-GYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
AS + Q YS GV+N C E L+HGV VGYGT E G+ YWL+KNSWG WGE GY +
Sbjct: 260 ASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIK 319
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
+ R+ + QCGIA +S+P
Sbjct: 320 MARNQNN---QCGIATASSYPT 338
>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
Length = 349
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 192/321 (59%), Gaps = 23/321 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++F+ W+A+Y RTY E +RF ++ +N+ +E N SY L N+FADLT
Sbjct: 33 LLDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPG---SSYELGENQFADLT 89
Query: 90 PQEFIASQTGFKMSDHSSSLKA----------NGTPFLYKSSQVPPSVNWIEKGAVTPVK 139
+EF + K+ + +SS +A GT +++ P SV+W KGAVTPVK
Sbjct: 90 EEEFKDTYL-MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVK 148
Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
Q C AVA++EG++ IK RLVSLSEQ++VDC NN+GC+GG A +++
Sbjct: 149 SQQHCGSCWAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWV 208
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
+N G+T ++ Y Y G G C S K HAA+I + V +E +L AVA +PV+V+
Sbjct: 209 TRNGGLTTESDYPYVGRQ-GQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVS 267
Query: 253 IDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
I+AS A QFY G+F+G C T NH VT VGYG + G KYW++KNSWG+ WGE GY R+
Sbjct: 268 INASRAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRM 327
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
QR + +G CGIA+ + V
Sbjct: 328 QRGVRAREGVCGIAIAPFYAV 348
>gi|357477225|ref|XP_003608898.1| Cysteine proteinase, partial [Medicago truncatula]
gi|355509953|gb|AES91095.1| Cysteine proteinase, partial [Medicago truncatula]
Length = 260
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 130/274 (47%), Positives = 166/274 (60%), Gaps = 33/274 (12%)
Query: 81 RLNKFADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTP 137
+LNKFAD+T EF + K++ H + + PF+Y++ + VP S++W + GAVT
Sbjct: 1 KLNKFADMTNYEFRSIYADSKVNHHRMFRGMSHDNGPFMYENVEGVPSSIDWRKIGAVTG 60
Query: 138 VKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
VK QGQC + AVEGIN IK +LVSLSEQ+LVDC T + N GC GG M+ AF+
Sbjct: 61 VKDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDT-EVNQGCNGGLMEYAFE 119
Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
+I QN GIT + Y Y G C+ K A I +E+VP N+E++LLKA ANQP+S
Sbjct: 120 FIKQN-GITTETNYPY-AAKDGTCNIQKENKPAVSIDGHENVPANNEKALLKAAANQPIS 177
Query: 251 VAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
VAIDA S QFYS GVF G+C T LNHGV NSWG +WGE GY
Sbjct: 178 VAIDAGGSDFQFYSEGVFTGHCGTELNHGV------------------NSWGSEWGEQGY 219
Query: 309 FRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
R+QR I QG CGIAM AS+P+ K S P+ +
Sbjct: 220 IRMQRAISHKQGLCGIAMEASYPIKKSSKNPTKS 253
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 141/345 (40%), Positives = 196/345 (56%), Gaps = 35/345 (10%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
L ++++ + +S RT ++E +KA + ++Y+ + E RF+IF +N +
Sbjct: 6 LLCAFVVVTTAASSHEILRT--------QWEAFKATHKKSYQSNMEELLRFKIFSENSLL 57
Query: 65 VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS--- 120
V R N A G SY L +N+F DL P EF G++ + G+ FL +
Sbjct: 58 VARHNEKYARGLVSYKLGMNQFGDLLPHEFARMFNGYR----GARTAGRGSTFLPPANVN 113
Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
S +P S++W EKGAVTPVK QGQC ++EG + +K LVSLSEQ LVDC+
Sbjct: 114 YSSLPQSMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCS 173
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
N+GC GG MD+AF+YI N GI + Y YE G C K ++ A T + D+
Sbjct: 174 ETFGNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEA-EDGEC-RFKKQNVGATDTGFVDI 231
Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSE 287
E+ L KAVA PVSVAIDA S+ Q YS GV++ C + L+HGV VGYG E
Sbjct: 232 EQGSEDDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETECSSEQLDHGVLVVGYGV-E 290
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYWL+KNSW + WG++GY ++ RD D QCGIA AS+P+
Sbjct: 291 DGKKYWLVKNSWAESWGDNGYIKMSRDKDN---QCGIASAASYPL 332
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 177/311 (56%), Gaps = 21/311 (6%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
F+ WKA +G +Y E + R I++ NL +E+ N+ SY L +NKFADLT EF
Sbjct: 22 FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEG---HSYKLAVNKFADLTYPEF 78
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
A G + +++ + +L + +P SV+W G VTP+K QGQC
Sbjct: 79 AAKYLGLRFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTT 138
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
+VEG +A K +LVSLSEQ LVDC++ N GC GG MD AF+YII N GI ++ Y Y
Sbjct: 139 GSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPY 198
Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSG 263
G C + + A + +Y+D+ E L AVA P+SVAIDAS + QFYS
Sbjct: 199 TAQD-GTCQ-FNSANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSS 256
Query: 264 GVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
GV+N + L+HGV AVGYGTS YWL+KNSWG WG+ GY + R+ + Q
Sbjct: 257 GVYNEPACSSSQLDHGVLAVGYGTSGSS-DYWLVKNSWGTSWGQSGYIWMTRNSNN---Q 312
Query: 322 CGIAMFASFPV 332
CGIA AS+P+
Sbjct: 313 CGIATAASYPL 323
>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 388
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 138/344 (40%), Positives = 194/344 (56%), Gaps = 26/344 (7%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+V L+ C A + + + +E WK + ++Y ++ E +R ++++NL +
Sbjct: 53 LLVCLL--SLCWGLAVSAPLGDSELDKHWELWKNWHQKSYHKAEEGWRRM-VWEENLKVI 109
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL-YKSSQV 123
E N ++G +Y L +N+F DLT +EF Q S + NG+ FL QV
Sbjct: 110 ELHNLEQSLGLHTYQLGMNQFGDLTNEEF--QQMLISERHFSEGNRINGSAFLEVNYVQV 167
Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P SV+W + G VTPVK QG C A+EG K RLVSLSEQ LVDC+
Sbjct: 168 PTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLVSLSEQNLVDCSWQQG 227
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC GG +D AF+YI++N+GI ++ Y Y T C + K E A++T + D+PP+
Sbjct: 228 NQGCNGGIVDFAFQYILENRGIDSEDCYPYTAKDTAQC-AFKPECATARVTGFVDIPPHS 286
Query: 237 EESLLKAVAN-QPVSVAIDA--SALQFYSGGVF-NGYCET-FLNHGVTAVGY---GTSEE 288
EE+L+KAVA PVSVAIDA ++ +FY G+F C + LNH V VGY G E
Sbjct: 287 EEALMKAVATVGPVSVAIDAHPTSFRFYQSGIFYEPKCSSERLNHAVLVVGYGYEGEDEA 346
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYW++KNSWG+ WG+ GYF L +D CGIA AS+P+
Sbjct: 347 GKKYWIVKNSWGKQWGDHGYFYLSKDRGN---HCGIATTASYPL 387
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 126/262 (48%), Positives = 171/262 (65%), Gaps = 18/262 (6%)
Query: 14 GSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI 73
G+ SQ RT E S+ E+ EQW A Y R YK++ E R++IFK+N+ ++ FN+ +
Sbjct: 19 GAWTSQCMARTLQEASMYERHEQWMASYARVYKDANEKQMRYKIFKENVQRIDSFNSES- 77
Query: 74 GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEK 132
++SY L +N+FADLT +EF + + GFK H S +A F Y++ + VP S++W +K
Sbjct: 78 -DKSYKLAVNQFADLTNEEFKSLRNGFK--GHMCSAQAG--HFRYENVTAVPASIDWRKK 132
Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
GAVT +K QGQC AVAAVEGI IK +L+SLSEQ+LVDC TN + GC GG M
Sbjct: 133 GAVTQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSEDQGCQGGLM 192
Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
DDAFK+I Q+ G+ ++A Y Y+ + C + + +A+IT YEDVP NDE +L AVA
Sbjct: 193 DDAFKFIEQH-GLASEATYPYDAADS-TCKTKEEAKPSAKITGYEDVPANDEAALKNAVA 250
Query: 246 NQPVSVAIDASA--LQFYSGGV 265
NQPVSVAIDA QFYS G+
Sbjct: 251 NQPVSVAIDAGGFEFQFYSSGI 272
>gi|357139514|ref|XP_003571326.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 363
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 131/335 (39%), Positives = 188/335 (56%), Gaps = 33/335 (9%)
Query: 26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN------NAAIGN---- 75
D+ + +++ +W+A+Y + Y E KRF +F+DN ++ F+ +A +G+
Sbjct: 35 DDSELRQRWSKWQAKYSKRYPSHEEQEKRFGVFRDNSNSIGAFSAPQTTTSAVVGSFGAP 94
Query: 76 ---RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEK 132
+ + +N+F DL P+E + TGF ++ ++ LK L S+ P V+W
Sbjct: 95 QTVTTVRVGMNRFGDLQPREVLDQFTGF--NNTAAVLKTPPPTRLPHHSRKPCCVDWRSS 152
Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
GAVT VK+QG C AVAA+EG+N I+ LVSLSEQQLVDC ++ ++GC GG
Sbjct: 153 GAVTGVKFQGSCQSCWAFAAVAAIEGMNKIRTGTLVSLSEQQLVDC--DNGSSGCAGGRT 210
Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAV 244
D A + + GIT+ Y+Y G + G C K DH A + ++ VPPNDE L AV
Sbjct: 211 DTALDLVARRGGITSGERYAYGGFN-GRCKVDKLLFDHGAAVGGFKAVPPNDEHQLAMAV 269
Query: 245 ANQPVSVAIDASA--LQFYSGGVFNGYCE---TFLNHGVTAVGYGTSEEGIKYWLIKNSW 299
A QPV+ +DAS QFYSGG+F G C +NH VT VGY E G K+W+ KNSW
Sbjct: 270 ARQPVTAYVDASTWEFQFYSGGIFRGPCSGDPARVNHAVTIVGY-CEEFGDKFWIAKNSW 328
Query: 300 GQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVS 333
DWG+ GY L +D+ P G CG+A +P +
Sbjct: 329 SDDWGDQGYILLAKDVLSSPNGTCGLATSPFYPTA 363
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 178/321 (55%), Gaps = 31/321 (9%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
+ FE+W A++G+ Y E RF +F+DN+ + + A N + LR+N+FADLT
Sbjct: 39 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSA--LRVNQFADLTND 96
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
EF+++ TG K + + +L P ++W KGAVT VK QG C
Sbjct: 97 EFVSTHTGAKPPCPKDAPRGVDPIWL------PCCIDWRYKGAVTDVKDQGACGSCWAFA 150
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
AVAA+EG+ I+ +L LSEQ+LVDC T ++GC GG D AF+ + GIT ++ Y
Sbjct: 151 AVAAIEGLTQIRTGKLTPLSEQELVDCDTG--SSGCAGGHTDRAFELVAAKGGITAESGY 208
Query: 205 SYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFY 261
YEG G C + A +HAA+I + VPP DE L AVA QPV+ IDAS A QFY
Sbjct: 209 RYEGYR-GKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFY 267
Query: 262 SGGVFNGYCETF---------LNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRL 311
GVF G C + NH VT VGY G KYW+ KNSWG+ WGE GY L
Sbjct: 268 GSGVFPGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILL 327
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
++D+ P G CG+A+ +P
Sbjct: 328 EKDVASPHGTCGVAVSPFYPT 348
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 178/321 (55%), Gaps = 31/321 (9%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
+ FE+W A++G+ Y E RF +F+DN+ + + A N + LR+N+FADLT
Sbjct: 17 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSA--LRVNQFADLTND 74
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
EF+++ TG K + + +L P ++W KGAVT VK QG C
Sbjct: 75 EFVSTHTGAKPPCPKDAPRGVDPIWL------PCCIDWRYKGAVTDVKDQGACGSCWAFA 128
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
AVAA+EG+ I+ +L LSEQ+LVDC T ++GC GG D AF+ + GIT ++ Y
Sbjct: 129 AVAAIEGLTQIRTGKLTPLSEQELVDCDTG--SSGCAGGHTDRAFELVAAKGGITAESGY 186
Query: 205 SYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFY 261
YEG G C + A +HAA+I + VPP DE L AVA QPV+ IDAS A QFY
Sbjct: 187 RYEGYR-GKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFY 245
Query: 262 SGGVFNGYCETF---------LNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRL 311
GVF G C + NH VT VGY G KYW+ KNSWG+ WGE GY L
Sbjct: 246 GSGVFPGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILL 305
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
++D+ P G CG+A+ +P
Sbjct: 306 EKDVASPHGTCGVAVSPFYPT 326
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 140/353 (39%), Positives = 200/353 (56%), Gaps = 34/353 (9%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K FL++V ++ + A F+ + E++ +K Q+ + Y +E R +I+ N
Sbjct: 2 KLFLLLVSFLAAANAVS----IFN--LVKEEWNAFKLQHRKKYDSESEERIRMKIYVQNK 55
Query: 63 VAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS-------LKANGT 114
+ + N +G + LR+NK+ADL +EF+ + GF S + S L
Sbjct: 56 HKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSAAAGSKLLGREQLMTIEE 115
Query: 115 PFLY---KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLS 164
P + + VP +++W EKGAVTPVK QG C A A+EG + K +LVSLS
Sbjct: 116 PITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLS 175
Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
EQ LVDC+T NNGC GG MD+AF+Y+ NKGI + Y YE + + KA A
Sbjct: 176 EQNLVDCSTKYGNNGCNGGLMDNAFQYVKDNKGIDTEKAYPYEAIDDECHYNPKAI--GA 233
Query: 225 QITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVT 279
+ D+P DE++L KA+A PVSVAIDAS + QFYS GV + C++ L+HGV
Sbjct: 234 TDKGFVDIPQGDEKALKKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVL 293
Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
AVGYGT+E+G YWL+KNSWG WG+ GY ++ R+ + CGIA AS+P+
Sbjct: 294 AVGYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARN---RENHCGIATTASYPL 343
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 138/344 (40%), Positives = 186/344 (54%), Gaps = 25/344 (7%)
Query: 5 FLIVVLIISGSCASQATYR---TFDEGSIAEK----FEQWKAQYGRTYKESAENSKRFEI 57
FL LII +S Y + D+ + E+ F+ W ++ + Y+ E RFEI
Sbjct: 12 FLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYRFEI 71
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL 117
F+DNL+ ++ N N SY L LN FADL+ EF GF D + + F
Sbjct: 72 FRDNLMYIDETNKK---NNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFT 128
Query: 118 YKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLV 169
YK + P S++W KGAVTPVK QG C +A VEGIN I L+ LSEQ+LV
Sbjct: 129 YKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELV 188
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + ++ GC GG+ + +Y+ N G+ VY Y+ C + +IT Y
Sbjct: 189 DC--DKHSYGCKGGYQTTSLQYVANN-GVHTSKVYPYQAKQYK-CRATDKPGPKVKITGY 244
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
+ VP N E S L A+ANQP+SV ++A Q Y GVF+G C T L+H VTAVGYGTS+
Sbjct: 245 KRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSD 304
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G Y +IKNSWG +WGE GY RL+R QG CG+ + +P
Sbjct: 305 -GKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|384247445|gb|EIE20932.1| hypothetical protein COCSUDRAFT_18161 [Coccomyxa subellipsoidea
C-169]
Length = 387
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 177/336 (52%), Gaps = 41/336 (12%)
Query: 41 YGRTYKESAENSKRFEIFKDNLVAVERFNNAA----------------------IGNRSY 78
+ + Y E + R IFK N+ + N+A + ++
Sbjct: 7 FNKKYSNEEEAALRLNIFKTNVDYITSVNSAQQSYQASKHFSENTQQTALSSLFLSQLAH 66
Query: 79 T-----LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKG 133
T L LN+FAD T +EF ++ G + S + T F + S+NW+E G
Sbjct: 67 TDLLPQLGLNEFADQTWEEFSSTHLGLNAGEDGSFRSSANTGFRHADVTPANSINWVEAG 126
Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
AVTPVK Q C +VEG N + LVSLSEQQLVDC T + GC GG MD
Sbjct: 127 AVTPVKNQAFCGSCWAFSTTGSVEGANFLATGDLVSLSEQQLVDCDTK-KDQGCGGGLMD 185
Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
AF YII+N G+ + YSY + G C+ ++ E I YEDVP NDE +L KAV+
Sbjct: 186 YAFDYIIKNGGLDTEEDYSYWSVG-GFCNKLREERTVVSIDGYEDVPVNDEVALAKAVSK 244
Query: 247 QPVSVAIDAS-ALQFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
QPVSVAI AS A+QFYS GV G C LNHGV A GY E G YWL+KNSWG W
Sbjct: 245 QPVSVAICASEAMQFYSSGVIAAKGSC-IGLNHGVLAAGYDVDESGKPYWLVKNSWGGTW 303
Query: 304 GEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
G GY +L++D +G CGIAM AS+PV K S P
Sbjct: 304 GMQGYMKLEKDSSVKEGACGIAMAASYPV-KSSPNP 338
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 124/310 (40%), Positives = 184/310 (59%), Gaps = 22/310 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
FE W A++ ++Y E ++R +F D L +E+ N A N ++TL LNKF+DLT EF
Sbjct: 2 FEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHN--AQPNTTFTLGLNKFSDLTNAEF 59
Query: 94 IASQTG-FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
A+ G FK + A S +P S++W ++GAVTP+K QGQC A
Sbjct: 60 RANYVGKFKPPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
+A++E + + LVSLSEQQL+DC T D GC GGF DDAFK++++N G+T + Y
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQ--GCQGGFPDDAFKFVVENGGVTTEEAYP 175
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
Y G + G C++ K + +IT Y+DV + ++L+KAV+ PV+V I S F Y
Sbjct: 176 YTGFA-GSCNTNK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232
Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
G+ +G C +H V +GYGT E G+ YW+IKNSWG WGEDG+ ++++ +G CG
Sbjct: 233 GILSGQCCNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGEDGFMKIKK--KDGEGMCG 289
Query: 324 IAMFASFPVS 333
+ +S+P +
Sbjct: 290 MNGQSSYPTT 299
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 129/313 (41%), Positives = 174/313 (55%), Gaps = 18/313 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
I ++E++KA++G +Y E ++R +F N+ + N+ +YTL +N+FADLT
Sbjct: 15 IDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKG---HTYTLGVNQFADLT 71
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
+EF + GFK A +Y +P SV+W +GAVTPVK QGQC
Sbjct: 72 VEEFSKTYMGFKKPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCWS 131
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
++EG N I +LVSLSEQQ VDCA N GC GG MD AFKY N + +
Sbjct: 132 FSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEANA-LCTEQ 190
Query: 203 VYSYEGMSTGICDSIKAEDHAAQ--ITNYEDVPPNDEESLLKAVANQPVSVAIDA--SAL 258
Y Y+G + G C + A+ ++ Y+DV + E+ ++ AVA QPVS+AI+A S
Sbjct: 191 SYPYKG-TDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKSVF 249
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
Q YSGGV G C L+HGV AVGYGT G YW +KNSWG WG GY LQR
Sbjct: 250 QLYSGGVLTGACGASLDHGVLAVGYGTL-SGTDYWKVKNSWGSTWGMSGYVLLQRG-KGG 307
Query: 319 QGQCGIAMFASFP 331
G+CG+ S+P
Sbjct: 308 SGECGLLSEPSYP 320
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 138/322 (42%), Positives = 191/322 (59%), Gaps = 27/322 (8%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
EG + +FEQ+K+ +GR Y R IF+ NL + R N + G+ ++++ +N F
Sbjct: 26 EGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNF 85
Query: 86 ADLTPQEFIASQTGFKMSDHSS---SLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
DL+ +EF A+ G++ S S+ A+ +P +V+W KG VTP+K Q
Sbjct: 86 TDLSNEEFRATFNGYRRLAAVSLADSVHADN-----DVEALPATVDWTTKGVVTPIKNQQ 140
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
QC AVA++EG +A+K +LVSLSEQ LVDC+ + + GC GG+MD AFKY+IQN
Sbjct: 141 QCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQN 200
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAID 254
+GI +A Y Y+ + C+ K A I ++ DV DE +L AVA+ P+SVAID
Sbjct: 201 RGIDTEASYPYKAIDES-CE-FKRNSVGATIHSFVDVKTGDESALQNAVASIGPISVAID 258
Query: 255 AS--ALQFYSGGVFNGY-CET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
A+ + QFYS GV+N C T L+HGVTAVGYGT G YW +KNSWG WG GY
Sbjct: 259 AAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL-NGAPYWKVKNSWGTSWGRKGYIF 317
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
+ R+ Q QCGIA AS+PV
Sbjct: 318 MSRN---KQNQCGIATKASYPV 336
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 115/221 (52%), Positives = 145/221 (65%), Gaps = 12/221 (5%)
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P SV+W EKGAV P+K QG C +A+VEGIN I L+SLSEQ+LVDC
Sbjct: 41 LPDSVDWREKGAVVPIKDQGGCGSCWAFSTIASVEGINKIVTGDLISLSEQELVDCDKT- 99
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N+GC GG MD AF++II N GI + Y Y G CDS + I +YEDVP N
Sbjct: 100 YNDGCNGGLMDYAFQFIIDNGGIDTEKDYPYT-EQDGRCDSYRKNAKVVSINSYEDVPVN 158
Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
DE++L KA A+QP++VAID + Q Y+ G+F G C T L+HGVT VGYG SE G YW
Sbjct: 159 DEQALKKAAASQPIAVAIDGGGRSFQLYNSGIFTGKCGTSLDHGVTVVGYG-SESGKDYW 217
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
+++NSWG+ WGE GY R+ R+ID P G CGIAM AS+P+ K
Sbjct: 218 IVRNSWGESWGEKGYIRMARNIDSPSGICGIAMEASYPIKK 258
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 136/344 (39%), Positives = 196/344 (56%), Gaps = 26/344 (7%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+V L+ C A + + ++ WK + ++Y E+ E +R ++++NL A+
Sbjct: 3 LLVCLV--SLCWGLAVSAPLGDSELDRHWKLWKNWHQKSYHEAEEGWRR-TVWEENLKAI 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
+ N ++G +Y L +N+F DLT +EF TG + S + NG+ FL + QV
Sbjct: 60 QLHNLEQSLGLHTYRLGMNQFGDLTNEEFQEILTGER--HFSKGNRINGSAFLEANFVQV 117
Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P SV+W + G VTPVK QG C A+EG K RL+SLSEQ LVDC+
Sbjct: 118 PTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSWQQG 177
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC+GG +D AF+YI+QN+GI ++ Y Y T C + K E A +T + D+PP+
Sbjct: 178 NQGCHGGIVDLAFQYILQNQGIDSEDCYPYTAKDTAQC-TFKPECATAPVTGFVDIPPHS 236
Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYCET-FLNHGVTAVGYGTSEE--- 288
EE+L+KAVA PVSV IDAS + +FY G+F + C + L+H V VGYG E
Sbjct: 237 EEALMKAVATVGPVSVGIDASSTSFRFYQSGIFYDPKCSSESLDHAVLVVGYGYEREDEA 296
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYW++KNSWG+ WG+ GY + +D CGIA AS+P+
Sbjct: 297 GKKYWIVKNSWGKHWGDRGYVYMSKDRGN---HCGIATVASYPL 337
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 143/346 (41%), Positives = 196/346 (56%), Gaps = 37/346 (10%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
L ++ + + SQ RT ++E +K+ + +TYK + E RF+IF +N +
Sbjct: 6 LLCAIVAAATAATSQEILRT--------EWEAFKSTHKKTYKSNVEELLRFKIFTENSLF 57
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YK 119
+ + N A G SY L +N+FADL P EF+ G++ L G+ +L
Sbjct: 58 IAKHNVKYAKGLVSYKLGINQFADLLPHEFVKMMNGYQ----GKRLAGRGSTYLPPANLN 113
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
S +P +V+W +KGAVTPVK QGQC + ++EG + +K +LVSLSEQ LVDC+
Sbjct: 114 DSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCS 173
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ N GC GG MD++F YI N GI + Y YE G C K ED A T + D+
Sbjct: 174 SAYGNQGCNGGLMDNSFNYIKANGGIDTEDSYPYEA-EDGDC-RYKKEDVGATDTGFVDI 231
Query: 233 PPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF---NGYCETFLNHGVTAVGYGTS 286
E+ L KAVA PVSVAIDAS + Q YS GV+ N E+ L+HGV AVGYG
Sbjct: 232 KEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSEGVYDEPNCSSES-LDHGVLAVGYGV- 289
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+ G KYWL+KNSW + WG+DGY + RD + QCGIA AS+P+
Sbjct: 290 KNGKKYWLVKNSWAETWGQDGYILMSRDKNN---QCGIASSASYPL 332
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 131/327 (40%), Positives = 182/327 (55%), Gaps = 29/327 (8%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E SI E F+QW+ ++ + Y+ +AE+ KR+ FK NL + +++ LNKFA
Sbjct: 43 EESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFA 102
Query: 87 DLTPQEF-------IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVK 139
DL+ +EF + K S + N ++ P S++W +KG VT VK
Sbjct: 103 DLSNEEFKELYLSKVKKPINIKRSTARDWRQRN-----LQTCDAPSSLDWRKKGVVTAVK 157
Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
QG C A+EGINAI L+SLSEQ+LVDC T N GC GG+MD AF+++
Sbjct: 158 DQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWV 215
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
I N GI +A Y Y G+ G C++ K E I Y DV D +LL A QP+SV
Sbjct: 216 INNGGIDTEANYPYTGVD-GTCNTTKEEIKVVSIDGYTDVDETDS-ALLCATVQQPISVG 273
Query: 253 IDASAL--QFYSGGVFNGYCE---TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
+D SAL Q Y+GG+++G C ++H V VGYG SE G YW++KNSWG +WG +G
Sbjct: 274 MDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYG-SENGEDYWIVKNSWGTEWGMEG 332
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVSK 334
YF ++R+ D P G C I AS+P +
Sbjct: 333 YFYIKRNTDLPYGVCAINAEASYPTKE 359
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ + F + QY + Y AE S RF FK N+ + N A N SYT+ LN+FADL+
Sbjct: 38 LQDMFTAFMKQYSKAYSH-AEFSSRFNQFKANVETIRLHNTLA--NASYTMGLNEFADLS 94
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
+EF G+K H A + P S++W AVTP+K QGQC
Sbjct: 95 FEEFKGKYFGYK---HVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWA 151
Query: 145 --AVAAVEGINAIK-INRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
A ++EG ++ + L SLSEQQLVDC+T+ N GC GG MD AF+YII NKGI +
Sbjct: 152 FSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAE 211
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SAL 258
+ Y Y+G+ G+C K+ I+ Y+DV DE SLL AV PVSVAI+A +
Sbjct: 212 SAYPYKGVG-GLCQ--KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGF 268
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
QFYS GVF+G C L+HGV AVGYGT+ YW++KNSWG WGE GY R+ R+
Sbjct: 269 QFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQ-DYWIVKNSWGTSWGESGYIRMIRN---- 323
Query: 319 QGQCGIAMFASFPV 332
+ QCGIA+ S+P
Sbjct: 324 KNQCGIAIQPSYPT 337
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 138/345 (40%), Positives = 195/345 (56%), Gaps = 29/345 (8%)
Query: 7 IVVLIISGSCAS--QATYRTFDEGSI--AEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
I+VL+ G A+ A+ D G + ++F QW+A + R+Y + E +RFE+++ N+
Sbjct: 14 ILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYRTNV 73
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG------FKMSDHSSSLKANGTPF 116
++ N G +Y L N+FADLT +EF+A G + + L ++G
Sbjct: 74 EYIDATNRR--GGLTYELGENQFADLTGEEFLARYAGGHTGSAITTAAEADGLWSSGGSD 131
Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQG-QC-------AVAAVEGINAIKINRLVSLSEQQL 168
+ P SV+W KGAVTPVK QG QC AVA +E + IK +LV+LSEQQL
Sbjct: 132 GSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQL 191
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
VDC D GC G+ AF++I++N GIT A Y Y+ + G C + K A IT
Sbjct: 192 VDCDKYDG--GCNKGYYHRAFQWIMENGGITTAAQYPYKAVR-GACSAAKP---AVTITG 245
Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
+ V N E +L AVA QP+ VAI+ ++QFY GVF+ C ++H V VGYG
Sbjct: 246 HLAVAKN-ELALQSAVARQPIGVAIEVPISMQFYKSGVFSAACGIQMSHAVVTVGYGADA 304
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G+KYWL+KNSWGQ WGE GY R++RD+ G CGIA+ ++P
Sbjct: 305 SGLKYWLVKNSWGQTWGEAGYIRMRRDVGG-GGLCGIALDTAYPT 348
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 140/348 (40%), Positives = 197/348 (56%), Gaps = 29/348 (8%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K+F++ ++ I G+ A FD + E++ +K Q+ + YK E R +IF +N
Sbjct: 2 KFFVLALVFIVGAQAVSF----FD--LVQEQWGTFKLQHKKQYKSDTEEKFRMKIFMENS 55
Query: 63 VAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN----GTPFL 117
V + N +G SY L++NK+AD+ EF+ + GF + ++ L + G F+
Sbjct: 56 HKVAKXNKLYEMGLVSYKLKINKYADMLHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFI 115
Query: 118 YKSS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
++ + P +V+W E GAVT VK QG C A A+EG + K N+LVSLSEQ LV
Sbjct: 116 APANVKFPENVDWREHGAVTXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLV 175
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC+T N+GC GG MD+AFKY+ N GI +A Y Y + K A +
Sbjct: 176 DCSTKFGNDGCNGGLMDNAFKYVKYNHGIDTEASYPYHADDEKCHYNPKTS--GATDRGF 233
Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYG 284
D+P DEE L+ AVA PVSVAIDAS + Q YS GV ++ C + L+HGV VGYG
Sbjct: 234 VDIPTGDEEKLMAAVATVGPVSVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYG 293
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
T E G YW++KNSWG+ WGE GY ++ R+ D CGIA AS+P+
Sbjct: 294 TDENGQDYWIVKNSWGESWGEQGYIKMARNRDN---NCGIATQASYPL 338
>gi|357127811|ref|XP_003565571.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 364
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 133/361 (36%), Positives = 193/361 (53%), Gaps = 39/361 (10%)
Query: 3 KYFLIVVLIISGSCASQATYRT------FDEGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
+ +L+++L ++G E + +++ W+A+Y +TY E KRF
Sbjct: 9 RPYLVLLLCLTGVLEQALQAAAAPPSWELPESELRQRWTNWQAKYSKTYPSHEEQEKRFG 68
Query: 57 IFKDNLVAVERFN------NAAIGN-------RSYTLRLNKFADLTPQEFIASQTGFKMS 103
+F+ N+ + F+ A +G+ + + +N+F DL P E + TGF +
Sbjct: 69 VFRGNINNIGAFSAAQTTTTAVVGSFGAPQTVTTVRVGMNRFGDLQPSEVLEQFTGFNST 128
Query: 104 DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIK 156
+ K P+ S+ P V+W GAVT VK+QG C AVAA+EG+N I+
Sbjct: 129 VVLKTPKPTRLPY---HSRKPCCVDWRSSGAVTGVKFQGSCLSCWAFAAVAAIEGMNKIR 185
Query: 157 INRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS 216
LVSLSEQQLVDC + ++GC GG D A + + GIT++ Y Y G + G C+
Sbjct: 186 TGTLVSLSEQQLVDC--DKGSSGCAGGRTDTALDLVAKRGGITSEEKYPYGGFN-GKCNV 242
Query: 217 IKAE-DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCET- 272
K +HAA + ++ VPPNDE L AVA QPV+V +DAS QFYSGG+F G C T
Sbjct: 243 DKLLFEHAAIVKGFKAVPPNDEHQLALAVAQQPVTVYVDASTWEFQFYSGGIFRGPCSTD 302
Query: 273 --FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASF 330
+NH VT VGY + G K+W+ KNSW DWG+ GY L +D+ P G C +A +
Sbjct: 303 PARVNHAVTIVGY-CEDFGEKFWIAKNSWSNDWGDQGYIYLAKDVAWPTGTCSLASSPFY 361
Query: 331 P 331
P
Sbjct: 362 P 362
>gi|115436422|ref|NP_001042969.1| Os01g0347500 [Oryza sativa Japonica Group]
gi|115436426|ref|NP_001042971.1| Os01g0348000 [Oryza sativa Japonica Group]
gi|15290194|dbj|BAB63883.1| putative SAG12 protein [Oryza sativa Japonica Group]
gi|15290200|dbj|BAB63889.1| putative SAG12 protein [Oryza sativa Japonica Group]
gi|21104809|dbj|BAB93394.1| putative SAG12 protein [Oryza sativa Japonica Group]
gi|113532500|dbj|BAF04883.1| Os01g0347500 [Oryza sativa Japonica Group]
gi|113532502|dbj|BAF04885.1| Os01g0348000 [Oryza sativa Japonica Group]
gi|125570283|gb|EAZ11798.1| hypothetical protein OsJ_01672 [Oryza sativa Japonica Group]
Length = 361
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 133/321 (41%), Positives = 174/321 (54%), Gaps = 29/321 (9%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN--------AAIGNRSYT---LRL 82
F QW A+Y + Y E KR++++K N + F + A ++ T + +
Sbjct: 47 FSQWMAKYAKHYSCPEEQEKRYQVWKGNTNFIGAFRSQTQLSSGVGAFAPQTITDSVVGM 106
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
N+F DLT EF+ TGF S S +P ++ P V+W GAVT VK+QG
Sbjct: 107 NRFGDLTSTEFVQQFTGFNASGFHSPPPTPISPHSWQ----PCCVDWRSSGAVTGVKFQG 162
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
CA AA+EG++ IK LVSLSEQ +VDC T + GC GG D A +
Sbjct: 163 NCASCWAFASAAAIEGLHKIKTGELVSLSEQVMVDCDTG--SFGCSGGHSDTALNLVASR 220
Query: 196 KGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
GIT++ Y Y G+ G CD K DH+A ++ + VPPNDE L AVA QPV+V ID
Sbjct: 221 GGITSEEKYPYTGVQ-GSCDVGKLLFDHSASVSGFAAVPPNDERQLALAVARQPVTVYID 279
Query: 255 ASA--LQFYSGGVFNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
ASA QFY GGV+ G C +NH VT VGY + G KYW+ KNSW DWGE GY L
Sbjct: 280 ASAQEFQFYKGGVYKGPCNPGSVNHAVTIVGYCENFGGEKYWIAKNSWSNDWGEQGYVYL 339
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
+D+ PQG CG+A +P
Sbjct: 340 AKDVWWPQGTCGLATSPFYPT 360
>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
Length = 349
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 191/321 (59%), Gaps = 23/321 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++F+ W+A+Y RTY E +RF ++ +N+ +E N SY L N+FADLT
Sbjct: 33 LLDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPG---SSYELGENRFADLT 89
Query: 90 PQEFIASQTGFKMSDHSSSLKA----------NGTPFLYKSSQVPPSVNWIEKGAVTPVK 139
+EF + K+ + +SS +A GT +++ P SV+W KGAVTPVK
Sbjct: 90 EEEFKDTYL-MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVK 148
Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
Q C AVA++EG++ IK LVSLSEQ++VDC NN+GC+GG A +++
Sbjct: 149 SQQHCGSCWAFAAVASIEGVHKIKTGLLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWV 208
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
+N G+T ++ Y Y G G C S K HAA+I + V +E +L AVA +PV+V+
Sbjct: 209 TRNGGLTTESDYPYVGRQ-GQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVS 267
Query: 253 IDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
I+AS A QFY G+F+G C T NH VT VGYG + G KYW++KNSWG+ WGE GY R+
Sbjct: 268 INASRAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRM 327
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
QR + +G CGIA+ + V
Sbjct: 328 QRGVRAREGVCGIAIAPFYAV 348
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 138/350 (39%), Positives = 200/350 (57%), Gaps = 34/350 (9%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F ++ L+I+ +QA ++ E + E++ +K ++ + Y +S E + R +IF +N
Sbjct: 3 FALITLLIALVAMTQAV--SYSE-LVREEWNTFKLEHRKNYADSTEETFRMKIFNENKHH 59
Query: 65 VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH----SSSLKANGTPFLY- 118
+ + N A G SY L LNK+AD+ EF + GF + H S+ G F+
Sbjct: 60 IAKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFISP 119
Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
+ ++P +V+W KGAVT VK QG C + A+EG + K LVSLSEQ LVDC
Sbjct: 120 EHVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDC 179
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGIC---DSIKAEDHAAQITN 228
+T NNGC GG MD+AF+Y+ N GI + Y+YEG+ +SI A D
Sbjct: 180 STKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDKNSIGATDRG----- 234
Query: 229 YEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF---NGYCETFLNHGVTAVG 282
+ D+P +E+ L +AVA PVSVAIDAS + QFYS GV+ N E L+HGV VG
Sbjct: 235 FADIPQGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAEN-LDHGVLVVG 293
Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YGT ++G YWL+KNSWG WG+ G+ ++ R+ + QCGIA +S+P+
Sbjct: 294 YGTEKDGSDYWLVKNSWGTTWGDKGFIKMSRN---KENQCGIASASSYPL 340
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 181/315 (57%), Gaps = 25/315 (7%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
+FE WK +G++Y ++ E R +++ N + V+ N A I SYTL +N FADLT +E
Sbjct: 29 EFEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGI--HSYTLGMNIFADLTHEE 86
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQGQCA---- 145
F G K+ + ++N + ++ V P SV+W G VTPVK QGQC
Sbjct: 87 FKRFYLGTKVDLNRP--RSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWS 144
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
+VEG +A K +LVSLSEQ LVDC+ N GC GG MDDAF+YII NKGI +A
Sbjct: 145 FSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEA 204
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQ 259
Y Y G C A + A +++++D+ E L AVA PVSVAIDAS + Q
Sbjct: 205 SYPYTAKD-GTC-KFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQ 262
Query: 260 FYSGGVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
Y+ GV+N T L+HGV A GYGTS G YWL+KNSWG WG+ GY + R+ +
Sbjct: 263 LYTSGVYNEKKCSSTSLDHGVLAAGYGTS-NGTPYWLVKNSWGSSWGQAGYIWMSRNANN 321
Query: 318 PQGQCGIAMFASFPV 332
QCGIA AS+P+
Sbjct: 322 ---QCGIATSASYPI 333
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 137/344 (39%), Positives = 185/344 (53%), Gaps = 25/344 (7%)
Query: 5 FLIVVLIISGSCASQATYR---TFDEGSIAEK----FEQWKAQYGRTYKESAENSKRFEI 57
FL LII +S Y + D+ + E+ F+ W ++ + Y+ E RFEI
Sbjct: 12 FLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYRFEI 71
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL 117
F+DNL+ ++ N N SY L LN FADL+ EF GF D + + F
Sbjct: 72 FRDNLMYIDETNKK---NNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFT 128
Query: 118 YKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLV 169
YK + P S++W KGAVTPVK QG C +A VEGIN I L+ LSEQ+LV
Sbjct: 129 YKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELV 188
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + ++ GC GG+ + +Y+ N G+ VY Y+ C + +IT Y
Sbjct: 189 DC--DKHSYGCKGGYQTTSLQYVANN-GVHTSKVYPYQAKQYK-CRATDKPGPKVKITGY 244
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
+ VP N E S L A+ANQP+S ++A Q Y GVF+G C T L+H VTAVGYGTS+
Sbjct: 245 KRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSD 304
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G Y +IKNSWG +WGE GY RL+R QG CG+ + +P
Sbjct: 305 -GKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 135/349 (38%), Positives = 193/349 (55%), Gaps = 30/349 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M Y +++VL + A+ FDE ++ WK+ + + Y+ E R +++
Sbjct: 1 MTLYLVVLVLCTGAALAAPRFDAQFDE-----HWDLWKSWHSKNYQHEKEEGWRRMVWEK 55
Query: 61 NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
NL +E N ++G SY+L +N F D+T +EF G+K+ K G+ FL
Sbjct: 56 NLKKIEMHNLEHSLGKHSYSLGMNHFGDMTNEEFRQVMNGYKLQQR----KFKGSLFLEP 111
Query: 120 SS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
++ + P V+W E+G VTPVK QGQC A+EG K +LVSLSEQ LVDC
Sbjct: 112 NNMEAPKQVDWREEGYVTPVKDQGQCGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDC 171
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ + N GC GG MD AF+YI N G+ ++ Y Y G C+ KAE AA T + D
Sbjct: 172 SRPEGNEGCNGGLMDQAFQYIQDNSGLDSEEAYPYLGTDDQPCN-YKAEFSAANDTGFMD 230
Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTS 286
+P E +L+KA+A+ PVSVAIDA + QFY G+ + C + L+HGV AVGYG
Sbjct: 231 IPSGKEHALMKAIASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFE 290
Query: 287 EE---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E G KYW++KNSW + WG+ GY + +D + CGIA AS+P+
Sbjct: 291 GEDVDGKKYWIVKNSWSEKWGDKGYILMAKD---RKNHCGIATAASYPL 336
>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 291
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 126/229 (55%), Positives = 153/229 (66%), Gaps = 13/229 (5%)
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
VP SV+W +KGAVT VK QGQC +AAVEGINAI+ L SLSEQQLVDC T
Sbjct: 61 VPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCDTK- 119
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
+N GC GG MD AF+YI ++ G+ + Y Y+ C+ K I YEDVP N
Sbjct: 120 SNAGCNGGLMDYAFQYIAKHGGVAAEDAYPYKARQASSCN--KKPSAVVTIDGYEDVPAN 177
Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
DE +L KAVA QPV+VAI+AS QFYS GVF G C T L+HGV AVGYGT+ +G KYW
Sbjct: 178 DETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYW 237
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
++KNSWG +WGE GY R++RD++ +G CGIAM AS+PV K S P A
Sbjct: 238 IVKNSWGPEWGEKGYIRMKRDVEDKEGLCGIAMEASYPV-KTSTNPKHA 285
>gi|310975577|gb|ADP55137.1| cathepsin S [Miichthys miiuy]
Length = 338
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 135/341 (39%), Positives = 191/341 (56%), Gaps = 24/341 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
++ L++ CA A FD + +E WK +G+TY+ E+ R E+++ NLV
Sbjct: 8 LVLGSLLLFSLCAGAAA--MFDS-KLDGHWELWKKMHGKTYRNYVEDESRRELWEKNLVL 64
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+ N A++G +Y L +N DLTP+E + S F + ++ +PF S +
Sbjct: 65 ITMHNLEASMGLHTYKLSMNHMGDLTPEEIMQS---FATLTPPTDIQRAPSPFAGTSGAA 121
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
VP +++W EKG VT VK QG C A A+EG A +LV LS Q LVDC+T
Sbjct: 122 VPDTMDWREKGCVTSVKMQGACGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKY 181
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N+GC GGFM AF+Y+I N GI +DA Y Y G + C + AA + Y +P
Sbjct: 182 GNHGCNGGFMHKAFQYVIDNHGIDSDAAYPYTGRQSQEC-HYSPKFRAANCSQYSFLPEG 240
Query: 236 DEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIK 291
DE +L +A+A P+SVAIDA FYS GV++ C +NHGV AVGYGT G
Sbjct: 241 DEGALKQALATIGPISVAIDARRPRFAFYSSGVYDDPSCSQDVNHGVLAVGYGTL-NGQD 299
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWGQ +G++GY R+ R+ + QCGIA + +P+
Sbjct: 300 YWLVKNSWGQTFGDNGYIRMARNKND---QCGIARYGCYPI 337
>gi|356545071|ref|XP_003540969.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 317
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 132/300 (44%), Positives = 171/300 (57%), Gaps = 40/300 (13%)
Query: 17 ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
ASQ T RT + S+ E+ E+W ++YG+ YK+ E KRF IFK+N+ +E NAAI +
Sbjct: 5 ASQVTCRTLQDASMYERHEEWMSRYGKVYKDPWEREKRFRIFKENMNYIETSKNAAI--K 62
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVT 136
Y L +N+FADL +EFIA Q FK G S+ AVT
Sbjct: 63 PYKLVINQFADLNNEEFIAPQNIFK-----------GMIICRLLSR-----------AVT 100
Query: 137 PVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
PVK QG C VA+ EGI A+ +L+SLSEQ+LVDC T + GC G MDDAF
Sbjct: 101 PVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCEGDLMDDAF 160
Query: 190 KYIIQNKGITNDAVYSYEGMST----GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
+ ++N + E G C++ + + A IT EDVP N+E++L K VA
Sbjct: 161 FMAVT---LSNSSFKILESRCQLGVDGKCNANEEVNPATTITGXEDVPANNEKALQKVVA 217
Query: 246 NQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
NQPVS+AIDA S QFY GVF G C T L+HGVT VGYG S +G +YWL+KNSW +W
Sbjct: 218 NQPVSIAIDACDSDFQFYKRGVFTGSCGTELDHGVTIVGYGVSHDGTQYWLVKNSWETEW 277
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 134/344 (38%), Positives = 196/344 (56%), Gaps = 27/344 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F+++ + + A+ T++ + ++ +KA +G+ Y+ E R +I+ +N +
Sbjct: 4 FVVLCFLCAAMTAAAITHQEL----VGAEWSAFKALHGKEYQSETEEYYRLKIYMENRMM 59
Query: 65 VERFNNAAIGNR-SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG---TPFLYKS 120
+ R N N+ SY L +N++ D+ EF++++ GF+ D+ S + P +
Sbjct: 60 IARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNGFR-RDYRSKPRQGSFYIEPEGIED 118
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P +V+W +KGAVTPVK QGQC ++EG + K +VSLSEQ LVDC+T
Sbjct: 119 KHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDCST 178
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
NNGC GG MD+AFKYI N GI + Y Y G + G C K D A T + D+P
Sbjct: 179 AFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNG-TDGTC-HFKKSDVGATDTGFVDIP 236
Query: 234 PNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEE 288
+E L KAVA P+SVAIDAS + QFYS GV++ C + L+HGV VGYGT ++
Sbjct: 237 EGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDEPECSSENLDHGVLVVGYGTKDD 296
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWG WG+ GY + R+ D QCGIA AS+P+
Sbjct: 297 -QDYWLVKNSWGTTWGDGGYIYMTRNKDN---QCGIASSASYPL 336
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 187/320 (58%), Gaps = 23/320 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
I E+++ +K ++ + + E R +IF +N + + N A G S+ L LNK++D+
Sbjct: 23 IKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDM 82
Query: 89 TPQEFIASQTGFKMSDHSSSLKANG-TPFLY---KSSQVPPSVNWIEKGAVTPVKYQGQC 144
EF + G+ + L+A G + +Y + Q+P SV+W + GAVT VK QG C
Sbjct: 83 LYHEFKETMNGYNHT-MRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHC 141
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
+ AA+EG + K LVSLSEQ LVDC+T NNGC GG MD+AF+YI N G
Sbjct: 142 GSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 201
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDAS 256
I + Y YEG+ C K+ A T + D+P DEE+L+KAVA PVSVAIDAS
Sbjct: 202 IDTEKSYPYEGIDDS-CHFTKSGVGATD-TGFVDIPQGDEEALMKAVATMGPVSVAIDAS 259
Query: 257 --ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
+ Q YS GV+N C+ L+HGV VGYGT + G+ YWL+KNSWG WG+ GY ++
Sbjct: 260 HESFQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQGYIKMA 319
Query: 313 RDIDQPQGQCGIAMFASFPV 332
R+ D QCGIA +S+P
Sbjct: 320 RNQDN---QCGIATASSYPT 336
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 139/357 (38%), Positives = 196/357 (54%), Gaps = 40/357 (11%)
Query: 6 LIVVLIISGSCASQATYRTF--------DEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
L++ + S +C S + F E + E F WK ++ R YK + E +KRFEI
Sbjct: 10 LVLFIWASLACLSSSLPTEFYITGEEFASEERVRELFHLWKERHKRVYKHAEETAKRFEI 69
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD--------HSSSL 109
FK+NL V N+ G+R +TL +NKFAD++ +EF S
Sbjct: 70 FKENLKYVIERNSK--GHR-HTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRRSMQ 126
Query: 110 KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVS 162
+ GT S + P S++W +KG VT +K QG C + A+EGINAI L+S
Sbjct: 127 QKKGTA----SCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLIS 182
Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
LSEQ+LVDC T N GC GG+MD AF+++I N GI +++ Y Y G + G C++ K +
Sbjct: 183 LSEQELVDCDTT--NYGCEGGYMDYAFEWVISNGGIDSESDYPYTG-TDGTCNTTKEDTK 239
Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLN---HG 277
I Y+DV +D +LL A NQP+SV +D SAL Q Y+ G++ G C + H
Sbjct: 240 VVSIDGYKDVDESDS-ALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHA 298
Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
V VGYG SE+ YW+ KNSWG WG +GYF ++R+ D P G+C I AS+P +
Sbjct: 299 VLIVGYG-SEDSEDYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYPTKE 354
>gi|3097321|dbj|BAA25899.1| Bd 30K [Glycine max]
gi|84371705|gb|ABC56139.1| 34 kDa maturing seed protein [Glycine max]
gi|195957142|gb|ACG59282.1| major allergen Gly m Bd 30K [Glycine max]
gi|223452512|gb|ACM89583.1| maturing seed protein [Glycine max]
gi|226432468|gb|ACO55749.1| Gly m Bd 30K allergen [Glycine max]
gi|320090153|gb|ADW08728.1| P34 allergen [Glycine max]
gi|320090155|gb|ADW08729.1| P34 allergen [Glycine max]
gi|320090157|gb|ADW08730.1| P34 allergen [Glycine max]
gi|320090159|gb|ADW08731.1| P34 allergen [Glycine max]
Length = 379
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 129/332 (38%), Positives = 180/332 (54%), Gaps = 24/332 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
++ F+ WK+++GR Y E +KR EIFK+NL + N S+ L LNKFAD+T
Sbjct: 40 VSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNLNYIRDMNANRKSPHSHRLGLNKFADIT 99
Query: 90 PQEFIAS--QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC--- 144
PQEF Q +S Y P S +W +KG +T VKYQG C
Sbjct: 100 PQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKKGVITQVKYQGGCGSG 159
Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
A A+E +AI LVSLSEQ+LVDC + + GCY G+ +F++++++ GI
Sbjct: 160 WAFSATGAIEAAHAIATGDLVSLSEQELVDCV--EESEGCYNGWHYQSFEWVLEHGGIAT 217
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE-------ESLLKAVANQPVSVAI 253
D Y Y G C + K +D I YE + +DE ++ L A+ QP+SV+I
Sbjct: 218 DDDYPYRA-KEGRCKANKIQDKVT-IDGYETLIMSDESTESETEQAFLSAILEQPISVSI 275
Query: 254 DASALQFYSGGVFNGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
DA Y+GG+++G T +NH V VGYG S +G+ YW+ KNSWG+DWGEDGY
Sbjct: 276 DAKDFHLYTGGIYDGENCTSPYGINHFVLLVGYG-SADGVDYWIAKNSWGEDWGEDGYIW 334
Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
+QR+ G CG+ FAS+P +ES SA
Sbjct: 335 IQRNTGNLLGVCGMNYFASYPTKEESETLVSA 366
>gi|326493706|dbj|BAJ85314.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 133/334 (39%), Positives = 181/334 (54%), Gaps = 32/334 (9%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN----NAAIG------ 74
E + E+F +W +Y + Y E RF++FK+N ++ + + N +G
Sbjct: 39 LPESEVRERFSKWMIKYSKHYSCKQEEEMRFQVFKNNTNSIGQLDRQNPNPGVGGALGPS 98
Query: 75 -NRSYTLR---LNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWI 130
++ +T + +N+F DL+P+E I TG +++S + +L S P V+W
Sbjct: 99 GSQVHTFQKVSMNRFGDLSPREVIQQYTGL----NTTSFRTASPTYLPYHSFKPCCVDWR 154
Query: 131 EKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGG 183
GAVT VK+QG C AVAA+EG+N I+ LVSLSEQ LVDC T + GC GG
Sbjct: 155 SSGAVTGVKHQGTCGSCWAFAAVAAIEGMNKIRTGELVSLSEQVLVDCDTV--STGCGGG 212
Query: 184 FMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLK 242
D A + GIT++ Y Y G G CD K DH A I ++ VP N+E L
Sbjct: 213 HSDSAMALVAARGGITSEERYPYAGFQ-GKCDVDKLMFDHQASIKGFKAVPSNNEAQLAI 271
Query: 243 AVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSE-EGIKYWLIKNSW 299
AVA QPV+V IDAS A QFYSGG++ G C +NH VT VGY EG KYW+ KNSW
Sbjct: 272 AVAMQPVTVYIDASGSAFQFYSGGIYRGPCSANVNHAVTIVGYCEGPGEGNKYWIAKNSW 331
Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
DWGE GY L +D+ G CG+A +P +
Sbjct: 332 SNDWGEQGYVYLAKDVAWSTGTCGLATSPFYPTA 365
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 132/322 (40%), Positives = 183/322 (56%), Gaps = 25/322 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++E +K ++ + Y E S R +IF +N + N A G+ +Y L +NK+ D+
Sbjct: 25 VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84
Query: 89 TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS--QVPPSVNWIEKGAVTPVKYQG 142
EF+++ GF+ +H+ K N G F+ Q+P +V+W KGAVTP+K QG
Sbjct: 85 LHHEFVSTMNGFR-GNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQG 143
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
QC A A+EG K +LVSLSEQ LVDC+ NNGC GG MD+AF+Y+ +N
Sbjct: 144 QCGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKEN 203
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAID 254
GI + Y Y+ + +A A+ + DV E +L KAVA PVSVAID
Sbjct: 204 GGIDTEESYPYDAEDEKCHYNPRAA--GAEDKGFVDVREGSEHALKKAVATVGPVSVAID 261
Query: 255 AS--ALQFYSGGVF-NGYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
AS + QFYS GV+ C L+HGV VGYG ++G YWL+KNSWG WG+ GY +
Sbjct: 262 ASHESFQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVK 321
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
+ R+ D QCGIA ASFP+
Sbjct: 322 MARNRDN---QCGIASSASFPL 340
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 134/340 (39%), Positives = 195/340 (57%), Gaps = 26/340 (7%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K FL++ ++++ +S+ F + S ++ WK+ +G++Y + E R I++ NL
Sbjct: 2 KVFLVLCVLVA---SSRGWSVRFGQDS---EWVAWKSYHGKSYSDVHEERTRMAIWQQNL 55
Query: 63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ 122
++R N + SY + +N DLT EF G + + H+S+ + T + +
Sbjct: 56 EKIKRHNAE---DHSYKMAMNHLGDLTEDEFRYFYLGVR-AHHNSTKRGWATYMPPSNVK 111
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P SV+W +KG VT VK QGQC +VEG + K LVSLSEQ L+DC+ +
Sbjct: 112 IPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSY 171
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
NNGC GG MD+AF+YI N GI ++ Y Y G G C + A++T Y+D+P
Sbjct: 172 GNNGCQGGLMDNAFRYIESNGGIDTESSYPYLGQQ-GSCH-FSSSHVGARVTGYQDIPQG 229
Query: 236 DEESLLKAVAN-QPVSVAIDASALQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEEGIKY 292
E++L AVA PVSVA+DAS QFYS GV+ N YC T L+HGV +GYG + G Y
Sbjct: 230 SEQALQSAVATVGPVSVAVDASQWQFYSSGVYDNPYCSSTQLDHGVLVIGYG-NYNGQDY 288
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WL+KNSWG WG +GY + R+ + QCGIA AS+P+
Sbjct: 289 WLVKNSWGYSWGVEGYIMMSRNKNN---QCGIASSASYPL 325
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ + F + QY + Y AE S RF FK N+ + N A N SYT+ LN+FADL+
Sbjct: 38 LQDMFTAFMKQYSKAYSH-AEFSSRFNQFKANVETIRLHNTLA--NASYTMGLNEFADLS 94
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
+EF G+K H A + P S++W AVTP+K QGQC
Sbjct: 95 FEEFKGKYFGYK---HVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWA 151
Query: 145 --AVAAVEGINAIK-INRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
A ++EG ++ + L SLSEQQLVDC+T+ + GC GG MD AF+YII NKGI +
Sbjct: 152 FSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAE 211
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SAL 258
+ Y Y+G+ G+C K+ I+ Y+DV DE SLL AV PVSVAI+A +
Sbjct: 212 SAYPYKGVG-GLCQ--KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGF 268
Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
QFYS GVF+G C L+HGV AVGYGT+ YW++KNSWG WGE GY R+ R+
Sbjct: 269 QFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQ-DYWIVKNSWGTSWGESGYIRMIRN---- 323
Query: 319 QGQCGIAMFASFPV 332
+ QCGIA+ S+P
Sbjct: 324 KNQCGIAIQPSYPT 337
>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 139/347 (40%), Positives = 197/347 (56%), Gaps = 32/347 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M Y + L + + A R D ++ QWKAQ+G++Y E+ E+S R ++
Sbjct: 1 MNFYLCLASLCLGLAAAIPPFDRALDS-----QWHQWKAQHGKSY-EANEDSLRRATWEK 54
Query: 61 NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
NL +ER N + G S+ LR+NKF D++ +EF G+K + S + G+ LY+
Sbjct: 55 NLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEEFKQVMNGYK--SNGSQRRTKGS--LYR 110
Query: 120 SS---QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
S Q+P SV+W EKG VTPVK QG C AV A+EG K +LVSLS Q L+
Sbjct: 111 ESLLAQLPESVDWREKGYVTPVKEQGDCGACWSFSAVGAIEGQWFRKTGKLVSLSIQNLI 170
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + NNGC GGFMD+AF+Y+ N GI + Y Y T K E A IT +
Sbjct: 171 DCTIPEGNNGCDGGFMDNAFQYVQDNGGIDTEECYPYVAQDTEC--KYKPECSGANITGF 228
Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYG 284
D+P DE +L++AVA P+SV ID++ + +FY GV + C + L+HGV VGYG
Sbjct: 229 VDIPSMDERALMEAVATVGPISVGIDSANPSFKFYQSGVYYEPDCSSSQLDHGVLVVGYG 288
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+ + +YW++KNSWG+ WG++GY + +D D CGIA AS+P
Sbjct: 289 SIGKD-EYWIVKNSWGEAWGDNGYILMAKDKDN---HCGIATEASYP 331
>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 359
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 141/361 (39%), Positives = 202/361 (55%), Gaps = 34/361 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIA----EKFEQWKAQYGRTYKESAENSKRFE 56
MA + L++ +C+ F + +IA E+F+ W+A+Y RTY E +RF
Sbjct: 3 MATASASLALVMLFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFM 62
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF 116
++ +NL ++ N + G+ SY L N+F DLT +EF + + M A P
Sbjct: 63 VYSENLRFIKTMNQLSTGS-SYELGENQFTDLTEEEF---KDTYLMKLDEQPPAAEAMPP 118
Query: 117 LY------------KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKI 157
+ + + P SV+W KGAVTPVK Q QC VA++EG++ IK
Sbjct: 119 IVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKT 178
Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
RLVSLSEQ++VDC N++GC GG+ A +++ +N G+T ++ Y Y G S C S
Sbjct: 179 GRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYVG-SQRQCMSG 237
Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-ALQFYSGGVFNGYCE-TFLN 275
K HAA+I Y+ V +E L +AVA +PV+V IDAS A QFY GVF+G C T +N
Sbjct: 238 KLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDASRAFQFYKRGVFSGPCNTTTVN 297
Query: 276 HGVTAVGYGTSEEGI----KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
H VT VGYG++ KYW++KNSWGQ WGE+GY R+ R + +G C IA+ +P
Sbjct: 298 HAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRAREGMCAIAIEPYYP 357
Query: 332 V 332
V
Sbjct: 358 V 358
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 125/268 (46%), Positives = 162/268 (60%), Gaps = 15/268 (5%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W + + + Y+ E RFE+FKDNL ++ N +SY L LN+FADL+
Sbjct: 47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG---KSYWLGLNEFADLS 103
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF G K + + F Y+ + VP SV+W +KGAV VK QG C
Sbjct: 104 HEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
VAAVEGIN I L +LSEQ+L+DC T NNGC GG MD AF+YI++N G+ +
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKE 222
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
Y Y M G C+ K E I ++DVP NDE+SLLKA+A+QP+SVAIDAS Q
Sbjct: 223 EDYPY-SMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQ 281
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSE 287
FYSGGVF+G C L+HGV AVGYG+S+
Sbjct: 282 FYSGGVFDGRCGVDLDHGVAAVGYGSSK 309
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 187/321 (58%), Gaps = 24/321 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
I E+++ +K ++ + Y E R +IF +N + + N A G S+ L LNK+AD+
Sbjct: 23 IKEEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADM 82
Query: 89 TPQEFIASQTGFKMSDHSSSLKA----NGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
EF + G+ + L+A NG ++ ++ QVP +V+W + GAVT VK QG
Sbjct: 83 LHHEFKETMNGYNHT-MRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGH 141
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + ++EG + K LVSLSEQ LVDC+T NNGC GG MD+AF+YI N
Sbjct: 142 CGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 201
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA 255
G+ + Y YEG+ C KA A T + D+P DEE+++KAVA PV+VAIDA
Sbjct: 202 GVDTEKSYPYEGIDDS-CHFNKATVGATD-TGFVDIPQGDEEAMMKAVATMGPVAVAIDA 259
Query: 256 S--ALQFYSGGVFNG-YCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
S + Q YS GV+N C + L+HGV VGYGT ++G YWL+KNSWG WG+ GY ++
Sbjct: 260 SNESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTWGDQGYIKM 319
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
R+ D QCGIA +SFP
Sbjct: 320 ARNQDN---QCGIATASSFPT 337
>gi|52076128|dbj|BAD46641.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|52076135|dbj|BAD46648.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 374
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 141/347 (40%), Positives = 189/347 (54%), Gaps = 34/347 (9%)
Query: 27 EGSIAEKFEQWKAQYG---RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLN 83
E S+ +++W+ YG + ++ A+ RFE+FK N + FN SY L LN
Sbjct: 36 EESMWSLYQRWRHVYGAASSSPRDLADKGSRFEVFKKNARYIHDFNRKK--GMSYKLGLN 93
Query: 84 KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQG 142
KFADLT +EF A TG + G+P L + PP+ +W E GAVT VK QG
Sbjct: 94 KFADLTLEEFTAKYTGANPGPITGLKNGTGSPPLAAVAGDAPPAWDWREHGAVTRVKDQG 153
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C V AVEGINAI L++LSEQQ++DC+ + C GG+ AF Y + N
Sbjct: 154 PCGSCWAFSVVEAVEGINAIMTGNLLTLSEQQVLDCSGAGD---CSGGYTSYAFDYAVSN 210
Query: 196 KGITNDAVYS-------------YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
GIT D +S YE + C + +I +Y V PNDEE+L +
Sbjct: 211 -GITLDQCFSPPTTGENYFYYPAYEAVQE-PCRFDPNKAPIVKIDSYSFVDPNDEEALKQ 268
Query: 243 AVANQ-PVSVAIDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
AV +Q PVSV I+AS Y GGVF+G C T LNH V VGY +E+G YW++KNSWG
Sbjct: 269 AVYSQGPVSVLIEASYEFMIYQGGVFSGPCGTELNHAVLVVGYDETEDGTPYWIVKNSWG 328
Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSADKSSA 347
WGE GY R+ R+I P+G CGIAM+ +P+ K P +A ++A
Sbjct: 329 AGWGESGYIRMIRNIPAPEGICGIAMYPIYPI-KSCPCPITAASAAA 374
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 183/317 (57%), Gaps = 19/317 (5%)
Query: 30 IAEKFEQWKAQYGRTYKES-AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
++ ++ W A++G+ S + +RFE FK+N +E N A G SY L LN+F+DL
Sbjct: 9 LSGEYASWCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRA--GKHSYRLGLNQFSDL 66
Query: 89 TPQEFIASQTGFKMSDHSSSL----KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
T +EF G + S + + + +++ +P SV+W + GAVT K QG C
Sbjct: 67 TSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRKHGAVTAPKDQGSC 126
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
A+EGIN I +L+SLSEQ+L+DC + GC GG M++A+++I++N G
Sbjct: 127 GGCWAFATTGAIEGINQIVTGQLMSLSEQELIDC-DKKADKGCDGGLMENAYQFIVENGG 185
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
+ + Y Y S C+ K I YE +P DE++LL+AVA QPVSVAI+ ++
Sbjct: 186 LDTETDYPYHA-SESHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGAS 244
Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
Q Y+ GVF G+C +NHGV VGYGT E+G+ YW++KNSW WG+ G+ ++QR+
Sbjct: 245 KDFQHYASGVFTGHCGEEINHGVLIVGYGT-EDGLDYWIVKNSWAATWGDGGFVKMQRNT 303
Query: 316 DQPQGQCGIAMFASFPV 332
+ G C I AS+PV
Sbjct: 304 GKRGGLCSINTLASYPV 320
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 136/320 (42%), Positives = 185/320 (57%), Gaps = 28/320 (8%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
+ ++E +K + ++Y+ E R++IF +N + + + N A G SY L +N+F DL
Sbjct: 3 LRTQWEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDL 62
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFL----YKSSQVPPSVNWIEKGAVTPVKYQGQC 144
P EF G+ K G+ FL S +P +V+W +KGAVTPVK QGQC
Sbjct: 63 LPHEFAKMFNGYH-----GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQC 117
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
A ++EG + +K +LVSLSEQ L+DC+ + N GC GG MD+AFKYI N G
Sbjct: 118 GSCWAFSATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDG 177
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA- 255
I + Y YE M G C K ED A T + D+ E+ L KAVA P+SVAIDA
Sbjct: 178 IDTEESYPYEAMD-GDC-RFKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDAS 235
Query: 256 -SALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
S+ Q YS GV++ C + L+HGV AVGYG + G KYWL+KNSW + WG++GY +
Sbjct: 236 HSSFQLYSEGVYDEPNCSSEELDHGVLAVGYGV-KNGKKYWLVKNSWAETWGDNGYILMS 294
Query: 313 RDIDQPQGQCGIAMFASFPV 332
RD D QCGIA AS+P+
Sbjct: 295 RDKDN---QCGIASSASYPL 311
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 142/358 (39%), Positives = 196/358 (54%), Gaps = 45/358 (12%)
Query: 1 MAKYFLIVVLIISGSCA-------------SQATYRTFDEGSIAEKFEQWKAQYGRTYKE 47
M+ F+I +L+ S + + +RT +E + E +E W A++ + Y
Sbjct: 1 MSTLFIISILLFLASFSYAMDISTIEYKYDKSSAWRTDEE--VKEIYELWLAKHDKVYSG 58
Query: 48 SAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS 107
E KRFEIFKDNL ++ N+ N +Y + L + DLT +EF A G + SD
Sbjct: 59 LVEYEKRFEIFKDNLKFIDEHNSE---NHTYKMGLTPYTDLTNEEFQAIYLGTR-SDTIH 114
Query: 108 SLKAN---GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIK 156
LK + Y++ +P ++W +KGAVTPVK QG+C V+ VE IN I+
Sbjct: 115 RLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIR 174
Query: 157 INRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS 216
L+SLSEQQLVDC N N+GC GG A++YII N GI +A Y Y+ + G C
Sbjct: 175 TGNLISLSEQQLVDC--NKKNHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQ-GPC-- 229
Query: 217 IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFL 274
+A +I Y+ VP +E +L KAVA+QP VAIDAS+ QF Y G+F+G C T L
Sbjct: 230 -RAAKKVVRIDGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKL 288
Query: 275 NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
NHGV VGY YW+++NSWG+ WGE GY R++R G CGIA +P
Sbjct: 289 NHGVVIVGYWKD-----YWIVRNSWGRYWGEQGYIRMKR--VGGCGLCGIARLPYYPT 339
>gi|194352764|emb|CAQ00110.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 406
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 132/347 (38%), Positives = 188/347 (54%), Gaps = 47/347 (13%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
+ ++F W + R+Y + E ++RFE+++ N+ +E N AA +Y L F DL
Sbjct: 59 MMDRFHVWMTVHNRSYSTAGEKARRFEVYRSNMRFIEAVNAEAATSGLTYELGEGPFTDL 118
Query: 89 TPQEFIASQTGFKM---------------SDHSSSLKANGTP-----FLYKSSQVPPSVN 128
T +EF+ TG + + H+ S+ GT + S+ P S++
Sbjct: 119 TNEEFMELYTGQILEDDQSEDGDDDEQIITTHAGSIDGLGTHKGATVYANFSASAPTSID 178
Query: 129 WIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
W ++G VTPVK Q QC VA +EGI+ IK LVSLSEQQL+DC DN GC
Sbjct: 179 WRKRGVVTPVKNQKQCGSCWAFPTVATIEGIHKIKRGTLVSLSEQQLIDCDYLDN--GCK 236
Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
GG + AF++I +N GIT+ + Y Y+ + G C ++ AA+I + V N E SL+
Sbjct: 237 GGLVTRAFQWIKKNGGITSTSSYKYKAVR-GRC--LRNRKPAAKIVGFRKVKSNSEVSLM 293
Query: 242 KAVANQPVSVAIDASALQF--YSGGVFNGYCETF-LNHGVTAVGYGTSEE---------- 288
AVANQPV+V+I + + F Y GG++NG C T LNH VT VGYG ++
Sbjct: 294 NAVANQPVAVSISSHSSHFHHYKGGIYNGPCSTTKLNHAVTVVGYGQQQQNGADSVHASA 353
Query: 289 -GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
G KYW++KNSWG WG+ GY ++R GQCGIA FP+ K
Sbjct: 354 PGAKYWIVKNSWGTTWGDKGYILMKRGTKHSSGQCGIATRPVFPLMK 400
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 193/344 (56%), Gaps = 36/344 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L ++ ++ + +SQ RT ++E +K + +TY+ E RF+IF +N + +
Sbjct: 7 LCAIVAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+ N A G SY L +N+F DL EF G H + K G+ FL
Sbjct: 59 AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSTFLPPANVND 113
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +P V+W +KGAVTPVK QGQC A ++EG + +K LVSLSEQ LVDC+
Sbjct: 114 SSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQ 173
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ NNGC GG M+DAFKYI +N GI + Y YE + G C K ED A T Y ++
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231
Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
E+ L KAVA P+SVAIDA S+ Q YS GV++ C + L+HGV VGYG +
Sbjct: 232 AGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL+KNSW + WG+ GY + RD + QCGIA AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 189/318 (59%), Gaps = 21/318 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ ++ +KA +G+ Y+ E R +I+ +N + + R N A SY L +N+F D+
Sbjct: 19 VGAEWSAFKALHGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDM 78
Query: 89 TPQEFIASQTGFKMSDHSSSLKAN--GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
EF++++ GFK + + + + P + +P +V+W +KGAVTPVK QGQC
Sbjct: 79 LHHEFVSTRNGFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVKNQGQCGS 138
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
++EG + K+++LVSLSEQ L+DC+ + NNGC GG MD AFKYI NKGI
Sbjct: 139 CWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYIKANKGID 198
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
+ Y Y + G+C K+ A T + D+P DE L KAVA PVSVAIDAS
Sbjct: 199 TEQSYPYNA-TDGVCHFNKSAV-GATDTGFVDIPEGDENKLKKAVATVGPVSVAIDASHE 256
Query: 257 ALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
+ QFYS GV++ C++ L+HGV VGYGT ++G YWL+KNSWG WG+ GY + R+
Sbjct: 257 SFQFYSEGVYDEPECDSEQLDHGVLVVGYGT-KDGQDYWLVKNSWGTTWGDGGYIYMSRN 315
Query: 315 IDQPQGQCGIAMFASFPV 332
D QCGIA AS+P+
Sbjct: 316 KDN---QCGIASAASYPL 330
>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
purpuratus]
gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 334
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 135/344 (39%), Positives = 200/344 (58%), Gaps = 26/344 (7%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K F+IV+L ++G+ A++ R FDE ++++W +G+ Y E +R I++DNL
Sbjct: 2 KTFIIVLLSVAGALATRLPSRDFDE-----EWKEWVDYHGKEYSAMGEEMERRMIWEDNL 56
Query: 63 VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
+ + N + G +Y L +N+F D+T EF+A++T KMS G+ FL
Sbjct: 57 RIITKHNLEHSQGKTTYRLGMNEFGDMTNAEFVATRTMKKMS--GVPKVGQGSTFLPSEF 114
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
Q+P SV+W +G VTPVK QGQC V A+EG + +K LVSLSEQ LVDC+
Sbjct: 115 LQLPDSVDWRTEGYVTPVKDQGQCGSCWAFSTVGALEGQHFVKTGTLVSLSEQNLVDCSQ 174
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ N+GC GG+ A +YI N GI + Y YEG+ + D A IT + +V
Sbjct: 175 AEGNDGCNGGWPAWADEYIKSNGGIDTEVGYPYEGVDDSC--HYRTSDVGATITGFAEVE 232
Query: 234 PNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVGYGTSEE 288
+ E++L KA+A P+SV IDA+ + Q Y GV++ T L+H VTAVGY ++ +
Sbjct: 233 ADSEKALEKALAQVGPISVCIDATQPSFQLYESGVYDEPDCSSTALDHCVTAVGYDSTAD 292
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KY+++KNSWG WG++GY + RD Q QCGIA A++P+
Sbjct: 293 GDKYYIVKNSWGTTWGQEGYIWMSRD---KQKQCGIATNATYPL 333
>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 140/340 (41%), Positives = 178/340 (52%), Gaps = 52/340 (15%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
+F++W G Y++ E RF I++ N VE SY L NKFADLT +E
Sbjct: 4 RFDRWLKXNGXNYEDKEEWEIRFVIYQAN---VEYIGCKKSQKNSYNLTDNKFADLTNEE 60
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
F+++ GF ++ L + ++ +P S +W ++GAVT +K QG C
Sbjct: 61 FVSTYLGF-----ATRLIPHTRFKYHEHGNLPXSKDWRKEGAVTDIKDQGNCGKHSTWFS 115
Query: 146 -----------------------------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
VAAVE IN IK +LVSLSEQ+LVD +
Sbjct: 116 PEISHNLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYDVANK 175
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC GG MD F +I +N G+T Y YEG+ G C+ KA HA I+ YE P D
Sbjct: 176 NQGCEGGLMDTTFAFIKKNGGLTTSKDYPYEGVD-GSCNKEKALHHAVNISGYERAPSKD 234
Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGY--GTSEEGIKY 292
E L A ANQP+SVAIDA A Q YS GVF+G C LNHGVT VGY GT + KY
Sbjct: 235 EAMLKVAAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGTFD---KY 291
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+KNS G DWGE GY R++RD G CGIAM AS+P+
Sbjct: 292 RTVKNSXGADWGESGYIRMKRDAFDKAGTCGIAMKASYPL 331
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 182/332 (54%), Gaps = 31/332 (9%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
++A +F++WKA++GR Y E +R ++ N+ +E N +Y L + DL
Sbjct: 48 TMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETAYTDL 107
Query: 89 TPQEFIASQT---------------GFKMSDHSSSLKANGTPFLYKSSQV--PPSVNWIE 131
T EF A T ++ + ++ A G + S P SV+W
Sbjct: 108 TADEFTAMYTSPSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAGAPASVDWRA 167
Query: 132 KGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGF 184
KGAVT VK QG+C VA VEGI+ I+ L+SLSEQ+LVDC T D GC GG
Sbjct: 168 KGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDTLDY--GCDGGV 225
Query: 185 MDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAV 244
A ++I N GI +A Y Y G G C + K HAA I+ + V E SL AV
Sbjct: 226 SYHALEWIASNGGIATEADYPYTGKD-GACVANKLPLHAAAISGFARVATRSEPSLANAV 284
Query: 245 ANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI-KYWLIKNSWGQ 301
A QPV+V+I+A Q Y GV+NG C T LNHGVT VGYG E KYW++KNSWG+
Sbjct: 285 AAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYWIVKNSWGK 344
Query: 302 DWGEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
WG+ GYFR+++D+ +P+G CGIA+ SFP+
Sbjct: 345 KWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 175/312 (56%), Gaps = 21/312 (6%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFE--IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+F W + ++ ++ E +KR E I D + NA G + L N+F+ ++
Sbjct: 27 HEFSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVK---LDHNEFSSMS 83
Query: 90 PQEFIASQTGFKMSD-HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF TG+ M + + A+ L+ QVP SV+W +KG VTPVK QG C
Sbjct: 84 FEEFKFKMTGYVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCW 143
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
AVEG + +LVSLSEQ+LVDC N + GC GG MD AF +I N GI ++
Sbjct: 144 AFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHN-GDMGCNGGLMDHAFAWIEDNGGICSE 202
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQ 259
Y Y+ + D K +I+ ++DV P DE +L AVA QPVSVAI+A A Q
Sbjct: 203 DDYEYKAKAQVCRDCEKV----VKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 258
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FY GVFN C T L+HGV AVGYG SE G K+W +KNSWG WGE GY RL R+ + P
Sbjct: 259 FYKSGVFNLTCGTRLDHGVLAVGYG-SENGQKFWKVKNSWGSSWGEKGYIRLAREENGPA 317
Query: 320 GQCGIAMFASFP 331
GQCGIA S+P
Sbjct: 318 GQCGIASVPSYP 329
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 131/324 (40%), Positives = 186/324 (57%), Gaps = 29/324 (8%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
I E++ +K ++ +TY++ E R +IF +N + + N A G ++ + +NK+AD+
Sbjct: 23 IKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADM 82
Query: 89 TPQEFIASQTGFKMSDH----SSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
EF + GF + H +S G F+ + ++P SV+W EKGAVT VK QG
Sbjct: 83 LHHEFRETMNGFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVKDQGH 142
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + A+EG + K LVSLSEQ LVDC+ NNGC GG MD+AF+YI N
Sbjct: 143 CGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKDNG 202
Query: 197 GITNDAVYSYEGMSTGIC---DSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVA 252
GI + Y YEG+ DS+ A D + D+P +E+ + +AVA PVSVA
Sbjct: 203 GIDTEKSYPYEGIDDSCHFNKDSVGATDRG-----FADIPQGNEKKMAEAVATIGPVSVA 257
Query: 253 IDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
IDAS + QFYS G++N C + L+HGV VGYGT E G YWL+KNSWG WG+ G+
Sbjct: 258 IDASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGF 317
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
++ R+ D QCGIA +S+P+
Sbjct: 318 IKMARNEDN---QCGIASASSYPL 338
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 175/312 (56%), Gaps = 21/312 (6%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFE--IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+F W + ++ ++ E +KR E I D + NA G + L N+F+ ++
Sbjct: 27 HEFSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVK---LDHNEFSSMS 83
Query: 90 PQEFIASQTGFKMSD-HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
+EF TG+ M + + A+ L+ QVP SV+W +KG VTPVK QG C
Sbjct: 84 FEEFKFKMTGYVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCW 143
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
AVEG + +LVSLSEQ+LVDC N + GC GG MD AF +I N GI ++
Sbjct: 144 AFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHN-GDMGCNGGLMDHAFAWIEDNGGICSE 202
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQ 259
Y Y+ + D K +I+ ++DV P DE +L AVA QPVSVAI+A A Q
Sbjct: 203 DDYEYKAKAQVCRDCEKV----VKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 258
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FY GVFN C T L+HGV AVGYG SE G K+W +KNSWG WGE GY RL R+ + P
Sbjct: 259 FYKSGVFNLTCGTRLDHGVLAVGYG-SENGQKFWKVKNSWGSSWGEKGYIRLAREENGPA 317
Query: 320 GQCGIAMFASFP 331
GQCGIA S+P
Sbjct: 318 GQCGIASVPSYP 329
>gi|195379496|ref|XP_002048514.1| GJ14012 [Drosophila virilis]
gi|194155672|gb|EDW70856.1| GJ14012 [Drosophila virilis]
Length = 327
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 194/337 (57%), Gaps = 27/337 (8%)
Query: 8 VVLIISG-SCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
++LI++G C +Y E +A +FE +K +Y ++Y++ E R +IFKDN ++
Sbjct: 6 LLLIVAGVGCNRALSY----EDVLASEFESFKVEYEKSYEDDGEEQLRMQIFKDNKQLID 61
Query: 67 RFNNA-AIGNRSYTLRLNKFADLTPQEFIASQ-TGFKMSDHSSSLKANGTPFLYKSSQVP 124
R N A G +Y + +N+F D+ EF +SD +SS++ +P ++++P
Sbjct: 62 RHNERYAAGEETYEMGVNQFTDMLATEFRKIMLVNLNISDFTSSIEYIYSP---ANAEIP 118
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
V+W EKGAVTPVK QG+C A A+EG + I+ +L+ LSEQ L+DC++ NN
Sbjct: 119 SQVDWREKGAVTPVKNQGRCGSCWAFSAAGALEGQHFIQTKQLIPLSEQNLLDCSSRYNN 178
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
+GC GG+ A Y+ N+G+ ND Y YEG G C + +A +T V DE
Sbjct: 179 HGCGGGWPAAALMYVRDNRGMDNDRAYPYEG-HVGRC-RFRRYSVSATVTQVMQVR-RDE 235
Query: 238 ESLLKAVANQ-PVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
+L AVA + PVSVA+DA+ Q Y GGV++ C NH + VGYG+ + G +WLIK
Sbjct: 236 VALANAVATKGPVSVAVDATYFQHYRGGVYSHRCRQQANHAMLVVGYGSDQRGGDFWLIK 295
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQ-CGIAMFASFPV 332
NSWG WGE GY RL R+ QG C +A +A FP+
Sbjct: 296 NSWG-GWGEQGYMRLARN----QGNLCHVASYAVFPI 327
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 191/342 (55%), Gaps = 24/342 (7%)
Query: 8 VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
V L I C A + + ++ WK+ + + Y E E+ +R +++ NL +E
Sbjct: 18 VCLTILSLCLGLAFAAPRVDPDLDSHWQLWKSWHSKDYHEREESWRRV-VWEKNLKMIEL 76
Query: 68 FN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
N + ++G SY L +N+F D+T +EF G+K S K G+ FL S + P
Sbjct: 77 HNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKHK--KSERKYRGSQFLEPSFLEAPR 134
Query: 126 SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
SV+W EKG VTPVK QGQC A+EG + K +LVSLSEQ LVDC+ + N
Sbjct: 135 SVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQ 194
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MD AF+Y+ N GI ++ Y Y C KAE +AA T + D+P E
Sbjct: 195 GCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDC-RYKAEYNAANDTGFVDIPQGHER 253
Query: 239 SLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCETF-LNHGVTAVGYGTSEE---GI 290
+L+KAVA+ PVSVAIDA S+ QFY G+ + C + L+HGV VGYG E G
Sbjct: 254 ALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGK 313
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYW++KNSWG+ WG+ GY + +D + CGIA AS+P+
Sbjct: 314 KYWIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYPL 352
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 192/344 (55%), Gaps = 36/344 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L + ++ + +SQ RT ++E +K + +TY+ E RF+IF +N + +
Sbjct: 7 LCAIAAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+ N A G SY L +N+F DL EF G H + K G+ FL
Sbjct: 59 AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSTFLPPANVND 113
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +P +V+W +KGAVTPVK QGQC A ++EG + +K LVSLSEQ LVDC+
Sbjct: 114 SSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ NNGC GG M+DAFKYI N GI + Y YE + G C K ED A T Y ++
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231
Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
E+ L KAVA P+SVAIDA S+ Q YS GV++ C + L+HGV VGYG +
Sbjct: 232 AGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL+KNSW + WG+ GY + RD + QCGIA AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 133/335 (39%), Positives = 185/335 (55%), Gaps = 45/335 (13%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++F ++A Y RTY E +RFE+++ N+ +E N G+ +Y L N+FADLT
Sbjct: 36 MMDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRR--GDLTYELGENQFADLT 93
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV------------------------PP 125
QEF A ++ + + P ++ Q+ P
Sbjct: 94 VQEFRAM--------YTMPARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPT 145
Query: 126 SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
SV+W KGAVTPVK QG C VA +EG++ IK +LVSLSEQ+LVDC D+
Sbjct: 146 SVDWRSKGAVTPVKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDADDGC 205
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
G + A +++ N G+T +A Y Y G + G CD KA +HAA+I + V N E
Sbjct: 206 GGG--LPEIAMEWVAHNGGLTTEANYPYTGKA-GKCDRGKASNHAAKIAAAQMVRANSEA 262
Query: 239 SLLKAVANQPVSVAIDA-SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKN 297
L +AVA QPV+VAI+A +L FY GV++G C +H VT VGYG +G KYW+IKN
Sbjct: 263 ELERAVARQPVAVAINAPDSLMFYKSGVYSGPCTAEFDHAVTVVGYGADNKGHKYWIIKN 322
Query: 298 SWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
SW + WGE GY R+QR + +G CGIA AS+PV
Sbjct: 323 SWAETWGEKGYGRMQRGVAAKEGLCGIATHASYPV 357
>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 400
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 187/311 (60%), Gaps = 17/311 (5%)
Query: 11 IISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN 70
++ +C Q ++ E +E+ E+W AQYG+ Y+++AE KRF+IFK+N+ +E FN
Sbjct: 92 LVGVTCGRQCRSKSRLEACTSERHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFNV 151
Query: 71 AAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS--SQVPPSVN 128
A G++ + +R+N+F DL +EF A + T F Y S + +P +++
Sbjct: 152 A--GDKPFNIRINQFPDLHDEEFKALLINGQRKVSGVETATEETSFRYGSVVTNIPATMD 209
Query: 129 WIEKGAVTPVKYQ---GQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
+KG VTP+K Q G C AVAA+EGI+ I ++L+ LS+Q+LVD + + GC
Sbjct: 210 GRKKGVVTPIKDQGIIGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVDSVKGE-SEGCI 268
Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
GG+++DAF++I++ GI ++ Y Y+G++ C K A I YE VP N++++LL
Sbjct: 269 GGYVEDAFEFIVKKGGILSETHYPYKGVNX--CKVEKETHSVAHIKGYEKVPSNNKKALL 326
Query: 242 KAVANQPVSVAID--ASALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNS 298
K VANQPVSV ID A A ++YS +FN C + NH V VGYG + +G KYW +KNS
Sbjct: 327 KVVANQPVSVYIDVGAHAFKYYSSEIFNARNCGSDPNHVVAVVGYGKALDGAKYWPVKNS 386
Query: 299 WGQDWGEDGYF 309
WG +WG Y
Sbjct: 387 WGTEWGGKWYM 397
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 136/346 (39%), Positives = 192/346 (55%), Gaps = 40/346 (11%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FLI+VL ++ + A + F K +G+ YK E + R IF+DN
Sbjct: 3 FLILVLSVTMATAMDVEWEAF------------KLTHGKQYKSPDEENVRRAIFRDNNQM 50
Query: 65 VERFNN-AAIGNRSYTLRLNKFADLTPQEFIASQTG-----FKMSDHSSSLKANGTPFLY 118
++ N AA+G RSY + +N+F DL E++ G +S S ++ TP L
Sbjct: 51 IKEHNQEAAMGRRSYFMGMNQFGDLAHSEYLELVVGPGLLPLNLSTPSENV-FESTPGL- 108
Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
QV +V+W +KGAVTP+K QG C ++EG + +K +LVSLSEQ L+DC
Sbjct: 109 ---QVDDTVDWRQKGAVTPIKDQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLDC 165
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ N GC GG MD AF+YI N GI + Y Y +CD K A +++Y D
Sbjct: 166 SRRFGNKGCEGGLMDQAFRYIKSNGGIDTEECYPYMAKDEKVCD-YKTSCSGATLSSYTD 224
Query: 232 VPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVGYGTS 286
+ DE +L++AV PVSVAIDAS +L+FY G+++ T L+HGV AVGYG S
Sbjct: 225 IKAMDEMALMQAVGTVGPVSVAIDASHKSLRFYKSGIYDEPECSRTKLDHGVLAVGYG-S 283
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G+ YWL+KNSWG WG+ GY ++ R+ + QCGIA AS+PV
Sbjct: 284 MDGMDYWLVKNSWGSAWGDMGYVKMTRNKNN---QCGIATKASYPV 326
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 192/344 (55%), Gaps = 36/344 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L ++ ++ + +SQ RT ++E +K + +TY+ E RF+IF +N + +
Sbjct: 7 LCAIVAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+ N A G SY L +N+F DL EF G H + K G+ FL
Sbjct: 59 AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HRGTRKTGGSTFLPPANVND 113
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +P +V+W +KGAVTPVK QGQC A ++EG + +K LVSLSEQ LVDC+
Sbjct: 114 SSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ NNGC GG M+DAFKYI N GI + Y YE + G C K ED A T Y ++
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231
Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
E L KAVA P+SVAIDA S+ Q YS GV++ C + L+HGV VGYG +
Sbjct: 232 AGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL+KNSW + WG+ GY + RD + QCGIA AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331
>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 361
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 140/360 (38%), Positives = 200/360 (55%), Gaps = 34/360 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIA----EKFEQWKAQYGRTYKESAENSKRFE 56
MA + L++ +C+ F + +IA E+F+ W+A+Y RTY E +RF
Sbjct: 3 MATASASLALVMLFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFM 62
Query: 57 IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF 116
++ +NL ++ N + G+ SY L N+F DLT +EF + + M A P
Sbjct: 63 VYSENLRFIKTMNQLSTGS-SYELGENQFTDLTEEEF---KDTYLMKLDEQPPAAEAMPP 118
Query: 117 LY------------KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKI 157
+ + + P SV+W KGAVTPVK Q QC VA++EG++ IK
Sbjct: 119 IVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKT 178
Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
RLVSLSEQ++VDC N++GC GG+ A +++ +N G+T ++ Y Y G S C S
Sbjct: 179 GRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYVG-SQRQCMSG 237
Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-ALQFYSGGVFNGYCE-TFLN 275
K HAA+I Y+ V +E L +AVA +PV+V IDAS A QFY GVF+G C T +N
Sbjct: 238 KLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDASRAFQFYKRGVFSGPCNTTTVN 297
Query: 276 HGVTAVGYGTSEEGI----KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
H VT VGYG++ KYW++KNSWGQ WGE+GY R+ R + +G C IA+ P
Sbjct: 298 HAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRAREGMCAIAIEPLLP 357
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 181/316 (57%), Gaps = 21/316 (6%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTP 90
F WK ++GR+Y S+E KR +I+ N V N A G+ +Y L + +ADL
Sbjct: 24 HDFHAWKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEH 83
Query: 91 QEFIASQTGFKMSDHSSSLKANGTPFL--YKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
+EF + G + ++S G+ FL ++ +P +++W + G VTPVK QG C
Sbjct: 84 EEFKQTVFGVCLGSFNASKPRGGSSFLKMHRFYNLPQTIDWRQWGFVTPVKNQGSCGSCW 143
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
+ A+EG N K RLVSLSEQ+LVDC+ N N GC GG+MD+AF+YI+ GI +
Sbjct: 144 SFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGIHTE 203
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--AL 258
Y YEG G C + E A T Y D+P +E +L +AVA PVSVAI AS +
Sbjct: 204 DSYPYEGQ-VGQCRANYGEI-GATCTGYYDIPSGNEHALKEAVATFGPVSVAIHASDQSF 261
Query: 259 QFYSGGVFNG-YCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
Q Y GV+N YC T L+H V VGYGT E G YWL+KNSWG WG+ GY ++ R+
Sbjct: 262 QLYHSGVYNNPYCSGTALDHAVLIVGYGT-EYGQDYWLVKNSWGPAWGDQGYIKMSRN-- 318
Query: 317 QPQGQCGIAMFASFPV 332
QCGIA ASFP+
Sbjct: 319 -RYNQCGIASAASFPL 333
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 192/344 (55%), Gaps = 36/344 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L + ++ + +SQ RT ++E +K + +TY+ E RF+IF ++ + +
Sbjct: 7 LCAIAAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTESSLII 58
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
R N A G SY L +N+F DL EF G H + K G+ FL
Sbjct: 59 ARHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSTFLPPANVND 113
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +P +V+W +KGAVTPVK QGQC A ++EG + +K LVSLSEQ LVDC+
Sbjct: 114 SSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ NNGC GG M+DAFKYI N GI + Y YE + G C K ED A T Y ++
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231
Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
E+ L KAVA P+SVAIDA S+ Q YS GV++ C + L+HGV VGYG +
Sbjct: 232 AGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL+KNSW + WG+ GY + RD + QCGIA AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 140/330 (42%), Positives = 196/330 (59%), Gaps = 28/330 (8%)
Query: 17 ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGN 75
A+ A+ FDE ++ E + +K + +TY AE+ +RF I++ +L + + N A +G
Sbjct: 8 ATLASPLVFDE-ALDEMWTLFKTTHSKTYATEAEDMRRF-IWERHLNMINQHNIEADLGK 65
Query: 76 RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGA 134
+++L +N++ DLT E+ A+ +G+KM+ S G+ FL + QVP +V+W EKG
Sbjct: 66 HTFSLGMNEYGDLTQHEY-AAMSGYKMAKSSV-----GSSFLEPENLQVPKTVDWREKGY 119
Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
VTPVK QGQC + ++EG K RL S+SEQ LVDC+ ++ N GC GG MD+
Sbjct: 120 VTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDN 179
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN- 246
AF YI +N GI ++ Y YE + G C K D + + D+P DE +L AVA+
Sbjct: 180 AFTYIKKNMGIDSEKSYPYEAVD-GEC-RYKKSDSVTTDSGFVDIPHGDETALRTAVASV 237
Query: 247 QPVSVAIDAS--ALQFYSGGVFN-GYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
PVSVAIDAS + QFY GV+ C T L+HGV VGYG E G YWL+KNSWG
Sbjct: 238 GPVSVAIDASHTSFQFYKTGVYTEANCSSTQLDHGVLVVGYGV-ENGQDYWLVKNSWGAS 296
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GY +L R+ QCGIA AS+P+
Sbjct: 297 WGEAGYIKLARNHGN---QCGIASQASYPL 323
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 135/360 (37%), Positives = 197/360 (54%), Gaps = 34/360 (9%)
Query: 8 VVLIISGSCASQATYRTFDEGS---IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
++ ++S + A Q +Y D S + F++W ++G+ Y E ++R +IF+ NL
Sbjct: 14 IICLVSAAKAVQHSYEVGDINSGNGLVRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQY 73
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG-----FKMSDHSSSLKANGTPFLYK 119
+ N + N S+ L LNKFADLT +EF G ++ + A P L +
Sbjct: 74 IHAHNKNS--NSSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRRTELEGAELRPVLKQ 131
Query: 120 S-------SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSE 165
+ + S++W +KGAVT VK Q QC A+EG+N I +LVSLSE
Sbjct: 132 TVGSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSE 191
Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
Q+LV C + N GC GG MD AF ++IQN GI + YSY G+ + C++ K
Sbjct: 192 QELVAC--DATNYGCEGGDMDYAFTWVIQNGGIDTEKDYSYTGVDS-TCNTNKEAKKIVS 248
Query: 226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCE---TFLNHGVTA 280
I Y DV P D+ +LL A +QPVSV ID SA+ Q Y+GG+++G C ++H V
Sbjct: 249 IDGYTDVSP-DDSALLCAAGSQPVSVGIDGSAIDFQLYTGGIYDGDCSGNPDDIDHAVLV 307
Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
VGY +++ G YW++KNSWG DWG +GYF + R+ + P G C I AS+P ES+ S
Sbjct: 308 VGY-SAKNGKDYWIVKNSWGTDWGLEGYFYILRNTELPYGVCAINAMASYPTKTESSVQS 366
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 136/342 (39%), Positives = 191/342 (55%), Gaps = 25/342 (7%)
Query: 8 VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
+VL+++ + A QA +F E + E++ +K Q+ + Y+ E R +IF DN V +
Sbjct: 4 LVLLVTIAVACQAV--SFSE-LVQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAK 60
Query: 68 FNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD---HSSSLKANGTPFLYKSSQV 123
N G Y L +NK+ DL EF+ GF + L+ + T +
Sbjct: 61 HNKLFEQGLYPYKLAMNKYGDLLHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVDI 120
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P +V+W ++GAVTPVK QG C A A+EG + + +LVSLSEQ LVDC++
Sbjct: 121 PDTVDWRQEGAVTPVKDQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFG 180
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
NNGC GG MD+AF+YI N GI +A Y Y G A++ A + D+P D
Sbjct: 181 NNGCNGGLMDNAFRYIKNNGGIDTEAAYPYMGEDEKF--RYSAKNRGATDKGFVDIPSGD 238
Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEE-GI 290
E+ L AVA P+S+AIDAS + Q YS GV+ + C T L+HGV VGYGT E+ G+
Sbjct: 239 EDKLKAAVATVGPISIAIDASHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDEKTGM 298
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWG WG DGY ++ R+ D QCG+A AS+P+
Sbjct: 299 DYWLVKNSWGDTWGLDGYIKMARNQDN---QCGVATQASYPL 337
>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
Length = 221
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 113/221 (51%), Positives = 145/221 (65%), Gaps = 14/221 (6%)
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P S++W E GAV PVK QG C VAAVEGIN I L+SLSEQQLVDC T
Sbjct: 3 LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA- 61
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N+GC GG+M+ AF++I+ N GI ++ Y Y G GIC+S I +YE+VP +
Sbjct: 62 -NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQD-GICNS-TVNAPVVSIDSYENVPSH 118
Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
+E+SL KAVANQPVSV +DA+ Q Y G+F G C NH +T VGYGT E +W
Sbjct: 119 NEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGT-ENDKDFW 177
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
++KNSWG++WGE GY R +R+I+ P G+CGI FAS+PV K
Sbjct: 178 IVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPVKK 218
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 136/344 (39%), Positives = 184/344 (53%), Gaps = 25/344 (7%)
Query: 5 FLIVVLIISGSCASQATYR---TFDEGSIAEK----FEQWKAQYGRTYKESAENSKRFEI 57
FL LII +S Y + D+ + E+ F+ W ++ + Y+ E RFEI
Sbjct: 12 FLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYRFEI 71
Query: 58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL 117
F+DNL+ ++ N N SY L LN FADL+ EF GF D + + F
Sbjct: 72 FRDNLMYIDETNKK---NNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFT 128
Query: 118 YKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLV 169
YK + P S++W KGAVTPVK QG C +A VEGIN I L+ LSEQ+LV
Sbjct: 129 YKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELV 188
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC + ++ GC GG+ + +Y+ N G+ VY + C + +IT Y
Sbjct: 189 DC--DKHSYGCKGGYQTTSLQYVANN-GVHTSKVYPCQAKQYK-CRATDKPGPKVKITGY 244
Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
+ VP N E S L A+ANQP+S ++A Q Y GVF+G C T L+H VTAVGYGTS+
Sbjct: 245 KRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSD 304
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G Y +IKNSWG +WGE GY RL+R QG CG+ + +P
Sbjct: 305 -GKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 133/325 (40%), Positives = 186/325 (57%), Gaps = 31/325 (9%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
I E+++ +K ++ + Y++ E R +IF +N + + N A G S+ + LNK+AD+
Sbjct: 24 IKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYADM 83
Query: 89 TPQEFIASQTGFKMSDH----SSSLKANGTPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQ 143
EF + GF + H +S G F+ + ++P SV+W KGAVT VK QG
Sbjct: 84 LHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQGH 143
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + A+EG + K L+SLSEQ LVDC+T NNGC GG MD+AF+YI N
Sbjct: 144 CGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 203
Query: 197 GITNDAVYSYEGMSTGICD----SIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSV 251
GI + Y YEG+ C +I A D + D+P DE+ L +AVA PVSV
Sbjct: 204 GIDTEKSYPYEGIDDS-CHFNKGTIGATDRG-----FTDIPQGDEKKLAQAVATIGPVSV 257
Query: 252 AIDAS--ALQFYSGGVFN-GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
AIDAS + QFYS GV++ C+ L+HGV VGYGT E G YWL+KNSWG WG+ G
Sbjct: 258 AIDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTDENGKDYWLVKNSWGTTWGDKG 317
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
+ ++ R+ D QCGIA +S+P+
Sbjct: 318 FIKMARNDDN---QCGIATASSYPL 339
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 181/317 (57%), Gaps = 19/317 (5%)
Query: 30 IAEKFEQWKAQYGRTYKES-AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
++ ++ W A++G+ S + RFE FK+N +E N A G SY L LN+F+DL
Sbjct: 9 LSGEYASWCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRA--GKHSYRLGLNQFSDL 66
Query: 89 TPQEFIASQTGFKMSDHSSSL----KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
T +EF G + S + + + +++ +P SV+W + GAVT K QG C
Sbjct: 67 TSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRQHGAVTAPKDQGSC 126
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
A+EGIN I +LVSLSEQ+L+DC + GC GG M++A+++I++N G
Sbjct: 127 GGCWAFATTGAIEGINQIVTGQLVSLSEQELIDC-DKKADKGCDGGLMENAYQFIVENGG 185
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
+ + Y Y S C+ K I Y+ +P DE++LL AVA QPVSVAI+ ++
Sbjct: 186 LDTETDYPYHA-SESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGAS 244
Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
Q Y+ GVF G+C +NHGV VGYGT E+G+ YW++KNSW WG+ G+ ++QR+
Sbjct: 245 KDFQHYASGVFTGHCGEEINHGVLIVGYGT-EDGLDYWIVKNSWAATWGDGGFVKMQRNT 303
Query: 316 DQPQGQCGIAMFASFPV 332
+ G C I AS+PV
Sbjct: 304 GKRGGLCSINTLASYPV 320
>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
Length = 337
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 126/311 (40%), Positives = 177/311 (56%), Gaps = 21/311 (6%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
+E WK +G+TY E+ +R E+++ NL+ + + N A++G ++Y L +N DLT +E
Sbjct: 35 WELWKKSHGKTYPNEVEDVRRRELWERNLMLITKHNLEASMGLQTYDLSMNHMGDLTTEE 94
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
+ S + + ++ PF+ + VP SV+W +G VT VK QG C A
Sbjct: 95 IMQS---YATLTPPADIQRAPAPFVGSGADVPVSVDWRLQGCVTSVKMQGSCGSCWAFSA 151
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
A+EG A +LV LS Q LVDC+ N GC GGFMD AF+Y+I NKGI ++A Y
Sbjct: 152 AGALEGQLAKTTGKLVDLSPQNLVDCSLKYGNKGCNGGFMDRAFQYVIDNKGIDSEASYP 211
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYS 262
Y G S AA + Y +P DE +L A+A P+SVAIDA+ FY
Sbjct: 212 YRGQLQQC--SYNPSYRAANCSRYSFLPEGDEGALKNALATIGPISVAIDATRPTFAFYR 269
Query: 263 GGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
GV+N C +NHGV AVGYGT E G YWL+KNSWG +G+ GY R+ R+ + Q
Sbjct: 270 SGVYNDPTCTQRVNHGVLAVGYGT-ESGQDYWLVKNSWGTSFGDKGYIRMSRNKND---Q 325
Query: 322 CGIAMFASFPV 332
CGIA++ S+P+
Sbjct: 326 CGIALYCSYPI 336
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 129/339 (38%), Positives = 191/339 (56%), Gaps = 35/339 (10%)
Query: 1 MAKYFLIVVLIISGSCA-----SQATYRTFDEGSIAEKFEQWKAQYGRTYKES-AENSKR 54
M L+++ ++ S A + R+ +E + F+ W +++G+TY + + +R
Sbjct: 9 MITLSLLIIFLLPPSSAMDLSVTSGGLRSNEE--VGFIFQTWMSKHGKTYTNALGDKEQR 66
Query: 55 FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT 114
F+ FKDNL +++ N N SY L L +FADLT QE+ +G + + L+
Sbjct: 67 FQNFKDNLRFIDQHNAK---NLSYRLGLTQFADLTVQEYQDLFSGRPIQKQKA-LRVTHR 122
Query: 115 PFLYKSSQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATN 174
Q+P SV+W +KGAV+ +K QG+C V E IN I L+SLSEQ+LVDC+ +
Sbjct: 123 YVPLAEDQLPQSVDWRQKGAVSEIKDQGRCTV---ESINKIVTGELISLSEQELVDCSID 179
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIK-AEDHAAQITNYEDVP 233
N+GC GG MD AF+++I N G+ + Y Y+ + G C+ + +I YEDVP
Sbjct: 180 --NHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQ-GYCNHNQNTSKKVIKIDGYEDVP 236
Query: 234 PNDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
N+E SL KAVA+QP G++ G C T L+H V VGYGT E G YW
Sbjct: 237 ANNENSLQKAVAHQP---------------GIYTGPCGTDLDHAVVIVGYGT-ENGQDYW 280
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+++NSWG WGE GY ++ R+ + P G CGIAM AS+P+
Sbjct: 281 IVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYPI 319
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 137/345 (39%), Positives = 185/345 (53%), Gaps = 27/345 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FL VLI + + ++ IAE++E +K Q+ + Y E R ++F DN
Sbjct: 6 FLCCVLIYHSNSVTAVSFNDL----IAEEWELFKTQFSKAYNTEIEEKFRMKVFMDNKHK 61
Query: 65 VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFL-YKSS 121
+ R N G SY L +N F DL EF+ + G++ S + + + F+ +
Sbjct: 62 IARHNKLFQNGEVSYELEMNHFGDLLHHEFVKTVNGYRHSLRRVTGDEIDSVTFIPAYNV 121
Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
VP SV+W +GAVT VK QGQC ++EG + +L SLSEQ L+DC+
Sbjct: 122 TVPDSVDWRTEGAVTEVKNQGQCGSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCSGK 181
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
NNGC GG MD+AF YI NKGI + Y YEG+ K ++ A + D+P
Sbjct: 182 YGNNGCSGGLMDNAFAYIKSNKGIDTEQSYPYEGIDDKC--RYKPQESGATDKGFVDIPQ 239
Query: 235 NDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN----GYCETFLNHGVTAVGYGTSE 287
DEE L AVA P+SVAIDAS + QFY GV+ G E L+HGV AVGYGT E
Sbjct: 240 GDEEKLKLAVATVGPISVAIDASHQSFQFYKKGVYYDKGCGNGEEDLDHGVLAVGYGT-E 298
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G YWL+KNSWG+ WG DGY ++ R+ CGIA AS+P+
Sbjct: 299 NGKDYWLVKNSWGKRWGLDGYIKMARN---KHNHCGIATSASYPL 340
>gi|344271925|ref|XP_003407787.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 333
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 132/340 (38%), Positives = 189/340 (55%), Gaps = 25/340 (7%)
Query: 8 VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
+ L ++ C A+ + S+ ++ QW++ Y + Y + E+ +R +++ N+ +ER
Sbjct: 3 LSLFLAALCLGIASAAPKFDQSLDAQWNQWRSTYKKVYAVNEEDWRR-AVWEKNMKMIER 61
Query: 68 FNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
N + G +T+ +N F D T +EF GF+ H K P +P S
Sbjct: 62 HNQEYSQGKHGFTMAMNAFGDKTNEEFRQLMNGFQSQKHKKG-KLFYEPVF---GHIPTS 117
Query: 127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
V+W +KG VTPVK QGQC A A+EG K +LVSLSEQ LVDC+ + N G
Sbjct: 118 VDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWREGNEG 177
Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
C GG MD+AF+Y+ N G+ ++ Y Y T C + AA T + D+PP E++
Sbjct: 178 CNGGLMDNAFQYVKDNGGLDSEESYPYTATDTQDC-RYNPKYSAANDTGFVDIPPQ-EKA 235
Query: 240 LLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETFLNHGVTAVGY---GTSEEGIKY 292
L+KAVA P+SVAIDA + QFYS G+ F+ C +NHGV AVGY GT + KY
Sbjct: 236 LMKAVATVGPISVAIDAGQVSFQFYSSGIYFDPACRLTVNHGVLAVGYGFEGTDPDKNKY 295
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WL+KNSWG+ WG DGY ++ +D + CGIA AS+P
Sbjct: 296 WLVKNSWGKSWGADGYIKIAKDRNN---HCGIARAASYPT 332
>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 356
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 126/325 (38%), Positives = 182/325 (56%), Gaps = 34/325 (10%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
+ E+W A++GR Y ++ E ++R E+F N V+ N A GNR+YTL LNKF+DLT E
Sbjct: 38 RHEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRA--GNRTYTLGLNKFSDLTDDE 95
Query: 93 FIASQTGFK---------MSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
F+ + G++ ++ S + A G Y + +P SV+W +GAVT VK QG
Sbjct: 96 FVQTHLGYRGHQQGGLRPEEENVSKVAALG----YGQADMPESVDWRAQGAVTGVKNQGS 151
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATND----NNNGCYGGFMDDAFKYI 192
C AVAA EG+ I L+S+SEQQ++DC N N C GG +DDA +Y+
Sbjct: 152 CGCCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYV 211
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKA-VANQPVSV 251
++G+ +A Y+Y G+ G C S + AA + V +E L+ VA QP++V
Sbjct: 212 AASRGLQPEAAYAYTGLQ-GACQSGFTPNSAASFGEPQTVTLQGDEGRLQGLVAGQPIAV 270
Query: 252 AIDASA-LQFYSGGVFNG---YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
+++AS + Y GVF C LNH VT VGYG+++ G +YWL+KN WG WGE G
Sbjct: 271 SVEASDDFRHYMSGVFTAGTSSCGQRLNHAVTVVGYGSADGGQEYWLVKNQWGTSWGEGG 330
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
Y R+ R P CGI+ +A +P
Sbjct: 331 YMRIARGNGAP--NCGISAYAYYPT 353
>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
Length = 533
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 176/312 (56%), Gaps = 21/312 (6%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFE--IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+F W +G T+ ++ E ++R E I D + NA G TL N F+ ++
Sbjct: 26 HEFSAWMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTG---VTLGHNAFSHMS 82
Query: 90 PQEFIASQTGFKMSD-HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
EF TG + + + A+ L+ +VP +V+W++KG VTPVK QG C
Sbjct: 83 FDEFKFKMTGLVLPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCW 142
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
AVEG + +L SLSEQ+LVDC N + GC GG MD AF++I + GI ++
Sbjct: 143 AFSTTGAVEGATFVSSGKLPSLSEQELVDCDHN-GDMGCNGGLMDHAFQWIEDHGGICSE 201
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQ 259
Y Y+ + +C + D ++T ++DV P DE +L AVA QPVSVAI+A A Q
Sbjct: 202 DDYEYKAKAQ-VC---RECDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 257
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FY GVFN C T L+HGV AVGYG ++ G K+W +KNSWG WGE GY RL R+ + P
Sbjct: 258 FYKSGVFNLTCGTRLDHGVLAVGYG-NDNGHKFWKVKNSWGASWGEQGYIRLAREENGPA 316
Query: 320 GQCGIAMFASFP 331
GQCGIA S+P
Sbjct: 317 GQCGIASVPSYP 328
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 191/344 (55%), Gaps = 36/344 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L ++ ++ + +SQ RT ++E +K + +TY+ E RF+IF +N + +
Sbjct: 7 LCAIVAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+ N A G SY L +N+F DL EF G H + K G+ FL
Sbjct: 59 AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSSFLPPANVND 113
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +P V+W +KGAVTPVK QGQC A ++EG + +K LVSLSEQ LVDC+
Sbjct: 114 SSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ NNGC GG M+DAFKYI N GI + Y YE + G C K ED A T Y ++
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231
Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
E L KAVA P+SVAIDA S+ Q YS GV++ C + L+HGV VGYG +
Sbjct: 232 AGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL+KNSW + WG+ GY + RD + QCGIA AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331
>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
Length = 430
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 133/339 (39%), Positives = 184/339 (54%), Gaps = 39/339 (11%)
Query: 29 SIAEKFEQWKAQYG--RTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKF 85
++A FE+W +++G R +++ E +KR F +N V N AIG S+ + LN
Sbjct: 93 ALARHFERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGEVSHWVGLNSL 152
Query: 86 ADLTPQEFIASQTGFKMSDHSSS----LKANGT--------PFLYKSSQVPPSVNWIEKG 133
A T +E+ A G+K SS L+A T + Y S P +++W+E G
Sbjct: 153 AATTREEYRA-LLGYKPELRSSGDAEMLEATSTDKVEQYKASWEYASVDPPEAIDWVELG 211
Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
AVTP K QGQC AVEGI I+ RLVSLSEQ++V C+ N GC GG MD
Sbjct: 212 AVTPPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCS--KQNMGCNGGLMD 269
Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
AF++I++N GI ++ Y Y + C+ K + H A I ++DVPP DE+ L KAV+
Sbjct: 270 YAFRWIVKNGGIDSEFQYPYSAEALA-CNRWKLQLHVATIDGFKDVPPGDEKELEKAVSQ 328
Query: 247 QPVSVAI--DASALQFYSGGVFNGY-CETFLNHGVTAVGYG---TSEEGIK-------YW 293
QPVS+AI D + Q Y GGV++ C + ++HGV VGYG T K +W
Sbjct: 329 QPVSIAIEADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHHNATKHHKRHRHFW 388
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+KNSWG WGE G+ R+ R I GQCGI S+P
Sbjct: 389 KVKNSWGGTWGEGGFIRMARRISDETGQCGITTAPSYPT 427
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 137/332 (41%), Positives = 187/332 (56%), Gaps = 30/332 (9%)
Query: 16 CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGN 75
C A FD +++ WKA++G++Y+ E R ++ N ++ N A G
Sbjct: 7 CTLIAAVAAFD---FSKELRAWKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHA-GV 62
Query: 76 RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL--YKSSQVPPSVNWIEKG 133
YTL++N+F DL EF + G++MS+ G PF+ + +P SV+W +KG
Sbjct: 63 FGYTLKMNQFGDLENSEFKSLYNGYRMSN----APRKGKPFVPAARVQDLPASVDWSKKG 118
Query: 134 AVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
VTPVK QGQC A ++EG + L+SLSEQ LVDC+ + N+GC GG MD
Sbjct: 119 WVTPVKNQGQCGSCWSFSATGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMD 178
Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
DAF+Y+I+N GI +A Y Y + + C D A I+ Y DV + E L AVA
Sbjct: 179 DAFEYVIKNNGIDTEASYPYRAVDS-TC-KFNTADVGATISGYVDVTKDSESDLQVAVAT 236
Query: 247 -QPVSVAIDAS--ALQFYSGGVFNGYC--ETFLNHGVTAVGYGTSEEGIK-YWLIKNSWG 300
PVSVAIDAS + QFYS GV++ T L+HGV AVGYGT +G K YWL+KNSWG
Sbjct: 237 IGPVSVAIDASHISFQFYSSGVYDPLICSSTNLDHGVLAVGYGT--DGSKDYWLVKNSWG 294
Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WG GY + R+ + +CGIA AS+PV
Sbjct: 295 ASWGMSGYIEMVRNHNN---KCGIATSASYPV 323
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 130/318 (40%), Positives = 187/318 (58%), Gaps = 24/318 (7%)
Query: 37 WKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIA 95
+K ++ ++YK E RF++F N +E+ N G S+ L LNKFAD+T EF
Sbjct: 46 FKLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQ 105
Query: 96 SQTGFKMSDH-----SSSLKANGTPF-LYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
GFK+ S LK +G F + + +P SV+W ++G VT VK QG C
Sbjct: 106 RMNGFKLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWA 165
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
A ++EG + + +LVSLSEQ LVDC N ++ GC GG+MD AF+Y+ NKGI +A
Sbjct: 166 FSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTEA 225
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASA--LQ 259
Y Y+G G C K+ED A T + D+P +E L A+A PVSVAIDA++ Q
Sbjct: 226 SYPYKGRD-GRC-RFKSEDVGATDTGFVDIPEGNETLLEAAIATVGPVSVAIDAASFKFQ 283
Query: 260 FYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
FYS GV ++ C +L+HGV AVGY ++++G +Y+++KNSW +DWG+DGY + R +
Sbjct: 284 FYSHGVYYDRSCSPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGDDGYILMSR---R 340
Query: 318 PQGQCGIAMFASFPVSKE 335
CGIA AS+P ++
Sbjct: 341 KNNNCGIATMASYPFVQQ 358
>gi|242072388|ref|XP_002446130.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
gi|241937313|gb|EES10458.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
Length = 276
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 126/286 (44%), Positives = 170/286 (59%), Gaps = 41/286 (14%)
Query: 59 KDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
+DN+ VE FN A N + L +N+FADLT +EF A++ GFK +S+ K T F Y
Sbjct: 19 RDNVAFVESFN--ANKNNKFWLGVNQFADLTTEEFKANK-GFK---PTSAEKVPTTGFKY 72
Query: 119 KS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
++ S +P +V+W KGAVTP+K QGQC AVAA+EGI + L+SLS+Q+L
Sbjct: 73 ENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSKQEL 132
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
VDC T+ + GC + Y+ + G C AA I
Sbjct: 133 VDCDTHSMDEGC--------------------EVQLPYKAVD-GKCKG--GSKSAATIKG 169
Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTS 286
+EDVP N+E +L+KAVANQPVSVA+DAS F YSGGV G C T L+HG+ A+GYG
Sbjct: 170 HEDVPVNNEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGME 229
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYW++KNSWG WGE G+ R+++DI +G CG+AM S+P
Sbjct: 230 SDGTKYWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 275
>gi|334332714|ref|XP_001367224.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 136/349 (38%), Positives = 193/349 (55%), Gaps = 32/349 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M Y + L + + A R D ++ QWKAQ+G++Y + E+S R ++
Sbjct: 1 MNFYLCLASLCLGLAAAIPPFDRALDS-----QWHQWKAQHGKSYAAN-EDSWRRATWEK 54
Query: 61 NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
NL +ER N + G S+ LR+NKF D++ +EF G+K + K + LY+
Sbjct: 55 NLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEEFKQVMNGYKSNGSQKRTKGS----LYR 110
Query: 120 SS---QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
S Q+P SV+W EKG VTPVK Q C A A+EG K +LVSLS Q LV
Sbjct: 111 ESLLAQLPESVDWREKGYVTPVKEQRGCYSCWAFSAAGAIEGQWFRKTGKLVSLSVQNLV 170
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC+ + NNGC GG M +AF+Y+ N GI + Y Y + E A +T +
Sbjct: 171 DCSIPEGNNGCDGGLMGNAFQYVQDNGGIDTEECYPYVAQDNEC--KYQPECSGANVTGF 228
Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYG 284
+P DE +L+KAVAN P+SVAIDA + +FY GV ++ C + LNHGV VGYG
Sbjct: 229 VKIPSTDERALMKAVANVGPISVAIDAGNPSFKFYQSGVYYDPQCSSSQLNHGVLVVGYG 288
Query: 285 T-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+ + G KYW++KNSWG++WG++GY + +D D CGI AS+P+
Sbjct: 289 SEGKNGRKYWIVKNSWGENWGDNGYVLMAKDEDN---HCGIITDASYPI 334
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 179/319 (56%), Gaps = 20/319 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
+ S KF W ++ E RFE+F N +E N A + S+T+ N+++
Sbjct: 21 DASYEAKFLSWMKKFAVKL-NPLEWVHRFEVFILNDQRIEAHNKDA--SSSFTMGHNEYS 77
Query: 87 DLTPQEFIASQTGFKMSD---HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
LT EF +TG ++S S + A P + + VP ++W+E+G VTPVK QG
Sbjct: 78 HLTFDEFKKLRTGLRVSPSYIQSRAKYALMAPAV-NMTDVPNEMDWVEQGGVTPVKNQGM 136
Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C A+EG + +LVS+SEQ+LVDC N + GC GG MD+AFK++ +K
Sbjct: 137 CGSCWAFSTTGAIEGAAFVSSKQLVSVSEQELVDCDHN-GDMGCNGGLMDNAFKWVKTHK 195
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
G+ + Y Y G C ++K ++T + DVP NDE++L AVA QPVSVAI+A
Sbjct: 196 GLCKEEDYPYHA-KEGTC-ALKKCKPVTKVTAFHDVPANDEQALKAAVAKQPVSVAIEAD 253
Query: 257 --ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
QFY GVF+ C T L+HGV VGYG E G KYW +KNSWG DWG+ GY +L R+
Sbjct: 254 QPEFQFYKSGVFDKSCGTKLDHGVLVVGYG-EEGGKKYWKVKNSWGADWGDKGYIKLARE 312
Query: 315 IDQPQGQCGIAMFASFPVS 333
GQCG+AM S+P +
Sbjct: 313 FGPETGQCGVAMVPSYPTA 331
>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
Length = 258
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 128/267 (47%), Positives = 165/267 (61%), Gaps = 22/267 (8%)
Query: 80 LRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN---GTPFLYKSSQVPPSVNWIEKGAVT 136
+ LN+FAD+T EF+A TG + + A G L + +V+W +KGAVT
Sbjct: 1 MELNEFADMTNDEFMAMYTGLRPVPAGAKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVT 60
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
+K Q QC AVAAVEGI+ I LVSLSEQQ++DC T D NNGC GG++D+AF
Sbjct: 61 GIKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDT-DGNNGCNGGYIDNAF 119
Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
+YI+ N G+ + Y Y + +C S++ A I+ Y+DVP DE +L AVANQPV
Sbjct: 120 QYIVGNGGLATEDAYPYTA-AQAMCQSVQP---VAAISGYQDVPSGDEAALAAAVANQPV 175
Query: 250 SVAIDASALQFYSGGVFNGY-CET--FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
SVAIDA Q Y GGV C T LNH VTAVGYGT+E+G YWL+KN WGQ+WGE
Sbjct: 176 SVAIDAHNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEG 235
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
GY RL+R + CG+A AS+PV+
Sbjct: 236 GYLRLERGAN----ACGVAQQASYPVA 258
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 138/345 (40%), Positives = 192/345 (55%), Gaps = 36/345 (10%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
L ++ ++ + S RT ++E +K + ++Y+ E RF+IF +N +
Sbjct: 6 LLCAIVAVTVAANSHEILRT--------QWEAFKTTHKKSYESHMEELLRFKIFTENSLI 57
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YK 119
+ + N A G SY L +N+F DL EF G++ + G+ F+
Sbjct: 58 IAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGYR-----GQRTSRGSTFMPPANVN 112
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
S +P +V+W +KGAVTPVK QGQC A ++EG + +K LVSLSEQ LVDC+
Sbjct: 113 DSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCS 172
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ NNGC GG MD+AFKYI N GI + Y YE M K ED A T + D+
Sbjct: 173 QSFGNNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMDDKC--RFKKEDVGATDTGFVDI 230
Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSE 287
E+ L KAVA P+SVAIDA S+ Q YS GV++ C + L+HGV AVGYG +
Sbjct: 231 EGGSEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECSSEELDHGVLAVGYGV-K 289
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYWL+KNSWG WG++GY + RD + QCGIA AS+P+
Sbjct: 290 DGKKYWLVKNSWGGSWGDNGYILMSRDKNN---QCGIASAASYPL 331
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 181/321 (56%), Gaps = 25/321 (7%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
FDE ++++ WK + + Y E R I++DNL +++ N S+TL +N
Sbjct: 21 FDEDE--QQWQAWKLFHTKKYTTVTEEGARKAIWRDNLKKIQKHNAEG---HSFTLAMNH 75
Query: 85 FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
DLT EF TG + S +S+ K G+ FL S QVP +V+W ++G VTPVK QGQ
Sbjct: 76 LGDLTQDEFRYFYTGMR-SHYSNYTKKQGSAFLAPSHVQVPDTVDWRKEGYVTPVKNQGQ 134
Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C ++EG N K +LVSLSEQ LVDC+T NNGC GG MD AFKYI +N
Sbjct: 135 CGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCQGGLMDYAFKYIKENG 194
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
GI + Y YE + + + A T + DV DEE+L A P+SVAIDA
Sbjct: 195 GIDTEESYPYEARNDRC--RFQKSNIGAVDTGFVDVTHGDEEALKTAAGTVGPISVAIDA 252
Query: 256 SAL--QFYSGGVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
+ QFY GV+N G T L+HGV VGYGT +G YWL+KNSWG+ WG +GY +
Sbjct: 253 GHMSFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTY-QGSDYWLVKNSWGERWGMEGYIMM 311
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
R+ + QCG+A AS+P+
Sbjct: 312 SRNKNN---QCGVATQASYPL 329
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 137/341 (40%), Positives = 184/341 (53%), Gaps = 27/341 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FL + + + +SQ RT ++E +K+Q+ + Y E RF+IF +N +
Sbjct: 6 FLCGCVAAAIAASSQEILRT--------EWEAFKSQHNKAYSSHVEELLRFKIFTENTLL 57
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
V + N A G SY L +NKF DL P EF G++ + P S +
Sbjct: 58 VAKHNAKYAKGLVSYKLAMNKFGDLLPHEFAKMVNGYRGKQNKEQRPTFIPPANLNDSSL 117
Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P +V+W +KGAVTPVK QGQC ++EG + K +LVSLSEQ LVDC+ +
Sbjct: 118 PTTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFG 177
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC GG MD+ F+YI N GI + + Y G C K D A + D+
Sbjct: 178 NQGCNGGLMDNGFQYIKANGGIDTEESHPYTAQD-GDC-KFKKADVGATDAGFVDIQQGS 235
Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVGYGTSEEGIK 291
E+ L KAVA PVSVAIDAS + Q YS GV++ + L+HGV VGYG + G K
Sbjct: 236 EDDLKKAVATVGPVSVAIDASHGSFQLYSQGVYDEPDCSSSQLDHGVLTVGYGV-KNGKK 294
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWG DWG++GY + RD D QCGIA AS+P+
Sbjct: 295 YWLVKNSWGGDWGDNGYILMSRDKDN---QCGIASSASYPL 332
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 190/344 (55%), Gaps = 36/344 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L + ++ + +SQ RT ++E +K + +TY+ E RF+IF +N + +
Sbjct: 7 LCAIAAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+ N A G SY L +N+F DL EF G H + K G+ FL
Sbjct: 59 AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSSFLPPANVND 113
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +P V+W +KGAVTPVK QGQC A ++EG + +K LVSLSEQ LVDC+
Sbjct: 114 SSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ NNGC GG M+DAFKYI N GI + Y YE + G C K ED A T Y ++
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231
Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
E L KAVA P+SVAIDA S+ Q YS GV++ C + L+HGV VGYG +
Sbjct: 232 AGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL+KNSW + WG+ GY + RD + QCGIA AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 133/314 (42%), Positives = 181/314 (57%), Gaps = 28/314 (8%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
E + QWK + + Y E + R+ I+KDN + N + + L++N+F D+T
Sbjct: 25 ESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHN---LKGGDFILKMNQFGDMTNS 81
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA----- 145
EF A + + S NG+ FL ++ V P +V+W +G VTPVK QGQC
Sbjct: 82 EFKA------FNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAF 135
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
++EG + K +LVSLSEQ LVDC+T NNGC GG MD+AF YI +NKGI ++A
Sbjct: 136 STTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEAS 195
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQF 260
Y Y G C K AA T + D+P +E L +AVA+ P+SVAIDAS + QF
Sbjct: 196 YPYTA-EDGKC-VFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQF 253
Query: 261 YSGGVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
YS GV+N T L+HGV VGYGT E G YWL+KNSW WG+ GY +++R+
Sbjct: 254 YSSGVYNEPSCSSTELDHGVLVVGYGT-ESGKDYWLVKNSWNTSWGDKGYIKMRRN---A 309
Query: 319 QGQCGIAMFASFPV 332
+ QCGIA AS+P+
Sbjct: 310 KNQCGIATKASYPL 323
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 177/320 (55%), Gaps = 27/320 (8%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE----RFNNAAIGNRSYTLRLNK 84
S+ +++ +KA++GR Y E R +F+ N ++ RF N + ++TL++N+
Sbjct: 19 SLRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEV---TFTLQMNQ 75
Query: 85 FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
F D+T +EF A+ GF + S + +P V+W KGAVTPVK Q QC
Sbjct: 76 FGDMTSEEFTATMNGFL---NVPSRRPTAILRADPDETLPKEVDWRTKGAVTPVKDQKQC 132
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
++EG + +K +LVSLSEQ LVDC+ N GC GG MD AF+YI NKG
Sbjct: 133 GSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKG 192
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
I + Y YE G C A + A T Y DV E +L KAVA P+SVAIDAS
Sbjct: 193 IDTEDSYPYEAQD-GKC-RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVAIDAS 250
Query: 257 --ALQFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
+ QFY GV+ G T L+HGV AVGYG +E+G YWL+KNSW WG GY ++
Sbjct: 251 QPSFQFYHDGVYYEEGCSSTMLDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNKGYIQMS 310
Query: 313 RDIDQPQGQCGIAMFASFPV 332
RD + CGIA AS+P+
Sbjct: 311 RD---KKNNCGIASQASYPL 327
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 132/350 (37%), Positives = 187/350 (53%), Gaps = 29/350 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M L+ V++ +G C++ + F+ WK + + Y+ E ++ + +
Sbjct: 1 MKVTVLLAVVLFAGCCSAMQLNQQH-----VSLFQTWKNLWKKVYQTVEEEEQKMATWFN 55
Query: 61 NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N + N ++ +SY L +N++ DLT +EF + G++ G+ +L
Sbjct: 56 NWNKISEHNMQYSLKQKSYRLEMNEYGDLTSEEFSSMMNGYRNDIRLKRKSTGGSTYLNL 115
Query: 120 SS-----QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
S Q+P V+W + G VTPVK QGQC A ++EG + K +LVSLSEQ
Sbjct: 116 LSFGSQIQLPTLVDWRKHGLVTPVKNQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQN 175
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
L+DC+T + N+GC GG MD AFKYI GI +A Y YE C D A T
Sbjct: 176 LIDCSTPEGNDGCNGGLMDQAFKYIKIQGGIDTEAYYPYEAKDD-TC-RFNITDSGATDT 233
Query: 228 NYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVG 282
+ D+ DEE L +A A P+SVAIDAS + QFYS GV++ T L+HGV VG
Sbjct: 234 GFVDIKSGDEEMLKEAAATVGPISVAIDASHTSFQFYSNGVYSETACSSTMLDHGVLVVG 293
Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YGT E G YWL+KNSWG+ WGE GY ++ R+ D QCGIA AS+P+
Sbjct: 294 YGT-ENGKDYWLVKNSWGEGWGEAGYIKMSRNADN---QCGIATQASYPL 339
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 130/318 (40%), Positives = 184/318 (57%), Gaps = 21/318 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ ++ +KA++G++Y E R +I+ +N + + N A G Y++ +N+F D+
Sbjct: 23 LGAEWSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDM 82
Query: 89 TPQEFIASQTGFKMS--DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
EF++++ GFK + D P + +P +V+W KGAVTPVK QGQC
Sbjct: 83 LHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGS 142
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
A ++EG + K +VSLSEQ LVDC+T+ NNGC GG MD+AFKYI NKGI
Sbjct: 143 CWAFSATGSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGID 202
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
+ Y Y G + G C K A + + D+ E L KAVA P+SVAIDAS
Sbjct: 203 TEKSYPYNG-TDGTC-HFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHE 260
Query: 257 ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
+ QFYS GV++ C++ L+HGV VGYGT G YWL+KNSWG WG++GY R+ R+
Sbjct: 261 SFQFYSDGVYDEPECDSESLDHGVLVVGYGTL-NGTDYWLVKNSWGTTWGDEGYIRMSRN 319
Query: 315 IDQPQGQCGIAMFASFPV 332
+ QCGIA AS+P+
Sbjct: 320 ---KKNQCGIASSASYPL 334
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 186/340 (54%), Gaps = 33/340 (9%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FLIV I + SQ Y+T F+ W ++ ++Y E R+ IF+DN+
Sbjct: 11 FLIVNCISAARVFSQKQYQT--------AFQNWMVKHQKSYTND-EFGSRYTIFQDNMDF 61
Query: 65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
V ++N L LN ADLT QE+ G K + +L T S+ P
Sbjct: 62 VTKWNQKG---SDTILGLNSMADLTNQEYQRIYLGTKTTVKKPNLIIGVTDV----SKAP 114
Query: 125 PSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
SV+W GAVT VK QGQC +VEGI+ I +LVSLSEQQ++DC+ ++ N
Sbjct: 115 ASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSGSEGN 174
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
NGC GG M ++F+YII G+ +A Y YEG+ G C KA + A IT Y++V E
Sbjct: 175 NGCDGGLMTNSFEYIIAVGGLDTEASYPYEGV-VGKCKFNKA-NIGATITGYKNVKSGSE 232
Query: 238 ESLLKAVANQPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIKYW 293
L AVA QPVSVAIDAS + Q YS GV+ T L+HGV AVGYG S+ G YW
Sbjct: 233 SDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYG-SQSGQDYW 291
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
++KNSWG DWGE G+ + R+ CGIA AS+P +
Sbjct: 292 IVKNSWGADWGEKGFILMARN---KHNNCGIATMASYPTA 328
>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 533
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 180/312 (57%), Gaps = 21/312 (6%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN--NAAIGNRSYTLRLNKFADLT 89
+F W + +G T+ ++ E ++R E + N + + N NA G + L N F+ ++
Sbjct: 26 HEFSAWMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVK---LGHNAFSHMS 82
Query: 90 PQEFIASQTGFKMSD-HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
EF TG + + + A+ L+ +VP +V+W++KG VTPVK QG C
Sbjct: 83 FDEFKFKMTGLVLPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCW 142
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
AVEG + +L+SLSEQ+LVDC N + GC GG MD AF++I + GI ++
Sbjct: 143 AFSTTGAVEGATFVSSGKLLSLSEQELVDCDHN-GDMGCNGGLMDHAFQWIEDHGGICSE 201
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQ 259
Y Y+ + +C + D ++T ++DV P DE +L AVA QPVSVAI+A A Q
Sbjct: 202 DDYEYKAKAQ-VC---RKCDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 257
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
FY GVFN C T L+HGV AVGYG ++ G K+W +KNSWG WGE GY RL R+ + P
Sbjct: 258 FYKSGVFNLTCGTRLDHGVLAVGYG-NDNGQKFWKVKNSWGASWGEQGYIRLAREENGPA 316
Query: 320 GQCGIAMFASFP 331
GQCGIA S+P
Sbjct: 317 GQCGIASVPSYP 328
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 141/344 (40%), Positives = 193/344 (56%), Gaps = 36/344 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L ++ ++ + +SQ RT ++E +K + +TY+ E RF+IF +N + +
Sbjct: 7 LCAIVAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+ N A G SY L +N+F DL EF G+ S K+ G+ FL
Sbjct: 59 AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYH-----GSRKSGGSTFLPPANVND 113
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +P +V+W +KGAVTPVK QGQC ++EG + +K LVSLSEQ LVDC+
Sbjct: 114 SSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQ 173
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ NNGC GG M+DAFKYI N GI + Y YE + G C K ED A T Y ++
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231
Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
E+ L KAVA P+SVAIDA S+ Q YS GV++ C + L+HGV VGYG +
Sbjct: 232 AGCEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL+KNSW + WG+ GY + RD + QCGIA AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331
>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
Length = 354
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 133/321 (41%), Positives = 183/321 (57%), Gaps = 25/321 (7%)
Query: 30 IAEKFEQW---KAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKF 85
I E F +W K +G++Y+ EN E F N++ +E N +G +++ + LN+
Sbjct: 40 IDEAFNKWDDYKETFGKSYEPDEEND-YMEAFVKNVIHIEEHNKEHRLGRKTFEMGLNEI 98
Query: 86 ADLTPQEFIASQTGFKMSDH-SSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQ 143
ADL ++ G++M SL++NGT FL + Q+P SV+W E+G VTPVK QG
Sbjct: 99 ADLPFSQY-RKLNGYRMRRQFGDSLQSNGTKFLVPFNVQIPESVDWREEGLVTPVKNQGM 157
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + A+EG +A +LVSLSEQ LVDC+T N+GC GG MD AF+YI +N
Sbjct: 158 CGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENH 217
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA 255
G+ + Y Y G T K A + D+P DEE+L KAVA Q P+S+AIDA
Sbjct: 218 GVDTEDSYPYVGRETKC--HFKRNAVGADDKGFVDLPEGDEEALKKAVATQGPISIAIDA 275
Query: 256 S--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
+ Q Y GV F+ C + L+HGV VGYGT E YWL+KNSWG WGE GY R+
Sbjct: 276 GHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTWGEKGYIRI 335
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
R+ + CG+A AS+P+
Sbjct: 336 ARNRNN---HCGVATKASYPL 353
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 141/344 (40%), Positives = 191/344 (55%), Gaps = 36/344 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L ++ ++ + +SQ RT ++E +K + ++Y+ E RF+IF +N + +
Sbjct: 7 LCAIVAVTVAASSQEILRT--------QWEAFKTTHKKSYQSHMEELLRFKIFTENSLII 58
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+ N A G SY L +N+F DL EF G H + K G+ FL
Sbjct: 59 AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSTFLPPANVND 113
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +P V+W +KGAVTPVK QGQC A ++EG + +K LVSLSEQ LVDC+
Sbjct: 114 SSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ NNGC GG M+DAFKYI N GI + Y YE + G C K ED A T Y ++
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231
Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
E L KAVA P+SVAIDA S+ Q YS GV++ C + L+HGV VGYG +
Sbjct: 232 AGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL+KNSW + WG+ GY + RD + QCGIA AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/330 (38%), Positives = 177/330 (53%), Gaps = 18/330 (5%)
Query: 15 SCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIG 74
S S + E I E F+ WK ++ + YK + E +R FK NL + N
Sbjct: 31 SAVSNDLHEGLTEEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKS 90
Query: 75 NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA 134
+ + LNKFADL+ +EF + +++ ++ P S++W KG
Sbjct: 91 GLEHKVGLNKFADLSNEEF--REMYLSKVKKPITIEEKRKHRHLQTCDAPSSLDWRNKGV 148
Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
VT VK QG C A+E INAI L+SLSEQ+LVDC T NN GC GG MD
Sbjct: 149 VTAVKDQGDCGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTT-NNYGCEGGDMDS 207
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF+++I N GI +A Y Y G+ G C++ K E I Y DV P+D +LL A Q
Sbjct: 208 AFQWVIGNGGIDTEADYPYTGVD-GTCNTAKEEKKVVSIEGYVDVDPSDS-ALLCATVQQ 265
Query: 248 PVSVAIDASAL--QFYSGGVFNGYCE---TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
P+SV +D SAL Q Y+GG+++G C ++H + VGYG SE YW++KNSWG +
Sbjct: 266 PISVGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYG-SENDEDYWIVKNSWGTE 324
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WG +GYF ++R+ +P G C I AS+P
Sbjct: 325 WGMEGYFYIRRNTSKPYGVCAINADASYPT 354
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 140/353 (39%), Positives = 195/353 (55%), Gaps = 32/353 (9%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
A FLI++L G A+ F+ + E++ +K Q+ + Y E R +I+ N
Sbjct: 1 AMKFLILIL---GFVAAANAISIFE--LVKEEWTAFKLQHRKKYDSETEERIRMKIYVQN 55
Query: 62 LVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS------SLKANGT 114
+ + N +G + LR+NK+ADL +EF+ + GF S LK
Sbjct: 56 KHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSVSGKGQLLRGELKPIEE 115
Query: 115 PFLY---KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLS 164
P + + VP +++W KGAVT VK QG C A A+EG + K +LVSLS
Sbjct: 116 PVTWIEPANVDVPTAMDWRTKGAVTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLS 175
Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
EQ LVDC+ NNGC GG MD AF+YI NKGI + Y YE + + KA A
Sbjct: 176 EQNLVDCSQKYGNNGCNGGMMDFAFQYIKDNKGIDTEKSYPYEAIDDECHYNPKAV--GA 233
Query: 225 QITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVT 279
+ D+P +E++L+KA+A PVSVAIDAS + QFYS GV + C++ L+HGV
Sbjct: 234 TDKGFVDIPQGNEKALMKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVL 293
Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
AVGYGT+E+G YWL+KNSWG WG+ GY ++ R+ D CGIA AS+P+
Sbjct: 294 AVGYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARNRDN---HCGIATTASYPL 343
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/322 (40%), Positives = 185/322 (57%), Gaps = 25/322 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
I E+++ +K ++ + Y + E R +IF +N + + N A G S+ + +NK+AD+
Sbjct: 23 IKEEWQTFKLEHRKNYVDETEERFRLKIFNENKHKIAKHNQRYASGEVSFKMAVNKYADM 82
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFL------YKSSQVPPSVNWIEKGAVTPVKYQG 142
EF + GF + H L+A+ F+ + ++P SV+W KGAVT VK QG
Sbjct: 83 LHHEFHTTMNGFNYTLHKQ-LRASDPSFVGVTFISPEHVKIPKSVDWRSKGAVTEVKDQG 141
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C + A+EG + K L+SLSEQ LVDC+T NNGC GG MD+AF+YI N
Sbjct: 142 HCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 201
Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAID 254
GI + Y YEG+ C KA A + D+P DE+ + +AVA PVSVAID
Sbjct: 202 GGIDTEKSYPYEGIDDS-CHFNKATIGATDRGSV-DIPQGDEKKMAEAVATIGPVSVAID 259
Query: 255 AS--ALQFYSGGVFN-GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
AS + QFYS G++N C+ L+HGV VGYGT E G YWL+KNSWG WG+ G+ +
Sbjct: 260 ASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGTDESGQDYWLVKNSWGTTWGDKGFIK 319
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
+ R+ D QCGIA +S+P+
Sbjct: 320 MARNADN---QCGIASASSYPL 338
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 133/314 (42%), Positives = 181/314 (57%), Gaps = 28/314 (8%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
E + QWK + + Y E + R+ I+KDN + N + + L++N+F D+T
Sbjct: 25 ESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHN---LKGGDFLLKMNQFGDMTNS 81
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA----- 145
EF A + + S NG+ FL ++ V P +V+W +G VTPVK QGQC
Sbjct: 82 EFKA------FNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAF 135
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
++EG + K +LVSLSEQ LVDC+T NNGC GG MD+AF YI +NKGI ++A
Sbjct: 136 STTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENKGIDSEAS 195
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQF 260
Y Y G C K AA T + D+P +E L +AVA+ P+SVAIDAS + QF
Sbjct: 196 YPYTA-EDGKC-VFKKPSVAATDTGFVDLPEGNENKLKEAVASVGPISVAIDASHESFQF 253
Query: 261 YSGGVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
YS GV+N T L+HGV VGYGT E G YWL+KNSW WG+ GY +++R+
Sbjct: 254 YSSGVYNEPSCSSTELDHGVLVVGYGT-ESGKDYWLVKNSWNTSWGDKGYIKMRRN---A 309
Query: 319 QGQCGIAMFASFPV 332
+ QCGIA AS+P+
Sbjct: 310 KNQCGIATKASYPL 323
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 134/349 (38%), Positives = 191/349 (54%), Gaps = 30/349 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M Y + L +SG A+ + + D+ +EQWK +G+ Y E E +R I++
Sbjct: 1 MWTYLALFTLCLSGVFAAPSLDKQLDD-----HWEQWKTWHGKNYHEKEEGWRRM-IWEK 54
Query: 61 NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
NL ++ N ++G +Y L +N F D+ +EF G+K H + K G+ F+
Sbjct: 55 NLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGYK---HKTERKFKGSLFMEP 111
Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
+ +VP ++W EKG VTPVK QG+C A+EG K +LVSLSEQ LVDC
Sbjct: 112 NFLEVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDC 171
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ + N GC GG MD AF+YI N G+ ++ Y Y G C + +AA T + D
Sbjct: 172 SRPEGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDDQPC-HYDPKYNAANDTGFVD 230
Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTS 286
+P E +L+KAVA+ PVSVAIDA + QFY G+ F C + L+HGV VGYG
Sbjct: 231 IPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEELDHGVLVVGYGFE 290
Query: 287 EE---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E G KYW++KNSW + WG+ GY + +D + CGIA AS+P+
Sbjct: 291 GEDVDGKKYWIVKNSWSESWGDKGYIYMAKD---RKNHCGIATAASYPL 336
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 133/323 (41%), Positives = 192/323 (59%), Gaps = 28/323 (8%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLN 83
FD+ ++ ++ QWKAQ+ RTY + E+ R ++ NL +E N + G S+ L +N
Sbjct: 21 FDQ-TLDSQWHQWKAQHRRTYAAN-EDGWRRATWEKNLKMIEMHNLEYSAGKHSFQLGMN 78
Query: 84 KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKY 140
KF D+T +EF G+ + + S + G+ LY+ +Q+P SV+W EKG VTPVK
Sbjct: 79 KFGDMTTEEFKQVMNGY--NSNGSQKRTKGS--LYREPLLAQLPKSVDWREKGYVTPVKN 134
Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
QGQC A ++EG K +LVSLSEQ LVDC+T++ NNGC GG MD+AF+Y+
Sbjct: 135 QGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTSEGNNGCSGGLMDNAFEYVK 194
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVA 252
N GI + Y Y G +AE A +T + D+P +E +L+KAVAN P+SVA
Sbjct: 195 NNGGIDTEQAYPYLGQDNEC--KYRAECSGANVTGFVDIPSMNERALMKAVANVGPISVA 252
Query: 253 IDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
IDA + QFY GV + C + L+HGV VGYG+ + +YW++KNSWG++WG+ GY
Sbjct: 253 IDAGNPSFQFYESGVYYEPQCSSSQLDHGVLVVGYGSIGKD-EYWIVKNSWGEEWGKKGY 311
Query: 309 FRLQRDIDQPQGQCGIAMFASFP 331
+ + + CGIA AS+P
Sbjct: 312 VLMAKFRNN---HCGIATAASYP 331
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 183/321 (57%), Gaps = 25/321 (7%)
Query: 30 IAEKFEQW---KAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKF 85
I E F +W K +G++Y+ EN E F N++ +E N +G +++ + LN+
Sbjct: 41 IDEAFNKWDDYKETFGKSYEPEEEND-YMEAFVKNVIHIEEHNKEHRLGRKTFEMGLNEI 99
Query: 86 ADLTPQEFIASQTGFKMSDH-SSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQ 143
ADL ++ G++M S+++NGT FL + Q+P SV+W E+G VTPVK QG
Sbjct: 100 ADLPFSQY-RKLNGYRMRRQFGDSMQSNGTKFLVPFNVQIPESVDWREEGLVTPVKNQGM 158
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + A+EG +A +LVSLSEQ LVDC+T N+GC GG MD AF+YI +N
Sbjct: 159 CGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENH 218
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA 255
G+ + Y Y G T K A + D+P DEE+L KAVA Q P+S+AIDA
Sbjct: 219 GVDTEDSYPYVGRETKC--HFKRNTVGADDKGFVDLPEGDEEALKKAVATQGPISIAIDA 276
Query: 256 S--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
+ Q Y GV F+ C + L+HGV VGYGT E YWL+KNSWG WGE GY R+
Sbjct: 277 GHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTWGEKGYIRI 336
Query: 312 QRDIDQPQGQCGIAMFASFPV 332
R+ + CG+A AS+P+
Sbjct: 337 ARNRNN---HCGVATKASYPL 354
>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
Length = 341
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 132/347 (38%), Positives = 192/347 (55%), Gaps = 27/347 (7%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
+ ++L++ A+ FD + E++ +K ++ + Y E R +I+ +N V
Sbjct: 1 MKILLVLCAVVAAGTAVSFFD--LVREEWNTFKLEHKKQYDSETEEKFRMKIYAENKHKV 58
Query: 66 ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGF-KMSDHSSSLKANGT-----PFLY 118
+ N G SY L+ NK++D+ EF+ + GF K H+ L A G F+
Sbjct: 59 AKHNQRYQKGLVSYRLKTNKYSDMLHHEFVNTMNGFNKTVKHNKGLYAKGNDIRGATFVS 118
Query: 119 KSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVD 170
++ PP+V+W + GAVTPVK QG+C A+EG + K LVSLSEQ L+D
Sbjct: 119 PANVAAPPTVDWRQHGAVTPVKDQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLID 178
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C++ NNGC GG MD+AFKYI N GI + Y YE + ++ A+ +
Sbjct: 179 CSSAYGNNGCNGGLMDNAFKYIKDNDGIDTEKTYPYEAVDDKC--RYNPKNSGAEDVGFV 236
Query: 231 DVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGT 285
D+P DE L+ A+A PVSVAIDAS + Q YS GV ++ C + L+HGV VGYGT
Sbjct: 237 DIPAGDEHKLMLALATVGPVSVAIDASQESFQLYSDGVYYDENCSSENLDHGVLVVGYGT 296
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E+G YWL+KNSWG WG++GY ++ R+ D CGIA AS+P+
Sbjct: 297 DEDGGDYWLVKNSWGPSWGDEGYIKMARNRDN---HCGIASSASYPL 340
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 186/332 (56%), Gaps = 23/332 (6%)
Query: 15 SCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AI 73
+C + A T + + +E WK + + Y + E+++R +I++DNL V + N ++
Sbjct: 9 ACVAGALCFTIIDKGFDDTWEAWKQTHSKQYTKEEEDNRR-KIWEDNLQKVSKHNTEHSL 67
Query: 74 GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL-YKSSQVPPSVNWIEK 132
G SYTL +NK+ADL +EF+ G K +S + G FL Y Q P SV+W ++
Sbjct: 68 GLHSYTLGMNKYADLRGEEFVQMMNGLKFD---ASRERQGIKFLSYAKFQAPDSVDWRDE 124
Query: 133 GAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
G VTPVK QGQC ++EG + L SLSEQ LVDC+ + NNGC GG M
Sbjct: 125 GYVTPVKDQGQCGSCWAFSTTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLM 184
Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKA-V 244
D AF+YI N GI + Y YE C ++ A + Y DV DE++L +A
Sbjct: 185 DYAFQYIKDNLGIDTEDKYPYEA-EDDTC-RFSPDNVGATDSGYVDVDSGDEDALKEACA 242
Query: 245 ANQPVSVAIDAS--ALQFYSGGVFNG-YCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWG 300
AN P+SVAIDAS + Q Y GV++ C + L+HGV VGYGT G YW++KNSWG
Sbjct: 243 ANGPISVAIDASHESFQLYESGVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWG 302
Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WG++GY + R+ D QCGIA AS+P
Sbjct: 303 LSWGQEGYIWMSRNKDN---QCGIATSASYPT 331
>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
Length = 341
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 133/346 (38%), Positives = 194/346 (56%), Gaps = 29/346 (8%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+V+L+ + AS ++ FD + E++ +K ++ + Y E+ R +I+ +N +
Sbjct: 4 LVILLCVVAAASAVSF--FD--LVKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKHNIA 59
Query: 67 RFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS------SLKANGTPFLYK 119
+ N A G S+ L+ NK+ D+ EF+ + GF + +S S G F+
Sbjct: 60 KHNQKYARGEVSFRLKQNKYGDMLHHEFVHTMNGFNKTTKNSKGLFGKSAGERGATFITP 119
Query: 120 SS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
++ +P V+W + GAVT VK QG+C + A+EG + + N LVSLSEQ L+DC
Sbjct: 120 ANVHLPDHVDWRKHGAVTEVKDQGKCGSCWSFSSTGALEGQHYRRTNILVSLSEQNLIDC 179
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ NNGC GG MD+AFKYI N+GI + Y YEG+ ++ A + D
Sbjct: 180 SAAYGNNGCNGGLMDNAFKYIKDNRGIDTEKSYPYEGIDDKC--RYNPKNTGADDNGFVD 237
Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTS 286
+P DE L+ AVA PVSVAIDA S+ QFYS GV F+ C + L+HGV VGYGT
Sbjct: 238 IPSGDEGKLMAAVATVGPVSVAIDASQSSFQFYSDGVYFDENCSSSSLDHGVLVVGYGTD 297
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E G YWL+KNSWG+ WG+ GY ++ R+ D CGIA AS+P+
Sbjct: 298 ENGGDYWLVKNSWGRSWGDLGYIKMARNRDN---HCGIATAASYPL 340
>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
Length = 344
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 126/316 (39%), Positives = 182/316 (57%), Gaps = 20/316 (6%)
Query: 31 AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLT 89
A +F ++K+QY + Y + R +++K N V N G +Y + LN AD+
Sbjct: 20 ASEFTRFKSQYRKDYPSDSVERYRKKVYKQNEKFVREHNERYERGEVTYKMALNHLADMH 79
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
P+EF+A+ GF S +++ G PF + K + + V+W +KGA++PVK QG C
Sbjct: 80 PREFMATFLGFNRSLRATNKVPEGIPFRHNKDAVIQKEVDWRQKGAISPVKDQGHCGSCW 139
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
+ A+E +K R VSLSEQ L+DC+ N NNGC GG M+ AF+Y+ N GI +
Sbjct: 140 AFSSTGALEAHTFLKKGRRVSLSEQNLIDCSLNYGNNGCEGGLMEQAFQYVRDNDGIDTE 199
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDAS--AL 258
Y YEG + K + A + +P DE++L++AVA Q P+S+AIDAS +
Sbjct: 200 EAYPYEGEDSEC--RFKKNNVGATDAGFVTIPSGDEQALMEAVATQGPLSIAIDASNPSF 257
Query: 259 QFYSGGV-FNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
QFYS GV + C + L+HGV VGYG E+ KYWL+KNSW + WGE+GY ++ R+ D
Sbjct: 258 QFYSEGVYYEPECSSAQLDHGVLLVGYGV-EKDQKYWLVKNSWSEQWGENGYIKMARNKD 316
Query: 317 QPQGQCGIAMFASFPV 332
CGIA ASFP+
Sbjct: 317 N---NCGIATQASFPI 329
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 134/348 (38%), Positives = 189/348 (54%), Gaps = 30/348 (8%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
A Y ++VL +S CA+ FD + + + WK + ++Y ES E +R +++ N
Sbjct: 3 ALYLAVLVLCVSAVCAAP----RFDS-QLEDHWHLWKNWHSKSYHESEEGWRRM-VWEKN 56
Query: 62 LVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
L +E N +G SY L +N F D+T +EF + G+K ++ K G+ F+ +
Sbjct: 57 LKKIEMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK---QTTERKFKGSLFMEPN 113
Query: 121 S-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
Q P +V+W EKG VTPVK QG C A+EG K +LVSLSEQ LVDC+
Sbjct: 114 YLQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCS 173
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ N GC GG MD AF+YI N G+ + Y Y G C K E A T + D+
Sbjct: 174 RPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPC-HYKPEFSGANETGFVDI 232
Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSE 287
P E +++KAVA PVSVAIDA + QFY G+ + C + L+HGV VGYG
Sbjct: 233 PSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEG 292
Query: 288 E---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E G KYW++KNSW + WG+ GY + +D + CGIA +S+P+
Sbjct: 293 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATASSYPL 337
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/309 (41%), Positives = 177/309 (57%), Gaps = 25/309 (8%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
F W ++ ++Y E R+ ++++N + +E N+ N+S+ L +NKF DLT EF
Sbjct: 30 FADWMQEHQKSYANE-EFVYRWNVWRENYLYIEAHNHQ---NKSFHLAMNKFGDLTNAEF 85
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
G ++ + +++ P + +P +W +KGAVT VK QGQC
Sbjct: 86 NKLFKGLSITADQAKQESDIAP----APGLPADFDWRQKGAVTHVKNQGQCGSCWSFSTT 141
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
+ EG N +K RL SLSEQ LVDC+T+ N+GC GG MD AF+YII+NKGI + Y Y
Sbjct: 142 GSTEGANFLKHGRLTSLSEQNLVDCSTSYGNHGCNGGLMDYAFEYIIRNKGIDTEESYPY 201
Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGG 264
S G C K + ++ +Y +VP +E +LL AVA QP SVAIDA S+ QFY GG
Sbjct: 202 HA-SQGTCRYNK-QHSGGELVSYTNVPSGNEGALLNAVATQPTSVAIDASHSSFQFYKGG 259
Query: 265 VFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
V++ + L+HGV AVG+G +G YWL+KNSWG DWG GY + R+ QC
Sbjct: 260 VYDEPACSSSRLDHGVLAVGWGV-RDGKDYWLVKNSWGADWGLSGYIEMSRN---KHNQC 315
Query: 323 GIAMFASFP 331
GIA AS P
Sbjct: 316 GIATAASHP 324
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 132/348 (37%), Positives = 195/348 (56%), Gaps = 29/348 (8%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K FL++++ I + + + + + +++ +K ++ + YK E R +IF DN
Sbjct: 2 KLFLLLIVAILATAQAISFFEL-----VNQEWTTFKMEHNKVYKNDIEERFRMKIFMDNK 56
Query: 63 VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP----FL 117
+ + N N + SY L++NK+ D+ EF+ + GF S ++ L++ P F+
Sbjct: 57 HKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSI-NTQLRSERLPIGASFI 115
Query: 118 YKSSQV-PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
++ V P +V+W E GAVTPVK QG C A A+EG + + L+ LSEQ L+
Sbjct: 116 EPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLI 175
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC+ NNGC GG MD AF+YI NKG+ + Y YE + A + A+ Y
Sbjct: 176 DCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKC--RYNAANSGARDVGY 233
Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYCET-FLNHGVTAVGYG 284
D+P +E+ L AVA PVSVAIDAS + QFYS GV+ C + L+HGV AVGYG
Sbjct: 234 VDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYG 293
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
T E G YWL+KNSWG+ WG++GY ++ R+ CGIA AS+P+
Sbjct: 294 TDENGQDYWLVKNSWGETWGDNGYIKMARN---KLNHCGIASTASYPL 338
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 134/353 (37%), Positives = 190/353 (53%), Gaps = 37/353 (10%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K +++ ++ +CA + E++ +K ++ + Y E+ R +I+ +N
Sbjct: 2 KSIAVLLCVVGAACAVSLL------DLVREEWSAFKLEHSKRYDSEVEDKFRMKIYLENK 55
Query: 63 VAV----ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGF-KMSDHSSSLKANG---- 113
+ +RF A+ SY LR NK+AD+ EF+ GF K H ++ G
Sbjct: 56 HRIAKHNQRFEQGAV---SYKLRPNKYADMLSHEFVHVMNGFNKTLKHPKAVHGKGRESR 112
Query: 114 -TPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLS 164
F+ + P V+W +KGAVT VK QG+C A+EG + K LVSLS
Sbjct: 113 PATFIAPAHVTYPDHVDWRKKGAVTEVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLS 172
Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
EQ L+DC+ NNGC GG MD+AFKYI N GI + Y YEG+ A++ A
Sbjct: 173 EQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDTEKAYPYEGVDDKC--RYNAKNSGA 230
Query: 225 QITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVT 279
+ D+P DEE L++AVA PVSVAIDAS + QFYS GV+ T L+HGV
Sbjct: 231 DDVGFVDIPQGDEEKLMQAVATVGPVSVAIDASQESFQFYSDGVYYDENCSSTDLDHGVM 290
Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
VGYGT E+G YWL+KNSWG+ WG+ GY ++ R+ + CGIA AS+P+
Sbjct: 291 VVGYGTDEQGGDYWLVKNSWGRTWGDLGYIKMARNKNN---HCGIASSASYPL 340
>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
Length = 341
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/320 (39%), Positives = 179/320 (55%), Gaps = 22/320 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
I E+++ +K Q+ + Y++ E + R +++ DN + + R N G +Y L +N F DL
Sbjct: 26 IEEEWDLFKVQFKKIYEDVKEEAFRKKVYLDNKLKIARHNKLYETGEETYALEMNHFGDL 85
Query: 89 TPQEFIASQTGFK--MSDHSSSLKANGTPFLYKSSQV--PPSVNWIEKGAVTPVKYQGQC 144
E+ GFK ++ + + KS V P S++W +KG VTPVK QGQC
Sbjct: 86 MQHEYTKMMNGFKPSLAGGDKNFTDDDAVTFLKSENVVIPKSIDWRKKGYVTPVKNQGQC 145
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
A ++EG + K LVSLSEQ L+DC+ NNGC GG MD AFKYI NKG
Sbjct: 146 GSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKG 205
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
+ + Y YE E+ A + D+P DE++L+ A+A PVS+AIDAS
Sbjct: 206 LDTEKSYPYEAEDDKC--RYNPENSGATDKGFVDIPEGDEDALVHALATVGPVSIAIDAS 263
Query: 257 A--LQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
+ QFY GVF N C T L+HGV AVGYGT +G YW++KNSWG+ WG+ GY +
Sbjct: 264 SEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHKGGDYWIVKNSWGKTWGDQGYIMMA 323
Query: 313 RDIDQPQGQCGIAMFASFPV 332
R+ + CG+A AS+P+
Sbjct: 324 RN---KKNNCGVASSASYPL 340
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 135/348 (38%), Positives = 189/348 (54%), Gaps = 30/348 (8%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
A Y ++VL +S CA+ FD + + + WK + + Y ES E +R +++ N
Sbjct: 3 ALYLAVLVLCVSAVCAAP----RFD-SQLEDHWHLWKNWHSKHYHESEEGWRRM-VWEKN 56
Query: 62 LVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
L +E N +G SY L +N F D+T +EF + G+K + + K G+ F+ +
Sbjct: 57 LKKIEIHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQT---TERKFKGSLFMEPN 113
Query: 121 S-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
Q P +V+W EKG VTPVK QG C A+EG K +LVSLSEQ LVDC+
Sbjct: 114 YLQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCS 173
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ N GC GG MD AF+YI N G+ + Y Y G C K E AA T + D+
Sbjct: 174 RPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPC-HYKPEFSAANETGFVDI 232
Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSE 287
P E +++KAVA PVSVAIDA + QFY G+ + C + L+HGV VGYG
Sbjct: 233 PSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEG 292
Query: 288 E---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E G KYW++KNSW + WG+ GY + +D + CGIA +S+P+
Sbjct: 293 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATASSYPL 337
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 135/345 (39%), Positives = 195/345 (56%), Gaps = 27/345 (7%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L + LI++ +QA +F E + +++ +K ++ + YK E R +IF DN +
Sbjct: 3 LFLFLIVAVLATAQAI--SFFE-LVNQEWTTFKMEHNKVYKNDVEERFRMKIFMDNKHKI 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP----FLYKS 120
+ N N + SY L++NK+ D+ EF+ + GF S ++ L++ P F+ +
Sbjct: 60 AKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSI-NTQLRSERLPIAASFIEPA 118
Query: 121 SQV-PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+ V P +V+W E GAVTPVK QG C A A+EG + + L+ LSEQ L+DC+
Sbjct: 119 NVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCS 178
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
NNGC GG MD AF+YI NKG+ + Y YE + A + A+ Y D+
Sbjct: 179 GKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKC--RYNAANSGARDVGYVDI 236
Query: 233 PPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYCET-FLNHGVTAVGYGTSE 287
P +E+ L AVA PVSVAIDAS + QFYS GV+ C + L+HGV AVGYGT E
Sbjct: 237 PQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYGTDE 296
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G YWL+KNSWG+ WG++GY ++ R+ CGIA AS+P+
Sbjct: 297 NGQDYWLVKNSWGETWGDNGYIKMARN---KLNHCGIASTASYPL 338
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 123/283 (43%), Positives = 165/283 (58%), Gaps = 20/283 (7%)
Query: 31 AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLT 89
A F+ +K + + Y+ E ++RF IF DNL + R N AA G ++T+ +N+FADLT
Sbjct: 17 AMSFDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLT 76
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
+E+ + + + L ++ SV+W +KGAVTP+K QGQC
Sbjct: 77 NEEYRQ----LYLRPYPTELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSCWS 132
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
+VEG +AI LVSLSEQQLVDC+ + N GC GG MD+AFKYII N G+ +
Sbjct: 133 FSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQ 192
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
Y Y G+CD K HA I+ Y+DVP N+E+ L AV PVSVAI+A + Q
Sbjct: 193 DYPYTARD-GVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQM 251
Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
YS GVF+G C T L+HGV VGY TS+ YW++KNSWG W
Sbjct: 252 YSSGVFSGPCGTNLDHGVLVVGY-TSD----YWIVKNSWGASW 289
>gi|297729067|ref|NP_001176897.1| Os12g0273900 [Oryza sativa Japonica Group]
gi|255670225|dbj|BAH95625.1| Os12g0273900 [Oryza sativa Japonica Group]
Length = 184
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 100/184 (54%), Positives = 127/184 (69%), Gaps = 1/184 (0%)
Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
+EG + +L+SLSEQ+LVDC + N+ GC GG +D AF++I+ N G+T +A Y Y
Sbjct: 1 MEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYT- 59
Query: 209 MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNG 268
G C + A D AA I YEDVP NDE SL+KAVA QPVSVA+DAS QFY GGV G
Sbjct: 60 AEDGRCKTTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDASKFQFYGGGVMAG 119
Query: 269 YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFA 328
C T L+HGVT +GYG + +G KYWL+KNSWG WGE GY R+++DID +G CG+AM
Sbjct: 120 ECGTSLDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQP 179
Query: 329 SFPV 332
S+P
Sbjct: 180 SYPT 183
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 177/316 (56%), Gaps = 22/316 (6%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
S ++ +E WK ++ + Y + E R++I++ N +E +NA +TL +NKF DL
Sbjct: 17 SFSQDWEDWKNEHNKKYSDDLEELTRYKIWQGNQKIIE-VHNANSDKFGFTLGMNKFGDL 75
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
EF G+ M S+S K YK+ P+V+W KGAVT VK QGQC
Sbjct: 76 ESHEFAEMFNGYMMQARSNSTKVFVADPNYKAD---PTVDWRTKGAVTGVKNQGQCGSCW 132
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
++EG + +K +LVSLSEQ LVDC+ + N GC GG MD AF+YI +N GI +
Sbjct: 133 AFSTTGSLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYIKKNGGIDTE 192
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SAL 258
A Y Y+ KA D A T Y D+ DE +L++AV PVSVAIDA S+
Sbjct: 193 ASYPYQAHDERC--RFKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSF 250
Query: 259 QFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
Q Y GV+ +T L+HGV A+GYGT E G YWL+KNSWG DWG +GY + R+ +
Sbjct: 251 QLYRSGVYYERECSQTALDHGVLAIGYGT-EGGSDYWLVKNSWGTDWGMEGYIMMSRNRN 309
Query: 317 QPQGQCGIAMFASFPV 332
CGIA AS+P
Sbjct: 310 N---NCGIATEASYPT 322
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/345 (37%), Positives = 196/345 (56%), Gaps = 28/345 (8%)
Query: 8 VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
+VL++ A A + FD + E++ +K Q+ Y+ E++ R +I+ ++ + +
Sbjct: 4 LVLLLCAVAAVSAV-QFFD--LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAK 60
Query: 68 FNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGF-KMSDHSSSL-----KANGTPFLYKS 120
N +G SY L +NK+ D+ EF+ + GF K + H+ +L G F+ +
Sbjct: 61 HNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 120
Query: 121 S-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
+ ++P V+W + GAVT +K QG+C A+EG + + LVSLSEQ L+DC+
Sbjct: 121 NVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS 180
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
NNGC GG MD+AFKYI N GI + Y YEG+ ++ A+ + D+
Sbjct: 181 EQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKC--RYNPKNTGAEDVGFVDI 238
Query: 233 PPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY--CETFLNHGVTAVGYGTSE 287
P DE+ L++AVA PVSVAIDAS + Q YS GV+N T L+HGV VGYGT E
Sbjct: 239 PEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDE 298
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G+ YWL+KNSWG+ WGE GY ++ R+ +CGIA AS+P+
Sbjct: 299 QGVDYWLVKNSWGRSWGELGYIKMIRN---KNNRCGIASSASYPL 340
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 177/324 (54%), Gaps = 26/324 (8%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
+ E + F ++A Y ++Y E +R+ IFK+NLV + N SY+L++N
Sbjct: 108 WKEAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY---SYSLKMNH 164
Query: 85 FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQG 142
F DL+ EF GFK S + S L S++P V+W +G VTPVK Q
Sbjct: 165 FGDLSRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQR 224
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C A+EG + K +LVSLSEQ+L+DC+ + N C GG M+DAF+Y++ +
Sbjct: 225 DCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDS 284
Query: 196 KGITNDAVYSY----EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
GI ++ Y Y E C+ + +I ++DVP E ++ A+A PVS+
Sbjct: 285 GGICSEDAYPYLARDEECRAQSCEKV------VKILGFKDVPRRSEAAMKAALAKSPVSI 338
Query: 252 AIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK-YWLIKNSWGQDWGEDGY 308
AI+A + QFY GVF+ C T L+HGV VGYGT +E K +W++KNSWG WG DGY
Sbjct: 339 AIEADQMPFQFYHEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGY 398
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
+ + +GQCG+ + ASFPV
Sbjct: 399 MYMAMHKGE-EGQCGLLLDASFPV 421
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 138/328 (42%), Positives = 193/328 (58%), Gaps = 31/328 (9%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E+F+ W+A+Y RTY E +RF I+ +N+ ++ N + G+ SY L N+F DLT
Sbjct: 34 LLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGS-SYELGENQFTDLT 92
Query: 90 PQEF-------------IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVT 136
+EF A G + S++ +NG + + P SV+W KGAVT
Sbjct: 93 EEEFKDTYLMKLDEQPPAAEAMGPTVGTMSTAGMSNGN----NTGEAPNSVDWRTKGAVT 148
Query: 137 PVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
VK Q QC VA++EG++ IK RLVSLSEQ++VDC N+NGC GG A
Sbjct: 149 RVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAM 208
Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
+++ +N G+T ++ Y Y G S C S K HAA+I Y+ V N+E L +AVA +PV
Sbjct: 209 EWVTRNGGLTTESDYPYVG-SQRQCMSGKLGHHAARIRGYQAVQRNNEAELERAVAERPV 267
Query: 250 SVAIDAS-ALQFYSGGVFNGYCE-TFLNHGVTAVGYGTS---EEGIKYWLIKNSWGQDWG 304
+V IDAS A QFY GVF+G C+ T +NH VT VGYG++ G KYW++KNSWGQ WG
Sbjct: 268 AVFIDASRAFQFYKSGVFSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWG 327
Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E+GY R+ R + +G C IA+ +PV
Sbjct: 328 ENGYVRMARRVRAREGMCAIAIEPYYPV 355
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 182/316 (57%), Gaps = 24/316 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
++ WK+ + + Y E E+ +R +++ NL +E N + +G SY L +N+F D+T +E
Sbjct: 10 WQLWKSWHNKDYHEREESWRRV-VWEKNLKMIELHNLDHTLGKHSYKLGMNQFGDMTTEE 68
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA------ 145
F G+ + S K G+ FL S + P SV+W EKG VTPVK QGQC
Sbjct: 69 FRQLMNGY--AHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFS 126
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
A+EG + K +LVSLSEQ LVDC+ + N GC GG MD AF+Y+ N GI ++ Y
Sbjct: 127 TTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESY 186
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQFY 261
Y C KAE +AA T + D+P E +L+KAVA PVSVAIDA S+ QFY
Sbjct: 187 PYTAKDDEDC-RYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFY 245
Query: 262 SGGV-FNGYCETF-LNHGVTAVGYGTSEE---GIKYWLIKNSWGQDWGEDGYFRLQRDID 316
G+ + C + L+HGV VGYG E G KYW++KNSWG+ WG+ GY + +D
Sbjct: 246 QSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKD-- 303
Query: 317 QPQGQCGIAMFASFPV 332
+ CGIA AS+P+
Sbjct: 304 -RKNHCGIATAASYPL 318
>gi|115436338|ref|NP_001042927.1| Os01g0330300 [Oryza sativa Japonica Group]
gi|13365805|dbj|BAB39243.1| putative SAG12 protein [Oryza sativa Japonica Group]
gi|14164528|dbj|BAB55777.1| putative SAG12 protein [Oryza sativa Japonica Group]
gi|113532458|dbj|BAF04841.1| Os01g0330300 [Oryza sativa Japonica Group]
gi|125570199|gb|EAZ11714.1| hypothetical protein OsJ_01576 [Oryza sativa Japonica Group]
Length = 367
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/322 (40%), Positives = 174/322 (54%), Gaps = 27/322 (8%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN--------AAIGNRSYT---LRL 82
F QW ++Y + Y E KR++++K N + F + A ++ T + +
Sbjct: 51 FSQWMSKYSKRYSCPEEQEKRYQVWKANTDFIGAFRSQTEISSGVGAFAPQTVTDSFVGM 110
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
N F DL EF+ TGF + + + + S +P V+W GAVT VK QG
Sbjct: 111 NLFGDLASGEFVRQFTGFNATGFVAPPPSPSP--IPPRSWLPCCVDWRSSGAVTGVKLQG 168
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
CA VAA+EG++ IK LVSLSEQ +VDC T +NGC GG D A +
Sbjct: 169 SCASCWAFAAVAAIEGLHRIKTGELVSLSEQVMVDCDTG--SNGCGGGRSDTALGLVASR 226
Query: 196 KGITNDAVYSYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
G+T++ Y Y G G CD K DH+A ++ + VPPNDE L AVA QPV+V ID
Sbjct: 227 GGVTSEERYPYAGARGG-CDVGKLLSDHSASVSGFAAVPPNDERQLALAVARQPVTVYID 285
Query: 255 ASA--LQFYSGGVFNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
ASA QFY GGV+ G C+ +NH VT VGY + G KYW+ KNSW DWGE GY L
Sbjct: 286 ASAPEFQFYKGGVYRGPCDPGRMNHAVTIVGYCENIGGDKYWIAKNSWSSDWGEQGYVYL 345
Query: 312 QRDIDQPQGQCGIAMFASFPVS 333
+D+ PQG CG+A +P +
Sbjct: 346 AKDVWWPQGTCGLATSPFYPTA 367
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 140/349 (40%), Positives = 196/349 (56%), Gaps = 48/349 (13%)
Query: 10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
L++ +C + AT + S ++E +K ++ + Y E E ++R IF+DNL +E N
Sbjct: 3 LLVLLACVAMATAASL---SFESQWEAFKIKHDKVYSEKEEYARRL-IFQDNLKTIESHN 58
Query: 70 NAA-IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPP 125
A G SY L +N+FAD+T E++ G + +S+L G+ Y+ + QV
Sbjct: 59 QEADTGKHSYWLGVNQFADMTHAEYLNQVIGGCLI--TSNLTKTGSRATYRYMPNMQVND 116
Query: 126 SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
+V+W +KG VT +K QGQC ++EG +A LVSLSEQ LVDC+ + N
Sbjct: 117 TVDWRDKGLVTDIKDQGQCGSCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQEGNK 176
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH---------AAQITNY 229
GC GG MD F+YIIQNKGI + Y Y KA++H A ++++
Sbjct: 177 GCEGGDMDQGFQYIIQNKGIDTEQCYPY-----------KAKNHRCKFDNSCIGATMSSF 225
Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY--CETFLNHGVTAVGYG 284
DV DE++L +A AN P+SV IDAS + QFYS GV+N + T L+HGV VGYG
Sbjct: 226 TDVTSGDEDALKQACANIGPISVGIDASHQSFQFYSSGVYNEFECSSTKLDHGVLVVGYG 285
Query: 285 TSEEGIK-YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
T G K YWL+KNSWG WG +GY + R+ D QCG+A ASFPV
Sbjct: 286 TY--GSKDYWLVKNSWGTVWGNEGYIMMSRNKDN---QCGVATDASFPV 329
>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
Length = 341
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 136/346 (39%), Positives = 191/346 (55%), Gaps = 29/346 (8%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+VVL+ + AS ++ FD + E++ +K ++ + Y E+ R +I+ +N +
Sbjct: 4 LVVLMCVVAAASAVSF--FD--LVKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKHKIA 59
Query: 67 RFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS------SSLKANGTPFLYK 119
+ N A G + ++ NK+ D+ EF+ + GF + + S G F+
Sbjct: 60 KHNQKFARGQVPFRVKQNKYGDMLHHEFVHTMNGFNKTTKNGKGLFGKSAGERGATFIPP 119
Query: 120 SS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
++ +VP V+W + GAVT VK QG+C A A+EG + + N LVSLSEQ L+DC
Sbjct: 120 ANVRVPDHVDWRKHGAVTEVKDQGKCGSCWSFSATGALEGQHYRQTNILVSLSEQNLIDC 179
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+T NNGC GG MD+AFKYI NKGI + Y YE + + A + D
Sbjct: 180 STAYGNNGCNGGLMDNAFKYIKDNKGIDTEKSYPYEAVDDKC--RYNPRNSGADDVGFID 237
Query: 232 VPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTS 286
+P DE L+ AVA PVSVAIDAS QFYS GV F+ C T L+HGV VGYGT
Sbjct: 238 IPSGDEGKLMAAVATVGPVSVAIDASQETFQFYSDGVYFDENCSSTSLDHGVLVVGYGTD 297
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E G YWL+KNSWG+ WG+ GY ++ R+ D CGIA ASFP+
Sbjct: 298 ENGGDYWLVKNSWGRSWGDLGYIKMARNRDN---HCGIATAASFPL 340
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 177/324 (54%), Gaps = 26/324 (8%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
+ E + F ++A Y ++Y E +R+ IFK+NLV + N SY+L++N
Sbjct: 107 WKEAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY---SYSLKMNH 163
Query: 85 FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQG 142
F DL+ EF GFK S + S L S++P V+W +G VTPVK Q
Sbjct: 164 FGDLSRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQR 223
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C A+EG + K +LVSLSEQ+L+DC+ + N C GG M+DAF+Y++ +
Sbjct: 224 DCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDS 283
Query: 196 KGITNDAVYSY----EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
GI ++ Y Y E C+ + +I ++DVP E ++ A+A PVS+
Sbjct: 284 GGICSEDAYPYLARDEECRAQSCEKV------VKILGFKDVPRRSEAAMKAALAKSPVSI 337
Query: 252 AIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK-YWLIKNSWGQDWGEDGY 308
AI+A + QFY GVF+ C T L+HGV VGYGT +E K +W++KNSWG WG DGY
Sbjct: 338 AIEADQMPFQFYHEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGY 397
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
+ + +GQCG+ + ASFPV
Sbjct: 398 MYMAMHKGE-EGQCGLLLDASFPV 420
>gi|125525718|gb|EAY73832.1| hypothetical protein OsI_01708 [Oryza sativa Indica Group]
Length = 366
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/322 (40%), Positives = 174/322 (54%), Gaps = 27/322 (8%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN--------AAIGNRSYT---LRL 82
F QW ++Y + Y E KR++++K N + F + A ++ T + +
Sbjct: 50 FSQWMSKYSKRYSCPEEQEKRYQVWKANTDFIGAFRSQTEISSGVGAFAPQTVTDSFVGM 109
Query: 83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
N F DL EF+ TGF + + + + S +P V+W GAVT VK QG
Sbjct: 110 NLFGDLASGEFVRQFTGFNATGFVAPPPSPSP--IPPRSWLPCCVDWRSSGAVTGVKLQG 167
Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
CA VAA+EG++ IK LVSLSEQ +VDC T +NGC GG D A +
Sbjct: 168 SCASCWAFAAVAAIEGLHRIKTGELVSLSEQVMVDCDTG--SNGCGGGRSDTALGLVASR 225
Query: 196 KGITNDAVYSYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
G+T++ Y Y G G CD K DH+A ++ + VPPNDE L AVA QPV+V ID
Sbjct: 226 GGVTSEERYPYAGARGG-CDVGKLLSDHSASVSGFAAVPPNDERQLALAVARQPVTVYID 284
Query: 255 ASA--LQFYSGGVFNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
ASA QFY GGV+ G C+ +NH VT VGY + G KYW+ KNSW DWGE GY L
Sbjct: 285 ASAPEFQFYKGGVYRGPCDPGRMNHAVTIVGYCENIGGDKYWIAKNSWSSDWGEQGYVYL 344
Query: 312 QRDIDQPQGQCGIAMFASFPVS 333
+D+ PQG CG+A +P +
Sbjct: 345 AKDVWWPQGTCGLATSPFYPTA 366
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 141/344 (40%), Positives = 190/344 (55%), Gaps = 36/344 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L + ++ + +SQ RT ++E +K + +TY+ E RF+IF +N + +
Sbjct: 7 LCAIAAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
+ N A G SY L +N+F DL EF G H + K G+ FL
Sbjct: 59 AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSSFLPPANVND 113
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
S +P V+W +KGAVTPVK QGQC A ++EG + +K LVSLSEQ LVDC+
Sbjct: 114 SSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+ NNGC GG M+DAFKYI N GI + Y Y+ + G C K ED A T Y ++
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYKAVD-GEC-RFKKEDVGATDTGYVEIK 231
Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
E L KAVA P+SVAIDA S+ Q YS GV++ C + L+HGV VGYG +
Sbjct: 232 AGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G KYWL+KNSW + WG+ GY + RD + QCGIA AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331
>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 127/320 (39%), Positives = 179/320 (55%), Gaps = 22/320 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
I E++ +KAQ+ + Y++ E + R +++ DN + + R N G +Y L +N F DL
Sbjct: 26 IEEEWSLFKAQFKKIYEDVKEEAFRKKVYLDNKLKIARHNKLYETGEETYALEMNHFGDL 85
Query: 89 TPQEFIASQTGFK--MSDHSSSLKANGTPFLYKSSQV--PPSVNWIEKGAVTPVKYQGQC 144
E+ GFK ++ + + KS V P +++W +KG VTPVK QGQC
Sbjct: 86 MQHEYKKMMNGFKPSLAGGDKNFTDDDAVTFLKSENVVVPKAIDWRKKGYVTPVKNQGQC 145
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
A ++EG + K LVSLSEQ L+DC+ NNGC GG MD AFKYI NKG
Sbjct: 146 GSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKG 205
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
+ + Y YE E+ A + D+P DE++L+ A+A PVS+AIDAS
Sbjct: 206 LDTEKSYPYEAEDDKC--RYNPENSGATDKGFVDIPEGDEDALMHALATVGPVSIAIDAS 263
Query: 257 A--LQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
+ QFY GVF N C T L+HGV AVGYGT +G YW++KNSWG+ WG+ GY +
Sbjct: 264 SEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHKGGDYWIVKNSWGKTWGDQGYIMMA 323
Query: 313 RDIDQPQGQCGIAMFASFPV 332
R+ + CG+A AS+P+
Sbjct: 324 RN---KKNNCGVASSASYPL 340
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/345 (37%), Positives = 197/345 (57%), Gaps = 29/345 (8%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F+++ L ++G A+ + D G + +EQWK+ +G++Y++ E +R +++ +L
Sbjct: 5 FVVLSLCLAGGLAAP----SLDPG-LDTHWEQWKSWHGKSYEQKEETWRRM-VWEKHLRV 58
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ- 122
+E N ++G S+ L +N F D+ +EF G+K + K G+ FL + Q
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYK--QTHKKLQGSHFLEPNFQE 116
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
VP V+W ++G VTPVK QGQC A+EG + + +LVSLSEQ LV+C+ +
Sbjct: 117 VPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPE 176
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD AF+Y+ N GI ++ Y Y G C + +AA T + D+P
Sbjct: 177 GNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPC-HYNPQYNAANDTGFVDIPSG 235
Query: 236 DEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTSE--- 287
E +L+KA+A PVSVAIDA ++ QFY G+ F C T L+HGV VGYG +
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYW++KNSW + WG++GY + +D D CGIA AS+P+
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDN---HCGIATAASYPL 337
>gi|261289789|ref|XP_002611756.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
gi|229297128|gb|EEN67766.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
Length = 308
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/307 (42%), Positives = 180/307 (58%), Gaps = 23/307 (7%)
Query: 42 GRTYKESAENSKRFEIFKDNLVAVERFNN-AAIGNRSYTLRLNKFADLTPQEF--IASQT 98
G+ Y +E + R IF++N V++ N AA+G ++ +++NKF DLT +EF I +
Sbjct: 8 GKQYNSLSEENARHSIFEENSKIVKQHNEEAAMGKHTFFMKMNKFGDLTTEEFRMIVIGS 67
Query: 99 GFKMSDHSSSLKANGTPF-LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVE 150
GF S+ + +A G F +V +V+W +KGAVT VK Q QC A ++E
Sbjct: 68 GFMQSNKTQ--QAEGGVFESLPGLKVDDTVDWRQKGAVTKVKNQEQCGSCWAFSATGSLE 125
Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
G + +K N LVSLSEQ LVDC+ + N GC GG MD AFKYI N GI + YSY G
Sbjct: 126 GQHFLKTNNLVSLSEQNLVDCSRREGNKGCKGGSMDQAFKYIKMNGGIDTEECYSYRGRD 185
Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN 267
+C K+ A +++Y D+ DE +L++AV+ P+SVAIDA + Q Y GV++
Sbjct: 186 ESMC-RYKSSCSGATLSSYTDIKTGDEMALMQAVSTVGPISVAIDAGHKSFQLYHHGVYD 244
Query: 268 --GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
T L+HGV AVGYG+S G YWL+KNSWG +WG +GY + R+ QCGIA
Sbjct: 245 EPKCSSTHLDHGVLAVGYGSS-NGSDYWLVKNSWGTEWGMEGYIMMSRN---KHNQCGIA 300
Query: 326 MFASFPV 332
A +PV
Sbjct: 301 TRAIYPV 307
>gi|305434756|gb|ADM53740.1| cathepsin L1 precursor [Lepeophtheirus salmonis]
Length = 325
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 128/319 (40%), Positives = 185/319 (57%), Gaps = 23/319 (7%)
Query: 28 GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFA 86
G ++ ++ W +G+TY R +IF++N + +++ N A G +Y+L +N++
Sbjct: 15 GELSGEWTLWTKLHGKTYTSFEIEELRVKIFEENRIKIQKHNAEAQNGLHTYSLEMNQYG 74
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
DL EF+ TG +S + T L S+ VP VNW + GAVT VK Q C
Sbjct: 75 DLLQSEFLQGYTGLAKGSYS----GDNTVILDNSAPVPSYVNWTKNGAVTAVKDQKDCGS 130
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
+VEG IK +L+S SEQQLVDC+++ N GC GG+MD+AFKY+I NKGI
Sbjct: 131 CWAFSTTGSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMDNAFKYLIANKGIA 190
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASA- 257
+ Y Y + G+C K A +I++++DV E+ L AVA P+SVAIDAS+
Sbjct: 191 TEDTYPYTA-TDGVCVYNKTM-AAGRISSFKDVKHGSEDQLKLAVAQIGPISVAIDASSG 248
Query: 258 -LQFYSGGVF-NGYCET-FLNHGVTAVGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQR 313
QFY GV+ + C + +L+HGV AVGYGT + G+ YWL+KNSW WG+ GY ++ R
Sbjct: 249 DFQFYKKGVYVDEECSSKYLDHGVLAVGYGTDKGTGLDYWLVKNSWSASWGDQGYIKMAR 308
Query: 314 DIDQPQGQCGIAMFASFPV 332
+ + CGIA AS+PV
Sbjct: 309 N---HKNMCGIASLASYPV 324
>gi|291398027|ref|XP_002715626.1| PREDICTED: cathepsin S [Oryctolagus cuniculus]
Length = 331
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 134/342 (39%), Positives = 188/342 (54%), Gaps = 27/342 (7%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K+ + +L+ S + A T D + WK YG+ YKE E + R I++ NL
Sbjct: 2 KWLVWALLVCSSTVAQLHRDPTLDH-----HWHLWKKAYGKQYKEKNEEAARRLIWEKNL 56
Query: 63 VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
V N ++G SY + +N AD+T +E ++ + ++ N T L +
Sbjct: 57 KFVTLHNLEHSMGMHSYDVGMNHLADMTSEEVVSLMSSLRIPH---QWPRNVTYKLNPNQ 113
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
++P SV+W E+G VT VKYQG C AV A+E +K LVSLS Q LVDC+T
Sbjct: 114 KLPDSVDWRERGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTT 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GGFM +AF+YII N GI ++A Y Y+ M ++ AA + Y ++P
Sbjct: 174 KYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDQKC--HYDSKHRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
EE+L +AVAN+ PVSVAIDAS F+ SG + C +NHGV AVGYG + +G
Sbjct: 232 FGSEEALKEAVANKGPVSVAIDASHSSFFLYRSGVYYEPSCTQNVNHGVLAVGYG-NLKG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
YWL+KNSWG +GE GY R+ R+ + CGIA + S+P
Sbjct: 291 KDYWLVKNSWGIHFGEQGYIRMARN---SKNHCGIANYPSYP 329
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 125/323 (38%), Positives = 179/323 (55%), Gaps = 26/323 (8%)
Query: 32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT----LRLNKFAD 87
E FE+W ++ + Y E ++R+ F NL V + N A G R+ + + +N FAD
Sbjct: 49 ELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRN--AEGRRAPSSGQGVGMNVFAD 106
Query: 88 LTPQEFIASQTGF----KMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
L+ +EF + K ++ + + G + P S++W ++GAVT VK QG
Sbjct: 107 LSNEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVKNQGD 166
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + A+EGINAI L+SLSEQ+LVDC T N GC GG+MD AF+++I N
Sbjct: 167 CGSCWAFSSTGAMEGINAITTGELISLSEQELVDCDTT--NEGCDGGYMDYAFEWVINNG 224
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
GI ++A Y Y G + +C++ K E I YEDV E +LL A QPVSV ID S
Sbjct: 225 GIDSEANYPYTGQADSVCNTTKEEIKVVSIDGYEDV-ATSESALLCAAVQQPVSVGIDGS 283
Query: 257 AL--QFYSGGVFNGYCE---TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
+L Q Y+GG+++G C ++H V VGYG + G YW++KNSWG DWG GY +
Sbjct: 284 SLDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYG-QQGGTDYWIVKNSWGTDWGMQGYIYI 342
Query: 312 QRDIDQPQGQCGIAMFASFPVSK 334
+R+ P G C I AS+P +
Sbjct: 343 RRNTGLPYGVCAIDAMASYPTKQ 365
>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 330
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 132/342 (38%), Positives = 189/342 (55%), Gaps = 31/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL+++ S + FDE S+ ++E+WK+ + R Y E R I++ N+ +
Sbjct: 4 LVCVLLLATSALGR-----FDESSLDAQWEEWKSTHRREYNGLGEEGIRRAIWEKNMRMI 58
Query: 66 ERFNN-AAIGNRSYTLRLNKFADLTPQEFIASQTGFKM---SDHSSSLKANGTPFLYKSS 121
E N AA+G S+ + +N D+T +E + TG ++ + S +L + P S
Sbjct: 59 EAHNEEAALGIHSFEMGMNHLGDMTSEEVVEKMTGLQIPMNQERSFTLAMDDMP-----S 113
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
++P SV++ +KG VT VK QG C A A+EG A +LV LS Q LVDC+
Sbjct: 114 KIPKSVDYRKKGMVTSVKNQGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGK 173
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
N+GC GGFM AF+Y+I N GI +DA Y Y G AA ++Y+ +P
Sbjct: 174 YGNHGCNGGFMTRAFQYVIDNHGIDSDASYPYTGRDEQC--RYNPATRAANCSSYQFLPE 231
Query: 235 NDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGI 290
DE +L +A+A P+SVAIDA FY GV+N C +NHGV AVGYG S G
Sbjct: 232 GDENALKQALATIGPISVAIDARRPRFSFYRSGVYNDPSCTQEVNHGVLAVGYG-SLNGQ 290
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWG +G+ GY R+ R+ QCGIA++A +PV
Sbjct: 291 DYWLVKNSWGSTFGDQGYIRMARNTGN---QCGIALYACYPV 329
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 130/349 (37%), Positives = 199/349 (57%), Gaps = 29/349 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M F+++ L ++G A+ + D G + +EQWK+ +G++Y++ E +R ++++
Sbjct: 1 MRLPFVVLSLCLAGGLAAP----SLDPG-LDTHWEQWKSWHGKSYEQKEETWRRM-VWEE 54
Query: 61 NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+L +E N ++G S+ L +N F D+ +EF G+K + K G+ FL
Sbjct: 55 HLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYK--QTHKKLQGSHFLEP 112
Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
+ +VP V+W ++G VTPVK QGQC A+EG + + +LVSLSEQ LV+C
Sbjct: 113 NFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVEC 172
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ + N GC GG MD AF+Y+ N GI ++ Y Y G C + +AA T + D
Sbjct: 173 SKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPC-HYNPQYNAANDTGFVD 231
Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTS 286
+P E +L+KA+A PVSVAIDA ++ QFY G+ F C T L+HGV VGYG
Sbjct: 232 IPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVE 291
Query: 287 E---EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+ +G KYW++KNSW + WG++GY + +D D CGIA AS+P+
Sbjct: 292 KRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDN---HCGIATAASYPL 337
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 134/348 (38%), Positives = 189/348 (54%), Gaps = 30/348 (8%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
A Y ++VL +S CA+ FD + + + WK + ++Y ES E +R +++ N
Sbjct: 3 ALYLAVLVLCVSAVCAAP----RFDS-QLEDHWHLWKNWHSKSYHESEEGWRRM-VWEKN 56
Query: 62 LVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
L +E N +G SY L +N F D+T +EF + G+K ++ K G+ F+ +
Sbjct: 57 LKKIEMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK---QTTERKFKGSLFMEPN 113
Query: 121 S-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
Q P +V+W EKG VTPVK QG C A+EG K +LVSLSEQ LVDC+
Sbjct: 114 YLQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCS 173
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ N GC GG MD AF+YI N G+ + Y Y G C K E A T + D+
Sbjct: 174 RPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPC-HYKPEFSGANETGFVDI 232
Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSE 287
P E +++KAVA PVSVAIDA + QFY G+ + C + L+HGV VGYG
Sbjct: 233 PSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKECSSEELDHGVLVVGYGFEG 292
Query: 288 E---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E G KYW++KNSW + WG+ GY + +D + CGIA +S+P+
Sbjct: 293 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATASSYPL 337
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 182/323 (56%), Gaps = 25/323 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++ +K Q+ + Y E+ R +I+ +N + + N G SY L NK+ D+
Sbjct: 24 VKEEWVAFKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDM 83
Query: 89 TPQEFIASQTGF-KMSDHSSSL-----KANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQ 141
EFI + G+ + + H+ L G F+ + + P V+W +KGAVT VK Q
Sbjct: 84 LHHEFIQAMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQ 143
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
G+C A+EG + K LVSLSEQ L+DC++ NNGC GG MD+AFKYI
Sbjct: 144 GKCGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYIKD 203
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAI 253
N GI + Y YEG+ ++ A+ + D+P DEE L++AVA PVSVAI
Sbjct: 204 NGGIDTEKTYPYEGVDDKC--RYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVGPVSVAI 261
Query: 254 DAS--ALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
DAS + QFYSGGV ++ C T L+HGV VGYGT E G YWL+KNSW + WGE GY
Sbjct: 262 DASQNSFQFYSGGVYYDTECSSTDLDHGVLVVGYGTDEAGGDYWLVKNSWSRTWGELGYI 321
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
++ R+ D CGIA AS+P+
Sbjct: 322 KMARNRDN---HCGIATDASYPL 341
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 190/340 (55%), Gaps = 36/340 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+V ++I+ + F E S ++ WK +G+TY E+ +R I+ DNL V
Sbjct: 8 LLVAVLIA---------QCFSELSQDRQWHAWKDFHGKTYTGEEEDLRR-AIWNDNLEIV 57
Query: 66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVP 124
++ N N SY L +N FADLT EF G++ + +S+ G+ FL S+ Q+P
Sbjct: 58 KKHNAE---NHSYKLDMNHFADLTVTEFKQRFMGYRAASNSTG----GSTFLPLSNVQLP 110
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
V+W +KG VT VK QGQC + ++EG + K +LVSLSEQ LVDC+ N
Sbjct: 111 AEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKYGN 170
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
NGC GG MD AFKYI N GI + Y Y G C K A +T Y DV E
Sbjct: 171 NGCEGGLMDYAFKYIKNNDGIDTEQSYPYTARD-GQC-HFKPGSVGATVTGYTDVQRGSE 228
Query: 238 ESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN--GYCETFLNHGVTAVGYGTSEEGIKY 292
L AVA P+SVAIDA S+ Q Y GV++ T L+HGV AVGYG +E+G Y
Sbjct: 229 GDLQSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQLDHGVLAVGYG-AEDGKDY 287
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WL+KNSWG+ WG +GY ++ R+ D QCGIA AS+P+
Sbjct: 288 WLVKNSWGEGWGMNGYIKMSRNKDN---QCGIATQASYPL 324
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 135/313 (43%), Positives = 181/313 (57%), Gaps = 29/313 (9%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
F+ +K ++G+TYK AE +KRF IF++NL +E N G SYT +NKFAD+T E
Sbjct: 26 FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAE 85
Query: 93 F---IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
F +A+Q K S+ A T L VP S++W + VTP+K Q QC
Sbjct: 86 FKAMLATQVKTK-----PSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWS 140
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
V + EG A+ +L SEQQLVDC T D N GC GG++DD F YI Q G+ ++
Sbjct: 141 FAVVGSTEGAYALSTGKLTRFSEQQLVDC-TTDLNYGCDGGYLDDTFPYI-QTNGLELES 198
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASALQFY 261
Y Y G G C S + ++++Y VP N E++LL+AV PV++AI+A LQFY
Sbjct: 199 DYPYTGYD-GSC-SYDSSKVVTKVSSYVSVPAN-EQALLEAVGTAGPVAIAINADDLQFY 255
Query: 262 SGGVFNG-YCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
G+ + YC+ +L+HGV AVGY SE G+ YWLIKNSWG DWGE GYFR R Q
Sbjct: 256 FSGIIDDKYCDPEWLDHGVLAVGY-NSENGLDYWLIKNSWGADWGESGYFRFLRG----Q 310
Query: 320 GQCGIAMFASFPV 332
CG+ A +P+
Sbjct: 311 NICGVKEDAVYPL 323
>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
Length = 289
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 114/221 (51%), Positives = 146/221 (66%), Gaps = 12/221 (5%)
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P SV+W ++GAV VK QG C + AVEGIN I L+SLSEQ+LVDC T+
Sbjct: 3 IPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS- 61
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD AF++II+N GI + Y Y+ + G CD + I YEDVP N
Sbjct: 62 YNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYK-AADGRCDQNRKNAKVVTIDAYEDVPEN 120
Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
+E +L KA+ANQP+SVAI+A A Q YS GVF+G C T L+HGV AVGYGT E G YW
Sbjct: 121 NEAALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGT-ENGKDYW 179
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
+++NSWG WGE GY ++ R+I + G+CGIAM AS+P+ K
Sbjct: 180 IVRNSWGGSWGESGYIKMARNIAEATGKCGIAMEASYPIKK 220
>gi|351721011|ref|NP_001238219.1| P34 probable thiol protease precursor [Glycine max]
gi|1199563|gb|AAB09252.1| 34 kDa maturing seed vacuolar thiol protease precursor [Glycine
max]
Length = 379
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 127/332 (38%), Positives = 178/332 (53%), Gaps = 24/332 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
++ F+ WK+++GR Y E +KR EIFK+N + N S+ L LNKFAD+T
Sbjct: 40 VSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKSPHSHRLGLNKFADIT 99
Query: 90 PQEFIAS--QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC--- 144
PQEF Q +S Y P S +W +KG +T VKYQG C
Sbjct: 100 PQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKKGVITQVKYQGGCGRG 159
Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
A A+E +AI LVSLSEQ+LVDC + + G Y G+ +F++++++ GI
Sbjct: 160 WAFSATGAIEAAHAIATGDLVSLSEQELVDCV--EESEGSYNGWQYQSFEWVLEHGGIAT 217
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE-------ESLLKAVANQPVSVAI 253
D Y Y G C + K +D I YE + +DE ++ L A+ QP+SV+I
Sbjct: 218 DDDYPYRA-KEGRCKANKIQDKVT-IDGYETLIMSDESTESETEQAFLSAILEQPISVSI 275
Query: 254 DASALQFYSGGVFNGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
DA Y+GG+++G T +NH V VGYG S +G+ YW+ KNSWG+DWGEDGY
Sbjct: 276 DAKDFHLYTGGIYDGENCTSPYGINHFVLLVGYG-SADGVDYWIAKNSWGEDWGEDGYIW 334
Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
+QR+ G CG+ FAS+P +ES SA
Sbjct: 335 IQRNTGNLLGVCGMNYFASYPTKEESETLVSA 366
>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
Length = 331
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 136/343 (39%), Positives = 192/343 (55%), Gaps = 32/343 (9%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
+L+ VL++ S +Q + ++ + WK YG+ YKE E + R I++ NL
Sbjct: 3 WLVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKF 58
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ- 122
V N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPN 112
Query: 123 --VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P SV+W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 174 ND-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
N GC GGFM AF+YII NKGI +DA Y Y+ M ++ AA + Y ++
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTEL 230
Query: 233 PPNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEE 288
P + E+ L +AVAN+ PVSV +DAS F+ SG + C +NHGV VGYG
Sbjct: 231 PYSREDVLKEAVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLN 289
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G +YWL+KNSWG+++GE+GY R+ R+ CGIA F S+P
Sbjct: 290 GKEYWLVKNSWGRNFGEEGYIRMARN---KGNHCGIASFPSYP 329
>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
Length = 401
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 132/308 (42%), Positives = 176/308 (57%), Gaps = 21/308 (6%)
Query: 40 QYGRTYKESAENSK---RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIAS 96
++ RT+++S + RFEI+K N + +N S+T+ +N+F DLT EF
Sbjct: 97 EWMRTHRKSYHHDHFLPRFEIWKTNNRWITHWNKKHANASSFTVAINQFGDLTSDEFNRL 156
Query: 97 QTGFKM-SDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAA 148
G + S +S K ++ +P S +W +KG V+ VK QG C +
Sbjct: 157 YNGLHVFSAPKASEKVERPRQWANTAGIPESGDWRQKGVVSRVKDQGMCGSCWAFSTTGS 216
Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNN-GCYGGFMDDAFKYIIQNKGITNDAVYSYE 207
EGINAI +RLV LSEQ LVDCAT +N GC GGFMD+AF+YII NKGI ++A Y Y
Sbjct: 217 TEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMDNAFRYIIDNKGIDSEASYPYV 276
Query: 208 GMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGV 265
+ G C + + + +P DE++LL A A QP+SV IDA + QFYS GV
Sbjct: 277 A-ADGQCRFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQPISVGIDAGRPSFQFYSKGV 335
Query: 266 FN-GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
+N C T LNHGV VG+G E G YWL+KNSWGQ WG DGY ++ RD + QCG
Sbjct: 336 YNEPECSSTELNHGVLIVGWGV-ERGQAYWLVKNSWGQTWGMDGYIKMSRDKNN---QCG 391
Query: 324 IAMFASFP 331
IA AS+P
Sbjct: 392 IATLASYP 399
>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
Length = 245
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 117/228 (51%), Positives = 147/228 (64%), Gaps = 14/228 (6%)
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P SV+W E GAV PVK Q C VAAVEGIN I L+SLSEQ+LVDC T +
Sbjct: 6 LPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDT-E 64
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
+ GC GG MD AF +II+N G+ + Y Y G G C+ I YEDVPP
Sbjct: 65 YDMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFD-GECNLSGKSSKVVSIDGYEDVPPF 123
Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
DE++L KAVA+QPVSVA++A ALQ Y G+F G C T L+HG+ AVGYGT E G YW
Sbjct: 124 DEKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGT-ENGTDYW 182
Query: 294 LIKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQPS 340
+++NSWG WGE+GY R++R++ D G+CGIAM AS+P+ K PS
Sbjct: 183 IVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPI-KNGENPS 229
>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 343
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 141/343 (41%), Positives = 194/343 (56%), Gaps = 45/343 (13%)
Query: 15 SCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIG 74
S A ++ +R+ +E + +E+ A++G+ Y E +RF+I K+NL VE+ N G
Sbjct: 35 SHADKSGWRSDEE--VMSIYEEXLAKHGKVYNAIDEMEERFQISKENLKFVEQHN---AG 89
Query: 75 NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA 134
NR+Y + LN+FAD + +M SS A P + S + SV+W ++GA
Sbjct: 90 NRTYKVGLNRFADRS-----------RMMTRPSSRYA---PRV--SDNLSESVDWRKEGA 133
Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
V VK Q +C +AAVEGIN I L +LS DC N GC GG D
Sbjct: 134 VVRVKTQSECESCRTFTVIAAVEGINKIVTGNLTALS-----DC-DRTVNAGCSGGLADY 187
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
A ++II N GI + Y ++G + GICD K + YE VP DE +L KAVANQ
Sbjct: 188 ALEFIINNGGIDTEEDYPFQG-AVGICDQYKIN----AVDGYERVPAYDELALKKAVANQ 242
Query: 248 PVSVA-IDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
PVSVA I+A Q Y G+F G C T ++HGVTAVGYGT E GI YW++KNSWG++WG
Sbjct: 243 PVSVAYIEAYGKEFQLYESGIFTGKCGTSIDHGVTAVGYGT-ENGIDYWIVKNSWGENWG 301
Query: 305 EDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
E GY R++R+ + G+CGIA+ +P+ K PS+ D SS
Sbjct: 302 EAGYVRMERNTAEDTAGKCGIAILTLYPI-KSGQNPSNPDNSS 343
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 135/313 (43%), Positives = 181/313 (57%), Gaps = 29/313 (9%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
F+ +K ++G+TYK AE +KRF IF++NL +E N G SYT +NKFAD+T E
Sbjct: 26 FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAE 85
Query: 93 F---IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
F +A+Q K S+ A T L VP S++W + VTP+K Q QC
Sbjct: 86 FKAMLATQVKTK-----PSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWA 140
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
V + EG A+ +L SEQQLVDC T D N GC GG++DD F YI Q G+ ++
Sbjct: 141 FAVVGSTEGAYALSTGKLTRFSEQQLVDC-TTDLNYGCDGGYLDDTFPYI-QTNGLELES 198
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASALQFY 261
Y Y G G C S ++ ++++Y VP N E++LL+AV PV++AI+A LQFY
Sbjct: 199 DYPYTGYD-GYC-SYESSKVVTKVSSYVSVPAN-EQALLEAVGTAGPVAIAINADDLQFY 255
Query: 262 SGGVFNG-YCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
G+ + YC+ +L+HGV AVGY SE G YWLIKNSWG DWGE GYFR R Q
Sbjct: 256 FSGIIDDKYCDPEYLDHGVLAVGY-DSENGRDYWLIKNSWGADWGESGYFRFLRG----Q 310
Query: 320 GQCGIAMFASFPV 332
CG+ A +P+
Sbjct: 311 NICGVKEDAVYPL 323
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 135/343 (39%), Positives = 192/343 (55%), Gaps = 34/343 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
+A +F ++V+ IS S + + + KF+ +K ++G+TY AE SKRF IF D
Sbjct: 3 VAIFFSLLVVAISASISEE----------LGAKFQAFKLEHGKTYLNQAEESKRFNIFTD 52
Query: 61 NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N+ A+E N G SY +NKF D++ +EF +T +S S T ++
Sbjct: 53 NVRAIEAHNALYEQGKVSYKKGINKFTDMSQEEF---KTMLTLS-ASRKPTLETTSYVKT 108
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
++P SV+W ++G VT VK QG C + EG A K +LVSLSEQQL+DC
Sbjct: 109 GVEIPSSVDWRKEGRVTGVKDQGDCGSCWAFSITGSTEGAYARKSGKLVSLSEQQLIDCC 168
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
T D + GC GG +DD FKY++++ G+ ++ Y+Y+G G C +++ Y +
Sbjct: 169 T-DTSAGCDGGSLDDNFKYVMKD-GLQSEESYTYKG-EDGAC-KYNVASVVTKVSKYTSI 224
Query: 233 PPNDEESLLKAVAN-QPVSVAIDASALQFYSGGVFNGY-CE-TFLNHGVTAVGYGTSEEG 289
P DE++LL+AVA PVSV +DAS L Y G++ C LNH + AVGYGT E G
Sbjct: 225 PAEDEDALLEAVATVGPVSVGMDASYLSSYDSGIYEDQDCSPAGLNHAILAVGYGT-ENG 283
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW+IKNSWG WGE GYFRL R + QCGI+ +P
Sbjct: 284 KDYWIIKNSWGASWGEQGYFRLAR----GKNQCGISEDTVYPT 322
>gi|414887427|tpg|DAA63441.1| TPA: hypothetical protein ZEAMMB73_713985 [Zea mays]
Length = 355
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 126/332 (37%), Positives = 181/332 (54%), Gaps = 34/332 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L++VL+ + + ++ + ++F W+A Y R+Y +AE +RFE+++ N+ +
Sbjct: 15 LMLVLMAGAASGGRVD---VEDMLMMDRFRAWQATYNRSYLTAAERLRRFEVYRQNMELI 71
Query: 66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT-------GFKMSDHSSSLKANGTPFLY 118
E N A SY L F DLT +EF+A+ T H + + P
Sbjct: 72 EATNRRA--ELSYQLSETPFTDLTSEEFLATHTMSTRLHASEAARRHRELITTHAGPVSD 129
Query: 119 KSSQ-----------VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRL 160
Q VP SV+W KGAVT VK QG C VAA+EG++ I+ +L
Sbjct: 130 GGRQWNRRNYTTDLDVPESVDWRTKGAVTTVKDQGACGGCWSFATVAAIEGLHKIRTGQL 189
Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
VSLSEQ+++DC++ NN GC+GG A ++ N G+T ++ Y YEG G C KA
Sbjct: 190 VSLSEQEVLDCSSPPNN-GCHGGNPAAAIDWVSANGGLTTESDYPYEGRQ-GKCKLDKAR 247
Query: 221 DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQ-FYSGGVFNGYCETF-LNHGV 278
+H A+I + V N+E +L AVA QPV+V ++ +Q Y GVF+G C+ LNH V
Sbjct: 248 NHVAKIRGRKLVDQNNEAALEVAVAQQPVAVGMNVHPIQQHYKSGVFHGPCDPEDLNHAV 307
Query: 279 TAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
T VGYG G KYW++KNSWG+ WGE GYFR
Sbjct: 308 TMVGYGAESGGRKYWIVKNSWGEKWGEKGYFR 339
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 129/345 (37%), Positives = 197/345 (57%), Gaps = 29/345 (8%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F+++ L ++G A+ + D G + +EQWK+ +G++Y++ E +R +++ +L
Sbjct: 5 FVVLSLCLAGGLAAP----SLDPG-LDTHWEQWKSWHGKSYEQKEETWRRM-VWEKHLRV 58
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+E N ++G S+ L +N F D+ +EF G+K + K G+ FL + +
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYK--QTHKKLQGSHFLEPNFLE 116
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
VP V+W ++G VTPVK QGQC A+EG + + +LVSLSEQ LV+C+ +
Sbjct: 117 VPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPE 176
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD AF+Y+ N GI ++ Y Y G C + +AA T + D+P
Sbjct: 177 GNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPC-HYNPQYNAANDTGFVDIPSG 235
Query: 236 DEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTSE--- 287
E +L+KA+A PVSVAIDA ++ QFY G+ F C T L+HGV VGYG +
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYW++KNSW + WG++GY + +D D CGIA AS+P+
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDN---HCGIATAASYPL 337
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 137/347 (39%), Positives = 190/347 (54%), Gaps = 30/347 (8%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
Y I+ L S A+ + ++ + + WK+ + + Y E E +R I++ NL
Sbjct: 3 YLCILALSFGASFAAPGL-----DPALNDHWLSWKSWHSKKYHEKEEGWRRM-IWEKNLK 56
Query: 64 AVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
+E N + ++G SY L +N F D+T +EF GFK S S K G+ FL +
Sbjct: 57 MIELHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGFKQS--RSQRKYKGSQFLEPNFL 114
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
Q P SV+W EKG VTPVK QGQC A A+EG + K +LVSLSEQ L+DC+
Sbjct: 115 QAPKSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQNLIDCSGP 174
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
+ N GC GG MD AF+YI N GI ++ Y Y G C K E ++A T + D+P
Sbjct: 175 EGNQGCNGGLMDQAFQYIKDNNGIDSEESYPYIGKDDEDC-LYKPEYNSANDTGFVDIPE 233
Query: 235 NDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYG----T 285
E +L+KAVA P+SVAIDAS + QFY GV + C + L+HGV VGYG
Sbjct: 234 GRERALMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEELDHGVLVVGYGYEGTD 293
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+ +YW++KNSW + WG+ GY + +D CGIA AS+P+
Sbjct: 294 DDNKKRYWIVKNSWSEKWGDQGYIHMAKD---RSNNCGIASAASYPM 337
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 135/323 (41%), Positives = 185/323 (57%), Gaps = 20/323 (6%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
SI E F+QW+ ++ + YK + E KRF FK NL + R + + LNKFADL
Sbjct: 38 SIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLR-HRVGLNKFADL 96
Query: 89 TPQEFI-ASQTGFKMSDHSSSLKA-NGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
+ +EF + K + + + A + + +S P S++W +KG VT VK QG C
Sbjct: 97 SNEEFKQLYLSKVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDCGS 156
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
A+EGINAI + L+SLSEQ+LVDC T N GC GG+MD AF+++I N GI
Sbjct: 157 CWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVINNGGID 214
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL- 258
+A Y Y G+ G C++ K E I Y+DV D +LL A A QP+SV ID SA+
Sbjct: 215 TEANYPYTGVD-GTCNTAKEEIKVVSIDGYKDVDETDS-ALLCAAAQQPISVGIDGSAID 272
Query: 259 -QFYSGGVFNGYCETFLN---HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
Q Y+GG+++G C + H V VGYG SE G YW++KNSWG WG +GYF ++R+
Sbjct: 273 FQLYTGGIYDGDCSDDPDDIDHAVLIVGYG-SENGEDYWIVKNSWGTSWGIEGYFYIKRN 331
Query: 315 IDQPQGQCGIAMFASFPVSKESA 337
D P G C I AS+P + SA
Sbjct: 332 TDLPYGVCAINAMASYPTKEASA 354
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 129/343 (37%), Positives = 190/343 (55%), Gaps = 25/343 (7%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
++ L++ +C S + + E ++ WK+ + + Y E E +R +++ NL +E
Sbjct: 1 MLPLLVLTACLSSVLSAPVLDAQLNEHWDLWKSWHSKKYHEKEEGWRRM-VWEKNLQKIE 59
Query: 67 RFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
N ++G S+ L +N F D+T +EF G+K+ + K G+ F+ + P
Sbjct: 60 LHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLK---TQRKFTGSLFMEPNFMTAP 116
Query: 126 S-VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
S V+W EKG VTPVK QGQC A+EG K +LVSLSEQ LVDC+ + N
Sbjct: 117 SAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGN 176
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GG MD AF+Y+ N+G+ ++ Y Y G C + +A T + DVP E
Sbjct: 177 EGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPCHYDPLYN-SANDTGFVDVPSGKE 235
Query: 238 ESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEE---G 289
+L+KAVA+ PVSVAIDA + QFY G+ + C + L+HGV AVGYG E G
Sbjct: 236 HALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDKMG 295
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
K+W++KNSWG+ WG+ GY + +D + CGIA AS+P+
Sbjct: 296 KKFWIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYPL 335
>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
Length = 330
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 131/345 (37%), Positives = 188/345 (54%), Gaps = 29/345 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M FL+ L + A+ +FD +E+WK ++G+TY + E KR ++++
Sbjct: 1 MTPIFLLATLCLGMISAAPTHDPSFDT-----VWEEWKTKHGKTYNTNEEGQKR-AVWEN 54
Query: 61 NLVAVERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N+ + N + G ++L +N F DLT EF TGF+ + +K PFL
Sbjct: 55 NMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQ-GQKTKMMKVFPEPFL-- 111
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
VP +V+W + G VTPVK QG C AV ++EG K +LV LSEQ LVDC+
Sbjct: 112 -GDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCS 170
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ N GC GG D AF+Y+ N G+ Y YE ++ G C + AA++ + +
Sbjct: 171 WSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALN-GTC-RYNPKYSAAKVVGFMSI 228
Query: 233 PPNDEESLLKAVAN-QPVSVAID--ASALQFYSGGVF--NGYCETFLNHGVTAVGYGTSE 287
PP+ E +L+KAVA P+SV ID + QFY GG++ T LNH V VGYG
Sbjct: 229 PPS-ENALMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEES 287
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYWL+KNSWG+DWG DGY ++ +D + CGIA AS+P+
Sbjct: 288 DGRKYWLVKNSWGRDWGMDGYIKMAKDWNN---NCGIASDASYPI 329
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 181/316 (57%), Gaps = 24/316 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
++ WK+ + + Y E E +R +++ NL +E N + A+G SY L +N+F D+T +E
Sbjct: 134 WQLWKSWHRKDYHEREEGWRRV-VWEKNLKMIEIHNLDHALGKHSYKLGMNQFGDMTTEE 192
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA------ 145
F G+ S K G+ FL + + P SV+W EKG VTPVK QGQC
Sbjct: 193 FRQLMNGYVHK--KSERKYRGSQFLEPNFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFS 250
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
A+EG + K +LVSLSEQ LVDC+ + N GC GG MD AF+Y+ N GI ++ Y
Sbjct: 251 TTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESY 310
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQFY 261
Y C KAE +AA T + D+P E +L+KAVA PVSVAIDA S+ QFY
Sbjct: 311 PYTAKDDEDC-RYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFY 369
Query: 262 SGGV-FNGYCETF-LNHGVTAVGYGTSEE---GIKYWLIKNSWGQDWGEDGYFRLQRDID 316
G+ + C + L+HGV VGYG E G KYW++KNSWG+ WG+ GY + +D
Sbjct: 370 QSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKD-- 427
Query: 317 QPQGQCGIAMFASFPV 332
+ CGIA AS+P+
Sbjct: 428 -RKNHCGIATAASYPL 442
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 189/324 (58%), Gaps = 23/324 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E+F+ W+A+Y RTY E +RF I+ +N+ ++ N + G+ SY L N+F DLT
Sbjct: 60 LLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGS-SYELGENQFTDLT 118
Query: 90 PQEFIAS---------QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKY 140
+EF + M ++ G + + P SV+W KGAVT VK
Sbjct: 119 EEEFKDTYLMKLDEQPPAAEAMPPTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVTRVKD 178
Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
Q QC VA++EG++ IK RLVSLSEQ++VDC N+NGC GG A +++
Sbjct: 179 QQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVT 238
Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
+N G+T ++ Y Y G S C S K HAA+I Y+ V N+E L +AVA QPV+V +
Sbjct: 239 RNGGLTTESDYPYVG-SQRQCMSGKLGHHAARIRGYQAVQRNNEAELERAVAGQPVAVFV 297
Query: 254 DAS-ALQFYSGGVFNGYCE-TFLNHGVTAVGYGTS---EEGIKYWLIKNSWGQDWGEDGY 308
DAS A QFY GVF+G C+ T +NH VT VGYG++ G KYW++KNSWGQ WGE+GY
Sbjct: 298 DASRAFQFYKSGVFSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGY 357
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
R+ R + +G C IA+ +PV
Sbjct: 358 VRMARRVRAREGMCAIAIEPYYPV 381
>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
Length = 341
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 132/344 (38%), Positives = 190/344 (55%), Gaps = 23/344 (6%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
+++VL + S + +E I E++ +K Q+ + Y++ E + R +++ DN + +
Sbjct: 3 VVIVLGLVAFAISSVSSINLNE-VIEEEWSLFKMQFKKLYEDIKEETFRKKVYLDNKLKI 61
Query: 66 ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMS---DHSSSLKANGTPFLYKSS 121
R N G +Y L +N F DL E+ GFK S S+ G FL +
Sbjct: 62 ARHNKLYESGEETYALEMNHFGDLMQHEYSKMMNGFKPSLAGGDSNFTNDEGVTFLKSEN 121
Query: 122 QV-PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
V P S++W +KG VTPVK QGQC A ++EG + K LVSLSEQ L+DC+
Sbjct: 122 VVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
NNGC GG MD AFKYI NKG+ + Y YE ++ A + D+P
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKC--RYNPDNSGATDNGFVDIP 239
Query: 234 PNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEE 288
DEE+L+ A+A PVS+AIDAS+ QFY GVF N C T L+HGV AVG+ T ++
Sbjct: 240 EGDEEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFRTDKK 299
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G YW++KNSWG+ WG++GY + R+ + CG+A AS+P+
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYPL 340
>gi|344271892|ref|XP_003407771.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 133/343 (38%), Positives = 191/343 (55%), Gaps = 34/343 (9%)
Query: 10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
L ++ C A+ + S+ E++ QWK+ Y + Y + E+ +R +++ N+ +ER N
Sbjct: 5 LFLTALCLGIASAAQKHDESLDEQWYQWKSLYKKPYAANEEDWRR-AVWEKNMKMIERHN 63
Query: 70 NA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS--SQVPPS 126
+ G +T+ +N F D+T +EF GF+ + K LY+ +P S
Sbjct: 64 QEYSQGKHGFTMTMNAFGDMTNEEFRQVMNGFQ------NQKRIQGKLLYEPVFGHIPKS 117
Query: 127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
V+W +KG VTPVK QGQC A A+EG K +LVSLSEQ LVDC+ + N G
Sbjct: 118 VDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRREGNEG 177
Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
C GG MD+AF+YI N G+ ++ Y Y M C + AA T + D+PP E++
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESYPYTAMDKQDC-RYNPKYSAANDTGFVDIPPQ-EKA 235
Query: 240 LLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCETF-LNHGVTAVGYGTSEEGI---- 290
L+KAVA P+SVA+DA + QFY G+ ++ C + LNHGV VGYG EGI
Sbjct: 236 LMKAVATVGPISVAVDAGHESFQFYKSGIYYDSNCSSKDLNHGVLVVGYGF--EGIDSAN 293
Query: 291 -KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+YWL+KNSWG WG DGY ++ +D + CGIA AS+P
Sbjct: 294 NRYWLVKNSWGTGWGTDGYIKMAKDRNN---HCGIATAASYPT 333
>gi|293334761|ref|NP_001168296.1| uncharacterized protein LOC100382061 [Zea mays]
gi|223947281|gb|ACN27724.1| unknown [Zea mays]
Length = 322
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 123/308 (39%), Positives = 171/308 (55%), Gaps = 31/308 (10%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ ++F W+A Y R+Y +AE +RFE+++ N+ +E N A SY L F DLT
Sbjct: 3 MMDRFRAWQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRA--ELSYQLSETPFTDLT 60
Query: 90 PQEFIASQT-------GFKMSDHSSSLKANGTPFLYKSSQ-----------VPPSVNWIE 131
+EF+A+ T H + + P Q VP SV+W
Sbjct: 61 SEEFLATHTMSTRLHASEAARRHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESVDWRT 120
Query: 132 KGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGF 184
KGAVT VK QG C VAA+EG++ I+ +LVSLSEQ+++DC++ NN GC+GG
Sbjct: 121 KGAVTTVKDQGACGGCWSFATVAAIEGLHKIRTGQLVSLSEQEVLDCSSPPNN-GCHGGN 179
Query: 185 MDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAV 244
A ++ N G+T ++ Y YEG G C KA +H A+I + V N+E +L AV
Sbjct: 180 PAAAIDWVSANGGLTTESDYPYEGRQ-GKCKLDKARNHVAKIRGRKLVDQNNEAALEVAV 238
Query: 245 ANQPVSVAIDASALQ-FYSGGVFNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
A QPV+V ++ +Q Y GVF+G C+ LNH VT VGYG G KYW++KNSWG+
Sbjct: 239 AQQPVAVGMNVHPIQQHYKSGVFHGPCDPEDLNHAVTMVGYGAESGGRKYWIVKNSWGEK 298
Query: 303 WGEDGYFR 310
WGE GYFR
Sbjct: 299 WGEKGYFR 306
>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
Length = 332
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 136/339 (40%), Positives = 189/339 (55%), Gaps = 28/339 (8%)
Query: 10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
L+++ C A+ + S+ + +WKA + + Y + E +R I++ N+ +ER N
Sbjct: 5 LLLAAFCLGIASAAPRHDHSLDADWYKWKATHRKLYGLNEEGRRR-AIWEKNMKMIERHN 63
Query: 70 -NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP-SV 127
G S+T+ +N F D+T +EF + GF+ H G FL S + P SV
Sbjct: 64 WEHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQNQKHKK-----GKVFLDAGSALTPHSV 118
Query: 128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
+W EKG VT VK QG C A A+EG K ++L+SLSEQ LVDC+ + N GC
Sbjct: 119 DWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEGC 178
Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
GG MD+AF+YI N G+ ++ Y Y G G C K + AA T Y D+ P E++L
Sbjct: 179 NGGLMDNAFQYIKDNGGLDSEESYPYFG-KDGSC-KYKPQSSAANDTGYVDI-PKQEKAL 235
Query: 241 LKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYGT--SEEGIKYW 293
+KAVA P+SV IDAS + QFYS G+ F C + L+HGV VGYG + KYW
Sbjct: 236 MKAVATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYW 295
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
L+KNSWG WG DGY ++ +D + CGIA AS+PV
Sbjct: 296 LVKNSWGNTWGMDGYIKMTKDQNN---HCGIATMASYPV 331
>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
Length = 341
Score = 213 bits (543), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 130/344 (37%), Positives = 191/344 (55%), Gaps = 23/344 (6%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
+++VL + S + +E I E++ +K Q+ + Y++ E + R +++ DN + +
Sbjct: 3 VVIVLGLVAFAISTVSSINLNE-VIEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKLKI 61
Query: 66 ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMS----DHSSSLKANGTPFLYKS 120
R N G +Y L +N F DL E+ GFK S D + + T ++
Sbjct: 62 ARHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSEN 121
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P SV+W +KG VTPVK QGQC A ++EG + K LVSLSEQ L+DC+
Sbjct: 122 VVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
NNGC GG MD AFKYI NKG+ + Y YE E+ A + D+P
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKC--RYNPENSGATDKGFVDIP 239
Query: 234 PNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEE 288
DE++L+ A+A PVS+AIDAS+ QFY GVF N C T L+HGV AVG+G+ ++
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G YW++KNSWG+ WG++GY + R+ + CG+A AS+P+
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYPL 340
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 192/342 (56%), Gaps = 27/342 (7%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K+ L+V+L S + A T D ++ WK YG+ Y E E R I++ NL
Sbjct: 11 KWLLLVLLGCSSAMAQLHKDPTLDH-----HWDLWKKTYGKQYTEENEEVTRRFIWEKNL 65
Query: 63 VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
V N ++G SY L +N AD+T +E + + ++ S + N T +
Sbjct: 66 KYVMLHNLEHSMGMHSYDLGMNHLADMTSEEVMLLMSSLRVP---SQWQRNVTFKSNPNQ 122
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
++P S++W +KG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 123 KLPDSMDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQNLVDCSTG 182
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+N GC GGFM +AF+YII N GI ++A Y Y+ M G C ++ AA + Y ++P
Sbjct: 183 KYSNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMD-GKCQ-YDVKNRAATCSKYVELP 240
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
+EE+L +AVAN+ PVSVAIDAS F+ SG ++ C +NHGV AVGYG + G
Sbjct: 241 FGNEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDKACTLNVNHGVLAVGYG-NYNG 299
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
YWL+KNSWG +GE GY R+ R+ CGIA + S+P
Sbjct: 300 KDYWLVKNSWGLHFGEQGYIRMARN---SGNHCGIASYPSYP 338
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 181/318 (56%), Gaps = 21/318 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ ++ +KA++G++Y E R +I+ +N + + N A G Y++ +N+F D+
Sbjct: 23 LGAEWSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDM 82
Query: 89 TPQEFIASQTGFKMS--DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
EF++++ GFK + D P + +P +V+W KGAVTPVK QGQC
Sbjct: 83 LHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGS 142
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
A ++EG + K +VSLSEQ LV C+T+ NNGC GG MDDAFKYI NKGI
Sbjct: 143 CWAFSATGSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGID 202
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
+ Y Y G + G C K A + + D+ E L KAVA P+SVAIDAS
Sbjct: 203 TEKSYPYNG-TDGTC-HFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHE 260
Query: 257 ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
+ QFYS GV++ C++ L+HGV VGYGT G YW +KNSWG WG++GY R+ R+
Sbjct: 261 SFQFYSDGVYDEPECDSESLDHGVLVVGYGTL-NGTDYWFVKNSWGTTWGDEGYIRMSRN 319
Query: 315 IDQPQGQCGIAMFASFPV 332
+ QCGIA AS P+
Sbjct: 320 ---KKNQCGIASSASIPL 334
>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 184/319 (57%), Gaps = 22/319 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
E + ++ WK +G+ Y+ E+ R E+++ NL+ + N A++G +Y L +N
Sbjct: 27 EPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLMLITMHNLEASMGLHTYELSMNHM 86
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC 144
DLT +E + S F + ++ +PF + + VP +++W EKG VT VK QG C
Sbjct: 87 GDLTQEEIMQS---FATLSPPTDIQRAASPFAGTTGADVPDTMDWREKGCVTSVKMQGSC 143
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
A A+EG A +LV LS Q LVDC+T N+GC GGFM AF+Y+I N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHQAFQYVIDNQG 203
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
I +DA Y Y G G C ++ AA + Y +P +E +L +A+AN P+SVAIDA+
Sbjct: 204 IDSDASYPYTG-RNGEC-RYNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAIDAT 261
Query: 257 --ALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
FY GV+N C +NHGV AVGYGT +G YWL+KNSWG+ +G+ GY R+ R
Sbjct: 262 RPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTL-DGQDYWLVKNSWGKTFGDQGYIRMSR 320
Query: 314 DIDQPQGQCGIAMFASFPV 332
+ + QCGIA++ +P+
Sbjct: 321 NKND---QCGIALYGCYPI 336
>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 129/330 (39%), Positives = 186/330 (56%), Gaps = 49/330 (14%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
S+ E+++Q+K +G+TY+ E +RF +F+ NLV ++ N G S+ ++ +FAD
Sbjct: 18 SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFAD 77
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
+T +EF+ LK G P L ++ + +++W E+GAVT
Sbjct: 78 MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDIDMEEKDAIDWREEGAVT 125
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
PVK Q C AV A+EG K LVSLS Q+LVDCAT D NNGC GG M A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
F ++ Q++GI + Y YEG + C K+ ++ ++ Y V P DE+ + + VA +
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239
Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
PV+VAI+AS L FY G+ + C LNHGV VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GYFRL++D+ CGI + ++PV
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGTYNTYPV 324
>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
Length = 323
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 125/313 (39%), Positives = 180/313 (57%), Gaps = 22/313 (7%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQ 91
++E +K ++G+ Y S E S R +F D L ++ N G +Y L++N F+DLT +
Sbjct: 19 EWENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHE 78
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
E +A++TG H S+ P ++ + V+W KGAVTPVK QGQC
Sbjct: 79 EVLATKTGMTRRRHPLSVLPKSAP----TTPMAADVDWRNKGAVTPVKDQGQCGSCWAFS 134
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
AVAA+EG + +K LVSLSEQ LVDC+++ N GC GG+ A++YII N+GI ++ Y
Sbjct: 135 AVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSY 194
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASALQF--Y 261
Y+ + A + A +++Y + DE +L AV N+ PVSV IDA F Y
Sbjct: 195 PYKAIDDNC--RYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGSY 252
Query: 262 SGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
GGV + C++ + NH VTAVGYGT G YW++KNSWG WGE GY ++ R+ D
Sbjct: 253 GGGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNRDN-- 310
Query: 320 GQCGIAMFASFPV 332
C IA ++ +PV
Sbjct: 311 -NCAIATYSVYPV 322
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 136/305 (44%), Positives = 178/305 (58%), Gaps = 26/305 (8%)
Query: 48 SAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-H 105
S E+++ FE+F+ NL + + N G +SY + LN FA LT +EF A G+ ++
Sbjct: 45 SPESTRAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGYGGAEVE 104
Query: 106 SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKIN 158
+ G S++P SV+W EKGAV VK QG C AVAA+EG + +
Sbjct: 105 QPKTRRAGKHERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSG 164
Query: 159 RLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV--YSYEGMSTGICDS 216
L+SLSEQQLVDC+ N+GC GG+MD+AF+Y + N G +D+ Y Y+GM G C
Sbjct: 165 ELISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMD-GKC-K 222
Query: 217 IKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA-SALQFYSGGVFNGY---CE 271
A+ A I+ Y DV +E LL AVAN PVSVAI A +ALQFY GVFNG C
Sbjct: 223 FSADGVRATISGYNDVKQGNETDLLDAVANVGPVSVAIHAGAALQFYLRGVFNGVAGTCF 282
Query: 272 TFLNHGVTAVGYGTSE----EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMF 327
LNHGVTAVGYGT+ + YW+IKNSWG WGE G+ R R + CG+A
Sbjct: 283 GPLNHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARG----KNLCGVANG 338
Query: 328 ASFPV 332
AS+P+
Sbjct: 339 ASYPL 343
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 138/347 (39%), Positives = 195/347 (56%), Gaps = 33/347 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
+ K+ ++V+L S + A T D ++ WK YG+ YKE E R I++
Sbjct: 11 IMKWLVLVLLGCSSAMAQLHKDPTLDR-----HWDLWKKTYGKQYKEKNEEGVRRLIWEK 65
Query: 61 NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
NL V N ++G SY L +N D+T +E A + ++ S + N T YK
Sbjct: 66 NLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTALMSSLRVP---SQWQRNVT---YK 119
Query: 120 SS---QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
S+ ++P SV+W +KG VT VKYQG C AV A+E +K +LVSLS Q LV
Sbjct: 120 SNPNQKLPDSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLV 179
Query: 170 DCATND-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
DC+ +N GC GGFM +AF+YII N GI ++A Y Y+ M G C ++ AA +
Sbjct: 180 DCSVGKYSNRGCNGGFMTEAFQYIIDNNGIESEASYPYKAMD-GKC-QYDSKYRAATCSR 237
Query: 229 YEDVPPNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYG 284
Y ++P + E++L +AVAN+ PVSVAIDAS F+ SG ++ C +NHGV VGYG
Sbjct: 238 YTELPEDSEDALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDPACTLHVNHGVLVVGYG 297
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+ G YWL+KNSWG +G+ GY R+ R+ CGIA +AS+P
Sbjct: 298 -NLNGKDYWLVKNSWGLHFGDQGYIRMARN---SGNHCGIASYASYP 340
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 133/323 (41%), Positives = 182/323 (56%), Gaps = 27/323 (8%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++ +K ++ + Y++ E R +IF +N + + N A G S+ L +NK+ADL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADL 84
Query: 89 TPQEFIASQTGFKMSDH----SSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
EF GF + H S+ G F+ + +P SV+W KGAVT VK QG
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + A+EG + K LVSLSEQ LVDC+T NNGC GG MD+AF+YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITN--YEDVPPNDEESLLKAVAN-QPVSVAI 253
GI + Y YE I DS A T+ + D+P DE+ + +AVA PV+VAI
Sbjct: 205 GIDTEKSYPYE----AIDDSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAI 260
Query: 254 DAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
DAS + QFYS GV+N C+ L+HGV VGYGT E G YWL+KNSWG WG+ G+
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTWGDKGFI 320
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
++ R+ D QCGIA +S+P+
Sbjct: 321 KMLRNKDN---QCGIASASSYPL 340
>gi|428170119|gb|EKX39047.1| hypothetical protein GUITHDRAFT_154556 [Guillardia theta CCMP2712]
Length = 352
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/351 (37%), Positives = 201/351 (57%), Gaps = 27/351 (7%)
Query: 6 LIVVLIISGSCA---SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
++++ I+ +CA S A+ + + I F WK ++ + Y + AE+ RF +FK N+
Sbjct: 4 ILLLAAIAATCAIPTSPASKTSSVDDEIHLAFISWKNKFEKVY-DGAEHLARFAVFKANM 62
Query: 63 VAVERFNNAA--IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA---NGTPFL 117
+ R +NA +G ++++ N+FAD+T +EF + G+K L +G
Sbjct: 63 EII-RAHNALYELGEETFSMAANQFADMTAEEFKRTVLGYKPELKGKRLLQGLNSGKNCT 121
Query: 118 YKS--SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQL 168
++S S P +++W K AVTPVK QGQC AVEG + + L+SLSE++L
Sbjct: 122 HRSNNSTRPKAIDWRTKSAVTPVKNQGQCGSCWSFSTTGAVEGAWVVAGHPLISLSEEEL 181
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY--EGMSTGICDSIKAEDHAAQI 226
V C T ++ GC GG MD+A+ +IIQN GI + VY Y +TG+C A I
Sbjct: 182 VQCDTK-SDQGCNGGLMDNAYAWIIQNGGIAAEDVYPYISGNGTTGVCHVAFLSKKVASI 240
Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGY-CETFLNHGVTAVGY 283
+++ D+ P DE L A+ QPV+VAI+A S+ QFY+GGV C T L+HGV AVGY
Sbjct: 241 SDWCDLKPEDESDLELALVQQPVAVAIEADQSSFQFYNGGVLPAKKCGTKLDHGVLAVGY 300
Query: 284 G-TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPV 332
G + + YW++KNSWG +WG++GY RL++ + + CGIA AS+P
Sbjct: 301 GYDKKHKMHYWIVKNSWGAEWGDEGYIRLEKMPKKTKHSACGIAKAASYPT 351
>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
Length = 307
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 134/301 (44%), Positives = 173/301 (57%), Gaps = 21/301 (6%)
Query: 47 ESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH 105
E E S+R EIF++N + NN A +G +Y L N+FA +T EF+A+ G + D
Sbjct: 12 EGKEESRRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIGGCLLDR 71
Query: 106 SSSLKANGTPFLYKSS--QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIK 156
++S Y S+ ++P +V+W KG VTPVK Q QC ++EG K
Sbjct: 72 NASKSTADRVHQYDSNLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKK 131
Query: 157 INRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS 216
+LVSLSEQ LVDC+ N GC GG MDDAFKYI N GI + Y YE G C
Sbjct: 132 TGKLVSLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARD-GKC-R 189
Query: 217 IKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYC-E 271
K D A +T Y D+ DE +L +AVA P+SVAIDAS Q YS GV + C
Sbjct: 190 FKPADVGATVTGYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSS 249
Query: 272 TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
T L+HGV AVGYGT E G YWL+KNSWG+ WG++GY + R+ + QCGIA AS+P
Sbjct: 250 TELDHGVLAVGYGT-EGGKDYWLVKNSWGEVWGQNGYIMMSRNKNN---QCGIATSASYP 305
Query: 332 V 332
+
Sbjct: 306 L 306
>gi|129353|sp|P22895.1|P34_SOYBN RecName: Full=P34 probable thiol protease; Flags: Precursor
Length = 379
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 127/332 (38%), Positives = 177/332 (53%), Gaps = 24/332 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
++ F+ WK+++GR Y E +KR EIFK+N + N S+ L LNKFAD+T
Sbjct: 40 VSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKSPHSHRLGLNKFADIT 99
Query: 90 PQEFIAS--QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC--- 144
PQEF Q +S Y P S +W +KG +T VKYQG C
Sbjct: 100 PQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKKGVITQVKYQGGCGRG 159
Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
A A+E +AI LVSLSEQ+LVDC + + G Y G+ +F++++++ GI
Sbjct: 160 WAFSATGAIEAAHAIATGDLVSLSEQELVDCV--EESEGSYNGWQYQSFEWVLEHGGIAT 217
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE-------ESLLKAVANQPVSVAI 253
D Y Y G C + K +D I YE + +DE ++ L A+ QP+SV+I
Sbjct: 218 DDDYPYRA-KEGRCKANKIQDKVT-IDGYETLIMSDESTESETEQAFLSAILEQPISVSI 275
Query: 254 DASALQFYSGGVFNGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
DA Y+GG+++G T +NH V VGYG S +G+ YW+ KNSWG DWGEDGY
Sbjct: 276 DAKDFHLYTGGIYDGENCTSPYGINHFVLLVGYG-SADGVDYWIAKNSWGFDWGEDGYIW 334
Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
+QR+ G CG+ FAS+P +ES SA
Sbjct: 335 IQRNTGNLLGVCGMNYFASYPTKEESETLVSA 366
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/345 (38%), Positives = 192/345 (55%), Gaps = 28/345 (8%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F +VVL + C + A + + E + WK + + Y E E +R +++ NL
Sbjct: 2 FPVVVLAL---CVTAALSAPSLDPQLDEHWNLWKDWHSKKYHEKEEGWRRM-VWEKNLKK 57
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+E N ++G +Y+L +N F D+T +EF G+K+ S K G+ F+ + +
Sbjct: 58 IELHNLEHSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLK---SQRKLRGSLFMEPNFLE 114
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
P SV+W +KG VTPVK QGQC A+EG + K LVSLSEQ LVDC+ +
Sbjct: 115 APRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRPE 174
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD AF+YI N G+ ++ Y Y G G C + + +A T + DVP
Sbjct: 175 GNEGCNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPCHYDPSYN-SANDTGFVDVPSG 233
Query: 236 DEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGY---GTSE 287
E +L+KAVA+ PVSVAIDA + QFY G+ ++ C + L+HGV VGY G
Sbjct: 234 SERALMKAVASVGPVSVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEGKDV 293
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYW++KNSW ++WG+ GY + +D + CGIA AS+P+
Sbjct: 294 DGKKYWIVKNSWSENWGDKGYIYMAKD---KKNHCGIATAASYPL 335
>gi|317135059|gb|ADV03094.1| cathepsin L [Hyriopsis cumingii]
gi|372126672|gb|AEX88474.1| cathepsin L [Hyriopsis schlegelii]
Length = 333
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 129/306 (42%), Positives = 179/306 (58%), Gaps = 24/306 (7%)
Query: 41 YGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTG 99
Y +TY+ + E R+ ++KDN +A+ R N+ A G +Y L +N++ DLT +E+ +TG
Sbjct: 37 YNKTYR-AHEEPVRYSVWKDNFLAINRHNSKADQGFHTYWLAMNEYGDLTNEEYFRLRTG 95
Query: 100 FKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEG 151
K++ ++++ G F Y + S+ P V+W KG VTPVK QG C A AVEG
Sbjct: 96 LKIN---ANIERRGLVFKYTNLSEYPSEVDWRSKGYVTPVKNQGGCGSCYAFSATGAVEG 152
Query: 152 INAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST 211
+ K +LVSLSEQ +VDC+ + N GC GG MD +F YI N GI + Y YE
Sbjct: 153 QHFRKTGKLVSLSEQNIVDCSFKEGNKGCRGGLMDKSFTYIKDNNGIDTEEAYPYEARD- 211
Query: 212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGVF-N 267
G C ++E A + Y D+P NDE +L AV P+SVAID +FY GVF N
Sbjct: 212 GPCRFRRSEV-GATVRGYVDLPENDEIALQHAVTTIGPISVAIDGHHFNFRFYHHGVFDN 270
Query: 268 GYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAM 326
C +T +NHGV VGYGT +G+ YWL+KNSWG+ WG +GY + R+ D QC I
Sbjct: 271 PNCSKTKINHGVLVVGYGT-RDGLDYWLVKNSWGERWGAEGYILMSRNNDN---QCCITC 326
Query: 327 FASFPV 332
AS+P+
Sbjct: 327 AASYPI 332
>gi|242072384|ref|XP_002446128.1| hypothetical protein SORBIDRAFT_06g002110 [Sorghum bicolor]
gi|241937311|gb|EES10456.1| hypothetical protein SORBIDRAFT_06g002110 [Sorghum bicolor]
Length = 186
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 102/187 (54%), Positives = 133/187 (71%), Gaps = 3/187 (1%)
Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
+EG I +LVSLSEQ+LVDC N + GC GG MDDAF++++ N G+T ++ Y Y G
Sbjct: 1 MEGAVKISTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFEFVVDNGGLTTESKYPYTG 60
Query: 209 MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVF 266
S G C+S +A++ AA IT YEDVP NDE SL KAVANQPVSVA+D + +FY GGV
Sbjct: 61 -SDGNCNSDEAKNDAASITGYEDVPANDETSLRKAVANQPVSVAVDGGDNLFRFYKGGVL 119
Query: 267 NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAM 326
+G C T L+HG+ AVGYG + +G K+WL+KNSWG WGE GY R++RDI +G CG+AM
Sbjct: 120 SGACGTELDHGIAAVGYGVAGDGTKFWLMKNSWGTSWGEAGYIRMERDIADDEGLCGLAM 179
Query: 327 FASFPVS 333
S+P +
Sbjct: 180 QPSYPTA 186
>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/330 (39%), Positives = 186/330 (56%), Gaps = 49/330 (14%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
S+ E+++Q+K +G+TY+ E +RF +F+ NLV ++ N G S+ ++ +FAD
Sbjct: 18 SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFAD 77
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
+T +EF+ LK G P L ++ + +V+W E+GAVT
Sbjct: 78 MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVT 125
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
PVK Q C AV A+EG K LVSLS Q+LVDCAT D NNGC GG M A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
F ++ Q++GI + Y YEG + C K+ ++ ++ Y V P DE+ + + VA +
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239
Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
PV+VAI+AS L FY G+ + C LNHGV VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GYFRL++D+ CGI + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324
>gi|115435294|ref|NP_001042405.1| Os01g0217300 [Oryza sativa Japonica Group]
gi|7523481|dbj|BAA94209.1| putative cysteine proteinase Mir3 [Oryza sativa Japonica Group]
gi|10800061|dbj|BAB16481.1| putative cysteine proteinase Mir3 [Oryza sativa Japonica Group]
gi|113531936|dbj|BAF04319.1| Os01g0217300 [Oryza sativa Japonica Group]
gi|125524918|gb|EAY73032.1| hypothetical protein OsI_00905 [Oryza sativa Indica Group]
Length = 366
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/325 (39%), Positives = 172/325 (52%), Gaps = 32/325 (9%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN--------AAIGNRSYT---LRL 82
F +W AQYG+ Y E+ KR++I+KDN + F + A ++ T + +
Sbjct: 49 FSRWMAQYGKAYSWPIEHEKRYQIWKDNSNFIGSFRSETEISSGVGAFAPQTVTDSFVGM 108
Query: 83 NKFADLTPQEFIASQTGFKMSD---HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVK 139
N+F DLTP EF TGF + H++ S +P V+W GAVT VK
Sbjct: 109 NRFGDLTPGEFAEQFTGFNATGGLLHAAPPPCPIP----PDSWLPCCVDWRSSGAVTGVK 164
Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
+Q CA AA+EG+N I+ LVSLSEQ +VDC T ++GC GG D A +
Sbjct: 165 FQRSCASCWAFAAAAAIEGLNKIRTGELVSLSEQVMVDCDTG--SSGCSGGRADTALGLV 222
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
G+ ++ Y Y G+ G CD K H+A ++ + VPPNDE L AVA QPV+
Sbjct: 223 AARGGVASEEEYPYTGVRGG-CDVGKLLSGHSASLSGFRAVPPNDERQLALAVARQPVTA 281
Query: 252 AIDASA--LQFYSGGVFNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
IDA A FY GGV+ G C +NH V VGY G KYW+ KNSWG DWGE GY
Sbjct: 282 YIDAGAREFMFYKGGVYRGPCSAERVNHAVAIVGYCEGFGGDKYWIAKNSWGSDWGEQGY 341
Query: 309 FRLQRDIDQPQGQCGIAMFASFPVS 333
L +D+ PQG CG+A +P +
Sbjct: 342 VYLAKDVWWPQGTCGLATSPFYPTA 366
>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/330 (39%), Positives = 186/330 (56%), Gaps = 49/330 (14%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
S+ E+++Q+K +G+TY+ E +RF +F+ NLV ++ N G S+ ++ +FAD
Sbjct: 18 SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFAD 77
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
+T +EF+ LK G P L ++ + +V+W E+GAVT
Sbjct: 78 MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVT 125
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
PVK Q C AV A+EG K LVSLS Q+LVDCAT D NNGC GG M A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
F ++ Q++GI + Y YEG + C K+ ++ ++ Y V P DE+ + + VA +
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239
Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
PV+VAI+AS L FY G+ + C LNHGV VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GYFRL++D+ CGI + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324
>gi|125569526|gb|EAZ11041.1| hypothetical protein OsJ_00885 [Oryza sativa Japonica Group]
Length = 366
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/325 (39%), Positives = 172/325 (52%), Gaps = 32/325 (9%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN--------AAIGNRSYT---LRL 82
F +W AQYG+ Y E+ KR++I+KDN + F + A ++ T + +
Sbjct: 49 FSRWMAQYGKAYSWPIEHEKRYQIWKDNSNFIGSFRSETEISSGVCAFAPQTVTDSFVGM 108
Query: 83 NKFADLTPQEFIASQTGFKMSD---HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVK 139
N+F DLTP EF TGF + H++ S +P V+W GAVT VK
Sbjct: 109 NRFGDLTPGEFAEQFTGFNATGGLLHAAPPPCPIP----PDSWLPCCVDWRSSGAVTGVK 164
Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
+Q CA AA+EG+N I+ LVSLSEQ +VDC T ++GC GG D A +
Sbjct: 165 FQRSCASCWAFAAAAAIEGLNKIRTGELVSLSEQVMVDCDTG--SSGCSGGRADTALGLV 222
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
G+ ++ Y Y G+ G CD K H+A ++ + VPPNDE L AVA QPV+
Sbjct: 223 AARGGVASEEEYPYTGVRGG-CDVGKLLSGHSASLSGFRAVPPNDERQLALAVARQPVTA 281
Query: 252 AIDASA--LQFYSGGVFNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
IDA A FY GGV+ G C +NH V VGY G KYW+ KNSWG DWGE GY
Sbjct: 282 YIDAGAREFMFYKGGVYRGPCSAERVNHAVAIVGYCEGFGGDKYWIAKNSWGSDWGEQGY 341
Query: 309 FRLQRDIDQPQGQCGIAMFASFPVS 333
L +D+ PQG CG+A +P +
Sbjct: 342 VYLAKDVWWPQGTCGLATSPFYPTA 366
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/309 (41%), Positives = 173/309 (55%), Gaps = 22/309 (7%)
Query: 37 WKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIA 95
+KA +G+ Y+ E R ++F DN ++ N +G SY +++N DL EF A
Sbjct: 16 FKAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMVHEFKA 75
Query: 96 SQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAA 148
GFK + ++ + NG ++ + +P SV+W ++GAVTPVK QG C A +
Sbjct: 76 LMNGFKKTPNA---ERNGKIYVPSNENLPKSVDWRQRGAVTPVKDQGHCGSCWSFSATGS 132
Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
+EG +K RLVSLSEQ LVDC+ N+GC GG M+ AF+Y+ NKGI +A Y YE
Sbjct: 133 LEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGIDTEASYPYEA 192
Query: 209 MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV 265
K + Y D+ E+ L AVA P+SV IDAS + QFYS GV
Sbjct: 193 RENNC--RFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESFQFYSEGV 250
Query: 266 FN-GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
+ YC + L+HGV VGYGT E G YWL+KNSWG WGE GY ++ R+ + CG
Sbjct: 251 YKEQYCSPSQLDHGVLTVGYGT-ENGQDYWLVKNSWGPSWGESGYIKIARN---HKNHCG 306
Query: 324 IAMFASFPV 332
IA AS+PV
Sbjct: 307 IASMASYPV 315
>gi|118136313|gb|ABK62794.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
gi|118136315|gb|ABK62795.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
Length = 335
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/342 (38%), Positives = 182/342 (53%), Gaps = 25/342 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F + +L+ + S TY F++ S QWK +Y + Y S + + + NL
Sbjct: 4 FSVFLLLCVATALSVPTYPLFNQWS------QWKVKYQKDYLSSEDELNKLLTWSKNLET 57
Query: 65 VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
V + N A G +SYTL +N ADL+ +EF A K K +
Sbjct: 58 VRKHNELYAQGKKSYTLAMNHMADLSSEEFKALYLVPKFDATKVPRKGKAAGEHRQIKND 117
Query: 124 PPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
PPS ++W+ KG VT VK Q QC + ++EG +L+S SEQQLVDC+T
Sbjct: 118 PPSEIDWVRKGHVTAVKNQAQCGSCWAFSSTGSIEGAVKRATGKLISFSEQQLVDCSTAF 177
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N+GC GG MD++F Y+I NKG+ ++A Y YE C KA I+++ DV
Sbjct: 178 GNHGCNGGIMDNSFNYLIHNKGLESEASYPYEAQKKE-CRYKKALSKGT-ISSFTDVSQF 235
Query: 236 DEESLLKAVA-NQPVSVAIDASALQF--YSGGVFN--GYCETFLNHGVTAVGYGTSEEGI 290
DE+ L +AV PVS+AIDAS F Y GV++ +T LNHGV AVGYGT+ EG+
Sbjct: 236 DEKDLKRAVGLVGPVSIAIDASQFSFHLYDSGVYDEEDCSQTMLNHGVLAVGYGTTPEGL 295
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW +KNSW WG +GY + R+ D QCG+A AS+P+
Sbjct: 296 DYWKVKNSWTNTWGMEGYILMSRNKDN---QCGVATVASYPI 334
>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
Length = 331
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 188/342 (54%), Gaps = 32/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL++ S +Q + ++ + WK YG+ YKE E + R I++ NL V
Sbjct: 4 LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNR 113
Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GGFM AF+YII NKGI +DA Y Y+ M ++ AA + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDLKC--QYDSKYRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
E+ L +AVAN+ PVSV +DA F+ SG + C +NHGV VGYG G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNSWG ++GE+GY R+ R+ CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329
>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
Length = 332
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 188/342 (54%), Gaps = 32/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL++ S +Q + ++ + WK YG+ YKE E + R I++ NL V
Sbjct: 4 LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNR 113
Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GGFM AF+YII NKGI +DA Y Y+ M ++ AA + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
E+ L +AVAN+ PVSV +DA F+ SG + C +NHGV VGYG G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNSWG ++GE+GY R+ R+ CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/286 (42%), Positives = 166/286 (58%), Gaps = 25/286 (8%)
Query: 55 FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT 114
F NL +E N GN S+T+ + +FADLT EF A F M+ +
Sbjct: 48 FRCHLANLRVIEAHN---AGNSSFTMGITQFADLTAAEFSAYVKRFPMNVTRPRNE---- 100
Query: 115 PFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQ 167
++ + V+W +K AVT +K QGQC +VEG +AI +LVSLSEQQ
Sbjct: 101 --VWITEAPLQEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQ 158
Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
L+DC+T N+GC GG MD AF+Y+I N G+ + Y Y G C++ K + HAA+I
Sbjct: 159 LMDCSTRYGNHGCNGGLMDYAFEYVIANGGLDTEEDYPYTA-EDGKCNTEKEKKHAAEIH 217
Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGT 285
+ +VP E+ L AV+ PVSVAI+A + Q Y+ GVF+G C T L+HGV VGY
Sbjct: 218 GFRNVPKEHEDQLAAAVSIGPVSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSD 277
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
YW++KNSWG+ WGE+GY RL+R +D+ +G CGI M AS+P
Sbjct: 278 D-----YWIVKNSWGKSWGEEGYIRLKRGVDK-KGMCGITMQASYP 317
>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
Length = 331
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 188/342 (54%), Gaps = 32/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL++ S +Q + ++ + WK YG+ YKE E + R I++ NL V
Sbjct: 4 LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNR 113
Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GGFM AF+YII NKGI +DA Y Y+ M ++ AA + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
E+ L +AVAN+ PVSV +DA F+ SG + C +NHGV VGYG G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNSWG ++GE+GY R+ R+ CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 178/311 (57%), Gaps = 22/311 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
F W ++ R Y E + R++ FK+N+ + ++N+ L L KFADLT +E+
Sbjct: 33 FIGWMRKHDRAYSHE-EFTDRYQAFKENMDFIHKWNSQ---ESDTVLGLTKFADLTNEEY 88
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
G K+ + +L A + P S++W EKGAV+ VK QGQC
Sbjct: 89 KKHYLGIKV-NVKKNLNAAQKGLKFFKFTGPDSIDWREKGAVSQVKDQGQCGSCWSFSTT 147
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
AVEG + IK +VSLSEQ LVDC+ N GC GG M +AF+YII N GI ++ Y Y
Sbjct: 148 GAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIATESSYPY 207
Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGG 264
+ G C K+ + A I Y+++P +E+SL A+A QPVSVAIDAS + Q YS G
Sbjct: 208 TA-AQGRCKFTKSMN-GANIIGYKEIPQGEEDSLTAALAKQPVSVAIDASHMSFQLYSSG 265
Query: 265 VFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
V++ L+HGV AVGYGT EG Y++IKNSWG WG+DGY + R+ Q QC
Sbjct: 266 VYDEPACSSEALDHGVLAVGYGTL-EGKDYYIIKNSWGPTWGQDGYIFMSRN---AQNQC 321
Query: 323 GIAMFASFPVS 333
G+A AS+P+S
Sbjct: 322 GVATMASYPIS 332
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 137/341 (40%), Positives = 188/341 (55%), Gaps = 26/341 (7%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
+ VL+++ S + D E + QWK ++G+ Y E + R I++ NL V
Sbjct: 4 LSVLLVAACVVSSLSMSFID---FDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIVI 60
Query: 67 RFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS--QV 123
+ N +G+ +Y L +N+FADL +EF++ GF+ +SS G+ FL S+ +
Sbjct: 61 KHNLKYDLGHFTYDLGMNQFADLKNEEFVSLMNGFR---GNSSKATRGSTFLPPSNVFDM 117
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P V+W KG VTPVK Q QC A ++EG + K +LVSLSEQ LVDC+ +
Sbjct: 118 PTMVDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGKEG 177
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC GG MD AF+YI+ GI + Y Y M G C KA + A T Y DV
Sbjct: 178 NMGCEGGLMDQAFQYILDVGGIDTEMSYPYTAMD-GQCHFNKA-NIGATDTGYTDVTTGS 235
Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVGYGTSEEGIK 291
E +L AVA+ P+SVAIDAS + Q Y GV+N T L+HGV AVGYGTS +G
Sbjct: 236 ESALQMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTLLDHGVLAVGYGTSSDGTD 295
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
Y+ +SWG WG +GY + R+ D QCGIA AS+P+
Sbjct: 296 YFFFFHSWGAAWGMNGYLWMSRNKDN---QCGIATKASYPL 333
>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
Length = 331
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 135/339 (39%), Positives = 193/339 (56%), Gaps = 26/339 (7%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL++ S +Q + ++ ++ WK YG+ YKE E R I++ NL V
Sbjct: 4 LVWVLLLCSSAMAQ----LHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
N ++G SY L +N D+T +E I+ + ++ S N T + ++P
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVP---SQWPRNVTYKSNPNQKLP 116
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-N 176
S++W EKG VT VKYQG C AV A+E +K RLVSLS Q LVDC+T
Sbjct: 117 DSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYR 176
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC GGFM +AF+YII N GI ++A Y Y+ + G C +++ AA + Y ++P D
Sbjct: 177 NKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVD-GKC-KYDSKNRAATCSRYTELPFAD 234
Query: 237 EESLLKAVANQ-PVSVAIDA--SALQFYSGGV-FNGYCETFLNHGVTAVGYGTSEEGIKY 292
E +L +AVAN+ PVSVAIDA S+ FY GV ++ C +NHGV VGYG + G Y
Sbjct: 235 EYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYG-NLNGKDY 293
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
WL+KNSWG ++G+ GY R+ R+ + CGIA + S+P
Sbjct: 294 WLVKNSWGLNFGDGGYIRMARN---SENHCGIANYPSYP 329
>gi|414591548|tpg|DAA42119.1| TPA: hypothetical protein ZEAMMB73_388689, partial [Zea mays]
Length = 229
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 109/204 (53%), Positives = 136/204 (66%), Gaps = 8/204 (3%)
Query: 141 QGQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
+G C A+AAVEG+N I +LVSLSEQ+LVDC DN GC GG MD AF+YI +N
Sbjct: 12 EGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQ-GCDGGLMDYAFQYIQRNG 70
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
G+T ++ Y Y C+ K H I YEDVP N+E++L KAVA+QPV+VAI+AS
Sbjct: 71 GVTTESNYPYLAEQRS-CNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVAIEAS 129
Query: 257 A--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
QFYS GVF G C T L+HGV AVGYGT+ +G KYW +KNSWG+DWGE GY R+QR
Sbjct: 130 GQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYIRMQRG 189
Query: 315 IDQPQGQCGIAMFASFPVSKESAQ 338
+ +G CGIAM S+P K +
Sbjct: 190 VPDSRGLCGIAMEPSYPTKKPAGH 213
>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
Length = 331
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 188/342 (54%), Gaps = 32/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL++ S +Q + ++ + WK YG+ YKE E + R I++ NL V
Sbjct: 4 LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNW 113
Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GGFM AF+YII NKGI +DA Y Y+ M ++ AA + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
E+ L +AVAN+ PVSV +DA F+ SG + C +NHGV VGYG G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNSWG ++GE+GY R+ R+ CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329
>gi|410990008|ref|XP_004001242.1| PREDICTED: cathepsin L1 isoform 1 [Felis catus]
Length = 333
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 127/339 (37%), Positives = 186/339 (54%), Gaps = 25/339 (7%)
Query: 9 VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
+L ++G C A+ S+ ++ QWKA +G+ Y + E +R +++ N+ +E+
Sbjct: 4 LLFLAGLCLGVASAAPQLYQSLDARWSQWKATHGKLYGMNDEVWRR-AVWERNMKMIEQH 62
Query: 69 NNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV 127
N + G ++T+ +N F D+T +EF G K+ K PF ++P SV
Sbjct: 63 NREHSQGKHTFTMAMNAFGDMTNEEFRQVMNGLKIQKRKK-WKVFQAPFFV---EIPSSV 118
Query: 128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
+W EKG VTPVK QG C A A+EG K +LVSLSEQ LVDC+ + N G
Sbjct: 119 DWREKGYVTPVKDQGYCLCCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQTEGNEGY 178
Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
GG +DDAF+Y+ N G+ ++ Y Y + G + E+ A +T+Y D+P + E +
Sbjct: 179 SGGLIDDAFQYVKDNGGLDSEESYPYH--AQGDSCKYRPENSVANVTDYWDIPSKENELM 236
Query: 241 LKAVANQPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYW 293
+ A P+S AIDAS +FY G+ ++ C + ++HGV VGY GT E KYW
Sbjct: 237 ITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGTETENKKYW 296
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+IKNSWG DWG DGY ++ +D D CGIA ASFP
Sbjct: 297 IIKNSWGTDWGMDGYIKMAKDRDN---HCGIASLASFPT 332
>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
Length = 347
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 134/340 (39%), Positives = 191/340 (56%), Gaps = 24/340 (7%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
+ VV+ + +CAS Y D + +E WK YG+ Y+E + R I++ NL V
Sbjct: 16 MKVVIWMFLACASTTAYLRHDP-MLDNHWELWKKTYGKQYEEQNQEVTRRLIWEKNLKFV 74
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
N ++G SY L +N +D+T +E + + ++ + S N T L + ++P
Sbjct: 75 TLHNLEHSMGLHSYDLSMNHLSDMTSEEVASLMSSLRIPNQWSR---NTTYRLNSNQKLP 131
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN- 176
SV+W +KG VT VKYQG C AV A+E +K +LVSLS Q LVDC+TN+
Sbjct: 132 DSVDWRDKGCVTEVKYQGTCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTNEKY 191
Query: 177 -NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N+GC GG M +AF+YII N GI +DA Y Y+ G C A + AA + Y ++P
Sbjct: 192 ENHGCNGGCMTEAFQYIIDNNGIDSDASYPYKA-KDGKCQYNPA-NRAATCSRYTELPYG 249
Query: 236 DEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
E++L +AVAN+ PVSV IDAS F+ SG ++ C +NHGV GYG + +G
Sbjct: 250 SEDALKEAVANKGPVSVGIDASLPSFFLYKSGVYYDPSCTQNVNHGVLVTGYG-NLDGKD 308
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
YWL+KNSWG +G+ GY R+ R+ CGIA F S+P
Sbjct: 309 YWLVKNSWGLSFGDKGYIRIARNRGN---HCGIANFPSYP 345
>gi|414591039|tpg|DAA41610.1| TPA: hypothetical protein ZEAMMB73_356414 [Zea mays]
Length = 376
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 127/330 (38%), Positives = 177/330 (53%), Gaps = 30/330 (9%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
E S+ +E+W++ + + ++ E RFE FK N + FN + Y L LNKFA
Sbjct: 38 EESMWSLYERWRSVHTVS-RDLREKQSRFEAFKANARHIGEFNKRK--DVPYKLGLNKFA 94
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV-----------NWIEKGAV 135
DLT +EF++ TG K+ D ++ + + S + PP + +W + GAV
Sbjct: 95 DLTQEEFVSKYTGAKVVDSEAAARLASGVRVSSSDESPPQLAASVGDAPDAWDWRDHGAV 154
Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
T VK QGQC AV AVE +NAI L++LSEQQ++DC+ + YGG+ A
Sbjct: 155 TAVKDQGQCGSCWAFSAVGAVESVNAIVTGNLLTLSEQQMLDCSGAGDCT--YGGYTYYA 212
Query: 189 FKYIIQNKGITNDAV------YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
Y I N G+T D Y+ C + +I + + DE +L +
Sbjct: 213 MLYAISN-GLTLDQCGKTPYYQRYDAQQHLPCRFDAKKPPVVKIDSMYVMNNADEAALKR 271
Query: 243 AVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
AV QPVSV IDA + +YS GVF G C T LNH V VGYG + +G KYW++KNSWG D
Sbjct: 272 AVYKQPVSVLIDAGGIGYYSEGVFTGPCGTSLNHAVLLVGYGATADGTKYWIVKNSWGAD 331
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GYFRL+RD+ G CGI M+ +P+
Sbjct: 332 WGEKGYFRLKRDVGTQGGLCGITMYPIYPI 361
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 135/348 (38%), Positives = 194/348 (55%), Gaps = 31/348 (8%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K+ + + + ++GS A FD + E++ +K + + Y+ E R +IF +N
Sbjct: 2 KFLIFLAICVAGSQA----VSFFD--LVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 63 VAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA----NGTPFL 117
V + N A G S+ L +NK+AD+ EF+ GF + S L++ + FL
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRT--KSGLRSGESDDSVTFL 113
Query: 118 YKSS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
++ Q+P ++W +KGAVTPVK QGQC A ++EG + K +LVSLSEQ LV
Sbjct: 114 PPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLV 173
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC+ NNGC GG MD+AF+YI N GI + Y Y+ K ++ A Y
Sbjct: 174 DCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC--HYKPKNKGATDRGY 231
Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCE-TFLNHGVTAVGYG 284
D+ +E+ L AVA PVSVAIDAS + Q YSGGV + C + L+HGV VGYG
Sbjct: 232 VDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYG 291
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
T ++G YWL+KNSWG+ WG+ GY ++ R+ D CGIA AS+P+
Sbjct: 292 TEDDGTDYWLVKNSWGKSWGDQGYIKMARNRDN---NCGIATEASYPL 336
>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 338
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 133/348 (38%), Positives = 188/348 (54%), Gaps = 30/348 (8%)
Query: 2 AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
A Y ++VL +S CA+ FD + + + WK + + Y S E +R +++ N
Sbjct: 3 ALYLAVLVLCVSAVCAAP----RFD-SQLEDHWHLWKNWHSKNYHASEEGWRRM-VWEKN 56
Query: 62 LVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
L +E N +G S+ L +N F D+T +EF + G+K + + K G+ F+ +
Sbjct: 57 LKKIEIHNLEHTMGKHSHRLGMNHFGDMTNEEFRQTMNGYKQT---TERKFKGSLFMEPN 113
Query: 121 S-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
Q P +V+W EKG VTPVK QG C A+EG K +LVSLSEQ LVDC+
Sbjct: 114 YLQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQPFRKTGKLVSLSEQNLVDCS 173
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ N GC GG MD AF+YI N G+ + Y Y G C K E AA T + D+
Sbjct: 174 RPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPC-HYKPEFSAANETGFVDI 232
Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSE 287
P E +++KAVA PVSVAIDA + QFY G+ + C + L+HGV VGYG
Sbjct: 233 PSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEG 292
Query: 288 E---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E G KYW++KNSW + WG+ GY + +D + CGIA +S+P+
Sbjct: 293 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATASSYPL 337
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 131/342 (38%), Positives = 190/342 (55%), Gaps = 26/342 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FL+ +++ S A T +A+++ +KA + + Y E R +I+ +N
Sbjct: 8 FLLAAVLVQLSAALSLT------NLLADEWHLFKATHKKEYPSQLEEKLRMKIYLENKHK 61
Query: 65 VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-Q 122
V + N G +SY + +NKF DL EF + G++ +SS + F+ ++ +
Sbjct: 62 VAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVE 121
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
VP SV+W EKGA+TPVK QGQC + A+EG K +LVSLSEQ L+DC+
Sbjct: 122 VPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKY 181
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD AF+YI NKGI + Y YE G+C + A + D+P
Sbjct: 182 GNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEA-EDGVC-RYNPRNRGAVDRGFVDIPSG 239
Query: 236 DEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGI 290
+E+ L AVA PVSVAIDAS + QFYS G + C++ L+HGV VGYG S+ G
Sbjct: 240 EEDKLKAAVATVGPVSVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYG-SDNGE 298
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSW + WG++GY ++ R+ + CG+A AS+P+
Sbjct: 299 DYWLVKNSWSEHWGDEGYIKIARN---RKNHCGVATAASYPL 337
>gi|297663703|ref|XP_002810310.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin S [Pongo abelii]
Length = 330
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 138/342 (40%), Positives = 188/342 (54%), Gaps = 33/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL++ S +Q + ++ + WK YG+ YKE E + R I++ NL V
Sbjct: 4 LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 60 MIHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNR 113
Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GGFM AF+YII NKGI +DA Y Y+ M DS + AA + Y D
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMVKCQYDS---KYRAATCSKYTDFX 230
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
E+ L +AVAN+ PVSV +DA F+ SG + C +NHGV VGYG G
Sbjct: 231 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 289
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNSWG+++GE+GY R+ R+ CGIA F SFP
Sbjct: 290 KEYWLVKNSWGRNFGEEGYIRMARN---KGNHCGIASFPSFP 328
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 135/348 (38%), Positives = 194/348 (55%), Gaps = 31/348 (8%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K+ + + + ++GS A FD + E++ +K + + Y+ E R +IF +N
Sbjct: 2 KFLIFLAICVAGSQA----VSFFD--LVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 63 VAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA----NGTPFL 117
V + N A G S+ L +NK+AD+ EF+ GF + S L++ + FL
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRT--KSGLRSGESDDSVTFL 113
Query: 118 YKSS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
++ Q+P ++W +KGAVTPVK QGQC A ++EG + K +LVSLSEQ LV
Sbjct: 114 PPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLV 173
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC+ NNGC GG MD+AF+YI N GI + Y Y+ K ++ A Y
Sbjct: 174 DCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC--HYKPKNKGATDRGY 231
Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCE-TFLNHGVTAVGYG 284
D+ +E+ L AVA PVSVAIDAS + Q YSGGV + C + L+HGV VGYG
Sbjct: 232 VDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYG 291
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
T ++G YWL+KNSWG+ WG+ GY ++ R+ D CGIA AS+P+
Sbjct: 292 TEDDGTDYWLVKNSWGKSWGDQGYIKMARNRDN---NCGIATEASYPL 336
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 133/326 (40%), Positives = 183/326 (56%), Gaps = 33/326 (10%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++ +K ++ + Y++ E R +IF +N + + N A G S+ L +NK+ADL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 89 TPQEFIASQTGFKMSDHSSSLKAN-----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQG 142
EF GF + H L+A G F+ + +P SV+W KGAVT VK QG
Sbjct: 85 LHHEFRQLMNGFNYTLHKQ-LRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQG 143
Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
C + A+EG + K LVSLSEQ LVDC+T NNGC GG MD+AF+YI N
Sbjct: 144 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 203
Query: 196 KGITNDAVYSYEGMSTGICD----SIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVS 250
GI + Y YE + C +I A D + D+P DE+ + +AVA PVS
Sbjct: 204 GGIDTEKSYPYEAIDDS-CHFNKGTIGATDRG-----FTDIPQGDEKKMAEAVATVGPVS 257
Query: 251 VAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
VAIDAS + QFYS GV+N C+ L+HGV VG+GT E G YWL+KNSWG WG+
Sbjct: 258 VAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDK 317
Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPV 332
G+ ++ R+ D QCGIA +S+P+
Sbjct: 318 GFIKMLRNKDN---QCGIASASSYPL 340
>gi|344271939|ref|XP_003407794.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 335
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 130/350 (37%), Positives = 188/350 (53%), Gaps = 34/350 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M + L + + A+ +T D ++ QWK+ Y + Y + E R +++
Sbjct: 1 MTPSVFLAALCLGIASAAPKLDQTLDV-----QWNQWKSTYKKVYAANEEGLTR-AVWEK 54
Query: 61 NLVAVERFNN-AAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N+ +ER N + G +T+ +N F D T +EF GF+ H G F +
Sbjct: 55 NMKMIERHNQEHSQGKHGFTMAMNAFGDKTNEEFRQLMNGFQSQKHKK-----GKLFHFH 109
Query: 120 S---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+P SVNW ++G VTPVK QG C A A+EG K +LVSLSEQ LV
Sbjct: 110 EPVFGHIPTSVNWTQRGYVTPVKDQGSCHSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 169
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC+ ++NNGC GG MD AF+Y+ N G+ ++ Y Y + C K E AA T +
Sbjct: 170 DCSRPESNNGCSGGLMDKAFQYVKNNGGLDSEESYPYTAKESRNC-LYKPEFSAANNTGF 228
Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETFLNHGVTAVGY-- 283
++PP E++L+ AVA+ P+SVA+DAS + +FY G+ F+ C +NHGV VGY
Sbjct: 229 VNIPP-QEKALMNAVASVGPISVAVDASLKSFRFYKSGIYFDPACRLAVNHGVLVVGYGF 287
Query: 284 -GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
GT + KYWL+KNSWG+ WG DGY ++ +D + CGIA AS+P
Sbjct: 288 EGTDPDKNKYWLVKNSWGKSWGADGYIKIAKDRNN---HCGIARAASYPT 334
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 130/342 (38%), Positives = 191/342 (55%), Gaps = 24/342 (7%)
Query: 8 VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
++L+I +CA+ F+ + +++ +K ++ + YK AE R +I+ N + + +
Sbjct: 4 ILLLIVITCAAVQAISFFE--LVNQEWINFKMEHKKCYKHEAEERLRMKIYMKNKLQIAQ 61
Query: 68 FN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN---GTPFLYKSS-Q 122
N + + +Y L++NK+ D+ EF G+ + + + G F+ + +
Sbjct: 62 HNCDYELKKVTYRLKINKYGDMLNHEFKNMLNGYNRTINHTLRNERLPVGAAFIEPCNVE 121
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P V+W + GAVT VK QG C A ++EG + + LVSLSEQ L+DC+ +
Sbjct: 122 LPKMVDWRKCGAVTEVKDQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSY 181
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
NNGC GG MD AF YI NKG+ + Y YEG C K A+ + + D+P
Sbjct: 182 GNNGCNGGLMDQAFSYIKDNKGLDTEKTYPYEGEDDK-CRYDKRSSGASDV-GFVDIPVG 239
Query: 236 DEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCE-TFLNHGVTAVGYGTSEEGI 290
DE+ L AVA PVSVAIDAS + QFYS G+ F C T L+HGV VGYGT EEG
Sbjct: 240 DEQKLKAAVATVGPVSVAIDASHQSFQFYSDGIYFEPECSSTNLDHGVLVVGYGTDEEGR 299
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW++KNSWG+ WGE GY ++ R+ID CGIA AS+P+
Sbjct: 300 DYWIVKNSWGESWGEKGYIKMARNIDN---HCGIASSASYPI 338
>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 365
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 142/377 (37%), Positives = 195/377 (51%), Gaps = 60/377 (15%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M Y +V L + A RT D ++ QWKAQ+ R Y E+ + R I++
Sbjct: 1 MNYYLCLVSLCLGLVAAIPKLDRTLDA-----QWYQWKAQHRRDYGENED--WRRAIWEK 53
Query: 61 NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF--- 116
NL ++E N + G S+ + +NKF D+T +EF GF S H + G F
Sbjct: 54 NLRSIEMHNLEYSAGKHSFQMEMNKFGDMTNEEFRQVMNGF--STHRVQRRTKGRLFREP 111
Query: 117 ---------------------------LYKSS---QVPPSVNWIEKGAVTPVKYQGQC-- 144
L++ Q+P SV+W +KG VTPVK QGQC
Sbjct: 112 LLVQIPKSVDWRDKGYVTPVKNQLVRRLFREPLLVQIPKSVDWRDKGYVTPVKNQGQCGS 171
Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
A ++EG K +LVSLSEQ LVDC+T N+GC GG MD+AF+Y+ +N GI
Sbjct: 172 CWAFSATGSLEGQWFRKTGKLVSLSEQNLVDCSTAQGNSGCQGGLMDNAFEYVKENGGID 231
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--S 256
+ Y Y ++ K + A IT Y D+P E++L KAVA P+SVAIDA S
Sbjct: 232 TEESYPY--IAADDTCQYKPQYSGANITGYVDIPSRMEKALEKAVATVGPISVAIDAGHS 289
Query: 257 ALQFYSGGV-FNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
+ QFY GV + C + L+HGV AVGYG + KYW++KNSWG++WG+ GY + RD
Sbjct: 290 SFQFYRSGVYYEPECSSEDLDHGVLAVGYGVQGKNGKYWIVKNSWGEEWGDSGYILMARD 349
Query: 315 IDQPQGQCGIAMFASFP 331
+ CGIA AS+P
Sbjct: 350 RNN---HCGIATAASYP 363
>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 183/319 (57%), Gaps = 22/319 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
E + ++ WK +G+ Y+ E+ R E+++ NL+ + N A++G +Y L +N
Sbjct: 27 EPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLMLITMHNLEASMGLHTYELSMNHM 86
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC 144
DLT +E + S F + ++ +PF + + VP +++W EKG VT VK QG C
Sbjct: 87 GDLTQEEIMQS---FATLSPPTDIQRAASPFAGTTGADVPDTMDWREKGCVTSVKMQGSC 143
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
A A+EG A +LV LS Q LVDC+T N+GC GG M AF+Y+I N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGLMHHAFQYVIDNQG 203
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
I +DA Y Y G G C ++ AA + Y +P +E +L +A+AN P+SVAIDA+
Sbjct: 204 IDSDASYPYTG-RNGEC-RYNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAIDAT 261
Query: 257 --ALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
FY GV+N C +NHGV AVGYGT +G YWL+KNSWG+ +G+ GY R+ R
Sbjct: 262 RPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTL-DGQDYWLVKNSWGKTFGDQGYIRMSR 320
Query: 314 DIDQPQGQCGIAMFASFPV 332
+ + QCGIA++ +P+
Sbjct: 321 NKND---QCGIALYGCYPI 336
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 181/323 (56%), Gaps = 27/323 (8%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++ +K ++ + Y++ E R +IF +N + + N A G S+ L +NK+ADL
Sbjct: 59 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 118
Query: 89 TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
EF GF + H A+ G F+ + +P SV+W KGAVT VK QG
Sbjct: 119 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 178
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + A+EG + K LVSLSEQ LVDC+T NNGC GG MD+AF+YI N
Sbjct: 179 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 238
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITN--YEDVPPNDEESLLKAVAN-QPVSVAI 253
GI + Y YE I DS T+ + D+P DE+ + +AVA PVSVAI
Sbjct: 239 GIDTEKSYPYE----AIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 294
Query: 254 DAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
DAS + QFYS GV+N C+ L+HGV VG+GT E G YWL+KNSWG WG+ G+
Sbjct: 295 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 354
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
++ R+ + QCGIA +S+P+
Sbjct: 355 KMLRN---KENQCGIASASSYPL 374
>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 128/330 (38%), Positives = 186/330 (56%), Gaps = 49/330 (14%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
S+ E+++Q+K +G+TY+ E +RF +F+ NL+ ++ N G S+ ++ +FAD
Sbjct: 18 SVYEEWQQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFAD 77
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
+T +EF+ LK G P L ++ + +V+W E+GAVT
Sbjct: 78 MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVT 125
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
PVK Q C AV A+EG K LVSLS Q+LVDCAT + NNGC GG M A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQA 185
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
F ++ Q++GI + Y YEG + C K+ D+ ++ Y V P DE+ + + VA +
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGDYVTKVKTY--VFPLDEQEMARTVAAKG 239
Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
PV+VAI+AS L FY G+ + C LNHGV VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GYFRL++D+ CGI + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324
>gi|226821425|gb|ACO82388.1| cathepsin S [Lutjanus argentimaculatus]
Length = 337
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 126/319 (39%), Positives = 180/319 (56%), Gaps = 22/319 (6%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
E + ++ WK + + Y+ E R +++ NL+ + N A++G +Y L +N
Sbjct: 27 ESRLDAHWDLWKKTHEKKYQNEVEEFSRRRLWEKNLMLITMHNLEASMGLHTYELGMNHM 86
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC 144
D+TP+E S F + ++ +PF S + +P +++W EKG VT VK QG C
Sbjct: 87 GDMTPEEIWQS---FATLTPPTDIQRAPSPFAGSSGADIPDTMDWREKGCVTSVKTQGSC 143
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
AV A+EG A K +LV LS Q LVDC+T N+GC GGFMD AF+Y+I N+G
Sbjct: 144 GSCWAFSAVGALEGQLAKKTGKLVDLSPQNLVDCSTKYGNHGCNGGFMDHAFQYVIDNQG 203
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
I +DA Y Y G S AA ++Y +P DE +L +A+A P+SVAIDA+
Sbjct: 204 IDSDASYPYTGRSDQC--HYNPSYRAANCSSYNFLPEGDEGALKQALATIGPISVAIDAT 261
Query: 257 ALQ--FYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
+ FY GV+N C +NHGV AVGYGT G YWL+KNSWG +G+ GY R+ R
Sbjct: 262 RPRFIFYRSGVYNDPSCSQEVNHGVLAVGYGTL-NGQDYWLVKNSWGTKFGDQGYIRMAR 320
Query: 314 DIDQPQGQCGIAMFASFPV 332
+ + QCGIAM+ +P+
Sbjct: 321 NQND---QCGIAMYGCYPI 336
>gi|33333708|gb|AAQ11972.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 128/330 (38%), Positives = 186/330 (56%), Gaps = 49/330 (14%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
S+ E+++Q+K +G+TY+ E +RF +F+ NL+ ++ N G S+ ++ +FAD
Sbjct: 18 SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFAD 77
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
+T +EF+ LK G P L ++ + +V+W E+GAVT
Sbjct: 78 MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVT 125
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
PVK Q C AV A+EG K LVSLS Q+LVDCAT + NNGC GG M A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQA 185
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
F ++ Q++GI + Y YEG + C K+ D+ ++ Y V P DE+ + + VA +
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGDYVTKVKTY--VFPLDEQEMARTVAAKG 239
Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
PV+VAI+AS L FY G+ + C LNHGV VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDETCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GYFRL++D+ CGI + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324
>gi|52546918|gb|AAU81592.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 196
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 107/185 (57%), Positives = 126/185 (68%), Gaps = 4/185 (2%)
Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
N+LVSLSEQ+LVDC N N GC GG MD AF +I + GIT + Y Y + G CD
Sbjct: 3 NKLVSLSEQELVDC-DNGENQGCNGGLMDLAFDFIKKKGGITTEENYPYMA-ADGKCDLK 60
Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLN 275
K I +EDVPPNDEESLLKAVANQPVSVAI+AS QFYS GVF G C T L+
Sbjct: 61 KRNTPVVSIDGHEDVPPNDEESLLKAVANQPVSVAIEASGSDFQFYSEGVFTGDCGTELD 120
Query: 276 HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKE 335
HGV VGYGT+ +G KYW ++NSWG +WGE GY R+QRDID +G CGIAM S+P+
Sbjct: 121 HGVAIVGYGTTLDGTKYWTVRNSWGPEWGEKGYIRMQRDIDAEEGLCGIAMQPSYPIKTS 180
Query: 336 SAQPS 340
S P+
Sbjct: 181 SDNPT 185
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 133/342 (38%), Positives = 188/342 (54%), Gaps = 30/342 (8%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
+ ++++++ C + T D + F +W ++Y E R+ ++++N +
Sbjct: 4 ITILVLLAAICVASTLATTHDP--LTGVFAEWMRDNSKSYSNE-EFVFRWNVWRENQQLI 60
Query: 66 ERFNNAAIGNRSYTLRLNKFADLTPQEF--IASQTGFKMSDHSSSLKA-NGTPFLYKSSQ 122
E N + N++ L +NKF DLT EF + F S H++ A P +
Sbjct: 61 EEHNRS---NKTSFLAMNKFGDLTNAEFNKLFKGLAFDYSFHANKAAAEKAVP----APG 113
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+ +W +KGAVT VK QGQC + EG N +K RL SLSEQ L+DC+ +
Sbjct: 114 LSADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSY 173
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
NNGC GG MD AF+YII NKGI +A Y Y+ + C A + +T+Y DV
Sbjct: 174 GNNGCNGGLMDYAFEYIINNKGIDTEASYPYQ-TAQYTCQYNPA-NSGGSLTSYTDVSSG 231
Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIK 291
DE +LL AVA +P SVAIDAS + QFYSGGV+ + T L+HGV AVG+GT E+G
Sbjct: 232 DENALLNAVATEPTSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLAVGWGT-EDGQD 290
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YWL+KNSWG DWG GY ++ R+ CGIA AS+P +
Sbjct: 291 YWLVKNSWGADWGLAGYIKMARN---RSNNCGIATSASYPTA 329
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 127/323 (39%), Positives = 184/323 (56%), Gaps = 25/323 (7%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
+ +++ +E WK + + Y E E +R I++ NL +E N ++G SY L +N F
Sbjct: 21 DARLSDHWELWKNWHSKKYHEKEEGWRRM-IWEKNLNKIELHNLEHSMGKHSYRLGMNHF 79
Query: 86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS-VNWIEKGAVTPVKYQGQC 144
D+T +EF G++ + KA G+ F+ + V PS V+W EKG VTPVK QGQC
Sbjct: 80 GDMTHEEFRQIMNGYQ---RKTERKAIGSLFMEPNFMVAPSAVDWREKGYVTPVKDQGQC 136
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
A+ZG N K+ +LVSLSEQ LVDC+ + N GC GG MD AF+Y+ N+G
Sbjct: 137 GSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDCSRPEGNEGCGGGLMDQAFQYVKDNQG 196
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA- 255
+ ++ Y Y G C + ++ T + D+P E +L+KAVA+ PVSVAIDA
Sbjct: 197 LDSEDSYPYLGTDDQPC-HYDPKYNSVNDTGFVDIPSGKEHALMKAVASVGPVSVAIDAG 255
Query: 256 -SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEE---GIKYWLIKNSWGQDWGEDGYF 309
+ QFY G+ + C + L+HGV AVGYG E G KYW++KNSW + WG+ GY
Sbjct: 256 HESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYI 315
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
+ +D + CGIA AS+P+
Sbjct: 316 YMAKD---RKNHCGIATAASYPL 335
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 175/310 (56%), Gaps = 28/310 (9%)
Query: 36 QWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIA 95
+WK + + Y E + R+ I+KDN + N + + L +N+F D+T EF
Sbjct: 29 RWKMAHNKAYSHDGEETVRYTIWKDNERRIREHN---LQGGDFLLEMNQFGDMTNNEF-K 84
Query: 96 SQTGFKMSDHSSSLKANGTPFLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA-------VA 147
G+ H S G+ FL +S V P SV+W +G VTPVK QGQC
Sbjct: 85 DFNGYLSHKHVS-----GSTFLTPNSFVAPDSVDWRNEGYVTPVKDQGQCGSCWAFSTTG 139
Query: 148 AVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYE 207
++EG N K +LVSLSEQ LVDC+T NNGC GG MD+AF YI +N GI ++A Y Y
Sbjct: 140 SLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENNGIDSEASYPYT 199
Query: 208 GMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGG 264
G C K + AA T + D+P DE L +AVA+ P+SVAIDAS + QFY G
Sbjct: 200 AKD-GKCAFTKP-NVAATDTGFVDIPSGDENKLKEAVASVGPISVAIDASHFSFQFYRKG 257
Query: 265 VFNGY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
V+N T L+HGV VGYGT E G YWL+KNSW WG+ GY ++ R+ + QC
Sbjct: 258 VYNERKCSSTELDHGVLVVGYGT-ESGKDYWLVKNSWNTSWGDKGYIKMSRN---AKNQC 313
Query: 323 GIAMFASFPV 332
GIA AS+P+
Sbjct: 314 GIATNASYPL 323
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 181/323 (56%), Gaps = 27/323 (8%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++ +K ++ + Y++ E R +IF +N + + N A G S+ L +NK+ADL
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 89 TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
EF GF + H A+ G F+ + +P SV+W KGAVT VK QG
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + A+EG + K LVSLSEQ LVDC+T NNGC GG MD+AF+YI N
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITN--YEDVPPNDEESLLKAVAN-QPVSVAI 253
GI + Y YE I DS T+ + D+P DE+ + +AVA PVSVAI
Sbjct: 235 GIDTEKSYPYE----AIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 290
Query: 254 DAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
DAS + QFYS GV+N C+ L+HGV VG+GT E G YWL+KNSWG WG+ G+
Sbjct: 291 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 350
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
++ R+ + QCGIA +S+P+
Sbjct: 351 KMLRN---KENQCGIASASSYPL 370
>gi|410990010|ref|XP_004001243.1| PREDICTED: cathepsin L1 isoform 2 [Felis catus]
Length = 337
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 128/341 (37%), Positives = 185/341 (54%), Gaps = 25/341 (7%)
Query: 9 VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
+L ++G C A+ S+ ++ QWKA +G+ Y + E +R +++ N+ +E+
Sbjct: 4 LLFLAGLCLGVASAAPQLYQSLDARWSQWKATHGKLYGMNDEVWRR-AVWERNMKMIEQH 62
Query: 69 NNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV 127
N + G ++T+ +N F D+T +EF G K+ K PF ++P SV
Sbjct: 63 NREHSQGKHTFTMAMNAFGDMTNEEFRQVMNGLKIQKRKK-WKVFQAPFFV---EIPSSV 118
Query: 128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
+W EKG VTPVK QG C A A+EG K +LVSLSEQ LVDC+ + N G
Sbjct: 119 DWREKGYVTPVKDQGYCLCCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQTEGNEGY 178
Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIK--AEDHAAQITNYEDVPPNDEE 238
GG +DDAF+Y+ N G+ ++ Y Y S K E+ A +T+Y D+P + E
Sbjct: 179 SGGLIDDAFQYVKDNGGLDSEESYPYHAQVKRASYSCKYRPENSVANVTDYWDIPSKENE 238
Query: 239 SLLKAVANQPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIK 291
++ A P+S AIDAS +FY G+ ++ C + ++HGV VGY GT E K
Sbjct: 239 LMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGTETENKK 298
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW+IKNSWG DWG DGY ++ +D D CGIA ASFP
Sbjct: 299 YWIIKNSWGTDWGMDGYIKMAKDRDN---HCGIASLASFPT 336
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 133/315 (42%), Positives = 186/315 (59%), Gaps = 21/315 (6%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQ 91
+F WK ++GR+Y+ +E +R +I+ +N V N A G +SY L + +FAD+ +
Sbjct: 26 EFHAWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMDNE 85
Query: 92 EFIASQTGFKMSDHSSSLKANGTPF--LYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
E+ + + + ++S G+ F L + + +P +V+W +KG VT VK Q QC
Sbjct: 86 EYKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSCWA 145
Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
A ++EG N K +LVSLSEQQLVDC+ + N GC GG MD AFKYI +N GI +
Sbjct: 146 FSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTEK 205
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQ 259
Y YE G C K E+ A+ T Y DV DE++L +AVA PVSV IDA S+ Q
Sbjct: 206 SYPYEA-EDGQC-RFKPENVGAKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQ 263
Query: 260 FYSGGVFNGY-CETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
Y GV++ C + L+HGV AVGYGT + G YWL+KNSWG WG++GY + R+ D
Sbjct: 264 LYDSGVYDEQDCSSQDLDHGVLAVGYGT-DNGQDYWLVKNSWGLGWGQEGYIMMSRNKDN 322
Query: 318 PQGQCGIAMFASFPV 332
QCGIA AS+P+
Sbjct: 323 ---QCGIATAASYPL 334
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 131/325 (40%), Positives = 182/325 (56%), Gaps = 31/325 (9%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++ +K ++ + Y++ E R +IF +N + + N A G S+ L +NK+ADL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 89 TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
EF GF + H A+ G F+ + +P SV+W KGAVT VK QG
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + A+EG + K LVSLSEQ LVDC+T NNGC GG MD+AF+YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 197 GITNDAVYSYEGMSTGICD----SIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSV 251
GI + Y YE + C +I A D + D+P DE+ + +AVA PVSV
Sbjct: 205 GIDTEKSYPYEAIDDS-CHFNKGTIGATDRG-----FTDIPQGDEKKMAEAVATVGPVSV 258
Query: 252 AIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
AIDAS + QFYS GV+N C+ L+HGV VG+GT E G YWL+KNSWG WG+ G
Sbjct: 259 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKG 318
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
+ ++ R+ + QCGIA +S+P+
Sbjct: 319 FIKMLRN---KENQCGIASASSYPL 340
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 181/323 (56%), Gaps = 27/323 (8%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++ +K ++ + Y++ E R +IF +N + + N A G S+ L +NK+ADL
Sbjct: 25 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 89 TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
EF GF + H A+ G F+ + +P SV+W KGAVT VK QG
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + A+EG + K LVSLSEQ LVDC+T NNGC GG MD+AF+YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITN--YEDVPPNDEESLLKAVAN-QPVSVAI 253
GI + Y YE I DS T+ + D+P DE+ + +AVA PVSVAI
Sbjct: 205 GIDTEKSYPYE----AIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 260
Query: 254 DAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
DAS + QFYS GV+N C+ L+HGV VG+GT E G YWL+KNSWG WG+ G+
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 320
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
++ R+ + QCGIA +S+P+
Sbjct: 321 KMLRN---KENQCGIASASSYPL 340
>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 128/330 (38%), Positives = 185/330 (56%), Gaps = 49/330 (14%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
S+ E+++Q+K +G+TY+ E +RF +F+ NLV ++ N G S+ ++ +FAD
Sbjct: 18 SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFAD 77
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
+T +EF+ LK G P L ++ + +V+W E+GAVT
Sbjct: 78 MTHEEFL------------DLLKLQGVPALPSNAVHFDNSEDIDMEEKDAVDWREEGAVT 125
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
P K Q C AV A+EG K LVSLS Q+LVDCAT D NNGC GG M A
Sbjct: 126 PAKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
F ++ Q++GI + Y YEG + C K+ ++ ++ Y V P DE+ + + VA +
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239
Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
PV+VAI+AS L FY G+ + C LNHGV VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GYFRL++D+ CGI + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 128/323 (39%), Positives = 180/323 (55%), Gaps = 34/323 (10%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE----RFNNAAIGNRSYTLRLNK 84
S+ ++++ +KA++GR Y E R +F+ N ++ RF N + ++TL++N+
Sbjct: 17 SLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEV---TFTLQMNQ 73
Query: 85 FADLTPQEFIASQTGF---KMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
F D+T +E +A+ GF ++ LKA+ +P V+W KGAVTPVK Q
Sbjct: 74 FGDMTSEEIVATMNGFLGAPTRRPAAVLKAD-------DETLPEKVDWRTKGAVTPVKDQ 126
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
QC ++EG + +K +LVSLSEQ LVDC+ N GC GG MD AF+YI
Sbjct: 127 KQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKA 186
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAI 253
NKGI + Y YE G C A + A T Y DV E +L KAVA P+SV I
Sbjct: 187 NKGIDTEDSYPYEAQD-GKC-RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGI 244
Query: 254 DA--SALQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
DA S FY GV+ + +C T L+HGV AVGYG+ E G +WL+KNSW WG+ GY
Sbjct: 245 DASQSTFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYI 304
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
++ R+ + CGIA AS+P+
Sbjct: 305 KMSRNRNN---NCGIASQASYPL 324
>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
Length = 333
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 129/316 (40%), Positives = 179/316 (56%), Gaps = 27/316 (8%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQ 91
++ +WK+ Y R Y + E +R +++ N+ +E N + G YT+ +N F D+T +
Sbjct: 28 QWHKWKSTYRRLYGTNEEEWRR-AVWEKNMKMIELHNGEYSEGKHGYTMEMNAFGDMTNE 86
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
EF G+K H K P + Q+P SV+W EKG VTPVK QGQC
Sbjct: 87 EFRQLVNGYKHQKHRKG-KVFQEPLML---QLPKSVDWREKGCVTPVKNQGQCGSCWAFS 142
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
A A+EG +K LVSLSEQ LVDC+ + N GC GG MD AF+Y++ NKG+ ++ Y
Sbjct: 143 ACGALEGQMCLKTGVLVSLSEQNLVDCSQAEGNQGCNGGLMDFAFQYVLNNKGLDSEESY 202
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFY 261
YE G C K E AA T Y D+ P E++L+KAVA P+++AIDAS + QFY
Sbjct: 203 PYEA-KDGTC-KYKPEFAAANDTGYVDI-PQLEKALMKAVATVGPIAIAIDASHPSFQFY 259
Query: 262 SGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
S G+ + C + L+HGV VGY GT KYW++KNSWG WG G+F + +D +
Sbjct: 260 SSGIYYEPNCSSKELDHGVLVVGYGFEGTDSNKKKYWIVKNSWGSSWGMGGFFHIAKDKN 319
Query: 317 QPQGQCGIAMFASFPV 332
CG+A AS+P
Sbjct: 320 N---HCGVATAASYPT 332
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 132/341 (38%), Positives = 192/341 (56%), Gaps = 28/341 (8%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
F IV +++ S A +E +I +K QY + Y+ E +R +++ NL
Sbjct: 4 FAIVAALVAVSFARVPRVGLDNEWNI------FKKQYNKLYQNEEEARRRL-VWESNLDF 56
Query: 65 VERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
+ N AA G ++ + +N++ D+T +EF + G++M + +S+ P +
Sbjct: 57 ITLHNLAADRGEHTFWVGMNEYGDMTNEEFTKTMNGYRMRNKTSNAPVFMPP--NNMGDL 114
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P +V+W KG VTP+K QGQC A ++EG K +LVSLSEQ LVDC+
Sbjct: 115 PDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLVSLSEQNLVDCSKKQG 174
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N+GC GG MDDAF YI N GI +A Y Y+ G C+ K+ D A T + D+ D
Sbjct: 175 NHGCEGGLMDDAFTYIKANNGIDTEASYPYKARD-GKCE-FKSADVGATDTGFVDIKTKD 232
Query: 237 EESLLKAVAN-QPVSVAIDASAL--QFYSGGVFNGY--CETFLNHGVTAVGYGTSEEGIK 291
EE+L +AVA P+SVAIDAS + Q Y GV++ + +T L+HGV AVGYGT E+
Sbjct: 233 EEALKQAVATVGPISVAIDASHMSFQLYRTGVYHDWFCSQTKLDHGVLAVGYGT-EDSKD 291
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWG+ WG+ GY ++ R+ + CGIA AS+P
Sbjct: 292 YWLVKNSWGESWGQKGYIQMSRN---RRNNCGIATSASYPT 329
>gi|33520126|gb|AAQ21040.1| cathepsin L precursor [Branchiostoma belcheri tsingtauense]
Length = 327
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 136/341 (39%), Positives = 193/341 (56%), Gaps = 31/341 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L++VL+ C + AT +I ++E +K +G+ Y E E++ R IF +N V
Sbjct: 3 LLIVLV----CVAVAT-------AIDNEWEAFKLLHGKQYNE-YEDTARHAIFLENCKIV 50
Query: 66 ERFNN-AAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF-LYKSSQV 123
++ N AA+G ++ +R+NKF DLT +EF G + + + +A G F +V
Sbjct: 51 KQHNEEAAMGKHTFFMRMNKFGDLTNEEFRMLVIGSGLMQSNRTQQAEGGVFESIPGLKV 110
Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
+V+W +KGAVT VK Q QC ++EG + +K LVSLSEQ LVDC+ +
Sbjct: 111 NDTVDWRQKGAVTKVKNQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEG 170
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC GG MD AFKYI N GI + Y Y+G C+ KA A ++++ DV D
Sbjct: 171 NKGCKGGLMDQAFKYIKTNGGIDTEECYPYKGRDERKCE-YKASCSGATLSSFVDVKTGD 229
Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIK 291
E++L +A A P+SV IDAS + Q Y GV++ C + L+HGV VGYGT +
Sbjct: 230 EDALKQASATIGPISVGIDASHPSFQLYDHGVYHEKRCSSKKLDHGVLVVGYGT-QSTKD 288
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWG DWG +GY + R+ D QCGIA AS+PV
Sbjct: 289 YWLVKNSWGADWGMEGYIMMSRNKDN---QCGIATQASYPV 326
>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
Length = 331
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 134/342 (39%), Positives = 188/342 (54%), Gaps = 32/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL++ S +Q + ++ + WK YG+ YKE E + R I++ NL V
Sbjct: 4 LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLTSSLRVP---SQWQRNIT---YKSNPNR 113
Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W EKG VT VKYQG C AV A+E +K +LV+LS Q LVDC+T
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GGFM AF+YII NKGI +DA Y Y+ M ++ AA + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
E+ L +AVAN+ PVSV +DA F+ SG + C +NHGV VGYG G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNSWG ++GE+GY R+ R+ CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329
>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 128/330 (38%), Positives = 186/330 (56%), Gaps = 49/330 (14%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
S+ E+++Q+K +G+TY+ E +RF +F+ NL+ ++ N G S+ ++ +FAD
Sbjct: 18 SVYEEWQQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFAD 77
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
+T +EF+ LK G P L ++ + +V+W E+GAVT
Sbjct: 78 MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVT 125
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
PVK Q C AV A+EG K LVSLS Q+LVDCAT + NNGC GG M A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQA 185
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
F ++ Q++GI + Y YEG + C K+ D+ ++ Y V P DE+ + + VA +
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGDYVTKVKTY--VFPLDEQEMARTVAAKG 239
Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
PV+VAI+AS L FY G+ + C LNHGV VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GYFRL++D+ CGI + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIDYYNTYPI 324
>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
Length = 331
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 133/342 (38%), Positives = 192/342 (56%), Gaps = 32/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ L++ S +Q + ++ + WK YG+ Y E E ++R I++ NL V
Sbjct: 4 LVWTLLVCCSAMAQ----LHRDPALDHHWHLWKKTYGKQYTEKNEETERRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS--- 121
N ++G SY L +N D+T +E ++ T K+ S + N T YKSS
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVVSLMTCLKVPRQS---QRNVT---YKSSPNQ 113
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
++P S++W EKG VT VKYQG C AV A+E + +LVSLS Q LVDC+T
Sbjct: 114 KLPDSLDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC+GGFM +AF+YII N GI ++A Y Y+ M +++ AA + Y ++P
Sbjct: 174 KYRNEGCHGGFMTEAFQYIIDNNGIDSEASYPYKAMDEKC--QYDSKNRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
EE+L +AVA++ PVSVAIDAS F+ SG + C +NHGV VGYG + G
Sbjct: 232 FGSEEALKEAVASKGPVSVAIDASHSSFFLYRSGVYYEPACTQVVNHGVLVVGYG-NLNG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
YWL+KNSWG +G+ GY R+ R+ + CGIA ++S+P
Sbjct: 291 NDYWLVKNSWGLYFGDKGYIRMARN---RENHCGIASYSSYP 329
>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 336
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 126/342 (36%), Positives = 199/342 (58%), Gaps = 26/342 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FL+ + +++ CA+ A + + + ++ +WK + ++Y +R ++++N+
Sbjct: 6 FLVAIGLVA--CATAAFVKPTNP-DLDSRWLEWKIAHTKSYTNDMHELERRLVWEENVKM 62
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-Q 122
+ N + ++ + + L +N++ D+ E ++ G+K S+ + K G+ FL S+ Q
Sbjct: 63 INMHNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNGYKSSNVT---KVQGSTFLTPSNIQ 119
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
VP +V+W KG VTPVK QGQC ++EG K ++LVSLSEQ LVDC+ +
Sbjct: 120 VPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTE 179
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD F+Y+I N GI ++ Y Y+ C KA +A++T + DV
Sbjct: 180 GNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDE-TC-HYKASCDSAEVTGFTDVTSG 237
Query: 236 DEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEEGI 290
DE++L++AVA+ PVSVAIDAS + Q Y GV++ C + L+HGV VGYGT + G
Sbjct: 238 DEQALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSELDHGVLVVGYGT-DGGK 296
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWG+ WG GY ++ R+ QCGIA AS+P+
Sbjct: 297 DYWLVKNSWGETWGLSGYIKMSRN---KSNQCGIATSASYPL 335
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 128/323 (39%), Positives = 180/323 (55%), Gaps = 34/323 (10%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE----RFNNAAIGNRSYTLRLNK 84
S+ ++++ +KA++GR Y E R +F+ N ++ RF N + ++TL++N+
Sbjct: 18 SLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEV---TFTLQMNQ 74
Query: 85 FADLTPQEFIASQTGF---KMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
F D+T +E +A+ GF ++ LKA+ +P V+W KGAVTPVK Q
Sbjct: 75 FGDMTSEEIVATMNGFLGAPTRRPAAVLKAD-------DETLPEKVDWRTKGAVTPVKDQ 127
Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
QC ++EG + +K +LVSLSEQ LVDC+ N GC GG MD AF+YI
Sbjct: 128 KQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKA 187
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAI 253
NKGI + Y YE G C A + A T Y DV E +L KAVA P+SV I
Sbjct: 188 NKGIDTEDSYPYEAQD-GKC-RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGI 245
Query: 254 DA--SALQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
DA S FY GV+ + +C T L+HGV AVGYG+ E G +WL+KNSW WG+ GY
Sbjct: 246 DASQSTFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYI 305
Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
++ R+ + CGIA AS+P+
Sbjct: 306 KMSRNRNN---NCGIASQASYPL 325
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 134/330 (40%), Positives = 186/330 (56%), Gaps = 28/330 (8%)
Query: 18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNR 76
S A + + ++ ++ WK YG+ Y+E E R I++ NL V N ++G
Sbjct: 12 SSAMAQVHRDPTLDHHWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNLEHSMGMH 71
Query: 77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS---QVPPSVNWIEKG 133
SY L +N D+T +E I+S + ++ S N T YKSS ++P S++W EKG
Sbjct: 72 SYELGMNHLGDMTSEEVISSMSSLRVP---SQWPRNVT---YKSSPNQKLPDSLDWREKG 125
Query: 134 AVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT-NDNNNGCYGGFM 185
VT VKYQG C AV A+E +K +LVSLS Q LVDC+T N GC GGFM
Sbjct: 126 CVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFM 185
Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
+AF+YII N GI ++A Y Y+ M G C ++ AA + Y ++P EE+L +AVA
Sbjct: 186 TEAFQYIIDNNGIDSEASYPYKAMD-GRCQ-YDVKNRAATCSRYIELPFGSEEALKEAVA 243
Query: 246 NQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQ 301
N+ PVSV IDA F+ +G ++ C +NHGV VGYG S G YWL+KNSWG
Sbjct: 244 NKGPVSVGIDAKQTSFFLYKTGVYYDPSCTQNVNHGVLVVGYG-SLNGKDYWLVKNSWGL 302
Query: 302 DWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
++G+ GY R+ R+ CGIA F S+P
Sbjct: 303 NFGDQGYIRMARN---SGNHCGIANFPSYP 329
>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
Length = 322
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 181/315 (57%), Gaps = 28/315 (8%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
++ +K QYGR Y + E+ R +F+ N +E N G ++TL++N+F D+T +E
Sbjct: 19 WQDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEE 78
Query: 93 FIASQTGF---KMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
F A+ GF + L+A+ +P V+W KGAVTPVK Q QC
Sbjct: 79 FAATMNGFLNVPTRHPVAILEAD-------DETLPKHVDWRTKGAVTPVKDQKQCGSCWA 131
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
++EG + +K +LVSLSEQ LVDC+ N GC GG MD AFKYI +NKGI +
Sbjct: 132 FSTTGSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEE 191
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQ 259
Y YE G C + + A T + D+ +E SL+KAVAN P+SVAIDAS + Q
Sbjct: 192 SYPYEAQD-GKC-RFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQ 249
Query: 260 FYSGGV-FNGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
FY GV + C T L+HGV A+GYG +++G +YWL+KNSW WG+ G+ ++ R+
Sbjct: 250 FYHQGVYYEKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRN--- 306
Query: 318 PQGQCGIAMFASFPV 332
+ CGIA AS+P+
Sbjct: 307 KKNNCGIASQASYPL 321
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 125/317 (39%), Positives = 179/317 (56%), Gaps = 19/317 (5%)
Query: 28 GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFA 86
G E ++++K +G+ Y E KRF+IF+D L +E N +G +SY + +N+F+
Sbjct: 48 GPYHETWKEFKTLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFS 107
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
D++ E++ G + + S + Q+ V+W +KG VTPVK QGQC
Sbjct: 108 DMSHDEYL-RHNGLRRGNRKYSKGEGCDSYTKSGKQLDDKVDWRDKGYVTPVKNQGQCGS 166
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
++EG + + +L+SLSEQQLVDC+ N GC GG MD+AF+YI G+
Sbjct: 167 CWSFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSIGGLE 226
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
+ Y Y G C +K A T DV DE++L A+A+ P+SVAIDAS
Sbjct: 227 GEDDYPYTA-KQGKC-HLKKSLFKANDTGCTDVESGDEDALKDALASVGPISVAIDASHA 284
Query: 257 ALQFYSGGVFNGY-CET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
+ Q Y GGV++ C + L+HGV VGYGT E G YWL+KNSWG+ WGE+GY ++ R+
Sbjct: 285 SFQSYDGGVYDEEECSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMWGEEGYIKMSRN 344
Query: 315 IDQPQGQCGIAMFASFP 331
D QCGIA AS+P
Sbjct: 345 KDN---QCGIATQASYP 358
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 126/316 (39%), Positives = 178/316 (56%), Gaps = 25/316 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
+EQWK +G+ Y E E +R +++ NL +E N ++G +Y L +N+F D+T +E
Sbjct: 29 WEQWKNWHGKKYHEKEEGWRRM-VWEKNLQKIELHNLEHSMGTHTYRLGMNRFGDMTHEE 87
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA------ 145
F G+K H + G+ F+ + +VP S++W EKG VTPVK QG+C
Sbjct: 88 FRQVMNGYK---HKKERRFRGSLFMEPNFLEVPNSLDWREKGYVTPVKDQGECGSCWAFS 144
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
A+EG K +LVSLSEQ LVDC+ + N GC GG MD AF+YI G+ ++ Y
Sbjct: 145 TTGAMEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDQNGLDSEESY 204
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQFY 261
Y G C + AA T + D+P E +L+KA+A PVSVAIDA + QFY
Sbjct: 205 PYVGTDDQPC-HYDPKYSAANDTGFVDIPSGKEHALMKAIAAVGPVSVAIDAGHESFQFY 263
Query: 262 SGGV-FNGYCET-FLNHGVTAVGYGTSEE---GIKYWLIKNSWGQDWGEDGYFRLQRDID 316
G+ + C + L+HGV AVGYG E G KYW++KNSW ++WG+ GY + +D
Sbjct: 264 QSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDKGYVYMAKD-- 321
Query: 317 QPQGQCGIAMFASFPV 332
CGIA AS+P+
Sbjct: 322 -RHNHCGIATAASYPL 336
>gi|348531513|ref|XP_003453253.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 333
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 136/347 (39%), Positives = 198/347 (57%), Gaps = 30/347 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M F++ ++ SCAS + +F WK ++ ++Y +E ++R +I+
Sbjct: 1 MKLLFVVAAVLAVSSCASISLEDM--------EFHAWKLKFEKSYDSPSEETQRKQIWLS 52
Query: 61 NLVAVERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF--L 117
N V + N A +G +SY L + FAD+ +E+ + + ++SL G+ F L
Sbjct: 53 NRKLVLKHNALADLGLKSYHLGMTYFADMENEEYKKLISQGCLGSFNASLPRRGSTFNRL 112
Query: 118 YKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
K + +P +V+W +KG VT VK Q QC A A+EG + K RLV LSEQQLVD
Sbjct: 113 PKGTVLPDTVDWRKKGYVTKVKNQQQCGSCWAFSATGALEGQHFKKTGRLVYLSEQQLVD 172
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C+ N N GC GG+M++AFKYI N GI +A Y Y+ M G+C A Y
Sbjct: 173 CSRNFGNRGCDGGWMNNAFKYIKDNGGIQTEASYPYQAMD-GLCH-YNPNSVGAICNGYV 230
Query: 231 DVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY-C-ETFLNHGVTAVGYGT 285
DV P DEE+L +AVA P+S+A+DAS + Q Y GV++ + C + +L+HG+ VGYGT
Sbjct: 231 DVSP-DEEALKEAVATIGPISIAMDASHESFQLYQSGVYDEHRCNDYYLSHGMLVVGYGT 289
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E G+ YWLIKNSWG WG+ GY ++ R+ + QCGIA AS+P+
Sbjct: 290 -EGGLDYWLIKNSWGLGWGKMGYIKMVRN---KRNQCGIATAASYPL 332
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 132/311 (42%), Positives = 180/311 (57%), Gaps = 27/311 (8%)
Query: 37 WKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIAS 96
+K+ + ++Y++ E R IF+DNL +E FN +TL +N+FAD+T EF
Sbjct: 31 FKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEFSNM 90
Query: 97 QTGFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQGQCA-------V 146
G + K G +++SS V P V+W +KG VT VK QGQC
Sbjct: 91 LLGLGGRN-----KIAGDS-VFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTT 144
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
++EG K +LVSLSEQ LVDC+T++ N GC GG MD AF YI +N GI +A Y Y
Sbjct: 145 GSLEGQVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPY 204
Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASAL--QFYSG 263
G S G C ++ + A ++ + DV DE +L +AVA P+SVAIDAS++ QFY G
Sbjct: 205 TG-SDGTCRFLENK-VGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRG 262
Query: 264 GVFNGY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
GV+N + T L+HGV VGYGT E G YWL+KNSWG WG GY ++ R+ + +
Sbjct: 263 GVYNPWFCSSTELDHGVLVVGYGT-EGGKDYWLVKNSWGSSWGLKGYIKMVRN---KKNR 318
Query: 322 CGIAMFASFPV 332
CGIA AS+P
Sbjct: 319 CGIATQASYPT 329
>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
Length = 306
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 183/318 (57%), Gaps = 34/318 (10%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE----RFNNAAIGNRSYTLRLNKFADLT 89
++ +K QYGR Y + E+ R +F+ N +E +F N + ++TL++N+F D+T
Sbjct: 3 WQDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEV---TFTLKMNQFGDMT 59
Query: 90 PQEFIASQTGF---KMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
+EF A+ GF + L+A+ +P V+W KGAVTPVK Q QC
Sbjct: 60 SEEFAATMNGFLNVPTRHPVAILEAD-------DETLPKHVDWRTKGAVTPVKDQKQCGS 112
Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
++EG + +K +LVSLSEQ LVDC+ N GC GG MD AFKYI +NKGI
Sbjct: 113 CWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGID 172
Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
+ Y YE G C + + A T + D+ +E SL+KAVAN P+SVAIDAS
Sbjct: 173 TEESYPYEAQD-GKC-RFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHP 230
Query: 257 ALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
+ QFY GV + C T L+HGV A+GYG +++G +YWL+KNSW WG+ G+ ++ R+
Sbjct: 231 SFQFYHQGVYYEKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRN 290
Query: 315 IDQPQGQCGIAMFASFPV 332
+ CGIA AS+P+
Sbjct: 291 ---KKNNCGIASQASYPL 305
>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
sativus]
Length = 235
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 112/227 (49%), Positives = 145/227 (63%), Gaps = 14/227 (6%)
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P +V+W +KGAV +K QG C A VEGIN I L+SLSEQ+LVDC
Sbjct: 4 LPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDC-DKS 62
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD AF++I++N G+ + Y Y G S G C+S+ I YEDVP N
Sbjct: 63 YNQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRG-SDGKCNSLLKNSKVVTIDGYEDVPTN 121
Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
DE +L +AV+ QPVSVAIDA Q Y G+F G C T ++H V AVGYG SE G+ YW
Sbjct: 122 DETALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYG-SENGVDYW 180
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKESAQP 339
+++NSWGQ WGEDGY R++R++ + G+CGIA+ AS+PV K S P
Sbjct: 181 IVRNSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPV-KYSPNP 226
>gi|334332716|ref|XP_001367365.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 137/348 (39%), Positives = 195/348 (56%), Gaps = 32/348 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M Y + L + + A R D ++ QWKAQ+G++Y E+ E+S R I++
Sbjct: 1 MNFYLCLASLCLGLAAAIPPFDRALDS-----QWHQWKAQHGKSY-EANEDSLRRAIWEK 54
Query: 61 NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
NL +ER N G +S+ L +NKF D+T +EF Q + S+S + +L++
Sbjct: 55 NLKMIERHNQEYRAGKQSFQLGMNKFGDMTTEEF---QEAINFYNSSASQRRT-KRYLHR 110
Query: 120 S---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
+Q+P SV+W E+G VTPVK QGQC AV A+EG K LVSLS Q LV
Sbjct: 111 EPLLAQLPESVDWREEGYVTPVKNQGQCLSCWAFSAVGAIEGQWFRKTGELVSLSIQNLV 170
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC T+D+ + C+GGFMD AF+Y+ N GI + Y Y G C + E A + +
Sbjct: 171 DCTTSDSISSCHGGFMDRAFQYVQDNGGIDTEECYPYVG-EVNEC-KYQPECSGANVVGF 228
Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYG 284
D+P DE +L++AVA P+SVAID + +FY GV ++ C + LNH VGYG
Sbjct: 229 VDIPSMDERALMEAVATVGPISVAIDGGNPSFKFYESGVYYDPQCSSSQLNHAGLVVGYG 288
Query: 285 TSE-EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+ +G KYW++KNSWG+ WG +GY + +D D CGIA AS+P
Sbjct: 289 SEGIDGRKYWIVKNSWGELWGNNGYILMAKDEDN---HCGIATEASYP 333
>gi|261289787|ref|XP_002611755.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
gi|229297127|gb|EEN67765.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
Length = 327
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 194/337 (57%), Gaps = 27/337 (8%)
Query: 10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
L+I C + AT +I ++E +K +G+ Y E E+ R+ IF++N V++ N
Sbjct: 3 LLIFVVCVAVAT-------AIDPQWEAFKLLHGKQYSE-YEDGARYAIFQENSRIVKQHN 54
Query: 70 N-AAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF-LYKSSQVPPSV 127
AA+G ++ +R+NKF D+T +EF G + + + + G F +V +V
Sbjct: 55 EEAAMGKHTFFMRMNKFGDMTNEEFQMLVIGSGLLYSNKTQQTEGGVFESLPGLKVNDTV 114
Query: 128 NWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
+W +KGAVT VK Q QC ++EG + +K LVSLSEQ LVDC+ + N GC
Sbjct: 115 DWRQKGAVTKVKNQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGC 174
Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
GG MD AFKYI N GI + Y Y+G + C+ K+ A +++Y D+ DE++L
Sbjct: 175 QGGLMDQAFKYIKTNGGIDTEECYPYKGKNERKCE-YKSSCSGATLSSYVDIKTGDEDAL 233
Query: 241 LKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLI 295
++A A P+SV IDAS + Q Y GV++ C + L+HGV VGYGT E YWL+
Sbjct: 234 MQASATIGPISVGIDASHPSFQLYDHGVYHEKRCSSKKLDHGVLVVGYGTDGEK-DYWLV 292
Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KNSWG++WG +GY ++ R+ D QCGIA AS+PV
Sbjct: 293 KNSWGEEWGMEGYIKMSRNKDN---QCGIATQASYPV 326
>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 129/344 (37%), Positives = 190/344 (55%), Gaps = 23/344 (6%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
+++VL + S + +E I E++ +K Q+ + Y++ E + R +++ DN + +
Sbjct: 3 VVIVLGLVAFAISTVSSINLNE-VIEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKLKI 61
Query: 66 ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMS----DHSSSLKANGTPFLYKS 120
N G +Y L +N F DL E+ GFK S D + + T ++
Sbjct: 62 AGHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSEN 121
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
+P SV+W +KG VTPVK QGQC A ++EG + K LVSLSEQ L+DC+
Sbjct: 122 VVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
NNGC GG MD AFKYI NKG+ + Y YE E+ A + D+P
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKC--RYNPENSGATDKGFVDIP 239
Query: 234 PNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEE 288
DE++L+ A+A PVS+AIDAS+ QFY GVF N C T L+HGV AVG+G+ ++
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G YW++KNSWG+ WG++GY + R+ + CG+A AS+P+
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYPL 340
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 185/342 (54%), Gaps = 36/342 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
LI +I C++ R F + F+ W ++ ++Y E R+ +F+DN+ V
Sbjct: 7 LIFCFLIINCCSAA---RIFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYSVFQDNMDIV 62
Query: 66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT---PFLYKSSQ 122
++N + L LN ADLT +EF G KAN T L S
Sbjct: 63 AKWNQKG---SNTILGLNVMADLTNEEFKKLYLG---------TKANVTYKKKTLVGVSG 110
Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
+P SV+W GAVT VK QGQC +VEGI+ I +LV LSEQQ++DC+ ++
Sbjct: 111 LPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSE 170
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
NNGC GG M ++F+YII G+ +A Y Y G G C ++ A IT Y++V
Sbjct: 171 GNNGCDGGLMTNSFEYIIAVGGLDTEASYPYTG-EVGKC-KFNKKNIGATITGYKNVESG 228
Query: 236 DEESLLKAVANQPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTSEEGIK 291
E L AVA QPVSVAIDA S+ Q Y+ GV + C T L+HGV AVGYG S+ G
Sbjct: 229 SESDLQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYG-SQSGQD 287
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
YW++KNSWG DWGE+G+ + R+ D CGIA ASFP +
Sbjct: 288 YWIVKNSWGADWGENGFILMARNKDN---NCGIATMASFPTA 326
>gi|33333696|gb|AAQ11966.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 128/330 (38%), Positives = 185/330 (56%), Gaps = 49/330 (14%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
S+ E+++Q+K +G+TY+ E +RF +F+ NL+ ++ N G S+ ++ +FAD
Sbjct: 18 SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFAD 77
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
+T +EF+ LK G P L ++ + +V+W E+GAVT
Sbjct: 78 MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVT 125
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
PVK Q C AV A+EG K LVSLS Q+LVDCAT + NNGC GG M A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQA 185
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
F ++ Q++GI + Y YEG + C K+ D+ ++ Y V P DE+ + + VA +
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGDYVTKVKTY--VFPLDEQEMARTVAAKG 239
Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
PV+VAI+AS L FY G+ + C LNHGV VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDETCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GYFRL++D+ CGI + +P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNPYPI 324
>gi|157278117|ref|NP_001098157.1| cathepsin S precursor [Oryzias latipes]
gi|50251130|dbj|BAD27582.1| cathepsin S [Oryzias latipes]
Length = 327
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 131/317 (41%), Positives = 177/317 (55%), Gaps = 25/317 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
+ + + WK Y +TY E R I+++NL + N ++G SY L +N DL
Sbjct: 21 LDQHWNLWKKTYSKTYSHEIEEFGRRRIWEENLEMISVHNLEVSLGLHSYELAMNHLGDL 80
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
T +E IAS TG + L+ + ++ VP SV+W E G VT VK QG+C
Sbjct: 81 TIEELIASLTG---TVAPVGLERIHYDLVKINTSVPESVDWREGGLVTSVKTQGRCGSCW 137
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
AV A+EG L SLS Q LVDC+T N GC GGFM +AF+Y+I+N+GI++D
Sbjct: 138 AFSAVGALEGQLKKTTGILTSLSPQNLVDCSTKYGNYGCKGGFMSNAFQYVIKNQGISSD 197
Query: 202 AVYSYEGMSTGICDSIK--AEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
A Y Y G D K ++ AA T Y +P DE +L VA P+SVAIDAS
Sbjct: 198 AAYPYIGKR----DKCKYDSKHRAANCTGYNFLPKGDEFALKVGVATIGPISVAIDASRP 253
Query: 257 ALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
FY GV+ + C +NHGV VGYGT E G YWL+KNSWG+ +G+ GY ++ R+
Sbjct: 254 KFLFYRHGVYKDHSCSHNVNHGVLVVGYGT-ENGEDYWLVKNSWGERYGDGGYIKMARN- 311
Query: 316 DQPQGQCGIAMFASFPV 332
+ QCGIA++A FPV
Sbjct: 312 --RRNQCGIALYACFPV 326
>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
Length = 344
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 130/352 (36%), Positives = 185/352 (52%), Gaps = 40/352 (11%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV- 65
+++ +++G+CA + E++ +K ++ + Y E+ R +I+ +N +
Sbjct: 6 VLLCLVAGACAVSLL------DLVREEWNAFKMEHSKQYDSEVEDKFRMKIYVENKHRIA 59
Query: 66 ---ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD---------HSSSLKANG 113
+RF + SY L+ NK+AD+ EF+ + GF + HS
Sbjct: 60 KHNQRFEQRLV---SYKLKPNKYADMLHHEFVHTMNGFNKTAKHGGRNKAVHSKGRDGRA 116
Query: 114 TPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSE 165
F+ + P V+W +KGAVT VK QG+C A+EG + K LVSLSE
Sbjct: 117 ATFIAPAHVSYPDHVDWRKKGAVTDVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSE 176
Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
Q LVDC+ NNGC GG MD+AFKYI N GI + Y YE + ++ A
Sbjct: 177 QNLVDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDTEKSYPYEAVDDKC--RYNPKNSGAD 234
Query: 226 ITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVTA 280
+ D+P DEE L++AVA P+SVAIDAS QFYS GV+ T L+HGV
Sbjct: 235 DVGFVDIPQGDEEKLMQAVATVGPISVAIDASQETFQFYSKGVYYDENCSSTDLDHGVMV 294
Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
VGYGT EEG YWL+KNSWG+ WGE GY ++ + + CGIA AS+P+
Sbjct: 295 VGYGTEEEGGDYWLVKNSWGRSWGELGYIKMAHNKNN---HCGIASSASYPL 343
>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
Length = 330
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 131/346 (37%), Positives = 194/346 (56%), Gaps = 31/346 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M FL+ L + A+ +FD +E+WK ++G+TY + E KR ++++
Sbjct: 1 MTPIFLLATLCLGMISAAPTHDPSFDT-----VWEEWKTKHGKTYNTNEEGQKR-AVWEN 54
Query: 61 NLVAVERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFK-MSDHSSSLKANGTPFLY 118
N+ + N + G ++L +N F DLT EF TGF+ M +++ PFL
Sbjct: 55 NMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMGPKETTIFRE--PFL- 111
Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
+P S++W E G VTPVK QGQC AV ++EG K +LVSLSEQ LVDC
Sbjct: 112 --GDIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDC 169
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ + N GC GG M+ AF+Y+ +N+G+ Y+YE G+C + AA +T +
Sbjct: 170 SWSYGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQD-GLC-RYNPKYSAANVTGFVK 227
Query: 232 VPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVTAVGYGTS 286
VP + E+ L+ AVA+ PVSV ID+ + +FYSGG++ T ++H V VGYG
Sbjct: 228 VPLS-EDDLMSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEE 286
Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+G KYWL+KNSWG+DWG DGY ++ +D + CGIA +A +P
Sbjct: 287 SDGGKYWLVKNSWGEDWGMDGYIKMAKDQNN---NCGIATYAIYPT 329
>gi|33333702|gb|AAQ11969.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 128/330 (38%), Positives = 185/330 (56%), Gaps = 49/330 (14%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
S+ E+++Q+K +G+TY+ E +RF +F+ NL+ ++ N G S+ ++ +FAD
Sbjct: 18 SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFAD 77
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
+T +EF+ LK G P L ++ + +V+W E+GAVT
Sbjct: 78 MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVT 125
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
PVK Q C AV A+EG K LVSLS Q+LVDCAT D NNGC GG M A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
F ++ Q++GI + Y YEG + C K+ ++ ++ Y V P DE+ + + VA +
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239
Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
PV+VAI+AS L FY G+ + C LNHGV VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GYFRL++D+ CGI + +P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNPYPI 324
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 173/311 (55%), Gaps = 22/311 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
F +K +YG+ Y E++ RF IFK N+ + N N ++ L +N+F DLT +E
Sbjct: 27 FNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNAR---NLTFALGVNEFTDLTQEEL 83
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
AS TG K + S L T Y + + SV+W +G VTPVK QGQC
Sbjct: 84 AASYTGLKPASLWSGLPRLST-HEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTT 142
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
A+EG A+ LVSLSEQQ VDC T D+ GC GG+MD+AF + +N I + Y Y
Sbjct: 143 GALEGAWALSTGNLVSLSEQQFVDCDTTDS--GCNGGWMDNAFSFAKKNS-ICTEGSYPY 199
Query: 207 EGMSTGICDSIKAEDHAAQ--ITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
+ G C+ + Q + Y DV + E++++ AVA QPVS+AI+A + Q YS
Sbjct: 200 TA-TDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLYS 258
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
GV C T L+HGV AVGYG SE G YW +KNSWG WGE GY RLQR G+C
Sbjct: 259 SGVLTASCGTRLDHGVLAVGYG-SEAGTDYWKVKNSWGSSWGEQGYVRLQRG-KGGAGEC 316
Query: 323 G-IAMFASFPV 332
G +A S+PV
Sbjct: 317 GLLAGPPSYPV 327
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 136/349 (38%), Positives = 192/349 (55%), Gaps = 28/349 (8%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
KY +VLI + AS ++ T + E++E +K ++ + Y+ E + R +IF +N
Sbjct: 2 KYLCALVLIAVAASASAVSFFTV----VMEEWESFKFEHSKKYESDTEETFRMKIFAENK 57
Query: 63 VAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN----GTPFL 117
+ N G+++Y L +NK+ D+ EF+ GF+ + + KAN G F+
Sbjct: 58 QKIAAHNKLYHTGSKTYKLGMNKYGDMLHHEFVNMMNGFRANTSGAGYKANRGFQGAHFV 117
Query: 118 YKSSQV--PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
V P SV+W EKGAVT VK QG C A A+EG + + LVSLSEQ L
Sbjct: 118 EPPEDVVMPKSVDWREKGAVTEVKDQGSCGSCWAFSATGALEGQHYRQTGDLVSLSEQNL 177
Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
VDC++ NNGC GG MD+AF+YI N GI + Y YE + A
Sbjct: 178 VDCSSKFGNNGCNGGLMDNAFQYIKVNGGIDTEKSYPYEAEDEPC--RYNPANAGADDRG 235
Query: 229 YEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYCET-FLNHGVTAVGY 283
+ DV +E +L KA+A PVSVAIDAS + QFY GV+ + C L+HGV AVGY
Sbjct: 236 FVDVREGNENALKKAIATIGPVSVAIDASQDSFQFYQHGVYSDPDCSAENLDHGVLAVGY 295
Query: 284 GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
GT+E+G YWL+KNSW + WG+ GY ++ R+ + CGIA AS+P+
Sbjct: 296 GTTEDGQDYWLVKNSWSKSWGDQGYIKIARNQNN---MCGIASAASYPL 341
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 179/311 (57%), Gaps = 23/311 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
+E WK+ +G+ Y E+ R +F N+ + N ++ + +N+F+DLT +EF
Sbjct: 25 WEAWKSFHGKKYHNQGEDDFRHYVFLQNIKTIAAHN----AKSTFKMAINEFSDLTRKEF 80
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
+ + G+++S S+ K + T ++ +P V+W ++G VTP+K QG+C
Sbjct: 81 VKTYNGYRLSMKKSTNKPS-TFMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTT 139
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
++EG + K +LVSLSEQ L+DC+ + N+GC GGFMDDAF+YI N GI +A Y Y
Sbjct: 140 GSLEGQHFRKTGKLVSLSEQNLIDCSAAEGNDGCGGGFMDDAFEYIKLNNGIDTEASYPY 199
Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSG 263
EG IC K + A T Y D+ E+ L AVA P+SVAIDAS + Y
Sbjct: 200 EGRDD-IC-RYKKTNKGAIDTGYMDIKQYSEDDLKAAVATVGPISVAIDASHKSFHMYHT 257
Query: 264 GVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
GV++ +T L+HGV VGYGT E G YWL+KNSWG DWG +GY ++ R+
Sbjct: 258 GVYHEPECSQTVLDHGVLVVGYGT-ENGEDYWLVKNSWGTDWGMNGYIKMSRN---RSNN 313
Query: 322 CGIAMFASFPV 332
CGIA AS+P+
Sbjct: 314 CGIATNASYPL 324
>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 177/318 (55%), Gaps = 29/318 (9%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
++E WK G+ Y E R I++ N V +NA +TL +N FADL E
Sbjct: 22 EWELWKRTNGKDYSSEKEELYRQTIWEANKKIVLE-HNANADKWGWTLEMNAFADLESSE 80
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA------ 145
F A G++ S+ K+N T + + +P +V+W KGAVTPVK Q QC
Sbjct: 81 FAAMYNGYR----RSARKSNATRYHVPTGNALPDTVDWRTKGAVTPVKNQKQCGSCWAFS 136
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
++EG +K L SLSEQQLVDC+ N+GC GG MD+AFKYI N GI ++A Y
Sbjct: 137 TTGSLEGQTFLKKGTLPSLSEQQLVDCSDKYGNHGCQGGLMDNAFKYIEANGGIDSEASY 196
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQFY 261
YE G C + AA T Y+D+P +D + L AVAN P+SVA+DA S+ Q Y
Sbjct: 197 PYEA-KNGKC-RFQQSAVAATCTGYKDIPHDDIDGLQDAVANVGPISVAMDASHSSFQLY 254
Query: 262 SGGVFNGYC--ETFLNHGVTAVGYGTSEEGI-----KYWLIKNSWGQDWGEDGYFRLQRD 314
+ GV++ T L+HGV AVGYGT G+ YWL+KNSWG DWG+ GYF++ R
Sbjct: 255 AAGVYDPLLCSSTRLDHGVLAVGYGTEPSGLFHEEKPYWLVKNSWGPDWGQQGYFKIVRK 314
Query: 315 IDQPQGQCGIAMFASFPV 332
+CGIA AS+P
Sbjct: 315 ----DNKCGIATDASYPT 328
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 136/347 (39%), Positives = 189/347 (54%), Gaps = 27/347 (7%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K FLI+ + I + + + + ++ + K E KA YK E R +IF DN
Sbjct: 2 KLFLILFITIFATVHAVSFFELVNQEWMTFKMEHKKA-----YKSDVEERFRMKIFMDNK 56
Query: 63 VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGTPFLY 118
+ + N N + SY L++NK+ D+ EF+ GF S ++ S G F+
Sbjct: 57 HKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASFIE 116
Query: 119 KSS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
++ +P V+W ++GAVTPVK QG C A A+EG + + LVSLSEQ L+D
Sbjct: 117 PANVALPKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLID 176
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C+ NNGC GG MD AF+YI NKG+ +A Y YE C A A + Y
Sbjct: 177 CSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEA-ENDKCRYNPANSGAIDV-GYI 234
Query: 231 DVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGT 285
D+P +E+ L AVA PVSVAIDAS + QFYS GV + C + L+HGV +GYGT
Sbjct: 235 DIPTGNEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGT 294
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+E G YWL+KNSWG+ WG +GY ++ R+ CGIA AS+P+
Sbjct: 295 NENGEDYWLVKNSWGETWGNNGYIKMARN---KLNHCGIASSASYPL 338
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 130/325 (40%), Positives = 182/325 (56%), Gaps = 31/325 (9%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
+ E++ +K ++ + Y++ E R +IF +N + + N A G S+ L +NK+ADL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 89 TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
EF GF + H A+ G F+ + +P SV+W KGAVT VK QG
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C + A+EG + K LVSLSEQ LVDC+T NNGC GG MD+AF+YI N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 197 GITNDAVYSYEGMSTGICD----SIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSV 251
GI + Y YE + C +I A D + D+P DE+ + +AVA PV+V
Sbjct: 205 GIDTEKSYPYEAIDDS-CHFNKGTIGATDRG-----FTDIPQGDEKKMAEAVATVGPVAV 258
Query: 252 AIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
AIDAS + QFYS GV+N C+ L+HGV VG+GT E G YWL+KNSWG WG+ G
Sbjct: 259 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKG 318
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
+ ++ R+ + QCGIA +S+P+
Sbjct: 319 FIKMLRN---KENQCGIASASSYPL 340
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 183/320 (57%), Gaps = 25/320 (7%)
Query: 28 GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFA 86
G + ++ +K +G++Y E+ +R ++F ++ + N +G +Y + LNKF
Sbjct: 13 GLASANWDLYKKVHGKSYGHDEEHFRR-QLFYKSVAKINAHNLRHDLGLTTYRMGLNKFT 71
Query: 87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQGQC 144
D+T +EF + G K ++ K NGT F + +P V+W EKG VTPVK QGQC
Sbjct: 72 DMTSEEF-RNFKGLKFD--ATKTKRNGTRFQKELLGEALPTQVDWREKGYVTPVKNQGQC 128
Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
++EG + +LVSLSEQ LVDC+ + NNGC GG MD+ F YI QN G
Sbjct: 129 GSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGG 188
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
I + Y Y G G C + A++ + DVP DE +L AVA+ PVSVAIDAS
Sbjct: 189 IDTEESYPYTG-KDGDC-AFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVSVAIDAS 246
Query: 257 --ALQFYSGGVFNGYCETF--LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
+ Q+Y GV++ +F L+HGV VGYGT E G+ YWL+KNSWG WG+DGY ++
Sbjct: 247 NDSFQYYKEGVYDEPSCSFSQLDHGVLVVGYGT-ENGVDYWLVKNSWGPTWGQDGYIKMM 305
Query: 313 RDIDQPQGQCGIAMFASFPV 332
R+ + QCGIA AS+P
Sbjct: 306 RN---KENQCGIASMASYPT 322
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 180/320 (56%), Gaps = 22/320 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
+ +++ +K ++ + YK E R +IF DN + + N N + SY L++NK+ D+
Sbjct: 30 VNQEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDM 89
Query: 89 TPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQV-PPSVNWIEKGAVTPVKYQGQC 144
EF+ GF S ++ S G F+ ++ V P V+W ++GAVTPVK QG C
Sbjct: 90 LHHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVVLPKKVDWRKEGAVTPVKDQGHC 149
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
A A+EG + + LVSLSEQ L+DC+ NNGC GG MD AF+YI NKG
Sbjct: 150 GSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKG 209
Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
+ +A Y YE C A A + Y D+P DE+ L AVA PVSVAIDAS
Sbjct: 210 LDTEASYPYEA-ENDKCRYNPANSGAIDV-GYIDIPTGDEKLLKAAVATIGPVSVAIDAS 267
Query: 257 --ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
+ QFYS GV + C + L+HGV +GYGT+E G YWL+KNSWG+ WG +GY ++
Sbjct: 268 HQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGETWGNNGYIKMA 327
Query: 313 RDIDQPQGQCGIAMFASFPV 332
R+ CGIA AS+P+
Sbjct: 328 RN---KLNHCGIASSASYPL 344
>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 129/349 (36%), Positives = 197/349 (56%), Gaps = 29/349 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M F+++ L ++G A+ + D G + +EQWK+ +G++Y++ E +R +++
Sbjct: 1 MRLPFVVLSLCLAGGLAAP----SLDPG-LDTHWEQWKSWHGKSYEQKEETWRRM-VWEK 54
Query: 61 NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
+L +E N ++G S+ L +N F D+ +EF G+K + K G+ FL
Sbjct: 55 HLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYK--QTHKKLQGSHFLEP 112
Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
+ +VP V+W ++G VTPVK QGQC A+EG + + +LVSLSEQ LV+C
Sbjct: 113 NFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVEC 172
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ + N GC GG MD AF+Y+ N GI ++ Y Y G C + +AA T + D
Sbjct: 173 SKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPC-HYNPQYNAANDTGFVD 231
Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTS 286
+P E +L+KA+A PVSVAIDA ++ QFY G+ F C T L+HGV VGYG
Sbjct: 232 IPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVE 291
Query: 287 E---EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+ +G KYW++KNSW + G++GY + +D D CGIA AS+P+
Sbjct: 292 KRDTDGKKYWIVKNSWSEKLGQNGYILMAKDKDN---HCGIATAASYPL 337
>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
Length = 333
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 129/316 (40%), Positives = 179/316 (56%), Gaps = 27/316 (8%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQ 91
++ +WK+ + R Y + E +R +++ N+ +E N + G +T+ +N F D+T +
Sbjct: 28 QWHKWKSTHRRLYDTNEEEWRR-AVWEKNMKMIELHNGEYSEGKHGFTMEMNAFGDMTNE 86
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
EF G+K H K P + Q+P SV+W EKG VTPVK QGQC
Sbjct: 87 EFRQLVNGYKHQKHRKG-KLFQEPLML---QLPKSVDWREKGCVTPVKNQGQCGSCWAFS 142
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
A A+EG +K LVSLSEQ LVDC+ + N GC GG MD AF+Y++ NKG+ ++ Y
Sbjct: 143 ACGALEGQMCLKTGVLVSLSEQNLVDCSRGEGNQGCNGGLMDFAFQYVLNNKGLDSEESY 202
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFY 261
YE G C K E AA T Y D+ P E++L+KAVA P++VAIDAS + QFY
Sbjct: 203 PYEA-KDGTC-KYKPEFAAANDTGYVDI-PQLEKALMKAVATVGPIAVAIDASHPSFQFY 259
Query: 262 SGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
S G+ F C + L+HGV +GY GT KYW++KNSWG WG G+F + +D +
Sbjct: 260 SSGIYFEPNCSSKDLDHGVLVIGYGFEGTDSNKKKYWIVKNSWGTGWGMGGFFHIAKDKN 319
Query: 317 QPQGQCGIAMFASFPV 332
CGIA AS+P
Sbjct: 320 N---HCGIATAASYPT 332
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 190/316 (60%), Gaps = 29/316 (9%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQE 92
++++K + +TY E S+RFEIF++N+ +E N +G +SY L +N+F+DL +E
Sbjct: 56 WKEFKILHDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEE 115
Query: 93 FIASQTGFKMSDHSSSLKANG-TPFLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA----- 145
F+ G K +SLK G + +L ++ V P SV+W +KG VT VK QGQC
Sbjct: 116 FV-KYNGLK----KTSLKDGGCSSYLAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCWSF 170
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
++EG + K +LVSLSE QLVDC+ + N GC GG MD+AFKYI G+ ++
Sbjct: 171 STTGSLEGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEED 230
Query: 204 YSYEGMSTGIC--DSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--AL 258
Y Y+ G C D K AA T DV E +L KAV+ PVSVAIDAS +
Sbjct: 231 YPYK-PKQGTCKFDDTKV---AATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSF 286
Query: 259 QFYSGGVFNG-YCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
Q Y+GGV++ C + L+HGV VGYGT ++G YW++KNSWG +WGEDGY ++ R+
Sbjct: 287 QSYAGGVYDEPECSSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRN-- 344
Query: 317 QPQGQCGIAMFASFPV 332
+ QCGIA AS+P+
Sbjct: 345 -KKNQCGIATQASYPL 359
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 173/311 (55%), Gaps = 22/311 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
F +K +YG+ Y E++ RF IFK N+ + N N ++ L +N+F DLT +EF
Sbjct: 27 FNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNAR---NLTFALGVNEFTDLTQEEF 83
Query: 94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
AS TG K + S L T Y + + SV+W +G VTPVK QGQC
Sbjct: 84 AASYTGLKPASLWSGLPRLST-HEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTT 142
Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
A+EG A+ LVSLSEQQ DC T D+ GC GG+MD+AF + +N I + Y Y
Sbjct: 143 GALEGAWALSTGNLVSLSEQQFEDCDTTDS--GCNGGWMDNAFSFAKKNS-ICTEGSYPY 199
Query: 207 EGMSTGICDSIKAEDHAAQ--ITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
+ G C+ + Q + Y DV + E++++ AVA QPVS+AI+A + Q YS
Sbjct: 200 TA-TDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLYS 258
Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
GV C T L+HGV AVGYG SE G YW +KNSWG WGE GY RLQR G+C
Sbjct: 259 SGVLTASCGTRLDHGVLAVGYG-SEAGTDYWKVKNSWGSSWGEQGYVRLQRG-KGGAGEC 316
Query: 323 G-IAMFASFPV 332
G +A S+PV
Sbjct: 317 GLLAGPPSYPV 327
>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
Length = 330
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 172/309 (55%), Gaps = 23/309 (7%)
Query: 39 AQYGRTYKESAENSK---RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIA 95
A + RT+ +S N + R+ ++++N ++ N N SY L +NKF DLT EF
Sbjct: 31 ADWMRTHTKSYSNEEFVFRWNVWRENYNFIQEENRK---NNSYYLTMNKFGDLTNAEFNK 87
Query: 96 SQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAA 148
G + LKA + +P + +W +KGAVT VK QGQC +
Sbjct: 88 VYKGLAFDYSAHILKAKAATPAAPAPGLPANFDWRQKGAVTHVKNQGQCGSCWSFSTTGS 147
Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
EG N +K LVSLSEQ L+DC+ + NNGC GG MD AF+YII NKGI +A Y YE
Sbjct: 148 TEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGIDTEASYPYET 207
Query: 209 MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVF 266
+ +T+Y DV DE +LL AVA +P SVAIDAS + QFYSGGV+
Sbjct: 208 AQYNC--RYNPANSGGSLTSYTDVSSGDENALLNAVAIEPTSVAIDASHNSFQFYSGGVY 265
Query: 267 --NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
+ T L+HGV AVG+GT E G YWL+KNSWG DWG GY ++ R+ CGI
Sbjct: 266 YESSCSSTQLDHGVLAVGWGT-ENGQDYWLVKNSWGADWGLQGYIKMARN---RHNNCGI 321
Query: 325 AMFASFPVS 333
A AS+P +
Sbjct: 322 ATAASYPTA 330
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 129/340 (37%), Positives = 189/340 (55%), Gaps = 26/340 (7%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
++ VL ++ SC FD + + ++ WK + Y ++ E+ +R ++ NL V
Sbjct: 6 VLAVLALAFSCT-----LAFD-AKLNQHWKLWKEANNKRYSDAEEHVRR-ATWEGNLQKV 58
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
+ N A +G +Y L +NK+AD+T EF+ G+ + + T +P
Sbjct: 59 QEHNLQADLGVHTYWLGMNKYADMTVTEFVKVMNGYNATMRGQRTQDRHTFSFNSKIALP 118
Query: 125 PSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
+V+W +KG VT VK QGQC A+EG + + +LVSLSEQ LVDC+ N
Sbjct: 119 DTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLVDCSGKQGN 178
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GG MD AF+YI +N GI + Y YE + KA + A T + D+ DE
Sbjct: 179 MGCNGGLMDQAFEYIKENNGIDTEDSYPYEAVDNQC--RFKAANVGATDTGFTDITSKDE 236
Query: 238 ESLLKAVAN-QPVSVAIDA--SALQFYSGGVFNG-YC-ETFLNHGVTAVGYGTSEEGIKY 292
+L +AVA P+SVAIDA ++ Q Y GV+N +C +T L+HGV AVGYGT + G Y
Sbjct: 237 SALQQAVATVGPISVAIDAGHTSFQLYKHGVYNEPFCSQTRLDHGVLAVGYGT-DSGKDY 295
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WL+KNSWG+ WG+ GY ++ R+ + QCGIA AS+P+
Sbjct: 296 WLVKNSWGEGWGDKGYIKMTRN---KRNQCGIATAASYPL 332
>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
Length = 335
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 126/343 (36%), Positives = 191/343 (55%), Gaps = 29/343 (8%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+V L IS A+ + D+ + WK+Q+G++Y E E +R I+++NL +
Sbjct: 5 LLVTLYISAVFAAPSIDIQLDD-----HWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQV 123
E+ N ++GN ++ + +N+F D+T +EF + G+K H + + G F+ K
Sbjct: 59 EQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYK---HDPNRTSQGPLFMEPKFFAA 115
Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
P V+W ++G VTPVK Q QC + A+EG K +L+S+SEQ LVDC+
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175
Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
N GC GG MD AF+Y+ +NKG+ ++ Y Y C + A+IT + D+P +
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPC-RYDPRFNVAKITGFVDIPKGN 234
Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETFLNHGVTAVGY---GTSEEG 289
E +L+ AVA PVSVAIDAS +LQFY G+ + C + L+H V VGY G G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAG 294
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+YW++KNSW WG+ GY + +D + CGIA AS+P+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNN---HCGIATMASYPL 334
>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
Length = 330
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 131/341 (38%), Positives = 187/341 (54%), Gaps = 31/341 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL + S +Q + ++ + WK YG+ YKE E + R I++ NL V
Sbjct: 4 LVCVLFVCSSAVAQ----LLKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS--- 121
N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNQ 113
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+
Sbjct: 114 MLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEK 173
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
N GC GGFM +AF+YII NKGI ++A Y Y+ M ++ AA + Y ++P
Sbjct: 174 YGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKAMDQKC--QYDSKYRAATCSKYTELPY 231
Query: 235 NDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
E+ L +AVAN+ PV V +DAS F+ SG ++ C +NHGV +GYG G
Sbjct: 232 GREDVLKEAVANKGPVCVGVDASHSSFFLYRSGVYYDPACTQNVNHGVLVIGYG-DLNGE 290
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNSWG ++GE GY R+ R+ CGIA + S+P
Sbjct: 291 EYWLVKNSWGSNFGERGYIRMARN---KGNHCGIASYPSYP 328
>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
Length = 342
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 134/343 (39%), Positives = 191/343 (55%), Gaps = 32/343 (9%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
+L+ L++ S +Q + ++ ++ WK YG+ YKE E R I++ NL
Sbjct: 14 WLVWALLLCSSAMAQ----VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 69
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-- 121
V N ++G SY L +N D+T +E I+ + ++ S N T YKS
Sbjct: 70 VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVP---SQWPRNVT---YKSDPN 123
Query: 122 -QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
++P S++W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 124 QKLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 183
Query: 174 ND-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
N GC GGFM +AF+YII N GI ++A Y Y+ M G C ++ AA + Y ++
Sbjct: 184 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMD-GKCQ-YDVKNRAATCSRYIEL 241
Query: 233 PPNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEE 288
P EE+L +AVAN+ PVSV IDAS F+ +G ++ C +NHGV VGYG + +
Sbjct: 242 PFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYG-NLD 300
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G YWL+KNSWG +G+ GY R+ R+ CGIA + S+P
Sbjct: 301 GKDYWLVKNSWGLHFGDQGYIRMARN---SGNHCGIASYPSYP 340
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 128/340 (37%), Positives = 189/340 (55%), Gaps = 20/340 (5%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
I ++ + G+ Q + +A+++ +KA + + Y E R +I+ +N V
Sbjct: 4 ITLIFLLGAVLVQLSAALSLTNLLADEWHLFKATHKKEYPSQLEEKFRMKIYLENKHKVA 63
Query: 67 RFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVP 124
+ N G +SY + +NKF DL EF + G++ +SS + F+ ++ +VP
Sbjct: 64 KHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVP 123
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
SV+W EKGA+TPVK QGQC + A+EG K +L+SLSEQ L+DC+ N
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GG MD AF+YI NKGI + Y YE +C + A + D+P +E
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEA-EDDVC-RYNPRNRGAVDRGFVDIPSGEE 241
Query: 238 ESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKY 292
+ L AVA PVSVAIDAS + QFYS GV + C++ L+HGV VGYG S+ G Y
Sbjct: 242 DKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDY 300
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WL+KNSW + WG++GY ++ R+ + CG+A AS+P+
Sbjct: 301 WLVKNSWSEHWGDEGYIKIARN---RKNHCGVATAASYPL 337
>gi|114559418|ref|XP_001171268.1| PREDICTED: cathepsin S isoform 3 [Pan troglodytes]
gi|397492866|ref|XP_003817341.1| PREDICTED: cathepsin S isoform 1 [Pan paniscus]
gi|410225070|gb|JAA09754.1| cathepsin S [Pan troglodytes]
gi|410251608|gb|JAA13771.1| cathepsin S [Pan troglodytes]
gi|410328325|gb|JAA33109.1| cathepsin S [Pan troglodytes]
gi|410328327|gb|JAA33110.1| cathepsin S [Pan troglodytes]
Length = 331
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 134/342 (39%), Positives = 187/342 (54%), Gaps = 32/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL++ S +Q + ++ + WK YG+ YKE E + R I++ NL V
Sbjct: 4 LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNQ 113
Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GGFM AF+YII NKGI +DA Y Y+ ++ AA + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKATDQKC--QYDSKYRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
E+ L +AVAN+ PVSV +DA F+ SG + C +NHGV VGYG G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDALHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNSWG ++GE+GY R+ R+ CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329
>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
Length = 334
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 131/324 (40%), Positives = 185/324 (57%), Gaps = 28/324 (8%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
FD+ AE + QWK+ + R Y + E +R I++ N+ ++ N + G +++ +N
Sbjct: 21 FDQTFSAE-WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 84 KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
F D+T +EF G++ H + P + K +P SV+W EKG VTPVK QGQ
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKG-RLFQEPLMLK---IPKSVDWREKGCVTPVKNQGQ 134
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C A +EG +K +L+SLSEQ LVDC+ N GC GG MD AF+YI +N
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENG 194
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
G+ ++ Y YE G C +AE A T + D+ P EE+L+KAVA P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEFAVANDTGFVDI-PQQEEALMKAVATVGPISVAMDA 251
Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
S +LQFYS G+ + C + L+HGV VGY GT KYWL+KNSWG +WG +GY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGY 311
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
++ +D D CG+A AS+PV
Sbjct: 312 IKIAKDRDN---HCGLATAASYPV 332
>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 128/341 (37%), Positives = 188/341 (55%), Gaps = 27/341 (7%)
Query: 8 VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
++LI++ C + + +GS+ + +WKA++ + Y E +R +++ N+ +E
Sbjct: 3 LLLILAAFCVGITSATSMFDGSLNAHWYRWKAKHRKLYGMREEGWRR-AVWEKNMKMIEV 61
Query: 68 FNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
N + G +T+ +N F D+T +EF GF+ H FL +VP S
Sbjct: 62 HNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFRNQKHKKGKVFQEPSFL----EVPKS 117
Query: 127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
V+W EKG VTPVK QGQC A A+EG K +L+SLSEQ LVDC+ N G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRPQGNEG 177
Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
C GG MD AF+YI +N G+ ++ Y Y+ M + E A T + D+ P +E++
Sbjct: 178 CDGGLMDYAFQYIKENGGLDSEESYPYDAMDESC--KYRPEYSVANDTGFVDI-PKEEKA 234
Query: 240 LLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYG---TSEEGIK 291
L+KAVA P+SVAIDA + QFY GV F C + ++HGV VGYG T + K
Sbjct: 235 LMKAVATVGPISVAIDAGHESFQFYKEGVYFEPECSSDNVDHGVLVVGYGYEETESDNNK 294
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+WL+KNSWG++WG GY ++ +D + CGIA AS+P
Sbjct: 295 FWLVKNSWGEEWGLGGYIKMTKD---QKNHCGIATAASYPT 332
>gi|156938919|gb|ABU97481.1| cathepsin L-like cysteine protease [Tyrophagus putrescentiae]
Length = 333
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 135/347 (38%), Positives = 185/347 (53%), Gaps = 33/347 (9%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K+ V+ ++ AS+ EG + F +K ++GR+Y E R +F NL
Sbjct: 2 KFVFAVLALVFAPTASE----LISEGELEAHFNLFKTRFGRSYANFEEEIFRKRVFASNL 57
Query: 63 VAVERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
+ N GN+++ + +N F D++ EF A G + S S+ P ++ +S
Sbjct: 58 EFIFNHNREFFAGNKNFNVAVNNFTDMSNTEFRARFNGLRHSGVQSA------PAIHSAS 111
Query: 122 Q--VPPSVNWIE-KGAVTPVKYQGQC--------AVAAVEGINAIKINRLVSLSEQQLVD 170
+P +V+W + K VTP+K Q QC AVA++EG + +K +LVSLSEQ LVD
Sbjct: 112 AEGLPATVDWTKVKNVVTPIKNQEQCGSCWAFFSAVASMEGQHGLKTGKLVSLSEQNLVD 171
Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
C+ + N GC GG MD AF+Y+I NKGI + Y Y+ + K A I +Y
Sbjct: 172 CSAAEGNMGCEGGLMDQAFQYVIANKGIDTEMSYPYKAIDESW--EFKKNSVGATIKSYV 229
Query: 231 DVPPNDEESLLKAVAN-QPVSVAIDASAL--QFYSGGVFN--GYCETFLNHGVTAVGYGT 285
DV E SL AVA P+SV IDAS L QFYS GV+ T L+HGVTAVGYG
Sbjct: 230 DVKTGSESSLQSAVATVGPISVGIDASQLSFQFYSSGVYEEPACSTTILDHGVTAVGYG- 288
Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+ G YW +KNSWG WG GY + R+ Q QCGIA AS+PV
Sbjct: 289 ALNGTPYWKVKNSWGTSWGMSGYIFMSRN---KQNQCGIATAASWPV 332
>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
Length = 1039
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 112/209 (53%), Positives = 137/209 (65%), Gaps = 10/209 (4%)
Query: 137 PVKYQGQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
P G C +AAVEGIN I L+SLSEQ+LVDC T+ N GC GG MD AF++I
Sbjct: 708 PFAVAGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFI 766
Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
I N GI + Y Y+G + G CD + I +YEDVP NDE+SL KAVANQPVSVA
Sbjct: 767 INNGGIDTEKDYPYKG-TDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVA 825
Query: 253 IDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
I+A + Q YS G+F G C T L+HGVT VGYGT E G YW++KNSWG WGE GY R
Sbjct: 826 IEAAGTTFQLYSSGIFTGSCGTALDHGVTVVGYGT-ENGKDYWIMKNSWGSSWGESGYVR 884
Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQP 339
++R+I G+CGIA+ S+P+ KE A P
Sbjct: 885 MERNIKASSGKCGIAVEPSYPL-KEGANP 912
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 130/344 (37%), Positives = 185/344 (53%), Gaps = 26/344 (7%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M V+L IS + A D ++ WK+ +G+ Y E + R I+++
Sbjct: 1 MEAVIFAVLLCISSALAMPPMEPLQDP-----NWKAWKSFHGKEYPNKNEETMRNFIWQN 55
Query: 61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
NL + N G S+ L +N D+T E + G K+ H+ S T +
Sbjct: 56 NLKKIVTHNE---GKHSFKLAMNHLGDMTSLEISQTLLGLKLKKHAESQPKGATFLPPAN 112
Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
+V S++W KG VTPVK QGQC A+EG + K +LVSLSEQ LVDC+
Sbjct: 113 VKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSG 172
Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
NNGC GG MD+AF+YI +N GI + Y Y G+C K+ A+ T + D+P
Sbjct: 173 KYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPYLA-KDGVCHYNKSAI-GAKDTGFVDIP 230
Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN--GYCETFLNHGVTAVGYGTSEE 288
DE +L +A+A+ P+S+AIDA S FY GV++ T L+HGV AVGYGT ++
Sbjct: 231 TGDENALQQALASVGPISIAIDASQSTFHFYHQGVYDDPDCSSTRLDHGVLAVGYGT-DD 289
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G YWL+KNSWG WGE+GY ++ R+ +CG+A AS+P+
Sbjct: 290 GKDYWLVKNSWGPSWGEEGYIKIARN---DHDKCGVASKASYPL 330
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 121/293 (41%), Positives = 170/293 (58%), Gaps = 24/293 (8%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
F ++A YG++Y E KR+ IFK+NL + N SY+L++N F DL+ +EF
Sbjct: 119 FGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGY---SYSLKMNHFGDLSREEF 175
Query: 94 IASQTGFKMSDHSSSLKAN----GTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC---- 144
G+ + S +LK+N T L S S VP +V+W EKG VTPVK Q C
Sbjct: 176 RRKYLGY---NKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCW 232
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
A A+EG + K L+SLSEQ+LVDC+ + N GC GG M+DAF+Y++ + G+ ++
Sbjct: 233 AFSATGALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSE 292
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--Q 259
Y Y G C +A I+ ++DVP E ++ A+A+ PVS+AI+A L Q
Sbjct: 293 EGYPYLARD-GECK--RACKKVVTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQ 349
Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK-YWLIKNSWGQDWGEDGYFRL 311
FY GVF+ C T L+HGV VGYGT +E K +W++KNSWG WG DGY +
Sbjct: 350 FYHEGVFDASCGTDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYM 402
>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 326
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 183/318 (57%), Gaps = 23/318 (7%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI-GNRSYTLRLNKFAD 87
S+ ++E WK YG+ Y + E + R I+ NL ++ N + G +YT +N+F D
Sbjct: 17 SVNTEWESWKRTYGKEYTQK-EEALRHMIWNVNLKMIQMHNEKYMSGKSTYTQNMNQFGD 75
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC--- 144
LT +E+ G+K S+ + K + T L + + P S++W +G VT VK QG C
Sbjct: 76 LTNEEYRELMCGYKKSNKTVISKPS-TFLLPSNYRAPASIDWRTQGYVTDVKDQGACGSC 134
Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
+ ++EG K +LV LSEQQLVDC+ + N GC GG+MD AF YI ++KG +
Sbjct: 135 WAFSSTGSLEGQTFKKTGKLVPLSEQQLVDCSGDYGNMGCGGGWMDQAFSYI-KDKGEES 193
Query: 201 DAVYSYEGMS-TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--S 256
+ Y Y G T + D+ K A T Y D+P DE +L +AVA P+SVAIDA S
Sbjct: 194 EDGYPYTGTDDTCVYDASKV---VATDTGYTDIPEMDENALQQAVATVGPISVAIDATHS 250
Query: 257 ALQFYSGGVFNG--YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
+ QFY GV++ +T L+H V AVGYGTSEEG+ YW++KNSW WG GY + R+
Sbjct: 251 SFQFYESGVYDEPECSQTNLDHAVLAVGYGTSEEGLDYWIVKNSWSTGWGMQGYIEMSRN 310
Query: 315 IDQPQGQCGIAMFASFPV 332
D QCGIA AS+PV
Sbjct: 311 KDN---QCGIASKASYPV 325
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 133/349 (38%), Positives = 192/349 (55%), Gaps = 35/349 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
LI+ L+ + A +Y I E++ +K ++ + Y++ E R +IF +N +
Sbjct: 5 LILPLLALVAVAQAVSYAEV----IQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60
Query: 66 ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN----GTPFLY-K 119
+ N A G S+ + +NK+AD+ EF ++ GF + H A+ G F+ +
Sbjct: 61 AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+P V+W KGAVT VK QG C + A+EG + K LVSLSEQ LVDC+
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICD----SIKAEDHAAQITN 228
T NNGC GG MD+AF+YI N GI + Y YE + C SI A D
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDS-CHFNKGSIGATDRG----- 234
Query: 229 YEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGY 283
+ D+P +E+ + +AVA PV+VAIDAS + QFYS GV+N C+ L+HGV VG+
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGF 294
Query: 284 GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
GT E G YWL+KNSWG WG+ G+ ++ R+ + QCGIA +S+P+
Sbjct: 295 GTDESGEDYWLVKNSWGTTWGDKGFIKMLRN---KENQCGIASASSYPL 340
>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
Length = 331
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 134/342 (39%), Positives = 187/342 (54%), Gaps = 32/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
LI VL++ S +Q + ++ + WK YG+ YKE E + R I++ NL V
Sbjct: 4 LICVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNANQ 113
Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GGFM AF+YII N GI +DA Y Y+ ++ AA + Y ++P
Sbjct: 174 KYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKC--QYDSKYRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
E+ L + VAN+ PVSV +DAS F+ SG + C +NHGV VGYG G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL-NG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNSWG+++GE+GY R+ R+ CGIA F S+P
Sbjct: 291 KEYWLVKNSWGRNFGEEGYIRMARN---KGNHCGIASFPSYP 329
>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
Length = 331
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 133/342 (38%), Positives = 187/342 (54%), Gaps = 32/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL++ S +Q + ++ + WK YG+ YKE E + R I++ NL V
Sbjct: 4 LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS--- 121
N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNQ 113
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 114 MLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GGFM AF+YII N GI +DA Y Y+ ++ AA + Y ++P
Sbjct: 174 KYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKC--QYDSKYRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
E+ L + VAN+ PVSV +DAS F+ SG + C +NHGV VGYG G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL-NG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNSWG+++GE+GY R+ R+ CGIA F S+P
Sbjct: 291 KEYWLVKNSWGRNFGEEGYIRMARN---KGNHCGIASFPSYP 329
>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
Length = 331
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 134/343 (39%), Positives = 190/343 (55%), Gaps = 32/343 (9%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
+L+ L++ S A + ++ ++ WK YG+ YKE E R I++ NL
Sbjct: 3 WLVWALLL----CSSAMAHVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 58
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-- 121
V N ++G SY L +N D+T +E I+ + ++ S N T YKS
Sbjct: 59 VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVP---SQWPRNVT---YKSDPN 112
Query: 122 -QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
++P S++W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 113 QKLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 172
Query: 174 ND-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
N GC GGFM +AF+YII N GI ++A Y Y+ M G C ++ AA + Y ++
Sbjct: 173 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMD-GKC-QYDVKNRAATCSRYIEL 230
Query: 233 PPNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEE 288
P EE+L +AVAN+ PVSV IDAS F+ +G ++ C +NHGV VGYG + +
Sbjct: 231 PFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYG-NLD 289
Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
G YWL+KNSWG +G+ GY R+ R+ CGIA + S+P
Sbjct: 290 GKDYWLVKNSWGLHFGDQGYIRMARN---SGNHCGIANYPSYP 329
>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
Length = 214
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 112/216 (51%), Positives = 138/216 (63%), Gaps = 12/216 (5%)
Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
SV+W +KG VT +K QG C A+AAVEG+ + LVSLSEQ+LVDC T N
Sbjct: 1 SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTT-VNQ 59
Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
GC GG MD AF+Y+I+N GIT+ + Y Y G CD K + HAA I ++ +PP EE
Sbjct: 60 GCDGGMMDYAFQYMIRNGGITSQSNYPYR-AQRGACDKDKVKYHAATINGFQAIPPQSEE 118
Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
LL+AVANQPVSVAI+A Q YS GVF G C + L+HGV VGYGT G +YWL+K
Sbjct: 119 LLLRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVK 178
Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
NSWG WGE GY R++R G CGI + AS+P
Sbjct: 179 NSWGSGWGESGYVRMERQ-GPGAGVCGINLDASYPT 213
>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 333
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 130/339 (38%), Positives = 184/339 (54%), Gaps = 27/339 (7%)
Query: 10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
L ++ C A+ S+ E + QWKA +G+ Y E +R E++K N+ + + N
Sbjct: 5 LFLAALCLGIASAAPQLNQSLDELWSQWKATHGKLYGMDEEGWRR-EVWKKNMKMIRQHN 63
Query: 70 -NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVN 128
+ G S+T+ +N F D+T +EF G +M H K P K +P SV+
Sbjct: 64 WEHSQGKHSFTVAMNGFGDMTNEEFKQVMNGLQMQKHKKG-KMFQAPLFAK---IPSSVD 119
Query: 129 WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
W EKG VTPVK QG C A A+EG K +LVSLSEQ LVDC+ + N GC
Sbjct: 120 WREKGYVTPVKDQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQAEGNEGCN 179
Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
GG M++AF+Y+ N G+ ++ Y Y K +D AA T + D+ P E++L+
Sbjct: 180 GGLMNNAFQYVKDNGGLDSEESYPYHAQDESC--KYKPQDSAANDTGFFDI-PQQEKALM 236
Query: 242 KAVANQ-PVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYGTS---EEGIKYW 293
AVA + P+SV IDAS QFY G+ ++ C + L+HGV +GYGT YW
Sbjct: 237 VAVATKGPISVGIDASHFTFQFYHEGIYYDPDCSSEDLDHGVLVIGYGTEIGQSINKTYW 296
Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
++KNSWG +WG DGY ++ +D + CGIA ASFPV
Sbjct: 297 IVKNSWGANWGIDGYIKMAKD---RKNHCGIATMASFPV 332
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 124/313 (39%), Positives = 174/313 (55%), Gaps = 18/313 (5%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
+F Q++ + + Y E KR+ IFK+NL + N + SY L++NKF DLT +E
Sbjct: 88 QFYQFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHN---MQGYSYVLKMNKFGDLTLEE 144
Query: 93 FIASQTGFKMSD-HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
F G+K D + + + T + + +P V+W ++G VT VK QG C
Sbjct: 145 FRQRYLGYKKPDLRTPPREVDTTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFS 204
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
A A+EG+ K +LV+LS+QQLVDC+ N GC GG M++AF+Y+++N GI + Y
Sbjct: 205 ATGAMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENY 264
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDA--SALQFY 261
Y G+C S + A IT Y VP E+S+ A+A PVSVAI A +A QFY
Sbjct: 265 PYM-RKDGVCKSSQCTS-VATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFY 322
Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGI-KYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
G+F+ C T L+HGV VGY G YW++KNSWG WG+ GY L P G
Sbjct: 323 YDGIFDAPCGTNLDHGVLLVGYSAETAGQGDYWIMKNSWGAAWGKGGYM-LMAMHKGPAG 381
Query: 321 QCGIAMFASFPVS 333
QCG+ + SFPV+
Sbjct: 382 QCGVLLDGSFPVA 394
>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
Length = 344
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 131/348 (37%), Positives = 194/348 (55%), Gaps = 31/348 (8%)
Query: 8 VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
+VL++ A A + FD + E++ +K Q+ YK E++ R +I+ ++ + +
Sbjct: 4 LVLLLCAVAAVSAV-QFFD--LVKEEWSAFKLQHRLNYKSEVEDNFRMKIYAEHKHIIAK 60
Query: 68 FNNA-AIGNRSYTLRLNKF---ADLTPQEFIASQTGF-KMSDHSSSL-----KANGTPFL 117
N +G SY L +N + D+ EF+ + GF K + H+ +L G F+
Sbjct: 61 HNQKYEMGLVSYKLGMNSWWEHGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 120
Query: 118 YKSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLV 169
++ ++P V+W + GAVT +K QG+C A+EG + + LVSLSEQ L+
Sbjct: 121 SPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLI 180
Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
DC+ NNGC GG MD+AFKYI N GI + Y YEG+ ++ A+ +
Sbjct: 181 DCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQAYPYEGVDDKC--RYNPKNTGAEDVGF 238
Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY--CETFLNHGVTAVGYG 284
D+P DE+ L++AVA PVSVAIDAS Q YS GV+N T L+HGV VGYG
Sbjct: 239 VDIPEGDEQKLMEAVATVGPVSVAIDASHTHFQLYSSGVYNEEECSSTDLDHGVLVVGYG 298
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
T E+G+ YWL+KNSWG+ WGE GY ++ R+ +CGIA AS+P+
Sbjct: 299 TDEQGVDYWLVKNSWGRSWGELGYIKMIRN---KNNRCGIASSASYPL 343
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 179/311 (57%), Gaps = 24/311 (7%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQ 91
KF+ +K ++G+TYK E + RF IFKDNL A+E+ N G SY +N+F D+T +
Sbjct: 24 KFQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQE 83
Query: 92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
EF A F S N T + VP S++W KG VT VK QG C
Sbjct: 84 EFRA----FLTLSSSKKPHFNTTEHVLTGLAVPDSIDWRTKGQVTGVKDQGNCGSCWAFS 139
Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
+ E K +LVSLSEQQLVDC+T D N GC GG++D+ F Y +++KG+ ++ Y
Sbjct: 140 VTGSTEAAYYRKAGKLVSLSEQQLVDCST-DINAGCNGGYLDETFTY-VKSKGLEAESTY 197
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASALQFYSG 263
Y+G + G C A +++ ++ + DE +LL AV N PVSVAIDA+ L Y
Sbjct: 198 PYKG-TDGSC-KYSASKVVTKVSGHKSLKSEDENALLDAVGNVGPVSVAIDATYLSSYES 255
Query: 264 GVF-NGYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
G++ + +C + LNHGV VGYGTS G KYW++KNSWG +GE GYFRL R + +
Sbjct: 256 GIYEDDWCSPSELNHGVLVVGYGTS-NGKKYWIVKNSWGGSFGESGYFRLLRG----KNE 310
Query: 322 CGIAMFASFPV 332
CG+A +P+
Sbjct: 311 CGVAEDTVYPI 321
>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
Length = 333
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 189/338 (55%), Gaps = 25/338 (7%)
Query: 10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
L ++ C A+ + S+ ++ QWKA + R Y + E +R +++ N+ +E N
Sbjct: 5 LFLTALCLGIASAAPKFDQSLNAQWYQWKATHRRLYGMNEEGWRR-AVWEKNMKMIELHN 63
Query: 70 NA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVN 128
+ G +T+ +N F D+T +EF GF+ H K P +++P SV+
Sbjct: 64 REYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHKKG-KMFQEPLF---AEIPKSVD 119
Query: 129 WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
W EKG VTPVK QGQC A A+EG K +LVSLSEQ LVDC+ N GC
Sbjct: 120 WREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCN 179
Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
GG MD+AF+Y+ N G+ ++ Y Y G T C+ K E AA T + D+ P E++L+
Sbjct: 180 GGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCN-YKPECSAANDTGFVDL-PQREKALM 237
Query: 242 KAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYG--TSEEGIKYWL 294
KAVA P+SVAIDA + QFY G+ F+ C + L+HGV VGYG ++ K+W+
Sbjct: 238 KAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWI 297
Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+KNSWG +WG +GY ++ +D + CGIA AS+P
Sbjct: 298 VKNSWGPEWGWNGYVKMAKDQNN---HCGIATAASYPT 332
>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
Length = 331
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 132/342 (38%), Positives = 181/342 (52%), Gaps = 32/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
+ L + A+ Y++ D ++ QWKA +G+ Y E+ E +R +++ NL +
Sbjct: 6 FLAALCLGIVSAAPKLYQSLDA-----RWSQWKAAHGKLYDENEEGWRR-AVWEKNLKVI 59
Query: 66 ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
++ N + G S+T+ +N F DLT +EF G K PF ++ P
Sbjct: 60 KQHNQEYSQGKHSFTMAMNAFGDLTNEEFKQVMNGLKSQKRKEGNVFQAPPF----AETP 115
Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
SV+W +KG VTPVK QG C A A+EG K RLVSLSEQ LVDC+ + N
Sbjct: 116 SSVDWRKKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTKRLVSLSEQNLVDCSQAEGN 175
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GG MD AF+Y+ N G+ ++ Y Y K E AA T + D+ P +E
Sbjct: 176 EGCSGGLMDYAFQYVKDNGGLDSEESYPYRAQDESC--KYKPEQSAANDTGFMDIHP-EE 232
Query: 238 ESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCETF-LNHGVTAVGYGT---SEEG 289
ESL AVA P+S AIDA S QFY G+ ++ C + L+HG+ VGYG+ E
Sbjct: 233 ESLKLAVATVGPISAAIDASLSTFQFYHKGIYYDPDCSSENLDHGILVVGYGSQGEDSEK 292
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
KYW++KNSWG DWG GY + +D D CGIA ASFP
Sbjct: 293 QKYWIVKNSWGTDWGTQGYILMAKDRDN---HCGIATAASFP 331
>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
Length = 331
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 133/342 (38%), Positives = 187/342 (54%), Gaps = 32/342 (9%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
L+ VL++ S +Q + ++ + WK YG+ YKE E + R I++ NL V
Sbjct: 4 LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59
Query: 66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
N ++G SY L +N D+T +E ++ + ++ S + N T YKS+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNANQ 113
Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
+P SV+W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+T
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
N GC GGFM AF+YII N GI +DA Y Y+ ++ AA + Y ++P
Sbjct: 174 KYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKC--QYDSKYRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
E+ L + VAN+ PVSV +DAS F+ SG + C +NHGV VGYG G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL-NG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
+YWL+KNSWG+++GE+GY R+ R+ CGIA F S+P
Sbjct: 291 KEYWLVKNSWGRNFGEEGYIRMARN---KGNHCGIASFPSYP 329
>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
Length = 333
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 178/312 (57%), Gaps = 22/312 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
++ WK Q+G+ YK E R E+++ NL + N A++G +Y L +N D+T +E
Sbjct: 30 WQMWKKQHGKNYKTEVEELGRREVWERNLQLISLHNLEASMGMHTYDLGMNHMGDMTEEE 89
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC------- 144
+ S K+ + LK + F+ S + VP +V+W +KG VT VK QG C
Sbjct: 90 ILQSFASLKVP---ADLKREPSAFVASSGTPVPDTVDWRQKGYVTQVKNQGSCGSCWAFS 146
Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
+V A+EG +L+ LS Q LVDC++ N GC GGFM +AF+Y+I NKGI +D Y
Sbjct: 147 SVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKGIDSDTSY 206
Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASALQF--Y 261
Y+G+ G C +A T Y +P DE +L +AVA P+SVAIDA+ F +
Sbjct: 207 PYQGVQ-GTC-HYNPSYRSANCTRYSFLPEGDETTLKQAVAMIGPISVAIDATRPSFILW 264
Query: 262 SGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
GV+N C +NH V VGYGT +G YWL+KNSWG +GE+GY R+ R+ +
Sbjct: 265 RSGVYNDLTCTQKINHAVLVVGYGTL-DGQDYWLVKNSWGTRFGENGYIRMSRNRNN--- 320
Query: 321 QCGIAMFASFPV 332
QCGIA++ +P+
Sbjct: 321 QCGIALYGCYPI 332
>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 131/324 (40%), Positives = 183/324 (56%), Gaps = 28/324 (8%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
FD+ ++ ++ QWKA + R Y + E +R +++ N+ +E N + G +T+ +N
Sbjct: 21 FDQ-NLDTQWYQWKATHKRLYGLNEEGWRR-AVWEKNMRMIELHNGEYSQGKHGFTMGMN 78
Query: 84 KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
+ D+T +EF GF+ H K P L Q P SV+W EKG VTPVK QGQ
Sbjct: 79 AYGDMTNEEFRQVMNGFQNQKHKKG-KMFRDPLLL---QYPKSVDWREKGYVTPVKNQGQ 134
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C A A+EG K +L+SLSEQ LVDC+ N GC GG MD AF+Y+ N
Sbjct: 135 CGSCWAFSATGALEGQMFQKTGKLISLSEQNLVDCSHPQGNQGCNGGLMDYAFQYVKDNS 194
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
G+ ++ Y YEGM G C K E A T + D+P + E++LL+AVA P+S AIDA
Sbjct: 195 GLDSEESYPYEGMD-GTC-KYKPECSVANDTGFVDIPGH-EKALLRAVATVGPISAAIDA 251
Query: 256 SAL--QFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
+ QFY G+ ++ C + L+HG+ VGY GT+ KYWL+KNSWG WG++GY
Sbjct: 252 GHMSFQFYKSGIYYDPDCSSKDLDHGILVVGYGFEGTNSNATKYWLVKNSWGTTWGDEGY 311
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
++ RD D CGIA AS+P
Sbjct: 312 VKIIRDKDN---HCGIATAASYPT 332
>gi|125592009|gb|EAZ32359.1| hypothetical protein OsJ_16569 [Oryza sativa Japonica Group]
Length = 480
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 127/342 (37%), Positives = 178/342 (52%), Gaps = 39/342 (11%)
Query: 23 RTFDEGSIAEK----FEQWKAQYGRTYKES--AENSKRFEIFKDNLVAVERFNNAAIGNR 76
R +EG + ++ W A+ G + E+ +RF +F DNL V+ N A
Sbjct: 37 RGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERG 96
Query: 77 SYTLRLNKFA---------DL---------------TPQEFIASQTGFKMSDHSSSLKAN 112
+ L +N+ DL P G + + +
Sbjct: 97 GFRLGMNRLRRSHQRGVPRDLPRRQGRREEPRRRGEVPPRRGGGAAGVRRLEGEGRRRPR 156
Query: 113 GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCA 172
P +S V SV + +G+ AV+ VE IN + +++LSEQ+LV+C+
Sbjct: 157 QEPGPMRSFSVHLSVKYFGQGSCWAFS-----AVSTVESINQLVTGEMITLSEQELVECS 211
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
TN N+GC GG MDDAF +II+N GI + Y Y+ + G CD + I +EDV
Sbjct: 212 TNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVD-GKCDINRENAKVVSIDGFEDV 270
Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
P NDE+SL KAVA+QPVSVAI+A Q Y GVF+G C T L+HGV AVGYGT + G
Sbjct: 271 PQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGT-DNGK 329
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YW+++NSWG WGE GY R++R+I+ G+CGIAM AS+P
Sbjct: 330 DYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPT 371
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 131/342 (38%), Positives = 189/342 (55%), Gaps = 26/342 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
FL+ +++ S A T +A+++ +KA + + Y E R +I+ +N
Sbjct: 4 FLLGAVLVQLSAALSLT------NLLADEWHLFKATHKKEYPSQLEEKFRMKIYLENKHK 57
Query: 65 VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-Q 122
V + N G +SY + +NKF DL EF + G++ +SS + F+ ++
Sbjct: 58 VAKHNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVT 117
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
VP SV+W EKGA+TPVK QGQC + A+EG K +LVSLSEQ L+DC+
Sbjct: 118 VPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKY 177
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N GC GG MD AF+YI NKGI + Y YE +C + A + D+P
Sbjct: 178 GNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEA-EDDVC-RYNPRNRGAVDRGFVDIPSG 235
Query: 236 DEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGI 290
+E+ L AVA PVSVAIDAS + QFYS GV + C++ L+HGV VGYG S+ G
Sbjct: 236 EEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGK 294
Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSW + WG++GY ++ R+ + CG+A AS+P+
Sbjct: 295 DYWLVKNSWSEHWGDEGYIKMARN---RKNHCGVASAASYPL 333
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/343 (37%), Positives = 186/343 (54%), Gaps = 25/343 (7%)
Query: 7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
++ L + C S A + + + +E WK+ + + Y E E +R +++ NL +E
Sbjct: 1 MLPLAVVALCLSAALSAPSLDPQLDDHWELWKSWHSKKYHEKEEGWRRM-VWEKNLKKIE 59
Query: 67 RFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVP 124
N ++G SY L +N F D+T +EF G+K + KA G+ FL + + P
Sbjct: 60 LHNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYK---RKAETKARGSLFLEPNFLEAP 116
Query: 125 PSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
SV+W + G VTPVK QGQC A+EG + K +LVSLSEQ LVDC+ + N
Sbjct: 117 KSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGN 176
Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
GC GG MD AF+Y+ N+G+ ++ Y Y G C ++ T + D+P E
Sbjct: 177 EGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPC-HYDPTYNSVNDTGFVDIPSGKE 235
Query: 238 ESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEE---G 289
+L+KAVA PVSVAIDA + QFY G+ + C + L+HGV VGYG E G
Sbjct: 236 RALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQGEDVDG 295
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
KYW++KNSW + WG+ GY + +D + CGIA AS+P+
Sbjct: 296 KKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATAASYPL 335
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 126/345 (36%), Positives = 191/345 (55%), Gaps = 29/345 (8%)
Query: 4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
+ L+V L IS A+ + D+ + WK+Q+G++Y E E +R I+++NL
Sbjct: 3 FALLVTLCISAVFAASSIDIQLDD-----HWNSWKSQHGKSYHEDVEVGRRM-IWEENLR 56
Query: 64 AVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS- 121
+E+ N + GN ++ + +N+F D+T +EF + G+K H + + G F+ S
Sbjct: 57 KIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYK---HDPNRTSQGPLFMEPSFF 113
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
P V+W ++G VTPVK Q QC + A+EG K +L+S+SEQ LVDC+
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
N GC GG MD AF+Y+ +NKG+ ++ Y Y C + A+IT + D+P
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPC-RYDPRFNVAKITGFVDIPR 232
Query: 235 NDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETFLNHGVTAVGY---GTSE 287
+E +L+ AVA PVSVAIDAS +LQFY G+ + C + L+H V VGY G
Sbjct: 233 GNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADV 292
Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
G +YW++KNSW WG+ GY + +D + CGIA AS+P+
Sbjct: 293 AGNRYWIVKNSWSDKWGDKGYIYMAKDKNN---HCGIATMASYPL 334
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 132/349 (37%), Positives = 192/349 (55%), Gaps = 35/349 (10%)
Query: 6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
LI+ L+ + A +Y I E++ +K ++ + Y++ E R +IF +N +
Sbjct: 5 LILPLLALVAVAQAVSYAEV----IQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60
Query: 66 ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN----GTPFLY-K 119
+ N A G S+ + +NK+AD+ EF ++ GF + H A+ G F+ +
Sbjct: 61 AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
+P V+W KGAVT VK QG C + A+EG + K LVSLSEQ LVDC+
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICD----SIKAEDHAAQITN 228
T NNGC GG MD+AF+YI N GI + Y YE + C +I A D
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDS-CHFNKGTIGATDRG----- 234
Query: 229 YEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGY 283
+ D+P +E+ + +AVA PV+VAIDAS + QFYS GV+N C+ L+HGV VG+
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGF 294
Query: 284 GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
GT E G YWL+KNSWG WG+ G+ ++ R+ + QCGIA +S+P+
Sbjct: 295 GTDESGQDYWLVKNSWGTTWGDKGFIKMLRN---KENQCGIASASSYPL 340
>gi|342305192|dbj|BAK55650.1| cathepsin S [Oplegnathus fasciatus]
Length = 337
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 129/341 (37%), Positives = 190/341 (55%), Gaps = 25/341 (7%)
Query: 5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
++ L++ C A FD G + +E WK + + Y+ E+ +R E+++ NL+
Sbjct: 8 LMLGSLLLVSLCVGAAA--MFD-GRLDVHWELWKRTHEKKYQNEGEDVRRRELWEKNLML 64
Query: 65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
+ N A++G +Y L +N DLT +E + S F + ++ +PF S +
Sbjct: 65 ITMHNLEASMGLHTYELSMNHMGDLTQEEILQS---FATLSPPTDIQRAPSPFAGTSGAA 121
Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
VP +V+W EKG VT VK QG C A A+EG A +L+ LS Q LVDC++
Sbjct: 122 VPDTVDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLLDLSPQNLVDCSSKY 181
Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
N+GC GGFM AF+Y+I N+GI +DA Y Y G S C A AA + Y +P
Sbjct: 182 GNHGCNGGFMHRAFQYVIDNQGIDSDASYPYTGQSQQ-CHYNPAY-RAANCSRYSFLPEG 239
Query: 236 DEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIK 291
DE +L +A+A P+SVAIDA+ + FY GV++ C +NHGV AVGYGT G
Sbjct: 240 DEGALKEALATIGPISVAIDATRPSFTFYRSGVYDDQTCTRNVNHGVLAVGYGTL-NGKD 298
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
YWL+KNSWG +G+ G+ R+ R+ + QCGIA++ +P+
Sbjct: 299 YWLVKNSWGSTFGDKGFIRMARNKND---QCGIALYGCYPI 336
>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
Short=CP-2; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Procathepsin L;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
Length = 334
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 187/324 (57%), Gaps = 28/324 (8%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
FD+ + ++ QWK+ + R Y + E +R +++ N+ ++ N + G +T+ +N
Sbjct: 21 FDQ-TFNAQWHQWKSTHRRLYGTNEEEWRR-AVWEKNMRMIQLHNGEYSNGKHGFTMEMN 78
Query: 84 KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
F D+T +EF G++ H + P + Q+P +V+W EKG VTPVK QGQ
Sbjct: 79 AFGDMTNEEFRQIVNGYRHQKHKKG-RLFQEPLML---QIPKTVDWREKGCVTPVKNQGQ 134
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C A +EG +K +L+SLSEQ LVDC+ + N GC GG MD AF+YI +N
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENG 194
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
G+ ++ Y YE G C +AE A T + D+ P E++L+KAVA P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDA 251
Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
S +LQFYS G+ + C + L+HGV VGY GT KYWL+KNSWG++WG DGY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGY 311
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
++ +D + CG+A AS+P+
Sbjct: 312 IKIAKDRNN---HCGLATAASYPI 332
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 133/350 (38%), Positives = 190/350 (54%), Gaps = 30/350 (8%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M Y + L + A+ + + ++ + ++ WK + + Y + E +R I++
Sbjct: 1 MKVYLCALALFLEACFAAPSL-----DSALDDHWQAWKTWHSKKYHQQEEGWRRM-IWEK 54
Query: 61 NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
NL ++ N + ++G SY L +N F D+T +EF G+K S + K G+ FL
Sbjct: 55 NLKMIQLHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGYKHS--KTEKKYRGSEFLEP 112
Query: 120 SSQV-PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
+ V P SV+W EKG VTPVK QGQC ++EG + K +LVSLSEQ LVDC
Sbjct: 113 NFLVVPKSVDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDC 172
Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
+ + N GC GG MD AF+YI N GI ++ Y Y C K+E +AA T + D
Sbjct: 173 SRPEGNQGCNGGLMDQAFEYIADNGGIDSEESYPYIAKDDEDC-LYKSEFNAANDTGFVD 231
Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYG-- 284
VP E +L+KAVA PVSVAIDA S QFY G+ ++ C + L+HGV VGYG
Sbjct: 232 VPEGHERALMKAVAAVGPVSVAIDASHSTFQFYESGIYYDPDCSSEELDHGVLVVGYGFE 291
Query: 285 --TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+ KYW++KNSW WG+ GY + +D + CGIA AS+P+
Sbjct: 292 GTDDDNKKKYWIVKNSWSDKWGDKGYILMAKDRNN---HCGIATAASYPL 338
>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
Length = 333
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 197/348 (56%), Gaps = 32/348 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M L+ VL + + A+ FD S+ ++E WKA + + Y + E ++ ++K
Sbjct: 1 MNPSLLLTVLCLGIASAAP----KFDH-SLNTQWELWKAVHRKPYDLNEEGWRK-AVWKK 54
Query: 61 NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
N+ +E N + G S+++ +N F DLT +EF GF+ ++ + T F
Sbjct: 55 NMKMIELHNQEYSQGKHSFSMAMNAFGDLTSEEFRQMMNGFQRQENKKGKVFHETIF--- 111
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
+ +PPSV+W EKG VTPVK QG+C A+EG K +LVSLSEQ LVDC+
Sbjct: 112 -ASIPPSVDWREKGYVTPVKNQGKCGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCS 170
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
+ N GC+GG MD+AF+Y++ G+ ++ Y Y G+ G C+ ++ AA T + D+
Sbjct: 171 QPEGNRGCHGGLMDNAFQYVLDVGGLDSEESYPYTGL-VGTCN-YNPKNSAANETGFVDL 228
Query: 233 PPNDEESLLKAVANQ-PVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGY---G 284
P E +L+KAVA P+SVA+DAS + QFY G+ + C++ ++HGV VGY G
Sbjct: 229 -PKQENALMKAVATLGPISVAVDASNPSFQFYKSGIYYEPKCKSESVDHGVLVVGYGFEG 287
Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+ KYWL+KNSWG+ WG +GY ++ +D + CGIA AS+P
Sbjct: 288 ADSDDNKYWLVKNSWGKHWGINGYIKMAKDQNN---HCGIATMASYPT 332
>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
Length = 331
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 132/342 (38%), Positives = 189/342 (55%), Gaps = 27/342 (7%)
Query: 3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
K+ L V L+ S + A R + ++ ++ WK Y + YKE E R I++ NL
Sbjct: 2 KWLLWVALVCSSAMA-----RLHKDPTLDNHWDLWKKTYSKQYKEKNEEVARRLIWEKNL 56
Query: 63 VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
V N ++G SY L +N D+T +E ++ + ++ S + N T +
Sbjct: 57 KFVMLHNLEHSMGMHSYDLSMNHLGDMTSEEVMSLMSSLRVP---SQWQRNVTFKSNPNQ 113
Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
++P S++W EKG VT VKYQG C AV A+E +K +LVSLS Q LVDC+
Sbjct: 114 KLPDSLDWREKGCVTDVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGE 173
Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
+N GC GGFM AF+YII N GI ++A Y Y+ + G C ++ AA + Y ++P
Sbjct: 174 KYSNKGCNGGFMTRAFQYIIDNNGIDSEASYPYKA-TDGKC-QYDPKNRAATCSKYTELP 231
Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
E++L +AVAN+ PVSV IDAS F+ SG ++ C +NHGV VGYG + G
Sbjct: 232 YGSEDALKEAVANKGPVSVGIDASRPSFFLYKSGVYYDPSCTDNVNHGVLVVGYG-NLNG 290
Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
YWL+KNSWG ++GE GY R+ R+ CGIA F S+P
Sbjct: 291 KDYWLVKNSWGLNFGEQGYIRMARN---SGNHCGIASFPSYP 329
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 127/307 (41%), Positives = 180/307 (58%), Gaps = 26/307 (8%)
Query: 41 YGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTG 99
+G+ Y + E ++R I++ NL +E+ N AA G+ S+ L +N++ D+T +EF ++ G
Sbjct: 34 HGKQYG-AEEEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEEFRSTMNG 92
Query: 100 FKMSDHSSSLKANGTPFLYKSS--QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVE 150
+KM + +S G+ +L S+ +P +V+W KG VTP+K QGQC A ++E
Sbjct: 93 YKMRNGTS----RGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSLE 148
Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
G K +L SLSEQ LVDC+ N+GC GG MDDAF+YI N GI ++ Y YE
Sbjct: 149 GQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGIDTESSYPYEA-K 207
Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASAL--QFYSGGVFN 267
G C A + A + + D+ E L AVA P+SVAIDAS + Q Y GV++
Sbjct: 208 NGKC-RFNAANVGATDSGFTDIKSKSESDLQSAVATVGPISVAIDASHMSFQLYRSGVYH 266
Query: 268 GY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
+ ET L+HGV AVGYGT E G YWL+KNSWG+ WG+ GY + R+ + CGIA
Sbjct: 267 EFFCSETRLDHGVLAVGYGT-ESGKDYWLVKNSWGESWGQKGYIMMSRN---KRNNCGIA 322
Query: 326 MFASFPV 332
AS+P
Sbjct: 323 TSASYPT 329
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 116/254 (45%), Positives = 153/254 (60%), Gaps = 16/254 (6%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
+ E FE W +++G+ Y+ E RFEIFKDNL ++ N +Y L LN+FADL+
Sbjct: 4 LIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVV---SNYWLGLNEFADLS 60
Query: 90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
EF G K+ S+ + + F Y+ +P SV+W +KGAVT +K QG C
Sbjct: 61 HHEFKKQYLGLKVD--FSTRRESSEEFTYRDVDLPKSVDWRKKGAVTNIKNQGSCGSCWA 118
Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
VAAVEGIN I L SLSEQ+L+DC N+GC GG MD AF +I++N G+ +
Sbjct: 119 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRT-YNSGCNGGLMDYAFSFIVENGGLHKED 177
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
Y Y M G C+ K E I+ Y DVP N+E+SLLKA+ANQP+SVAI+AS QF
Sbjct: 178 DYPYI-MEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 236
Query: 261 YSGGVFNGYCETFL 274
YSGGVF+G+C T L
Sbjct: 237 YSGGVFDGHCGTQL 250
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 118/268 (44%), Positives = 157/268 (58%), Gaps = 16/268 (5%)
Query: 75 NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS-VNWIEKG 133
NRSY + LN+FADLT +EF ++ GF + + + P + SQV PS V+W G
Sbjct: 12 NRSYKVGLNQFADLTGEEFRSTYLGFTGGSNKTKVSNRYEP---RVSQVLPSYVDWRSAG 68
Query: 134 AVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
AV +K QG+C A+A VEGIN I L+SLSEQ+L+ C N GC GG++
Sbjct: 69 AVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGTQNTRGCNGGYIT 128
Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
D F++II N GI Y Y G C+ + I Y +VP N+E +L AV
Sbjct: 129 DGFQFIINNGGINTGENYPYTAQD-GECNLDLQNEKYVTIDTYGNVPYNNEWALQTAVTY 187
Query: 247 QPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
QPVSVA+DA+ A + YS G+F G C T ++H VT VGYGT E GI YW+++NSW WG
Sbjct: 188 QPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-EGGIDYWIVENSWDTTWG 246
Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPV 332
E+GY R+ R++ G CGIA S+PV
Sbjct: 247 EEGYMRILRNVGG-AGTCGIATMPSYPV 273
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 142/348 (40%), Positives = 187/348 (53%), Gaps = 32/348 (9%)
Query: 1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
M F++ L + A +T D +++QWKA +GR Y + E +R +++
Sbjct: 1 MTPSFVLAALCLGIVSALPKLDQTLDA-----QWDQWKAAHGRLYGLNEEGWRR-AVWEK 54
Query: 61 NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
NL +E N + G S+TL +N F D+T +EF GF+ H + K P L
Sbjct: 55 NLRMIELHNGEYSQGRHSFTLGMNHFGDMTNEEFRQVMNGFQHQKHKTG-KMYQEPLLL- 112
Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
Q+P SV+W EKG VT VK QGQC A ++EG K LVSLSEQ LVDC+
Sbjct: 113 --QLPKSVDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCS 170
Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
N GC GG MD AF+Y+ NKG+ + Y Y G G C K E AA T + DV
Sbjct: 171 RPQGNQGCNGGLMDFAFQYVKDNKGLEAEKSYPYVG-KDGEC-KYKPELSAANDTGFVDV 228
Query: 233 PPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVTAVGYGT-- 285
P E+ + KA+A P+SVAIDA + QFY G++ G LNHGV VGYGT
Sbjct: 229 -PQREKVVQKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGCSSRDLNHGVLLVGYGTDA 287
Query: 286 SEEGI-KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
SE G YWLIKNSWG WG DGY ++ R+ + CG+A AS+P+
Sbjct: 288 SETGKGDYWLIKNSWGTTWGADGYVKIARNRNN---HCGVATAASYPL 332
>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
Length = 334
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 130/324 (40%), Positives = 185/324 (57%), Gaps = 28/324 (8%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
FD+ AE + QWK+ + R Y + E +R I++ N+ ++ N + G +++ +N
Sbjct: 21 FDQTFSAE-WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 84 KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
F D+T +EF G++ H + P + K +P SV+W EKG VTPVK QGQ
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKG-RLFQEPLMLK---IPKSVDWREKGCVTPVKNQGQ 134
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C A +EG +K +L+SLSEQ LVDC+ N GC GG MD AF+YI +N
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDYAFQYIKENG 194
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
G+ ++ Y YE G C +AE A T + D+ P E++L+KAVA P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDA 251
Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
S +LQFYS G+ + C + L+HGV VGY GT KYWL+KNSWG +WG +GY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGY 311
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
++ +D D CG+A AS+PV
Sbjct: 312 IKIAKDRDN---HCGLATAASYPV 332
>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; AltName: Full=p39 cysteine proteinase;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
Length = 334
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 130/324 (40%), Positives = 185/324 (57%), Gaps = 28/324 (8%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
FD+ AE + QWK+ + R Y + E +R I++ N+ ++ N + G +++ +N
Sbjct: 21 FDQTFSAE-WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 84 KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
F D+T +EF G++ H + P + K +P SV+W EKG VTPVK QGQ
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKG-RLFQEPLMLK---IPKSVDWREKGCVTPVKNQGQ 134
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C A +EG +K +L+SLSEQ LVDC+ N GC GG MD AF+YI +N
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENG 194
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
G+ ++ Y YE G C +AE A T + D+ P E++L+KAVA P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDA 251
Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
S +LQFYS G+ + C + L+HGV VGY GT KYWL+KNSWG +WG +GY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGY 311
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
++ +D D CG+A AS+PV
Sbjct: 312 IKIAKDRDN---HCGLATAASYPV 332
>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
Length = 334
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 130/324 (40%), Positives = 185/324 (57%), Gaps = 28/324 (8%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
FD+ AE + QWK+ + R Y + E +R I++ N+ ++ N + G +++ +N
Sbjct: 21 FDQTFSAE-WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 84 KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
F D+T +EF G++ H + P + K +P SV+W EKG VTPVK QGQ
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKG-RLFQEPLMLK---IPKSVDWREKGCVTPVKNQGQ 134
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C A +EG +K +L+SLSEQ LVDC+ N GC GG MD AF+YI +N
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENG 194
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
G+ ++ Y YE G C +AE A T + D+ P E++L+KAVA P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEFAVANGTGFVDI-PQQEKALMKAVATVGPISVAMDA 251
Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
S +LQFYS G+ + C + L+HGV VGY GT KYWL+KNSWG +WG +GY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGY 311
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
++ +D D CG+A AS+PV
Sbjct: 312 IKIAKDRDN---HCGLATAASYPV 332
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 131/328 (39%), Positives = 184/328 (56%), Gaps = 22/328 (6%)
Query: 17 ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGN 75
+S A + ++ ++ + WK YGR Y+E E R I++ NL +V N ++G
Sbjct: 19 SSYAVAQVQNDPTLDHHWNLWKKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMGM 78
Query: 76 RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAV 135
SY L +N AD+T +E + + ++ S +AN T + ++P SV+W EKG V
Sbjct: 79 HSYDLGMNHLADMTSEEVSSLMSSLRVP---SQWQANVTYKSNSNQKLPDSVDWREKGCV 135
Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDD 187
T VKYQG C AV A+E +K LVSLS Q LVDC+T N GC GGFM
Sbjct: 136 TEVKYQGACGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGFMTK 195
Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
AF+YII N GI ++ Y Y+ M G C ++ AA + Y ++P E++L +AVAN+
Sbjct: 196 AFQYIIDNNGIDSEVSYPYKAMD-GNC-RYDSKHRAATCSKYTELPFGSEDALKEAVANK 253
Query: 248 -PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
PVSVAIDA F+ SG ++ C +NHGV VGYG + G YWL+KNSWG ++
Sbjct: 254 GPVSVAIDAKHSSFFLYKSGVYYDPSCTQNVNHGVLVVGYG-NLNGRDYWLVKNSWGLNF 312
Query: 304 GEDGYFRLQRDIDQPQGQCGIAMFASFP 331
GE GY R+ R+ CGIA + S+P
Sbjct: 313 GEQGYIRMARN---SGNHCGIASYPSYP 337
>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 127/341 (37%), Positives = 186/341 (54%), Gaps = 26/341 (7%)
Query: 8 VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
+ L ++ C A+ + S+ ++ QW++ Y + Y + E+ +R +++ N+ +ER
Sbjct: 3 LSLFLAALCLGVASAAPKLDQSLDVQWNQWRSTYKKPYAVNEEDWRR-AVWEKNVKMIER 61
Query: 68 FNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
N + G +T+ +N F D+T +EF GF+ H K P +P S
Sbjct: 62 HNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHKKG-KLFYEPVF---GHIPTS 117
Query: 127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
V+W +KG VTPVK QGQC A A+EG K +LVSLSEQ LVDC+ + N G
Sbjct: 118 VDWTQKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRREGNEG 177
Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
C GG MD+AF+Y+ N G+ ++ Y Y T C+ K E AA T + D+P E++
Sbjct: 178 CNGGLMDNAFQYVQDNGGLDSEESYPYLATDTHTCN-YKPECSAANDTGFVDIPQR-EKA 235
Query: 240 LLKAVAN-QPVSVAIDA--SALQFYSGGVF--NGYCETFLNHGVTAVGY---GTSEEGIK 291
L+KAVA P+SVAIDA + QFY G++ G L+HGV VGY G E K
Sbjct: 236 LMKAVATVGPISVAIDAGHESFQFYKSGIYYEPGCSSKDLDHGVLLVGYGFEGKDSENNK 295
Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
+W++KNSWG WG +GY ++ +D + CGIA AS+P
Sbjct: 296 FWIVKNSWGTSWGTNGYVKMAKDQNN---HCGIATAASYPT 333
>gi|119389039|pdb|2C0Y|A Chain A, The Crystal Structure Of A Cys25ala Mutant Of Human
Procathepsin S
Length = 315
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 130/319 (40%), Positives = 178/319 (55%), Gaps = 28/319 (8%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFAD 87
++ + WK YG+ YKE E + R I++ NL V N ++G SY L +N D
Sbjct: 7 TLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGD 66
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC 144
+T +E ++ + ++ S + N T YKS+ +P SV+W EKG VT VKYQG C
Sbjct: 67 MTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNRILPDSVDWREKGCVTEVKYQGSC 120
Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDAFKYIIQNK 196
AV A+E +K +LVSLS Q LVDC+T N GC GGFM AF+YII NK
Sbjct: 121 GAAWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 180
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA 255
GI +DA Y Y+ M ++ AA + Y ++P E+ L +AVAN+ PVSV +DA
Sbjct: 181 GIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDA 238
Query: 256 SALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
F+ SG + C +NHGV VGYG G +YWL+KNSWG ++GE+GY R+
Sbjct: 239 RHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNGKEYWLVKNSWGHNFGEEGYIRMA 297
Query: 313 RDIDQPQGQCGIAMFASFP 331
R+ CGIA F S+P
Sbjct: 298 RN---KGNHCGIASFPSYP 313
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 127/326 (38%), Positives = 177/326 (54%), Gaps = 26/326 (7%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFE------IFKDNLVAVERFNNAAIGNRSY 78
F ++A + + + +E+ +++ RF I++ N+ E N N+SY
Sbjct: 14 FVASTLAATHDPLTGVFAKWMRENTKSNYRFVYSNEEFIYRWNVWRDEEHNRQ---NKSY 70
Query: 79 TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPV 138
L +N+F DLT EF G D+S K + ++ +P +W +KGAVT V
Sbjct: 71 FLAMNQFGDLTNAEFNRLFKGLAF-DYSKHAKIHTAAPEAPATGIPSEFDWRQKGAVTHV 129
Query: 139 KYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
K QGQC + EG N +K RLVSLSEQ L+DC+ + NNGC GG MD AF+Y
Sbjct: 130 KNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMDYAFEY 189
Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
II N+GI +A Y Y+ C A + +T Y DV DE +LL A +PVSV
Sbjct: 190 IINNRGIDTEASYPYQTAGPLTCQ-YNAANKGGSLTGYTDVTSGDENALLNAAVKEPVSV 248
Query: 252 AIDAS--ALQFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
AIDAS + QFYSGGV+ + T L+HGV VG+G SE G +W +KNSWG WG +G
Sbjct: 249 AIDASHNSFQFYSGGVYYESACSSTQLDHGVLVVGWG-SENGQDFWWVKNSWGASWGLNG 307
Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVS 333
Y ++ R+ + CGIA AS+P +
Sbjct: 308 YIKMSRNQNN---NCGIATAASYPTA 330
>gi|351629617|gb|AEQ54772.1| KDEL-tailed cysteine proteinase CP4, partial [Coffea canephora]
Length = 215
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 108/198 (54%), Positives = 136/198 (68%), Gaps = 7/198 (3%)
Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
V VEGIN IK +LVSLSEQ+LVDC T+ N GC GG M++A+++I ++ GIT + +Y
Sbjct: 12 VVGVEGINKIKTGQLVSLSEQELVDCETD--NEGCNGGLMENAYEFIKKSGGITTERLYP 69
Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSG 263
Y+ G CDS K A I +E VP NDE +L+KAVANQPVSVAIDAS +QFYS
Sbjct: 70 YKARD-GSCDSSKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDASGSDMQFYSE 128
Query: 264 GVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ- 321
GV+ G C L+HGV VGYGT+ +G KYW++KNSWG WGE GY R+QR +D +G
Sbjct: 129 GVYTGDSCGNELDHGVAVVGYGTALDGTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGV 188
Query: 322 CGIAMFASFPVSKESAQP 339
CGIAM AS+P+ S P
Sbjct: 189 CGIAMEASYPLKLSSHNP 206
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 125/316 (39%), Positives = 187/316 (59%), Gaps = 28/316 (8%)
Query: 33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV----ERFNNAAIGNRSYTLRLNKFADL 88
+++Q+KA+YG+ Y+ + E+S R +++ N + E++ N + S+TL +N+F D+
Sbjct: 21 EWQQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLV---SFTLAMNQFGDM 77
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
T +E A+ GF + GT + ++P +V+W +KGAVTPVK Q C
Sbjct: 78 TTEEINAAMNGFLSAGKKV---PRGTMYQPLVDELPDTVDWRDKGAVTPVKDQKACGSCW 134
Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
A ++EG + + +LVSLSEQ LVDC+ N GC GG MD+AF+YI N GI +
Sbjct: 135 AFSATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDTE 194
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA--SAL 258
Y YE G C +++ A +++Y D+ E+ L KAVA + PVSVAIDA S
Sbjct: 195 ESYPYEA-KNGPC-RFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTF 252
Query: 259 QFYSGGV-FNGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
FYS G+ ++ C +FL+HGV AVGYGT ++ YWL+KNSW + WG+ GY ++ R+ +
Sbjct: 253 HFYSRGIYYDEKCSSSFLDHGVLAVGYGT-DDSSDYWLVKNSWNETWGDSGYIKMSRNRN 311
Query: 317 QPQGQCGIAMFASFPV 332
CGIA AS+PV
Sbjct: 312 N---NCGIASQASYPV 324
>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
Length = 334
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 130/324 (40%), Positives = 185/324 (57%), Gaps = 28/324 (8%)
Query: 25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
FD+ AE + QWK+ + R Y + E +R I++ N+ ++ N + G +++ +N
Sbjct: 21 FDQTFSAE-WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRIIQLHNGEYSNGQHGFSMEMN 78
Query: 84 KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
F D+T +EF G++ H + P + K +P SV+W EKG VTPVK QGQ
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKG-RLFQEPLMLK---IPKSVDWREKGCVTPVKNQGQ 134
Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
C A +EG +K +L+SLSEQ LVDC+ N GC GG MD AF+YI +N
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENG 194
Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
G+ ++ Y YE G C +AE A T + D+ P E++L+KAVA P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDA 251
Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
S +LQFYS G+ + C + L+HGV VGY GT KYWL+KNSWG +WG +GY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGY 311
Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
++ +D D CG+A AS+PV
Sbjct: 312 IKIAKDRDN---HCGLATAASYPV 332
>gi|33333710|gb|AAQ11973.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 128/330 (38%), Positives = 184/330 (55%), Gaps = 49/330 (14%)
Query: 29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
S+ E+ +Q+K +G+TY+ E +RF +F+ NLV ++ N G S+ ++ +FAD
Sbjct: 18 SVYEEGQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFAD 77
Query: 88 LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
+T +EF+ LK G P L ++ + +V+W E+GAVT
Sbjct: 78 MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVT 125
Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
PVK Q C AV A+EG K LVSLS Q+LVDCAT D NNGC GG M A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185
Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
F ++ Q++GI + Y YEG + C K+ ++ ++ Y V P DE+ + + VA +
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239
Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
PV+VAI+AS L FY G+ + C LN GV VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNPGVLVVGYG-SENGVDYWIVKNSWGAD 298
Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WGE GYFRL++D+ CGI + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324
>gi|139947602|ref|NP_001077155.1| cathepsin L1 precursor [Bos taurus]
gi|134025180|gb|AAI34742.1| CTSL1 protein [Bos taurus]
gi|296484500|tpg|DAA26615.1| TPA: cathepsin L1 [Bos taurus]
Length = 333
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 124/340 (36%), Positives = 195/340 (57%), Gaps = 29/340 (8%)
Query: 10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
L+++ C A+ + S+ +++ WKA + + Y + E ++ ++K N+ +E N
Sbjct: 5 LLLTALCLGIASAAPKFDHSLDTQWKLWKAAHRKPYDLNEEGWRK-AVWKKNMKMIELHN 63
Query: 70 NA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVN 128
+ G S+++ +N F D+T +EF + GF+ + + + T F + +PPSV+
Sbjct: 64 QEYSQGKHSFSMAMNAFGDMTNEEFRHTMNGFQRQKNKKGKEFHETIF----ASIPPSVD 119
Query: 129 WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
W EKG VTPVK QG+C A A+EG K +LVSLSEQ LVDC+ + N GC+
Sbjct: 120 WREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCH 179
Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
GGF+D+AF+Y++ G+ ++ Y Y G+ G C + AA T + D+ P E++L+
Sbjct: 180 GGFIDNAFQYVLDVGGLDSEESYPYTGL-VGTC-LYNPNNSAANETGFVDL-PKQEKALM 236
Query: 242 KAVANQ-PVSVAIDAS--ALQFYSGGVF---NGYCETFLNHGVTAVGY---GTSEEGIKY 292
KAVAN P+SVA+DA + QFY G++ N E+ ++H V VGY G + KY
Sbjct: 237 KAVANLGPISVAVDAHNPSFQFYKSGIYYEPNCSSES-VDHAVLVVGYGFEGADSDDNKY 295
Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
WL+KNSWG+ WG +GY ++ +D + CGIA AS+P
Sbjct: 296 WLVKNSWGEHWGMNGYIKMAKDRNN---HCGIATMASYPT 332
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/314 (40%), Positives = 184/314 (58%), Gaps = 22/314 (7%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN-AAIGNRSYTLRLNKFADLTPQE 92
++ +K + R Y E+ E +R E+F++NL +E N + G SY + +N+FAD+ +E
Sbjct: 44 WQDFKTVHERNYGET-EEMQRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKE 102
Query: 93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQV--PPSVNWIEKGAVTPVKYQGQCA----- 145
F + GF+M++ + + ++ + V P V+W ++G VTP+K QG C
Sbjct: 103 FASVVNGFRMNNRTKVRDHLHSHYISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGSCWSF 162
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
A+EG + K +LVSLSEQ L+DC+T+ NNGC GG MD AF+YI N G +
Sbjct: 163 STTGALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDDTEDS 222
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQF 260
Y YE + G C K E A T Y D+P DEE + +AVA PVSVAIDAS + Q
Sbjct: 223 YPYEA-ADGPC-RFKKEYVGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQM 280
Query: 261 YSGGVFNGY-CET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
Y GV++ C+ L+HGV VGYGT E G YWL+KNSWG WG++GY ++ R+ +
Sbjct: 281 YQSGVYDEVECDPEGLDHGVLVVGYGT-ELGQDYWLVKNSWGTKWGDEGYIKMSRNKNN- 338
Query: 319 QGQCGIAMFASFPV 332
QCGI+ AS+P+
Sbjct: 339 --QCGISSMASYPL 350
>gi|42564161|gb|AAS20592.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 326
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 179/315 (56%), Gaps = 27/315 (8%)
Query: 35 EQW---KAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTP 90
+QW K +G+TYK E RF IF+ NL +E N G SY L + FADLT
Sbjct: 21 DQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTH 80
Query: 91 QEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------ 144
EF + +++A F + +VP S++W +KGAV VKYQG C
Sbjct: 81 DEFKDKLR--RQIKTKPNVEATLAVFP-EGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAF 137
Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC-YGGFMDDAFKYIIQNKGITNDA 202
A A+EG NAI N + LSEQQL+DC+ N+ C +GG M AF Y++ +KGI D+
Sbjct: 138 SATGALEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVL-DKGIEADS 196
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASALQFY 261
Y Y+G+ T A+ +I Y +V ++EE L KAV PVSVAIDA +Q Y
Sbjct: 197 SYPYKGIDTPC--QYDAKKTVLKIKGYRNVSISEEE-LKKAVGTVGPVSVAIDADPIQLY 253
Query: 262 SGGVFNG-YCETFLNHGVTAVGYGTSEEGI---KYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
SGG+ +G +C LNHGV AVGYG + K+W +KNSWG+DWGE GYFR++RD +
Sbjct: 254 SGGILDGLFCTHNLNHGVLAVGYGEEDHLFGKKKFWKVKNSWGKDWGEQGYFRIKRDANN 313
Query: 318 PQGQCGIAMFASFPV 332
CGIA AS+P+
Sbjct: 314 ---LCGIADKASYPI 325
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 186/316 (58%), Gaps = 26/316 (8%)
Query: 34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN-AAIGNRSYTLRLNKFADLTPQE 92
++ +K + RTY E+ E S+R E+F++NL ++ N+ G Y + +N+FAD+ E
Sbjct: 43 WQDFKTVHERTYGET-EESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADMEANE 101
Query: 93 FIASQTGFKMSDHSS---SLKANG-TPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
F + GF+M++ + L AN +P + S VP V+W ++G VTPVK QGQC
Sbjct: 102 FASIMNGFRMNNRTEVRDHLHANYISPAIPVS--VPAEVDWRKEGYVTPVKNQGQCGSCW 159
Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
++EG + K +LVSLSEQ LVDC+T+ N GC GG +D AF+YI N G +
Sbjct: 160 AFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDNDGDDTE 219
Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDA--SAL 258
A Y YE + G C K+ A T Y D+P DE + +AVA PVSVAIDA S+
Sbjct: 220 ACYPYEAVD-GTC-RFKSVCVGATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDASHSSF 277
Query: 259 QFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
Q Y G++ C L+H V VGYGT E+G YWL+KNSWG WG++GY ++ R++D
Sbjct: 278 QMYQSGIYVEQECSPKQLDHAVLVVGYGT-EQGQDYWLVKNSWGTTWGDEGYIKMARNMD 336
Query: 317 QPQGQCGIAMFASFPV 332
QCGIA AS+P+
Sbjct: 337 N---QCGIASQASYPL 349
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/320 (38%), Positives = 179/320 (55%), Gaps = 25/320 (7%)
Query: 30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
+ + ++ WK + + Y E E +R +++ NL +E N ++G SY L +N F D+
Sbjct: 24 LDQHWQLWKGWHSKNYHEKEEGWRRL-VWEKNLRKIELHNLEHSMGKHSYRLGMNHFGDM 82
Query: 89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-- 145
T +EF G+K + K +G+ F+ + + P +V+W +KG VTPVK QGQC
Sbjct: 83 THEEFRQIMNGYKRREQR---KYSGSLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQCGSC 139
Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
A+EG K +LVSLSEQ LVDC+ + N GC GG MD AF+Y+ N+G+ +
Sbjct: 140 WAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQGLDS 199
Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SA 257
+ Y Y+G C A+ A T + D+P E +L+KAVA+ PVSVAIDA +
Sbjct: 200 EDFYPYKGTDDQPC-QYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDAGHES 258
Query: 258 LQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEE---GIKYWLIKNSWGQDWGEDGYFRLQ 312
QFY G+ F C + L+HGV VGYG E G KYW++KNSW + WG+ G+ +
Sbjct: 259 FQFYQSGIYFEKECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGFIYMA 318
Query: 313 RDIDQPQGQCGIAMFASFPV 332
+D CGIA AS+P+
Sbjct: 319 KD---RHNHCGIATAASYPL 335
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 126/307 (41%), Positives = 180/307 (58%), Gaps = 26/307 (8%)
Query: 41 YGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTG 99
+G+ Y + E ++R I++ NL +E+ N AA G+ S+ L +N++ D+T +EF ++ G
Sbjct: 34 HGKQYG-AEEEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEEFRSTMNG 92
Query: 100 FKMSDHSSSLKANGTPFLYKSS--QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVE 150
+KM + +S G+ +L S+ +P +V+W KG VTP+K QGQC A ++E
Sbjct: 93 YKMRNGTS----RGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSLE 148
Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
G K +L SLSEQ LVDC+ N+GC GG MDDAF+YI N GI ++ Y YE
Sbjct: 149 GQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGIDTESSYPYEA-K 207
Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASAL--QFYSGGVFN 267
G C A + A + + D+ E L AVA P++VAIDAS + Q Y GV++
Sbjct: 208 NGKC-RFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKSGVYH 266
Query: 268 GY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
+ ET L+HGV AVGYGT E G YWL+KNSWG+ WG+ GY + R+ + CGIA
Sbjct: 267 EFFCSETRLDHGVLAVGYGT-ESGKDYWLVKNSWGESWGQKGYIMMSRN---KRNNCGIA 322
Query: 326 MFASFPV 332
AS+P
Sbjct: 323 TSASYPT 329
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 180/314 (57%), Gaps = 25/314 (7%)
Query: 31 AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTP 90
AE++ WK +YG+TY+ E++ R +I+ N V N+ + S+ L +N+FADLT
Sbjct: 26 AEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSM---DSSFQLEVNEFADLTA 82
Query: 91 QEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
+EF + G+ + + N T + Y +P SV+W KG VTPVK Q QC
Sbjct: 83 EEFSSIYNGYGKGRNREN-HENTTIYRYTGGAIPDSVDWRTKGLVTPVKNQKQCGSCWAF 141
Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
++EG +A K +LVSLSEQ LVDC D+ GC GG M AFKYI +NKGI +
Sbjct: 142 STTGSLEGAHAKKTGKLVSLSEQNLVDCDKKDH--GCQGGLMTTAFKYIEENKGIDTEES 199
Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQF 260
Y Y+ G C+ K +D A + + + D E+L KAVA P+SVA+DAS + Q
Sbjct: 200 YPYKA-KNGRCE-FKKDDIGATVERHVSILTTDCEALKKAVAEIGPISVAMDASHSSFQL 257
Query: 261 YSGGVFN-GYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
Y G+++ C + L+HGV VGYG E+G +YWL+KNSWG++WG +GYF+ I
Sbjct: 258 YKSGIYDPKICSSRKLDHGVLVVGYG-KEDGEEYWLVKNSWGKNWGMEGYFK----IASK 312
Query: 319 QGQCGIAMFASFPV 332
+ CGI A +PV
Sbjct: 313 KNLCGICTSACYPV 326
>gi|42564159|gb|AAS20591.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 326
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 178/315 (56%), Gaps = 27/315 (8%)
Query: 35 EQW---KAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTP 90
+QW K +G+TYK E RF IF+ NL +E N G SY L + FADLT
Sbjct: 21 DQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTH 80
Query: 91 QEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------ 144
EF + +++A F + +VP S++W +KGAV VKYQG C
Sbjct: 81 DEFKDELR--RQIKTKPNVEATLAVFP-EGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAF 137
Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC-YGGFMDDAFKYIIQNKGITNDA 202
A A+EG NAI N + LSEQQL+DC+ N+ C +GG M AF Y++ +KGI D+
Sbjct: 138 SATGALEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVL-DKGIEADS 196
Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASALQFY 261
Y Y+G+ T A+ +I Y++V N EE L KAV PVSVAIDA +Q Y
Sbjct: 197 SYPYKGIDTPC--QYDAKKTVLKIKGYKNVS-NSEEELKKAVGTVGPVSVAIDADPIQLY 253
Query: 262 SGGVFNG-YCETFLNHGVTAVGYGTSEEGI---KYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
GG+ +G +C LNHGV AVGYG + K+W +KNSWG+DWGE GYFR++RD +
Sbjct: 254 FGGILDGLFCTHNLNHGVLAVGYGEEDHLFGKKKFWKVKNSWGKDWGEQGYFRIKRDANN 313
Query: 318 PQGQCGIAMFASFPV 332
CGIA AS+P+
Sbjct: 314 ---LCGIADKASYPI 325
>gi|33242886|gb|AAQ01147.1| cathepsin [Haplochromis chilotes]
Length = 334
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/322 (39%), Positives = 177/322 (54%), Gaps = 28/322 (8%)
Query: 27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
E ++ +E WK +G++YK EN+ R E++ +NL + N A++G +Y L +N
Sbjct: 24 ESTLDAHWELWKKTHGKSYKNDVENAHRRELWGNNLKMITVHNLEASMGLHTYELGMNHM 83
Query: 86 ADLTPQE---FIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
DLT +E F AS T + ++ +PF S S +P +++W EKG VT VK Q
Sbjct: 84 GDLTEEEIMQFFASLT------PPTDIQRAPSPFAGASGSGIPDTMDWREKGCVTKVKMQ 137
Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
G C A A+EG A +LV LS Q LVDC+ N+GC GGFM AF+Y+I
Sbjct: 138 GACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVID 197
Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAI 253
N GI +DA Y Y G AA ++Y+ +P DE +L + +A P+SVAI
Sbjct: 198 NHGIDSDASYPYIGRDDQC--HYNPATRAANCSSYQFLPEGDENALKQGLATVGPISVAI 255
Query: 254 DA--SALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
DA FY GV+N C +NHGV AVGYGT G YWL+KNSWG +G+ GY R
Sbjct: 256 DARRPRFSFYRSGVYNDPSCTQKVNHGVLAVGYGTL-NGQDYWLVKNSWGTTFGDQGYIR 314
Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
+ R+ QCGIA++ +PV
Sbjct: 315 MARNTGN---QCGIALYPCYPV 333
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.131 0.388
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,400,993,642
Number of Sequences: 23463169
Number of extensions: 222004660
Number of successful extensions: 556233
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6262
Number of HSP's successfully gapped in prelim test: 1194
Number of HSP's that attempted gapping in prelim test: 526424
Number of HSP's gapped (non-prelim): 8379
length of query: 348
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 205
effective length of database: 9,003,962,200
effective search space: 1845812251000
effective search space used: 1845812251000
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)