BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 044448
(308 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 213 bits (541), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 128/325 (39%), Positives = 175/325 (53%), Gaps = 33/325 (10%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
T + A +E W++++ ++Y E E RF+IFK+ F+ LN+
Sbjct: 31 TQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQ 90
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F ++Y G+ SNR ++ + Y +DW GAV +K Q
Sbjct: 91 FADLTDEEFRSTYLGFTSGSNKTKVSNR---YEPRVGQVLPSY--VDWRSAGAVVDIKSQ 145
Query: 111 GSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIR 165
G C CWAF+A+ATVEG+NKI TG L++ S+ +L+DC GC ++ + F++I
Sbjct: 146 GE-CGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFII 204
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ +E YPY QD C+ + KY I Y+ V E LQ V+ QPVSV
Sbjct: 205 NNGGINTEENYPYTA-QDGECNL--DLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSV 261
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
A+DA F Y G+FTGPCG +H VTIVGYGT EG YW+VKN W T W E
Sbjct: 262 ALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT----EGGIDYWIVKNSWDTTWGEE 317
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G MRI R VGG+G C IA +YP+
Sbjct: 318 GYMRILRNVGGAGTCGIATMPSYPV 342
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 211 bits (538), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 128/300 (42%), Positives = 171/300 (57%), Gaps = 32/300 (10%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTG----YKPPPTDHP 74
EK RF +FK N HE L+LNKF D+T E+F +Y G +
Sbjct: 53 EKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEK 112
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
+ +S + N+N+ S+DW + GAVTPVK+QG CWAF+ V VEG+N+IRT
Sbjct: 113 KATKSFMYANVNT----LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTK 168
Query: 134 QLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
+L + S+ +LVDC T GC ++ AFE+I++ L SE VYPY+ D CD +
Sbjct: 169 KLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKA-SDETCDTNKE 227
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPN 249
+A +I G++ V +E+ L V+ QPVSVAIDA + F FY GVFTG CG N
Sbjct: 228 NAP--VVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELN 285
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
HGV +VGYGTT + YW+VKN WG W E G +R+ RG+ GLC IA A+YPL
Sbjct: 286 HGVAVVGYGTTIDG---TKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 209 bits (532), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 118/310 (38%), Positives = 176/310 (56%), Gaps = 27/310 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFL 60
+ E+WM E+ R YKD EK RF+IFK N + + +N+F D+T+ +F+
Sbjct: 36 RFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFV 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A YTG P S F ++N S + SIDW + GAV VK+Q CW+F
Sbjct: 96 AQYTGVSLPLNIEREPVVS--FDDVNISAVP--QSIDWRDYGAVNEVKNQNPCGSCWSFA 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A+ATVEG+ KI+TG LV+ S+ +++DC+ GC ++ A+++I + +E YPY
Sbjct: 152 AIATVEGIYKIKTGYLVSLSEQEVLDCAVSYGCKGGWVNKAYDFIISNNGVTTEENYPYL 211
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFYHGG 238
Q C+ +++ I GY YV+ E + VS QP++ IDA+ F +Y+GG
Sbjct: 212 AYQG-TCN---ANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGG 267
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGL 297
VF+GPCG + NH +TI+GYG + YW+V+N WG++W EGG +R+ RGV SG+
Sbjct: 268 VFSGPCGTSLNHAITIIGYGQDSSG---TKYWIVRNSWGSSWGEGGYVRMARGVSSSSGV 324
Query: 298 CNIAANAAYP 307
C IA +P
Sbjct: 325 CGIAMAPLFP 334
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 208 bits (530), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 127/325 (39%), Positives = 174/325 (53%), Gaps = 33/325 (10%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
T + A +E W++++ ++Y E E RF+IFK+ F+ LN+
Sbjct: 31 TQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQ 90
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F ++Y + SNR ++ + Y +DW GAV +K Q
Sbjct: 91 FADLTDEEFRSTYLRFTSGSNKTKVSNR---YEPRVGQVLPSY--VDWRSAGAVVDIKSQ 145
Query: 111 GSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIR 165
G C CWAF+A+ATVEG+NKI TG L++ S+ +L+DC GC ++ + F++I
Sbjct: 146 GE-CGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFII 204
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ +E YPY QD C+ + KY I Y+ V E LQ V+ QPVSV
Sbjct: 205 NNGGINTEENYPYTA-QDGECN--VDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSV 261
Query: 226 AIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
A+DA F Y G+FTGPCG +H VTIVGYGT EG YW+VKN W T W E
Sbjct: 262 ALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGT----EGGIDYWIVKNSWDTTWGEE 317
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G MRI R VGG+G C IA +YP+
Sbjct: 318 GYMRILRNVGGAGTCGIATMPSYPV 342
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 208 bits (529), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 127/315 (40%), Positives = 175/315 (55%), Gaps = 28/315 (8%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+E+W R + AEK RF FK N F L LN+F D+ + +F A
Sbjct: 46 YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDMDQAEFRA 104
Query: 62 SYTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
++ G + P S + LN S + S+DW ++GAVT VKDQG CWAF+
Sbjct: 105 TFVGDLRRDTPSKPPSVPGFMYAALNVSDLP--PSVDWRQKGAVTGVKDQGKCGSCWAFS 162
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYP 177
V +VEG+N IRTG LV+ S+ +L+DC T +GC ++NAFEYI+ L +E YP
Sbjct: 163 TVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLITEAAYP 222
Query: 178 YQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNF 234
Y+ + C+ R++ + I G+Q V +EE L V+ QPVSVA++A+ F F
Sbjct: 223 YRAARG-TCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMF 281
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG CG +HGV +VGYG AE + YW VKN WG +W E G +R+ + G
Sbjct: 282 YSEGVFTGECGTELDHGVAVVGYGV---AEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGA 338
Query: 295 S-GLCNIAANAAYPL 308
S GLC IA A+YP+
Sbjct: 339 SGGLCGIAMEASYPV 353
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 207 bits (527), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 119/310 (38%), Positives = 170/310 (54%), Gaps = 27/310 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+ E+WM E+ R YKD EK +RF+IFK N L +N+F D+T +F+
Sbjct: 36 QFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNEFV 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A YTG P S F +++ S S SIDW + GAVT VK+QG CWAF
Sbjct: 96 AQYTGLSLPLNIKREPVVS--FDDVDIS--SVPQSIDWRDSGAVTSVKNQGRCGSCWAFA 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
++ATVE + KI+ G LV+ S+ Q++DC+ GC ++ A+ +I + +AS +YPY+
Sbjct: 152 SIATVESIYKIKRGNLVSLSEQQVLDCAVSYGCKGGWINKAYSFIISNKGVASAAIYPYK 211
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFYHGG 238
+ C +++ I Y YVQ E + VS QP++ A+DA+ F Y G
Sbjct: 212 AAKG-TC---KTNGVPNSAYITRYTYVQRNNERNMMYAVSNQPIAAALDASGNFQHYKRG 267
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GL 297
VFTGPCG NH + I+GYG + + +W+V+N WG W EGG +R+ R V S GL
Sbjct: 268 VFTGPCGTRLNHAIVIIGYGQDSSG---KKFWIVRNSWGAGWGEGGYIRLARDVSSSFGL 324
Query: 298 CNIAANAAYP 307
C IA + YP
Sbjct: 325 CGIAMDPLYP 334
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 207 bits (526), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 127/315 (40%), Positives = 175/315 (55%), Gaps = 28/315 (8%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+E+W R + AEK RF FK N F L LN+F D+ + +F A
Sbjct: 46 YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDMDQAEFRA 104
Query: 62 SYTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
++ G + P S + LN S + S+DW ++GAVT VKDQG CWAF+
Sbjct: 105 TFVGDLRRDTPAKPPSVPGFMYAALNVSDLP--PSVDWRQKGAVTGVKDQGKCGSCWAFS 162
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYP 177
V +VEG+N IRTG LV+ S+ +L+DC T +GC ++NAFEYI+ L +E YP
Sbjct: 163 TVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLITEAAYP 222
Query: 178 YQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNF 234
Y+ + C+ R++ + I G+Q V +EE L V+ QPVSVA++A+ F F
Sbjct: 223 YRAARG-TCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMF 281
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG CG +HGV +VGYG AE + YW VKN WG +W E G +R+ + G
Sbjct: 282 YSEGVFTGDCGTELDHGVAVVGYGV---AEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGA 338
Query: 295 S-GLCNIAANAAYPL 308
S GLC IA A+YP+
Sbjct: 339 SGGLCGIAMEASYPV 353
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 206 bits (523), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 173/316 (54%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +W E ++Y E+E R+ F+ N H F L LN+FADLT E+
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ +Y G + + P R + L + + +S+DW +GAV +KDQG CWA
Sbjct: 100 YRDTYLGLR----NKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+A VEG+N+I TG L++ S+ +LVDC T GC ++ AF++I + +E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDTEDD 215
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G+ D CD R +A K I Y+ V P +E LQ V+ QPVSVAI+A F
Sbjct: 216 YPYKGK-DERCDVNRKNA--KVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQ 272
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R +
Sbjct: 273 LYSSGIFTGKCGTALDHGVAAVGYGT----ENGKDYWIVRNSWGKSWGESGYVRMERNIK 328
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 329 ASSGKCGIAVEPSYPL 344
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 205 bits (522), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 165/312 (52%), Gaps = 28/312 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E WMV+ + Y AEKE R IF+ N F L L FADL+ ++
Sbjct: 50 ESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVC 109
Query: 64 TGYKP-PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTA 120
G P PP +H S+ +K S+ S+DW GAVT VKDQG +C CWAF+
Sbjct: 110 HGADPRPPRNHVFMTSSDRYKT--SADDVLPKSVDWRNEGAVTEVKDQG-HCRSCWAFST 166
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
V VEGLNKI TG+LVT S+ L++C+ NGC LE A+E+I + L ++ YPY+
Sbjct: 167 VGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYK 226
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHG 237
+ CD R + K I GY+ + E L V+ QPV+ ID++ F Y
Sbjct: 227 A-VNGVCD-GRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYES 284
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF G CG NHGV +VGYGT E + YWLVKN G W E G M++ R + G
Sbjct: 285 GVFDGSCGTNLNHGVVVVGYGT----ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRG 340
Query: 297 LCNIAANAAYPL 308
LC IA A+YPL
Sbjct: 341 LCGIAMRASYPL 352
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 203 bits (516), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/300 (41%), Positives = 166/300 (55%), Gaps = 30/300 (10%)
Query: 30 AEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDHPHSN 77
EK RF +FK N L+LNKFAD+T +F ++Y G K +HP
Sbjct: 54 GEKHKRFNVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSK---VNHPRMF 110
Query: 78 RSNWFKN---LNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
R +N + +S S+DW ++GAVT VKDQG CWAF+ V VEG+N+I+T
Sbjct: 111 RGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTN 170
Query: 134 QLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
+LV S+ +LVDC GC +E+AFE+I+Q + +E YPY+ Q+ CD S
Sbjct: 171 KLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKA-QEGTCD--AS 227
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPN 249
+ +I G++ V E+ L V+ QPVSVAIDA + F FY GVFTG C N
Sbjct: 228 KVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLN 287
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
HGV IVGYGTT + YW+V+N WG W E G +R+ R + GLC IA +YP+
Sbjct: 288 HGVAIVGYGTTVDGTN---YWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 344
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 203 bits (516), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 162/311 (52%), Gaps = 26/311 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E WMV+ + Y AEKE R IF+ N F L LN+FADL+ ++
Sbjct: 57 ESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGEIC 116
Query: 64 TGYKP-PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG-SYCCWAFTAV 121
G P PP +H SN +K + + S+DW GAVT VKDQG CWAF+ V
Sbjct: 117 HGADPRPPRNHVFMTSSNRYKTSDGDVLP--KSVDWRNEGAVTEVKDQGLCRSCWAFSTV 174
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEGLNKI TG+LVT S+ L++C+ NGC +E A+E+I L ++ YPY+
Sbjct: 175 GAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKA 234
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGG 238
C+ R K I GY+ + E L V+ QPV+ +D++ F Y G
Sbjct: 235 LNG-VCE-GRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESG 292
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG NHGV +VGYGT E + YW+VKN G W E G M++ R + GL
Sbjct: 293 VFDGTCGTNLNHGVVVVGYGT----ENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGL 348
Query: 298 CNIAANAAYPL 308
C IA A+YPL
Sbjct: 349 CGIAMRASYPL 359
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 202 bits (514), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 122/300 (40%), Positives = 168/300 (56%), Gaps = 32/300 (10%)
Query: 31 EKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDH----P 74
EK+ RF +FK N L+LNKFAD+T +F +Y+G K P
Sbjct: 53 EKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRNTYSGSKVKHHRMFRGGP 112
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
N + ++ +++ S +DW ++GAVT VKDQG CWAF+ + VEG+N+I+T
Sbjct: 113 RGNGTFMYEKVDTVPAS----VDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTN 168
Query: 134 QLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
+LV+ S+ +LVDC T GC ++ AFE+I+Q + +E YPY+ D CD +
Sbjct: 169 KLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPYEA-YDGTCDVSKE 227
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPN 249
+A +I G++ V E L V+ QPVSVAIDA + F FY GVFTG CG +
Sbjct: 228 NAPAV--SIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELD 285
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
HGV IVGYGTT + YW VKN WG W E G +R+ RG+ GLC IA A+YP+
Sbjct: 286 HGVAIVGYGTTIDG---TKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPI 342
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 200 bits (509), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 175/320 (54%), Gaps = 40/320 (12%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFLA 61
+E+W +D EK RF +FK+N HEF L LNKF D+T ++F +
Sbjct: 40 YEKWRTHHT-VARDLDEKNRRFNVFKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRS 98
Query: 62 SYTGYKPPPTDHPHSNR-------SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
Y G K H S R S ++N+ S + SIDW +GAVT VKDQG
Sbjct: 99 KYAGSK---IQHHRSQRGIQKNTGSFMYENVGSLPAA---SIDWRAKGAVTGVKDQGQCG 152
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLA 171
CWAF+ +A+VEG+N+I+TG+LV+ S+ +LVDC T GC ++ AFE+I Q +
Sbjct: 153 SCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEFI-QKNGIT 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT- 230
+E YPY QD C + + +I G+Q V E L V+ QP+SV+I+A+
Sbjct: 212 TEDSYPY-AEQDGTCA--SNLLNSPVVSIDGHQDVPANNENALMQAVANQPISVSIEASG 268
Query: 231 -WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GVFTG CG +HGV IVGYG T + YW+VKN WG W E G +R+
Sbjct: 269 YGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDG---TKYWIVKNSWGEEWGESGYIRMQ 325
Query: 290 RGVGGS-GLCNIAANAAYPL 308
RG+ G C IA A+YP+
Sbjct: 326 RGISDKRGKCGIAMEASYPI 345
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 199 bits (507), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 125/300 (41%), Positives = 170/300 (56%), Gaps = 31/300 (10%)
Query: 31 EKEMRFKIF-----------KKNHEF-LRLNKFADLTREKFLASYTG----YKPPPTDHP 74
E+E RF +F KKN + L+LNKFADLT +F +YTG +
Sbjct: 53 EREKRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPK 112
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
++ + + N SK+ S+DW ++GAVT +K+QG CWAF+ VA VEG+NKI+T
Sbjct: 113 RGSKQFMYDHENLSKLP--SSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTN 170
Query: 134 QLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
+LV+ S+ +LVDC T GC +E AFE+I++ + +E YPY+G D CD S
Sbjct: 171 KLVSLSEQELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEG-IDGKCD--AS 227
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPN 249
+G I G++ V E L V+ QPVSVAIDA + F FY GVFTG CG N
Sbjct: 228 KDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELN 287
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
HGV VGYG+ E + YW+V+N WG W EGG ++I R + G C IA A+YP+
Sbjct: 288 HGVAAVGYGS----ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPI 343
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 199 bits (505), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 121/280 (43%), Positives = 158/280 (56%), Gaps = 22/280 (7%)
Query: 40 KKNHEF-LRLNKFADLTREKFLASYTGYKPPPTDHPHSNR-----SNWFKNLNSSKMSFY 93
KKN + L++N+FAD+T +F +SY G H R S F N +++
Sbjct: 73 KKNKPYKLKINRFADITHHEFRSSYAGSN---VKHHRMLRGPKRGSGGFMYENVTRVP-- 127
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-- 150
S+DW E+GAVT VK+Q CWAF+ VA VEG+NKIRT +LV+ S+ +LVDC T
Sbjct: 128 SSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQ 187
Query: 151 GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT 210
GCA +E AFE+I+ + +E YPY +C +S G+ I G+++V
Sbjct: 188 GCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCR--ANSIGGETVTIDGHEHVPEND 245
Query: 211 EEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQP 268
EE L V+ QPVSVAIDA + F Y GVF G CG NHGV IVGYG T
Sbjct: 246 EEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNG---TK 302
Query: 269 YWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYP 307
YW+V+N WG W EGG +RI RG+ G C IA A+YP
Sbjct: 303 YWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYP 342
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 197 bits (500), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 122/305 (40%), Positives = 163/305 (53%), Gaps = 40/305 (13%)
Query: 30 AEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHSN 77
EK RF +FK N H L+LNKFAD+T +F ++Y G K N
Sbjct: 54 GEKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKV--------N 105
Query: 78 RSNWFKNLNSSKMSFY--------DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLN 128
F+ +F S+DW ++GAVT VKDQG CWAF+ + VEG+N
Sbjct: 106 HHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGIN 165
Query: 129 KIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYC 186
+I+T +LV+ S+ +LVDC GC +E+AFE+I+Q + +E YPY Q+ C
Sbjct: 166 QIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYTA-QEGTC 224
Query: 187 DWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPC 244
D S + +I G++ V E L V+ QPVSVAIDA + F FY GVFTG C
Sbjct: 225 D--ESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDC 282
Query: 245 GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAAN 303
NHGV IVGYGTT + YW+V+N WG W E G +R+ R + GLC IA
Sbjct: 283 NTDLNHGVAIVGYGTTVDGTN---YWIVRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMM 339
Query: 304 AAYPL 308
A+YP+
Sbjct: 340 ASYPI 344
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 195 bits (496), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 168/313 (53%), Gaps = 29/313 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLA 61
+EQW+VE + Y EKE RFKIFK N +F+ L +FADLT E+F A
Sbjct: 44 YEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRA 103
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
Y K T +K + D +DW GAV VKDQG+ CWAF+A
Sbjct: 104 IYLRKKMERTKDSVKTERYLYKEGDV----LPDEVDWRANGAVVSVKDQGNCGSCWAFSA 159
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASECVYP 177
V VEG+N+I TG+L++ S+ +LVDC GC + AFE+I + + ++ YP
Sbjct: 160 VGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYP 219
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFY 235
Y C+ +++ + + I GY+ V E+ L+ V+ QPVSVAI+A+ F Y
Sbjct: 220 YNANDLGLCNADKNNNT-RVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLY 278
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
GV TG CG + +HGV +VGYG+T+ + YW+++N WG NW + G +++ R +
Sbjct: 279 KSGVMTGTCGISLDHGVVVVGYGSTS----GEDYWIIRNSWGLNWGDSGYVKLQRNIDDP 334
Query: 296 -GLCNIAANAAYP 307
G C IA +YP
Sbjct: 335 FGKCGIAMMPSYP 347
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 194 bits (492), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 121/319 (37%), Positives = 170/319 (53%), Gaps = 30/319 (9%)
Query: 8 TGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFK--------KNHEF----LRLNKFADLT 55
T + E WM E ++ YK EK RF++F+ +N+E L LN+FADLT
Sbjct: 44 TDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLT 103
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
E+F Y G P +N+ ++++ S+DW ++GAV PVKDQG
Sbjct: 104 HEEFKGRYLGLAKPQFSRKRQPSANFRYRDITD----LPKSVDWRKKGAVAPVKDQGQCG 159
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLA 171
CWAF+ VA VEG+N+I TG L + S+ +L+DC T +GC ++ AF+YI L
Sbjct: 160 SCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLH 219
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E YPY ++ C + + I GY+ V +E L ++ QPVSVAI+A+
Sbjct: 220 KEDDYPYL-MEEGICQEQKEDV--ERVTISGYEDVPENDDESLVKALAHQPVSVAIEASG 276
Query: 232 --FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GGVF G CG +HGV VGYG++ ++ Y +VKN WG W E G +R+
Sbjct: 277 RDFQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSD----YVIVKNSWGPRWGEKGFIRMK 332
Query: 290 RGVGG-SGLCNIAANAAYP 307
R G GLC I A+YP
Sbjct: 333 RNTGKPEGLCGINKMASYP 351
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 193 bits (491), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 118/318 (37%), Positives = 170/318 (53%), Gaps = 31/318 (9%)
Query: 11 IAAKHEQWMVEF--ARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTR 56
+ + +E W+V+ A++ EK+ RF+IFK N F L L +FADLT
Sbjct: 46 VMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTN 105
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+++ + Y G K R + +SIDW ++GAV VKDQG C
Sbjct: 106 DEYRSKYLGAKM----EKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSC 161
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+ + VEG+N+I TG L+T S+ +LVDC T GC ++ AFE+I + + ++
Sbjct: 162 WAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTD 221
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY+G D CD R +A K I Y+ V +EE L+ V+ QP+S+AI+A
Sbjct: 222 KDYPYKG-VDGTCDQIRKNA--KVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRA 278
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+F G CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R
Sbjct: 279 FQLYDSGIFDGSCGTQLDHGVVAVGYGT----ENGKDYWIVRNSWGKSWGESGYLRMARN 334
Query: 292 VG-GSGLCNIAANAAYPL 308
+ SG C IA +YP+
Sbjct: 335 IASSSGKCGIAIEPSYPI 352
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 192 bits (489), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 122/312 (39%), Positives = 172/312 (55%), Gaps = 36/312 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E WM++ + YK+ EK RF+IFK N ++ L LN FAD++ ++F Y
Sbjct: 49 ESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKY 108
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAV 121
TG + + ++ + LN ++ + +DW ++GAVTPVK+QGS C CWAF+AV
Sbjct: 109 TG---SIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGS-CGSCWAFSAV 164
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQG 180
T+EG+ KIRTG L S+ +L+DC + GC + +A + + QY + YPY+G
Sbjct: 165 VTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQYG-IHYRNTYPYEG 223
Query: 181 RQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
Q YC RS G Y A G + VQP E L ++ QPVSV ++A F Y G
Sbjct: 224 VQR-YC---RSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRG 279
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-G 296
G+F GPCGN +H V VGYG Y L+KN WGT W E G +RI RG G S G
Sbjct: 280 GIFVGPCGNKVDHAVAAVGYGPN--------YILIKNSWGTGWGENGYIRIKRGTGNSYG 331
Query: 297 LCNIAANAAYPL 308
+C + ++ YP+
Sbjct: 332 VCGLYTSSFYPV 343
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 191 bits (485), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 120/312 (38%), Positives = 170/312 (54%), Gaps = 31/312 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E W+ F + Y+ EK +RF++FK N + +L LN+FADL+ E+F Y
Sbjct: 52 ENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADLSHEEFKKMY 111
Query: 64 TGYKPPPT--DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
G K D S ++++ + S +DW ++GAV VK+QGS CWAF+
Sbjct: 112 LGLKTDIVRRDEERSYAEFAYRDVEAVPKS----VDWRKKGAVAEVKNQGSCGSCWAFST 167
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPY 178
VA VEG+NKI TG L T S+ +L+DC T NGC ++ AFEYI + L E YPY
Sbjct: 168 VAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPY 227
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
++ C+ + + + I G+Q V E+ L ++ QP+SVAIDA+ F FY
Sbjct: 228 S-MEEGTCEMQKDES--ETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYS 284
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-S 295
GGVF G CG +HGV VGYG++ ++ Y +VKN WG W E G +R+ R G
Sbjct: 285 GGVFDGRCGVDLDHGVAAVGYGSSKGSD----YIIVKNSWGPKWGEKGYIRLKRNTGKPE 340
Query: 296 GLCNIAANAAYP 307
GLC I A++P
Sbjct: 341 GLCGINKMASFP 352
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 191 bits (484), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 119/319 (37%), Positives = 178/319 (55%), Gaps = 36/319 (11%)
Query: 13 AKHEQWMVEFARTYKDQ--AEKEMRFKIFKKNHEF---------------LRLNKFADLT 55
A ++ W+ E + E E RF +F N +F L +N+FADLT
Sbjct: 50 AAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLT 109
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
E+F A++ G K +R+ + + +S+DW E+GAV PVK+QG
Sbjct: 110 NEEFRATFLGAKVA-----ERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGS 164
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLA 171
CWAF+AV+TVE +N++ TG+++T S+ +LV+CST +GC +++AF++I + +
Sbjct: 165 CWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGID 224
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
+E YPY+ D CD R +A K +I G++ V E+ LQ V+ QPVSVAI+A
Sbjct: 225 TEDDYPYKA-VDGKCDINRENA--KVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGG 281
Query: 232 --FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F YH GVF+G CG + +HGV VGYGT + + YW+V+N WG W E G +R+
Sbjct: 282 REFQLYHSGVFSGRCGTSLDHGVVAVGYGT----DNGKDYWIVRNSWGPKWGESGYVRME 337
Query: 290 RGVG-GSGLCNIAANAAYP 307
R + +G C IA A+YP
Sbjct: 338 RNINVTTGKCGIAMMASYP 356
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 190 bits (482), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 119/327 (36%), Positives = 166/327 (50%), Gaps = 28/327 (8%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------L 48
+ + G + +EQW+VE + Y EKE RFKIFK N + + L
Sbjct: 28 TESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGL 87
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP-V 107
NKF+DLT ++F ASY G K +K + D +DW ERGAV P V
Sbjct: 88 NKFSDLTADEFQASYLGGKMEKKSLSDVAERYQYKEGDV----LPDEVDWRERGAVVPRV 143
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEY 163
K QG CWAF A VEG+N+I TG+LV+ S+ +L+DC N GCA AFE+
Sbjct: 144 KRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEF 203
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I++ + S+ VY Y G C + + I G++ V E L+ V+ QP+
Sbjct: 204 IKENGGIVSDEVYGYTGEDTAACKAIEMKTT-RVVTINGHEVVPVNDEMSLKKAVAYQPI 262
Query: 224 SVAIDATWFNFYHGGVFTGPCGNT-PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
SV I A + Y GV+ G C N +H V IVGYGT+++ + YWL++N WG W E
Sbjct: 263 SVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSD---EGDYWLIRNSWGPEWGE 319
Query: 283 GGSMRIFRGV-GGSGLCNIAANAAYPL 308
GG +R+ R +G C +A YP+
Sbjct: 320 GGYLRLQRNFHEPTGKCAVAVAPVYPI 346
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 182 bits (463), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 120/301 (39%), Positives = 163/301 (54%), Gaps = 33/301 (10%)
Query: 31 EKEMRFKIFKKNHEF---------------LRLNKFADLTREKFLASYTGYKPPPTDHPH 75
E E RF++F N +F L +N+FADLT +F A+Y G P
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPA-----G 138
Query: 76 SNRSNWFKNLNSSKMSFYDSIDWNERGAVT-PVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
R + + DS+DW ++GAV PVK+QG CWAF+AVA VEG+NKI TG
Sbjct: 139 RGRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTG 198
Query: 134 QLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWR 190
+LV+ S+ +LV+C+ +GC +++AF +I + L +E YPY D C+ +
Sbjct: 199 ELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTA-MDGKCNLAK 257
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTP 248
S K +I G++ V E LQ V+ QPVSVAIDA F Y GVFTG CG
Sbjct: 258 RSR--KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNL 315
Query: 249 NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
+HGV VGYG T+A YW V+N WG +W E G +R+ R V +G C IA A+YP
Sbjct: 316 DHGVVAVGYG--TDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 373
Query: 308 L 308
+
Sbjct: 374 I 374
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 182 bits (461), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 116/323 (35%), Positives = 168/323 (52%), Gaps = 43/323 (13%)
Query: 17 QWMVEFARTYKDQA----EKEMRFKIFKKNHEF--------------LRLNKFADLTREK 58
QW E +T + +++ RF IFK N F L L KF DLT ++
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDE 110
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNS------SKMSFYDSIDWNERGAVTPVKDQGS 112
+ Y G + P + R KN+N + +++DW ++GAV P+KDQG+
Sbjct: 111 YRKLYLGARTEP-----ARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGT 165
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+ A VEG+NKI TG+L++ S+ +LVDC S GC ++ AF++I +
Sbjct: 166 CGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 225
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L +E YPY+G + ++S + +I GY+ V E L+ +S QPVSVAI+A
Sbjct: 226 LNTEKDYPYRGFGGKCNSFLKNS---RVVSIDGYEDVPTKDETALKKAISYQPVSVAIEA 282
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F Y G+FTG CG +H V VGYG+ E YW+V+N WG W E G +R
Sbjct: 283 GGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGS----ENGVDYWIVRNSWGPRWGEEGYIR 338
Query: 288 IFRGVGG--SGLCNIAANAAYPL 308
+ R + SG C IA A+YP+
Sbjct: 339 MERNLAASKSGKCGIAVEASYPV 361
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 181 bits (458), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 116/322 (36%), Positives = 167/322 (51%), Gaps = 42/322 (13%)
Query: 17 QWMVEFARTYKDQA----EKEMRFKIFKKNHEFLRLNK--------------FADLTREK 58
+W +E ++ + +++ RF IFK N F+ L+ FA+LT ++
Sbjct: 6 RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDE 65
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD------SIDWNERGAVTPVKDQGS 112
+ + Y G + P R KN+N + + ++DW ++GAV +KDQG+
Sbjct: 66 YRSLYLGARTEPV-----RRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGT 120
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+ A VEG+NKI TG+LV+ S+ +LVDC S GC ++ AF++I +
Sbjct: 121 CGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 180
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L +E YPY G ++S + I GY+ V E L+ VS QPVSVAIDA
Sbjct: 181 LNTEKDYPYHGTNGKCNSLLKNS---RVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDA 237
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F Y G+FTG CG +H V VGYG+ E YW+V+N WGT W E G +R
Sbjct: 238 GGRAFQHYQSGIFTGKCGTNMDHAVVAVGYGS----ENGVDYWIVRNSWGTRWGEDGYIR 293
Query: 288 IFRGVGG-SGLCNIAANAAYPL 308
+ R V SG C IA A+YP+
Sbjct: 294 MERNVASKSGKCGIAIEASYPV 315
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 177 bits (448), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 116/312 (37%), Positives = 170/312 (54%), Gaps = 31/312 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIF-----------KKNHEF-LRLNKFADLTREKFLASY 63
+ WM++ + Y+ EK RF+IF KKN+ + L LN FADL+ ++F Y
Sbjct: 49 DSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKY 108
Query: 64 TGYKPPP-TDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
G+ T H + ++ +K++ + + SIDW +GAVTPVK+QG+ CWAF+
Sbjct: 109 VGFVAEDFTGLEHFDNEDFTYKHVTN----YPQSIDWRAKGAVTPVKNQGACGSCWAFST 164
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+ATVEG+NKI TG L+ S+ +LVDC + GC + + +Y+ + + VYPYQ
Sbjct: 165 IATVEGINKIVTGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYVAN-NGVHTSKVYPYQ 223
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
+Q Y C + G I GY+ V E ++ QP+SV ++A F Y
Sbjct: 224 AKQ-YKCR--ATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKS 280
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-G 296
GVF GPCG +H VT VGYGT+ +G+ Y ++KN WG NW E G MR+ R G S G
Sbjct: 281 GVFDGPCGTKLDHAVTAVGYGTS---DGKN-YIIIKNSWGPNWGEKGYMRLKRQSGNSQG 336
Query: 297 LCNIAANAAYPL 308
C + ++ YP
Sbjct: 337 TCGVYKSSYYPF 348
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 175 bits (444), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 99/220 (45%), Positives = 132/220 (60%), Gaps = 13/220 (5%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-G 151
DSIDW E GAV PVK+QG CWAF+ VA VEG+N+I TG L++ S+ QLVDC+T N G
Sbjct: 5 DSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTANHG 64
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AF++I + SE YPY+G QD C+ S+ + +I Y+ V E
Sbjct: 65 CRGGWMNPAFQFIVNNGGINSEETYPYRG-QDGICN---STVNAPVVSIDSYENVPSHNE 120
Query: 212 EGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ LQ V+ QPVSV +DA F Y G+FTG C + NH +T+VGYGT E + +
Sbjct: 121 QSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGT----ENDKDF 176
Query: 270 WLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
W+VKN WG NW E G +R R + G C I A+YP+
Sbjct: 177 WIVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 172 bits (437), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 161/307 (52%), Gaps = 29/307 (9%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTG 65
WM+ + Y++ EK RF+IFK N + L LN+FADL+ ++F Y G
Sbjct: 51 WMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKYVG 110
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
T + + +N ++ +++DW ++GAVTPV+ QGS CWAF+AVATV
Sbjct: 111 SLIDATIEQSYDE----EFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATV 166
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
EG+NKIRTG+LV S+ +LVDC + GC + A EY+ + + YPY+ +Q
Sbjct: 167 EGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAK-NGIHLRSKYPYKAKQG 225
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFT 241
G G VQP E L + +++QPVSV +++ F Y GG+F
Sbjct: 226 ---TCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFE 282
Query: 242 GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNI 300
GPCG +H VT V G + Y L+KN WGT W E G +RI R G S G+C +
Sbjct: 283 GPCGTKVDHAVTAV----GYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 338
Query: 301 AANAAYP 307
++ YP
Sbjct: 339 YKSSYYP 345
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 171 bits (432), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 111/310 (35%), Positives = 161/310 (51%), Gaps = 33/310 (10%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTG 65
WM++ + YK+ EK RF+IFK N ++ L LN+F+DL+ ++F Y G
Sbjct: 51 WMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYWLGLNEFSDLSNDEFKEKYVG 110
Query: 66 YKPPP-TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVA 122
P T+ P+ +N + +S+DW +GAVTPVK QG YC CWAF+ VA
Sbjct: 111 SLPEDYTNQPYDEEF-----VNEDIVDLPESVDWRAKGAVTPVKHQG-YCESCWAFSTVA 164
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
TVEG+NKI+TG LV S+ +LVDC + GC + + + +Y+ Q + YPY +
Sbjct: 165 TVEGINKIKTGNLVELSEQELVDCDKQSYGCNRGYQSTSLQYVAQ-NGIHLRAKYPYIAK 223
Query: 182 QDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YHGGV 239
Q + G G VQ E L + ++ QPVSV +++ +F Y GG+
Sbjct: 224 QQ---TCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGI 280
Query: 240 FTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLC 298
F G CG +H VT V G + Y L+KN WG W E G +RI R G S G+C
Sbjct: 281 FEGSCGTKVDHAVTAV----GYGKSGGKGYILIKNSWGPGWGENGYIRIRRASGNSPGVC 336
Query: 299 NIAANAAYPL 308
+ ++ YP+
Sbjct: 337 GVYRSSYYPI 346
>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
Length = 208
Score = 166 bits (419), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 97/218 (44%), Positives = 127/218 (58%), Gaps = 19/218 (8%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-G 151
+ IDW ++GAVTPVK+QGS CWAF+ V+TVE +N+IRTG L++ S+ +LVDC N G
Sbjct: 3 EQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKNHG 62
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C A++YI + ++ YPY+ Q A+ K +I GY V E
Sbjct: 63 CLGGAFVFAYQYIINNGGIDTQANYPYKAVQG------PCQAASKVVSIDGYNGVPFCNE 116
Query: 212 EGLQDVVSRQPVSVAIDATWFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
L+ V+ QP +VAIDA+ F Y G+F+GPCG NHGVTIVGY Q Y
Sbjct: 117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGY--------QANY 168
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
W+V+N WG W E G +R+ R VGG GLC IA YP
Sbjct: 169 WIVRNSWGRYWGEKGYIRMLR-VGGCGLCGIARLPYYP 205
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 166 bits (419), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 96/224 (42%), Positives = 132/224 (58%), Gaps = 13/224 (5%)
Query: 91 SFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--S 147
S +SIDW E+G + VKDQGS CWAF+AVA +E +N I TG L++ S+ +LVDC S
Sbjct: 17 SLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRS 76
Query: 148 TLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQ 207
GC ++ AFE++ + + +E YPY+ R CD +R +A K I Y+ V
Sbjct: 77 YNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNG-VCDQYRKNA--KVVKIDSYEDVP 133
Query: 208 PATEEGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEG 265
E+ LQ V+ QPVS+A++A +F H G+FTG CG +HGV I GYGT E
Sbjct: 134 VNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT----EN 189
Query: 266 QQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
YW+V+N WG N E G +R+ R V SGLC +A +YP+
Sbjct: 190 GMDYWIVRNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPV 233
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
Length = 215
Score = 164 bits (415), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 97/219 (44%), Positives = 129/219 (58%), Gaps = 18/219 (8%)
Query: 96 IDWNERGAVTPVKDQ---GSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-G 151
+DW +GAV +K+Q GS CWAF+AVA VE +NKIRTGQL++ S+ +LVDC T + G
Sbjct: 5 VDWRSKGAVNSIKNQKQCGS--CWAFSAVAAVESINKIRTGQLISLSEQELVDCDTASHG 62
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ NAF+YI + ++ YPY Q C +R + +I G+Q V E
Sbjct: 63 CNGGWMNNAFQYIITNGGIDTQQNYPYSAVQG-SCKPYRL----RVVSINGFQRVTRNNE 117
Query: 212 EGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
LQ V+ QPVSV ++A F Y G+FTGPCG NHGV IVGYGT + + Y
Sbjct: 118 SALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGT----QSGKNY 173
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
W+V+N WG NW G + + R V S GLC IA +YP
Sbjct: 174 WIVRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYP 212
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 164 bits (415), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 127/220 (57%), Gaps = 13/220 (5%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-G 151
DSIDW E+GAV PVK+QG CWAF A+A VEG+N+I TG L++ S+ QLVDCST N G
Sbjct: 5 DSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTRNHG 64
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C + AF+YI + SE YPY G + CD + + +I Y+ V E
Sbjct: 65 CEGGWPYRAFQYIINNGGINSEEHYPYTG-TNGTCD---TKENAHVVSIDSYRNVPSNDE 120
Query: 212 EGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ LQ V+ QPVSV +DA F Y G+FTG C + NH T+ G E E + Y
Sbjct: 121 KSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGG----RETENDKDY 176
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W VKN WG NW E G +R+ R + SG C IA + +YP+
Sbjct: 177 WTVKNSWGKNWGESGYIRVERNIAESSGKCGIAISPSYPI 216
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 160 bits (406), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 116/327 (35%), Positives = 171/327 (52%), Gaps = 47/327 (14%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W +E + Y+D+ E+ R KIF K N F L +NK+ADL
Sbjct: 57 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 116
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNW-FKN---LNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F G+ T H ++ FK ++ + ++ S+DW +GAVT VKDQG
Sbjct: 117 HEFRQLMNGFNY--TLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQG- 173
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CWAF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF YI+
Sbjct: 174 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 233
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSRQ-PVSV 225
+ +E YPY+ D C + + G GA RG+ + E+ + + V+ PVSV
Sbjct: 234 GGIDTEKSYPYEAIDD-SCHFNK----GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSV 288
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
AIDA+ F FY GV+ P + N HGV +VG+GT E + YWLVKN WGT W
Sbjct: 289 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTD---ESGEDYWLVKNSWGTTWG 345
Query: 282 EGGSMRIFRGVGGSGLCNIAANAAYPL 308
+ G +++ R C IA+ ++YPL
Sbjct: 346 DKGFIKMLR--NKENQCGIASASSYPL 370
>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
Length = 330
Score = 159 bits (402), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 165/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E W + + Y + ++ R I++KN H + L +N D+T E+
Sbjct: 28 ELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEV 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ H SN + + + DSID+ ++G VTPVK+QG CWAF
Sbjct: 88 VQKMTGLKVPPS-HSRSNDTLYIPDWEGRTP---DSIDYRKKGYVTPVKNQGQCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 144 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 203
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R PVSVAIDA T F F
Sbjct: 204 YVG-QDENCMY---NPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQF 259
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + + +W++KN WG NW G + + R
Sbjct: 260 YSKGVYYDENCNSDNLNHAVLAVGYGI----QKGKKHWIIKNSWGENWGNKGYILMARNK 315
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 316 NNA--CGIANLASFP 328
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 158 bits (400), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 164/330 (49%), Gaps = 47/330 (14%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W +++ YK++ E+ R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++ + P S W +N+ ++S DS+DW E+G VT
Sbjct: 79 HLGDMTGEEVISLMGSLRVP---------SQWQRNVTYRSNSNQKLPDSVDWREKGCVTE 129
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCST GC F+ AF
Sbjct: 130 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAF 189
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
+YI + SE YPY+ C R + + Y + +E+ L++ V+ +
Sbjct: 190 QYIIDNNGIDSEASYPYKAMNG-KC---RYDSKKRAATCSKYTELPFGSEDALKEAVANK 245
Query: 222 -PVSVAIDATWFNF--YHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
PVSVAIDA+ ++F Y GV+ P C NHGV +VGYG + YWLVKN WG
Sbjct: 246 GPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKD----YWLVKNSWG 301
Query: 278 TNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
N+ + G +R+ R G C IA+ +YP
Sbjct: 302 LNFGDQGYIRMARNSGNH--CGIASYPSYP 329
>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
Length = 330
Score = 158 bits (399), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 165/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
+ W + + Y + ++ R I++KN H + L +N D+T E+
Sbjct: 28 DLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEV 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ H SN + + + S DS+D+ ++G VTPVK+QG CWAF
Sbjct: 88 VQKMTGLKVPPS-HSRSNDTLYIPDWESRAP---DSVDYRKKGYVTPVKNQGQCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 144 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 203
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R P+SVAIDA T F F
Sbjct: 204 YVG-QDESCMY---NPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQF 259
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + +W++KN WG NW G + + R
Sbjct: 260 YSKGVYYDENCNSDNLNHAVLAVGYGI----QKGNKHWIIKNSWGENWGNKGYILMARNK 315
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 316 NNA--CGIANLASFP 328
>sp|Q05094|CYSP2_LEIPI Cysteine proteinase 2 OS=Leishmania pifanoi GN=CYS2 PE=1 SV=1
Length = 444
Score = 157 bits (396), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 100/301 (33%), Positives = 151/301 (50%), Gaps = 28/301 (9%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKF 59
AA E++ + R Y+ AE++ R F++N E +R + KF DL+ +F
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEF 94
Query: 60 LASY---TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
A Y Y H + ++ + + D++DW E+GAVTPVKDQG+ C
Sbjct: 95 AARYLNGAAYFAAAKRHA----AQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 150
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ--RLAS 172
WAF+AV +EG + +LV+ S+ QLV C +N GC + AF+++ Q L +
Sbjct: 151 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHT 210
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E YPY Y + SS GA I G+ + + + + P+++A+DA+
Sbjct: 211 EDSYPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASS 270
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y GV T G NHGV +VGY T G+ PYW++KN WG +W E G +R+ G
Sbjct: 271 FMSYKSGVLTACIGKQLNHGVLLVGYDMT----GEVPYWVIKNSWGGDWGEQGYVRVVMG 326
Query: 292 V 292
V
Sbjct: 327 V 327
>sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus GN=CTSK PE=1 SV=1
Length = 329
Score = 156 bits (395), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 164/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E W +++ Y + ++ R I++KN H + L +N D+T E+
Sbjct: 27 ELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEV 86
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ HSN + + + DSID+ ++G VTPVK+QG CWAF
Sbjct: 87 VQKMTGLKVPPS-RSHSNDTLYIPDWEGRTP---DSIDYRKKGYVTPVKNQGQCGSCWAF 142
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 143 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENYGCGGGYMTNAFQYVQRNRGIDSEDAYP 202
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R PVSVAIDA T F F
Sbjct: 203 YVG-QDESCMY---NPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQF 258
Query: 235 YHGGVF--TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ + NH V VGYG + +W++KN WG +W G + + R
Sbjct: 259 YSKGVYYDENCSSDNVNHAVLAVGYGI----QKGNKHWIIKNSWGESWGNKGYILMARNK 314
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 315 NNA--CGIANLASFP 327
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 156 bits (394), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/318 (34%), Positives = 166/318 (52%), Gaps = 44/318 (13%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E++ +F R Y D E+ R +F N ++ L +N+F+D+T EKF
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A GYK P P + F + +++ S +DW +GAVTPVKDQG CWAF
Sbjct: 81 NAVMKGYKKGP--RPAA----VFTSTDAAPES--TEVDWRTKGAVTPVKDQGQCGSCWAF 132
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN----GCAKNFLENAFEYIRQYQRLASEC 174
+ +EG + ++TG+LV+ S+ QLVDC+ + GC ++E A Y+R + +E
Sbjct: 133 STTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTES 192
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATWFN 233
YPY+ R D C R +++ GY + +E L+ P+SVAIDA+ +
Sbjct: 193 SYPYEAR-DNTC---RFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRS 248
Query: 234 F--YHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y+ GV+ P C ++ +H V VGYG+ EG Q +WLVKN W T+W E G +++
Sbjct: 249 FQSYYTGVYYEPSCSSSQLDHAVLAVGYGS----EGGQDFWLVKNSWATSWGESGYIKMA 304
Query: 290 RGVGGSGLCNIAANAAYP 307
R + C IA +A YP
Sbjct: 305 RNRNNN--CGIATDACYP 320
>sp|P36400|LMCPB_LEIME Cysteine proteinase B OS=Leishmania mexicana GN=LMCPB PE=2 SV=2
Length = 443
Score = 155 bits (393), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 149/300 (49%), Gaps = 27/300 (9%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKF 59
AA E++ + R Y+ AE++ R F++N E +R + KF DL+ +F
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEF 94
Query: 60 LASY---TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
A Y Y H + ++ + + D++DW E+GAVTPVKDQG+ C
Sbjct: 95 AARYLNGAAYFAAAKRHA----AQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 150
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ--RLAS 172
WAF+AV +EG + +LV+ S+ QLV C +N GC + AF+++ Q L +
Sbjct: 151 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHT 210
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY Y + SS I G+ + + + + P+++A+DA+ F
Sbjct: 211 EDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSF 270
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV T G NHGV +VGY T G+ PYW++KN WG +W E G +R+ GV
Sbjct: 271 MSYKSGVLTACIGKQLNHGVLLVGYDMT----GEVPYWVIKNSWGGDWGEQGYVRVVMGV 326
>sp|P84346|MEX1_JACME Mexicain OS=Jacaratia mexicana PE=1 SV=1
Length = 214
Score = 155 bits (392), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 93/225 (41%), Positives = 128/225 (56%), Gaps = 23/225 (10%)
Query: 92 FYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TL 149
+ +SIDW E+GAVTPVK+Q CWAF+ VAT+EG+NKI TGQL++ S+ +L+DC
Sbjct: 1 YPESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYRS 60
Query: 150 NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA---IRGYQYV 206
+GC + + +Y+ + +E YPY+ +Q R A K G I GY+YV
Sbjct: 61 HGCDGGYQTPSLQYVVD-NGVHTEREYPYEKKQG------RCRAKDKKGPKVYITGYKYV 113
Query: 207 QPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAE 264
E L ++ QPVSV D+ F FY GG++ GPCG +H VT VGYG T
Sbjct: 114 PANDEISLIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGKT---- 169
Query: 265 GQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
Y L+KN WG NW E G +RI R G S G C + ++ +P+
Sbjct: 170 ----YLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFFPI 210
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 155 bits (392), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 106/318 (33%), Positives = 161/318 (50%), Gaps = 41/318 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + ++ R Y D E R IF++N ++ L +NKF D+T E+F
Sbjct: 21 EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A G P + S ++ + + +DW +GAVTPVKDQG CWAF
Sbjct: 81 NAVMKGNIP----RRSAPVSVFYPKKETGPQA--TEVDWRTKGAVTPVKDQGQCGSCWAF 134
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECV 175
+ ++EG + ++TG L++ ++ QLVDCS GC ++ +AF+YI+ + +E
Sbjct: 135 STTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAA 194
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDA--TWF 232
YPY+ R D C + +S + G+ + +E GLQ V P+SV IDA + F
Sbjct: 195 YPYEAR-DGSCRFDSNSVA---ATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSF 250
Query: 233 NFYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GV+ P + +H V VGYG+ EG Q +WLVKN W T+W + G +++ R
Sbjct: 251 QFYSSGVYYEPSCSPSYLDHAVLAVGYGS----EGGQDFWLVKNSWATSWGDAGYIKMSR 306
Query: 291 GVGGSGLCNIAANAAYPL 308
+ C IA A+YPL
Sbjct: 307 NRNNN--CGIATVASYPL 322
>sp|P83443|MDO1_PSEMR Macrodontain-1 OS=Pseudananas macrodontes PE=1 SV=1
Length = 213
Score = 155 bits (391), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 89/219 (40%), Positives = 132/219 (60%), Gaps = 19/219 (8%)
Query: 95 SIDWNERGAVTPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGC 152
SIDW + GAV VK+QG C CWAF A+ATVEG+ KIR G LV S+ +++DC+ GC
Sbjct: 5 SIDWRDYGAVNEVKNQGP-CGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVSYGC 63
Query: 153 AKNFLENAFEYIRQYQRLASECVYPYQGRQDYY-CDWWRSSASGKYGAIRGYQYVQPATE 211
++ A+++I + ++ YPY+ Q +++ +SA I GY YV+ E
Sbjct: 64 KGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCNANYFPNSAY-----ITGYSYVRRNDE 118
Query: 212 EGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ VS QP++ IDA+ F +Y GGV++GPCG + NH +TI+GY G+ Y
Sbjct: 119 SHMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGY-------GRDSY 171
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
W+V+N WG++W +GG +RI R V S G+C IA + +P
Sbjct: 172 WIVRNSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFP 210
>sp|Q90686|CATK_CHICK Cathepsin K OS=Gallus gallus GN=CTSK PE=2 SV=1
Length = 334
Score = 154 bits (390), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 105/273 (38%), Positives = 149/273 (54%), Gaps = 22/273 (8%)
Query: 43 HEF-LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNER 101
H F L +N D+T E+ + + TG + P P N + + + +S + ++DW +
Sbjct: 74 HSFQLAMNYLGDMTSEEVVRTMTGLRVP-RSRPRPNGTLYVPDWSSRAPA---AVDWRRK 129
Query: 102 GAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC-STLNGCAKNFLEN 159
G VTPVKDQG CWAF++V +EG K RTG+L++ S LV C S NGC ++ N
Sbjct: 130 GYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSNNNGCGGGYMTN 189
Query: 160 AFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS 219
AFEY+R + + SE YPY G QD C + S +GK RGY+ + E+ L+ V+
Sbjct: 190 AFEYVRLNRGIDSEDAYPYIG-QDESCMY---SPTGKAAKCRGYREIPEDNEKALKRAVA 245
Query: 220 R-QPVSVAIDATW--FNFYHGGVF--TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274
R PVSV IDA+ F FY GV+ TG NH V VGYG A+ +W++KN
Sbjct: 246 RIGPVSVGIDASLPSFQFYSRGVYYDTGCNPENINHAVLAVGYG----AQKGTKHWIIKN 301
Query: 275 RWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
WGT W G + + R + + C IA A++P
Sbjct: 302 SWGTEWGNKGYVLLARNMKQT--CGIANLASFP 332
>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
Length = 331
Score = 154 bits (390), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 111/333 (33%), Positives = 164/333 (49%), Gaps = 53/333 (15%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W + + YK++ E+ +R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++ + + P S W +N+ ++ DS+DW E+G VT
Sbjct: 79 HLGDMTSEEVMSLMSSLRVP---------SQWQRNITYKSNPNRILPDSVDWREKGCVTE 129
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCST GC F+ AF
Sbjct: 130 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAF 189
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDW---WRSSASGKYGAIRGYQYVQPATEEGLQDVV 218
+YI + + S+ YPY+ D C + +R++ KY + E+ L++ V
Sbjct: 190 QYIIDNKGIDSDASYPYKA-MDQKCQYDSKYRAATCSKYTEL------PYGREDVLKEAV 242
Query: 219 SRQ-PVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274
+ + PVSV +DA F Y GV+ P C NHGV +VGYG E YWLVKN
Sbjct: 243 ANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKE----YWLVKN 298
Query: 275 RWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
WG N+ E G +R+ R G C IA+ +YP
Sbjct: 299 SWGHNFGEEGYIRMARNKGNH--CGIASFPSYP 329
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 154 bits (389), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 104/311 (33%), Positives = 162/311 (52%), Gaps = 31/311 (9%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTG 65
WM + Y + E R++ FKKN ++ L LN+ ADL+ E++ +Y G
Sbjct: 37 WMRSNNKAYTHK-EFMPRYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLG 95
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
+ + + R N LN + ++DW E+ AVTPVKDQG C++F+ +V
Sbjct: 96 TRAHIKLNGYHKR-NLGLRLNRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSV 154
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
EG+ I+TG+LV+ S+ ++DCS+ GC + NAFEYI + L SE YPY+ +
Sbjct: 155 EGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMK 214
Query: 182 QDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGV 239
+ C + S + K I Y+ ++ E LQ+ + PVSVAIDA+ F Y GV
Sbjct: 215 VNDECKFQEGSVAAK---ITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGV 271
Query: 240 FTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
+ P ++ +HGV VG GT + + Y++VKN WG +W G + + R +
Sbjct: 272 YYEPACSSEDLDHGVLAVGMGT----DNGEDYYIVKNSWGPSWGLNGYIHMARNKDNN-- 325
Query: 298 CNIAANAAYPL 308
C I+ A+YP+
Sbjct: 326 CGISTMASYPI 336
>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
Length = 330
Score = 154 bits (388), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 110/334 (32%), Positives = 165/334 (49%), Gaps = 56/334 (16%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W + + YK++ E+ +R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL----NSSKMSFYDSIDWNERGAVT 105
D+T E+ ++ + + P + W +N+ N ++M DS+DW E+G VT
Sbjct: 79 HLGDMTSEEVMSLMSSLRVP---------NQWQRNITYKSNPNQM-LPDSVDWREKGCVT 128
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCS GC F+ AF
Sbjct: 129 EVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAF 188
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDW---WRSSASGKYGAIRGYQYVQPATEEGL--QD 216
+YI + + SE YPY+ D C + +R++ KY + P E + +
Sbjct: 189 QYIIDNKGIDSEASYPYKA-TDQKCQYDSKYRAATCSKYTEL-------PYGREDVLKEA 240
Query: 217 VVSRQPVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVK 273
V ++ PV V +DA+ F Y GV+ P C NHGV ++GYG E YWLVK
Sbjct: 241 VANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYGDLNGKE----YWLVK 296
Query: 274 NRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
N WG+N+ E G +R+ R G C IA+ +YP
Sbjct: 297 NSWGSNFGEQGYIRMARNKGNH--CGIASYPSYP 328
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
SV=1
Length = 321
Score = 154 bits (388), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 108/319 (33%), Positives = 163/319 (51%), Gaps = 47/319 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR----------------LNKFADLTREKF 59
+ + ++ R Y D E+ R ++F++N + + +N+F D+T E+F
Sbjct: 21 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 80
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYCCW 116
A GYK P + + +DW + VTPVKDQ GS CW
Sbjct: 81 NAVMKGYKKGSRGEPKAV-------FTAEAGPMAADVDWRTKALVTPVKDQEQCGS--CW 131
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
AF+A +EG + ++ +LV+ S+ QLVDCST +GC ++ +AF+YI+ + +E
Sbjct: 132 AFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTE 191
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVAIDATWF 232
YPY+ +D C + +S GAI TEE LQ+ VS P+SVAIDA+ F
Sbjct: 192 SSYPYEA-EDRSCRFDANS----IGAICTGSVEVQHTEEALQEAVSGVGPISVAIDASHF 246
Query: 233 N--FYHGGV-FTGPCGNT-PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
+ FY GV + C T +HGV VGYGT E + YWLVKN WG++W + G +++
Sbjct: 247 SFQFYSSGVYYEQNCSPTFLDHGVLAVGYGT----ESTKDYWLVKNSWGSSWGDAGYIKM 302
Query: 289 FRGVGGSGLCNIAANAAYP 307
R + C IA+ +YP
Sbjct: 303 SRNRDNN--CGIASEPSYP 319
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.133 0.434
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 121,383,992
Number of Sequences: 539616
Number of extensions: 5102007
Number of successful extensions: 11522
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 202
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 10472
Number of HSP's gapped (non-prelim): 236
length of query: 308
length of database: 191,569,459
effective HSP length: 117
effective length of query: 191
effective length of database: 128,434,387
effective search space: 24530967917
effective search space used: 24530967917
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (28.1 bits)