BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018781
(350 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 585 bits (1509), Expect = e-166, Method: Compositional matrix adjust.
Identities = 264/341 (77%), Positives = 306/341 (89%), Gaps = 1/341 (0%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
LL+++S S C + A DFSIVGY+PEHLT+ DKL+ELFESWMS+H K YK +EEK+HRF
Sbjct: 13 LLVAISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRF 72
Query: 70 EIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGL-KPQFPTRRQPSAEFSYR 128
E+F+ENL HIDQRN E+ SYWLGLNEFAD++HEEFK +YLGL KPQF +RQPSA F YR
Sbjct: 73 EVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYR 132
Query: 129 DVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC 188
D+ LPKSVDWRKKGAV PVK+QG CGSCWAFSTVAAVEGINQI +GNL+SLSEQELIDC
Sbjct: 133 DITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDC 192
Query: 189 DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP 248
DT+FN+GCNGGLMDYAF+YI+++GGLHKE+DYPYLMEEG C+++KE++E VTISGY+DVP
Sbjct: 193 DTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVP 252
Query: 249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDY 308
END++SL+KALAHQPVSVAIEASG DFQFY GGVF G CG +LDHGVAAVGYG SKGSDY
Sbjct: 253 ENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDY 312
Query: 309 IIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+IVKNSWGP+WGE+G+IRMKRNTGKPEGLCGINKMAS P K
Sbjct: 313 VIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASYPTK 353
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 535 bits (1378), Expect = e-151, Method: Compositional matrix adjust.
Identities = 253/326 (77%), Positives = 285/326 (87%), Gaps = 2/326 (0%)
Query: 26 HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE 85
HD+SIVGYSPE L S DKLIELFE+W+S K Y+ +EEK RFE+FK+NLKHID+ NK+
Sbjct: 29 HDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKK 88
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPS--AEFSYRDVKALPKSVDWRKKG 143
SYWLGLNEFAD+SHEEFK YLGLK R + AEF+YRDV+A+PKSVDWRKKG
Sbjct: 89 GKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKG 148
Query: 144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
AV VKNQGSCGSCWAFSTVAAVEGIN+IV+GNLT+LSEQELIDCDT++NNGCNGGLMDY
Sbjct: 149 AVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDY 208
Query: 204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
AF+YIV +GGL KEEDYPY MEEGTCE +K+E E VTI+G+QDVP NDE+SLLKALAHQP
Sbjct: 209 AFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQP 268
Query: 264 VSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERG 323
+SVAI+ASG +FQFYSGGVF G CG +LDHGVAAVGYG SKGSDYIIVKNSWGPKWGE+G
Sbjct: 269 LSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKG 328
Query: 324 YIRMKRNTGKPEGLCGINKMASIPLK 349
YIR+KRNTGKPEGLCGINKMAS P K
Sbjct: 329 YIRLKRNTGKPEGLCGINKMASFPTK 354
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 380 bits (975), Expect = e-104, Method: Compositional matrix adjust.
Identities = 185/347 (53%), Positives = 245/347 (70%), Gaps = 5/347 (1%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SK++ L+ L + S A DF VGYS + LTS+++LI+LF+SWM KH K Y+ I+E
Sbjct: 6 SISKIIFLATCLIIHMGLSSA-DFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ--PS 122
K++RFEIF++NL +ID+ NK+ SYWLGLN FAD+S++EFK KY+G + T + +
Sbjct: 65 KIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDN 124
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
+F+Y+ V P+S+DWR KGAVTPVKNQG+CGSCWAFST+A VEGIN+IV+GNL LSE
Sbjct: 125 EDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSE 184
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DCD + GC GG + +Y VA+ G+H + YPY ++ C + V I+
Sbjct: 185 QELVDCD-KHSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKIT 242
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
GY+ VP N E S L ALA+QP+SV +EA G FQ Y GVF GPCG +LDH V AVGYG
Sbjct: 243 GYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGT 302
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
S G +YII+KNSWGP WGE+GY+R+KR +G +G CG+ K + P K
Sbjct: 303 SDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 349
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 377 bits (969), Expect = e-104, Method: Compositional matrix adjust.
Identities = 183/357 (51%), Positives = 245/357 (68%), Gaps = 12/357 (3%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD------KLIELFESWMSK 54
M F + +L L++ A SS A D SI+ Y +H S +++ ++E+W+ K
Sbjct: 1 MGFLKPTMAILF---LAMVAVSS-AVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVK 56
Query: 55 HGK--TYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLK 112
HGK + + EK RFEIFK+NL+ +D+ N++ SY LGL FAD++++E+++KYLG K
Sbjct: 57 HGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAK 116
Query: 113 PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 172
+ R+ S + R LP+S+DWRKKGAV VK+QG CGSCWAFST+ AVEGINQI
Sbjct: 117 MEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQI 176
Query: 173 VSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK 232
V+G+L +LSEQEL+DCDTS+N GCNGGLMDYAF++I+ +GG+ ++DYPY +GTC+
Sbjct: 177 VTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQI 236
Query: 233 KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD 292
++ +VVTI Y+DVP E+SL KA+AHQP+S+AIEA G FQ Y G+F G CG +LD
Sbjct: 237 RKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLD 296
Query: 293 HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
HGV AVGYG G DY IV+NSWG WGE GY+RM RN G CGI S P+K
Sbjct: 297 HGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIK 353
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 371 bits (953), Expect = e-102, Method: Compositional matrix adjust.
Identities = 188/347 (54%), Positives = 236/347 (68%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
+ ++L+L + + ++ DF + + S + L EL+E W S H + +EEK
Sbjct: 3 RFIVLALCMLMVLETTKGLDFH-----NKDVESENSLWELYERWRSHH-TVARSLEEKAK 56
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQ----FPTRRQPSA 123
RF +FK N+KHI + NK+ SY L LN+F DM+ EEF+ Y G + F ++ +
Sbjct: 57 RFNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATK 116
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y +V LP SVDWRK GAVTPVKNQG CGSCWAFSTV AVEGINQI + LTSLSEQ
Sbjct: 117 SFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQ 176
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCDT+ N GCNGGLMD AF++I GGL E YPY + TC+ KE VV+I G
Sbjct: 177 ELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDG 236
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++DVP+N E L+KA+A+QPVSVAI+A G+DFQFYS GVFTG CG EL+HGVA VGYG +
Sbjct: 237 HEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTT 296
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G+ Y IVKNSWG +WGE+GYIRM+R EGLCGI AS PLK
Sbjct: 297 IDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPLK 343
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 370 bits (949), Expect = e-101, Method: Compositional matrix adjust.
Identities = 178/328 (54%), Positives = 230/328 (70%), Gaps = 7/328 (2%)
Query: 27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE- 85
D SIV Y S ++ L+ W ++HGK+Y + E+ R+ F++NL++ID+ N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 86 ---VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK 142
V S+ LGLN FAD+++EE+++ YLGL+ + R+ S + D +ALP+SVDWR K
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTK 138
Query: 143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMD 202
GAV +K+QG CGSCWAFS +AAVEGINQIV+G+L SLSEQEL+DCDTS+N GCNGGLMD
Sbjct: 139 GAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 198
Query: 203 YAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
YAF +I+ +GG+ E+DYPY ++ C+ ++ +VVTI Y+DV N E SL KA+A+Q
Sbjct: 199 YAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQ 258
Query: 263 PVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGER 322
PVSVAIEA G FQ YS G+FTG CG LDHGVAAVGYG G DY IV+NSWG WGE
Sbjct: 259 PVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGES 318
Query: 323 GYIRMKRNTGKPEGLCGINKMASIPLKK 350
GY+RM+RN G CGI S PLKK
Sbjct: 319 GYVRMERNIKASSGKCGIAVEPSYPLKK 346
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 369 bits (948), Expect = e-101, Method: Compositional matrix adjust.
Identities = 190/346 (54%), Positives = 246/346 (71%), Gaps = 5/346 (1%)
Query: 5 SHSKLLLLSLSLSLFACSSLAH-DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIE 63
S SKLL +++ L F SL++ DFSIVGYS + LTS ++LI+LF SWM KH K YK ++
Sbjct: 6 SFSKLLFVAICL--FGHMSLSYCDFSIVGYSQDDLTSTERLIQLFNSWMLKHNKNYKNVD 63
Query: 64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA 123
EKL+RFEIFK+NLK+ID+RNK + YWLGLNEF+D+S++EFK KY+G P+ T +
Sbjct: 64 EKLYRFEIFKDNLKYIDERNKMINGYWLGLNEFSDLSNDEFKEKYVGSLPEDYTNQPYDE 123
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
EF D+ LP+SVDWR KGAVTPVK+QG C SCWAFSTVA VEGIN+I +GNL LSEQ
Sbjct: 124 EFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKIKTGNLVELSEQ 183
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCD + GCN G + +Y VA G+H YPY+ ++ TC + V +G
Sbjct: 184 ELVDCDKQ-SYGCNRGYQSTSLQY-VAQNGIHLRAKYPYIAKQQTCRANQVGGPKVKTNG 241
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
V N+E SLL A+AHQPVSV +E++G DFQ Y GG+F G CG ++DH V AVGYGKS
Sbjct: 242 VGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHAVTAVGYGKS 301
Query: 304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G YI++KNSWGP WGE GYIR++R +G G+CG+ + + P+K
Sbjct: 302 GGKGYILIKNSWGPGWGENGYIRIRRASGNSPGVCGVYRSSYYPIK 347
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 360 bits (923), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 178/309 (57%), Positives = 216/309 (69%), Gaps = 6/309 (1%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
L+E W S H + + EK RF +FK N H+ NK Y L LN+FADM++ EF+N
Sbjct: 37 LYERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRN 95
Query: 107 KYLGLKPQ----FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
Y G K + F + + F Y V +P SVDWRKKGAVT VK+QG CGSCWAFST
Sbjct: 96 TYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFST 155
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
+ AVEGINQI + L SLSEQEL+DCDT N GCNGGLMDYAF++I GG+ E +YPY
Sbjct: 156 IVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+GTC+ KE V+I G+++VPENDE +LLKA+A+QPVSVAI+A G+DFQFYS GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275
Query: 283 FTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
FTG CG ELDHGVA VGYG + G+ Y VKNSWGP+WGE+GYIRM+R EGLCGI
Sbjct: 276 FTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIA 335
Query: 342 KMASIPLKK 350
AS P+KK
Sbjct: 336 MEASYPIKK 344
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 360 bits (923), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 185/347 (53%), Positives = 234/347 (67%), Gaps = 11/347 (3%)
Query: 8 KLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLH 67
KLL + LS SL + + DF + L S + L +L+E W S H + + EK
Sbjct: 5 KLLWVVLSFSLVLGVANSFDFH-----DKDLASEESLWDLYERWRSHH-TVSRSLGEKHK 58
Query: 68 RFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT--RRQP--SA 123
RF +FK NL H+ NK Y L LN+FADM++ EF++ Y G K P R P +
Sbjct: 59 RFNVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENG 118
Query: 124 EFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQ 183
F Y V ++P SVDWRKKGAVT VK+QG CGSCWAFSTV AVEGINQI + L +LSEQ
Sbjct: 119 AFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQ 178
Query: 184 ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISG 243
EL+DCD N GCNGGLM+ AF++I GG+ E +YPY +EGTC+ K V+I G
Sbjct: 179 ELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDG 238
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
+++VP NDE +LLKA+A+QPVSVAI+A G+DFQFYS GVFTG C +L+HGVA VGYG +
Sbjct: 239 HENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTT 298
Query: 304 -KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G++Y IV+NSWGP+WGE GYIRM+RN K EGLCGI + S P+K
Sbjct: 299 VDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIK 345
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 359 bits (921), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 176/322 (54%), Positives = 225/322 (69%), Gaps = 6/322 (1%)
Query: 33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
+ + L S + L +L+E W S H + + EK RF +FK N+ H+ NK Y L
Sbjct: 25 FHEKDLESEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLK 83
Query: 93 LNEFADMSHEEFKNKYLGLK----PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
LN+FADM++ EF++ Y G K F + S F Y V ++P SVDWRKKGAVT V
Sbjct: 84 LNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDV 143
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
K+QG CGSCWAFST+ AVEGINQI + L SLSEQEL+DCD N GCNGGLM+ AF++I
Sbjct: 144 KDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFI 203
Query: 209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
GG+ E +YPY +EGTC++ K V+I G+++VP NDE +LLKA+A+QPVSVAI
Sbjct: 204 KQKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAI 263
Query: 269 EASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRM 327
+A G+DFQFYS GVFTG C +L+HGVA VGYG + G++Y IV+NSWGP+WGE+GYIRM
Sbjct: 264 DAGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRM 323
Query: 328 KRNTGKPEGLCGINKMASIPLK 349
+RN K EGLCGI MAS P+K
Sbjct: 324 QRNISKKEGLCGIAMMASYPIK 345
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 356 bits (914), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 185/345 (53%), Positives = 237/345 (68%), Gaps = 3/345 (0%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SKLL +++ L + S DFSIVGYS + LTS ++LI+LF SWM H K Y+ ++E
Sbjct: 6 SISKLLFVAICLFVHMSVSFG-DFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAE 124
KL+RFEIFK+NL +ID+ NK+ SYWLGLNEFAD+S++EF KY+G + E
Sbjct: 65 KLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEE 124
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
F D LP++VDWRKKGAVTPV++QGSCGSCWAFS VA VEGIN+I +G L LSEQE
Sbjct: 125 FINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQE 184
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
L+DC+ ++GC GG YA +Y VA G+H YPY ++GTC K+ +V SG
Sbjct: 185 LVDCERR-SHGCKGGYPPYALEY-VAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGV 242
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
V N+E +LL A+A QPVSV +E+ G FQ Y GG+F GPCG ++DH V AVGYGKS
Sbjct: 243 GRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSG 302
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G YI++KNSWG WGE+GYIR+KR G G+CG+ K + P K
Sbjct: 303 GKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPTK 347
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 353 bits (905), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 177/347 (51%), Positives = 229/347 (65%), Gaps = 15/347 (4%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
+ L SL + AC Y + + S + L L++ W S H + + E+ R
Sbjct: 7 IFLFSLVILQTACG--------FDYDDKEIESEEGLSTLYDRWRSHHS-VPRSLNEREKR 57
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP------QFPTRRQPS 122
F +F+ N+ H+ NK+ SY L LN+FAD++ EFKN Y G Q P R
Sbjct: 58 FNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQ 117
Query: 123 AEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSE 182
+ + ++ LP SVDWRKKGAVT +KNQG CGSCWAFSTVAAVEGIN+I + L SLSE
Sbjct: 118 FMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSE 177
Query: 183 QELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTIS 242
QEL+DCDT N GCNGGLM+ AF++I +GG+ E+ YPY +G C+ K+ +VTI
Sbjct: 178 QELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTID 237
Query: 243 GYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK 302
G++DVPENDE +LLKA+A+QPVSVAI+A +DFQFYS GVFTG CG EL+HGVAAVGYG
Sbjct: 238 GHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGS 297
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+G Y IV+NSWG +WGE GYI+++R +PEG CGI AS P+K
Sbjct: 298 ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIK 344
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 351 bits (900), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 182/343 (53%), Positives = 238/343 (69%), Gaps = 10/343 (2%)
Query: 14 LSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFK 73
++L+L A S L+ SI ++ + L S D L L+E W + H + ++EK RF +FK
Sbjct: 7 IALALVALSFLSIAQSIP-FTEKDLASEDSLWNLYEKWRTHH-TVARDLDEKNRRFNVFK 64
Query: 74 ENLKHIDQRN-KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ----PSAEFSYR 128
EN+K I + N K+ Y L LN+F DM+++EF++KY G K Q ++ + F Y
Sbjct: 65 ENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYE 124
Query: 129 DVKALPK-SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELID 187
+V +LP S+DWR KGAVT VK+QG CGSCWAFST+A+VEGINQI +G L SLSEQEL+D
Sbjct: 125 NVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELVD 184
Query: 188 CDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDV 247
CDTS+N GCNGGLMDYAF++I G+ E+ YPY ++GTC VV+I G+QDV
Sbjct: 185 CDTSYNEGCNGGLMDYAFEFI-QKNGITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDV 243
Query: 248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GS 306
P N+E +L++A+A+QP+SV+IEASG FQFYS GVFTG CG ELDHGVA VGYG ++ G+
Sbjct: 244 PANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGT 303
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
Y IVKNSWG +WGE GYIRM+R G CGI AS P+K
Sbjct: 304 KYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPIK 346
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 350 bits (897), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 182/351 (51%), Positives = 236/351 (67%), Gaps = 18/351 (5%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEE 64
S SKLL +++ L ++ S DFSIVGYS LTS ++LI+LFESWM KH K YK I+E
Sbjct: 6 SISKLLFVAICLFVYMGLSFG-DFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDE 64
Query: 65 KLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSA 123
K++RFEIFK+NLK+ID+ NK+ SYWLGLN FADMS++EFK KY G + + T
Sbjct: 65 KIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTT-----T 119
Query: 124 EFSYRDV-----KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
E SY +V +P+ VDWR+KGAVTPVKNQGSCGSCWAFS V +EGI +I +GNL
Sbjct: 120 ELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLN 179
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEV 238
SEQEL+DCD + GCNGG A + +VA G+H YPY + C +++
Sbjct: 180 EYSEQELLDCDRR-SYGCNGGYPWSALQ-LVAQYGIHYRNTYPYEGVQRYCRSREKGPYA 237
Query: 239 VTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAV 298
G + V +E +LL ++A+QPVSV +EA+G DFQ Y GG+F GPCG ++DH VAAV
Sbjct: 238 AKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAV 297
Query: 299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
GY G +YI++KNSWG WGE GYIR+KR TG G+CG+ + P+K
Sbjct: 298 GY----GPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVK 344
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 340 bits (873), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 227/345 (65%), Gaps = 14/345 (4%)
Query: 18 LFACSSLAHDFSIVGYSPEHLT-------SMDKLIELFESWMSKHG--KTYKCIEEKLHR 68
+ ++ A D SI+ Y+ EH + + ++ W++++G E R
Sbjct: 15 IVGAATAAPDMSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERR 74
Query: 69 FEIFKENLKHIDQRN---KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEF 125
F +F +NLK +D N E + LG+N FAD+++EEF+ +LG K R +
Sbjct: 75 FLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAKVA-ERSRAAGERY 133
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+ V+ LP+SVDWR+KGAV PVKNQG CGSCWAFS V+ VE INQ+V+G + +LSEQEL
Sbjct: 134 RHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQEL 193
Query: 186 IDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
++C T+ N+GCNGGLMD AF +I+ +GG+ E+DYPY +G C+ +E +VV+I G+
Sbjct: 194 VECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGF 253
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK 304
+DVP+NDE+SL KA+AHQPVSVAIEA G +FQ Y GVF+G CG LDHGV AVGYG
Sbjct: 254 EDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDN 313
Query: 305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
G DY IV+NSWGPKWGE GY+RM+RN G CGI MAS P K
Sbjct: 314 GKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPTK 358
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 337 bits (863), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 170/361 (47%), Positives = 226/361 (62%), Gaps = 16/361 (4%)
Query: 4 FSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMD----------KLIELFESWMS 53
++ S +L+ L+L + +C++ A D S+V + H + + +FESWM
Sbjct: 3 YAKSAMLIFLLALVIASCAT-AMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMV 61
Query: 54 KHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP 113
KHGK Y + EK R IF++NL+ I RN E SY LGLN FAD+S E+ G P
Sbjct: 62 KHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGEICHGADP 121
Query: 114 QFPTRR---QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN 170
+ P S + D LPKSVDWR +GAVT VK+QG C SCWAFSTV AVEG+N
Sbjct: 122 RPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLN 181
Query: 171 QIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE 230
+IV+G L +LSEQ+LI+C+ NNGC GG ++ A+++I+ +GGL + DYPY G CE
Sbjct: 182 KIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCE 240
Query: 231 DK-KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGA 289
+ KE+ + V I GY+++P NDE +L+KA+AHQPV+ +++S +FQ Y GVF G CG
Sbjct: 241 GRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGT 300
Query: 290 ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
L+HGV VGYG G DY IVKNS G WGE GY++M RN P GLCGI AS PLK
Sbjct: 301 NLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPLK 360
Query: 350 K 350
Sbjct: 361 N 361
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 336 bits (862), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 166/321 (51%), Positives = 221/321 (68%), Gaps = 14/321 (4%)
Query: 42 DKLIELFESWMSKHGKTYKCIE----EKLHRFEIFKENLKHIDQRNKEV--TSYWLGLNE 95
+++ ++ W ++HGKT ++ RF IFK+NL+ ID N++ +Y LGL +
Sbjct: 43 EEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTK 102
Query: 96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR------DVKALPKSVDWRKKGAVTPVK 149
F D++++E++ YLG + + P RR A+ + + K +P++VDWR+KGAV P+K
Sbjct: 103 FTDLTNDEYRKLYLGARTE-PARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIK 161
Query: 150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIV 209
+QG+CGSCWAFST AAVEGIN+IV+G L SLSEQEL+DCD S+N GCNGGLMDYAF++I+
Sbjct: 162 DQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIM 221
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIE 269
+GGL+ E+DYPY G C + VV+I GY+DVP DE +L KA+++QPVSVAIE
Sbjct: 222 KNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIE 281
Query: 270 ASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
A G FQ Y G+FTG CG LDH V AVGYG G DY IV+NSWGP+WGE GYIRM+R
Sbjct: 282 AGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMER 341
Query: 330 N-TGKPEGLCGINKMASIPLK 349
N G CGI AS P+K
Sbjct: 342 NLAASKSGKCGIAVEASYPVK 362
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 336 bits (861), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 162/317 (51%), Positives = 216/317 (68%), Gaps = 13/317 (4%)
Query: 45 IELFESWMSKHGKTYK----CIEEKLHRFEIFKENLKHID--QRNKEVTSYWLGLNEFAD 98
+ ++ W +HGK+ I ++ RF IFK+NL+ ID N + +Y LGL FA+
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 MSHEEFKNKYLGLKPQFPTRRQPSAE------FSYRDVKALPKSVDWRKKGAVTPVKNQG 152
++++E+++ YLG + + P RR A+ + +V +P +VDWR+KGAV +K+QG
Sbjct: 61 LTNDEYRSLYLGARTE-PVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119
Query: 153 SCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASG 212
+CGSCWAFST AAVEGIN+IV+G L SLSEQEL+DCD S+N GCNGGLMDYAF++I+ +G
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASG 272
GL+ E+DYPY G C + VVTI GY+DVP DE +L +A+++QPVSVAI+A G
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239
Query: 273 TDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
FQ Y G+FTG CG +DH V AVGYG G DY IV+NSWG +WGE GYIRM+RN
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVA 299
Query: 333 KPEGLCGINKMASIPLK 349
G CGI AS P+K
Sbjct: 300 SKSGKCGIAIEASYPVK 316
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 332 bits (850), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 168/353 (47%), Positives = 224/353 (63%), Gaps = 9/353 (2%)
Query: 5 SHSKLLLLSLSLSLFACSSLAHDFSIVGYSPE---HLTSMDKLIELFESWMSKHGKTYKC 61
+ S +L+L +++ + +C++ A D S+V Y H + +FESWM KHGK Y
Sbjct: 4 AKSAMLILLVAMVIASCAT-AIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGS 62
Query: 62 IEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR-- 119
+ EK R IF++NL+ I+ RN E SY LGL FAD+S E+K G P+ P
Sbjct: 63 VAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVF 122
Query: 120 -QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
S + LPKSVDWR +GAVT VK+QG C SCWAFSTV AVEG+N+IV+G L
Sbjct: 123 MTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELV 182
Query: 179 SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK-KEEME 237
+LSEQ+LI+C+ NNGC GG ++ A+++I+ +GGL + DYPY G C+ + KE +
Sbjct: 183 TLSEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNK 241
Query: 238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAA 297
V I GY+++P NDE +L+KA+AHQPV+ I++S +FQ Y GVF G CG L+HGV
Sbjct: 242 NVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVV 301
Query: 298 VGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
VGYG G DY +VKNS G WGE GY++M RN P GLCGI AS PLK
Sbjct: 302 VGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPLKN 354
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 332 bits (850), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 159/307 (51%), Positives = 208/307 (67%), Gaps = 4/307 (1%)
Query: 47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-EVTSYWLGLNEFADMSHEEFK 105
++E W+ ++ K Y + EK RF+IFK+NLK +D+ N ++ +GL FAD+++EEF+
Sbjct: 43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
YL K + + + Y++ LP VDWR GAV VK+QG+CGSCWAFS V A
Sbjct: 103 AIYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGA 162
Query: 166 VEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
VEGINQI +G L SLSEQEL+DCD F N GC+GG+M+YAF++I+ +GG+ ++DYPY
Sbjct: 163 VEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNA 222
Query: 225 EE-GTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
+ G C DK VVTI GY+DVP +DE+SL KA+AHQPVSVAIEAS FQ Y GV
Sbjct: 223 NDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGV 282
Query: 283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
TG CG LDHGV VGYG + G DY I++NSWG WG+ GY++++RN P G CGI
Sbjct: 283 MTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAM 342
Query: 343 MASIPLK 349
M S P K
Sbjct: 343 MPSYPTK 349
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 330 bits (847), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 168/294 (57%), Positives = 202/294 (68%), Gaps = 8/294 (2%)
Query: 62 IEEKLHRFEIFKENLKHIDQRNK---EVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
I E RF +F +NLK +D N E + LG+N FAD+++ EF+ YLG P R
Sbjct: 82 IGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAGRGR 141
Query: 119 RQPSAEFSYRDVKALPKSVDWRKKGAVT-PVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
R A + + V+ALP SVDWR KGAV PVKNQG CGSCWAFS VAAVEGIN+IV+G L
Sbjct: 142 RVGEA-YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGEL 200
Query: 178 TSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQEL++C + N+GCNGG+MD AF +I +GGL EEDYPY +G C K
Sbjct: 201 VSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSR 260
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+VV+I G++DVPENDE SL KA+AHQPVSVAI+A G +FQ Y GVFTG CG LDHGV
Sbjct: 261 KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVV 320
Query: 297 AVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
AVGYG + G+ Y V+NSWGP WGE GYIRM+RN G CGI MAS P+
Sbjct: 321 AVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 323 bits (827), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 171/347 (49%), Positives = 226/347 (65%), Gaps = 14/347 (4%)
Query: 9 LLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHR 68
++L+S LSL S DF + L + + + +L+E W H + + E + R
Sbjct: 6 IVLISF-LSLLQASK-GFDFD-----EKELETEENVWKLYERWRGHHSVS-RASHEAIKR 57
Query: 69 FEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLG--LKPQFPTR--RQPSAE 124
F +F+ N+ H+ + NK+ Y L +N FAD++H EF++ Y G +K R ++ S
Sbjct: 58 FNVFRHNVLHVHRTNKKNKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGG 117
Query: 125 FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQE 184
F Y +V +P SVDWR+KGAVT VKNQ CGSCWAFSTVAAVEGIN+I + L SLSEQE
Sbjct: 118 FMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQE 177
Query: 185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT-CEDKKEEMEVVTISG 243
L+DCDT N GC GGLM+ AF++I +GG+ EE YPY + C E VTI G
Sbjct: 178 LVDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDG 237
Query: 244 YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS 303
++ VPENDE+ LLKA+AHQPVSVAI+A +DFQ YS GVF G CG +L+HGV VGYG++
Sbjct: 238 HEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGET 297
Query: 304 K-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
K G+ Y IV+NSWGP+WGE GY+R++R + EG CGI AS P K
Sbjct: 298 KNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTK 344
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 322 bits (824), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 172/320 (53%), Positives = 209/320 (65%), Gaps = 9/320 (2%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEF 96
L S + L +L+E W S H + + EK RF FK N I NK Y L LN F
Sbjct: 36 LESEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRF 94
Query: 97 ADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYR--DVKALPKSVDWRKKGAVTPVKNQGS 153
DM EF+ ++G L+ P++ F Y +V LP SVDWR+KGAVT VK+QG
Sbjct: 95 GDMDQAEFRATFVGDLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGG 213
CGSCWAFSTV +VEGIN I +G+L SLSEQELIDCDT+ N+GC GGLMD AF+YI +GG
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 214 LHKEEDYPYLMEEGTCEDKKEEME---VVTISGYQDVPENDEQSLLKALAHQPVSVAIEA 270
L E YPY GTC + VV I G+QDVP N E+ L +A+A+QPVSVA+EA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 271 SGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKR 329
SG F FYS GVFTG CG ELDHGVA VGYG ++ G Y VKNSWGP WGE+GYIR+++
Sbjct: 275 SGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEK 334
Query: 330 NTGKPEGLCGINKMASIPLK 349
++G GLCGI AS P+K
Sbjct: 335 DSGASGGLCGIAMEASYPVK 354
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 321 bits (822), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 172/320 (53%), Positives = 208/320 (65%), Gaps = 9/320 (2%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEF 96
L S + L +L+E W S H + + EK RF FK N I NK Y L LN F
Sbjct: 36 LESEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRF 94
Query: 97 ADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYR--DVKALPKSVDWRKKGAVTPVKNQGS 153
DM EF+ ++G L+ P + F Y +V LP SVDWR+KGAVT VK+QG
Sbjct: 95 GDMDQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGG 213
CGSCWAFSTV +VEGIN I +G+L SLSEQELIDCDT+ N+GC GGLMD AF+YI +GG
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 214 LHKEEDYPYLMEEGTCEDKKEEME---VVTISGYQDVPENDEQSLLKALAHQPVSVAIEA 270
L E YPY GTC + VV I G+QDVP N E+ L +A+A+QPVSVA+EA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 271 SGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKR 329
SG F FYS GVFTG CG ELDHGVA VGYG ++ G Y VKNSWGP WGE+GYIR+++
Sbjct: 275 SGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEK 334
Query: 330 NTGKPEGLCGINKMASIPLK 349
++G GLCGI AS P+K
Sbjct: 335 DSGASGGLCGIAMEASYPVK 354
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 317 bits (811), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 162/353 (45%), Positives = 224/353 (63%), Gaps = 23/353 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT--SMDKLIELFESWMSKHGKT 58
M+ S LL+LSL+ ++ ++LT + D++ ++ESW+ K+GK+
Sbjct: 10 MSLLFFSTLLILSLA-----------------FNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPT 117
Y + E RFEIFKE L+ ID+ N + SY +GLN+FAD++ EEF++ YLG
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSG-SN 111
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ + S + R + LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IV+G L
Sbjct: 112 KTKVSNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVL 171
Query: 178 TSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQELIDC + N GCNGG + F++I+ +GG++ EE+YPY ++G C +
Sbjct: 172 ISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNE 231
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+ VTI Y++VP N+E +L A+ +QPVSVA++A+G F+ YS G+FTGPCG +DH V
Sbjct: 232 KYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVT 291
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG G DY IVKNSW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 292 IVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 343
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 313 bits (801), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 161/353 (45%), Positives = 223/353 (63%), Gaps = 23/353 (6%)
Query: 1 MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLT--SMDKLIELFESWMSKHGKT 58
M+ S LL+LSL+ ++ ++LT + D++ ++ESW+ K+GK+
Sbjct: 10 MSLLFFSTLLILSLA-----------------FNAKNLTQRTNDEVKAMYESWLIKYGKS 52
Query: 59 YKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKNKYLGLKPQFPT 117
Y + E RFEIFKE L+ ID+ N + SY +GLN+FAD++ EEF++ YL
Sbjct: 53 YNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSG-SN 111
Query: 118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
+ + S + R + LP VDWR GAV +K+QG CG CWAFS +A VEGIN+IV+G L
Sbjct: 112 KTKVSNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVL 171
Query: 178 TSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEM 236
SLSEQELIDC + N GCNGG + F++I+ +GG++ EE+YPY ++G C +
Sbjct: 172 ISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNE 231
Query: 237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVA 296
+ VTI Y++VP N+E +L A+ +QPVSVA++A+G F+ YS G+FTGPCG +DH V
Sbjct: 232 KYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVT 291
Query: 297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
VGYG G DY IVKNSW WGE GY+R+ RN G G CGI M S P+K
Sbjct: 292 IVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVK 343
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 300 bits (767), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 145/334 (43%), Positives = 212/334 (63%), Gaps = 11/334 (3%)
Query: 16 LSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKEN 75
L LF C+ A + P D +++ FE WM+++G+ YK +EK+ RF+IFK N
Sbjct: 10 LFLFLCAMWASPSAASRDEPN-----DPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNN 64
Query: 76 LKHIDQRN-KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALP 134
+KHI+ N + SY LG+N+F DM+ EF +Y G+ R+P F ++ A+P
Sbjct: 65 VKHIETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDDVNISAVP 124
Query: 135 KSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN 194
+S+DWR GAV VKNQ CGSCW+F+ +A VEGI +I +G L SLSEQE++DC S+
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY-- 182
Query: 195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQS 254
GC GG ++ A+ +I+++ G+ EE+YPYL +GTC + I+GY V NDE+S
Sbjct: 183 GCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTC-NANSFPNSAYITGYSYVRRNDERS 241
Query: 255 LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKN 313
++ A+++QP++ I+AS +FQ+Y+GGVF+GPCG L+H + +GYG+ S G+ Y IV+N
Sbjct: 242 MMYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRN 300
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
SWG WGE GY+RM R G+CGI P
Sbjct: 301 SWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 288 bits (738), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 135/301 (44%), Positives = 200/301 (66%), Gaps = 6/301 (1%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMS 100
D +++ FE WM+++G+ YK +EK+ RF+IFK N+ HI+ N+ SY LG+N+F DM+
Sbjct: 31 DPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMT 90
Query: 101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
+ EF +Y GL +R+P F D+ ++P+S+DWR GAVT VKNQG CGSCWAF
Sbjct: 91 NNEFVAQYTGLSLPLNIKREPVVSFDDVDISSVPQSIDWRDSGAVTSVKNQGRCGSCWAF 150
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
+++A VE I +I GNL SLSEQ+++DC S+ GC GG ++ A+ +I+++ G+ Y
Sbjct: 151 ASIATVESIYKIKRGNLVSLSEQQVLDCAVSY--GCKGGWINKAYSFIISNKGVASAAIY 208
Query: 221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
PY +GTC+ I+ Y V N+E++++ A+++QP++ A++ASG +FQ Y
Sbjct: 209 PYKAAKGTCKTNGVPNSAY-ITRYTYVQRNNERNMMYAVSNQPIAAALDASG-NFQHYKR 266
Query: 281 GVFTGPCGAELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
GVFTGPCG L+H + +GYG+ S G + IV+NSWG WGE GYIR+ R+ GLCG
Sbjct: 267 GVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNSWGAGWGEGGYIRLARDVSSSFGLCG 326
Query: 340 I 340
I
Sbjct: 327 I 327
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 278 bits (710), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 125/218 (57%), Positives = 161/218 (73%)
Query: 132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS 191
+LP+S+DWR+KG + VK+QGSCGSCWAFS VAA+E IN IV+GNL SLSEQEL+DCD S
Sbjct: 17 SLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRS 76
Query: 192 FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
+N GC+GGLMDYAF++++ +GG+ EEDYPY G C+ ++ +VV I Y+DVP N+
Sbjct: 77 YNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNN 136
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIV 311
E++L KA+AHQPVS+A+EA G DFQ Y G+FTG CG +DHGV GYG G DY IV
Sbjct: 137 EKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGTENGMDYWIV 196
Query: 312 KNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
+NSWG E GY+R++RN GLCG+ S P+K
Sbjct: 197 RNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPVK 234
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 270 bits (690), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 126/218 (57%), Positives = 161/218 (73%), Gaps = 2/218 (0%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP S+DWR+ GAV PVKNQG CGSCWAFSTVAAVEGINQIV+G+L SLSEQ+L+DC T+
Sbjct: 3 LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-TTA 61
Query: 193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
N+GC GG M+ AF++IV +GG++ EE YPY ++G C + VV+I Y++VP ++E
Sbjct: 62 NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGIC-NSTVNAPVVSIDSYENVPSHNE 120
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
QSL KA+A+QPVSV ++A+G DFQ Y G+FTG C +H + VGYG D+ IVK
Sbjct: 121 QSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIVK 180
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
NSWG WGE GYIR +RN P+G CGI + AS P+KK
Sbjct: 181 NSWGKNWGESGYIRAERNIENPDGKCGITRFASYPVKK 218
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 270 bits (689), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 203/315 (64%), Gaps = 11/315 (3%)
Query: 43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSH 101
+++ ++E W+ ++GK Y + EK RF+IFK+NLK I++ N + SY GLN+F+D++
Sbjct: 36 EVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTA 95
Query: 102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTP-VKNQGSCGSCWAF 160
+EF+ YLG K + + + + Y++ LP VDWR++GAV P VK QG CGSCWAF
Sbjct: 96 DEFQASYLGGKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAF 155
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
+ AVEGINQI +G L SLSEQELIDCD +N GC GG +AF++I +GG+ +E
Sbjct: 156 AATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEV 215
Query: 220 YPYLMEEGTCEDKKEEME---VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
Y Y E+ T K EM+ VVTI+G++ VP NDE SL KA+A+QP+SV I A+ +
Sbjct: 216 YGYTGED-TAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA--NMS 272
Query: 277 FYSGGVFTGPCGAEL-DHGVAAVGYGKSKG-SDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
Y GV+ G C DH V VGYG S DY +++NSWGP+WGE GY+R++RN +P
Sbjct: 273 DYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEP 332
Query: 335 EGLCGINKMASIPLK 349
G C + P+K
Sbjct: 333 TGKCAVAVAPVYPIK 347
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 261 bits (668), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 150/346 (43%), Positives = 202/346 (58%), Gaps = 17/346 (4%)
Query: 10 LLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRF 69
+ LS++L +F L+ F G H D I+ WM + K Y +E + R+
Sbjct: 1 MRLSITL-IFTLIVLSISFISAGNVFSHKQYQDSFID----WMRSNNKAYTH-KEFMPRY 54
Query: 70 EIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPT----RRQPSAEF 125
E FK+N+ ++ N + + LGLN+ AD+S+EE++ YLG + +R
Sbjct: 55 EEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRL 114
Query: 126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
+ K P +VDWR+K AVTPVK+QG CGSC++FST +VEG+ I +G L SLSEQ +
Sbjct: 115 NRPQFKQ-PLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNI 173
Query: 186 IDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
+DC +SF N GCNGGLM AF+YI+ + GL+ EE YPY M+ +E I+ Y
Sbjct: 174 LDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSY 233
Query: 245 QDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGK 302
+++ DE L AL PVSVAI+AS FQ Y+ GV+ P C +E LDHGV AVG G
Sbjct: 234 KEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGT 293
Query: 303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
G DY IVKNSWGP WG GYI M RN + CGI+ MAS P+
Sbjct: 294 DNGEDYYIVKNSWGPSWGLNGYIHMARN---KDNNCGISTMASYPI 336
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
Length = 215
Score = 261 bits (666), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 121/217 (55%), Positives = 155/217 (71%), Gaps = 3/217 (1%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP VDWR KGAV +KNQ CGSCWAFS VAAVE IN+I +G L SLSEQEL+DCDT+
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA- 59
Query: 193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
++GCNGG M+ AF+YI+ +GG+ +++YPY +G+C K + VV+I+G+Q V N+E
Sbjct: 60 SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSC--KPYRLRVVSINGFQRVTRNNE 117
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
+L A+A QPVSV +EA+G FQ YS G+FTGPCG +HGV VGYG G +Y IV+
Sbjct: 118 SALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVR 177
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
NSWG WG +GYI M+RN GLCGI ++ S P K
Sbjct: 178 NSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYPTK 214
>sp|P84346|MEX1_JACME Mexicain OS=Jacaratia mexicana PE=1 SV=1
Length = 214
Score = 255 bits (651), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 117/216 (54%), Positives = 161/216 (74%), Gaps = 6/216 (2%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S+DWR+KGAVTPVKNQ CGSCWAFSTVA +EGIN+I++G L SLSEQEL+DC+ +
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYR-S 60
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
+GC+GG + +Y+V +G +H E +YPY ++G C K ++ V I+GY+ VP NDE
Sbjct: 61 HGCDGGYQTPSLQYVVDNG-VHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDEI 119
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
SL++A+A+QPVSV ++ G FQFY GG++ GPCG DH V AVGYGK+ Y+++KN
Sbjct: 120 SLIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGKT----YLLLKN 175
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWGP WGE+GYIR+KR +G+ +G CG+ + P+K
Sbjct: 176 SWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFFPIK 211
>sp|P84347|MEX2_JACME Chymomexicain OS=Jacaratia mexicana PE=1 SV=1
Length = 215
Score = 254 bits (650), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 118/216 (54%), Positives = 158/216 (73%), Gaps = 5/216 (2%)
Query: 134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
P+S+DWR KGAVTPVKNQ CGSCWAFSTVA VEGIN+I +G L SLSEQEL+DCD +
Sbjct: 2 PESIDWRDKGAVTPVKNQNPCGSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRR-S 60
Query: 194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
+GC GG + +Y+ +GG+H E++YPY ++G C K+++ V I+GY+ VP NDE
Sbjct: 61 HGCKGGYQTGSIQYVADNGGVHTEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPANDEI 120
Query: 254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
SL++ + +QPVSV E+ G FQ Y GG+F GPCG + DH V A+GYGK++ ++ KN
Sbjct: 121 SLIQGIGNQPVSVLHESKGRAFQLYKGGIFNGPCGYKNDHAVTAIGYGKAQ----LLDKN 176
Query: 314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
SWGP WGE+GYI++KR +GK EG CG+ K + P+K
Sbjct: 177 SWGPNWGEKGYIKIKRASGKSEGTCGVYKSSYFPIK 212
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 252 bits (644), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 122/218 (55%), Positives = 151/218 (69%), Gaps = 2/218 (0%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP S+DWR+KGAV PVKNQG CGSCWAF +AAVEGINQIV+G+L SLSEQ+L+DC T
Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTR- 61
Query: 193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
N+GC GG AF+YI+ +GG++ EE YPY GTC D KE VV+I Y++VP NDE
Sbjct: 62 NHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTC-DTKENAHVVSIDSYRNVPSNDE 120
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
+SL KA+A+QPVSV ++A+G DFQ Y G+FTG C +H G DY VK
Sbjct: 121 KSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGGRETENDKDYWTVK 180
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
NSWG WGE GYIR++RN + G CGI S P+K+
Sbjct: 181 NSWGKNWGESGYIRVERNIAESSGKCGIAISPSYPIKE 218
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 248 bits (634), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 197/323 (60%), Gaps = 21/323 (6%)
Query: 42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFA 97
D ++E + ++ +H K Y+ E+ R +IF EN I + N+ S+ L +N++A
Sbjct: 53 DVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 112
Query: 98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK-------ALPKSVDWRKKGAVTPVKN 150
D+ H EF+ G + + + E S++ V LPKSVDWR KGAVT VK+
Sbjct: 113 DLLHHEFRQLMNGFNYTLHKQLRAADE-SFKGVTFISPAHVTLPKSVDWRTKGAVTAVKD 171
Query: 151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIV 209
QG CGSCWAFS+ A+EG + SG L SLSEQ L+DC T + NNGCNGGLMD AF+YI
Sbjct: 172 QGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 231
Query: 210 ASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAI 268
+GG+ E+ YPY + +C K + T G+ D+P+ DE+ + +A+A PVSVAI
Sbjct: 232 DNGGIDTEKSYPYEAIDDSCHFNKGTVG-ATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 290
Query: 269 EASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYI 325
+AS FQFYS GV+ P C A+ LDHGV VG+G + G DY +VKNSWG WG++G+I
Sbjct: 291 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 350
Query: 326 RMKRNTGKPEGLCGINKMASIPL 348
+M RN E CGI +S PL
Sbjct: 351 KMLRN---KENQCGIASASSYPL 370
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 247 bits (631), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 180/322 (55%), Gaps = 29/322 (9%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F WM H K+Y EE R+ IFK N+ ++ Q N + + LGLN FAD+++EE++N
Sbjct: 30 FTDWMITHQKSYTS-EEFGARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYRNT 88
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
YLG K + E + A K DWR +GAVTPVKNQG CG CW+FST + E
Sbjct: 89 YLGTKFDASSLIGTQEEKVFTTSSAASK--DWRSEGAVTPVKNQGQCGGCWSFSTTGSTE 146
Query: 168 GINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEG 227
G + G L SLSEQ LIDC T N+GC+GGLM YAF+YI+ + G+ E YPY E G
Sbjct: 147 GAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNGIDTESSYPYKAENG 205
Query: 228 TCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP- 286
CE K E T+S Y+ V E SL A+ PVSVAI+AS FQ Y+ G++ P
Sbjct: 206 KCEYKSEN-SGATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPE 264
Query: 287 CGAE-LDHGVAAVGY-------------------GKSKGSDYIIVKNSWGPKWGERGYIR 326
C +E LDHGV AVGY S ++Y IVKNSWG WG GYI
Sbjct: 265 CSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGYIL 324
Query: 327 MKRNTGKPEGLCGINKMASIPL 348
M RN + CGI AS P+
Sbjct: 325 MSRN---RDNNCGIASSASFPV 343
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 247 bits (630), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 193/324 (59%), Gaps = 18/324 (5%)
Query: 38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGL 93
++ +D + E + ++ +H K Y E+ R +IF EN I + N+ SY LGL
Sbjct: 18 ISPLDLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGL 77
Query: 94 NEFADMSHEEFK---NKYLGLKPQFPTRRQPSAEFSYRDVK--ALPKSVDWRKKGAVTPV 148
N++ADM H EFK N Y Q R +Y +PKSVDWR+ GAVT V
Sbjct: 78 NKYADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGV 137
Query: 149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKY 207
K+QG CGSCWAFS+ A+EG + +G L SLSEQ L+DC T + NNGCNGGLMD AF+Y
Sbjct: 138 KDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRY 197
Query: 208 IVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSV 266
I +GG+ E+ YPY + +C K + T +G+ D+PE DE+ + KA+A PVSV
Sbjct: 198 IKDNGGIDTEKSYPYEGIDDSCHFNKATIG-ATDTGFVDIPEGDEEKMKKAVATMGPVSV 256
Query: 267 AIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERG 323
AI+AS FQ YS GV+ P C + LDHGV VGYG + G DY +VKNSWG WGE+G
Sbjct: 257 AIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQG 316
Query: 324 YIRMKRNTGKPEGLCGINKMASIP 347
YI+M RN CGI +S P
Sbjct: 317 YIKMARNQNNQ---CGIATASSYP 337
>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
Length = 208
Score = 241 bits (616), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 121/217 (55%), Positives = 155/217 (71%), Gaps = 10/217 (4%)
Query: 133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
LP+ +DWRKKGAVTPVKNQGSCGSCWAFSTV+ VE INQI +GNL SLSEQEL+DCD
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59
Query: 193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
N+GC GG +A++YI+ +GG+ + +YPY +G C+ +VV+I GY VP +E
Sbjct: 60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAAS---KVVSIDGYNGVPFCNE 116
Query: 253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
+L +A+A QP +VAI+AS FQ YS G+F+GPCG +L+HGV VGY ++Y IV+
Sbjct: 117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGY----QANYWIVR 172
Query: 313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
NSWG WGE+GYIRM R G GLCGI ++ P K
Sbjct: 173 NSWGRYWGEKGYIRMLRVGGC--GLCGIARLPYYPTK 207
>sp|Q54TR1|CFAD_DICDI Counting factor associated protein D OS=Dictyostelium discoideum
GN=cfaD PE=1 SV=1
Length = 531
Score = 241 bits (616), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 183/311 (58%), Gaps = 14/311 (4%)
Query: 46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
LF+ + +++ K Y +E RF FK K I N + +SY LG+N +AD+S++EF
Sbjct: 223 NLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKLGMNHYADLSNKEFN 282
Query: 106 NKYLGLKPQFPTRRQPSAEFSYRD--VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
+KP+ A+ + D ++++P +VDWR + VTPVK+QG CGSCW F +
Sbjct: 283 TL---VKPKVARPSVTGADSVHDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGST 339
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
++EG N + +G L SLSEQ+L+DC + + GC GG AF+Y++ G L E +YPY
Sbjct: 340 GSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYPY 399
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
LM+ G C D+ V+I+GY +V E +L A+A PV++AI+AS DF++Y G
Sbjct: 400 LMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSG 459
Query: 282 VFTGPCGA----ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
V+ P +LDH V A+GYG +G DY +VKNSW WG GY+ M RN L
Sbjct: 460 VYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGYVYMARNDNN---L 516
Query: 338 CGINKMASIPL 348
CG++ A+ P+
Sbjct: 517 CGVSSQATYPI 527
>sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max PE=1 SV=1
Length = 379
Score = 238 bits (608), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 140/344 (40%), Positives = 196/344 (56%), Gaps = 32/344 (9%)
Query: 29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN---KE 85
SI+ T+ ++ LF+ W S+HG+ Y EE+ R EIFK N +I N K
Sbjct: 25 SILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKS 84
Query: 86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA-------LPKSVD 138
S+ LGLN+FAD++ +EF KYL Q P + + + +K P S D
Sbjct: 85 PHSHRLGLNKFADITPQEFSKKYL----QAPKDVSQQIKMANKKMKKEQYSCDHPPASWD 140
Query: 139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
WRKKG +T VK QG CG WAFS A+E + I +G+L SLSEQEL+DC + G
Sbjct: 141 WRKKGVITQVKYQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDC-VEESEGSYN 199
Query: 199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND------- 251
G +F++++ GG+ ++DYPY +EG C+ K + + VTI GY+ + +D
Sbjct: 200 GWQYQSFEWVLEHGGIATDDDYPYRAKEGRCKANKIQ-DKVTIDGYETLIMSDESTESET 258
Query: 252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG-----PCGAELDHGVAAVGYGKSKGS 306
EQ+ L A+ QP+SV+I+A DF Y+GG++ G P G ++H V VGYG + G
Sbjct: 259 EQAFLSAILEQPISVSIDAK--DFHLYTGGIYDGENCTSPYG--INHFVLLVGYGSADGV 314
Query: 307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
DY I KNSWG WGE GYI ++RNTG G+CG+N AS P K+
Sbjct: 315 DYWIAKNSWGFDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTKE 358
>sp|Q3T0I2|CATH_BOVIN Pro-cathepsin H OS=Bos taurus GN=CTSH PE=2 SV=1
Length = 335
Score = 237 bits (605), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 129/308 (41%), Positives = 185/308 (60%), Gaps = 17/308 (5%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
F+SWM +H K Y EE HR + F NL+ I+ N ++ +GLN+F+DMS +E K K
Sbjct: 35 FQSWMVQHQKKYSS-EEYYHRLQAFASNLREINAHNARNHTFKMGLNQFSDMSFDELKRK 93
Query: 108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGA-VTPVKNQGSCGSCWAFSTVAAV 166
YL +PQ + + + R P S+DWRKKG VTPVKNQGSCGSCW FST A+
Sbjct: 94 YLWSEPQNCSATKSN---YLRGTGPYPPSMDWRKKGNFVTPVKNQGSCGSCWTFSTTGAL 150
Query: 167 EGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
E I +G L L+EQ+L+DC +FNN GC GGL AF+YI + G+ E+ YPY +
Sbjct: 151 ESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRGQ 210
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVFT 284
+G C+ + + + + ++ NDE+++++A+A H PVS A E + DF Y G+++
Sbjct: 211 DGDCKYQPSK-AIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTA-DFMMYRKGIYS 268
Query: 285 GP----CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
+++H V AVGYG+ KG Y IVKNSWGP WG +GY ++R + +CG+
Sbjct: 269 STSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIERG----KNMCGL 324
Query: 341 NKMASIPL 348
AS P+
Sbjct: 325 AACASFPI 332
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 235 bits (599), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 184/309 (59%), Gaps = 20/309 (6%)
Query: 51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKN 106
W + HG+ Y EE R ++++N+K I+ N+E + + + +N F DM++EEF+
Sbjct: 32 WKATHGRLYGMNEEGWRR-AVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQ 90
Query: 107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
G + Q + + F V +PKSVDWR+KG VT VKNQG CGSCWAFS A+
Sbjct: 91 VMNGFQNQ---KHKKGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGAL 147
Query: 167 EGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
EG +G L SLSEQ L+DC N GCNGGLMD AF+Y+ +GGL EE YPYL
Sbjct: 148 EGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGR 207
Query: 226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFT 284
E K E +G+ D+P+ E++L+KA+A P+SVAI+A + FQFY G++
Sbjct: 208 ETNSCTYKPECSAANDTGFVDIPQR-EKALMKAVATVGPISVAIDAGHSSFQFYKSGIYY 266
Query: 285 GP-CGA-ELDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
P C + +LDHGV VGYG S S + IVKNSWGP+WG GY++M ++ C
Sbjct: 267 DPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNH---C 323
Query: 339 GINKMASIP 347
GI+ AS P
Sbjct: 324 GISTAASYP 332
>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
Length = 334
Score = 233 bits (595), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 185/312 (59%), Gaps = 20/312 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
+ W + H + Y EE+ R ++++N K ID N+E + + + +N F DM++EE
Sbjct: 29 WHQWKATHRRLYGMNEEEWRR-AVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEE 87
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
F+ G + Q + + F + +PKSVDW KKG VTPVKNQG CGSCWAFS
Sbjct: 88 FRQVMNGFQNQ---KHKKGKLFHEPLLVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSAT 144
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTS-FNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
A+EG +G L SLSEQ L+DC + N GCNGGLMD AF+YI +GGL EE YPY
Sbjct: 145 GALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPY 204
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
L + + K E +G+ D+P+ E++L+KA+A P+SVAI+A T FQFY G
Sbjct: 205 LATDTNSCNYKPECSAANDTGFVDIPQR-EKALMKAVATVGPISVAIDAGHTSFQFYKSG 263
Query: 282 VFTGP-CGA-ELDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
++ P C + +LDHGV VGYG S + + IVKNSWGP+WG GY++M ++
Sbjct: 264 IYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQ---N 320
Query: 336 GLCGINKMASIP 347
CGI AS P
Sbjct: 321 NHCGIATAASYP 332
>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
Length = 333
Score = 232 bits (591), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 192/329 (58%), Gaps = 21/329 (6%)
Query: 31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT--- 87
+G + LT L + W + H + Y EE R ++++N+K I+ N+E +
Sbjct: 12 LGIASATLTFNHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYSQGK 70
Query: 88 -SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVT 146
S+ + +N F DM+ EEF+ G + + P + + E + + P+SVDWR+KG VT
Sbjct: 71 HSFTMAMNTFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA---PRSVDWREKGYVT 127
Query: 147 PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAF 205
PVKNQG CGSCWAFS A+EG +G L SLSEQ L+DC N GCNGGLMDYAF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEGCNGGLMDYAF 187
Query: 206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPV 264
+Y+ +GGL EE YPY E +C+ E V +G+ D+P+ E++L+KA+A P+
Sbjct: 188 QYVADNGGLDSEESYPYEATEESCK-YNPEYSVANDTGFVDIPK-QEKALMKAVATVGPI 245
Query: 265 SVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYG----KSKGSDYIIVKNSWGPK 318
SVAI+A F FY G++ P C +E +DHGV VGYG +S S Y +VKNSWG +
Sbjct: 246 SVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNSKYWLVKNSWGEE 305
Query: 319 WGERGYIRMKRNTGKPEGLCGINKMASIP 347
WG GYI+M ++ CGI AS P
Sbjct: 306 WGMGGYIKMAKDR---RNHCGIASAASYP 331
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 232 bits (591), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 186/312 (59%), Gaps = 20/312 (6%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-----EVTSYWLGLNEFADMSHE 102
+E + K+G+ Y EE +R IF++N K+I++ NK EVT + L +N+F DM+ E
Sbjct: 20 WEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVT-FNLAMNKFGDMTLE 78
Query: 103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS--VDWRKKGAVTPVKNQGSCGSCWAF 160
EF +K P R P + F Y + P++ VDWR KGAVTPVK+QG CGSCWAF
Sbjct: 79 EFNAV---MKGNIPRRSAPVSVF-YPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAF 134
Query: 161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFN-NGCNGGLMDYAFKYIVASGGLHKEED 219
ST ++EG + + +G+L SL+EQ+L+DC + GCNGG M+ AF YI A+ G+ E
Sbjct: 135 STTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAA 194
Query: 220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFY 278
YPY +G+C + T SG+ ++ E L +A+ P+SV I+A+ + FQFY
Sbjct: 195 YPYEARDGSCRFDSNSV-AATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFY 253
Query: 279 SGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
S GV+ P C LDH V AVGYG G D+ +VKNSW WG+ GYI+M RN
Sbjct: 254 SSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNN-- 311
Query: 337 LCGINKMASIPL 348
CGI +AS PL
Sbjct: 312 -CGIATVASYPL 322
>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
Length = 329
Score = 232 bits (591), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 136/308 (44%), Positives = 187/308 (60%), Gaps = 14/308 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
+E W H K Y +++ R I+++NLK+I N E V +Y L +N DM++EE
Sbjct: 26 WELWKKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTNEE 85
Query: 104 FKNKYLGLK-PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
K GLK P +R + + +A P SVD+RKKG VTPVKNQG CGSCWAFS+
Sbjct: 86 VVQKMTGLKVPASHSRSNDTLYIPDWEGRA-PDSVDYRKKGYVTPVKNQGQCGSCWAFSS 144
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
V A+EG + +G L +LS Q L+DC S N+GC GG M AF+Y+ + G+ E+ YPY
Sbjct: 145 VGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPY 203
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
+ +E +C + GY+++PE +E++L +A+A PVSVAI+AS T FQFYS G
Sbjct: 204 VGQEESCM-YNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKG 262
Query: 282 VFTG-PCGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
V+ C ++ L+H V AVGYG KG+ + I+KNSWG WG +GYI M RN CG
Sbjct: 263 VYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNA---CG 319
Query: 340 INKMASIP 347
I +AS P
Sbjct: 320 IANLASFP 327
>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
Length = 329
Score = 232 bits (591), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 136/308 (44%), Positives = 187/308 (60%), Gaps = 14/308 (4%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
+E W H K Y +++ R I+++NLK+I N E V +Y L +N DM++EE
Sbjct: 26 WELWKKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTNEE 85
Query: 104 FKNKYLGLK-PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
K GLK P +R + + +A P SVD+RKKG VTPVKNQG CGSCWAFS+
Sbjct: 86 VVQKMTGLKVPASHSRSNDTLYIPDWEGRA-PDSVDYRKKGYVTPVKNQGQCGSCWAFSS 144
Query: 163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
V A+EG + +G L +LS Q L+DC S N+GC GG M AF+Y+ + G+ E+ YPY
Sbjct: 145 VGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPY 203
Query: 223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
+ +E +C + GY+++PE +E++L +A+A PVSVAI+AS T FQFYS G
Sbjct: 204 VGQEESCM-YNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKG 262
Query: 282 VFTG-PCGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
V+ C ++ L+H V AVGYG KG+ + I+KNSWG WG +GYI M RN CG
Sbjct: 263 VYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNA---CG 319
Query: 340 INKMASIP 347
I +AS P
Sbjct: 320 IANLASFP 327
>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
Length = 329
Score = 231 bits (590), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 134/307 (43%), Positives = 181/307 (58%), Gaps = 12/307 (3%)
Query: 48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
+E W H K Y +++ R I+++NLK+I N E V +Y L +N DM+ EE
Sbjct: 26 WELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEE 85
Query: 104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
K GLK R + P SVD+RKKG VTPVKNQG CGSCWAFS+V
Sbjct: 86 VVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSV 145
Query: 164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
A+EG + +G L +LS Q L+DC S N+GC GG M AF+Y+ + G+ E+ YPY+
Sbjct: 146 GALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 204
Query: 224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGV 282
+E +C + GY+++PE +E++L +A+A PVSVAI+AS T FQFYS GV
Sbjct: 205 GQEESCM-YNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGV 263
Query: 283 FTG-PCGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
+ C ++ L+H V AVGYG KG+ + I+KNSWG WG +GYI M RN CGI
Sbjct: 264 YYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNA---CGI 320
Query: 341 NKMASIP 347
+AS P
Sbjct: 321 ANLASFP 327
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.134 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 137,725,183
Number of Sequences: 539616
Number of extensions: 6055596
Number of successful extensions: 16485
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 222
Number of HSP's successfully gapped in prelim test: 34
Number of HSP's that attempted gapping in prelim test: 15491
Number of HSP's gapped (non-prelim): 292
length of query: 350
length of database: 191,569,459
effective HSP length: 118
effective length of query: 232
effective length of database: 127,894,771
effective search space: 29671586872
effective search space used: 29671586872
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (28.5 bits)