BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018968
(348 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 315 bits (806), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 215/314 (68%), Gaps = 9/314 (2%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T+ ++E+ E WM++H ++YK EK RF++F+ENL +I++ N E N +Y LG N F+
Sbjct: 42 TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR K AV P+KDQ +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFS VAAVEGI +I+ NL LSEQ+L+DC T N+GC GG M+ AF+YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 ATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
ED+YPY +G C ++ IS YE+VP D+++L+KA++ QPVS+ I A +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD--- 333
F+ YK G+FNG CGT LDH V VG+G+++ G++Y ++KNSWG WG+ G++++ R+
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGK 337
Query: 334 -EGLCGIGTQSSYP 346
EGLCGI +SYP
Sbjct: 338 PEGLCGINKMASYP 351
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 312 bits (800), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 156/348 (44%), Positives = 221/348 (63%), Gaps = 19/348 (5%)
Query: 12 KINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHG--RSYKDELEKEMRFK 69
K+ I +F ++IL +C E+ + ++++W + H RS E+E RF
Sbjct: 3 KLLLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPRSLN---EREKRFN 59
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-----YKMPSPSHRSTTSS 124
+F+ N+ ++ NK+ NR+YKL N+F+DLT +EF+ YTG ++M R +
Sbjct: 60 VFRHNVMHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQF 118
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
+ ++NLS +P+S+DWR K AVT IK+Q +CG CWAFS VAAVEGI KI L+ LS
Sbjct: 119 MYDHENLSK--LPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLS 176
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKI 243
EQ+LVDC T N GC GG ME AFE+I +N GI TED YPY+ + G C A++ I
Sbjct: 177 EQELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTI 236
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+E+VP DE ALLKAV+ QPVS+ I A +++F+ Y EG+F G CGT+L+H V VG+G
Sbjct: 237 DGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYG 296
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+E G YW+++NSWG WG+ GY+KI R+ EG CGI ++SYP+
Sbjct: 297 -SERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPI 343
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 312 bits (800), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 158/350 (45%), Positives = 226/350 (64%), Gaps = 25/350 (7%)
Query: 18 MFIIIILLVSCASQV----VSSRSTH---------EQSVVEMHEKWMAQHGR--SYKDEL 62
M I+ + +V+ +S V +S H E V+ ++E W+ +HG+ S +
Sbjct: 8 MAILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLV 67
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTT 122
EK+ RF+IFK+NL ++++ N E N +Y+LG RF+DLTNDE+R+ Y G KM R T+
Sbjct: 68 EKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS 126
Query: 123 SSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQ 182
+Y+ ++P S+DWR K AV +KDQ CG CWAFS + AVEGI +I +LI
Sbjct: 127 ---LRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183
Query: 183 LSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAA 241
LSEQ+LVDC T+ N GC GG M+ AFE+II+N GI T+ +YPY+ V GTC +K A
Sbjct: 184 LSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVV 243
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVG 301
I +YE+VP+ E++L KAV+ QP+SI I A F+ Y GIF+G CGTQLDH V VG
Sbjct: 244 TIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVG 303
Query: 302 FGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+G TE+G +YW+++NSWG +WG++GY+++ R+ G CGI + SYP+
Sbjct: 304 YG-TENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPI 352
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 311 bits (796), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 157/341 (46%), Positives = 211/341 (61%), Gaps = 14/341 (4%)
Query: 19 FIIIILLVSCASQVVSSRSTH------EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFK 72
+ ++L S V +S H E+S+ +++E+W + H S + EK RF +FK
Sbjct: 6 LLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRFNVFK 64
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKYQNL 131
NL ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ P R T +
Sbjct: 65 ANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYE 123
Query: 132 SMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDC 191
+ VP S+DWR K AVT +KDQ +CG CWAFS V AVEGI +I L+ LSEQ+LVDC
Sbjct: 124 KVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDC 183
Query: 192 STNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVP 250
N GC GG ME AFE+I Q GI TE YPY+A +GTC A++ A I +E VP
Sbjct: 184 DKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHENVP 243
Query: 251 SGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+ DE ALLKAV+ QPVS+ I A ++F+ Y EG+F G C T L+H V IVG+GTT DG N
Sbjct: 244 ANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTN 303
Query: 311 YWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
YW+++NSWG WG+ GY+++ R+ EGLCGI SYP+
Sbjct: 304 YWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 344
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 306 bits (785), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 153/312 (49%), Positives = 203/312 (65%), Gaps = 16/312 (5%)
Query: 46 MHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFR 105
++E+W + H S + EK+ RF +FK N ++ ANK ++ YKL N+F+D+TN EFR
Sbjct: 37 LYERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFR 94
Query: 106 ALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCC 160
Y+G K+ HR + TF Y+ + VP S+DWR K AVT +KDQ +CG C
Sbjct: 95 NTYSGSKVKH--HRMFRGGPRGNGTFMYEKVDT--VPASVDWRKKGAVTSVKDQGQCGSC 150
Query: 161 WAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATE 220
WAFS + AVEGI +I L+ LSEQ+LVDC T+ N GC GG M+ AFE+I Q GI TE
Sbjct: 151 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTE 210
Query: 221 DEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKS 279
YPY+A GTC +++ A A I +E VP DE ALLKAV+ QPVS+ I A ++F+
Sbjct: 211 ANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQF 270
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEG 335
Y EG+F G CGT+LDH V IVG+GTT DG YW +KNSWG WG+ GY+++ R EG
Sbjct: 271 YSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEG 330
Query: 336 LCGIGTQSSYPL 347
LCGI ++SYP+
Sbjct: 331 LCGIAMEASYPI 342
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 305 bits (781), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 142/317 (44%), Positives = 202/317 (63%), Gaps = 11/317 (3%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+++ +++E+W + H R + EK RF FK N +I NK G+ Y+L NRF D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
EFRA + G + +PS + + F Y L+++D+P S+DWR K AVT +KDQ +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPS-KPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCG 156
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFS V +VEGI I +L+ LSEQ+L+DC T N+GC GG M+ AFEYI N G+
Sbjct: 157 SCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216
Query: 219 TEDEYPYQAVQGTCSAAQKA----AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
TE YPY+A +GTC+ A+ A I +++VP+ E+ L +AV+ QPVS+ + A
Sbjct: 217 TEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASG 276
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE 334
F Y EG+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 277 KAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDS 336
Query: 335 ----GLCGIGTQSSYPL 347
GLCGI ++SYP+
Sbjct: 337 GASGGLCGIAMEASYPV 353
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 304 bits (779), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 198/316 (62%), Gaps = 9/316 (2%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+++ +++E+W + H R + EK RF FK N +I NK G+ Y+L NRF D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
EFRA + G + + F Y L+++D+P S+DWR K AVT +KDQ +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGS 157
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIAT 219
CWAFS V +VEGI I +L+ LSEQ+L+DC T N+GC GG M+ AFEYI N G+ T
Sbjct: 158 CWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLIT 217
Query: 220 EDEYPYQAVQGTCSAAQKA----AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
E YPY+A +GTC+ A+ A I +++VP+ E+ L +AV+ QPVS+ + A
Sbjct: 218 EAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGK 277
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE- 334
F Y EG+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 278 AFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 335 ---GLCGIGTQSSYPL 347
GLCGI ++SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 302 bits (774), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 152/318 (47%), Positives = 203/318 (63%), Gaps = 16/318 (5%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDL 99
E+S+ +++E+W + H S + EK RF +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
TN EFR+ Y G +KM S S TF Y+ + VP S+DWR K AVT +KDQ
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
+CG CWAFS + AVEGI +I L+ LSEQ+LVDC N GC GG ME AFE+I Q
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206
Query: 215 QGIATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
GI TE YPY A +GTC ++ A I +E VP DE ALLKAV+ QPVS+ I A
Sbjct: 207 GGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD 333
++F+ Y EG+F G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326
Query: 334 ----EGLCGIGTQSSYPL 347
EGLCGI +SYP+
Sbjct: 327 ISKKEGLCGIAMMASYPI 344
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 302 bits (774), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 151/353 (42%), Positives = 220/353 (62%), Gaps = 26/353 (7%)
Query: 18 MFIIIILLVSCAS----QVVSSRSTHE--------QSVVE-----MHEKWMAQHGRSYKD 60
+F++ +++ SCA+ VVSS H Q + + M E WM +HG+ Y
Sbjct: 10 IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDS 69
Query: 61 ELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRS 120
EKE R IF++NL +I N E N +Y+LG NRF+DL+ E+ + G P +
Sbjct: 70 VAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV 128
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANL 180
+S+ +Y+ +P S+DWR++ AVT +KDQ C CWAFS V AVEG+ KI L
Sbjct: 129 FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGEL 188
Query: 181 IQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA-- 238
+ LSEQ L++C+ NNGCGGG +E A+E+I+ N G+ T+++YPY+A+ G C K
Sbjct: 189 VTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDN 247
Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVT 298
I YE +P+ DE AL+KAV+ QPV+ + + + EF+ Y+ G+F+G CGT L+H V
Sbjct: 248 KNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVV 307
Query: 299 IVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+VG+G TE+G +YW++KNS GDTWG+AGYMK+ R+ GLCGI ++SYPL
Sbjct: 308 VVGYG-TENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 302 bits (773), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 156/348 (44%), Positives = 220/348 (63%), Gaps = 25/348 (7%)
Query: 18 MFIIIILLVSCASQVVSSRSTH--------EQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
M + I+L+S S + +S+ E++V +++E+W H S + E RF
Sbjct: 1 MKLFFIVLISFLSLLQASKGFDFDEKELETEENVWKLYERWRGHHSVS-RASHEAIKRFN 59
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-----YKMPSPSHRSTTSS 124
+F+ N+ ++ + NK+ N+ YKL NRF+D+T+ EFR+ Y G ++M R S
Sbjct: 60 VFRHNVLHVHRTNKK-NKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRG--SG 116
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
F Y+N+ T VP+S+DWR+K AVT +K+QQ+CG CWAFS VAAVEGI KI L+ LS
Sbjct: 117 GFMYENV--TRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLS 174
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQA--VQGTCSAAQKAAAAK 242
EQ+LVDC T N GC GG ME AFE+I N GI TE+ YPY + VQ + +
Sbjct: 175 EQELVDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVT 234
Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGF 302
I +E VP DE+ LLKAV+ QPVS+ I A +++F+ Y EG+F G CGTQL+H V IVG+
Sbjct: 235 IDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGY 294
Query: 303 GTTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYP 346
G T++G YW+++NSWG WG+ GY++I R +EG CGI ++SYP
Sbjct: 295 GETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYP 342
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 300 bits (768), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 151/348 (43%), Positives = 218/348 (62%), Gaps = 22/348 (6%)
Query: 16 IPMFIIIILLVSCASQVVSSRSTH------EQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+ FI++ L + + H E S+ E++E+W + H + E EK RF
Sbjct: 1 MKRFIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFN 59
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTG-----YKMPSPSHRSTTSS 124
+FK N+++I + NK+ +++YKL N+F D+T++EFR Y G ++M ++T S
Sbjct: 60 VFKHNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKS- 117
Query: 125 TFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
F Y N++ +PTS+DWR AVTP+K+Q +CG CWAFS V AVEGI +I L LS
Sbjct: 118 -FMYANVNT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLS 174
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAKI 243
EQ+LVDC TN N GC GG M+ AFE+I + G+ +E YPY+A TC + + A I
Sbjct: 175 EQELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSI 234
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+E+VP E L+KAV+ QPVS+ I A ++F+ Y EG+F G CGT+L+H V +VG+G
Sbjct: 235 DGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYG 294
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
TT DG YW++KNSWG+ WG+ GY+++ R EGLCGI ++SYPL
Sbjct: 295 TTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 299 bits (765), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 155/335 (46%), Positives = 208/335 (62%), Gaps = 10/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + V M+E W+ ++G+SY E E RF+IFKE L +
Sbjct: 13 LFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+YK+G N+F+DLT++EFR+ Y G+ S S+++ S+ +Y+ +P
Sbjct: 73 IDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRVGQVLP 128
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+ +DWR AV IK Q ECG CWAFSA+A VEGI KI LI LSEQ+L+DC T
Sbjct: 129 SYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNT 188
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC GG + F++II N GI TE+ YPY A G C+ Q I YE VP +E
Sbjct: 189 RGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYNNEW 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+ + A FK Y GIF G CGT +DHAVTIVG+G TE G +YW++K
Sbjct: 249 ALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGIDYWIVK 307
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSW TWG+ GYM+ILR+ G CGI T SYP+
Sbjct: 308 NSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 298 bits (762), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 153/348 (43%), Positives = 217/348 (62%), Gaps = 23/348 (6%)
Query: 17 PMFIIIILL----VSCASQVVSSRS--THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKI 70
P FI + L+ +S A + + E S+ ++EKW H + +D EK RF +
Sbjct: 4 PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVA-RDLDEKNRRFNV 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRS-----TTSST 125
FKEN+++I + N++ + YKL N+F D+TN EFR+ Y G K+ HRS + +
Sbjct: 63 FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQH--HRSQRGIQKNTGS 120
Query: 126 FKYQNLSMTDVPT-SLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLS 184
F Y+N+ +P S+DWR K AVT +KDQ +CG CWAFS +A+VEGI +I L+ LS
Sbjct: 121 FMYENVG--SLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLS 178
Query: 185 EQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTC-SAAQKAAAAKI 243
EQ+LVDC T+ N GC GG M+ AFE+I Q GI TED YPY GTC S + I
Sbjct: 179 EQELVDCDTSYNEGCNGGLMDYAFEFI-QKNGITTEDSYPYAEQDGTCASNLLNSPVVSI 237
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
+++VP+ +E AL++AV+ QP+S+ I A F+ Y EG+F G CGT+LDH V IVG+G
Sbjct: 238 DGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYG 297
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILR----DEGLCGIGTQSSYPL 347
T DG YW++KNSWG+ WG++GY+++ R G CGI ++SYP+
Sbjct: 298 ATRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 297 bits (761), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 154/335 (45%), Positives = 207/335 (61%), Gaps = 10/335 (2%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F +L++S A + V M+E W+ ++G+SY E E RF+IFKE L +
Sbjct: 13 LFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I++ N + NR+YK+G N+F+DLT++EFR+ Y + S S+++ S+ +Y+ +P
Sbjct: 73 IDEHNADTNRSYKVGLNQFADLTDEEFRSTYL--RFTSGSNKTKVSN--RYEPRVGQVLP 128
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-TNGN 196
+ +DWR AV IK Q ECG CWAFSA+A VEGI KI LI LSEQ+L+DC T
Sbjct: 129 SYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNT 188
Query: 197 NGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQ 255
GC GG + F++II N GI TE+ YPY A G C+ Q I YE VP +E
Sbjct: 189 RGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNNEW 248
Query: 256 ALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL AV+ QPVS+ + A FK Y GIF G CGT +DHAVTIVG+G TE G +YW++K
Sbjct: 249 ALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYG-TEGGIDYWIVK 307
Query: 316 NSWGDTWGDAGYMKILRD---EGLCGIGTQSSYPL 347
NSW TWG+ GYM+ILR+ G CGI T SYP+
Sbjct: 308 NSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 295 bits (755), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 139/333 (41%), Positives = 208/333 (62%), Gaps = 10/333 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+ + L V AS +S +++ E+WMA++GR YKD EK +RF+IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG N+F+D+TN+EF A YTG +P R S + ++ ++ VP
Sbjct: 68 IETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVS---FDDVDISSVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
S+DWRD AVT +K+Q CG CWAF+++A VE I KI NL+ LSEQQ++DC+ +
Sbjct: 125 QSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAV--SY 182
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG + KA+ +II N+G+A+ YPY+A +GTC +A I+ Y V +E+ +
Sbjct: 183 GCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNSAYITRYTYVQRNNERNM 242
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+ AVS QP++ + A + F+ YK G+F G CGT+L+HA+ I+G+G G +W+++NS
Sbjct: 243 MYAVSNQPIAAALDA-SGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNS 301
Query: 318 WGDTWGDAGYMKILRDE----GLCGIGTQSSYP 346
WG WG+ GY+++ RD GLCGI YP
Sbjct: 302 WGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 295 bits (755), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 139/333 (41%), Positives = 210/333 (63%), Gaps = 10/333 (3%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
+F+ + L AS +SR +++ E+WMA++GR YKD+ EK RF+IFK N+++
Sbjct: 8 VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67
Query: 78 IEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG N+F+D+T EF A YTG +P R S + +++++ VP
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVS---FDDVNISAVP 124
Query: 138 TSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNN 197
S+DWRD AV +K+Q CG CW+F+A+A VEGI KI L+ LSEQ+++DC+ +
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV--SY 182
Query: 198 GCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQAL 257
GC GG + KA+++II N G+ TE+ YPY A QGTC+A +A I+ Y V DE+++
Sbjct: 183 GCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAYITGYSYVRRNDERSM 242
Query: 258 LKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+ AVS QP++ I A + F+ Y G+F+G CGT L+HA+TI+G+G G YW+++NS
Sbjct: 243 MYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNS 301
Query: 318 WGDTWGDAGYMKILR----DEGLCGIGTQSSYP 346
WG +WG+ GY+++ R G+CGI +P
Sbjct: 302 WGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 295 bits (754), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 149/348 (42%), Positives = 220/348 (63%), Gaps = 21/348 (6%)
Query: 18 MFIIIILLV--SCASQVVSSRSTHE-----QSVVE-----MHEKWMAQHGRSYKDELEKE 65
M I+++ +V SCA+ + S +++ SV + + E WM +HG+ Y EKE
Sbjct: 8 MLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAEKE 67
Query: 66 MRFKIFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSST 125
R IF++NL +I N E N +Y+LG F+DL+ E++ + G P + +S+
Sbjct: 68 RRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMTSS 126
Query: 126 FKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSE 185
+Y+ + +P S+DWR++ AVT +KDQ C CWAFS V AVEG+ KI L+ LSE
Sbjct: 127 DRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSE 186
Query: 186 QQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA--AAAKI 243
Q L++C+ NNGCGGG +E A+E+I++N G+ T+++YPY+AV G C K I
Sbjct: 187 QDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMI 245
Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFG 303
YE +P+ DE AL+KAV+ QPV+ I + + EF+ Y+ G+F+G CGT L+H V +VG+G
Sbjct: 246 DGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYG 305
Query: 304 TTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
TE+G +YWL+KNS G TWG+AGYMK+ R+ GLCGI ++SYPL
Sbjct: 306 -TENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 292 bits (748), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 202/324 (62%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R+ F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAV 148
++LG NRF+DLTN+E+R Y G + R + N ++ P S+DWR K AV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAF 208
IKDQ CG CWAFSA+AAVEGI +I +LI LSEQ+LVDC T+ N GC GG M+ AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 EYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
++II N GI TED+YPY+ C +K A I +YE+V E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261
Query: 268 IGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGY 327
+ I A F+ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320
Query: 328 MKILRD----EGLCGIGTQSSYPL 347
+++ R+ G CGI + SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 292 bits (747), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 209/322 (64%), Gaps = 16/322 (4%)
Query: 40 EQSVVEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V ++ +W A+HG++ + +++ RF IFK+NL +I+ N++ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101
Query: 95 RFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKKAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR K AV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYI 211
KDQ CG CWAFS AAVEGI KI LI LSEQ+LVDC + N GC GG M+ AF++I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGI 270
++N G+ TE +YPY+ G C++ K + I YE+VP+ DE AL KA+S QPVS+ I
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280
Query: 271 AAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKI 330
A F+ Y+ GIF G CGT LDHAV VG+G +E+G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339
Query: 331 LRD-----EGLCGIGTQSSYPL 347
R+ G CGI ++SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 290 bits (742), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 215/315 (68%), Gaps = 11/315 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
+H++ ++E+ E W++ ++Y+ EK +RF++FK+NL++I++ NK+G ++Y LG N F+
Sbjct: 43 SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
DL+++EF+ +Y G K S + F Y+++ VP S+DWR K AV +K+Q
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQG 216
CG CWAFS VAAVEGI KI NL LSEQ+L+DC T NNGC GG M+ AFEYI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218
Query: 217 IATEDEYPYQAVQGTCSAAQ-KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTT 275
+ E++YPY +GTC + ++ I+ +++VP+ DE++LLKA++ QP+S+ I A
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGR 278
Query: 276 EFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-- 333
EF+ Y G+F+G CG LDH V VG+G+++ G++Y ++KNSWG WG+ GY+++ R+
Sbjct: 279 EFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTG 337
Query: 334 --EGLCGIGTQSSYP 346
EGLCGI +S+P
Sbjct: 338 KPEGLCGINKMASFP 352
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 289 bits (740), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 210/310 (67%), Gaps = 15/310 (4%)
Query: 47 HEKWMAQHGRSYKDEL--EKEMRFKIFKENLEYIEKANKEGNRT--YKLGTNRFSDLTND 102
++ W+A++G + L E E RF +F +NL++++ N + ++LG NRF+DLTN+
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWA 162
EFRA + G K+ + RS + +Y++ + ++P S+DWR+K AV P+K+Q +CG CWA
Sbjct: 112 EFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167
Query: 163 FSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATED 221
FSAV+ VE I ++ +I LSEQ+LV+CSTNG N+GC GG M+ AF++II+N GI TED
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227
Query: 222 EYPYQAVQGTCSA-AQKAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSY 280
+YPY+AV G C + A I +E+VP DE++L KAV+ QPVS+ I A EF+ Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGL 336
G+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346
Query: 337 CGIGTQSSYP 346
CGI +SYP
Sbjct: 347 CGIAMMASYP 356
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 287 bits (735), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 145/316 (45%), Positives = 202/316 (63%), Gaps = 12/316 (3%)
Query: 39 HEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFSD 98
+E V M+E+W+ ++ ++Y EKE RFKIFK+NL+++++ N +RT+++G RF+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
LTN+EFRA+Y KM + S + + Y+ + +P +DWR AV +KDQ CG
Sbjct: 96 LTNEEFRAIYLRKKMER-TKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGI 217
CWAFSAV AVEGI +I+ LI LSEQ+LVDC N GC GG M AFE+I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 218 ATEDEYPYQAVQ-GTCSAAQ--KAAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
T+ +YPY A G C+A + I YE+VP DE++L KAV+ QPVS+ I A +
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ YK G+ G CG LDH V +VG+G+T G +YW+I+NSWG WGD+GY+K+ R+
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNI 331
Query: 334 ---EGLCGIGTQSSYP 346
G CGI SYP
Sbjct: 332 DDPFGKCGIAMMPSYP 347
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 286 bits (733), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 204/317 (64%), Gaps = 15/317 (4%)
Query: 44 VEMHEKWMAQHGRSYKDEL----EKEMRFKIFKENLEYIEKANKEG-NRTYKLGTNRFSD 98
+ ++ +W +HG+S + +++ RF IFK+NL +I+ N+ N TYKLG F++
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKKAVTPIKDQQ 155
LTNDE+R+LY G + P R T + KY +++ +VP ++DWR K AV IKDQ
Sbjct: 61 LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119
Query: 156 ECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS AAVEGI KI L+ LSEQ+LVDC + N GC GG M+ AF++I++N
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 216 GIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYT 274
G+ TE +YPY G C++ K + I YE+VPS DE AL +AVS QPVS+ I A
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239
Query: 275 TEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F+ Y+ GIF G CGT +DHAV VG+G +E+G +YW+++NSWG WG+ GY+++ R+
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNV 298
Query: 334 ---EGLCGIGTQSSYPL 347
G CGI ++SYP+
Sbjct: 299 ASKSGKCGIAIEASYPV 315
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 274 bits (701), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 138/295 (46%), Positives = 192/295 (65%), Gaps = 14/295 (4%)
Query: 63 EKEMRFKIFKENLEYIEKANKEGNRT--YKLGTNRFSDLTNDEFRALYTGYKMPSPSHRS 120
E E RF++F +NL++++ N + ++LG NRF+DLTN EFRA Y G P+ R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKKAVT-PIKDQQECGCCWAFSAVAAVEGITKISGAN 179
+ Y++ + +P S+DWRDK AV P+K+Q +CG CWAFSAVAAVEGI KI
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 180 LIQLSEQQLVDCSTNG-NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKA 238
L+ LSEQ+LV+C+ NG N+GC GG M+ AF +I +N G+ TE++YPY A+ G C+ A+++
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 239 -AAAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAV 297
I +E+VP DE +L KAV+ QPVS+ I A EF+ Y G+F G CGT LDH V
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 298 TIVGFGT-TEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
VG+GT GA YW ++NSWG WG+ GY+++ R+ G CGI +SYP+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 274 bits (700), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 199/318 (62%), Gaps = 16/318 (5%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ + WM +H + Y+ EK RF+IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR K AVTP+K+Q
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 QECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQN 214
CG CWAFS +A VEGI KI NL++LSEQ+LVDC + + GC GG + +Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211
Query: 215 QGIATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAY 273
G+ T YPYQA Q C A K KI+ Y+ VPS E + L A++ QP+S+ + A
Sbjct: 212 NGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAG 271
Query: 274 TTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR- 332
F+ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330
Query: 333 ---DEGLCGIGTQSSYPL 347
+G CG+ S YP
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 270 bits (691), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 190/316 (60%), Gaps = 11/316 (3%)
Query: 43 VVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDL 99
V+E + +H ++Y+DE E+ R KIF EN I K N+ EG ++KL N+++DL
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKKAVTPIKDQQE 156
+ EFR L G+ +FK + + + +P S+DWR K AVT +KDQ
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 157 CGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQ 215
CG CWAFS+ A+EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 216 GIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYT 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+ I A
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 294
Query: 275 TEFKSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR 332
F+ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K+LR
Sbjct: 295 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 354
Query: 333 D-EGLCGIGTQSSYPL 347
+ E CGI + SSYPL
Sbjct: 355 NKENQCGIASASSYPL 370
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 261 bits (668), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 143/351 (40%), Positives = 208/351 (59%), Gaps = 21/351 (5%)
Query: 10 SFKINTIPMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
SF+ + ++++ +S + +E V+ M+E+W+ ++G++Y EKE RFK
Sbjct: 4 SFRTLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFK 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM +S + +YQ
Sbjct: 64 IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120
Query: 130 NLSMTDVPTSLDWRDKKAVTP-IKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQL 188
+P +DWR++ AV P +K Q ECG CWAF+A AVEGI +I+ L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180
Query: 189 VDCST-NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAK----- 242
+DC N N GC GG AFE+I +N GI +++ Y Y G +AA KA K
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGY---TGEDTAACKAIEMKTTRVV 237
Query: 243 -ISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQL-DHAVTIV 300
I+ +E VP DE +L KAV+ QP+S+ I+A YK G++ G C DH V IV
Sbjct: 238 TINGHEVVPVNDEMSLKKAVAYQPISVMISA--ANMSDYKSGVYKGACSNLWGDHNVLIV 295
Query: 301 GFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
G+GT+ D +YWLI+NSWG WG+ GY+++ R+ G C + YP+
Sbjct: 296 GYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPI 346
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 258 bits (658), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 185/312 (59%), Gaps = 13/312 (4%)
Query: 48 EKWMA---QHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSDLTN 101
E+W QH ++Y +E+E+ R KIF EN I K N+ +G +YKLG N+++D+ +
Sbjct: 26 EEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLH 85
Query: 102 DEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGC 159
EF+ GY + T Y + VP S+DWR+ AVT +KDQ CG
Sbjct: 86 HEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGS 145
Query: 160 CWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIA 218
CWAFS+ A+EG L+ LSEQ LVDCST GNNGC GG M+ AF YI N GI
Sbjct: 146 CWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGID 205
Query: 219 TEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEF 277
TE YPY+ + +C + A + + ++P GDE+ + KAV +M PVS+ I A F
Sbjct: 206 TEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESF 265
Query: 278 KSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE- 334
+ Y EG++N C Q LDH V +VG+GT E G +YWL+KNSWG TWG+ GY+K+ R++
Sbjct: 266 QLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARNQN 325
Query: 335 GLCGIGTQSSYP 346
CGI T SSYP
Sbjct: 326 NQCGIATASSYP 337
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 249 bits (636), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 135/315 (42%), Positives = 194/315 (61%), Gaps = 15/315 (4%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ E WM +H + YK+ EK RF+IFK+NL+YI++ NK+ N +Y LG N F+
Sbjct: 39 TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK-NNSYWLGLNVFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
D++NDEF+ YTG + ++ +T S + N ++P +DWR K AVTP+K+Q C
Sbjct: 98 DMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSC 155
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFSAV +EGI KI NL + SEQ+L+DC + GC GG A + + Q GI
Sbjct: 156 GSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR-SYGCNGGYPWSALQLVAQ-YGI 213
Query: 218 ATEDEYPYQAVQGTCSAAQKAA-AAKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
+ YPY+ VQ C + +K AAK +V +E ALL +++ QPVS+ + A +
Sbjct: 214 HYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKD 273
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F+ Y+ GIF G CG ++DHAV VG+ G NY LIKNSWG WG+ GY++I R
Sbjct: 274 FQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILIKNSWGTGWGENGYIRIKRGTGN 328
Query: 333 DEGLCGIGTQSSYPL 347
G+CG+ T S YP+
Sbjct: 329 SYGVCGLYTSSFYPV 343
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 248 bits (634), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 189/314 (60%), Gaps = 12/314 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ WM H + Y++ EK RF+IFK+NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DL+NDEF Y G + + +S ++ N ++P ++DWR K AVTP++ Q C
Sbjct: 98 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDTVNLPENVDWRKKGAVTPVRHQGSC 154
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
G CWAFSAVA VEGI KI L++LSEQ+LVDC ++GC GG A EY+ +N GI
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 212
Query: 218 ATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
+YPY+A QGTC A Q K S V +E LL A++ QPVS+ + +
Sbjct: 213 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 272
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F+ YK GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 273 FQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 331
Query: 333 DEGLCGIGTQSSYP 346
G+CG+ S YP
Sbjct: 332 SPGVCGLYKSSYYP 345
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 247 bits (630), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 190/315 (60%), Gaps = 12/315 (3%)
Query: 38 THEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKEGNRTYKLGTNRFS 97
T + ++++ WM +H ++YK+ EK RF+IFK+NL+YI++ NK N Y LG N FS
Sbjct: 39 TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMIN-GYWLGLNEFS 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQEC 157
DL+NDEF+ Y G P + ++ N + D+P S+DWR K AVTP+K Q C
Sbjct: 98 DLSNDEFKEKYVG---SLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYC 154
Query: 158 GCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNGNNGCGGGTMEKAFEYIIQNQGI 217
CWAFS VA VEGI KI NL++LSEQ+LVDC + GC G + +Y+ QN GI
Sbjct: 155 ESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQN-GI 212
Query: 218 ATEDEYPYQAVQGTCSAAQKAAA-AKISNYEEVPSGDEQALLKAVSMQPVSIGIAAYTTE 276
+YPY A Q TC A Q K + V S +E +LL A++ QPVS+ + + +
Sbjct: 213 HLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRD 272
Query: 277 FKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR---- 332
F++YK GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 273 FQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGN 331
Query: 333 DEGLCGIGTQSSYPL 347
G+CG+ S YP+
Sbjct: 332 SPGVCGVYRSSYYPI 346
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 243 bits (620), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 130/344 (37%), Positives = 210/344 (61%), Gaps = 13/344 (3%)
Query: 11 FKINTIPMFIIIILLVSCASQ-VVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFK 69
+++ +F +I+L +S S V S ++ S ++ WM + ++Y + E R++
Sbjct: 1 MRLSITLIFTLIVLSISFISAGNVFSHKQYQDSFID----WMRSNNKAYTHK-EFMPRYE 55
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
FK+N++Y+ N +G++T LG N+ +DL+N+E+R Y G + + +
Sbjct: 56 EFKKNMDYVHNWNSKGSKTV-LGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRL 114
Query: 130 NLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
N P ++DWR+K AVTP+KDQ +CG C++FS +VEG+T I L+ LSEQ ++
Sbjct: 115 NRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNIL 174
Query: 190 DCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQ-AVQGTCSAAQKAAAAKISNYE 247
DCS++ GN GC GG M AFEYII+N G+ +E++YPY+ V C + + AAKI++Y+
Sbjct: 175 DCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSYK 234
Query: 248 EVPSGDEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFGTT 305
E+ +GDE L A+ + PVS+ I A F+ Y G+ + C ++ LDH V VG G T
Sbjct: 235 EIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMG-T 293
Query: 306 EDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPLA 348
++G +Y+++KNSWG +WG GY+ + R+ + CGI T +SYP+A
Sbjct: 294 DNGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMASYPIA 337
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 243 bits (619), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 137/338 (40%), Positives = 198/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ L+ C+S + H ++ H + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MNWLVWALLLCSSAMAH---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
+ N E G +Y+LG N D+T++E +L + ++PS R+ T + Q L
Sbjct: 58 TVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKL-- 115
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 116 ---PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M +AF+YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 173 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPF 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E+AL +AV+ + PVS+GI A + F YK G+ ++ C ++H V +VG+G DG
Sbjct: 233 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL-DGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
+YWL+KNSWG +GD GY+++ R+ G CGI SYP
Sbjct: 292 DYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 329
>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
Length = 340
Score = 242 bits (617), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 131/306 (42%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 39 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAV 166
++P S ++ T +++ S +P ++DWR+K VT +K Q CG CWAFSAV
Sbjct: 99 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153
Query: 167 AAVEGITKISGANLIQLSEQQLVDCSTN---GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A+EG K+ LI LS Q LVDCS GN GCGGG M +AF+YII N GI + Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+GI A + F YK
Sbjct: 214 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273
Query: 283 GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 341 TQSSYP 346
+ SYP
Sbjct: 333 SYCSYP 338
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 238 bits (607), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 192/338 (56%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ LL C+ V H+ ++ H W + + YK+E E+ R I+++NL+
Sbjct: 1 MKWLVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T +E +L ++PS R+ T Y++ S
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT-----YRSNSN 112
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 113 QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M AF+YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKRAATCSKYTELPF 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E AL +AV+ + PVS+ I A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL-NGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
+YWL+KNSWG +GD GY+++ R+ G CGI + SYP
Sbjct: 292 DYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 329
>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
Length = 331
Score = 237 bits (604), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 197/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
+P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPS 251
GN GC GG M AF+YII N+GI ++ YPY+A+ C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY 232
Query: 252 GDEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E L +AV+ + PVS+G+ A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGK 291
Query: 310 NYWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
YWL+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 292 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
SV=1
Length = 321
Score = 236 bits (601), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 134/307 (43%), Positives = 184/307 (59%), Gaps = 16/307 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
+ + Q+GR Y D E+ R ++F++N + IE NK+ G T+K+ N+F D+TN+EF
Sbjct: 21 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 80
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
A+ GYK S R + F + M +DWR K VTP+KDQ++CG CWAFS
Sbjct: 81 NAVMKGYKKGS---RGEPKAVFTAEAGPMA---ADVDWRTKALVTPVKDQEQCGSCWAFS 134
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
A A+EG + L+ LSEQQLVDCST+ GN+GCGGG M AF+YI N GI TE Y
Sbjct: 135 ATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSY 194
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKE 282
PY+A +C + A + EV E+AL +AVS + P+S+ I A F+ Y
Sbjct: 195 PYEAEDRSCRFDANSIGAICTGSVEVQH-TEEALQEAVSGVGPISVAIDASHFSFQFYSS 253
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGI 339
G++ T LDH V VG+G TE +YWL+KNSWG +WGDAGY+K+ R+ + CGI
Sbjct: 254 GVYYEQNCSPTFLDHGVLAVGYG-TESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGI 312
Query: 340 GTQSSYP 346
++ SYP
Sbjct: 313 ASEPSYP 319
>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
Length = 330
Score = 234 bits (597), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 128/337 (37%), Positives = 195/337 (57%), Gaps = 17/337 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M ++ +L C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++P+ R+ T + Q L
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPNQWQRNITYKSNPNQML-- 115
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
P S+DWR+K VT +K Q CG CWAFSAV A+E K+ L+ LS Q LVDCS
Sbjct: 116 ---PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSE 172
Query: 194 N-GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M +AF+YII N+GI +E YPY+A C K AA S Y E+P G
Sbjct: 173 KYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKCQYDSKYRAATCSKYTELPYG 232
Query: 253 DEQALLKAVSMQ-PVSIGIAAYTTEFKSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGAN 310
E L +AV+ + PV +G+ A F Y+ G+ ++ C +++H V ++G+G +G
Sbjct: 233 REDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYGDL-NGKE 291
Query: 311 YWLIKNSWGDTWGDAGYMKILRDEG-LCGIGTQSSYP 346
YWL+KNSWG +G+ GY+++ R++G CGI + SYP
Sbjct: 292 YWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 328
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 233 bits (595), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 133/308 (43%), Positives = 180/308 (58%), Gaps = 14/308 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
E + ++GR Y D E R IF++N +YIE+ NK+ G T+ L N+F D+T +EF
Sbjct: 21 EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAFS 164
A+ G +P RS S F Y T +DWR K AVTP+KDQ +CG CWAFS
Sbjct: 81 NAVMKG-NIP---RRSAPVSVF-YPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFS 135
Query: 165 AVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGIATEDEY 223
++EG + +LI L+EQQLVDCS G GC GG M AF+YI N GI TE Y
Sbjct: 136 TTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAY 195
Query: 224 PYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSYKE 282
PY+A G+C + AA S + + SG E L +AV + P+S+ I A + F+ Y
Sbjct: 196 PYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSS 255
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGI 339
G++ + LDHAV VG+G +E G ++WL+KNSW +WGDAGY+K+ R+ CGI
Sbjct: 256 GVYYEPSCSPSYLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGI 314
Query: 340 GTQSSYPL 347
T +SYPL
Sbjct: 315 ATVASYPL 322
>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
Length = 333
Score = 233 bits (593), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 130/341 (38%), Positives = 194/341 (56%), Gaps = 23/341 (6%)
Query: 17 PMFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLE 76
P FI+ L + AS + T S+ KW A H R Y E+ R ++++N++
Sbjct: 3 PTFILAALCLGIASATL----TFNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE N+E G ++ + N F D+T++EFR + G++ P +Q
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKPRKGKV------FQEPLF 111
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS- 192
+ P S+DWR+K VTP+K+Q +CG CWAFSA A+EG L+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
GN GC GG M+ AF+Y+ N G+ +E+ YPY+A + +C + + A + + ++P
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIPK- 230
Query: 253 DEQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TTE 306
E+AL+KAV ++ P+S+ I A F YKEGI F C ++ +DH V +VG+G T
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
D + YWL+KNSWG+ WG GY+K+ +D CGI + +SYP
Sbjct: 291 DNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYP 331
>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
Length = 323
Score = 232 bits (592), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 188/314 (59%), Gaps = 14/314 (4%)
Query: 42 SVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANK---EGNRTYKLGTNRFSD 98
S + E + + G+ Y + E+ R +F + L++I++ N+ +G TY L N FSD
Sbjct: 15 SAIGEWENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSD 74
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECG 158
LT++E A TG + R S ++ T + +DWR+K AVTP+KDQ +CG
Sbjct: 75 LTHEEVLATKTGM-----TRRRHPLSVLP-KSAPTTPMAADVDWRNKGAVTPVKDQGQCG 128
Query: 159 CCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN-GNNGCGGGTMEKAFEYIIQNQGI 217
CWAFSAVAA+EG + +L+ LSEQ LVDCS++ GN GC GG +A++YII N+GI
Sbjct: 129 SCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGI 188
Query: 218 ATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIGIAAYTTE 276
TE YPY+A+ C A +S+Y E SGDE AL AV + PVS+ I A +
Sbjct: 189 DTESSYPYKAIDDNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSS 248
Query: 277 FKSYKEGI-FNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRD- 333
F SY G+ + C + +HAVT VG+GT +G +YW++KNSWG WG++GY+K+ R+
Sbjct: 249 FGSYGGGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNR 308
Query: 334 EGLCGIGTQSSYPL 347
+ C I T S YP+
Sbjct: 309 DNNCAIATYSVYPV 322
>sp|P07711|CATL1_HUMAN Cathepsin L1 OS=Homo sapiens GN=CTSL1 PE=1 SV=2
Length = 333
Score = 231 bits (590), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 131/340 (38%), Positives = 193/340 (56%), Gaps = 20/340 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M +IL C + S+ T + S+ KW A H R Y E+ R ++++N++
Sbjct: 1 MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58
Query: 78 IEKAN---KEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N +EG ++ + N F D+T++EFR + G++ P +Q
Sbjct: 59 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKV------FQEPLFY 112
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS-T 193
+ P S+DWR+K VTP+K+Q +CG CWAFSA A+EG LI LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
GN GC GG M+ AF+Y+ N G+ +E+ YPY+A + +C K + A + + ++P
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-Q 231
Query: 254 EQALLKAV-SMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TTED 307
E+AL+KAV ++ P+S+ I A F YKEGI F C ++ +DH V +VG+G T D
Sbjct: 232 EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESD 291
Query: 308 GANYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYP 346
YWL+KNSWG+ WG GY+K+ +D CGI + +SYP
Sbjct: 292 NNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYP 331
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 231 bits (588), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 176/309 (56%), Gaps = 19/309 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRFKIFKENLEYIEKANKE---GNRTYKLGTNRFSDLTNDEF 104
E++ + GR Y D E+ R +F +NL+YIE+ NK+ G TY L N+FSD+TN++F
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80
Query: 105 RALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKKAVTPIKDQQECGCCWAF 163
A+ GYK P P+ T++ T +DWR K AVTP+KDQ +CG CWAF
Sbjct: 81 NAVMKGYKKGPRPAAVFTSTDA--------APESTEVDWRTKGAVTPVKDQGQCGSCWAF 132
Query: 164 SAVAAVEGITKISGANLIQLSEQQLVDCSTNG--NNGCGGGTMEKAFEYIIQNQGIATED 221
S +EG + L+ LSEQQLVDC+ N GC GG +E+A Y+ N G+ TE
Sbjct: 133 STTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTES 192
Query: 222 EYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDEQALLKAV-SMQPVSIGIAAYTTEFKSY 280
YPY+A TC A + Y + G E AL A + P+S+ I A F+SY
Sbjct: 193 SYPYEARDNTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSY 252
Query: 281 KEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGDTWGDAGYMKILRDE-GLC 337
G++ +QLDHAV VG+G +E G ++WL+KNSW +WG++GY+K+ R+ C
Sbjct: 253 YTGVYYEPSCSSSQLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGESGYIKMARNRNNNC 311
Query: 338 GIGTQSSYP 346
GI T + YP
Sbjct: 312 GIATDACYP 320
>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
Length = 329
Score = 230 bits (586), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 131/341 (38%), Positives = 200/341 (58%), Gaps = 26/341 (7%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M+ + +LL+ V S + + + +++ H E W H + Y ++++ R I+++NL+
Sbjct: 1 MWGLKVLLLP-----VMSFALYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLK 55
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
YI N E G TY+L N D+TN+E TG K+P+ RS + L +
Sbjct: 56 YISIHNLEASLGVHTYELAMNHLGDMTNEEVVQKMTGLKVPASHSRSNDT-------LYI 108
Query: 134 TD----VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
D P S+D+R K VTP+K+Q +CG CWAFS+V A+EG K L+ LS Q LV
Sbjct: 109 PDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLV 168
Query: 190 DCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV 249
DC + N+GCGGG M AF+Y+ +N+GI +ED YPY + +C AAK Y E+
Sbjct: 169 DCVSE-NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREI 227
Query: 250 PSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTE 306
P G+E+AL +AV+ + PVS+ I A T F+ Y +G+ ++ C + L+HAV VG+G +
Sbjct: 228 PEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQ 286
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYP 346
G +W+IKNSWG+ WG+ GY+ + R++ CGI +S+P
Sbjct: 287 KGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
Length = 329
Score = 230 bits (586), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 131/341 (38%), Positives = 200/341 (58%), Gaps = 26/341 (7%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M+ + +LL+ V S + + + +++ H E W H + Y ++++ R I+++NL+
Sbjct: 1 MWGLKVLLLP-----VMSFALYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLK 55
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
YI N E G TY+L N D+TN+E TG K+P+ RS + L +
Sbjct: 56 YISIHNLEASLGVHTYELAMNHLGDMTNEEVVQKMTGLKVPASHSRSNDT-------LYI 108
Query: 134 TD----VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLV 189
D P S+D+R K VTP+K+Q +CG CWAFS+V A+EG K L+ LS Q LV
Sbjct: 109 PDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLV 168
Query: 190 DCSTNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEV 249
DC + N+GCGGG M AF+Y+ +N+GI +ED YPY + +C AAK Y E+
Sbjct: 169 DCVSE-NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREI 227
Query: 250 PSGDEQALLKAVS-MQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTE 306
P G+E+AL +AV+ + PVS+ I A T F+ Y +G+ ++ C + L+HAV VG+G +
Sbjct: 228 PEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQ 286
Query: 307 DGANYWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYP 346
G +W+IKNSWG+ WG+ GY+ + R++ CGI +S+P
Sbjct: 287 KGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
Length = 329
Score = 228 bits (582), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 198/336 (58%), Gaps = 16/336 (4%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEKWMAQHGRSYKDELEKEMRFKIFKENLEY 77
M++ LL+ VVS + E+++ E W HG+ Y ++++ R I+++NL+
Sbjct: 1 MWVFKFLLLP----VVSFALSPEETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKK 56
Query: 78 IEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
I N E G TY+L N D+T++E TG ++P PS RS ++ T Y
Sbjct: 57 ISVHNLEASLGAHTYELAMNHLGDMTSEEVVQKMTGLRVP-PS-RSFSNDTL-YTPEWEG 113
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
VP S+D+R K VTP+K+Q +CG CWAFS+ A+EG K L+ LS Q LVDC +
Sbjct: 114 RVPDSIDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSE 173
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
N GCGGG M AF+Y+ QN GI +ED YPY +C A AAK Y E+P G+E
Sbjct: 174 -NYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNE 232
Query: 255 QALLKAVS-MQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANY 311
+AL +AV+ + PVS+ I A T F+ Y G+ ++ C ++HAV +VG+G T+ G Y
Sbjct: 233 KALKRAVARVGPVSVSIDASLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYG-TQKGNKY 291
Query: 312 WLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYP 346
W+IKNSWG++WG+ GY+ + R++ CGI +S+P
Sbjct: 292 WIIKNSWGESWGNKGYVLLARNKNNACGITNLASFP 327
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 227 bits (579), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 107/217 (49%), Positives = 149/217 (68%), Gaps = 6/217 (2%)
Query: 135 DVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTN 194
D+P S+DWR+ AV P+K+Q CG CWAFS VAAVEGI +I +LI LSEQQLVDC+T
Sbjct: 2 DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT- 60
Query: 195 GNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGDE 254
N+GC GG M AF++I+ N GI +E+ YPY+ G C++ A I +YE VPS +E
Sbjct: 61 ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNSTVNAPVVSIDSYENVPSHNE 120
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
Q+L KAV+ QPVS+ + A +F+ Y+ GIF G C +HA+T+VG+GT D ++W++
Sbjct: 121 QSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDFWIV 179
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
KNSWG WG++GY++ R+ +G CGI +SYP+
Sbjct: 180 KNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 227 bits (579), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 109/217 (50%), Positives = 148/217 (68%), Gaps = 6/217 (2%)
Query: 136 VPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCSTNG 195
+P S+DWR+K + +KDQ CG CWAFSAVAA+E I I NLI LSEQ+LVDC +
Sbjct: 18 LPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSY 77
Query: 196 NNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQK-AAAAKISNYEEVPSGDE 254
N GC GG M+ AFE++I+N GI TE++YPY+ G C +K A KI +YE+VP +E
Sbjct: 78 NEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNE 137
Query: 255 QALLKAVSMQPVSIGIAAYTTEFKSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
+AL KAV+ QPVSI + A +F+ YK GIF G CGT +DH V I G+G TE+G +YW++
Sbjct: 138 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYG-TENGMDYWIV 196
Query: 315 KNSWGDTWGDAGYMKILRD----EGLCGIGTQSSYPL 347
+NSWG + GY+++ R+ GLCG+ + SYP+
Sbjct: 197 RNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPV 233
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 226 bits (577), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 134/356 (37%), Positives = 195/356 (54%), Gaps = 42/356 (11%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMHEK-----WMAQHGRSYKDELEKEMRFKIFK 72
+ + +LLVS A T +Q E+ + WM H +SY E E R+ IFK
Sbjct: 4 LSFLCVLLVSVA--------TAKQQFSELQYRNAFTDWMITHQKSYTSE-EFGARYNIFK 54
Query: 73 ENLEYIEKANKEGNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N++Y+++ N +G+ T LG N F+D+TN+E+R Y G K + S T + + +
Sbjct: 55 ANMDYVQQWNSKGSETV-LGLNNFADITNEEYRNTYLGTKFDASSLIGT-----QEEKVF 108
Query: 133 MTDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCS 192
T S DWR + AVTP+K+Q +CG CW+FS + EG S L+ LSEQ L+DCS
Sbjct: 109 TTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCS 168
Query: 193 TNGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSG 252
T N+GC GG M AFEYII N GI TE YPY+A G C + + A +S+Y+ V +G
Sbjct: 169 TE-NSGCDGGLMTYAFEYIINNNGIDTESSYPYKAENGKCEYKSENSGATLSSYKTVTAG 227
Query: 253 DEQALLKAVSMQPVSIGIAAYTTEFKSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGA- 309
E +L AV++ PVS+ I A F+ Y GI + C ++ LDH V VG+G+ +
Sbjct: 228 SESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSS 287
Query: 310 -----------------NYWLIKNSWGDTWGDAGYMKILRD-EGLCGIGTQSSYPL 347
YW++KNSWG +WG GY+ + R+ + CGI + +S+P+
Sbjct: 288 GQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFPV 343
>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
Length = 329
Score = 226 bits (576), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 129/337 (38%), Positives = 198/337 (58%), Gaps = 18/337 (5%)
Query: 18 MFIIIILLVSCASQVVSSRSTHEQSVVEMH-EKWMAQHGRSYKDELEKEMRFKIFKENLE 76
M+ + +LL+ V S + + + +++ H E W H + Y +++++ R I+++NL+
Sbjct: 1 MWGLKVLLLP-----VVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLK 55
Query: 77 YIEKANKE---GNRTYKLGTNRFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
YI N E G TY+L N D+T++E TG K+P RS + Y
Sbjct: 56 YISIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTL---YIPEWE 112
Query: 134 TDVPTSLDWRDKKAVTPIKDQQECGCCWAFSAVAAVEGITKISGANLIQLSEQQLVDCST 193
P S+D+R K VTP+K+Q +CG CWAFS+V A+EG K L+ LS Q LVDC +
Sbjct: 113 GRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS 172
Query: 194 NGNNGCGGGTMEKAFEYIIQNQGIATEDEYPYQAVQGTCSAAQKAAAAKISNYEEVPSGD 253
N+GCGGG M AF+Y+ +N+GI +ED YPY + +C AAK Y E+P G+
Sbjct: 173 E-NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGN 231
Query: 254 EQALLKAVS-MQPVSIGIAAYTTEFKSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGAN 310
E+AL +AV+ + PVS+ I A T F+ Y +G+ ++ C + L+HAV VG+G + G
Sbjct: 232 EKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQKGNK 290
Query: 311 YWLIKNSWGDTWGDAGYMKILRDE-GLCGIGTQSSYP 346
+W+IKNSWG+ WG+ GY+ + R++ CGI +S+P
Sbjct: 291 HWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 327
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.315 0.130 0.387
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 126,277,168
Number of Sequences: 539616
Number of extensions: 5217784
Number of successful extensions: 16073
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 224
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 14958
Number of HSP's gapped (non-prelim): 300
length of query: 348
length of database: 191,569,459
effective HSP length: 118
effective length of query: 230
effective length of database: 127,894,771
effective search space: 29415797330
effective search space used: 29415797330
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (28.5 bits)