BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018958
(348 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 312 bits (799), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 215/314 (68%), Gaps = 9/314 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T+ ++E+ E WM++H ++YK EK R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42 TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T N+GC GG + AF YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
ED+YPY G C ++ IS YE+VP D+++L+KA++ QPVS+AI A +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
FQ YK G+FNG CGT LDH V VG+G+++ G++Y ++KNSWG WG+ G++++ R+
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGK 337
Query: 334 -EGLCGIGTRSSYP 346
EGLCGI +SYP
Sbjct: 338 PEGLCGINKMASYP 351
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 308 bits (789), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 153/342 (44%), Positives = 217/342 (63%), Gaps = 19/342 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHG--RSYKDELEKEMRLKIFKENL 75
+F ++ L +C E+ + ++++W + H RS E+E R +F+ N+
Sbjct: 9 LFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPRSLN---EREKRFNVFRHNV 65
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQN 130
++ NK+ NR+YKL N+F+DLT +EF+ YTG ++M R + + ++N
Sbjct: 66 MHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHEN 124
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
LS +P+S+DWR KGAVT IKNQ +CG CWAF+ VAAVEGI KI++ L+ LSEQ+L+D
Sbjct: 125 LSK--LPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182
Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEV 249
C T N GC GG E AF +I +N GI TED YPY+ + G C A++ I +E+V
Sbjct: 183 CDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDV 242
Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
P DE ALLKAV+ QPVS+AI A S++FQ Y EG+F G CGT+L+H V VG+G +E G
Sbjct: 243 PENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYG-SERGK 301
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
YW+++NSWG WG+ GY+KI R+ EG CGI +SYP+
Sbjct: 302 KYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPI 343
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 308 bits (788), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 148/333 (44%), Positives = 211/333 (63%), Gaps = 10/333 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +S +++ E+WMA++GR YKD EK +R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG NQF+D+TN+EF A YTG +P R S + ++ ++ VP
Sbjct: 68 IETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVS---FDDVDISSVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
S+DWRD GAVT +KNQ CG CWAFA++A VE I KI+ GNL+ LSEQQ+LDC+ +
Sbjct: 125 QSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAV--SY 182
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG KA+++II N+G+A+ YPY+A GTC P +A I+ Y V +E+ +
Sbjct: 183 GCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNSAYITRYTYVQRNNERNM 242
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+ AVS QP++ A+ A S FQ YK G+F G CGT+L+HA+ I+G+G G +W+++NS
Sbjct: 243 MYAVSNQPIAAALDA-SGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNS 301
Query: 318 WGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
WG WG+ GY+++ RD GLCGI YP
Sbjct: 302 WGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 307 bits (787), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 150/323 (46%), Positives = 216/323 (66%), Gaps = 12/323 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
V ++ E V+ I+E W+ +HG+ S +EK+ R +IFK+NL ++++ N E N +Y
Sbjct: 35 VSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSY 93
Query: 90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
+LG +F+DLTNDE+R+ Y G KM R T+ +Y+ ++P S+DWR KGAV
Sbjct: 94 RLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVA 150
Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
+K+Q CG CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 151 EVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFE 210
Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
+II+N GI T+ +YPY+ V GTC +K A I +YE+VP+ E++L KAV+ QP+SI
Sbjct: 211 FIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISI 270
Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
AI A FQ Y GIF+G CGTQLDH V VG+G TE+G +YW+++NSWG +WG++GY+
Sbjct: 271 AIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGKSWGESGYL 329
Query: 329 KIVRD----EGLCGIGTRSSYPL 347
++ R+ G CGI SYP+
Sbjct: 330 RMARNIASSSGKCGIAIEPSYPI 352
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 305 bits (782), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 146/333 (43%), Positives = 212/333 (63%), Gaps = 10/333 (3%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L AS +SR +++ E+WMA++GR YKD+ EK R +IFK N+++
Sbjct: 8 VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG NQF+D+T EF A YTG +P R S + +++++ VP
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVS---FDDVNISAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
S+DWRD GAV +KNQ CG CW+FAA+A VEGI KI++G L+ LSEQ++LDC+ +
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV--SY 182
Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
GC GG KA+ +II N G+ TE+ YPY A GTC+A P +A I+ Y V DE+++
Sbjct: 183 GCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAYITGYSYVRRNDERSM 242
Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
+ AVS QP++ I A S FQ Y G+F+G CGT L+HA+TI+G+G G YW+++NS
Sbjct: 243 MYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNS 301
Query: 318 WGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
WG++WG+ GY+++ R G+CGI +P
Sbjct: 302 WGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 305 bits (780), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 155/344 (45%), Positives = 211/344 (61%), Gaps = 14/344 (4%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
T + + L S V +S H E+S+ +++E+W + H S + EK R
Sbjct: 3 TKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRFN 61
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKY 128
+FK NL ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ P R T +
Sbjct: 62 VFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAF 120
Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+ VP S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ LSEQ+L
Sbjct: 121 MYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQEL 180
Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYE 247
+DC N GC GG E AF +I Q GI TE YPY+A GTC A++ A I +E
Sbjct: 181 VDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHE 240
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
VP+ DE ALLKAV+ QPVS+AI A ++FQ Y EG+F G C T L+H V IVG+GTT D
Sbjct: 241 NVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD 300
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G NYW+++NSWG WG+ GY+++ R+ EGLCGI SYP+
Sbjct: 301 GTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 344
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 302 bits (773), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 140/317 (44%), Positives = 202/317 (63%), Gaps = 11/317 (3%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ +++E+W + H R + EK R FK N +I NK G+ Y+L N+F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
EFRA + G + +PS + + F Y L+++D+P S+DWR KGAVT +K+Q +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPS-KPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCG 156
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
CWAF+ V +VEGI IR+G+L+ LSEQ+L+DC T N+GC GG + AF YI N G+
Sbjct: 157 SCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216
Query: 219 TEDEYPYQAVPGTCSAAQ----KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
TE YPY+A GTC+ A+ P I +++VP+ E+ L +AV+ QPVS+A+ A
Sbjct: 217 TEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASG 276
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
F Y EG+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 277 KAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDS 336
Query: 335 ----GLCGIGTRSSYPL 347
GLCGI +SYP+
Sbjct: 337 GASGGLCGIAMEASYPV 353
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 301 bits (772), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 198/316 (62%), Gaps = 9/316 (2%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ +++E+W + H R + EK R FK N +I NK G+ Y+L N+F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
EFRA + G + + F Y L+++D+P S+DWR KGAVT +K+Q +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGS 157
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
CWAF+ V +VEGI IR+G+L+ LSEQ+L+DC T N+GC GG + AF YI N G+ T
Sbjct: 158 CWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLIT 217
Query: 220 EDEYPYQAVPGTCSAAQ----KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
E YPY+A GTC+ A+ P I +++VP+ E+ L +AV+ QPVS+A+ A
Sbjct: 218 EAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGK 277
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 334
F Y EG+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 278 AFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 335 ---GLCGIGTRSSYPL 347
GLCGI +SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 301 bits (771), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 150/312 (48%), Positives = 203/312 (65%), Gaps = 16/312 (5%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++E+W + H S + EK+ R +FK N ++ ANK ++ YKL N+F+D+TN EFR
Sbjct: 37 LYERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFR 94
Query: 106 ALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
Y+G K+ HR + TF Y+ + VP S+DWR KGAVT +K+Q +CG C
Sbjct: 95 NTYSGSKVKH--HRMFRGGPRGNGTFMYEKVDT--VPASVDWRKKGAVTSVKDQGQCGSC 150
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
WAF+ + AVEGI +I++ L+ LSEQ+L+DC T+ N GC GG + AF +I Q GI TE
Sbjct: 151 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTE 210
Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
YPY+A GTC +++ A A I +E VP DE ALLKAV+ QPVS+AI A ++FQ
Sbjct: 211 ANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQF 270
Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEG 335
Y EG+F G CGT+LDH V IVG+GTT DG YW +KNSWG WG+ GY+++ R EG
Sbjct: 271 YSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEG 330
Query: 336 LCGIGTRSSYPL 347
LCGI +SYP+
Sbjct: 331 LCGIAMEASYPI 342
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 299 bits (765), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 151/345 (43%), Positives = 216/345 (62%), Gaps = 22/345 (6%)
Query: 19 FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
FI++ L + + H E S+ E++E+W + H + E EK R +FK
Sbjct: 4 FIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFNVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFK 127
N+++I + NK+ +++YKL N+F D+T++EFR Y G ++M ++T S F
Sbjct: 63 HNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKS--FM 119
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y N++ +PTS+DWR GAVTP+KNQ +CG CWAF+ V AVEGI +IR+ L LSEQ+
Sbjct: 120 YANVNT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNY 246
L+DC TN N GC GG + AF +I + G+ +E YPY+A TC ++ A I +
Sbjct: 178 LVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGH 237
Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
E+VP E L+KAV+ QPVS+AI A ++FQ Y EG+F G CGT+L+H V +VG+GTT
Sbjct: 238 EDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTI 297
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
DG YW++KNSWG WG+ GY+++ R EGLCGI +SYPL
Sbjct: 298 DGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 298 bits (763), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 150/353 (42%), Positives = 220/353 (62%), Gaps = 26/353 (7%)
Query: 18 MFIIITLLVSCAS----QVVSSRSTHE--------QSVVE-----IHEKWMAQHGRSYKD 60
+F++ ++ SCA+ VVSS H Q + + + E WM +HG+ Y
Sbjct: 10 IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDS 69
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EKE RL IF++NL +I N E N +Y+LG N+F+DL+ E+ + G P +
Sbjct: 70 VAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV 128
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
+S+ +Y+ +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G L
Sbjct: 129 FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGEL 188
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-- 238
+ LSEQ L++C+ NNGC GG E A+ +I+ N G+ T+++YPY+A+ G C K
Sbjct: 189 VTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDN 247
Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
I YE +P+ DE AL+KAV+ QPV+ + + S EFQ Y+ G+F+G CGT L+H V
Sbjct: 248 KNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVV 307
Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+VG+G TE+G +YW++KNS G+TWG+AGYMK+ R+ GLCGI R+SYPL
Sbjct: 308 VVGYG-TENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 297 bits (761), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 149/318 (46%), Positives = 203/318 (63%), Gaps = 16/318 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S + EK R +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR+ Y G +KM S S TF Y+ + VP S+DWR KGAVT +K+Q
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC N GC GG E AF +I Q
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206
Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
GI TE YPY A GTC ++ A I +E VP DE ALLKAV+ QPVS+AI A
Sbjct: 207 GGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
++FQ Y EG+F G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326
Query: 334 ----EGLCGIGTRSSYPL 347
EGLCGI +SYP+
Sbjct: 327 ISKKEGLCGIAMMASYPI 344
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 297 bits (761), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 152/350 (43%), Positives = 221/350 (63%), Gaps = 21/350 (6%)
Query: 16 TPMFIIITLLV--SCASQVVSSRSTHE-----QSVVE-----IHEKWMAQHGRSYKDELE 63
+ M I++ +V SCA+ + S +++ SV + I E WM +HG+ Y E
Sbjct: 6 SAMLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAE 65
Query: 64 KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS 123
KE RL IF++NL +I N E N +Y+LG F+DL+ E++ + G P + +
Sbjct: 66 KERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMT 124
Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
S+ +Y+ + +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G L+ L
Sbjct: 125 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 184
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP--AAA 241
SEQ L++C+ NNGC GG E A+ +I++N G+ T+++YPY+AV G C K
Sbjct: 185 SEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNV 243
Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
I YE +P+ DE AL+KAV+ QPV+ I + S EFQ Y+ G+F+G CGT L+H V +VG
Sbjct: 244 MIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVG 303
Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
+G TE+G +YWL+KNS G TWG+AGYMK+ R+ GLCGI R+SYPL
Sbjct: 304 YG-TENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 296 bits (757), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 212/340 (62%), Gaps = 10/340 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
T GC GG F +II N GI TE+ YPY A G C+ Q I YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVP 243
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E AL AV+ QPVS+A+ A F+ Y GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSW TWG+ GYM+I+R+ G CGI T SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 295 bits (754), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 148/318 (46%), Positives = 205/318 (64%), Gaps = 17/318 (5%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E++V +++E+W H S + E R +F+ N+ ++ + NK+ N+ YKL N+F+D+
Sbjct: 31 EENVWKLYERWRGHHSVS-RASHEAIKRFNVFRHNVLHVHRTNKK-NKPYKLKINRFADI 88
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
T+ EFR+ Y G ++M R S F Y+N+ T VP+S+DWR+KGAVT +KNQ
Sbjct: 89 THHEFRSSYAGSNVKHHRMLRGPKRG--SGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 144
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
++CG CWAF+ VAAVEGI KIR+ L+ LSEQ+L+DC T N GC GG E AF +I N
Sbjct: 145 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 204
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA--AKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
GI TE+ YPY + A I +E VP DE+ LLKAV+ QPVS+AI A
Sbjct: 205 GGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDA 264
Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
S++FQ Y EG+F G CGTQL+H V IVG+G T++G YW+++NSWG WG+ GY++I R
Sbjct: 265 GSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIER 324
Query: 333 ----DEGLCGIGTRSSYP 346
+EG CGI +SYP
Sbjct: 325 GISENEGRCGIAMEASYP 342
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 294 bits (752), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 153/340 (45%), Positives = 211/340 (62%), Gaps = 10/340 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y + S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYL--RFTSGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
T GC GG F +II N GI TE+ YPY A G C+ Q I YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVP 243
Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
+E AL AV+ QPVS+A+ A F+ Y GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYG-TEGGID 302
Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
YW++KNSW TWG+ GYM+I+R+ G CGI T SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 292 bits (747), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 150/349 (42%), Positives = 219/349 (62%), Gaps = 25/349 (7%)
Query: 17 PMFIIITLL----VSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
P FI + L+ +S A + + E S+ ++EKW H + +D EK R +
Sbjct: 4 PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVA-RDLDEKNRRFNV 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS-----TTSST 125
FKEN+++I + N++ + YKL N+F D+TN EFR+ Y G K+ HRS + +
Sbjct: 63 FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQH--HRSQRGIQKNTGS 120
Query: 126 FKYQNLSMTDVPT-SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
F Y+N+ +P S+DWR KGAVT +K+Q +CG CWAF+ +A+VEGI +I++G L+ LS
Sbjct: 121 FMYENVG--SLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLS 178
Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAK 242
EQ+L+DC T+ N GC GG + AF +I Q GI TED YPY GTC++ P +
Sbjct: 179 EQELVDCDTSYNEGCNGGLMDYAFEFI-QKNGITTEDSYPYAEQDGTCASNLLNSPVVS- 236
Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGF 302
I +++VP+ +E AL++AV+ QP+S++I A FQ Y EG+F G CGT+LDH V IVG+
Sbjct: 237 IDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGY 296
Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
G T DG YW++KNSWG WG++GY+++ R G CGI +SYP+
Sbjct: 297 GATRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 291 bits (744), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 215/315 (68%), Gaps = 11/315 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
+H++ ++E+ E W++ ++Y+ EK +R ++FK+NL++I++ NK+G ++Y LG N+F+
Sbjct: 43 SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
DL+++EF+ +Y G K S + F Y+++ VP S+DWR KGAV +KNQ
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ VAAVEGI KI +GNL LSEQ+L+DC T NNGC GG + AF YI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218
Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
+ E++YPY GTC + + I+ +++VP+ DE++LLKA++ QP+S+AI A
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGR 278
Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
EFQ Y G+F+G CG LDH V VG+G+++ G++Y ++KNSWG WG+ GY+++ R+
Sbjct: 279 EFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTG 337
Query: 334 --EGLCGIGTRSSYP 346
EGLCGI +S+P
Sbjct: 338 KPEGLCGINKMASFP 352
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 290 bits (741), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 209/322 (64%), Gaps = 16/322 (4%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V I+ +W A+HG++ + +++ R IFK+NL +I+ N++ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+ AAVEGI KI +G LI LSEQ+L+DC + N GC GG + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
++N G+ TE +YPY+ G C++ K + I YE+VP+ DE AL KA+S QPVS+AI
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280
Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
A FQ Y+ GIF G CGT LDHAV VG+G +E+G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339
Query: 331 VRD-----EGLCGIGTRSSYPL 347
R+ G CGI +SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 287 bits (735), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 202/324 (62%), Gaps = 12/324 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
+II N GI TED+YPY+ C +K A I +YE+V E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261
Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
+AI A FQ Y GIF G CGT LDH V VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320
Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
+++ R+ G CGI SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 287 bits (735), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 212/310 (68%), Gaps = 15/310 (4%)
Query: 47 HEKWMAQHGRSYKDEL--EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTND 102
++ W+A++G + L E E R +F +NL++++ N + ++LG N+F+DLTN+
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111
Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
EFRA + G K+ + RS + +Y++ + ++P S+DWR+KGAV P+KNQ +CG CWA
Sbjct: 112 EFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167
Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
F+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG + AF +II+N GI TED
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227
Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
+YPY+AV G C ++ A I +E+VP DE++L KAV+ QPVS+AI A EFQ Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
G+F+G CGT LDH V VG+G T++G +YW+++NSWG WG++GY+++ R+ G
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346
Query: 337 CGIGTRSSYP 346
CGI +SYP
Sbjct: 347 CGIAMMASYP 356
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 285 bits (728), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 204/317 (64%), Gaps = 15/317 (4%)
Query: 44 VEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSD 98
+ I+ +W +HG+S + +++ R IFK+NL +I+ N+ N TYKLG F++
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKGAVTPIKNQK 155
LTNDE+R+LY G + P R T + KY +++ +VP ++DWR KGAV IK+Q
Sbjct: 61 LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ AAVEGI KI +G L+ LSEQ+L+DC + N GC GG + AF +I++N
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
G+ TE +YPY G C++ K + I YE+VPS DE AL +AVS QPVS+AI A
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ Y+ GIF G CGT +DHAV VG+G +E+G +YW+++NSWG WG+ GY+++ R+
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNV 298
Query: 334 ---EGLCGIGTRSSYPL 347
G CGI +SYP+
Sbjct: 299 ASKSGKCGIAIEASYPV 315
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 280 bits (717), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 143/318 (44%), Positives = 201/318 (63%), Gaps = 16/318 (5%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ + WM +H + Y+ EK R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC + + GC GG + + Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211
Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
G+ T YPYQA C A KP KI+ Y+ VPS E + L A++ QP+S+ + A
Sbjct: 212 NGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAG 271
Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
FQ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330
Query: 333 ---DEGLCGIGTRSSYPL 347
+G CG+ S YP
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 280 bits (715), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 142/316 (44%), Positives = 202/316 (63%), Gaps = 12/316 (3%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
+E V ++E+W+ ++ ++Y EKE R KIFK+NL+++++ N +RT+++G +F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTN+EFRA+Y KM + S + + Y+ + +P +DWR GAV +K+Q CG
Sbjct: 96 LTNEEFRAIYLRKKMER-TKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
CWAF+AV AVEGI +I +G LI LSEQ+L+DC N GC GG AF +I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 218 ATEDEYPYQAVP-GTCSAAQ--KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
T+ +YPY A G C+A + I YE+VP DE++L KAV+ QPVS+AI A S
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
FQ YK G+ G CG LDH V +VG+G+T G +YW+I+NSWG WGD+GY+K+ R+
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNI 331
Query: 334 ---EGLCGIGTRSSYP 346
G CGI SYP
Sbjct: 332 DDPFGKCGIAMMPSYP 347
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 275 bits (703), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 139/295 (47%), Positives = 194/295 (65%), Gaps = 14/295 (4%)
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
E E R ++F +NL++++ N + ++LG N+F+DLTN EFRA Y G P+ R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVT-PIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ Y++ + +P S+DWRDKGAV P+KNQ +CG CWAF+AVAAVEGI KI +G
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
L+ LSEQ+L++C+ NG N+GC GG + AFA+I +N G+ TE++YPY A+ G C+ A++
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 239 -AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAV 297
I +E+VP DE +L KAV+ QPVS+AI A EFQ Y G+F G CGT LDH V
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 298 TIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
VG+GT GA YW ++NSWG WG+ GY+++ R+ G CGI +SYP+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 273 bits (697), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 195/316 (61%), Gaps = 14/316 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+DE E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
CG CWAF++ A+EG +SG L+ LSEQ L+DCST GNNGC GG + AF YI N
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
GI TE YPY+A+ +C + A + ++P GDE+ + +AV ++ PVS+AI A
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 294
Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
FQ Y EG++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 295 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 354
Query: 333 D-EGLCGIGTRSSYPL 347
+ E CGI + SSYPL
Sbjct: 355 NKENQCGIASASSYPL 370
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 261 bits (666), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 191/314 (60%), Gaps = 13/314 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
I E+W QH ++Y +E+E+ R+KIF EN I K N+ +G +YKLG N+++D+
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 100 TNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
+ EF+ GY + T Y + VP S+DWR+ GAVT +K+Q C
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
G CWAF++ A+EG ++G L+ LSEQ L+DCST GNNGC GG + AF YI N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 217 IATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYST 275
I TE YPY+ + +C + A + + ++P GDE+ + KAV +M PVS+AI A
Sbjct: 204 IDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHE 263
Query: 276 EFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
FQ Y EG++N C Q LDH V +VG+GT E G +YWL+KNSWG TWG+ GY+K+ R+
Sbjct: 264 SFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARN 323
Query: 334 E-GLCGIGTRSSYP 346
+ CGI T SSYP
Sbjct: 324 QNNQCGIATASSYP 337
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 260 bits (665), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 142/351 (40%), Positives = 205/351 (58%), Gaps = 21/351 (5%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
SF+ ++ + +S + +E V+ ++E+W+ ++G++Y EKE R K
Sbjct: 4 SFRTLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFK 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM +S + +YQ
Sbjct: 64 IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120
Query: 130 NLSMTDVPTSLDWRDKGAVTP-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P +DWR++GAV P +K Q ECG CWAFAA AVEGI +I +G L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180
Query: 189 LDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK----- 242
+DC N N GC GG AF +I +N GI +++ Y Y G +AA K K
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYT---GEDTAACKAIEMKTTRVV 237
Query: 243 -ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL-DHAVTIV 300
I+ +E VP DE +L KAV+ QP+S+ I+A YK G++ G C DH V IV
Sbjct: 238 TINGHEVVPVNDEMSLKKAVAYQPISVMISA--ANMSDYKSGVYKGACSNLWGDHNVLIV 295
Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
G+GT+ D +YWLI+NSWG WG+ GY+++ R+ G C + YP+
Sbjct: 296 GYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPI 346
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 259 bits (662), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 144/337 (42%), Positives = 206/337 (61%), Gaps = 17/337 (5%)
Query: 18 MFIIITLLVSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+F+ + L S V S++ T + ++++ E WM +H + YK+ EK R +IFK+NL
Sbjct: 17 LFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNL 76
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+YI++ NK+ N +Y LG N F+D++NDEF+ YTG + ++ +T S + N +
Sbjct: 77 KYIDETNKK-NNSYWLGLNVFADMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVN 133
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P +DWR KGAVTP+KNQ CG CWAF+AV +EGI KIR+GNL + SEQ+LLDC
Sbjct: 134 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR- 192
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDE 254
+ GC GG A + Q GI + YPY+ V C + +K P AAK +V +E
Sbjct: 193 SYGCNGGYPWSALQLVAQ-YGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNE 251
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
ALL +++ QPVS+ + A +FQ Y+ GIF G CG ++DHAV VG+ G NY LI
Sbjct: 252 GALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILI 306
Query: 315 KNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
KNSWG WG+ GY++I R G+CG+ T S YP+
Sbjct: 307 KNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 343
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 251 bits (640), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 192/314 (61%), Gaps = 12/314 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM H + Y++ EK R +IFK+NL YI++ NK+ N +Y LG N+F+
Sbjct: 39 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF Y G + + +S ++ N ++P ++DWR KGAVTP+++Q C
Sbjct: 98 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDTVNLPENVDWRKKGAVTPVRHQGSC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC ++GC GG A Y+ +N GI
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 212
Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
+YPY+A GTC A Q K S V +E LL A++ QPVS+ + +
Sbjct: 213 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 272
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
FQ YK GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 273 FQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 331
Query: 333 DEGLCGIGTRSSYP 346
G+CG+ S YP
Sbjct: 332 SPGVCGLYKSSYYP 345
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 248 bits (633), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 137/315 (43%), Positives = 194/315 (61%), Gaps = 12/315 (3%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM +H ++YK+ EK R +IFK+NL+YI++ NK N Y LG N+FS
Sbjct: 39 TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMIN-GYWLGLNEFS 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF+ Y G P + ++ N + D+P S+DWR KGAVTP+K+Q C
Sbjct: 98 DLSNDEFKEKYVG---SLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
CWAF+ VA VEGI KI++GNL++LSEQ+L+DC + GC G + + Y+ QN GI
Sbjct: 155 ESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQN-GI 212
Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
+YPY A TC A Q K + V S +E +LL A++ QPVS+ + + +
Sbjct: 213 HLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRD 272
Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
FQ+YK GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 273 FQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGN 331
Query: 333 DEGLCGIGTRSSYPL 347
G+CG+ S YP+
Sbjct: 332 SPGVCGVYRSSYYPI 346
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 241 bits (616), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 132/344 (38%), Positives = 210/344 (61%), Gaps = 13/344 (3%)
Query: 11 FKINTTPMFIIITLLVSCASQ-VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
+++ T +F +I L +S S V S ++ S ++ WM + ++Y + E R +
Sbjct: 1 MRLSITLIFTLIVLSISFISAGNVFSHKQYQDSFID----WMRSNNKAYTHK-EFMPRYE 55
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
FK+N++Y+ N +G++T LG NQ +DL+N+E+R Y G + + +
Sbjct: 56 EFKKNMDYVHNWNSKGSKTV-LGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRL 114
Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
N P ++DWR+K AVTP+K+Q +CG C++F+ +VEG+T I++G L+ LSEQ +L
Sbjct: 115 NRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNIL 174
Query: 190 DCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA-VPGTCSAAQKPAAAKISNYE 247
DCS++ GN GC GG AF YII+N G+ +E++YPY+ V C + AAKI++Y+
Sbjct: 175 DCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSYK 234
Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTT 305
E+ +GDE L A+ + PVS+AI A FQ Y G+ + C ++ LDH V VG G T
Sbjct: 235 EIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMG-T 293
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
++G +Y+++KNSWG +WG GY+ + R+ + CGI T +SYP+A
Sbjct: 294 DNGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMASYPIA 337
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 240 bits (613), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 199/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ L+ C+S + H ++ H + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MNWLVWALLLCSSAMAH---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
+ N E G +Y+LG N D+T++E +L + ++PS R+ T + Q L
Sbjct: 58 TVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKL-- 115
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 116 ---PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG +AF YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 173 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPF 232
Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E+AL +AV+ + PVS+ I A + F YK G+ ++ C ++H V +VG+G DG
Sbjct: 233 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL-DGK 291
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
+YWL+KNSWG +GD GY+++ R+ G CGI SYP
Sbjct: 292 DYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 329
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 240 bits (613), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 195/338 (57%), Gaps = 18/338 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ LL C+ V H+ ++ H W + + YK+E E+ R I+++NL+
Sbjct: 1 MKWLVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T +E +L ++PS R+ T Y++ S
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT-----YRSNSN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG AF YII N GI +E YPY+A+ G C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKRAATCSKYTELPF 232
Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E AL +AV+ + PVS+AI A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL-NGK 291
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
+YWL+KNSWG +GD GY+++ R+ G CGI + SYP
Sbjct: 292 DYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 329
>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
Length = 340
Score = 238 bits (608), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 128/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)
Query: 50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
W H + YKD+ E+E+R I+++NL++I N E G TY++G N D+TN+E
Sbjct: 39 WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 98
Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
++P S ++ T +++ S +P ++DWR+KG VT +K Q CG CWAF+AV
Sbjct: 99 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A+EG K+++G LI LS Q L+DCS GN GC GG +AF YII N GI + Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
PY+A C K AA S Y ++P GDE AL +AV+ + PVS+ I A + F YK
Sbjct: 214 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273
Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R ++ CGI
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332
Query: 341 TRSSYP 346
+ SYP
Sbjct: 333 SYCSYP 338
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 238 bits (608), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 129/309 (41%), Positives = 181/309 (58%), Gaps = 19/309 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E++ + GR Y D E+ RL +F +NL+YIE+ NK+ G TY L NQFSD+TN++F
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80
Query: 105 RALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
A+ GYK P P+ T++ T +DWR KGAVTP+K+Q +CG CWAF
Sbjct: 81 NAVMKGYKKGPRPAAVFTSTDA--------APESTEVDWRTKGAVTPVKDQGQCGSCWAF 132
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG--NNGCLGGSREKAFAYIIQNQGIATED 221
+ +EG +++G L+ LSEQQL+DC+ N GC GG E+A Y+ N G+ TE
Sbjct: 133 STTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTES 192
Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSY 280
YPY+A TC A + Y + G E AL A + P+S+AI A FQSY
Sbjct: 193 SYPYEARDNTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSY 252
Query: 281 KEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLC 337
G++ +QLDHAV VG+G +E G ++WL+KNSW +WG++GY+K+ R+ C
Sbjct: 253 YTGVYYEPSCSSSQLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGESGYIKMARNRNNNC 311
Query: 338 GIGTRSSYP 346
GI T + YP
Sbjct: 312 GIATDACYP 320
>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
Length = 331
Score = 235 bits (600), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 128/338 (37%), Positives = 199/338 (58%), Gaps = 18/338 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG AF YII N+GI ++ YPY+A+ C K AA S Y E+P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY 232
Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
G E L +AV+ + PVS+ + A F Y+ G+ + C ++H V +VG+G +G
Sbjct: 233 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGK 291
Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
YWL+KNSWG+ +G+ GY+++ R++G CGI + SYP
Sbjct: 292 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
Length = 333
Score = 235 bits (599), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 131/341 (38%), Positives = 195/341 (57%), Gaps = 23/341 (6%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
P FI+ L + AS + T S+ KW A H R Y E+ R ++++N++
Sbjct: 3 PTFILAALCLGIASATL----TFNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
IE N+E G ++ + N F D+T++EFR + G++ P +Q
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKPRKGKV------FQEPLF 111
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
+ P S+DWR+KG VTP+KNQ +CG CWAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
GN GC GG + AF Y+ N G+ +E+ YPY+A +C + + A + + ++P
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIPK- 230
Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TTE 306
E+AL+KAV ++ P+S+AI A F YKEGI F C ++ +DH V +VG+G T
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
D + YWL+KNSWG WG GY+K+ +D CGI + +SYP
Sbjct: 291 DNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYP 331
>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
Length = 323
Score = 234 bits (598), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 129/308 (41%), Positives = 190/308 (61%), Gaps = 14/308 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEF 104
E + + G+ Y + E+ R+ +F + L++I++ N+ +G TY L N FSDLT++E
Sbjct: 21 ENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEV 80
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A TG + R S ++ T + +DWR+KGAVTP+K+Q +CG CWAF+
Sbjct: 81 LATKTGM-----TRRRHPLSVLP-KSAPTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFS 134
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
AVAA+EG +++G+L+ LSEQ L+DCS++ GN GC GG +A+ YII N+GI TE Y
Sbjct: 135 AVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSY 194
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
PY+A+ C A +S+Y E SGDE AL AV + PVS+ I A + F SY
Sbjct: 195 PYKAIDDNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGSYGG 254
Query: 283 GI-FNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
G+ + C + +HAVT VG+GT +G +YW++KNSWG WG++GY+K+ R+ + C I
Sbjct: 255 GVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNRDNNCAI 314
Query: 340 GTRSSYPL 347
T S YP+
Sbjct: 315 ATYSVYPV 322
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 233 bits (594), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 132/308 (42%), Positives = 181/308 (58%), Gaps = 14/308 (4%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E + ++GR Y D E R IF++N +YIE+ NK+ G T+ L N+F D+T +EF
Sbjct: 21 EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ G +P RS S F Y T +DWR KGAVTP+K+Q +CG CWAF+
Sbjct: 81 NAVMKG-NIP---RRSAPVSVF-YPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFS 135
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
++EG +++G+LI L+EQQL+DCS G GC GG AF YI N GI TE Y
Sbjct: 136 TTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAY 195
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY+A G+C AA S + + SG E L +AV + P+S+ I A + FQ Y
Sbjct: 196 PYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSS 255
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGI 339
G++ + LDHAV VG+G +E G ++WL+KNSW +WGDAGY+K+ R+ CGI
Sbjct: 256 GVYYEPSCSPSYLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGI 314
Query: 340 GTRSSYPL 347
T +SYPL
Sbjct: 315 ATVASYPL 322
>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
Length = 330
Score = 232 bits (592), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 126/337 (37%), Positives = 197/337 (58%), Gaps = 17/337 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++P+ R+ T + Q L
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPNQWQRNITYKSNPNQML-- 115
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCS
Sbjct: 116 ---PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSE 172
Query: 194 N-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
GN GC GG +AF YII N+GI +E YPY+A C K AA S Y E+P G
Sbjct: 173 KYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKCQYDSKYRAATCSKYTELPYG 232
Query: 253 DEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGAN 310
E L +AV+ + PV + + A F Y+ G+ ++ C +++H V ++G+G +G
Sbjct: 233 REDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYGDL-NGKE 291
Query: 311 YWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
YWL+KNSWG+ +G+ GY+++ R++G CGI + SYP
Sbjct: 292 YWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 328
>sp|P07711|CATL1_HUMAN Cathepsin L1 OS=Homo sapiens GN=CTSL1 PE=1 SV=2
Length = 333
Score = 232 bits (592), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 131/342 (38%), Positives = 193/342 (56%), Gaps = 23/342 (6%)
Query: 16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
P I+ + AS + T + S+ KW A H R Y E+ R ++++N+
Sbjct: 2 NPTLILAAFCLGIASATL----TFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNM 56
Query: 76 EYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ IE N +EG ++ + N F D+T++EFR + G++ P +Q
Sbjct: 57 KMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKV------FQEPL 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ P S+DWR+KG VTP+KNQ +CG CWAF+A A+EG ++G LI LSEQ L+DCS
Sbjct: 111 FYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCS 170
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
GN GC GG + AF Y+ N G+ +E+ YPY+A +C K + A + + ++P
Sbjct: 171 GPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK 230
Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TT 305
E+AL+KAV ++ P+S+AI A F YKEGI F C ++ +DH V +VG+G T
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
D YWL+KNSWG WG GY+K+ +D CGI + +SYP
Sbjct: 290 SDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYP 331
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
SV=1
Length = 321
Score = 231 bits (589), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 132/307 (42%), Positives = 183/307 (59%), Gaps = 16/307 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
+ + Q+GR Y D E+ R ++F++N + IE NK+ G T+K+ NQF D+TN+EF
Sbjct: 21 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 80
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ GYK S R + F + M +DWR K VTP+K+Q++CG CWAF+
Sbjct: 81 NAVMKGYKKGS---RGEPKAVFTAEAGPMA---ADVDWRTKALVTPVKDQEQCGSCWAFS 134
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A A+EG +++ L+ LSEQQL+DCST+ GN+GC GG AF YI N GI TE Y
Sbjct: 135 ATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSY 194
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKE 282
PY+A +C A + EV E+AL +AVS + P+S+AI A FQ Y
Sbjct: 195 PYEAEDRSCRFDANSIGAICTGSVEVQH-TEEALQEAVSGVGPISVAIDASHFSFQFYSS 253
Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
G++ T LDH V VG+G TE +YWL+KNSWG++WGDAGY+K+ R+ + CGI
Sbjct: 254 GVYYEQNCSPTFLDHGVLAVGYG-TESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGI 312
Query: 340 GTRSSYP 346
+ SYP
Sbjct: 313 ASEPSYP 319
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 229 bits (585), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 107/217 (49%), Positives = 149/217 (68%), Gaps = 6/217 (2%)
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
D+P S+DWR+ GAV P+KNQ CG CWAF+ VAAVEGI +I +G+LI LSEQQL+DC+T
Sbjct: 2 DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT- 60
Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
N+GC GG AF +I+ N GI +E+ YPY+ G C++ I +YE VPS +E
Sbjct: 61 ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNSTVNAPVVSIDSYENVPSHNE 120
Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
Q+L KAV+ QPVS+ + A +FQ Y+ GIF G C +HA+T+VG+GT D ++W++
Sbjct: 121 QSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDFWIV 179
Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
KNSWG WG++GY++ R+ +G CGI +SYP+
Sbjct: 180 KNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216
>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
Length = 329
Score = 229 bits (583), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 129/326 (39%), Positives = 193/326 (59%), Gaps = 21/326 (6%)
Query: 33 VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
V S + + + +++ H E W H + Y ++++ R I+++NL+YI N E G T
Sbjct: 11 VMSFALYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHT 70
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD----VPTSLDWRD 144
Y+L N D+TN+E TG K+P+ RS + L + D P S+D+R
Sbjct: 71 YELAMNHLGDMTNEEVVQKMTGLKVPASHSRSNDT-------LYIPDWEGRAPDSVDYRK 123
Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSR 204
KG VTP+KNQ +CG CWAF++V A+EG K ++G L+ LS Q L+DC + N+GC GG
Sbjct: 124 KGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYM 182
Query: 205 EKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-M 263
AF Y+ +N+GI +ED YPY +C AAK Y E+P G+E+AL +AV+ +
Sbjct: 183 TNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARV 242
Query: 264 QPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
PVS+AI A T FQ Y +G+ ++ C + L+HAV VG+G + G +W+IKNSWG
Sbjct: 243 GPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQKGNKHWIIKNSWGEN 301
Query: 322 WGDAGYMKIVRDE-GLCGIGTRSSYP 346
WG+ GY+ + R++ CGI +S+P
Sbjct: 302 WGNKGYILMARNKNNACGIANLASFP 327
>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
Length = 329
Score = 229 bits (583), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 129/326 (39%), Positives = 193/326 (59%), Gaps = 21/326 (6%)
Query: 33 VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
V S + + + +++ H E W H + Y ++++ R I+++NL+YI N E G T
Sbjct: 11 VMSFALYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHT 70
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD----VPTSLDWRD 144
Y+L N D+TN+E TG K+P+ RS + L + D P S+D+R
Sbjct: 71 YELAMNHLGDMTNEEVVQKMTGLKVPASHSRSNDT-------LYIPDWEGRAPDSVDYRK 123
Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSR 204
KG VTP+KNQ +CG CWAF++V A+EG K ++G L+ LS Q L+DC + N+GC GG
Sbjct: 124 KGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYM 182
Query: 205 EKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-M 263
AF Y+ +N+GI +ED YPY +C AAK Y E+P G+E+AL +AV+ +
Sbjct: 183 TNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARV 242
Query: 264 QPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
PVS+AI A T FQ Y +G+ ++ C + L+HAV VG+G + G +W+IKNSWG
Sbjct: 243 GPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQKGNKHWIIKNSWGEN 301
Query: 322 WGDAGYMKIVRDE-GLCGIGTRSSYP 346
WG+ GY+ + R++ CGI +S+P
Sbjct: 302 WGNKGYILMARNKNNACGIANLASFP 327
>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
Length = 334
Score = 228 bits (581), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 130/340 (38%), Positives = 194/340 (57%), Gaps = 19/340 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M + + L C + S+ +Q++ +W A H R Y E+ R ++++N++
Sbjct: 1 MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAN-EEGWRRAVWEKNMKM 58
Query: 78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
IE N E G + + N F D+TN+EFR + ++ + + F+
Sbjct: 59 IELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFR----NQKFRKGKVFR--EPLFL 112
Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
D+P S+DWR KG VTP+KNQK+CG CWAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
GN GC GG +AF Y+ +N G+ +E+ YPY AV C + + A + + V G
Sbjct: 173 QGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGK 232
Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTED 307
E+AL+KAV ++ P+S+A+ A + FQ YK GI F C ++ LDH V +VG+ G +
Sbjct: 233 EKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSN 292
Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
+ YWL+KNSWG WG GY+KI +D+ CGI T +SYP
Sbjct: 293 NSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYP 332
>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 227 bits (579), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 185/311 (59%), Gaps = 19/311 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
+W + H R Y E+E R I+++N+ I+ N E G + + N F D+TN+EF
Sbjct: 30 HQWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEF 88
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
R + GY+ H+ +Q M +P S+DWR+KG VTP+KNQ +CG CWAF+
Sbjct: 89 RQVVNGYR-----HQKHKKGRL-FQEPLMLKIPKSVDWREKGCVTPVKNQGQCGSCWAFS 142
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A +EG +++G LI LSEQ L+DCS GN GC GG + AF YI +N G+ +E+ Y
Sbjct: 143 ASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESY 202
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY+A G+C + A A + + ++P E+AL+KAV ++ P+S+A+ A Q Y
Sbjct: 203 PYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261
Query: 283 GI-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGL 336
GI + C ++ LDH V +VG+ GT + YWL+KNSWG+ WG GY+KI +D +
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321
Query: 337 CGIGTRSSYPL 347
CG+ T +SYP+
Sbjct: 322 CGLATAASYPV 332
>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 226 bits (575), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 184/311 (59%), Gaps = 19/311 (6%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
+W + H R Y E+E R ++++N+ I+ N E G + + N F D+TN+EF
Sbjct: 30 HQWKSTHRRLYGTN-EEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEF 88
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
R + GY+ H+ +Q M +P ++DWR+KG VTP+KNQ +CG CWAF+
Sbjct: 89 RQIVNGYR-----HQKHKKGRL-FQEPLMLQIPKTVDWREKGCVTPVKNQGQCGSCWAFS 142
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
A +EG +++G LI LSEQ L+DCS + GN GC GG + AF YI +N G+ +E+ Y
Sbjct: 143 ASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESY 202
Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
PY+A G+C + A A + + ++P E+AL+KAV ++ P+S+A+ A Q Y
Sbjct: 203 PYEAKDGSCKYRAEYAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261
Query: 283 GI-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
GI + C ++ LDH V +VG+ GT + YWL+KNSWG WG GY+KI +D
Sbjct: 262 GIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNH 321
Query: 337 CGIGTRSSYPL 347
CG+ T +SYP+
Sbjct: 322 CGLATAASYPI 332
>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
Length = 208
Score = 226 bits (575), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 112/213 (52%), Positives = 145/213 (68%), Gaps = 10/213 (4%)
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P +DWR KGAVTP+KNQ CG CWAF+ V+ VE I +IR+GNLI LSEQ+L+DC
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59
Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
N+GCLGG+ A+ YII N GI T+ YPY+AV G C AA K I Y VP +E
Sbjct: 60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAASK--VVSIDGYNGVPFCNEX 117
Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
AL +AV++QP ++AI A S +FQ Y GIF+G CGT+L+H VTIVG+ ANYW+++
Sbjct: 118 ALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGY-----QANYWIVR 172
Query: 316 NSWGNTWGDAGYMKIVR--DEGLCGIGTRSSYP 346
NSWG WG+ GY++++R GLCGI YP
Sbjct: 173 NSWGRYWGEKGYIRMLRVGGCGLCGIARLPYYP 205
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.315 0.130 0.387
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 127,069,099
Number of Sequences: 539616
Number of extensions: 5202453
Number of successful extensions: 16415
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 223
Number of HSP's successfully gapped in prelim test: 22
Number of HSP's that attempted gapping in prelim test: 15280
Number of HSP's gapped (non-prelim): 291
length of query: 348
length of database: 191,569,459
effective HSP length: 118
effective length of query: 230
effective length of database: 127,894,771
effective search space: 29415797330
effective search space used: 29415797330
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (28.5 bits)