BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023507
(281 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 221 bits (562), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 120/330 (36%), Positives = 168/330 (50%), Gaps = 71/330 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L V AS +S +++ E+WMA++GR YKD EK +R +IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG NQF+D+TN+EF A YTG +P R S + ++ ++ VP
Sbjct: 68 IETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVS---FDDVDISSVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD------- 190
S+DWRD GAVT +KNQ CG CWAFA++A VE I KI+ GNL+ LSEQQ+LD
Sbjct: 125 QSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAVSYGC 184
Query: 191 --------------------------------CSTNGNNGCLGGSR--------EKAFAY 210
C TNG +R E+ Y
Sbjct: 185 KGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNSAYITRYTYVQRNNERNMMY 244
Query: 211 IIQNQ-----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
+ NQ G+F G CGT+L+HA+ I+G+G G +W+++NSWG
Sbjct: 245 AVSNQPIAAALDASGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNSWGA 304
Query: 254 TWGDAGYMKIVRDE----GLCGIGTRSSYP 279
WG+ GY+++ RD GLCGI YP
Sbjct: 305 GWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 220 bits (561), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 117/330 (35%), Positives = 173/330 (52%), Gaps = 71/330 (21%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+F+ + L AS +SR +++ E+WMA++GR YKD+ EK R +IFK N+++
Sbjct: 8 VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
IE N +Y LG NQF+D+T EF A YTG +P R S + +++++ VP
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVS---FDDVNISAVP 124
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN--- 194
S+DWRD GAV +KNQ CG CW+FAA+A VEGI KI++G L+ LSEQ++LDC+ +
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSYGC 184
Query: 195 -------------GNNG---------------CLGGS----------------REKAFAY 210
NNG C S E++ Y
Sbjct: 185 KGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAYITGYSYVRRNDERSMMY 244
Query: 211 IIQNQ-----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
+ NQ G+F+G CGT L+HA+TI+G+G G YW+++NSWG+
Sbjct: 245 AVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGS 304
Query: 254 TWGDAGYMKIVR----DEGLCGIGTRSSYP 279
+WG+ GY+++ R G+CGI +P
Sbjct: 305 SWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 205 bits (521), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 169/313 (53%), Gaps = 74/313 (23%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T+ ++E+ E WM++H ++YK EK R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42 TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLT++EF+ Y G P S + S+ F+Y+++ TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ VAAVEGI +I +GNL LSEQ+L+DC T N+GC GG + AF YII G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 218 FN----------GVCGTQLDHA--VTIVGF------------------------------ 235
G+C Q + VTI G+
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 236 -------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD---- 266
GT D G++Y ++KNSWG WG+ G++++ R+
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKP 338
Query: 267 EGLCGIGTRSSYP 279
EGLCGI +SYP
Sbjct: 339 EGLCGINKMASYP 351
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 196 bits (497), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 126/336 (37%), Positives = 174/336 (51%), Gaps = 82/336 (24%)
Query: 18 MFIIITLLVSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
+F+ + L S V S++ T + ++++ E WM +H + YK+ EK R +IFK+NL
Sbjct: 17 LFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNL 76
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
+YI++ NK+ N +Y LG N F+D++NDEF+ YTG + ++ +T S + N +
Sbjct: 77 KYIDETNKK-NNSYWLGLNVFADMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVN 133
Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
+P +DWR KGAVTP+KNQ CG CWAF+AV +EGI KIR+GNL + SEQ+LLDC
Sbjct: 134 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR- 192
Query: 196 NNGCLGG------------------------------SREK------------------- 206
+ GC GG SREK
Sbjct: 193 SYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEG 252
Query: 207 AFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 248
A Y I NQ GIF G CG ++DHAV VG+ G NY LIK
Sbjct: 253 ALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILIK 307
Query: 249 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 280
NSWG WG+ GY++I R G+CG+ T S YP+
Sbjct: 308 NSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 343
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 180 bits (457), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 111/321 (34%), Positives = 165/321 (51%), Gaps = 81/321 (25%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
++ V I+ +W A+HG++ + +++ R IFK+NL +I+ N++ N TYKLG
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101
Query: 95 QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
+F+DLTNDE+R LY G + P+ R + KY ++ +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
K+Q CG CWAF+ AAVEGI KI +G LI LSEQ+L+DC + N GC GG + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220
Query: 212 IQNQGI----------FNGVCGTQLDHA--VTIVGF------------------------ 235
++N G+ F G C + L ++ V+I G+
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280
Query: 236 -------------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIV 264
GT D G +YW+++NSWG WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRME 340
Query: 265 RD-----EGLCGIGTRSSYPL 280
R+ G CGI +SYP+
Sbjct: 341 RNLAASKSGKCGIAVEASYPV 361
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 174 bits (442), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 109/316 (34%), Positives = 161/316 (50%), Gaps = 80/316 (25%)
Query: 44 VEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSD 98
+ I+ +W +HG+S + +++ R IFK+NL +I+ N+ N TYKLG F++
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKGAVTPIKNQK 155
LTNDE+R+LY G + P R T + KY +++ +VP ++DWR KGAV IK+Q
Sbjct: 61 LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
CG CWAF+ AAVEGI KI +G L+ LSEQ+L+DC + N GC GG + AF +I++N
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 216 GI----------FNGVCGTQLDHA--VTIVGF---------------------------- 235
G+ NG C + L ++ VTI G+
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239
Query: 236 ---------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD-- 266
GT D G +YW+++NSWG WG+ GY+++ R+
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVA 299
Query: 267 --EGLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 300 SKSGKCGIAIEASYPV 315
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 174 bits (441), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 106/316 (33%), Positives = 167/316 (52%), Gaps = 80/316 (25%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDEL--EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQ 95
E ++ W+A++G + L E E R +F +NL++++ N + ++LG N+
Sbjct: 45 EAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNR 104
Query: 96 FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
F+DLTN+EFRA + G K+ + RS + +Y++ + ++P S+DWR+KGAV P+KNQ
Sbjct: 105 FADLTNEEFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQG 160
Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQN 214
+CG CWAF+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG + AF +II+N
Sbjct: 161 QCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKN 220
Query: 215 QGI----------FNGVCGTQLDHA--VTIVGF--------------------------- 235
GI +G C ++A V+I GF
Sbjct: 221 GGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAG 280
Query: 236 ----------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD- 266
GT+ D G +YW+++NSWG WG++GY+++ R+
Sbjct: 281 GREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNI 340
Query: 267 ---EGLCGIGTRSSYP 279
G CGI +SYP
Sbjct: 341 NVTTGKCGIAMMASYP 356
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 173 bits (438), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 92/216 (42%), Positives = 134/216 (62%), Gaps = 15/216 (6%)
Query: 40 EQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
E V+ I+E W+ +HG+ S +EK+ R +IFK+NL ++++ N E N +Y+LG +F+
Sbjct: 43 EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFA 101
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DLTNDE+R+ Y G KM R T+ +Y+ ++P S+DWR KGAV +K+Q C
Sbjct: 102 DLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVAEVKDQGGC 158
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
G CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF +II+N GI
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218
Query: 218 -------FNGVCGT--QLDHAVTIVGFGTTEDGANY 244
+ GV GT Q+ +V + ED Y
Sbjct: 219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTY 254
Score = 89.4 bits (220), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 55/76 (72%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF+G CGTQLDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 278 AFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMARNIA 336
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYP+
Sbjct: 337 SSSGKCGIAIEPSYPI 352
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 171 bits (434), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 114/352 (32%), Positives = 169/352 (48%), Gaps = 91/352 (25%)
Query: 18 MFIIITLLVSCAS----QVVSSRSTHE--------QSVVE-----IHEKWMAQHGRSYKD 60
+F++ ++ SCA+ VVSS H Q + + + E WM +HG+ Y
Sbjct: 10 IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDS 69
Query: 61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
EKE RL IF++NL +I N E N +Y+LG N+F+DL+ E+ + G P +
Sbjct: 70 VAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV 128
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
+S+ +Y+ +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G L
Sbjct: 129 FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGEL 188
Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQL---D 227
+ LSEQ L++C+ NNGC GG E A+ +I+ N G+ NGVC +L +
Sbjct: 189 VTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDN 247
Query: 228 HAVTIVGF----------------------------------------GTTEDGANYWLI 247
V I G+ GT N+ ++
Sbjct: 248 KNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVV 307
Query: 248 KNSWG---------------NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
+G +TWG+AGYMK+ R+ GLCGI R+SYPL
Sbjct: 308 VVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 169 bits (429), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 128/210 (60%), Gaps = 5/210 (2%)
Query: 10 SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
SF+ ++ + +S + +E V+ ++E+W+ ++G++Y EKE R K
Sbjct: 4 SFRTLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFK 63
Query: 70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM +S + +YQ
Sbjct: 64 IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120
Query: 130 NLSMTDVPTSLDWRDKGAVTP-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
+P +DWR++GAV P +K Q ECG CWAFAA AVEGI +I +G L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180
Query: 189 LDCST-NGNNGCLGGSREKAFAYIIQNQGI 217
+DC N N GC GG AF +I +N GI
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGI 210
Score = 68.2 bits (165), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 42/70 (60%), Gaps = 5/70 (7%)
Query: 216 GIFNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
G++ G C DH V IVG+GT+ D +YWLI+NSWG WG+ GY+++ R+ G C
Sbjct: 277 GVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKC 336
Query: 271 GIGTRSSYPL 280
+ YP+
Sbjct: 337 AVAVAPVYPI 346
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 168 bits (426), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 132/206 (64%), Gaps = 15/206 (7%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
+H++ ++E+ E W++ ++Y+ EK +R ++FK+NL++I++ NK+G ++Y LG N+F+
Sbjct: 43 SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
DL+++EF+ +Y G K S + F Y+++ VP S+DWR KGAV +KNQ
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
CG CWAF+ VAAVEGI KI +GNL LSEQ+L+DC T NNGC GG + AF YI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218
Query: 217 IFN----------GVCGTQLDHAVTI 232
+ G C Q D + T+
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETV 244
Score = 76.3 bits (186), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 49/68 (72%), Gaps = 5/68 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
G+F+G CG LDH V VG+G+++ G++Y ++KNSWG WG+ GY+++ R+ EGLCG
Sbjct: 286 GVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCG 344
Query: 272 IGTRSSYP 279
I +S+P
Sbjct: 345 INKMASFP 352
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 168 bits (426), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 106/317 (33%), Positives = 154/317 (48%), Gaps = 83/317 (26%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
+ E+W +H ++Y+DE E+ RLKIF EN I K N+ EG ++KL N+++DL
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
+ EFR L G+ +FK + + + +P S+DWR KGAVT +K+Q
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQ------------------------------ 186
CG CWAF++ A+EG +SG L+ LSEQ
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 187 ----------QLLDCSTNGNNGCLGGSREKAFAYIIQ----------------------- 213
+ +D S + N G +G + ++ F I Q
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGAT-DRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 293
Query: 214 -------NQGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
++G++N C Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++
Sbjct: 294 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKML 353
Query: 265 RD-EGLCGIGTRSSYPL 280
R+ E CGI + SSYPL
Sbjct: 354 RNKENQCGIASASSYPL 370
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 164 bits (416), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 133/232 (57%), Gaps = 23/232 (9%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHG--RSYKDELEKEMRLKIFKENL 75
+F ++ L +C E+ + ++++W + H RS E+E R +F+ N+
Sbjct: 9 LFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPRSLN---EREKRFNVFRHNV 65
Query: 76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQN 130
++ NK+ NR+YKL N+F+DLT +EF+ YTG ++M R + + ++N
Sbjct: 66 MHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHEN 124
Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
LS +P+S+DWR KGAVT IKNQ +CG CWAF+ VAAVEGI KI++ L+ LSEQ+L+D
Sbjct: 125 LSK--LPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182
Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTI 232
C T N GC GG E AF +I +N GI +G C D+ V +
Sbjct: 183 CDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLV 234
Score = 82.8 bits (203), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 35/70 (50%), Positives = 49/70 (70%), Gaps = 5/70 (7%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
+G+F G CGT+L+H V VG+G+ E G YW+++NSWG WG+ GY+KI R+ EG C
Sbjct: 275 EGVFTGSCGTELNHGVAAVGYGS-ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRC 333
Query: 271 GIGTRSSYPL 280
GI +SYP+
Sbjct: 334 GIAMEASYPI 343
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 164 bits (416), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 114/180 (63%), Gaps = 3/180 (1%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ +++E+W + H R + EK R FK N +I NK G+ Y+L N+F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
EFRA + G + +PS + + F Y L+++D+P S+DWR KGAVT +K+Q +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPS-KPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCG 156
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF 218
CWAF+ V +VEGI IR+G+L+ LSEQ+L+DC T N+GC GG + AF YI N G+
Sbjct: 157 SCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216
Score = 95.5 bits (236), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 267
A++ ++G+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 278 AFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 268 ---GLCGIGTRSSYPL 280
GLCGI +SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 164 bits (415), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 75/179 (41%), Positives = 110/179 (61%), Gaps = 1/179 (0%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+++ +++E+W + H R + EK R FK N +I NK G+ Y+L N+F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
EFRA + G + + F Y L+++D+P S+DWR KGAVT +K+Q +CG
Sbjct: 98 DQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGS 157
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF 218
CWAF+ V +VEGI IR+G+L+ LSEQ+L+DC T N+GC GG + AF YI N G+
Sbjct: 158 CWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216
Score = 94.7 bits (234), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 54/76 (71%), Gaps = 4/76 (5%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 267
A++ ++G+F G CGT+LDH V +VG+G EDG YW +KNSWG +WG+ GY+++ +D
Sbjct: 278 AFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 268 ---GLCGIGTRSSYPL 280
GLCGI +SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 164 bits (414), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 127/206 (61%), Gaps = 5/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+ S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC GG F +II N GI
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGI 209
Score = 90.1 bits (222), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 51/68 (75%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIF G CGT +DHAVTIVG+GT E G +YW++KNSW TWG+ GYM+I+R+ G CGI
Sbjct: 276 GIFTGPCGTAIDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGI 334
Query: 273 GTRSSYPL 280
T SYP+
Sbjct: 335 ATMPSYPV 342
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 162 bits (411), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 82/189 (43%), Positives = 117/189 (61%), Gaps = 6/189 (3%)
Query: 32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
+VS E+ ++ +W A+HG+SY E+E R F++NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
++LG N+F+DLTN+E+R Y G + R + N ++ P S+DWR KGAV
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
IK+Q CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201
Query: 209 AYIIQNQGI 217
+II N GI
Sbjct: 202 DFIINNGGI 210
Score = 85.1 bits (209), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 5/76 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + + GIF G CGT LDH V VG+GT E+G +YW+++NSWG +WG++GY+++ R+
Sbjct: 270 AFQLYSSGIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIK 328
Query: 267 --EGLCGIGTRSSYPL 280
G CGI SYPL
Sbjct: 329 ASSGKCGIAVEPSYPL 344
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 161 bits (408), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 126/206 (61%), Gaps = 5/206 (2%)
Query: 13 INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
++ + +F L++S A + V ++E W+ ++G+SY E E R +IFK
Sbjct: 8 VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y + S S+++ S+ +Y+
Sbjct: 68 ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYL--RFTSGSNKTKVSN--RYEPRV 123
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+P+ +DWR GAV IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183
Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
T GC GG F +II N GI
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGI 209
Score = 90.1 bits (222), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 51/68 (75%), Gaps = 4/68 (5%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
GIF G CGT +DHAVTIVG+GT E G +YW++KNSW TWG+ GYM+I+R+ G CGI
Sbjct: 276 GIFTGPCGTAVDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGI 334
Query: 273 GTRSSYPL 280
T SYP+
Sbjct: 335 ATMPSYPV 342
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 161 bits (408), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 119/183 (65%), Gaps = 11/183 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E++V +++E+W H S + E R +F+ N+ ++ + NK+ N+ YKL N+F+D+
Sbjct: 31 EENVWKLYERWRGHHSVS-RASHEAIKRFNVFRHNVLHVHRTNKK-NKPYKLKINRFADI 88
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
T+ EFR+ Y G ++M R S F Y+N+ T VP+S+DWR+KGAVT +KNQ
Sbjct: 89 THHEFRSSYAGSNVKHHRMLRGPKRG--SGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 144
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
++CG CWAF+ VAAVEGI KIR+ L+ LSEQ+L+DC T N GC GG E AF +I N
Sbjct: 145 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 204
Query: 215 QGI 217
GI
Sbjct: 205 GGI 207
Score = 88.6 bits (218), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 36/74 (48%), Positives = 53/74 (71%), Gaps = 4/74 (5%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 265
+ + ++G+F G CGTQL+H V IVG+G T++G YW+++NSWG WG+ GY++I R
Sbjct: 269 FQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISE 328
Query: 266 DEGLCGIGTRSSYP 279
+EG CGI +SYP
Sbjct: 329 NEGRCGIAMEASYP 342
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 158 bits (399), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 85/210 (40%), Positives = 127/210 (60%), Gaps = 17/210 (8%)
Query: 19 FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
FI++ L + + H E S+ E++E+W + H + E EK R +FK
Sbjct: 4 FIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFNVFK 62
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFK 127
N+++I + NK+ +++YKL N+F D+T++EFR Y G ++M ++T S F
Sbjct: 63 HNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKS--FM 119
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
Y N++ +PTS+DWR GAVTP+KNQ +CG CWAF+ V AVEGI +IR+ L LSEQ+
Sbjct: 120 YANVNT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L+DC TN N GC GG + AF +I + G+
Sbjct: 178 LVDCDTNQNQGCNGGLMDLAFEFIKEKGGL 207
Score = 94.7 bits (234), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 82/151 (54%), Gaps = 23/151 (15%)
Query: 143 RDKGAVT-----PIKNQKE-CGCCWAFAAVAAVEG---ITKIRSGNLIQLSEQQLLDCST 193
++KG +T P K E C A V +++G + K +L++ Q + +
Sbjct: 202 KEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAI 261
Query: 194 NGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
+ GGS + ++G+F G CGT+L+H V +VG+GTT DG YW++KNSWG
Sbjct: 262 DA-----GGS-----DFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGE 311
Query: 254 TWGDAGYMKIVR----DEGLCGIGTRSSYPL 280
WG+ GY+++ R EGLCGI +SYPL
Sbjct: 312 EWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 157 bits (396), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 127/209 (60%), Gaps = 15/209 (7%)
Query: 17 PMFIIITLL----VSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
P FI + L+ +S A + + E S+ ++EKW H + +D EK R +
Sbjct: 4 PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVA-RDLDEKNRRFNV 62
Query: 71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS-----TTSST 125
FKEN+++I + N++ + YKL N+F D+TN EFR+ Y G K+ HRS + +
Sbjct: 63 FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQH--HRSQRGIQKNTGS 120
Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
F Y+N+ S+DWR KGAVT +K+Q +CG CWAF+ +A+VEGI +I++G L+ LSE
Sbjct: 121 FMYENVGSLPA-ASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSE 179
Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
Q+L+DC T+ N GC GG + AF +I +N
Sbjct: 180 QELVDCDTSYNEGCNGGLMDYAFEFIQKN 208
Score = 91.7 bits (226), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 52/77 (67%), Gaps = 4/77 (5%)
Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 265
+ + ++G+F G CGT+LDH V IVG+G T DG YW++KNSWG WG++GY+++ R
Sbjct: 269 YGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGI 328
Query: 266 --DEGLCGIGTRSSYPL 280
G CGI +SYP+
Sbjct: 329 SDKRGKCGIAMEASYPI 345
>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
Length = 333
Score = 157 bits (396), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/339 (31%), Positives = 154/339 (45%), Gaps = 85/339 (25%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHE----KWMAQHGRSYKDELEKEMRLKIFKE 73
M+ + LL + A ++S+ +T E +V I + WM QH ++Y E RL++F
Sbjct: 1 MWTALPLLCAGA-WLLSAGATAELTVNAIEKFHFTSWMKQHQKTYSSR-EYSHRLQVFAN 58
Query: 74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
N I+ N+ N T+K+G NQFSD++ F + Y P + S T S +
Sbjct: 59 NWRKIQAHNQR-NHTFKMGLNQFSDMS---FAEIKHKYLWSEPQNCSATKSNYL---RGT 111
Query: 134 TDVPTSLDWRDKG-AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
P+S+DWR KG V+P+KNQ CG CW F+ A+E I SG ++ L+EQQL+DC+
Sbjct: 112 GPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCA 171
Query: 193 TNGNN-GCLGGSREKAFAYIIQNQGIF----------NGVCGTQLDHAVTIV-------- 233
N NN GC GG +AF YI+ N+GI NG C + AV V
Sbjct: 172 QNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQCKFNPEKAVAFVKNVVNITL 231
Query: 234 ------------------GFGTTED----------------------------------G 241
F TED G
Sbjct: 232 NDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNG 291
Query: 242 ANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
YW++KNSWG+ WG+ GY I R + +CG+ +SYP+
Sbjct: 292 LLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACASYPI 330
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 156 bits (395), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 78/177 (44%), Positives = 114/177 (64%), Gaps = 5/177 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM +H ++YK+ EK R +IFK+NL+YI++ NK N Y LG N+FS
Sbjct: 39 TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMIN-GYWLGLNEFS 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF+ Y G P + ++ N + D+P S+DWR KGAVTP+K+Q C
Sbjct: 98 DLSNDEFKEKYVG---SLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CWAF+ VA VEGI KI++GNL++LSEQ+L+DC + GC G + + Y+ QN
Sbjct: 155 ESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQN 210
Score = 60.8 bits (146), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/69 (49%), Positives = 44/69 (63%), Gaps = 5/69 (7%)
Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R G+CG
Sbjct: 279 GIFEGSCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGNSPGVCG 337
Query: 272 IGTRSSYPL 280
+ S YP+
Sbjct: 338 VYRSSYYPI 346
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 156 bits (395), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 76/177 (42%), Positives = 115/177 (64%), Gaps = 5/177 (2%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ WM H + Y++ EK R +IFK+NL YI++ NK+ N +Y LG N+F+
Sbjct: 39 TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 97
Query: 98 DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
DL+NDEF Y G + + +S ++ N ++P ++DWR KGAVTP+++Q C
Sbjct: 98 DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDTVNLPENVDWRKKGAVTPVRHQGSC 154
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC ++GC GG A Y+ +N
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN 210
Score = 61.6 bits (148), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 46/78 (58%), Gaps = 5/78 (6%)
Query: 206 KAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
K + + GIF G CGT++DHAVT VG+G + LIKNSWG WG+ GY++I R
Sbjct: 269 KGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKR 327
Query: 266 ----DEGLCGIGTRSSYP 279
G+CG+ S YP
Sbjct: 328 APGNSPGVCGLYKSSYYP 345
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 155 bits (393), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 115/180 (63%), Gaps = 9/180 (5%)
Query: 38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
T + ++++ + WM +H + Y+ EK R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39 TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97
Query: 98 DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
DL+NDEF+ Y G+ H T+K+ +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98 DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC + + GC GG + + Y+ N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANN 212
Score = 83.2 bits (204), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 38/73 (52%), Positives = 50/73 (68%), Gaps = 5/73 (6%)
Query: 212 IQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 267
+ G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG WG+ GYM++ R +
Sbjct: 277 LYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ 335
Query: 268 GLCGIGTRSSYPL 280
G CG+ S YP
Sbjct: 336 GTCGVYKSSYYPF 348
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 155 bits (393), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 84/210 (40%), Positives = 119/210 (56%), Gaps = 9/210 (4%)
Query: 15 TTPMFIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
T + + L S V +S H E+S+ +++E+W + H S + EK R
Sbjct: 2 ATKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRF 60
Query: 69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFK 127
+FK NL ++ NK ++ YKL N+F+D+TN EFR+ Y G K+ P R T
Sbjct: 61 NVFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGA 119
Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
+ + VP S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++ L+ LSEQ+
Sbjct: 120 FMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQE 179
Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
L+DC N GC GG E AF +I Q GI
Sbjct: 180 LVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Score = 90.1 bits (222), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 50/70 (71%), Gaps = 4/70 (5%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
+G+F G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+ EGLC
Sbjct: 275 EGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLC 334
Query: 271 GIGTRSSYPL 280
GI SYP+
Sbjct: 335 GIAMLPSYPI 344
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 155 bits (392), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 87/233 (37%), Positives = 134/233 (57%), Gaps = 24/233 (10%)
Query: 16 TPMFIIITLLV--SCASQVVSSRSTHE-----QSVVE-----IHEKWMAQHGRSYKDELE 63
+ M I++ +V SCA+ + S +++ SV + I E WM +HG+ Y E
Sbjct: 6 SAMLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAE 65
Query: 64 KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS 123
KE RL IF++NL +I N E N +Y+LG F+DL+ E++ + G P + +
Sbjct: 66 KERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMT 124
Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
S+ +Y+ + +P S+DWR++GAVT +K+Q C CWAF+ V AVEG+ KI +G L+ L
Sbjct: 125 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 184
Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQL 226
SEQ L++C+ NNGC GG E A+ +I++N G+ NGVC +L
Sbjct: 185 SEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRL 236
Score = 91.7 bits (226), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 41/75 (54%), Positives = 55/75 (73%), Gaps = 5/75 (6%)
Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
+ + G+F+G CGT L+H V +VG+GT E+G +YWL+KNS G TWG+AGYMK+ R+
Sbjct: 279 FQLYESGVFDGSCGTNLNHGVVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIAN 337
Query: 267 -EGLCGIGTRSSYPL 280
GLCGI R+SYPL
Sbjct: 338 PRGLCGIAMRASYPL 352
>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
Length = 360
Score = 154 bits (389), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 143/303 (47%), Gaps = 76/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+SY+ E R +IF E+L+ + N++G +Y+LG N+F+D++ +EFRA
Sbjct: 61 RFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKG-LSYRLGINRFADMSWEEFRAT- 118
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
++ + + S T + + +P + DWR+ G V+P+KNQ CG CW F+ A
Sbjct: 119 ---RLGAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGA 175
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E +G I LSEQQL+DC NN GC GG +AF YI N G+
Sbjct: 176 LEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQG 235
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L AV +V GF
Sbjct: 236 VNGICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGVYTS 295
Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
GTT EDG YWLIKNSWG WGD GY K+ + +CG+ T +S
Sbjct: 296 DHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCAS 355
Query: 278 YPL 280
YP+
Sbjct: 356 YPI 358
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 154 bits (389), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 117/180 (65%), Gaps = 4/180 (2%)
Query: 39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
+E V ++E+W+ ++ ++Y EKE R KIFK+NL+++++ N +RT+++G +F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 99 LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
LTN+EFRA+Y KM + S + + Y+ + +P +DWR GAV +K+Q CG
Sbjct: 96 LTNEEFRAIYLRKKMER-TKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152
Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
CWAF+AV AVEGI +I +G LI LSEQ+L+DC N GC GG AF +I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Score = 79.0 bits (193), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 48/75 (64%), Gaps = 5/75 (6%)
Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
A+ + G+ G CG LDH V +VG+G+T G +YW+I+NSWG WGD+GY+K+ R+
Sbjct: 274 AFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNID 332
Query: 267 --EGLCGIGTRSSYP 279
G CGI SYP
Sbjct: 333 DPFGKCGIAMMPSYP 347
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 152 bits (384), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 105/306 (34%), Positives = 143/306 (46%), Gaps = 83/306 (27%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ ++G+SY+ E R +IF E+LE + N++G Y+LG N+FSD++ +EF+A
Sbjct: 63 RFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRKG-LPYRLGINRFSDMSWEEFQATR 121
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
G T S+T +L M D +P + DWR+ G V+P+KNQ CG CW F+
Sbjct: 122 LGAAQ-------TCSATLAGNHL-MRDAAALPETKDWREDGIVSPVKNQAHCGSCWTFST 173
Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI------- 217
A+E +G I LSEQQL+DC+ NN GC GG +AF YI N GI
Sbjct: 174 TGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTEESYP 233
Query: 218 ---FNGVCG---------------------TQLDHAVTIV-----------GF------- 235
NGVC +L +AV +V GF
Sbjct: 234 YKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGFRQYKSGV 293
Query: 236 ------GTTEDGAN---------------YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 274
GTT D N YWLIKNSWG WGD GY K+ + +C I T
Sbjct: 294 YTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAIAT 353
Query: 275 RSSYPL 280
+SYP+
Sbjct: 354 CASYPV 359
>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
PE=2 SV=2
Length = 362
Score = 151 bits (381), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 142/303 (46%), Gaps = 77/303 (25%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ +HG+ Y D E + R +IF E+LE + N+ G Y+LG N+F+D++ +EF+A
Sbjct: 64 RFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRG-LPYRLGINRFADMSWEEFQASR 122
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
G + + + ++ +P + DWR+ G V+P+K+Q CG CW F+ +
Sbjct: 123 LG-----AAQNCSATLAGNHRMRDAAALPETKDWREDGIVSPVKDQGHCGSCWTFSTTGS 177
Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
+E +G + LSEQQL+DC+T NN GC GG +AF YI N G+
Sbjct: 178 LEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTG 237
Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
NG+C +L +AV +V GF
Sbjct: 238 VNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVAFQVINGFRMYKSGVYTS 297
Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
GT+ E+G YWLIKNSWG WGD GY K+ + +CGI T +S
Sbjct: 298 DHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCAS 357
Query: 278 YPL 280
YP+
Sbjct: 358 YPI 360
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 150 bits (379), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 79/183 (43%), Positives = 113/183 (61%), Gaps = 11/183 (6%)
Query: 40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
E+S+ +++E+W + H S + EK R +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
TN EFR+ Y G +KM S S TF Y+ + VP S+DWR KGAVT +K+Q
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146
Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
+CG CWAF+ + AVEGI +I++ L+ LSEQ+L+DC N GC GG E AF +I Q
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206
Query: 215 QGI 217
GI
Sbjct: 207 GGI 209
Score = 90.9 bits (224), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 51/70 (72%), Gaps = 4/70 (5%)
Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
+G+F G C T L+H V IVG+GTT DG NYW+++NSWG WG+ GY+++ R+ EGLC
Sbjct: 275 EGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLC 334
Query: 271 GIGTRSSYPL 280
GI +SYP+
Sbjct: 335 GIAMMASYPI 344
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 150 bits (379), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 80/202 (39%), Positives = 119/202 (58%), Gaps = 21/202 (10%)
Query: 46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
++E+W + H S + EK+ R +FK N ++ ANK ++ YKL N+F+D+TN EFR
Sbjct: 37 LYERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFR 94
Query: 106 ALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
Y+G K+ HR + TF Y+ + VP S+DWR KGAVT +K+Q +CG C
Sbjct: 95 NTYSGSKVKH--HRMFRGGPRGNGTFMYEKVDT--VPASVDWRKKGAVTSVKDQGQCGSC 150
Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI--- 217
WAF+ + AVEGI +I++ L+ LSEQ+L+DC T+ N GC GG + AF +I Q GI
Sbjct: 151 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTE 210
Query: 218 -------FNGVCGTQLDHAVTI 232
++G C ++A +
Sbjct: 211 ANYPYEAYDGTCDVSKENAPAV 232
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 147 bits (372), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 108/159 (67%), Gaps = 8/159 (5%)
Query: 63 EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
E E R ++F +NL++++ N + ++LG N+F+DLTN EFRA Y G P+ R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAV-TPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
+ Y++ + +P S+DWRDKGAV P+KNQ +CG CWAF+AVAAVEGI KI +G
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
L+ LSEQ+L++C+ NG N+GC GG + AFA+I +N G+
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGL 238
Score = 76.3 bits (186), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 51/84 (60%), Gaps = 9/84 (10%)
Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGY 260
G RE + + + G+F G CGT LDH V VG+GT GA YW ++NSWG WG+ GY
Sbjct: 295 GGRE----FQLYDSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGY 350
Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
+++ R+ G CGI +SYP+
Sbjct: 351 IRMERNVTARTGKCGIAMMASYPI 374
>sp|Q94503|CYSP6_DICDI Cysteine proteinase 6 OS=Dictyostelium discoideum GN=cprF PE=2 SV=1
Length = 434
Score = 143 bits (361), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 111/203 (54%), Gaps = 10/203 (4%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M ++ L V S + + E WM H R Y E E R IFK N++Y
Sbjct: 1 MKVLSALCVLLVSVATAKQQLSELQYRNAFTNWMIAHQRHYSSE-EFNGRFNIFKANMDY 59
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
I + N +G+ T LG N F+D+TN+E+RA Y G + S T S + +
Sbjct: 60 INEWNTKGSETV-LGLNVFADITNEEYRATYLGTPFDASSLEMTPS-----EKVFGGVQA 113
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG--NLIQLSEQQLLDCS-TN 194
S+DWR KGAVTPIKNQ ECG CW+F+A A EG I +G +L +SEQQL+DCS +
Sbjct: 114 NSVDWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGDSDLTSVSEQQLIDCSGSY 173
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
GNNGC GG AF YII N GI
Sbjct: 174 GNNGCEGGLMTLAFEYIINNGGI 196
Score = 45.8 bits (107), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 243 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 281
NYW++KNSWG WG GY+ + +D + CGI T +S P A
Sbjct: 388 NYWIVKNSWGLDWGINGYILMSKDKDNQCGIATMASIPQA 427
>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
Length = 356
Score = 143 bits (360), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 101/305 (33%), Positives = 144/305 (47%), Gaps = 82/305 (26%)
Query: 49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
++ +H + Y E + R +IF +NL+ I N++G +YKLG N+F+DLT DEFR
Sbjct: 59 RFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKG-LSYKLGINEFTDLTWDEFRK-- 115
Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
+K+ + + S T+ NL +T+V P + DWR G V+P+K Q +CG CW F+
Sbjct: 116 --HKLGASQNCSATTKG----NLKLTNVVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTT 169
Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI-------- 217
A+E G I LSEQQL+DC+ NN GC GG +AF YI N G+
Sbjct: 170 GALEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPY 229
Query: 218 --FNGVCG---------------------TQLDHAVTIV-----------GF-------- 235
NG+C +L +AV +V GF
Sbjct: 230 TGKNGICKFSQANIGVKVISSVNITLGAEYELKYAVALVRPVSVAFEVVKGFKQYKSGVY 289
Query: 236 GTTE--------------------DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTR 275
+TE +G YWLIKNSWG WG+ GY K+ + +CG+ T
Sbjct: 290 ASTECGDTPMDVNHAVLAVGYGVENGTPYWLIKNSWGADWGEDGYFKMEMGKNMCGVATC 349
Query: 276 SSYPL 280
+SYP+
Sbjct: 350 ASYPI 354
>sp|P09668|CATH_HUMAN Pro-cathepsin H OS=Homo sapiens GN=CTSH PE=1 SV=4
Length = 335
Score = 143 bits (360), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 138/312 (44%), Gaps = 81/312 (25%)
Query: 42 SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
S+ + H K WM++H ++Y E E RL+ F N I A+ GN T+K+ NQFSD++
Sbjct: 29 SLEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMS 86
Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGC 159
E + Y P + S T S + P S+DWR KG V+P+KNQ CG
Sbjct: 87 FAEIKHKYL---WSEPQNCSATKSNYL---RGTGPYPPSVDWRKKGNFVSPVKNQGACGS 140
Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF 218
CW F+ A+E I +G ++ L+EQQL+DC+ + NN GC GG +AF YI+ N+GI
Sbjct: 141 CWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIM 200
Query: 219 ----------NGVCGTQLDHAVTIV--------------------------GFGTTED-- 240
+G C Q A+ V F T+D
Sbjct: 201 GEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFM 260
Query: 241 --------------------------------GANYWLIKNSWGNTWGDAGYMKIVRDEG 268
G YW++KNSWG WG GY I R +
Sbjct: 261 MYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKN 320
Query: 269 LCGIGTRSSYPL 280
+CG+ +SYP+
Sbjct: 321 MCGLAACASYPI 332
>sp|Q5E998|CATL2_BOVIN Cathepsin L2 OS=Bos taurus GN=CTSL2 PE=2 SV=1
Length = 334
Score = 142 bits (359), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/342 (29%), Positives = 155/342 (45%), Gaps = 91/342 (26%)
Query: 17 PMFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENL 75
P F + L + V+S + ++ H +W A H R Y E+E R ++++N
Sbjct: 3 PSFFLTVLCLG-----VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNK 56
Query: 76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
+ I+ N+E G +++ N F D+TN+EFR + G++ + H+ +
Sbjct: 57 KIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQ--NQKHKKGKL----FHEPL 110
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
+ DVP S+DW KG VTP+KNQ +CG CWAF+A A+EG ++G L+ LSEQ L+DCS
Sbjct: 111 LVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170
Query: 193 -------TNG-----------NNGCLGGS------------------------------- 203
NG +NGCL
Sbjct: 171 RAQGNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIP 230
Query: 204 -REKAFAYIIQNQG--------------------IFNGVCGTQ-LDHAVTIVGFG---TT 238
REKA + G ++ C ++ LDH V +VG+G T
Sbjct: 231 QREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290
Query: 239 EDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
+ +W++KNSWG WG GY+K+ +D+ CGI T +SYP
Sbjct: 291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYP 332
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 142 bits (359), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/176 (42%), Positives = 106/176 (60%), Gaps = 14/176 (7%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
E++ + GR Y D E+ RL +F +NL+YIE+ NK+ G TY L NQFSD+TN++F
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80
Query: 105 RALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
A+ GYK P P+ T++ T +DWR KGAVTP+K+Q +CG CWAF
Sbjct: 81 NAVMKGYKKGPRPAAVFTSTDA--------APESTEVDWRTKGAVTPVKDQGQCGSCWAF 132
Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG--NNGCLGGSREKAFAYIIQNQGI 217
+ +EG +++G L+ LSEQQL+DC+ N GC GG E+A Y+ N G+
Sbjct: 133 STTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGV 188
Score = 66.6 bits (161), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 42/57 (73%), Gaps = 2/57 (3%)
Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
+QLDHAV VG+G+ E G ++WL+KNSW +WG++GY+K+ R+ CGI T + YP
Sbjct: 265 SQLDHAVLAVGYGS-EGGQDFWLVKNSWATSWGESGYIKMARNRNNNCGIATDACYP 320
>sp|Q94504|CYSP7_DICDI Cysteine proteinase 7 OS=Dictyostelium discoideum GN=cprG PE=1 SV=1
Length = 460
Score = 142 bits (358), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 82/203 (40%), Positives = 110/203 (54%), Gaps = 12/203 (5%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
M ++ L V S + + E WM H R Y E E R IFK N++Y
Sbjct: 1 MKVLSALCVLLVSVATAKQQLSEVEYRNAFTNWMIAHQRHYSSE-EFNGRYNIFKANMDY 59
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
+ + N +G+ T LG N F+D++N+E+RA Y G + S T S + D
Sbjct: 60 VNEWNTKGSETV-LGLNVFADISNEEYRATYLGTPFDASSLEMTESD-------KIFDAS 111
Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG--NLIQLSEQQLLDCS-TN 194
+DWR +GAVTPIKNQ +CG CW+F+ A EG + +G NL+ LSEQ L+DCS +
Sbjct: 112 AQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSY 171
Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
GNNGC GG AF YII N+GI
Sbjct: 172 GNNGCEGGLMTLAFEYIINNKGI 194
Score = 44.7 bits (104), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 243 NYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRSSYPLA 281
+YW++KNSWG +WG GY+ + + + CGI T +S P A
Sbjct: 417 DYWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMASRPTA 456
>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus GN=VCATH
PE=3 SV=1
Length = 337
Score = 142 bits (358), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 100/332 (30%), Positives = 144/332 (43%), Gaps = 79/332 (23%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
+ +I T+L+ +SQ+ E ++ + + Y D K R KIFK+NLE
Sbjct: 3 LLMIFTILLVASSQIEGHLKFDIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNLED 62
Query: 78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-- 135
I + NK N + N+FSDL+ +E YTG PS+ ++S F N+ D
Sbjct: 63 INEKNKL-NDSAIYNINKFSDLSKNELLTKYTGLTSKKPSNMVRSTSNF--CNVIHLDAP 119
Query: 136 ------VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
+P + DWR +T +K+Q CG CWA AAV +E + I+ LI LSEQQL+
Sbjct: 120 PDVHDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLI 179
Query: 190 DCSTNGNNGCLGGSRE-------------------------------KAFA--------Y 210
DC + N C GG K FA Y
Sbjct: 180 DCDS-ANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTKGVCKIDNKKFALSVSSCKRY 238
Query: 211 IIQNQ---------------------------GIFNGVCGTQLDHAVTIVGFGTTEDGAN 243
I QN+ GI + L+HAV +VG+GT E G +
Sbjct: 239 IFQNEENLKKELITMGPIAMAIDAASISTYSKGIIHFCENLGLNHAVLLVGYGT-EGGVS 297
Query: 244 YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTR 275
YW +KNSWG+ WG+ GY ++ R+ CG+ +
Sbjct: 298 YWTLKNSWGSDWGEDGYFRVKRNINACGLNNQ 329
>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
Length = 331
Score = 141 bits (356), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 121/206 (58%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L+ C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++PS R+ T Y++
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N+GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGI 198
Score = 62.8 bits (151), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 50/78 (64%), Gaps = 3/78 (3%)
Query: 203 SREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
+R +F ++ ++ + C ++H V +VG+G +G YWL+KNSWG+ +G+ GY++
Sbjct: 254 ARHPSF-FLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGKEYWLVKNSWGHNFGEEGYIR 311
Query: 263 IVRDEG-LCGIGTRSSYP 279
+ R++G CGI + SYP
Sbjct: 312 MARNKGNHCGIASFPSYP 329
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
SV=1
Length = 321
Score = 140 bits (354), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 106/174 (60%), Gaps = 10/174 (5%)
Query: 48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
+ + Q+GR Y D E+ R ++F++N + IE NK+ G T+K+ NQF D+TN+EF
Sbjct: 21 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 80
Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
A+ GYK S R + F + M +DWR K VTP+K+Q++CG CWAF+
Sbjct: 81 NAVMKGYKKGS---RGEPKAVFTAEAGPMA---ADVDWRTKALVTPVKDQEQCGSCWAFS 134
Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
A A+EG +++ L+ LSEQQL+DCST+ GN+GC GG AF YI N GI
Sbjct: 135 ATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI 188
Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 49/75 (65%), Gaps = 4/75 (5%)
Query: 208 FAYIIQNQGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
F++ + G++ T LDH V VG+GT E +YWL+KNSWG++WGDAGY+K+ R
Sbjct: 246 FSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGT-ESTKDYWLVKNSWGSSWGDAGYIKMSR 304
Query: 266 D-EGLCGIGTRSSYP 279
+ + CGI + SYP
Sbjct: 305 NRDNNCGIASEPSYP 319
>sp|P54639|CYSP4_DICDI Cysteine proteinase 4 OS=Dictyostelium discoideum GN=cprD PE=2 SV=2
Length = 442
Score = 138 bits (348), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 86/202 (42%), Positives = 114/202 (56%), Gaps = 15/202 (7%)
Query: 20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
+ LLVS AS + + E WM H R+Y E E R +IFK N++Y+
Sbjct: 6 FLCLLLVSYAS---AKQQFSELQYRNAFTNWMQAHQRTYSSE-EFNARYQIFKSNMDYVH 61
Query: 80 KANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
+ N +G T LG N F+D+TN E+R Y G +P S T + + + T PT
Sbjct: 62 QWNSKGGETV-LGLNVFADITNQEYRTTYLG----TPFDGSALIGT-EEEKIFSTPAPT- 114
Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG---NLIQLSEQQLLDCSTN-G 195
+DWR +GAVTPIKNQ +CG CW+F+ + EG I SG +L+ LSEQ L+DCS + G
Sbjct: 115 VDWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYG 174
Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
NNGC GG AF YII N+GI
Sbjct: 175 NNGCEGGLMTLAFEYIINNKGI 196
Score = 48.9 bits (115), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 28/40 (70%), Gaps = 1/40 (2%)
Query: 243 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 281
NYW++KNSWG +WG GY+ + +D CGI T +S+P A
Sbjct: 400 NYWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMASFPTA 439
>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
Length = 330
Score = 138 bits (347), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/205 (37%), Positives = 119/205 (58%), Gaps = 13/205 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ +L C+S V H+ ++ H W +G+ YK++ E+ +R I+++NL+
Sbjct: 1 MKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T++E +L + ++P+ R+ T + Q L
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPNQWQRNITYKSNPNQML-- 115
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCS
Sbjct: 116 ---PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSE 172
Query: 194 N-GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG +AF YII N+GI
Sbjct: 173 KYGNKGCNGGFMTEAFQYIIDNKGI 197
Score = 65.9 bits (159), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 55/96 (57%), Gaps = 2/96 (2%)
Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANY 244
E L + N C+G ++ ++ ++ C +++H V ++G+G +G Y
Sbjct: 234 EDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYGDL-NGKEY 292
Query: 245 WLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 279
WL+KNSWG+ +G+ GY+++ R++G CGI + SYP
Sbjct: 293 WLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 328
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 138 bits (347), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 76/181 (41%), Positives = 106/181 (58%), Gaps = 9/181 (4%)
Query: 46 IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
I E+W QH ++Y +E+E+ R+KIF EN I K N+ +G +YKLG N+++D+
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 100 TNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
+ EF+ GY + T Y + VP S+DWR+ GAVT +K+Q C
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143
Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
G CWAF++ A+EG ++G L+ LSEQ L+DCST GNNGC GG + AF YI N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 217 I 217
I
Sbjct: 204 I 204
Score = 80.9 bits (198), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 37/74 (50%), Positives = 53/74 (71%), Gaps = 3/74 (4%)
Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
++ + ++G++N C Q LDH V +VG+GT E G +YWL+KNSWG TWG+ GY+K+ R+
Sbjct: 264 SFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARN 323
Query: 267 E-GLCGIGTRSSYP 279
+ CGI T SSYP
Sbjct: 324 QNNQCGIATASSYP 337
>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
Length = 340
Score = 137 bits (346), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 75/192 (39%), Positives = 111/192 (57%), Gaps = 12/192 (6%)
Query: 33 VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
V+ ++ H + W H + YKD+ E+E+R I+++NL++I N E G T
Sbjct: 21 VAMEQLQRDPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHT 80
Query: 89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
Y++G N D+TN+E ++P S ++ T +++ S +P ++DWR+KG V
Sbjct: 81 YQVGMNDMGDMTNEEILCRMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCV 135
Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSRE 205
T +K Q CG CWAF+AV A+EG K+++G LI LS Q L+DCS GN GC GG
Sbjct: 136 TEVKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMT 195
Query: 206 KAFAYIIQNQGI 217
+AF YII N GI
Sbjct: 196 EAFQYIIDNGGI 207
Score = 63.9 bits (154), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 47/73 (64%), Gaps = 3/73 (4%)
Query: 209 AYIIQNQGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-D 266
++ G+++ C ++H V +VG+GT DG +YWL+KNSWG +GD GY+++ R +
Sbjct: 267 SFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNN 325
Query: 267 EGLCGIGTRSSYP 279
+ CGI + SYP
Sbjct: 326 KNHCGIASYCSYP 338
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 137 bits (346), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 79/206 (38%), Positives = 115/206 (55%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ LL C+ V H+ ++ H W + + YK+E E+ R I+++NL+
Sbjct: 1 MKWLVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
++ N E G +Y LG N D+T +E +L ++PS R+ T Y++ S
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT-----YRSNSN 112
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
+P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 113 QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG AF YII N GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNNGI 198
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 137 bits (346), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/205 (40%), Positives = 112/205 (54%), Gaps = 21/205 (10%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK-----WMAQHGRSYKDELEKEMRLKIFK 72
+ + LLVS A T +Q E+ + WM H +SY E E R IFK
Sbjct: 4 LSFLCVLLVSVA--------TAKQQFSELQYRNAFTDWMITHQKSYTSE-EFGARYNIFK 54
Query: 73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
N++Y+++ N +G+ T LG N F+D+TN+E+R Y G K + S T + + +
Sbjct: 55 ANMDYVQQWNSKGSETV-LGLNNFADITNEEYRNTYLGTKFDASSLIGT-----QEEKVF 108
Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
T S DWR +GAVTP+KNQ +CG CW+F+ + EG G L+ LSEQ L+DCS
Sbjct: 109 TTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCS 168
Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
T N+GC GG AF YII N GI
Sbjct: 169 TE-NSGCDGGLMTYAFEYIINNNGI 192
Score = 45.8 bits (107), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 28/38 (73%), Gaps = 1/38 (2%)
Query: 244 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
YW++KNSWG +WG GY+ + R+ + CGI + +S+P+
Sbjct: 306 YWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFPV 343
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 137 bits (345), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 118/206 (57%), Gaps = 14/206 (6%)
Query: 18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
M ++ L+ C+S + H ++ H + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MNWLVWALLLCSSAMAH---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57
Query: 77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
+ N E G +Y+LG N D+T++E +L + ++PS R+ T + Q L
Sbjct: 58 TVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKL-- 115
Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
P S+DWR+KG VT +K Q CG CWAF+AV A+E K+++G L+ LS Q L+DCST
Sbjct: 116 ---PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 172
Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
GN GC GG +AF YII N GI
Sbjct: 173 AKYGNKGCNGGFMTEAFQYIIDNNGI 198
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.132 0.402
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 102,919,239
Number of Sequences: 539616
Number of extensions: 4214831
Number of successful extensions: 12963
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 224
Number of HSP's successfully gapped in prelim test: 22
Number of HSP's that attempted gapping in prelim test: 11927
Number of HSP's gapped (non-prelim): 536
length of query: 281
length of database: 191,569,459
effective HSP length: 116
effective length of query: 165
effective length of database: 128,974,003
effective search space: 21280710495
effective search space used: 21280710495
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.7 bits)